Query lcl|Aclame:protein:vir:94800|NCBI_annot:ORF012|genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Match_columns 319 No_of_seqs 127 out of 204 Neff 6.9 Searched_HMMs 1612 Date Sun Dec 1 20:22:13 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_29 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_29_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:97331 Length: 319 100.0 2E-117 1E-120 660.3 30.4 319 1-319 1-319 (319) 2 protein:vir:94800 Length: 319 100.0 2E-117 1E-120 660.3 30.4 319 1-319 1-319 (319) 3 protein:vir:107120 Length: 329 100.0 6E-109 4E-112 614.0 30.1 318 1-318 12-329 (329) 4 protein:vir:105464 Length: 346 100.0 5.2E-88 3.2E-91 499.1 26.9 289 27-319 1-318 (346) 5 protein:vir:102335 Length: 312 100.0 9.5E-87 5.9E-90 492.2 26.5 275 25-299 1-312 (312) 6 protein:vir:79008 Length: 299 100.0 3.6E-83 2.2E-86 472.6 27.7 272 18-297 1-299 (299) 7 protein:vir:79712 Length: 285 100.0 1.3E-83 8.3E-87 474.9 24.7 269 27-296 1-285 (285) 8 protein:vir:78920 Length: 290 100.0 4.7E-83 2.9E-86 471.9 26.2 265 27-294 1-290 (290) 9 protein:vir:99523 Length: 311 100.0 3.2E-82 2E-85 467.4 25.2 271 22-295 1-311 (311) 10 protein:vir:78090 Length: 302 100.0 2.3E-81 1.4E-84 462.7 25.0 270 25-297 1-302 (302) 11 protein:vir:105822 Length: 273 100.0 1.4E-50 8.7E-54 293.9 26.1 265 25-294 1-273 (273) 12 protein:vir:102605 Length: 273 100.0 1.4E-50 8.7E-54 293.9 26.1 265 25-294 1-273 (273) 13 protein:vir:7990 Length: 273 # 100.0 2.4E-49 1.5E-52 287.2 25.5 265 25-294 1-273 (273) 14 protein:vir:78739 Length: 332 100.0 6.9E-47 4.3E-50 273.6 18.9 275 12-292 1-332 (332) 15 protein:vir:3136 Length: 322 # 100.0 4.5E-47 2.8E-50 274.7 14.7 271 7-299 1-322 (322) 16 protein:vir:3364 Length: 347 # 100.0 9.4E-45 5.8E-48 262.0 20.1 285 5-300 1-347 (347) 17 protein:vir:94622 Length: 341 100.0 1.6E-44 1E-47 260.6 21.4 280 12-297 1-341 (341) 18 protein:vir:1541 Length: 347 # 100.0 2.5E-44 1.6E-47 259.6 21.1 287 1-300 1-347 (347) 19 protein:vir:10450 Length: 344 100.0 1.9E-44 1.2E-47 260.3 19.7 284 1-295 1-344 (344) 20 protein:vir:94711 Length: 347 100.0 1.6E-44 9.7E-48 260.7 18.4 285 5-295 1-347 (347) 21 protein:vir:2201 Length: 345 # 100.0 4.2E-43 2.6E-46 252.9 20.6 284 5-294 1-345 (345) 22 protein:vir:8885 Length: 347 # 100.0 4.1E-42 2.5E-45 247.5 19.5 285 5-295 1-347 (347) 23 protein:vir:99675 Length: 324 100.0 6.2E-42 3.8E-45 246.5 18.8 257 54-315 1-324 (324) 24 protein:vir:100057 Length: 375 100.0 6.6E-39 4.1E-42 229.9 22.4 291 1-299 1-375 (375) 25 protein:vir:80180 Length: 381 100.0 1.5E-38 9.3E-42 227.9 23.4 305 1-319 1-350 (381) 26 protein:vir:94576 Length: 347 100.0 3.3E-39 2E-42 231.6 19.2 284 5-294 1-347 (347) 27 protein:vir:96262 Length: 274 100.0 2E-37 1.2E-40 221.8 22.8 268 19-319 1-274 (274) 28 protein:vir:95898 Length: 274 100.0 2E-37 1.2E-40 221.8 22.8 268 19-319 1-274 (274) 29 protein:vir:80930 Length: 278 100.0 4.7E-37 2.9E-40 219.7 23.5 270 19-296 1-278 (278) 30 protein:vir:93742 Length: 274 100.0 3.7E-36 2.3E-39 214.8 23.8 268 19-319 1-274 (274) 31 protein:vir:80213 Length: 334 100.0 7.5E-37 4.7E-40 218.6 19.7 282 1-296 1-334 (334) 32 protein:vir:1239 Length: 274 # 100.0 4E-36 2.5E-39 214.7 23.5 268 19-319 1-274 (274) 33 protein:vir:94494 Length: 274 100.0 1.1E-35 7E-39 212.2 24.1 268 19-319 1-274 (274) 34 protein:vir:97433 Length: 274 100.0 1.1E-35 7E-39 212.2 24.1 268 19-319 1-274 (274) 35 protein:vir:96123 Length: 274 100.0 3.8E-35 2.3E-38 209.3 24.5 268 19-304 1-274 (274) 36 protein:vir:103323 Length: 364 100.0 3.5E-35 2.2E-38 209.5 22.5 294 18-315 1-364 (364) 37 protein:vir:99075 Length: 392 100.0 9.4E-35 5.8E-38 207.1 20.9 289 25-319 1-327 (392) 38 protein:vir:96833 Length: 275 100.0 1.7E-33 1.1E-36 200.2 23.2 270 18-303 1-275 (275) 39 protein:vir:3613 Length: 272 # 100.0 5.5E-32 3.4E-35 192.0 22.5 264 19-294 1-272 (272) 40 protein:vir:108303 Length: 418 100.0 9.9E-32 6.1E-35 190.6 21.9 281 22-319 1-339 (418) 41 protein:vir:105334 Length: 276 100.0 2.9E-31 1.8E-34 188.0 23.6 271 19-303 1-276 (276) 42 protein:vir:97031 Length: 402 100.0 1.2E-31 7.6E-35 190.0 18.7 298 18-319 1-364 (402) 43 protein:vir:6324 Length: 335 # 100.0 6.2E-31 3.8E-34 186.2 20.8 284 11-309 1-335 (335) 44 protein:vir:78935 Length: 335 100.0 1.2E-30 7.7E-34 184.5 20.9 284 11-305 1-335 (335) 45 protein:vir:3525 Length: 423 # 99.9 9.2E-30 5.7E-33 179.8 21.4 287 19-319 1-339 (423) 46 protein:vir:9820 Length: 272 # 99.9 4.3E-29 2.7E-32 176.1 23.4 266 19-300 1-272 (272) 47 protein:vir:3033 Length: 272 # 99.9 4.3E-29 2.7E-32 176.1 23.4 266 19-300 1-272 (272) 48 protein:vir:105374 Length: 423 99.9 7.4E-29 4.6E-32 174.8 21.6 289 25-319 1-339 (423) 49 protein:vir:174 Length: 423 # 99.9 6.8E-29 4.2E-32 175.0 21.2 291 19-319 1-339 (423) 50 protein:vir:105522 Length: 423 99.9 2.8E-27 1.7E-30 166.2 21.9 291 19-319 1-339 (423) 51 protein:vir:102655 Length: 322 99.9 5.3E-27 3.3E-30 164.6 22.4 277 1-295 3-322 (322) 52 protein:vir:7019 Length: 401 # 99.9 1.9E-27 1.2E-30 167.1 18.6 299 11-319 1-358 (401) 53 protein:vir:105645 Length: 400 99.9 3.1E-25 2E-28 154.9 18.4 299 11-319 1-371 (400) 54 protein:vir:95107 Length: 270 99.9 1.3E-24 8.3E-28 151.5 21.1 267 14-319 1-270 (270) 55 protein:vir:739 Length: 231 # 99.9 3.9E-24 2.4E-27 148.9 19.8 227 57-294 1-231 (231) 56 protein:vir:1781 Length: 221 # 99.8 1.3E-22 8.2E-26 140.5 15.4 194 99-301 1-221 (221) 57 protein:vir:9309 Length: 324 # 99.1 1.2E-10 7.2E-14 75.1 22.7 285 1-305 1-324 (324) 58 protein:vir:96223 Length: 324 99.1 2.5E-10 1.5E-13 73.2 23.4 287 1-305 1-324 (324) 59 protein:vir:1583 Length: 351 # 99.1 6.9E-11 4.3E-14 76.3 19.0 292 1-319 1-316 (351) 60 protein:vir:2106 Length: 430 # 99.0 1.8E-10 1.1E-13 74.1 19.6 267 19-295 1-430 (430) 61 protein:vir:78830 Length: 324 99.0 9.6E-10 5.9E-13 70.0 23.1 285 1-305 1-324 (324) 62 protein:vir:96392 Length: 324 99.0 9.6E-10 5.9E-13 70.0 23.1 285 1-305 1-324 (324) 63 protein:vir:97148 Length: 324 99.0 1.3E-09 7.8E-13 69.4 22.9 286 1-305 1-324 (324) 64 protein:vir:99749 Length: 324 99.0 1.6E-09 9.9E-13 68.8 23.1 285 1-305 1-324 (324) 65 protein:vir:103955 Length: 324 99.0 1.9E-09 1.2E-12 68.5 23.0 285 1-305 1-324 (324) 66 protein:vir:98871 Length: 314 98.9 2.8E-10 1.7E-13 72.9 16.1 286 1-297 1-314 (314) 67 protein:vir:100939 Length: 430 98.9 2E-09 1.2E-12 68.3 19.7 268 19-295 1-430 (430) 68 protein:vir:9265 Length: 430 # 98.9 2E-09 1.2E-12 68.3 19.7 268 19-295 1-430 (430) 69 protein:vir:102944 Length: 330 98.8 5.6E-09 3.5E-12 65.8 18.9 293 1-319 1-318 (330) 70 protein:vir:3969 Length: 287 # 98.7 1.9E-09 1.2E-12 68.4 16.2 262 19-295 1-287 (287) 71 protein:vir:94528 Length: 286 98.7 2E-09 1.3E-12 68.3 15.8 266 11-297 1-286 (286) 72 protein:vir:94673 Length: 419 98.7 2.3E-08 1.4E-11 62.4 21.2 288 1-299 95-419 (419) 73 protein:vir:5974 Length: 324 # 98.7 2.1E-08 1.3E-11 62.7 20.2 291 1-319 1-312 (324) 74 protein:vir:1886 Length: 385 # 98.5 2.6E-07 1.6E-10 56.7 22.3 281 1-298 69-385 (385) 75 protein:vir:191 Length: 385 # 98.5 2.6E-07 1.6E-10 56.7 22.3 281 1-298 69-385 (385) 76 protein:vir:100135 Length: 418 98.5 3.4E-07 2.1E-10 56.0 23.1 285 1-299 109-418 (418) 77 protein:vir:95763 Length: 297 98.5 1.8E-07 1.1E-10 57.5 21.4 269 14-297 1-297 (297) 78 protein:vir:9927 Length: 295 # 98.5 4.1E-08 2.5E-11 61.1 15.9 274 11-303 1-295 (295) 79 protein:vir:94142 Length: 304 98.4 6.8E-07 4.2E-10 54.4 22.2 262 18-293 1-304 (304) 80 protein:vir:105905 Length: 304 98.4 6.8E-07 4.2E-10 54.4 22.2 262 18-293 1-304 (304) 81 protein:vir:78223 Length: 333 98.4 8.7E-07 5.4E-10 53.8 21.5 283 1-296 4-333 (333) 82 protein:vir:80684 Length: 315 98.4 6.1E-07 3.8E-10 54.6 20.5 278 1-306 1-315 (315) 83 protein:vir:4997 Length: 397 # 98.3 1.3E-06 8.3E-10 52.8 23.3 293 1-304 79-397 (397) 84 protein:vir:97053 Length: 390 98.3 1.5E-06 9.3E-10 52.5 21.2 279 1-293 71-390 (390) 85 protein:vir:9410 Length: 415 # 98.3 1.5E-06 9.3E-10 52.5 22.3 299 1-306 71-415 (415) 86 protein:vir:98339 Length: 415 98.3 1.5E-06 9.6E-10 52.4 23.3 293 1-305 92-415 (415) 87 protein:vir:79987 Length: 415 98.3 1.5E-06 9.6E-10 52.4 23.3 293 1-305 92-415 (415) 88 protein:vir:81100 Length: 415 98.3 1.5E-06 9.6E-10 52.4 23.3 293 1-305 92-415 (415) 89 protein:vir:4339 Length: 395 # 98.3 1.6E-06 9.7E-10 52.4 21.3 280 1-295 77-395 (395) 90 protein:vir:41 Length: 299 # N 98.2 2.1E-06 1.3E-09 51.7 21.0 271 9-299 1-299 (299) 91 protein:vir:94771 Length: 298 98.2 2.2E-06 1.4E-09 51.6 21.5 262 18-295 1-298 (298) 92 protein:vir:4856 Length: 293 # 98.2 2.3E-06 1.5E-09 51.4 23.0 279 14-306 1-293 (293) 93 protein:vir:3991 Length: 404 # 98.2 2.8E-06 1.7E-09 51.1 23.4 292 1-307 97-404 (404) 94 protein:vir:7409 Length: 408 # 98.2 2.8E-06 1.8E-09 51.0 23.9 295 1-311 97-408 (408) 95 protein:vir:6242 Length: 390 # 98.2 3.1E-06 1.9E-09 50.8 20.0 284 1-298 74-390 (390) 96 protein:vir:4830 Length: 397 # 98.2 3.5E-06 2.2E-09 50.5 23.5 294 1-306 79-397 (397) 97 protein:vir:9759 Length: 303 # 98.1 3.8E-06 2.3E-09 50.3 20.2 266 1-297 1-303 (303) 98 protein:vir:81070 Length: 390 98.1 4.8E-06 3E-09 49.7 20.8 279 1-293 91-390 (390) 99 protein:vir:9574 Length: 300 # 98.1 4.8E-06 3E-09 49.7 20.1 266 1-299 1-300 (300) 100 protein:vir:1025 Length: 408 # 98.1 4.9E-06 3E-09 49.7 24.1 295 1-314 97-408 (408) 101 protein:vir:102119 Length: 404 98.0 6.1E-06 3.8E-09 49.2 22.1 292 1-301 83-404 (404) 102 protein:vir:104085 Length: 320 98.0 6.3E-06 3.9E-09 49.1 22.5 283 5-297 1-320 (320) 103 protein:vir:95451 Length: 313 98.0 3.6E-07 2.2E-10 55.9 12.6 266 22-298 1-313 (313) 104 protein:vir:1638 Length: 298 # 98.0 7.4E-06 4.6E-09 48.7 21.3 262 23-296 1-298 (298) 105 protein:vir:95376 Length: 425 98.0 7.6E-06 4.7E-09 48.6 20.5 288 1-299 96-425 (425) 106 protein:vir:8102 Length: 543 # 98.0 6.5E-06 4E-09 49.0 18.9 284 1-298 225-543 (543) 107 protein:vir:4953 Length: 397 # 98.0 8E-06 5E-09 48.5 23.6 296 1-306 74-397 (397) 108 protein:vir:1084 Length: 437 # 98.0 8.8E-06 5.5E-09 48.3 21.6 290 1-305 125-437 (437) 109 protein:vir:102873 Length: 392 98.0 9E-06 5.6E-09 48.3 23.2 288 1-302 76-392 (392) 110 protein:vir:102082 Length: 392 98.0 9E-06 5.6E-09 48.3 23.2 288 1-302 76-392 (392) 111 protein:vir:105004 Length: 392 98.0 9E-06 5.6E-09 48.3 23.2 288 1-302 76-392 (392) 112 protein:vir:107593 Length: 392 98.0 9E-06 5.6E-09 48.3 23.2 288 1-302 76-392 (392) 113 protein:vir:4700 Length: 415 # 98.0 9.2E-06 5.7E-09 48.2 21.5 296 1-306 71-415 (415) 114 protein:vir:4600 Length: 415 # 98.0 9.2E-06 5.7E-09 48.2 21.5 296 1-306 71-415 (415) 115 protein:vir:1383 Length: 421 # 98.0 9.5E-06 5.9E-09 48.1 23.4 297 1-317 100-421 (421) 116 protein:vir:10364 Length: 390 97.9 1.1E-05 7.1E-09 47.7 21.8 279 1-293 71-390 (390) 117 protein:vir:7771 Length: 330 # 97.9 1.2E-05 7.7E-09 47.5 23.3 281 1-305 1-330 (330) 118 protein:vir:4511 Length: 409 # 97.9 1.3E-05 8.1E-09 47.4 20.5 290 1-299 92-409 (409) 119 protein:vir:1328 Length: 392 # 97.9 1.5E-05 9.1E-09 47.1 20.4 281 1-298 85-392 (392) 120 protein:vir:100172 Length: 394 97.8 1.5E-05 9.4E-09 47.0 23.6 283 1-303 96-394 (394) 121 protein:vir:3870 Length: 400 # 97.8 1.6E-05 1E-08 46.9 21.0 277 1-297 95-400 (400) 122 protein:vir:485 Length: 407 # 97.8 1.8E-05 1.1E-08 46.7 22.1 290 1-305 86-407 (407) 123 protein:vir:93881 Length: 387 97.7 2.5E-05 1.5E-08 45.9 19.2 275 1-301 81-387 (387) 124 protein:vir:4092 Length: 390 # 97.7 3E-05 1.9E-08 45.4 21.5 302 1-318 59-390 (390) 125 protein:vir:100884 Length: 389 97.7 3.1E-05 1.9E-08 45.3 24.3 282 1-302 95-389 (389) 126 protein:vir:100247 Length: 425 97.7 3.2E-05 2E-08 45.2 21.5 280 1-296 117-425 (425) 127 protein:vir:2770 Length: 318 # 97.6 2.5E-05 1.6E-08 45.8 16.9 227 18-254 1-318 (318) 128 protein:vir:81227 Length: 413 97.5 5.2E-05 3.2E-08 44.1 22.9 288 1-300 81-413 (413) 129 protein:vir:78523 Length: 338 97.5 5.3E-05 3.3E-08 44.0 23.8 286 1-302 4-338 (338) 130 protein:vir:4456 Length: 401 # 97.5 6.4E-05 4E-08 43.6 21.4 281 1-295 87-401 (401) 131 protein:vir:4786 Length: 295 # 97.4 2.7E-05 1.7E-08 45.6 14.2 263 18-291 1-295 (295) 132 protein:vir:9704 Length: 394 # 97.4 8E-05 4.9E-08 43.1 22.8 280 1-299 91-394 (394) 133 protein:vir:4226 Length: 326 # 97.4 8.3E-05 5.2E-08 42.9 23.1 288 1-297 3-326 (326) 134 protein:vir:9875 Length: 296 # 97.3 7.7E-05 4.8E-08 43.1 16.2 274 1-297 1-296 (296) 135 protein:vir:96978 Length: 387 97.3 9.4E-05 5.8E-08 42.7 17.8 278 1-301 78-387 (387) 136 protein:vir:94424 Length: 387 97.3 9.4E-05 5.8E-08 42.7 17.8 278 1-301 78-387 (387) 137 protein:vir:2685 Length: 387 # 97.3 9.4E-05 5.8E-08 42.7 17.8 278 1-301 78-387 (387) 138 protein:vir:2344 Length: 397 # 97.3 9.8E-05 6.1E-08 42.6 22.5 301 1-319 1-343 (397) 139 protein:vir:108211 Length: 318 97.3 0.0001 6.3E-08 42.5 17.7 274 5-295 1-318 (318) 140 protein:vir:81160 Length: 371 97.2 0.00014 8.7E-08 41.7 23.1 281 1-295 61-371 (371) 141 protein:vir:101607 Length: 379 97.2 0.00015 9.1E-08 41.6 21.5 284 1-296 64-379 (379) 142 protein:vir:3845 Length: 395 # 97.2 0.00015 9.2E-08 41.6 23.7 293 1-304 71-395 (395) 143 protein:vir:6212 Length: 434 # 97.1 0.00018 1.1E-07 41.1 19.7 292 1-299 102-434 (434) 144 protein:vir:9361 Length: 402 # 96.9 0.00025 1.6E-07 40.3 17.4 277 1-301 98-402 (402) 145 protein:vir:3158 Length: 321 # 96.9 0.00027 1.7E-07 40.1 19.8 282 14-304 1-321 (321) 146 protein:vir:8187 Length: 311 # 96.9 0.00028 1.8E-07 40.0 20.4 266 11-297 1-311 (311) 147 protein:vir:1268 Length: 397 # 96.8 0.0003 1.9E-07 39.9 22.3 282 1-296 94-397 (397) 148 protein:vir:106647 Length: 303 96.8 0.00032 2E-07 39.7 16.1 275 1-303 1-303 (303) 149 protein:vir:93696 Length: 364 96.8 0.00036 2.2E-07 39.4 18.1 271 1-307 1-364 (364) 150 protein:vir:962 Length: 397 # 96.7 0.00038 2.4E-07 39.3 18.1 273 1-295 87-397 (397) 151 protein:vir:80446 Length: 367 96.7 0.00038 2.4E-07 39.3 17.6 290 11-319 1-351 (367) 152 protein:vir:2430 Length: 318 # 96.7 0.0004 2.5E-07 39.2 23.0 284 5-302 1-318 (318) 153 protein:vir:1433 Length: 435 # 96.7 0.00043 2.7E-07 39.0 20.5 284 1-298 101-435 (435) 154 protein:vir:80376 Length: 435 96.6 0.00044 2.7E-07 39.0 20.7 282 1-298 101-435 (435) 155 protein:vir:78640 Length: 352 96.6 0.00045 2.8E-07 38.9 18.3 274 1-301 49-352 (352) 156 protein:vir:819 Length: 404 # 96.6 0.00048 3E-07 38.8 16.8 278 1-287 1-404 (404) 157 protein:vir:10123 Length: 404 96.6 0.00048 3E-07 38.8 16.8 278 1-287 1-404 (404) 158 protein:vir:3298 Length: 404 # 96.6 0.00048 3E-07 38.8 16.8 278 1-287 1-404 (404) 159 protein:vir:104439 Length: 404 96.6 0.00048 3E-07 38.8 16.8 278 1-287 1-404 (404) 160 protein:vir:2504 Length: 305 # 96.4 0.00069 4.3E-07 37.9 21.5 269 1-304 1-305 (305) 161 protein:vir:79642 Length: 329 95.9 0.0013 7.8E-07 36.5 16.5 279 1-291 6-329 (329) 162 protein:vir:99920 Length: 311 95.9 0.0013 8.3E-07 36.3 20.1 266 18-295 1-311 (311) 163 protein:vir:4197 Length: 314 # 95.8 0.0014 8.6E-07 36.3 21.8 279 1-303 4-314 (314) 164 protein:vir:96762 Length: 632 95.6 0.0018 1.1E-06 35.6 18.1 278 1-295 309-632 (632) 165 protein:vir:8420 Length: 477 # 95.5 0.002 1.2E-06 35.4 20.0 295 1-300 120-477 (477) 166 protein:vir:105610 Length: 430 95.4 0.0021 1.3E-06 35.3 16.6 267 18-302 1-430 (430) 167 protein:vir:104256 Length: 458 94.6 0.0039 2.4E-06 33.8 21.4 284 1-298 116-458 (458) 168 protein:vir:5739 Length: 366 # 94.0 0.0056 3.5E-06 32.9 21.1 281 1-294 20-366 (366) 169 protein:vir:103285 Length: 296 92.7 0.011 6.5E-06 31.4 17.2 269 12-297 1-296 (296) 170 protein:vir:107687 Length: 319 91.7 0.015 9E-06 30.7 17.7 282 1-294 1-319 (319) 171 protein:vir:95963 Length: 395 91.6 0.015 9.4E-06 30.6 19.9 289 1-317 62-395 (395) 172 protein:vir:105038 Length: 428 90.7 0.019 1.2E-05 30.0 21.0 276 1-294 96-428 (428) 173 protein:vir:101650 Length: 497 90.1 0.023 1.4E-05 29.6 22.9 287 1-299 118-497 (497) 174 protein:vir:7855 Length: 497 # 90.1 0.023 1.4E-05 29.6 22.9 287 1-299 118-497 (497) 175 protein:vir:5942 Length: 523 # 89.1 0.028 1.8E-05 29.1 13.7 276 1-295 162-523 (523) 176 protein:vir:78350 Length: 383 88.5 0.032 2E-05 28.8 18.8 280 1-302 55-383 (383) 177 protein:vir:98635 Length: 377 87.5 0.038 2.4E-05 28.4 13.7 273 1-294 54-377 (377) 178 protein:vir:93616 Length: 645 87.5 0.038 2.4E-05 28.4 22.7 290 1-303 298-645 (645) 179 protein:vir:95875 Length: 401 87.3 0.04 2.5E-05 28.3 15.3 275 11-296 1-401 (401) 180 protein:vir:94989 Length: 349 86.3 0.047 2.9E-05 27.9 21.0 290 1-319 1-331 (349) 181 protein:vir:80068 Length: 301 86.0 0.049 3E-05 27.8 17.9 256 19-292 1-301 (301) 182 protein:vir:9643 Length: 377 # 85.8 0.05 3.1E-05 27.7 18.7 277 1-294 54-377 (377) 183 protein:vir:5255 Length: 304 # 82.9 0.073 4.5E-05 26.8 13.4 266 18-292 1-304 (304) 184 protein:vir:103886 Length: 302 82.0 0.081 5E-05 26.6 20.3 257 18-298 1-302 (302) 185 protein:vir:4159 Length: 315 # 78.1 0.12 7.3E-05 25.7 19.8 277 1-293 8-315 (315) 186 protein:vir:78387 Length: 349 77.9 0.12 7.4E-05 25.6 20.8 288 1-319 1-331 (349) 187 protein:vir:95512 Length: 693 77.0 0.13 8E-05 25.5 17.1 279 1-294 371-693 (693) 188 protein:vir:9509 Length: 381 # 75.0 0.15 9.4E-05 25.1 19.4 282 1-312 52-381 (381) 189 protein:vir:101291 Length: 381 75.0 0.15 9.4E-05 25.1 19.4 282 1-312 52-381 (381) 190 protein:vir:104342 Length: 314 75.0 0.15 9.4E-05 25.1 15.2 285 1-297 3-314 (314) 191 protein:vir:100632 Length: 381 64.6 0.3 0.00018 23.5 19.4 285 1-307 41-381 (381) 192 protein:vir:79548 Length: 652 60.5 0.37 0.00023 22.9 19.1 272 1-293 341-652 (652) 193 protein:vir:8324 Length: 410 # 43.0 0.86 0.00054 20.9 14.5 273 1-294 86-410 (410) 194 protein:vir:99424 Length: 360 40.3 0.98 0.00061 20.6 18.3 283 1-297 1-360 (360) 195 protein:vir:95131 Length: 325 33.1 1.4 0.00086 19.8 19.0 286 11-319 1-316 (325) 196 protein:vir:106590 Length: 349 24.2 2.2 0.0014 18.7 15.2 287 5-314 1-349 (349) 197 protein:vir:80128 Length: 466 22.5 2.4 0.0015 18.5 17.3 298 1-319 105-466 (466) No 1 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=100.00 E-value=2.1e-117 Score=660.27 Aligned_cols=319 Identities=100% Similarity=1.399 Sum_probs=315.0 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK 80 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~ 80 (319) |||+||||||||++||||||++.++||+++|++||+++||+++.+.++++++++|++|+|.+|++||||+|+++|++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~ 80 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK 80 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeeccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCC Q lcl|Aclame:pro 81 RNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTG 160 (319) Q Consensus 81 r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T 160 (319) |++++++++++++|+|++|||||||+|.||+||++|+++.+++++++++|++++++||+|+|||++++++++.+.+.++| T Consensus 81 R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t 160 (319) T protein:vir:97 81 RNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTG 160 (319) T ss_pred CCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEE Q lcl|Aclame:pro 161 SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAI 240 (319) Q Consensus 161 ~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i 240 (319) ++|+|++|++++++|||++||++|||||||++|++|+++++|+++.+.++.++++|+||+||||+|++||+.+++++||| T Consensus 161 ~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i 240 (319) T protein:vir:97 161 SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAI 240 (319) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEecccccccceEE Confidence 99999999999999999999999999999999999999999999999988899999999999999999999999999999 Q ss_pred EEcCCceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 241 AVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 241 ~~~~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) ++|++|+++++|++++++|+|+|++|||+|+||+|||+||++||++|||++..+++++.++|+.+|+++++||+||.|| T Consensus 241 ~~h~~A~~~~~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:97 241 AVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred EEcCCeeeeeeeeeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCcccCCCccccccccccCCcccccC Confidence 9999999999999999999989999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=100.00 E-value=2.1e-117 Score=660.27 Aligned_cols=319 Identities=100% Similarity=1.399 Sum_probs=315.0 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK 80 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~ 80 (319) |||+||||||||++||||||++.++||+++|++||+++||+++.+.++++++++|++|+|.+|++||||+|+++|++||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~ 80 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK 80 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeeccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCC Q lcl|Aclame:pro 81 RNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTG 160 (319) Q Consensus 81 r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T 160 (319) |++++++++++++|+|++|||||||+|.||+||++|+++.+++++++++|++++++||+|+|||++++++++.+.+.++| T Consensus 81 R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t 160 (319) T protein:vir:94 81 RNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTG 160 (319) T ss_pred CCCCcccCCcccceeEEEeecccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEE Q lcl|Aclame:pro 161 SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAI 240 (319) Q Consensus 161 ~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i 240 (319) ++|+|++|++++++|||++||++|||||||++|++|+++++|+++.+.++.++++|+||+||||+|++||+.+++++||| T Consensus 161 ~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i 240 (319) T protein:vir:94 161 SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAI 240 (319) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeceeecCeEEEEecccccccceEE Confidence 99999999999999999999999999999999999999999999999988899999999999999999999999999999 Q ss_pred EEcCCceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 241 AVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 241 ~~~~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) ++|++|+++++|++++++|+|+|++|||+|+||+|||+||++||++|||++..+++++.++|+.+|+++++||+||.|| T Consensus 241 ~~h~~A~~~~~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~~~k~~~Iy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 319 (319) T protein:vir:94 241 AVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) T ss_pred EEcCCeeeeeeeeeeeeccCCCccccceeeeeeeeeeeEEeccccceEEEeecCCcccCCCccccccccccCCcccccC Confidence 9999999999999999999989999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=100.00 E-value=5.8e-109 Score=613.97 Aligned_cols=318 Identities=83% Similarity=1.248 Sum_probs=305.1 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK 80 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~ 80 (319) |||.||||||-|++||||||++.+|||+++|++||+++||+++.+.++++++++|++|++.+|++||||+|+++|++||+ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~DY~ 91 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELKDYK 91 (329) T ss_pred hhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEEEeeeccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCC Q lcl|Aclame:pro 81 RNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTG 160 (319) Q Consensus 81 r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T 160 (319) |++++++++++++|++++|||||||+|.||+||++|+++.+++++++++|++++++||+|+|||++|+++++.+.+.++| T Consensus 92 R~~g~~~g~vt~~~~t~tidqdR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~~~~~~t 171 (329) T protein:vir:10 92 RNATNEFDHPQIQETTYFLDQEKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKHLTVGSG 171 (329) T ss_pred CCCCccccccccceeEEEeecccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888999 Q ss_pred HhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEE Q lcl|Aclame:pro 161 SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAI 240 (319) Q Consensus 161 ~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i 240 (319) ++|+|++|++++++|||++||++|||||+|++|++|+++++|.+..+..+.++++|+||+||||+|++||+.+++++||| T Consensus 172 ~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~Vg~idG~~Ii~vps~~~k~in~i 251 (329) T protein:vir:10 172 ADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGVQGELDGFTIVKVPSKMLQGVEAM 251 (329) T ss_pred HHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeeeeeecCeEEEEecCCcccceeEE Confidence 99999999999999999999999999999999999999999998887778899999999999999999999999999999 Q ss_pred EEcCCceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCcccccccccccccccc Q lcl|Aclame:pro 241 AVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLE 318 (319) Q Consensus 241 ~~~~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 318 (319) ++|++|+++++|++++|+|+|+|++|||+|+||+|||+||++||++|||++..+++++.++++++|+++--.+....- T Consensus 252 i~~~~A~~~~~K~~~~~~~~p~~~~~a~~v~gr~yyd~~V~~~k~~~I~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 329 (329) T protein:vir:10 252 AVIGEVMASPIQANEAKLNSNVPGMFGTLAEQMLYTGAFVPEHLQKYIFTIGGKEVETNRDGVDAHADETNASADTGA 329 (329) T ss_pred EEcCCceeeeeeeeeeeeeCCCCccchheeeeeeeeeeEEEccccCEEEEecccCcccCCCCCCccccccccccccCC Confidence 999999999999999999998999999999999999999999999999999999999999999988876444221111 No 4 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=100.00 E-value=5.2e-88 Score=499.13 Aligned_cols=289 Identities=17% Similarity=0.180 Sum_probs=262.0 Q ss_pred hhhhhhHhhHHHHHHHHHhhhhhhhcc----cCcceeeeCCceEEeeeccc-cccccccCCCCcc-cCCcccceeEEEEe Q lcl|Aclame:pro 27 GQTLLKNKHVGILERVTAVNAYSTPAL----ISNDAIFMEGRSFTVMKGDT-TELKDYKRNATNE-FDHPKIEETTYFLD 100 (319) Q Consensus 27 n~~~l~~ky~~lld~~~~~~sl~~~~~----~n~~~~~~~g~tVkIp~i~~-~g~~DY~r~~~~~-~~~~t~t~~tltid 100 (319) =+++|+++|++.||+.+.+.++++..+ .|.+++|+||++||||+|++ +|++||+|++++. .++++++|+|++|+ T Consensus 1 Mainya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~et~tl~ 80 (346) T protein:vir:10 1 MTINYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWDSYELK 80 (346) T ss_pred CcchhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCcccccccccceeEEEee Confidence 345678999999999998887765544 56788999999999999985 7999999999997 59999999999999 Q ss_pred ecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-----cccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 101 QEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK-----HLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 101 qdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~-----~~~~~~T~~n~~~~i~~a~~~L 175 (319) |||+|+|.||.||++|+++.+++++++++|+|++++||+|+|||++|++.++. ..+.++|++|+|++|++++++| T Consensus 81 qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a~T~~ni~~~i~~~~~~l 160 (346) T protein:vir:10 81 NERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNTLDEKNILPAFDNMMLDF 160 (346) T ss_pred ccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhccccccccccCHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999977633 2345679999999999999999 Q ss_pred HhccCC-CCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccc----------------cce Q lcl|Aclame:pro 176 DEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQ----------------GLQ 238 (319) Q Consensus 176 de~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~----------------~~n 238 (319) +|++|| ++|||||||++|++|+++++|+++.++++.+.++|+||+||||+|++||++||+ .|| T Consensus 161 de~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~~~~t~ak~IN 240 (346) T protein:vir:10 161 DEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQRTVYSLDDVTIRVVPSDLMQTAYDFSDGSKIIDTAKQIE 240 (346) T ss_pred HHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccceeeeeecCeEEEEcchhhcccchhhccCccccCCcccee Confidence 999999 799999999999999999999999998887889999999999999999999986 599 Q ss_pred EEEEcCCceeeeeeeeeeeeecCCCCC-ccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccc Q lcl|Aclame:pro 239 AIAVVGEVLASPIQADLAKTNSNIPGM-FGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSL 317 (319) Q Consensus 239 ~i~~~~~A~~~~~k~~~~~~~~~~~~~-~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 317 (319) ||++|++|+++++|++++++|.|.+++ .+|+++||.|||+||++||++|||++..++|++.++. ...++||++.. T Consensus 241 fiiv~~~A~ia~~K~~~~~if~P~~~~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~~~~----~~~~~kpt~~~ 316 (346) T protein:vir:10 241 MFLIYNGVQIAPEKYSFVGFDQPSAATSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKDQEQ----SGQDAKPTAES 316 (346) T ss_pred EEEECCceeeeeeeeeeeEeeCCCCCcccceeeeeeeeeeeeeeccccceEEEeeecccccCccC----cccccCccccc Confidence 999999999999999999999986554 4499999999999999999999999999998887665 44567888888 Q ss_pred cC Q lcl|Aclame:pro 318 EM 319 (319) Q Consensus 318 ~~ 319 (319) +| T Consensus 317 ~~ 318 (346) T protein:vir:10 317 TL 318 (346) T ss_pred ch Confidence 88 No 5 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=100.00 E-value=9.5e-87 Score=492.20 Aligned_cols=275 Identities=14% Similarity=0.043 Sum_probs=250.2 Q ss_pred chhhhhhhHhhHHHHHHHHHhhhhhhhcc-cCcceeeeCCceEEeeeccccccccccCCCC--cccCCcccceeEEEEee Q lcl|Aclame:pro 25 EPGQTLLKNKHVGILERVTAVNAYSTPAL-ISNDAIFMEGRSFTVMKGDTTELKDYKRNAT--NEFDHPKIEETTYFLDQ 101 (319) Q Consensus 25 ~~n~~~l~~ky~~lld~~~~~~sl~~~~~-~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~--~~~~~~t~t~~tltidq 101 (319) =+|+|+|+++|++.||+++.+.+++..+. .|.+++|.||+|||||+|+++|++||+|+++ +..|+++++|+|++|+| T Consensus 1 Mantl~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~et~tl~q 80 (312) T protein:vir:10 1 MANTLAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEYETKTMTQ 80 (312) T ss_pred CCcchhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccceeEEeee Confidence 35889999999999999999999886553 4567899999999999999999999999987 77789999999999999 Q ss_pred cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-------cccCCHhHHHHHHHHHHHH Q lcl|Aclame:pro 102 EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL-------TVGTGSDAQYDAVLDVSVE 174 (319) Q Consensus 102 dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~-------~~~~T~~n~~~~i~~a~~~ 174 (319) ||+|+|.||.||++|+++.+++++++++|+|++++||+|+||||+|++.+.... ..++|++|+|++|++++++ T Consensus 81 DR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~T~~ni~~~i~~~~~~ 160 (312) T protein:vir:10 81 DRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYSVNSSTIINKIKTGIKI 160 (312) T ss_pred cccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccccCHHHHHHHHHHHHHH Confidence 999999999999999999999999999999999999999999999998775432 3457999999999999999 Q ss_pred HHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccc------------------- Q lcl|Aclame:pro 175 LDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQ------------------- 235 (319) Q Consensus 175 Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~------------------- 235 (319) |||++||++|||+|||+++.+|+++..+.......+.+.++|+|++||||+|++||++||+ T Consensus 161 lde~~vp~~rvl~vTp~~~~lLk~~~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t~~~~~gg~~ 240 (312) T protein:vir:10 161 IRENGYNGPLVCHLTYDSMFAIEEKVLEKLTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTTSNQTAGGYL 240 (312) T ss_pred HHHccCCCceEEEeChHHHHHHhhhhhceecccccccceeeeeeeeecccEEEEchhhhccceeeeccCcccccccCcee Confidence 9999999999999999999999998777666666677889999999999999999999984 Q ss_pred ------cceEEEEcCCceeeeeeeeeeeeecCC--CCCccceeeeeeeeeEEEeccccceEEEEccccccCC Q lcl|Aclame:pro 236 ------GLQAIAVVGEVLASPIQADLAKTNSNI--PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATK 299 (319) Q Consensus 236 ------~~n~i~~~~~A~~~~~k~~~~~~~~~~--~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~ 299 (319) .||||++|++|+++++|++++++|.|. |+.+||+++||.|||+||++||++|||+|..++++-. T Consensus 241 ~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a~~~~ 312 (312) T protein:vir:10 241 KGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDYRRYHDLWVTDNKANSVYANFKDAKPVG 312 (312) T ss_pred ecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEEeecccCCC Confidence 499999999999999999999999984 4557899999999999999999999999997666554 No 6 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=100.00 E-value=3.6e-83 Score=472.58 Aligned_cols=272 Identities=13% Similarity=0.121 Sum_probs=244.7 Q ss_pred hhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhccc---CcceeeeCCceEEeeeccccccccccCCC-CcccCCcccc Q lcl|Aclame:pro 18 HFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALI---SNDAIFMEGRSFTVMKGDTTELKDYKRNA-TNEFDHPKIE 93 (319) Q Consensus 18 ~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~---n~~~~~~~g~tVkIp~i~~~g~~DY~r~~-~~~~~~~t~t 93 (319) |= +++|+++|++.||+.+.+.++++.+.. +.+++|.||++||||+|+++|++||+|++ ++..++++++ T Consensus 1 MA--------~~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~ 72 (299) T protein:vir:79 1 MA--------ALNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNA 72 (299) T ss_pred Cc--------cchhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcc Confidence 32 355778999999999988887664432 35678999999999999999999999987 8888899999 Q ss_pred eeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc----cccccCCHhHHHHHHH Q lcl|Aclame:pro 94 ETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----HLTVGTGSDAQYDAVL 169 (319) Q Consensus 94 ~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~----~~~~~~T~~n~~~~i~ 169 (319) |++++|||||||+|.||+||++|+++.+++++++++|++++++||+|+|||++|++++.. .....+|++|+|++|+ T Consensus 73 ~~t~~ldqdr~~~f~vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~ 152 (299) T protein:vir:79 73 WEPKVLTNQRKWSTLVHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFD 152 (299) T ss_pred eeEEEeeccccceeccchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHH Confidence 999999999999999999999999999999999999999999999999999999877632 3345679999999999 Q ss_pred HHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhccccccc-ceeeeeeeeecCeEEEEeccccccc----------- Q lcl|Aclame:pro 170 DVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQ-VLGKGVQGELDGFVIVKVPTKLLQG----------- 236 (319) Q Consensus 170 ~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~-~~~~g~Vg~idG~~I~~vps~~~~~----------- 236 (319) +++++|+|++|| ++|||+|||+++++|+++++|++..+..++ +.++|+||+||||+|++||+++|++ T Consensus 153 ~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~ 232 (299) T protein:vir:79 153 KLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVG 232 (299) T ss_pred HHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCcccc Confidence 999999999999 699999999999999999999998887644 6899999999999999999999864 Q ss_pred -----ceEEEEcCCceeeeeeeeeeeeecCCCCCcc-ceeeeeeeeeEEEeccccceEEEEcccccc Q lcl|Aclame:pro 237 -----LQAIAVVGEVLASPIQADLAKTNSNIPGMFG-TLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) Q Consensus 237 -----~n~i~~~~~A~~~~~k~~~~~~~~~~~~~~~-~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a 297 (319) ||||++|++|+++++|++.+++|+|.++..| |+++||.|||+||++||++|||+|..++++ T Consensus 233 ~~ak~in~ii~~~~a~~~~~K~~~~~~~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 233 AGAKQIFMSLVHPSAIITPVSYQFSKLDEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred CcccccceEEEcCCeeeeeEeeeeEEeecCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 8999999999999999999999998666544 678999999999999999999999987777 No 7 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=100.00 E-value=1.3e-83 Score=474.94 Aligned_cols=269 Identities=20% Similarity=0.208 Sum_probs=249.2 Q ss_pred hhhhhhHhhHHHHHHHHHhhhhhhhcccC---cceeeeCCceEEeeecc-ccccccccCCCCcccCCcccceeEEEEeec Q lcl|Aclame:pro 27 GQTLLKNKHVGILERVTAVNAYSTPALIS---NDAIFMEGRSFTVMKGD-TTELKDYKRNATNEFDHPKIEETTYFLDQE 102 (319) Q Consensus 27 n~~~l~~ky~~lld~~~~~~sl~~~~~~n---~~~~~~~g~tVkIp~i~-~~g~~DY~r~~~~~~~~~t~t~~tltidqd 102 (319) =.++|+++|++.|++.+...++++.+++. .+++|.||++||||+|+ ++|++||+|+.|+..|+++++|+|++|+|| T Consensus 1 Main~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~et~tl~~D 80 (285) T protein:vir:79 1 MTVVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGKETVKLTHE 80 (285) T ss_pred CcchhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceeeeEEEeecc Confidence 34558899999999999999998877543 45799999999999997 589999999999999999999999999999 Q ss_pred ccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHHHhccCCC Q lcl|Aclame:pro 103 KYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPE 182 (319) Q Consensus 103 r~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~ 182 (319) |+|+|.||.||++| ++.+++++++.+|++++++||+|+|||++|+++++...+.++|++|+|++|++++++|+|++||+ T Consensus 81 R~~~f~iD~mDvdE-n~~~~~~ni~~ef~~~~vvPEiDayrfskla~~a~~~~~~~~T~~nv~~~i~~~~~~lde~~vp~ 159 (285) T protein:vir:79 81 DWFGYDLDQFDMDE-NGAYTVENVVREHNKMITIPHRDKVAVQKLFDSAAKKATDSITKDNALDAYDTAEAYMFDNEVPG 159 (285) T ss_pred ccceecccccchhh-hhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHHcCCCC Confidence 99999999999999 67899999999999999999999999999999999988899999999999999999999999999 Q ss_pred CcEEEEChHHHHHHhhhhhhhhccccccc---ceeeeeeeeecC-eEEEEeccccccc------ceEEEEcCCceeeeee Q lcl|Aclame:pro 183 NRVLFVSPTFYKGIKKFVIALPQGDTRQQ---VLGKGVQGELDG-FVIVKVPTKLLQG------LQAIAVVGEVLASPIQ 252 (319) Q Consensus 183 ~R~l~VsP~~~~~L~~~~~f~~~~~~~~~---~~~~g~Vg~idG-~~I~~vps~~~~~------~n~i~~~~~A~~~~~k 252 (319) +|||||||++|++|+++++|++..++++. +.++|+|++||| ++|++||+++|++ ||||++|++|++.++| T Consensus 160 ~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infiiv~~~a~i~~~K 239 (285) T protein:vir:79 160 GFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFILTPLSAIAPIVK 239 (285) T ss_pred ceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEEEecCceecccee Confidence 99999999999999999999999887653 468999999999 9999999999964 8999999999999999 Q ss_pred eeeeeeecCCC--CCccceeeeeeeeeEEEeccccceEEEEccccc Q lcl|Aclame:pro 253 ADLAKTNSNIP--GMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) Q Consensus 253 ~~~~~~~~~~~--~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~ 296 (319) ++.+++|.|++ +.+||+++||.|||+||++||++|||+|..++. T Consensus 240 ~~~~~~f~P~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 240 YDSVSVIDPSTDRSGNRWTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred eeeeEeECCCCCCCcceeeeeeeeeeeeeehhhccceeeeeecccC Confidence 99999999854 457799999999999999999999999976555 No 8 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=100.00 E-value=4.7e-83 Score=471.93 Aligned_cols=265 Identities=16% Similarity=0.170 Sum_probs=242.2 Q ss_pred hhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccce Q lcl|Aclame:pro 27 GQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWG 106 (319) Q Consensus 27 n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~ 106 (319) =.|+|+++|++.||+.+.+.++++.+ .|++++|.||+|||||+|+++|++||+|++|+..++++++|+|++|+|||||+ T Consensus 1 Main~a~~~~~~Ld~~~~~~~~t~~l-~~~~~~~~ggktVkI~~i~~~gl~DY~R~~g~~~g~v~~~~et~tl~qdR~~~ 79 (290) T protein:vir:78 1 MAINYVDKYGKELDQKLVFGTYTNEL-ETPNLLWLDAKTFKIQTITTTGLKAHTRNKGYNEGSASNTNKSYTIDFDRDVE 79 (290) T ss_pred CchhHHHHHHHHHHHHHHhhheeeec-cccceeeccCCEEEEeeeccCcccccccCCCcccCccccceeeEEeeccccce Confidence 34567899999999999999887655 57789999999999999999999999999999999999999999999999999 Q ss_pred eecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc---ccccCCHhHHHHHHHHHHHHHHhccCC-C Q lcl|Aclame:pro 107 RFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH---LTVGTGSDAQYDAVLDVSVELDEIKAP-E 182 (319) Q Consensus 107 F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~---~~~~~T~~n~~~~i~~a~~~Lde~~VP-~ 182 (319) |.||.||++|+++.+++++++++|++++++||+|+|||++|++.++.. .+.++|++|+|++|++++.+|+| || + T Consensus 80 F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~~~~~~~~t~t~~n~~~~i~~~~~~lde--vp~~ 157 (290) T protein:vir:78 80 FFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKTNSNSVAEEITKDNVFTKLKAAIRKVKK--YGTQ 157 (290) T ss_pred eeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhccCcccccccCHHHHHHHHHHHHHHHHh--cCCC Confidence 999999999999999999999999999999999999999999888653 34557899999999999999998 67 7 Q ss_pred CcEEEEChHHHHHHhhhhhhhhcccccc--cceeeeeeeeecCeEEEEecc-ccc----------------ccceEEEEc Q lcl|Aclame:pro 183 NRVLFVSPTFYKGIKKFVIALPQGDTRQ--QVLGKGVQGELDGFVIVKVPT-KLL----------------QGLQAIAVV 243 (319) Q Consensus 183 ~R~l~VsP~~~~~L~~~~~f~~~~~~~~--~~~~~g~Vg~idG~~I~~vps-~~~----------------~~~n~i~~~ 243 (319) ||||||||++|++|+++++|++..+.++ .+.++|+|+++|||+|++||+ +|| +.||||++| T Consensus 158 ~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~~~~~ak~in~ii~~ 237 (290) T protein:vir:78 158 NLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGTRIVEVEAEDRFYDTFDFTDGYKPAAGAKKLNFLLVN 237 (290) T ss_pred CeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCcEEEEecccchhhhhhhhcccccccCCccceeEEEEc Confidence 9999999999999999999999877653 467899999999999999997 464 459999999 Q ss_pred CCceeeeeeeeeeeeecCCC--CCccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 244 GEVLASPIQADLAKTNSNIP--GMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 244 ~~A~~~~~k~~~~~~~~~~~--~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) ++|+++++|++++++|.|.. +.+||+++||.|||+||++||++|||+|... T Consensus 238 ~~a~i~~~K~~~~~~~~P~~~~~~d~~~~~~r~y~d~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 238 KGSVVGGAKHASIYLHAPGSVGQGDGWLYQYRVYHDIFVLDQQKDGVIASTEV 290 (290) T ss_pred CCceeeeeeeeEEEeeCCCCCcCcceeeeeeeeeeeeeeeccccCeeEEEeeC Confidence 99999999999999999854 4478999999999999999999999999866 No 9 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=100.00 E-value=3.2e-82 Score=467.39 Aligned_cols=271 Identities=17% Similarity=0.177 Sum_probs=242.8 Q ss_pred hhcchhh--hhhhHhhHHHHHHHHHhhhhhhhcccCcce-eeeCCceEEeeeccccccccccCCCCcccCCcccceeEEE Q lcl|Aclame:pro 22 KSVEPGQ--TLLKNKHVGILERVTAVNAYSTPALISNDA-IFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYF 98 (319) Q Consensus 22 ~~~~~n~--~~l~~ky~~lld~~~~~~sl~~~~~~n~~~-~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tlt 98 (319) --...|+ ++|+++|++.||+++.+.+++..+.+ +++ .+.||+|||||+|+++|++||+|++|+..|+++++|+|++ T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~-~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~g~v~~~~et~t 79 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAGLFTAALGT-PEVDLVNGGRSFTLKTISTSGLKDHTRGKGFNSGTISDEKTIYT 79 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhhhcccceec-CchheeecCCEEEEEeeeeccccccccccCccccceeeeeeEEE Confidence 2245677 45799999999999999998876654 445 4579999999999999999999999999999999999999 Q ss_pred EeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc---------------cccccCCHhH Q lcl|Aclame:pro 99 LDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK---------------HLTVGTGSDA 163 (319) Q Consensus 99 idqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~---------------~~~~~~T~~n 163 (319) |+|||+|.|.||.||++|+++.+++++++++|+|++++||+|+|||++||+.+.. .....+|++| T Consensus 80 l~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt~~n 159 (311) T protein:vir:99 80 MGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLDETN 159 (311) T ss_pred eeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccCHHH Confidence 9999999999999999999999999999999999999999999999999976532 2234579999 Q ss_pred HHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhccccc--ccceeeeeeeeecCeEEEEe-cccccc---- Q lcl|Aclame:pro 164 QYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTR--QQVLGKGVQGELDGFVIVKV-PTKLLQ---- 235 (319) Q Consensus 164 ~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~--~~~~~~g~Vg~idG~~I~~v-ps~~~~---- 235 (319) +|+.|++++.+|+| || +||||||||+++++|+++++|.+..+.. ....++++|++|||++|++| |++||+ T Consensus 160 vl~~l~~~~~~~~~--v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~t~~~ 237 (311) T protein:vir:99 160 AYSQLKTGIGKVRK--YGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVYESNRFMTKYD 237 (311) T ss_pred HHHHHHHHHHHHHh--cCCCCeEEEEChHHHHHHhhchhhheeeecccccccccccccceecCeEEEEecCchhhcchhh Confidence 99999999999998 56 8999999999999999999999877654 24568999999999999999 999875 Q ss_pred ------------cceEEEEcCCceeeeeeeeeeeeecCC--CCCccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 236 ------------GLQAIAVVGEVLASPIQADLAKTNSNI--PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 236 ------------~~n~i~~~~~A~~~~~k~~~~~~~~~~--~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) .||||++|++|++.++|++++++|.|. ++.+||+++||.|||+||++||++|||+|..++ T Consensus 238 ft~G~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~~~gd~~l~~~R~Y~D~fv~~nk~~~Iyv~~k~A 311 (311) T protein:vir:99 238 FTDGAKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQHTDGDGYLYQNRLYHDLFIKKHKRDGIFVSVKKA 311 (311) T ss_pred hcCCccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCCCCCcceeeeeeeeeeeeeeeccccCeEEEeeecC Confidence 499999999999999999999999984 445789999999999999999999999999766 No 10 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=100.00 E-value=2.3e-81 Score=462.70 Aligned_cols=270 Identities=16% Similarity=0.189 Sum_probs=246.0 Q ss_pred chhhhhhhHhhHHHHHHHHHhhhhhhhc-ccCcceeeeCCceEEeeecc-----ccccccccCCCCcccCCcccceeEEE Q lcl|Aclame:pro 25 EPGQTLLKNKHVGILERVTAVNAYSTPA-LISNDAIFMEGRSFTVMKGD-----TTELKDYKRNATNEFDHPKIEETTYF 98 (319) Q Consensus 25 ~~n~~~l~~ky~~lld~~~~~~sl~~~~-~~n~~~~~~~g~tVkIp~i~-----~~g~~DY~r~~~~~~~~~t~t~~tlt 98 (319) =+|+|+|+++|++.||+.+.+.+++..+ ..|+.+.|.||+|||||+|+ ++|++||+|++|+..|+++++|+|++ T Consensus 1 Mantl~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~g~v~~~~et~t 80 (302) T protein:vir:78 1 MANSLALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQGSVTLAWSDYT 80 (302) T ss_pred CCchhHHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCccccceeeeeeeEE Confidence 4589999999999999999999988666 45667899999999999998 46999999999999999999999999 Q ss_pred EeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc------ccccCCHhHHHHHHHHHH Q lcl|Aclame:pro 99 LDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------LTVGTGSDAQYDAVLDVS 172 (319) Q Consensus 99 idqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~------~~~~~T~~n~~~~i~~a~ 172 (319) |+|||+|.|.||.||++|+++.+++++++++|+|++++||+|+||||+|++.+... .+..+|++|+|+.|+.++ T Consensus 81 lt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~t~~nvl~~i~~~~ 160 (302) T protein:vir:78 81 LDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVGGVIDLSKPDASAQALMGDIATAM 160 (302) T ss_pred eeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccCccccccccchhHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999766432 223468999999999999 Q ss_pred HHHHhccCCCCcEEEEChHHHHHHhhhhhhhhccccc--ccceeeeeeeeecCeEEEEeccccccc-------------- Q lcl|Aclame:pro 173 VELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTR--QQVLGKGVQGELDGFVIVKVPTKLLQG-------------- 236 (319) Q Consensus 173 ~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~--~~~~~~g~Vg~idG~~I~~vps~~~~~-------------- 236 (319) .+|+|+ ++|+|||+|+++.+|++++.|.+..+.. +.+.++++|++|||++|++||++||++ T Consensus 161 ~~~~e~---~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~G~~~~~~a 237 (302) T protein:vir:78 161 ELVDDS---NQLILVTSPTTLAGLLNTALIRESKNTQVLRRGEVDTKITFIQDVEVLQVPSEYLYDKVAPKVGVPDYTGA 237 (302) T ss_pred HHhhcc---CCeEEEEChHHHHHHhcchhhccceeccccccccccceeeeecccEEEEchhhhcccceeccCCccccCCc Confidence 999996 5999999999999999999999876553 456789999999999999999999854 Q ss_pred --ceEEEEcCCceeeeeeeeeeeeecCCCCCc--cceeeeeeeeeEEEeccccceEEEEcccccc Q lcl|Aclame:pro 237 --LQAIAVVGEVLASPIQADLAKTNSNIPGMF--GTLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) Q Consensus 237 --~n~i~~~~~A~~~~~k~~~~~~~~~~~~~~--~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a 297 (319) ||||++|++|++.++|++++++|.|.++++ +|+++||.|||+||++||++|||+|...+.| T Consensus 238 k~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~~~ 302 (302) T protein:vir:78 238 KKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVDLRLYHDLIVPKNQRPGIIKASFGTIA 302 (302) T ss_pred cceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeeeeeeEeeeeeeccccCeEEEeeccccC Confidence 999999999999999999999999866664 6899999999999999999999999977777 No 11 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=100.00 E-value=1.4e-50 Score=293.89 Aligned_cols=265 Identities=20% Similarity=0.183 Sum_probs=219.0 Q ss_pred chhhhhhhHhhHHHHHHHHHhhhhhhhcccCccee--eeCCceEEeeeccccccccccCCCC-cccCCcccceeEEEEee Q lcl|Aclame:pro 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAI--FMEGRSFTVMKGDTTELKDYKRNAT-NEFDHPKIEETTYFLDQ 101 (319) Q Consensus 25 ~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~--~~~g~tVkIp~i~~~g~~DY~r~~~-~~~~~~t~t~~tltidq 101 (319) =.+++.++|+|.+.+.+.+...+ ....++|++|. ...|++|+||+++..+++||++.++ ...++++.++.+++||+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~l-v~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQT-VFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhh-ccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEee Confidence 13344478899888888876554 45667888884 4569999999999999999998765 56678899999999999 Q ss_pred cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-ccccCCHhHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 102 EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH-LTVGTGSDAQYDAVLDVSVELDEIKA 180 (319) Q Consensus 102 dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~-~~~~~T~~n~~~~i~~a~~~Lde~~V 180 (319) +|++.|.||++|..++... +.. ..+++.+++++++|+++++.++..+... ....++++++|+.|+++..+|||++| T Consensus 80 ~~~~~~~i~d~d~~~~~~~--~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~v 156 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGS--LEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKANV 156 (273) T ss_pred eeecceEeecHHHhhhhcc--HHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcCC Confidence 9999999999998887654 455 5678899999999999999987754433 34557889999999999999999999 Q ss_pred C-CCcEEEEChHHHHHHhhhhhhhhcccc-c-ccceeeeeeeeecCeEEEEeccc-ccccceEEEEcCCceeeeeeeeee Q lcl|Aclame:pro 181 P-ENRVLFVSPTFYKGIKKFVIALPQGDT-R-QQVLGKGVQGELDGFVIVKVPTK-LLQGLQAIAVVGEVLASPIQADLA 256 (319) Q Consensus 181 P-~~R~l~VsP~~~~~L~~~~~f~~~~~~-~-~~~~~~g~Vg~idG~~I~~vps~-~~~~~n~i~~~~~A~~~~~k~~~~ 256 (319) | ++||++|+|+++..|+++++|.+..+. + +..+++|.||+++||+|+++++- ...+..++++|++|++++.|++++ T Consensus 157 P~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~ 236 (273) T protein:vir:10 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTV 236 (273) T ss_pred CcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeeehh Confidence 9 799999999999999999987765443 3 34678999999999999975321 113456899999999999999999 Q ss_pred eeecCCCCCccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 257 KTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 257 ~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) |..+ .+++||+.|+||++||++|++|++..++-..++ T Consensus 237 e~~r-~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 237 EALR-DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hccc-CCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 9988 588899999999999999999998766655555 No 12 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=100.00 E-value=1.4e-50 Score=293.89 Aligned_cols=265 Identities=20% Similarity=0.183 Sum_probs=219.0 Q ss_pred chhhhhhhHhhHHHHHHHHHhhhhhhhcccCccee--eeCCceEEeeeccccccccccCCCC-cccCCcccceeEEEEee Q lcl|Aclame:pro 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAI--FMEGRSFTVMKGDTTELKDYKRNAT-NEFDHPKIEETTYFLDQ 101 (319) Q Consensus 25 ~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~--~~~g~tVkIp~i~~~g~~DY~r~~~-~~~~~~t~t~~tltidq 101 (319) =.+++.++|+|.+.+.+.+...+ ....++|++|. ...|++|+||+++..+++||++.++ ...++++.++.+++||+ T Consensus 1 MA~~~~~pe~~~~~v~~~~~~~l-v~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) T protein:vir:10 1 MAFNNFIPELWSDMLLEEWTAQT-VFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhh-ccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEee Confidence 13344478899888888876554 45667888884 4569999999999999999998765 56678899999999999 Q ss_pred cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-ccccCCHhHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 102 EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH-LTVGTGSDAQYDAVLDVSVELDEIKA 180 (319) Q Consensus 102 dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~-~~~~~T~~n~~~~i~~a~~~Lde~~V 180 (319) +|++.|.||++|..++... +.. ..+++.+++++++|+++++.++..+... ....++++++|+.|+++..+|||++| T Consensus 80 ~~~~~~~i~d~d~~~~~~~--~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~v 156 (273) T protein:vir:10 80 EKSIDFLVDDIDRVQVAGS--LEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKANV 156 (273) T ss_pred eeecceEeecHHHhhhhcc--HHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcCC Confidence 9999999999998887654 455 5678899999999999999987754433 34557889999999999999999999 Q ss_pred C-CCcEEEEChHHHHHHhhhhhhhhcccc-c-ccceeeeeeeeecCeEEEEeccc-ccccceEEEEcCCceeeeeeeeee Q lcl|Aclame:pro 181 P-ENRVLFVSPTFYKGIKKFVIALPQGDT-R-QQVLGKGVQGELDGFVIVKVPTK-LLQGLQAIAVVGEVLASPIQADLA 256 (319) Q Consensus 181 P-~~R~l~VsP~~~~~L~~~~~f~~~~~~-~-~~~~~~g~Vg~idG~~I~~vps~-~~~~~n~i~~~~~A~~~~~k~~~~ 256 (319) | ++||++|+|+++..|+++++|.+..+. + +..+++|.||+++||+|+++++- ...+..++++|++|++++.|++++ T Consensus 157 P~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a~q~~~~ 236 (273) T protein:vir:10 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTV 236 (273) T ss_pred CcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeeeeeeehh Confidence 9 799999999999999999987765443 3 34678999999999999975321 113456899999999999999999 Q ss_pred eeecCCCCCccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 257 KTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 257 ~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) |..+ .+++||+.|+||++||++|++|++..++-..++ T Consensus 237 e~~r-~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 237 EALR-DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hccc-CCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 9988 588899999999999999999998766655555 No 13 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=100.00 E-value=2.4e-49 Score=287.17 Aligned_cols=265 Identities=19% Similarity=0.176 Sum_probs=217.2 Q ss_pred chhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceee--eCCceEEeeeccccccccccCCCC-cccCCcccceeEEEEee Q lcl|Aclame:pro 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF--MEGRSFTVMKGDTTELKDYKRNAT-NEFDHPKIEETTYFLDQ 101 (319) Q Consensus 25 ~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~--~~g~tVkIp~i~~~g~~DY~r~~~-~~~~~~t~t~~tltidq 101 (319) =++++.++|.|.+.+.+.+..++ +...++|++|.+ ..|+||+||+++..+++||++.++ ...++++.++.+++|+| T Consensus 1 MA~~~~~pei~~~~v~~~~~~~l-v~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid~ 79 (273) T protein:vir:79 1 MAFNNFIPELWSDMLLEEWTAQT-VFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLIDQ 79 (273) T ss_pred CcchhhhHHHHHHHHHHHHHhhc-cchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEEEEEee Confidence 12344578899888877776554 455678888855 469999999999999999998765 56678999999999999 Q ss_pred cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-cccccCCHhHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 102 EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK-HLTVGTGSDAQYDAVLDVSVELDEIKA 180 (319) Q Consensus 102 dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~-~~~~~~T~~n~~~~i~~a~~~Lde~~V 180 (319) +|++.|.||++|..++... +.. ..+++.+++++++|+++++.++..+.. ......+++++|+.|+++..+|||++| T Consensus 80 ~~~~~~~i~d~d~~~~~~~--~~~-~~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~v 156 (273) T protein:vir:79 80 EKSIDFLVDDIDRVQVAGS--LEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKANV 156 (273) T ss_pred ecccceeeccHHHHhhccc--HHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHHHHhhhccC Confidence 9999999999999988764 455 567889999999999999988764433 234556889999999999999999999 Q ss_pred C-CCcEEEEChHHHHHHhhhhhhh-hcccccc-cceeeeeeeeecCeEEEEecccc-cccceEEEEcCCceeeeeeeeee Q lcl|Aclame:pro 181 P-ENRVLFVSPTFYKGIKKFVIAL-PQGDTRQ-QVLGKGVQGELDGFVIVKVPTKL-LQGLQAIAVVGEVLASPIQADLA 256 (319) Q Consensus 181 P-~~R~l~VsP~~~~~L~~~~~f~-~~~~~~~-~~~~~g~Vg~idG~~I~~vps~~-~~~~n~i~~~~~A~~~~~k~~~~ 256 (319) | ++|||+|+|+++..|+++++|. .....++ ..+++|.||+++||+|+++++-. ..+..++++|++|++++.|++.+ T Consensus 157 P~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a~~~~~~ 236 (273) T protein:vir:79 157 PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYVSQIDTV 236 (273) T ss_pred CccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeeeeehhhh Confidence 9 7999999999999999988754 4444443 46889999999999999753321 23456899999999999999999 Q ss_pred eeecCCCCCccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 257 KTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 257 ~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) |..| ++++|++.|+|+++||++|++|++..++-..++ T Consensus 237 e~~r-~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 237 EALR-DQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hccc-CcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 9998 488899999999999999999998766555555 No 14 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=100.00 E-value=6.9e-47 Score=273.64 Aligned_cols=275 Identities=19% Similarity=0.078 Sum_probs=218.1 Q ss_pred eeehhhhhhhhhcc----------hhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccC Q lcl|Aclame:pro 12 LKLNLQHFANKSVE----------PGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKR 81 (319) Q Consensus 12 ~~~~~~~~~~~~~~----------~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r 81 (319) .+.+-+|.-+|... .+++-+ |+|.+.+++.|...+..-....-++ ..+|++|+||+++...+++|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~l-e~~~geV~~~f~~~s~~~~~~~~r~--i~~G~tv~i~~ig~~~~~~~~~ 77 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATAL-KLFSGEVFTAFNNASIFKGLVRSYD--LRGGKSKQFMFTGKLSAGYHTP 77 (332) T ss_pred CcccccccCCccccCCccccccccchhhhh-hhhhhhHHHHHHHHhhhhhcccccc--ccccceEEEEeccceeEeeecC Confidence 45555777777662 123445 8999999999988877544433343 3489999999999999999999 Q ss_pred CCCcccC-CcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc------- Q lcl|Aclame:pro 82 NATNEFD-HPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK------- 153 (319) Q Consensus 82 ~~~~~~~-~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~------- 153 (319) ..+..+. +++.+..+++||+.+||+|.||++|+.|+.. ++...++++++++|+.++|++++..++..+.. T Consensus 78 g~~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~--dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~ 155 (332) T protein:vir:78 78 GTPIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQY--STRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGE 155 (332) T ss_pred CCCCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCc--chHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccc Confidence 8888775 5899999999999999999999999999875 56888999999999999999999988754322 Q ss_pred --------cccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhh--hhhhhhccccc-ccceeeee-eee Q lcl|Aclame:pro 154 --------HLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKK--FVIALPQGDTR-QQVLGKGV-QGE 220 (319) Q Consensus 154 --------~~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~--~~~f~~~~~~~-~~~~~~g~-Vg~ 220 (319) .....+++.++|++|+++.++|+|++|| ++||++|+|++|.+|++ +++|......+ +..+++|. |++ T Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~ 235 (332) T protein:vir:78 156 PGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYS 235 (332) T ss_pred ccccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeE Confidence 1123457889999999999999999999 79999999999999987 67787665444 34677875 999 Q ss_pred ecCeEEEEecccccc----------------------cceEEEEcCCceeeee----eeeeeeeecCCCCCccceeeeee Q lcl|Aclame:pro 221 LDGFVIVKVPTKLLQ----------------------GLQAIAVVGEVLASPI----QADLAKTNSNIPGMFGTLAEQLL 274 (319) Q Consensus 221 idG~~I~~vps~~~~----------------------~~n~i~~~~~A~~~~~----k~~~~~~~~~~~~~~~~~v~gr~ 274 (319) ++||+|+++++-... +.-.++.|++|+.+++ |+++.+.++ .+++++|.|+|++ T Consensus 236 i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~-~~~~~~d~i~~~~ 314 (332) T protein:vir:78 236 IAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDF-NVQYQGDLIVGKL 314 (332) T ss_pred EeeeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhccc-chhhhHhhhhhhh Confidence 999999986432110 1124788999999887 444666666 5899999999999 Q ss_pred eeeEEEeccccceEEEEc Q lcl|Aclame:pro 275 YTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 275 ~yg~~V~~~k~~~Iy~~~ 292 (319) +||++++||+..+.+... T Consensus 315 ~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 315 AMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhcCceecccceEEEeeC Confidence 999999999997654333 No 15 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=100.00 E-value=4.5e-47 Score=274.67 Aligned_cols=271 Identities=13% Similarity=0.031 Sum_probs=205.9 Q ss_pred cccceeeehhhhhhhhhcchhhhhh--hHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCC Q lcl|Aclame:pro 7 NATGMLKLNLQHFANKSVEPGQTLL--KNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNAT 84 (319) Q Consensus 7 ~~~~~~~~~~~~~~~~~~~~n~~~l--~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~ 84 (319) -++|- -.+|++++ .|.|++.+...+..+ |....+++ ...+..|++||||+|+.++++||+++++ T Consensus 1 ~~~~n------------~ts~~qafi~~EiWsa~il~~l~~~-Lv~~~~~~-~~d~g~GDtV~InsIg~~tV~dY~~~~~ 66 (322) T protein:vir:31 1 MSTGN------------NTSNTQALIVSEIWADEIEDILHEK-LLDVNIAR-VVDFPDGDKLTIPSVGTPVVRSRPEQGD 66 (322) T ss_pred CCCCC------------CcccceEEeehhhhHHHHHHHhhhh-hhhhhhhc-ccccCCCCeEEeccccccccccccCCCC Confidence 12222 24454433 578877776665544 34444434 4555679999999999999999999999 Q ss_pred cccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc---------- Q lcl|Aclame:pro 85 NEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH---------- 154 (319) Q Consensus 85 ~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~---------- 154 (319) +++++++.+..+++|||+|||+|.||| |..|++. ++.+.++++++++++.++|+|..+.|+.++... T Consensus 67 i~~d~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~--dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin 143 (322) T protein:vir:31 67 FTFDNLDTGEISIILRDEVYAGNAISK-KLRQDSR--WISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVIN 143 (322) T ss_pred cccccCCCceEEEEEehhhhhccccch-hHHHhhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceec Confidence 999999999999999999999999999 9999874 668999999999999999999999887655211 Q ss_pred ------ccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHH---------Hhhhhhhhhcccccccceeee-- Q lcl|Aclame:pro 155 ------LTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKG---------IKKFVIALPQGDTRQQVLGKG-- 216 (319) Q Consensus 155 ------~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~---------L~~~~~f~~~~~~~~~~~~~g-- 216 (319) ...+..+.++|+.|+++..+|||++|| +|||++|+|.+++. |+++++|++....|. .+| T Consensus 144 ~~~~~iv~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~---a~g~~ 220 (322) T protein:vir:31 144 GVPHRFVGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGI---APDMQ 220 (322) T ss_pred CCccceeccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccc---hhhHH Confidence 112345678999999999999999999 79999999999764 577888886544442 223 Q ss_pred eeeeecCeEEEEecccccccceEEE---------------------EcCCceeeeeeeeeeeeecCCCCCccceeeeeee Q lcl|Aclame:pro 217 VQGELDGFVIVKVPTKLLQGLQAIA---------------------VVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLY 275 (319) Q Consensus 217 ~Vg~idG~~I~~vps~~~~~~n~i~---------------------~~~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~ 275 (319) .||++.||+|+++++-.-.++.+++ ++.+..++..++.+.|.|| .|++|++..+||.+ T Consensus 221 ~Vg~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r-~~~~~~d~~~~~~~ 299 (322) T protein:vir:31 221 FVRSVYGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFI-DDYNDDLNTATTAR 299 (322) T ss_pred HHHHHhceeeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhccc-Cccccccceeeeee Confidence 5999999999975432112222333 3555666667888889998 69999999999999 Q ss_pred eeEEEeccccceEEEEccccccCC Q lcl|Aclame:pro 276 TGAFVPEHLQKYIFTIGGTEVATK 299 (319) Q Consensus 276 yg~~V~~~k~~~Iy~~~~~~~a~~ 299 (319) ||..|++++..+..... ..|.+- T Consensus 300 ~g~g~~r~e~l~~~~a~-~~~~~~ 322 (322) T protein:vir:31 300 WGNGLVRDENLVCVLAN-ADKVTF 322 (322) T ss_pred ecceeecccceEEEEec-cccccC Confidence 99999999997664332 233332 No 16 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=100.00 E-value=9.4e-45 Score=261.96 Aligned_cols=285 Identities=14% Similarity=0.042 Sum_probs=215.3 Q ss_pred cccccceeee-hhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCC Q lcl|Aclame:pro 5 IKNATGMLKL-NLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) Q Consensus 5 ~~~~~~~~~~-~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~ 83 (319) .-|-.+-=+| ..++...-+.....+-+ |+|++.+++.|...+..- .+++. ....+|++|+||+++...+.+|++.. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~i-e~~~g~V~~~f~~~s~~~-~~v~~-r~~~~G~sv~i~~iG~~t~~~~~~g~ 77 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTM-PRHML-RSIASGKSAQFPVIGRTKAAYLKPGE 77 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHH-HHHHHHHHHHHHHHHhhh-hhhcc-ccccccceeEeeeccceeeeeecCCC Confidence 1111111122 23344433444444555 999999998888777543 33442 24457999999999999999999877 Q ss_pred Ccc--cCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---C------ Q lcl|Aclame:pro 84 TNE--FDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK---A------ 152 (319) Q Consensus 84 ~~~--~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a---~------ 152 (319) ... ..+++.+..+++||+.+||.|.||++|+.|+.. ++...+++++.++|+...|++++..++..+ . T Consensus 78 ~l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~--D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~ 155 (347) T protein:vir:33 78 NLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHY--DVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENI 155 (347) T ss_pred CCCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCC--chhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 553 456888999999999999999999999999864 567889999999999999999987654211 0 Q ss_pred --------ccc---cc------cCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhccccccccee Q lcl|Aclame:pro 153 --------KHL---TV------GTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLG 214 (319) Q Consensus 153 --------~~~---~~------~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~ 214 (319) ... +. ..++.++|+.|+++.++|+|++|| ++||++|+|++|.+|+++++|......++..+. T Consensus 156 ~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~ 235 (347) T protein:vir:33 156 EGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPE 235 (347) T ss_pred ccccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccccccc Confidence 000 01 113568999999999999999999 799999999999999999999866555667789 Q ss_pred eeeeeeecCeEEEEeccccccc-------------------------------ceEEEEcCCceeeeeeee-eeeeecCC Q lcl|Aclame:pro 215 KGVQGELDGFVIVKVPTKLLQG-------------------------------LQAIAVVGEVLASPIQAD-LAKTNSNI 262 (319) Q Consensus 215 ~g~Vg~idG~~I~~vps~~~~~-------------------------------~n~i~~~~~A~~~~~k~~-~~~~~~~~ 262 (319) +|.|++++||+|+++++- ... .-.++.|++|+..+..++ ++|.++ + T Consensus 236 ~G~V~~i~G~~V~~Sn~l-p~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r-~ 313 (347) T protein:vir:33 236 RGTIRNVMGFEVVEVPHL-TAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERAR-R 313 (347) T ss_pred cceeEEEeceeEEEeccc-ccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeecc-c Confidence 999999999999986431 100 113688999998888877 899998 5 Q ss_pred CCCccceeeeeeeeeEEEeccccceEEEEccccccCCC Q lcl|Aclame:pro 263 PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKR 300 (319) Q Consensus 263 ~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~ 300 (319) +.+++|+|+|+++||++|+||+..+.+.-.... + T Consensus 314 ~~~~~d~i~~~~~~G~~vlrP~~av~i~~~~~~----~ 347 (347) T protein:vir:33 314 ANYQADQIIAKYAMGHGGLRPEAAGAIVLPKVS----E 347 (347) T ss_pred hhhhhHhhhhhhhcCCceecccceEEEecCCCC----C Confidence 999999999999999999999996655433211 1 No 17 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=100.00 E-value=1.6e-44 Score=260.63 Aligned_cols=280 Identities=11% Similarity=0.030 Sum_probs=215.3 Q ss_pred eeehhhhhhhhhcchhhhh-hhHhhHHHHHHHHHhhhhhhhcccCcce--eeeCCceEEeeeccccccccccCCCCcccC Q lcl|Aclame:pro 12 LKLNLQHFANKSVEPGQTL-LKNKHVGILERVTAVNAYSTPALISNDA--IFMEGRSFTVMKGDTTELKDYKRNATNEFD 88 (319) Q Consensus 12 ~~~~~~~~~~~~~~~n~~~-l~~ky~~lld~~~~~~sl~~~~~~n~~~--~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~ 88 (319) .+|.--..-..|.+++... .+|.|.+.+.+.+..++ ++..++ ++| .+..|++|+||+++..++.||+++....++ T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~-v~~~~~-~d~~~~~~~Gdtv~ip~~g~~~~~d~~~~~~i~~~ 78 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKM-LDTSVV-KTWGAQVKKGDTFHVPRISELGVEDKATDVPVGVQ 78 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhc-chhhcc-ccccccccCCceEEEeccCcceeeeecCCCccccc Confidence 1222122223344455443 46999988888886554 455544 566 445699999999999999999999899999 Q ss_pred CcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-------cccC-- Q lcl|Aclame:pro 89 HPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL-------TVGT-- 159 (319) Q Consensus 89 ~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~-------~~~~-- 159 (319) +++.++.+++||++++++|.||++|..|+.. ++.....++++++++.++|+++++.++..+.... .... T Consensus 79 ~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~--d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~ 156 (341) T protein:vir:94 79 PVNDTDFVITVDTDRTTAVALDDLLEIQASY--DLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITG 156 (341) T ss_pred cccCceEEEEEeeeeecceeechHHHHhhcc--chHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccC Confidence 9999999999999999999999999998864 6678888999999999999999998875543211 1111 Q ss_pred -CHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEeccccc--- Q lcl|Aclame:pro 160 -GSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL--- 234 (319) Q Consensus 160 -T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~--- 234 (319) .....|+.|+++.++|||++|| ++|||+|+|+++..|+++++|.+....++..+++|.||+++||+|+++++-.. T Consensus 157 ~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~ 236 (341) T protein:vir:94 157 NGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNNSA 236 (341) T ss_pred chhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEecccccccc Confidence 2345799999999999999999 79999999999999999999999887777778999999999999998643211 Q ss_pred --------------------------------ccceEEEEcCCceeeee------------eeeeeeeecCCCCCcccee Q lcl|Aclame:pro 235 --------------------------------QGLQAIAVVGEVLASPI------------QADLAKTNSNIPGMFGTLA 270 (319) Q Consensus 235 --------------------------------~~~n~i~~~~~A~~~~~------------k~~~~~~~~~~~~~~~~~v 270 (319) ..+.-+++|++|+..+. |...++..+ .+.+++|++ T Consensus 237 ~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~i 315 (341) T protein:vir:94 237 TGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSF-ENREQVWLM 315 (341) T ss_pred ccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccc-hhhhhhhhh Confidence 01234788888865542 333333333 456789999 Q ss_pred eeeeeeeEEEeccccceEEEEcccccc Q lcl|Aclame:pro 271 EQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) Q Consensus 271 ~gr~~yg~~V~~~k~~~Iy~~~~~~~a 297 (319) .||.+||++|+||+. ++.++++.++. T Consensus 316 ~~~~~~G~~~lrp~~-~v~~~~~~~~~ 341 (341) T protein:vir:94 316 VGRQAYGARLYRPLH-AVNIHTTGDTV 341 (341) T ss_pred hhhhhhcccccCcce-eEEEecCcCCC Confidence 999999999999999 57777765555 No 18 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=100.00 E-value=2.5e-44 Score=259.61 Aligned_cols=287 Identities=13% Similarity=0.014 Sum_probs=219.0 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK 80 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~ 80 (319) |-.++- |-.--..+..++-+.+.+.+.+ |.|++.++..|...+..- .+++. ....+|++|+||+++...+.+|+ T Consensus 1 ma~~~~---~~~~~t~~~~~~~~~~~~a~~i-e~f~g~V~~~f~~~s~~~-~~~~~-~~~~~G~sv~i~~ig~~t~~~~~ 74 (347) T protein:vir:15 1 MANIQG---GQQIGTNQGKGQSAADKLALFL-KVFGGEVLTAFARTSVTM-PRHML-RSIASGKSAQFPVIGRTKAAYLK 74 (347) T ss_pred CCcccc---CCccccccccCCCcchHHHHHH-HHHHHHHHHHHHHhhhhh-hcccc-ccccccceeEeeeccceeeeeec Confidence 433332 1111133444444556666666 899999999998777543 33332 24457999999999999999999 Q ss_pred CCCCc--ccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc----- Q lcl|Aclame:pro 81 RNATN--EFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----- 153 (319) Q Consensus 81 r~~~~--~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~----- 153 (319) +.... ...+++.+..+++||+.+||.|.||++|..|+.. ++....++++.++|+..+|++++..++..+.. T Consensus 75 ~g~~l~~~~~~~~~~e~~ltID~~~~~~~~VddlD~~q~~~--D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~ 152 (347) T protein:vir:15 75 PGENLDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHY--DVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASN 152 (347) T ss_pred cCCCCCCCCCCCccceEEEEechhhhhhHHhhhHHHHhcCC--cchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 87754 4466889999999999999999999999999875 56788999999999999999999877542110 Q ss_pred ---------c---cccc---------CCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhccccccc Q lcl|Aclame:pro 154 ---------H---LTVG---------TGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQ 211 (319) Q Consensus 154 ---------~---~~~~---------~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~ 211 (319) . .... ....++++.|+++.++|+|++|| ++||++|+|++|..|+++++|......++. T Consensus 153 ~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~ 232 (347) T protein:vir:15 153 ENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALI 232 (347) T ss_pred ccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccc Confidence 0 0000 11357899999999999999999 799999999999999999999877666666 Q ss_pred ceeeeeeeeecCeEEEEecccccc------------------------------cceEEEEcCCceeeeeeee-eeeeec Q lcl|Aclame:pro 212 VLGKGVQGELDGFVIVKVPTKLLQ------------------------------GLQAIAVVGEVLASPIQAD-LAKTNS 260 (319) Q Consensus 212 ~~~~g~Vg~idG~~I~~vps~~~~------------------------------~~n~i~~~~~A~~~~~k~~-~~~~~~ 260 (319) .+++|.|++++||+|+++++-... ..-.++.|++|+..+..++ ++|.++ T Consensus 233 ~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~ 312 (347) T protein:vir:15 233 DHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERAR 312 (347) T ss_pred cccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecc Confidence 789999999999999986432100 0113788999999888776 899887 Q ss_pred CCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCC Q lcl|Aclame:pro 261 NIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKR 300 (319) Q Consensus 261 ~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~ 300 (319) ++.+++|+|+|+++||++|+||+..+.+.-.... + T Consensus 313 -~~~~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~----~ 347 (347) T protein:vir:15 313 -RANYQADQIIAKYAMGHGGLRPEAAGAIVLPKVS----E 347 (347) T ss_pred -cchhhhhhhehhhhcCCceeccccEEEEecCCCC----C Confidence 5999999999999999999999996655433211 1 No 19 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=100.00 E-value=1.9e-44 Score=260.25 Aligned_cols=284 Identities=14% Similarity=0.025 Sum_probs=215.8 Q ss_pred CCcccccccceeeeh--hhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLN--LQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKD 78 (319) Q Consensus 1 ~~~~~~~~~~~~~~~--~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~D 78 (319) |-.++ +-=+.| ..-=..-..+...+-+ |+|++.+++.|...+..-.. ++. ....+|++++||+++...++. T Consensus 1 ma~~~----~~~~~n~~~~~~~~~~~~~~al~i-e~~~geV~~~f~~~s~~~~~-~~~-r~i~~g~s~~~~~iG~~~~~~ 73 (344) T protein:vir:10 1 MANMT----GGQQLGTNQGKDVMAAGDKLALFL-KVFGGEVLTAFARTSVTTSR-HMV-RSISSGKSAQFPVLGRTQAAY 73 (344) T ss_pred Ccccc----ccccCCcccCCccCCccchhHHHH-HHHHHHHHHHHHHHhhhccc-cee-eeecccceEEEEeeceeEEEe Confidence 21111 000001 0001122334555555 99999999998888765433 332 356689999999999999999 Q ss_pred ccCCCCccc--CCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc--- Q lcl|Aclame:pro 79 YKRNATNEF--DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK--- 153 (319) Q Consensus 79 Y~r~~~~~~--~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~--- 153 (319) |++++.... +++..+..+++||+.+|+.|.||++|+.|+.. ++...+++++.++|+...|++++..++..+.. T Consensus 74 ~~~G~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~--D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~ 151 (344) T protein:vir:10 74 LAPGENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHY--DVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQ 151 (344) T ss_pred eecCCCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCc--chHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 999876543 56889999999999999999999999999874 56788999999999999999998877532210 Q ss_pred ----------c-------c-----cccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcccccc Q lcl|Aclame:pro 154 ----------H-------L-----TVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQ 210 (319) Q Consensus 154 ----------~-------~-----~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~ 210 (319) . . ....+++++|+.|+++.++|||++|| ++||++|+|++|.+|+++++|......++ T Consensus 152 ~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~ 231 (344) T protein:vir:10 152 YNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAAL 231 (344) T ss_pred cccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccc Confidence 0 0 11124568999999999999999999 79999999999999999999987766566 Q ss_pred cceeeeeeeeecCeEEEEeccccc------------c-----------------cceEEEEcCCceeeeeeee-eeeeec Q lcl|Aclame:pro 211 QVLGKGVQGELDGFVIVKVPTKLL------------Q-----------------GLQAIAVVGEVLASPIQAD-LAKTNS 260 (319) Q Consensus 211 ~~~~~g~Vg~idG~~I~~vps~~~------------~-----------------~~n~i~~~~~A~~~~~k~~-~~~~~~ 260 (319) ...++|+|++++||+|+++|+-.. . ..--+++||.|+..+.+++ ++|.++ T Consensus 232 ~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r 311 (344) T protein:vir:10 232 IDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERAR 311 (344) T ss_pred cceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeeccc Confidence 678999999999999998753210 0 0112578999999999998 899998 Q ss_pred CCCCCccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 261 NIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 261 ~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) ++++|+|.++|+++||++|+||+..+...-+ ++ T Consensus 312 -~~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~-~~ 344 (344) T protein:vir:10 312 -RANFQADQIIAKYAMGHGGLRPEAAGAVVFK-TK 344 (344) T ss_pred -chhHHHHHHHHHhhcccceecccceEEEEee-cC Confidence 5999999999999999999999987653322 22 No 20 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=100.00 E-value=1.6e-44 Score=260.74 Aligned_cols=285 Identities=14% Similarity=0.026 Sum_probs=217.0 Q ss_pred cccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCC Q lcl|Aclame:pro 5 IKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNAT 84 (319) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~ 84 (319) .-|++|-.---.+....-...+..+.+ |.|.+.+..-|...+..-.. ++. ....+|++|+||+++...+++|++++. T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~i-k~f~~eV~~~f~~~s~~~~~-~~~-r~i~~G~sv~i~~iG~~tv~~~t~G~~ 77 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFL-KVFAGEVLTAFTRRSVTADK-HIV-RTIQNGKSAQFPVMGRTSGVYLAPGER 77 (347) T ss_pred CCCCCccccccccccCCccccHHHHHH-HHHhHHHHHHHHHHHhhhcc-ccc-ccccccceEEEecccceeeeeecCCCC Confidence 233333211123444444444455555 88988888887766654332 222 244589999999999999999999886 Q ss_pred c--ccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc---Ccc----- Q lcl|Aclame:pro 85 N--EFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK---AKH----- 154 (319) Q Consensus 85 ~--~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a---~~~----- 154 (319) . +..+++.+..+++||+.+|+.|.||++|..|+.. ++...+++++.++++..+|++++..++..+ +.. T Consensus 78 l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~--D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~ 155 (347) T protein:vir:94 78 LSDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHY--DVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIA 155 (347) T ss_pred cCCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccC Confidence 6 4567888999999999999999999999999875 567789999999999999999987664211 100 Q ss_pred ----------------ccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcccccccceeeee Q lcl|Aclame:pro 155 ----------------LTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) Q Consensus 155 ----------------~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) .+.+.++.++++.|+++.++|+|++|| ++||++|+|++|.+|++++.|......++..+.+|. T Consensus 156 g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~ 235 (347) T protein:vir:94 156 GLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGN 235 (347) T ss_pred CCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccc Confidence 011123578899999999999999999 799999999999999999888876655666789999 Q ss_pred eeeecCeEEEEecccc---------cc-------------------------cceEEEEcCCceeeeeeee-eeeeecCC Q lcl|Aclame:pro 218 QGELDGFVIVKVPTKL---------LQ-------------------------GLQAIAVVGEVLASPIQAD-LAKTNSNI 262 (319) Q Consensus 218 Vg~idG~~I~~vps~~---------~~-------------------------~~n~i~~~~~A~~~~~k~~-~~~~~~~~ 262 (319) |+++.||+|+++|+-- .. ..-.++.||.|+..+.+++ ++|.++ + T Consensus 236 Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r-~ 314 (347) T protein:vir:94 236 IRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDR-D 314 (347) T ss_pred eEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchh-c Confidence 9999999999864321 00 0124678999999999998 899998 5 Q ss_pred CCCccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 263 PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 263 ~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) +++|+|+++|+++||++|+||+..+.+.-..+. T Consensus 315 ~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 315 VDAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred hhhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 999999999999999999999997765433222 No 21 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=100.00 E-value=4.2e-43 Score=252.92 Aligned_cols=284 Identities=13% Similarity=0.029 Sum_probs=218.0 Q ss_pred cccccceeeehhhhhhhh--hcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCC Q lcl|Aclame:pro 5 IKNATGMLKLNLQHFANK--SVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRN 82 (319) Q Consensus 5 ~~~~~~~~~~~~~~~~~~--~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~ 82 (319) .-+-+|-+..|++-=.-. +-....+-+ |+|.+.+++.|...+..... ++ .....+|++++||+++...++.|++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~l-e~f~geV~~~f~~~s~~~~~-~~-~r~i~~gks~~~~~iG~~~~~~~~~G 77 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFL-KVFGGEVLTAFARTSVTTSR-HM-VRSISSGKSAQFPVLGRTQAAYLAPG 77 (345) T ss_pred CcccccchhcccccccccccCCchhHHHH-HHHhHHHHHHHHHHhhhccc-ce-eeeccccceEEEeeecceEEEeeecC Confidence 222233333332211111 113334444 99999999999888775433 33 23566899999999999999999988 Q ss_pred CCccc--CCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc------ Q lcl|Aclame:pro 83 ATNEF--DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------ 154 (319) Q Consensus 83 ~~~~~--~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~------ 154 (319) +.... .++..+..+++||+.+|+.|.||++|..|+. .++...+++++.++|+..+|+..+..++..+... T Consensus 78 ~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~--~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~ 155 (345) T protein:vir:22 78 ENLDDKRKDIKHTEKVITIDGLLTADVLIYDIEDAMNH--YDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNEN 155 (345) T ss_pred CCCCCCCCCcccceEEEEecchhhhhhhHhhHHHHhcC--chhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 76544 3577889999999999999999999999887 4568889999999999999999987665322110 Q ss_pred -------------------ccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhccccccccee Q lcl|Aclame:pro 155 -------------------LTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLG 214 (319) Q Consensus 155 -------------------~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~ 214 (319) .....+++++|++|+++.++|+|++|| .+||++|+|++|.+|+++++|.+....+....+ T Consensus 156 ~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~ 235 (345) T protein:vir:22 156 IEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPE 235 (345) T ss_pred ccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccccc Confidence 012235678999999999999999999 799999999999999999999876665666678 Q ss_pred eeeeeeecCeEEEEecccc-----------------c-------------ccceEEEEcCCceeeeeeee-eeeeecCCC Q lcl|Aclame:pro 215 KGVQGELDGFVIVKVPTKL-----------------L-------------QGLQAIAVVGEVLASPIQAD-LAKTNSNIP 263 (319) Q Consensus 215 ~g~Vg~idG~~I~~vps~~-----------------~-------------~~~n~i~~~~~A~~~~~k~~-~~~~~~~~~ 263 (319) +|+|++++||+|+++|+-. . ...-.++.||+|+..+.+++ ++|.++ ++ T Consensus 236 ~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r-~~ 314 (345) T protein:vir:22 236 KGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERAR-RA 314 (345) T ss_pred cceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeee-ch Confidence 9999999999999875311 0 01124688999999999998 799998 58 Q ss_pred CCccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 264 GMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 264 ~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) .+|+|+++|++.||++|+||+..+...-.-. T Consensus 315 ~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 315 NFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred hHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 9999999999999999999999776544322 No 22 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=100.00 E-value=4.1e-42 Score=247.48 Aligned_cols=285 Identities=12% Similarity=0.034 Sum_probs=220.9 Q ss_pred cccccceeee-hhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCC Q lcl|Aclame:pro 5 IKNATGMLKL-NLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) Q Consensus 5 ~~~~~~~~~~-~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~ 83 (319) .-|.+|-..| ..|..+....++..+-+ |+|.+.++..|...+..- ..++. ....+|++|+||+++...+.+|++.. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~-~~~~~-r~i~~G~sv~~~~iG~~~~~~~~~g~ 77 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFL-KVFGGEVLTAFVRRSVTM-DKHMV-RTIQNGKSASFPVMGRTKGYYLAPGE 77 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHH-HHHHHHHHHHHHHHhhhh-hcccc-ccccCcceEEEeeecceeeeeecccc Confidence 4445554444 45566666666666655 999999998888776543 33332 24568999999999999999988776 Q ss_pred Ccc--cCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc------- Q lcl|Aclame:pro 84 TNE--FDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------- 154 (319) Q Consensus 84 ~~~--~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~------- 154 (319) ... ..+++.+..+++||+.+|+.|.||++|..|+.. ++...+++++.++|+...|++++..++..+... T Consensus 78 ~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~--D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~ 155 (347) T protein:vir:88 78 NLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHY--DVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENI 155 (347) T ss_pred CCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcC--CchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 543 356888999999999999999999999999874 567889999999999999999998775433111 Q ss_pred ------c----cc-------cCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcccccccceeee Q lcl|Aclame:pro 155 ------L----TV-------GTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) Q Consensus 155 ------~----~~-------~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) . .. ...+..+|+.|+++.++|+|++|| ++||++|+|++|..|+++++|.+....+...+++| T Consensus 156 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G 235 (347) T protein:vir:88 156 AGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETG 235 (347) T ss_pred CCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcc Confidence 0 00 112456799999999999999999 79999999999999999988887655556678899 Q ss_pred eeeeecCeEEEEecccccc---------------------------------cceEEEEcCCceeeeeeee-eeeeecCC Q lcl|Aclame:pro 217 VQGELDGFVIVKVPTKLLQ---------------------------------GLQAIAVVGEVLASPIQAD-LAKTNSNI 262 (319) Q Consensus 217 ~Vg~idG~~I~~vps~~~~---------------------------------~~n~i~~~~~A~~~~~k~~-~~~~~~~~ 262 (319) .|+++.||+|+++|+-... +.-.++.|++|+..+..++ ++|.++ + T Consensus 236 ~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r-~ 314 (347) T protein:vir:88 236 NIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERAR-R 314 (347) T ss_pred eeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeee-c Confidence 9999999999987643110 0112788999999998888 799998 5 Q ss_pred CCCccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 263 PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 263 ~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) +.+|+|.++|++.||++|+||+..+.+.-..++ T Consensus 315 ~~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 315 PEFQADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred hhhHHHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 889999999999999999999986554333222 No 23 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=100.00 E-value=6.2e-42 Score=246.51 Aligned_cols=257 Identities=15% Similarity=0.085 Sum_probs=201.1 Q ss_pred cCcceeeeCCceEEeeeccccccccccCCCCc--ccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHH Q lcl|Aclame:pro 54 ISNDAIFMEGRSFTVMKGDTTELKDYKRNATN--EFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQG 131 (319) Q Consensus 54 ~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~--~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~ 131 (319) .=| ...+|++++||+++...+++|++.... .+.++..+..+++||+.+|+.|.||++|..|+. +++...+++++ T Consensus 1 ~vr--~i~~g~s~~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~--~Dlr~e~s~~~ 76 (324) T protein:vir:99 1 MTR--TITSGKSAQFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNH--YDVRSEYSTQM 76 (324) T ss_pred Cee--eeecCceEEEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcC--ccchhHHHHHH Confidence 001 345899999999999999999988865 457789999999999999999999999999976 45688999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCc-----------------------cccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEE Q lcl|Aclame:pro 132 AEVVAPYLDNLRFATLARNKAK-----------------------HLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLF 187 (319) Q Consensus 132 ~~~vapeiD~~~~s~la~~a~~-----------------------~~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~ 187 (319) .++|+..+|++.+..++..+.. ......+++++|+.|+++.++|||++|| .+||++ T Consensus 77 G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~v 156 (324) T protein:vir:99 77 GEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFY 156 (324) T ss_pred HHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEE Confidence 9999999999998876532210 0011234668999999999999999999 799999 Q ss_pred EChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEeccccc--------------------------------- Q lcl|Aclame:pro 188 VSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL--------------------------------- 234 (319) Q Consensus 188 VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~--------------------------------- 234 (319) |+|++|.+|++++.+......++....+|.|++++||+|+++++-.. T Consensus 157 v~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~ 236 (324) T protein:vir:99 157 TDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGA 236 (324) T ss_pred eChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccccccc Confidence 99999999987766665544455678999999999999998643110 Q ss_pred ccceEEEEcCCceeeeeeee-eeeeecCCCCCccceeeeeeeeeEEEeccccceE-EEEccccccCCCC------Ccccc Q lcl|Aclame:pro 235 QGLQAIAVVGEVLASPIQAD-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYI-FTIGGTEVATKRD------GVDAH 306 (319) Q Consensus 235 ~~~n~i~~~~~A~~~~~k~~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~I-y~~~~~~~a~~~~------~~~~~ 306 (319) .+.--+++|++|+.....+. ++|.++ ++.+|+|+|+|++.||++++||+..+. ....+.+|+..++ +..+. T Consensus 237 ~~~~gl~~~~~a~~tv~~~~~~~e~~~-~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~~~~~~~~~~~~~~~ 315 (324) T protein:vir:99 237 DNVVGLFVHRSAVATLKLKDMALERAR-RPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPAVAPDVITGVASFAAP 315 (324) T ss_pred CceeEEEEehhheEEEeeecceeccee-chhhHHHhhhhhhhhcCcccccceEEEEEEccCccccccchhhhhhccccCc Confidence 01113788999987777777 799998 588899999999999999999998774 4456655544442 55555 Q ss_pred ccccccccc Q lcl|Aclame:pro 307 ADNVAKPSG 315 (319) Q Consensus 307 ~~~~~~~~~ 315 (319) .++.++.+. T Consensus 316 ~~~~~~~~~ 324 (324) T protein:vir:99 316 ASTRAKSSA 324 (324) T ss_pred ccceeeecC Confidence 666666555 No 24 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=100.00 E-value=6.6e-39 Score=229.90 Aligned_cols=291 Identities=13% Similarity=0.056 Sum_probs=206.5 Q ss_pred CCcccccccceeeehhhhhhhhhc---chhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSV---EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELK 77 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~---~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~ 77 (319) |-.+--.. |....+..-+.+. +...+-+ |.|.+.++.-|...+..-.. ++. ....+|++|+||+++...++ T Consensus 1 ~~~~~~~~---~~~~n~~t~~~~~~~~~~~al~l-e~f~geV~~~f~~~si~~~~-~~~-rti~~Gksv~f~~iG~~t~~ 74 (375) T protein:vir:10 1 MANANQVA---LGRSNLSTGTGYGGATDKYALYL-KLFSGEMFKGFQHETIARDL-VTK-RTLKNGKSLQFIYTGRMTSS 74 (375) T ss_pred Cccccccc---cCccccCCccccccccchHHHHH-HHHhHHHHHHHHHHHhhhcc-ccc-cccccCceEEEEeeeeeEEe Confidence 43332222 2222233333333 4444544 99999999999888765433 332 35568999999999999999 Q ss_pred cccCCCCcccC---CcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc- Q lcl|Aclame:pro 78 DYKRNATNEFD---HPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK- 153 (319) Q Consensus 78 DY~r~~~~~~~---~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~- 153 (319) +|++....... +...+..+++||+.+||.|.||++|+.|+.. ++...+++++.++|+...|++.+..++..+.. T Consensus 75 ~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~~~~VdDiD~aqa~~--Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~ 152 (375) T protein:vir:10 75 FHTPGTPILGNADKAPPVAEKTIVMDDLLISSAFVYDLDETLAHY--ELRGEISKKIGYALAEKYDRLIFRSITRGARSA 152 (375) T ss_pred eecCCcCcCCccccCCCCCceEEEecchhhhhhhHhhHHHHhcCc--hhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 99987765443 5567888999999999999999999999874 56888999999999999999999888643311 Q ss_pred ----------------------cccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhh---hhhhhccc Q lcl|Aclame:pro 154 ----------------------HLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKF---VIALPQGD 207 (319) Q Consensus 154 ----------------------~~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~---~~f~~~~~ 207 (319) .....+|+.++|++|+++.++|+|++|| ++||++|+|++|.+|+++ ++|..... T Consensus 153 ~p~~~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~ 232 (375) T protein:vir:10 153 SPVSATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDV 232 (375) T ss_pred cccccccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecc Confidence 1122357899999999999999999999 799999999999999876 34443333 Q ss_pred ccccceeeeeeeeecCeEEEEecccc---c---------------------------------------------ccceE Q lcl|Aclame:pro 208 TRQQVLGKGVQGELDGFVIVKVPTKL---L---------------------------------------------QGLQA 239 (319) Q Consensus 208 ~~~~~~~~g~Vg~idG~~I~~vps~~---~---------------------------------------------~~~n~ 239 (319) .++....+|.|++++||+|+++++-. . .+..- T Consensus 233 ~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~ 312 (375) T protein:vir:10 233 QGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCG 312 (375) T ss_pred cccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEE Confidence 34556788999999999999863311 0 00113 Q ss_pred EEEcCCceeeeeeee-eeeeec--CCCCCccceeeeeeeeeEEEeccccceEEEEccccccCC Q lcl|Aclame:pro 240 IAVVGEVLASPIQAD-LAKTNS--NIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATK 299 (319) Q Consensus 240 i~~~~~A~~~~~k~~-~~~~~~--~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~ 299 (319) ++.|+.|...+.=++ .+++++ ....+.+|++.++.-+|+.++||...+.+.-.++++++= T Consensus 313 ~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~~~~~~~ 375 (375) T protein:vir:10 313 LIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGATAPSAF 375 (375) T ss_pred EEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCcCccccC Confidence 678888887552111 122221 134567899999999999999999955432222333332 No 25 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=100.00 E-value=1.5e-38 Score=227.94 Aligned_cols=305 Identities=15% Similarity=0.079 Sum_probs=210.2 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK 80 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~ 80 (319) |- +|. .+| +.+-++.+-...+++- +|.|.+.+.+.+..++.-..+..++++.+..|++|+||+++..++.||+ T Consensus 1 ~~-~~~-~~~----~~~~~~~~~t~~~~fi-Pev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a~d~~ 73 (381) T protein:vir:80 1 MA-TIQ-GTG----GYKGSAVDLSNVQVFI-PEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRAAVYDKQ 73 (381) T ss_pred Cc-eec-ccc----cccCcccchhhHHhhh-hHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCcceeeeec Confidence 21 222 111 2233444444444443 6899888888876554333344456788889999999999999999999 Q ss_pred CCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc------ Q lcl|Aclame:pro 81 RNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------ 154 (319) Q Consensus 81 r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~------ 154 (319) +.....+++++.+..+++||+++++.|.||++|..|+.. ++.....+++..+++..+|+++++.++...... T Consensus 74 ~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~--D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t 151 (381) T protein:vir:80 74 PQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASY--TLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYS 151 (381) T ss_pred CCCcccccccCCceEEEEEeeeeecceeechHHHHhhcc--ChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 999999999999999999999999999999999888764 667888999999999999999998775432110 Q ss_pred ------------ccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeee Q lcl|Aclame:pro 155 ------------LTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGEL 221 (319) Q Consensus 155 ------------~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~i 221 (319) ...+.+....|+.|+++.++|||++|| ++||++|+|+++..|+++++|......+++.+++|.||++ T Consensus 152 ~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i 231 (381) T protein:vir:80 152 YDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTI 231 (381) T ss_pred ccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEE Confidence 011234567899999999999999999 7999999999999999999999876666778999999999 Q ss_pred cCeEEEEeccc---ccccceEEEEcCCceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEec-cccceEEEEcccccc Q lcl|Aclame:pro 222 DGFVIVKVPTK---LLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPE-HLQKYIFTIGGTEVA 297 (319) Q Consensus 222 dG~~I~~vps~---~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~-~k~~~Iy~~~~~~~a 297 (319) +||+|+++++- ......+.++++.+... ++... .+++.+...+.+++.++.||+.+.. -...-+|.-.++.++ T Consensus 232 ~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~--~~~~~-~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~ 308 (381) T protein:vir:80 232 LGMEVIVTTQIGINSLTGYVNGQGAPTQPTP--GVLGS-PYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAA 308 (381) T ss_pred cceEEEeecccccccccceeeeccccccccc--ccccc-ccccccccceeeeeeeeeeceeeeeeeccceeeecceeeec Confidence 99999985322 12233445555544432 22222 2444556678999999999999854 333333333333222 Q ss_pred CCCCCcccccc----------------------ccccccccccC Q lcl|Aclame:pro 298 TKRDGVDAHAD----------------------NVAKPSGSLEM 319 (319) Q Consensus 298 ~~~~~~~~~~~----------------------~~~~~~~~~~~ 319 (319) ... ++..+- ...++-|+-|+ T Consensus 309 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 350 (381) T protein:vir:80 309 DGG--QTLGSFGGANRWATAVVCHPDWLAVGVQQNVKSESSRET 350 (381) T ss_pred CCC--ceeeeehhhhhhhhhcccccccccccceeEeecccchhh Confidence 211 111111 01111122222 No 26 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=100.00 E-value=3.3e-39 Score=231.55 Aligned_cols=284 Identities=12% Similarity=0.005 Sum_probs=215.0 Q ss_pred cccccceeee-hhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCC Q lcl|Aclame:pro 5 IKNATGMLKL-NLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) Q Consensus 5 ~~~~~~~~~~-~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~ 83 (319) .-|..+-=+| ..+...+-......+-+ |+|++.+++.|...+..-.. ++. ....+|++++||+++...+.+|++.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~i-e~~~geV~~~f~~~s~~~~~-~~~-rti~~G~sv~~~~iG~~~~~~~~~G~ 77 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFL-KVFGGEVLTAFTRTSVTMNK-HLV-RSIQSGKSAQFPVLGRTKAAYLQPGE 77 (347) T ss_pred CCccccccccccccccCCcccchHHHHH-HHHhHHHHHHHHHHHhhhhh-hhh-eeccccceEEeeeccceeEeeeecCc Confidence 3333333333 23344444444455545 99999999888877665433 332 34568999999999999999999877 Q ss_pred Ccc--cCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc------- Q lcl|Aclame:pro 84 TNE--FDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------- 154 (319) Q Consensus 84 ~~~--~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~------- 154 (319) ... ..++..+..+++||+.+|+.|.||++|+.|+.. ++...+++++.++++.+.|++.+..++..+... T Consensus 78 ~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~--D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~ 155 (347) T protein:vir:94 78 NLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHY--DVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENI 155 (347) T ss_pred CCCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCc--chHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 543 357889999999999999999999999999874 567889999999999999999987665322110 Q ss_pred ------------------ccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcccccccceee Q lcl|Aclame:pro 155 ------------------LTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGK 215 (319) Q Consensus 155 ------------------~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~ 215 (319) .+...++.++|+.|+++..+|+|++|| .+||++|+|++|..|++...+............+ T Consensus 156 ~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~ 235 (347) T protein:vir:94 156 AGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPST 235 (347) T ss_pred ccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccccccccccc Confidence 011134678899999999999999999 6999999999999998865555433333346789 Q ss_pred eeeeeecCeEEEEeccccc--------------------------ccce-------EEEEcCCceeeeeeee-eeeeecC Q lcl|Aclame:pro 216 GVQGELDGFVIVKVPTKLL--------------------------QGLQ-------AIAVVGEVLASPIQAD-LAKTNSN 261 (319) Q Consensus 216 g~Vg~idG~~I~~vps~~~--------------------------~~~n-------~i~~~~~A~~~~~k~~-~~~~~~~ 261 (319) |.|++++||+|+++|+-.. ..+. -++.|++|+..+..++ .+|.++ T Consensus 236 G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~- 314 (347) T protein:vir:94 236 GSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERAR- 314 (347) T ss_pred ceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeee- Confidence 9999999999998754211 0011 3789999999998888 588887 Q ss_pred CCCCccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 262 IPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 262 ~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) .+.+++|.+.++..||+.++||...+.+.-..+ T Consensus 315 ~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 315 RANFQADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred chhhhhhhhhhhhhhcCcccccceeEEEEecCC Confidence 588999999999999999999999876555433 No 27 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=100.00 E-value=2e-37 Score=221.79 Aligned_cols=268 Identities=17% Similarity=0.069 Sum_probs=213.9 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhc--ccCcceeeeCCceEEeeecccc-ccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPA--LISNDAIFMEGRSFTVMKGDTT-ELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~--~~n~~~~~~~g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~ 95 (319) +|+--++...+--+|.|.+.+.+.+.. .+.+.. .+|+++.+.+|++|+||.+... ...+|+.+.+.+++.++.+.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~-~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEK-KLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHh-hhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhccccee Confidence 566667777777888888777776643 333333 3578888888999999999864 577899988999999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) +++|+| +++.|.++|+|..++.. +++....+++++.++.++|+++++.+........+.. ..|+.|++|..+| T Consensus 80 ~~~i~~-~~~a~~i~D~~~~~~~~--d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~----~~~d~i~~A~~~l 152 (274) T protein:vir:96 80 EAKIRK-IAKGTSISDEALLSGYG--DPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADI----TKLTGLQTAIDKF 152 (274) T ss_pred EEEeee-eecceeehHHHHhhccc--hHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc----cCHHHHHHHHHHh Confidence 999988 79999999999888754 5577888999999999999999987765544333322 3488999999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeee-ee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASP-IQ 252 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~-~k 252 (319) ++.+. .+||++|+|++++.|++++ +|++..+.+..++++|.||++.||+|++++ ..+....++.+++|+.+. .+ T Consensus 153 gd~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~--~~~~~t~~l~~~gA~~~~~~~ 229 (274) T protein:vir:96 153 NDEDL-EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSN--KLEAGTAILAKKGAVKLITKR 229 (274) T ss_pred ccccc-cccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeC--CCCCceEEEEeccceeeeecC Confidence 88654 7899999999999999985 788888888889999999999999999743 333444555566666665 44 Q ss_pred eeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 253 ADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 253 ~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) ...+|..| +++++.+.+.+|++||++|++|.+.. ..+|.+||+|| T Consensus 230 ~~~vE~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v---------------------~~tk~~~~~~~ 274 (274) T protein:vir:96 230 DFFLETDR-DPSTKTTALYSDKHYVAYLYDESKAV---------------------KITKGSGSLEM 274 (274) T ss_pred Cccccccc-ccccccCEEEEeEEEEEEEEcCCcEE---------------------EEEcCCccccC Confidence 44888888 58889999999999999999998732 23567899999 No 28 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=100.00 E-value=2e-37 Score=221.79 Aligned_cols=268 Identities=17% Similarity=0.069 Sum_probs=213.9 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhc--ccCcceeeeCCceEEeeecccc-ccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPA--LISNDAIFMEGRSFTVMKGDTT-ELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~--~~n~~~~~~~g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~ 95 (319) +|+--++...+--+|.|.+.+.+.+.. .+.+.. .+|+++.+.+|++|+||.+... ...+|+.+.+.+++.++.+.. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~-~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEK-KLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHh-hhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhccccee Confidence 566667777777888888777776643 333333 3578888888999999999864 577899988999999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) +++|+| +++.|.++|+|..++.. +++....+++++.++.++|+++++.+........+.. ..|+.|++|..+| T Consensus 80 ~~~i~~-~~~a~~i~D~~~~~~~~--d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~----~~~d~i~~A~~~l 152 (274) T protein:vir:95 80 EAKIRK-IAKGTSISDEALLSGYG--DPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADI----TKLTGLQTAIDKF 152 (274) T ss_pred EEEeee-eecceeehHHHHhhccc--hHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc----cCHHHHHHHHHHh Confidence 999988 79999999999888754 5577888999999999999999987765544333322 3488999999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeee-ee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASP-IQ 252 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~-~k 252 (319) ++.+. .+||++|+|++++.|++++ +|++..+.+..++++|.||++.||+|++++ ..+....++.+++|+.+. .+ T Consensus 153 gd~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~--~~~~~t~~l~~~gA~~~~~~~ 229 (274) T protein:vir:95 153 NDEDL-EPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSN--KLEAGTAILAKKGAVKLITKR 229 (274) T ss_pred ccccc-cccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeC--CCCCceEEEEeccceeeeecC Confidence 88654 7899999999999999985 788888888889999999999999999743 333444555566666665 44 Q ss_pred eeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 253 ADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 253 ~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) ...+|..| +++++.+.+.+|++||++|++|.+.. ..+|.+||+|| T Consensus 230 ~~~vE~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v---------------------~~tk~~~~~~~ 274 (274) T protein:vir:95 230 DFFLETDR-DPSTKTTALYSDKHYVAYLYDESKAV---------------------KITKGSGSLEM 274 (274) T ss_pred Cccccccc-ccccccCEEEEeEEEEEEEEcCCcEE---------------------EEEcCCccccC Confidence 44888888 58889999999999999999998732 23567899999 No 29 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=100.00 E-value=4.7e-37 Score=219.75 Aligned_cols=270 Identities=15% Similarity=0.023 Sum_probs=211.1 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhc--ccCcceeeeCCceEEeeeccccc-cccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPA--LISNDAIFMEGRSFTVMKGDTTE-LKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~--~~n~~~~~~~g~tVkIp~i~~~g-~~DY~r~~~~~~~~~t~t~~ 95 (319) -|+-.+.+-++-.+|.|++++.+.+..+ +.+.. .+++.+...+|++|+||++...| ..+|..+.+.+++.++.+.. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~-~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~ 79 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKA-IKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESV 79 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHh-hhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCccccccccee Confidence 2444445555556677776666665433 33333 34566677789999999998765 56899989999999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc--ccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH--LTVGTGSDAQYDAVLDVSV 173 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~--~~~~~T~~n~~~~i~~a~~ 173 (319) +++|+| ++..|.++|++..++.. +++....+++++.++.++|+++++.+.+..... .....+.+++|+.|.++.. T Consensus 80 ~~~i~~-~~~a~~v~D~~~~~~~~--d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~ 156 (278) T protein:vir:80 80 KHGIKK-AGKGVKLTDESVLSGYG--DPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPD 156 (278) T ss_pred eEeeeh-hhccccccHHHHhhccc--cHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHH Confidence 999987 67899999999888754 567888999999999999999999886543221 1223356788999999999 Q ss_pred HHHhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeee- Q lcl|Aclame:pro 174 ELDEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASP- 250 (319) Q Consensus 174 ~Lde~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~- 250 (319) +|++.++|..++++|+|++++.|+++. +|++....++..+++|.||++.|++|+++ +.++....++.|++|+.+. T Consensus 157 ~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s--~~~p~~t~~l~~~gAi~~~~ 234 (278) T protein:vir:80 157 AIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRT--KKLADGNALAVKAGALKTFL 234 (278) T ss_pred hhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEc--CCCCcceEEEEeccceeeee Confidence 999999998889999999999998875 78877777888899999999999999974 3455667788889999754 Q ss_pred eeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccc Q lcl|Aclame:pro 251 IQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) Q Consensus 251 ~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~ 296 (319) .|...+|..| ++.++++.+++|++||++|++|.+. |-+.+++.. T Consensus 235 ~~~~~vE~~R-d~~~~~d~i~~~~~yg~~v~~~~~~-v~it~~a~~ 278 (278) T protein:vir:80 235 KRNLLAESGR-DMDHKLTKFNADQHYAVALVDETKA-VKVVPVAGN 278 (278) T ss_pred cCCccccccc-chhhccceeeeeeEEEEEEEcCcce-EEEeeccCC Confidence 5555888887 5888999999999999999999983 333333222 No 30 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=100.00 E-value=3.7e-36 Score=214.83 Aligned_cols=268 Identities=14% Similarity=0.071 Sum_probs=216.8 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccC--cceeeeCCceEEeeeccc-cccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS--NDAIFMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n--~~~~~~~g~tVkIp~i~~-~g~~DY~r~~~~~~~~~t~t~~ 95 (319) -|++.+.+-.+-.+|.|.+.+.+.+... +....+++ +.+...+|++|+||++.. ....+|..+.+++++.++.+.. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~-~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~~~ 79 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhh-hhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCccccccccccee Confidence 5778888888888899988777776444 34444444 445666799999999986 4688999888999999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) ++++++ +++.|.++|.+..++.. +++....+++.+.++..+|+++++.+........+ ....++.|++|..+| T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~--d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~----~~~~~d~i~dA~~~l 152 (274) T protein:vir:93 80 EAKIRK-IAKGTSITDEALLSGYG--DPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNA----DITKLNGLQSAIDKF 152 (274) T ss_pred EEEeee-ecccccccHHHHHhhcc--chHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc----cccCHHHHHHHHHHh Confidence 999987 78999999999999864 45777889999999999999999887554433222 233588899999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQA 253 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~ 253 (319) ++.++ ++||++|+|++++.|+++. +|++....++..+++|.||++.|++|+++ +.++....+++|++|+.+..|. T Consensus 153 ~d~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s--~~~p~~t~~l~~~gai~~~~~~ 229 (274) T protein:vir:93 153 NDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRT--NKLEAGTAILAKKGAVKLILKR 229 (274) T ss_pred hhccC-CccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEc--CCCCcceEEEEeCCeEEEEecC Confidence 98765 6899999999999999885 78888777888899999999999999974 3455667888899999988765 Q ss_pred e-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 254 D-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 254 ~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) . .+|..| +++++.+.+++|+|||+++++|.+.. ..+++.||+|| T Consensus 230 ~~~vE~~R-d~~~~~d~i~~~~~y~~~~~~~~~~v---------------------~~t~~~~s~~~ 274 (274) T protein:vir:93 230 DFFLEVAR-DASTKTTALYSDKHYVAYLYDESKAV---------------------KITKGSGSLEM 274 (274) T ss_pred Cccccccc-chhhcccEEEEEEEEEEEEEcCCceE---------------------EEeeCccccCC Confidence 5 788877 58889999999999999999998732 23466788888 No 31 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=100.00 E-value=7.5e-37 Score=218.62 Aligned_cols=282 Identities=12% Similarity=0.040 Sum_probs=212.0 Q ss_pred CCcccccccceeeehhhhhhhhhcchhh--hhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQ--TLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKD 78 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~--~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~D 78 (319) |+-.-. +-|+ =+++-..+. .-+-|.|++.++..|...+..-.. .+. -...+|++++||+++...+++ T Consensus 1 m~~~~~---~~~t------~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~-~~~-r~i~~G~s~~~~~iG~~~~~~ 69 (334) T protein:vir:80 1 MTYPAA---NTHT------RPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASW-MNV-RSLRGTNQLRVDRVGASTIAG 69 (334) T ss_pred CCCCcC---CCcc------ccccccccchheehhhhhhhHHHHHHHHhhhhhcc-cee-eeccccceEEEeeecceeeee Confidence 321111 1111 112222222 122399999898888777664433 222 244679999999999999999 Q ss_pred ccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc---- Q lcl|Aclame:pro 79 YKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH---- 154 (319) Q Consensus 79 Y~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~---- 154 (319) |++++......+..+..+++||+.+|+.|.||++|..|+.. ++...+++++.++++.+.|++.+..++..+... T Consensus 70 ~~~g~~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~--D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~ 147 (334) T protein:vir:80 70 RKAGEELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNL--DVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAH 147 (334) T ss_pred ecCCCCCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCc--chHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 99999999999999999999999999999999999999875 567899999999999999999987776443211 Q ss_pred -------------------ccccCCHhHHHHHHHHHHHHHHhccCC----CCcEEEEChHHHHHHhhhhhhhhcc--cc- Q lcl|Aclame:pro 155 -------------------LTVGTGSDAQYDAVLDVSVELDEIKAP----ENRVLFVSPTFYKGIKKFVIALPQG--DT- 208 (319) Q Consensus 155 -------------------~~~~~T~~n~~~~i~~a~~~Lde~~VP----~~R~l~VsP~~~~~L~~~~~f~~~~--~~- 208 (319) .....+++.++.+++++.+.|+|++|| .+||++|+|++|.+|+++++|.... .. T Consensus 148 ~~~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~ 227 (334) T protein:vir:80 148 LKPAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKE 227 (334) T ss_pred ccccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccc Confidence 011123556789999999999999999 2699999999999999999998652 11 Q ss_pred cccceeeeeeeeecCeEEEEecccc---------cccce----------EEEEcCCceeeeeeee-eeeeecCCCCCccc Q lcl|Aclame:pro 209 RQQVLGKGVQGELDGFVIVKVPTKL---------LQGLQ----------AIAVVGEVLASPIQAD-LAKTNSNIPGMFGT 268 (319) Q Consensus 209 ~~~~~~~g~Vg~idG~~I~~vps~~---------~~~~n----------~i~~~~~A~~~~~k~~-~~~~~~~~~~~~~~ 268 (319) +.....+|+|++++||+|+++++-- ....| .++.|++|+..+..++ ..|+++ ++.+++| T Consensus 228 ~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~-~~~~~~d 306 (334) T protein:vir:80 228 GGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWE-EKKDFGH 306 (334) T ss_pred ccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeee-chhhHHH Confidence 2345689999999999999853311 11112 3577999999988886 678887 5889999 Q ss_pred eeeeeeeeeEEEeccccceEEEEccccc Q lcl|Aclame:pro 269 LAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) Q Consensus 269 ~v~gr~~yg~~V~~~k~~~Iy~~~~~~~ 296 (319) .+++++.||+.++||+..++.--..+.| T Consensus 307 ~i~~~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 307 YLDTFQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred HHHHHHHcCCceeccceEEEEEEeeecC Confidence 9999999999999999877654454555 No 32 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=100.00 E-value=4e-36 Score=214.65 Aligned_cols=268 Identities=14% Similarity=0.054 Sum_probs=212.0 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccC--cceeeeCCceEEeeecccc-ccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS--NDAIFMEGRSFTVMKGDTT-ELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n--~~~~~~~g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~ 95 (319) +|+.-+..-.+-.+|.|.+.+.+.+.. .+.+..+++ ..+...+|++|+||.+... ...+|+.+.+.+++.++.+.. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~-~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEK-KLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHh-hhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhccccee Confidence 666667777777788887777766643 344444444 4456667999999999864 577899988999999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) +.+|.| +++.|.++|++..++.. +++....+++++.++.++|+++++.+.+......+.. ..|+.|++|..+| T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~--d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a----~~~d~i~dA~~~l 152 (274) T protein:vir:12 80 EAKIRK-IAKGTSITDEALLSGYG--DPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADI----TKLNGLQSAIDKF 152 (274) T ss_pred eEEeee-ecceeeecHHHHHhccc--chHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc----cCHHHHHHHHHHh Confidence 999988 79999999999988865 4577788999999999999999988765544333322 3488999999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeee- Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQ- 252 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k- 252 (319) ++.+. .+||++|+|++++.|++++ +|++..+.+..++++|.||++.|++|++. +.++....++.+++|+....| T Consensus 153 gd~~~-~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s--~~~p~~t~~l~~~gA~~~~~~~ 229 (274) T protein:vir:12 153 NDEDL-EPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRS--NKLEAGTAILAKKGAVKLILKR 229 (274) T ss_pred ccccc-cccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEe--CCCCcceEEEEeccceeeeecC Confidence 88654 7899999999999999985 78988877888899999999999999974 344445556666777776544 Q ss_pred eeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 253 ADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 253 ~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) -..+|..| +++++.+.+.+|++||++|++|.+..+ .+++.||+|| T Consensus 230 ~~~vE~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~---------------------~t~~~~~~~~ 274 (274) T protein:vir:12 230 DFFLEVAR-DASTKTTALYSDKHYVAYLYDESKAVK---------------------ITKGSGSLEM 274 (274) T ss_pred Cceecccc-chhhcccEEEeeeEEEEEEEcCCceEE---------------------EEcCCccccC Confidence 44788887 588899999999999999999887322 3567788888 No 33 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=100.00 E-value=1.1e-35 Score=212.16 Aligned_cols=268 Identities=15% Similarity=0.066 Sum_probs=214.7 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccC--cceeeeCCceEEeeecccc-ccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS--NDAIFMEGRSFTVMKGDTT-ELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n--~~~~~~~g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~ 95 (319) -|+..+..-.+-.+|.|.+++.+.+. +.+.+..+++ +.+...+|++|+||.+... ...||+.+.+++++.++.+.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~-~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLE-KKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhh-hhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccccccccee Confidence 46667777777788888887777764 4445444444 4456667999999999864 467899988999999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) +.+++| +++.|.++|.+..++.. +++....+++++.++.++|++.++.+.+.+....+. ...|+.|++|..+| T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~--dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~----~~~~d~i~dA~~~l 152 (274) T protein:vir:94 80 EAKIRK-IAKGTSITDEALLSGYG--DPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNAD----ITKLNGLQSAIDKF 152 (274) T ss_pred EEEeee-ecceecccHHHHHhccc--hHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc----ccCHHHHHHHHHHh Confidence 999988 78999999999999765 457778899999999999999998876554333222 23488999999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQA 253 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~ 253 (319) ++.++ .+||++|+|++++.|++++ +|++....++.++++|.||++.|++|++++ .++....++.+++|+....|. T Consensus 153 ~d~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~--~~p~~t~~l~~~gA~~~~~~~ 229 (274) T protein:vir:94 153 NDEDL-EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN--KLEAGTAILAKKGAVKLILKR 229 (274) T ss_pred hccCC-CceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcC--CCCcceEEEEeCcceEeeecC Confidence 98765 5899999999999999985 788888888888999999999999999743 455666778888888877665 Q ss_pred e-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 254 D-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 254 ~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) . .+|..| +++++.+.+.+|+|||+++++|.+... .+++.||+|| T Consensus 230 ~~~vE~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~---------------------~t~~~~~~~~ 274 (274) T protein:vir:94 230 DFFLEVAR-DASTKTTALYSDKHYVAYLYDESKAVK---------------------ITKGSGSLEM 274 (274) T ss_pred Cceecccc-chhhcccEEEEEEEEEEEEEcCCceEE---------------------EecCcccccC Confidence 4 788887 588899999999999999999987322 3456778888 No 34 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=100.00 E-value=1.1e-35 Score=212.16 Aligned_cols=268 Identities=15% Similarity=0.066 Sum_probs=214.7 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccC--cceeeeCCceEEeeecccc-ccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS--NDAIFMEGRSFTVMKGDTT-ELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n--~~~~~~~g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~ 95 (319) -|+..+..-.+-.+|.|.+++.+.+. +.+.+..+++ +.+...+|++|+||.+... ...||+.+.+++++.++.+.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~-~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLE-KKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCccceehhheechHHHHHHHHHhhh-hhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccccccccee Confidence 46667777777788888887777764 4445444444 4456667999999999864 467899988999999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) +.+++| +++.|.++|.+..++.. +++....+++++.++.++|++.++.+.+.+....+. ...|+.|++|..+| T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~--dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~----~~~~d~i~dA~~~l 152 (274) T protein:vir:97 80 EAKIRK-IAKGTSITDEALLSGYG--DPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNAD----ITKLNGLQSAIDKF 152 (274) T ss_pred EEEeee-ecceecccHHHHHhccc--hHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc----ccCHHHHHHHHHHh Confidence 999988 78999999999999765 457778899999999999999998876554333222 23488999999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQA 253 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~ 253 (319) ++.++ .+||++|+|++++.|++++ +|++....++.++++|.||++.|++|++++ .++....++.+++|+....|. T Consensus 153 ~d~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~--~~p~~t~~l~~~gA~~~~~~~ 229 (274) T protein:vir:97 153 NDEDL-EPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTN--KLEAGTAILAKKGAVKLILKR 229 (274) T ss_pred hccCC-CceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcC--CCCcceEEEEeCcceEeeecC Confidence 98765 5899999999999999985 788888888888999999999999999743 455666778888888877665 Q ss_pred e-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 254 D-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 254 ~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) . .+|..| +++++.+.+.+|+|||+++++|.+... .+++.||+|| T Consensus 230 ~~~vE~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~---------------------~t~~~~~~~~ 274 (274) T protein:vir:97 230 DFFLEVAR-DASTKTTALYSDKHYVAYLYDESKAVK---------------------ITKGSGSLEM 274 (274) T ss_pred Cceecccc-chhhcccEEEEEEEEEEEEEcCCceEE---------------------EecCcccccC Confidence 4 788887 588899999999999999999987322 3456778888 No 35 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=100.00 E-value=3.8e-35 Score=209.31 Aligned_cols=268 Identities=13% Similarity=0.048 Sum_probs=209.9 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhccc--CcceeeeCCceEEeeeccc-cccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALI--SNDAIFMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~--n~~~~~~~g~tVkIp~i~~-~g~~DY~r~~~~~~~~~t~t~~ 95 (319) +|+.-+....+--+|.|++++.+.+. +++....++ ++.+...+|++|+||.+.. ....||+.+.+.+++.++.+.. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~-~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~~~ 79 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELD-KKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTSKR 79 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHH-hhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhccccee Confidence 56666777777778888777766664 333333333 3445666799999999985 4688999888999999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) ++++++ +++.|.++|.+..++.. +++....+++++.++..+|.++++.+.......... ...|+.|++|..+| T Consensus 80 ~~~i~~-~~~~~~i~D~~~~~~~~--d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~----~~~~d~i~dA~~~l 152 (274) T protein:vir:96 80 EAKVRK-IGKGTELTDEAVLSGFG--DPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEAD----ITKLDGLQTAIDKF 152 (274) T ss_pred EEEEEe-eeceeeecHHHHHhhcc--hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCcc----cccHHHHHHHHHHh Confidence 999987 79999999999888754 567778899999999999999998875543322222 23489999999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQA 253 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~ 253 (319) ++.++ .+||++|+|++++.|+++. +|.+..+.++..+++|.||++.|++|++. +.++....++.+++|+.+..|. T Consensus 153 ~d~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s--~~~p~~t~~l~~~gA~~~~~~~ 229 (274) T protein:vir:96 153 NDEDL-EPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRS--NKLNKGEALLAKKGAVKLITKR 229 (274) T ss_pred cccCC-CceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEc--CCCCcceEEEEeCcceeeeecC Confidence 99875 6899999999999998874 78888887888899999999999999974 3455666778889999998887 Q ss_pred e-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCcc Q lcl|Aclame:pro 254 D-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVD 304 (319) Q Consensus 254 ~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~ 304 (319) . .+|..| ++.++++.+.+|++||+++++|.+..+ +..+++. .++ T Consensus 230 ~~~vE~~R-d~~~~~d~i~~~~~yg~~~~~~~~vv~-~t~~~~~-----~~~ 274 (274) T protein:vir:96 230 DFFLEKDR-DASRKSTALYSDKHYVAYLYDESKVVK-ITKGAGD-----EVM 274 (274) T ss_pred Cccccccc-chhhcccEEEEeeEEEEEEEcCccEEE-EEcCccc-----ccC Confidence 6 788877 588899999999999999999988433 2222221 122 No 36 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=100.00 E-value=3.5e-35 Score=209.47 Aligned_cols=294 Identities=10% Similarity=-0.023 Sum_probs=215.0 Q ss_pred hhhhh-hcchh-------hhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCCcccCC Q lcl|Aclame:pro 18 HFANK-SVEPG-------QTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDH 89 (319) Q Consensus 18 ~~~~~-~~~~n-------~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~ 89 (319) |.-.| ...|. ..-+-|+|.+.+++.|...+...... +.....+|++++||.++...+++|+..+...... T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~--~~rti~~gkS~q~~~iG~~~~~~~~~G~~ld~~~ 78 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWF--DVQEVVGTNSVSNKYIGETELQVLSPGKSPDASP 78 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcc--eeeeecccceEEeeeeeeeEEeeeccCcccCCCC Confidence 32222 22221 12244899999999998777654332 3345679999999999999999999888887788 Q ss_pred cccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----Cc------------ Q lcl|Aclame:pro 90 PKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK----AK------------ 153 (319) Q Consensus 90 ~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a----~~------------ 153 (319) +..+..+++||+-+++.+.||++|..|+..+. +-..+++++.++++...|++.+..+...+ .. T Consensus 79 ~~~~k~~itID~ll~a~~~V~diDe~q~~~D~-vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~ 157 (364) T protein:vir:10 79 TEFDKNRLVVDTTVIARNTVAHFHDVQNDIDG-LKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGF 157 (364) T ss_pred cccCcEEEEecceeeechhhhhHHHHhcCccc-hhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCcc Confidence 88888899999999999999999999887541 35688899999999999999876553221 00 Q ss_pred -------cccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcc--cccccceeeeeeeeecC Q lcl|Aclame:pro 154 -------HLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQG--DTRQQVLGKGVQGELDG 223 (319) Q Consensus 154 -------~~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~--~~~~~~~~~g~Vg~idG 223 (319) .....+++.+++++|.++.+.|||++|| ++||++|+|++|.+|+++++|.... ..+....++|+|++++| T Consensus 158 ~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~G 237 (364) T protein:vir:10 158 SIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKSWN 237 (364) T ss_pred eeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEEec Confidence 0011223467899999999999999999 6999999999999999999987432 22445678999999999 Q ss_pred eEEEEecccc----------------------c---------ccceEEEEcCCceeeeeee-eeeeeecCCCCCccceee Q lcl|Aclame:pro 224 FVIVKVPTKL----------------------L---------QGLQAIAVVGEVLASPIQA-DLAKTNSNIPGMFGTLAE 271 (319) Q Consensus 224 ~~I~~vps~~----------------------~---------~~~n~i~~~~~A~~~~~k~-~~~~~~~~~~~~~~~~v~ 271 (319) |+|+++|+-- . ...-.++.||.|+..++.+ -.++.++ .+.+++|.+. T Consensus 238 v~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~-~~~~~~~~id 316 (364) T protein:vir:10 238 TPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFY-EKKEKTWYID 316 (364) T ss_pred eEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeee-ccceeeeeee Confidence 9999864321 0 0123678999999988887 5677776 5888999999 Q ss_pred eeeeeeEEEeccccceEEEEccccccC-CCC---Cccccccccccccc Q lcl|Aclame:pro 272 QLLYTGAFVPEHLQKYIFTIGGTEVAT-KRD---GVDAHADNVAKPSG 315 (319) Q Consensus 272 gr~~yg~~V~~~k~~~Iy~~~~~~~a~-~~~---~~~~~~~~~~~~~~ 315 (319) .+.-||+.++||...+++...++...+ .-. .-.-.+.+-+|+++ T Consensus 317 a~~a~G~g~lRPeaa~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (364) T protein:vir:10 317 TFLAEGAIPDRWEAVAVVTAADTAELATDHNAILARANRKVTLTKSVN 364 (364) T ss_pred eehcccCcccCccceEEEEecCCCCCccchhhhhhhccccEEEEEecC Confidence 999999999999987775433222221 111 11112334455666 No 37 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=100.00 E-value=9.4e-35 Score=207.13 Aligned_cols=289 Identities=12% Similarity=0.018 Sum_probs=189.6 Q ss_pred chhhhhhhHhhHHHHHHHHHhhhhhhhcccCcce--eee--CCceEEeeeccccccccccCC-----CCcccCCccccee Q lcl|Aclame:pro 25 EPGQTLLKNKHVGILERVTAVNAYSTPALISNDA--IFM--EGRSFTVMKGDTTELKDYKRN-----ATNEFDHPKIEET 95 (319) Q Consensus 25 ~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~--~~~--~g~tVkIp~i~~~g~~DY~r~-----~~~~~~~~t~t~~ 95 (319) =+|++--++.|.+.+.+.+. +++++..++||+| ++. .||+|+||......+.||+.. ....+++++.+.. T Consensus 1 Ma~~~~~p~~~a~~~l~~l~-~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (392) T protein:vir:99 1 MANAFSKPTAVVDTAIQMLQ-NELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTEDSF 79 (392) T ss_pred CccccccHHHHHHHHHHHHH-hhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCcccccccccceE Confidence 23455566777766555553 4556778899998 443 599999999999999999753 2456678889999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--ccccccCCHhHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA--KHLTVGTGSDAQYDAVLDVSV 173 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~--~~~~~~~T~~n~~~~i~~a~~ 173 (319) +++|||+||+.|.||+.|..+... ++.....+++.++|++++|.++++.++.... ......+++.++|+.|.++.+ T Consensus 80 ~~~id~~k~~~~~i~d~e~~~~~~--~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~~a~~ 157 (392) T protein:vir:99 80 PVTLTDVAYHLGVLTDEELTFDLE--SFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGARR 157 (392) T ss_pred EEEEeeeeecceeechHHHhhhhh--hhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHHHHHH Confidence 999999999999999888877654 5566778999999999999999987765332 233445678999999999999 Q ss_pred HHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccc---cceeeeeeeeecCeEEEEecccccccceEEEEcCCceeee Q lcl|Aclame:pro 174 ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQ---QVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASP 250 (319) Q Consensus 174 ~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~---~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~ 250 (319) +|+|++||.+||++++|+++..|+++++|.+....+. ..+++|.||+++||+|++.++. .....+++|+++.+++ T Consensus 158 ~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~--~~~t~~a~~~~a~~~a 235 (392) T protein:vir:99 158 ALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLI--PHGDAYLYHPTAFIMA 235 (392) T ss_pred HHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeeccc--ccccceeeeccccccc Confidence 9999999999999999999999999999998766654 4578999999999999975432 2334577888888776 Q ss_pred eeeeeeeeecCCCCC--cc-ceeeeee-----------------eeeEEEecc-ccceEEEEcccc---ccCCCCCcccc Q lcl|Aclame:pro 251 IQADLAKTNSNIPGM--FG-TLAEQLL-----------------YTGAFVPEH-LQKYIFTIGGTE---VATKRDGVDAH 306 (319) Q Consensus 251 ~k~~~~~~~~~~~~~--~~-~~v~gr~-----------------~yg~~V~~~-k~~~Iy~~~~~~---~a~~~~~~~~~ 306 (319) .+......... ... .+ ..+.++. ++|..++.. ...+++...... ......+.... T Consensus 236 t~a~v~~~~~~-~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v~~~ 314 (392) T protein:vir:99 236 TRAPAPPMGAV-RSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPEAGA 314 (392) T ss_pred ccccccccccc-ceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeeeecc Confidence 65332211110 000 00 0111111 112222111 111121111000 00001122222 Q ss_pred ccccccccccccC Q lcl|Aclame:pro 307 ADNVAKPSGSLEM 319 (319) Q Consensus 307 ~~~~~~~~~~~~~ 319 (319) ....+...|...- T Consensus 315 ~~~~~~~~~~~~~ 327 (392) T protein:vir:99 315 NATITAAAGEDHT 327 (392) T ss_pred cceeEeeecccee Confidence 2233333333221 No 38 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=100.00 E-value=1.7e-33 Score=200.24 Aligned_cols=270 Identities=14% Similarity=0.029 Sum_probs=205.9 Q ss_pred hhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhc-ccCcceeeeCCceEEeeecccc-ccccccCCCCcccCCccccee Q lcl|Aclame:pro 18 HFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPA-LISNDAIFMEGRSFTVMKGDTT-ELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 18 ~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~-~~n~~~~~~~g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~ 95 (319) |=--+.+....+--+|.|++++.+.+.....-+.. .+++++...+|++|+||.+... ...+|..+...+++.++.+.. T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~~~ 80 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETKKR 80 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhccccee Confidence 32224455666666777777666666443332222 3467778888999999999875 477899888999999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) +.++.+ +++.|.++|.+..++.. ++.....+++++.++..+|+.+++.+.+......+.. .-|+.|++|..+| T Consensus 81 ~~~i~~-~~~~~~i~D~~~~~~~~--d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~~~----~~~d~i~dA~~~l 153 (275) T protein:vir:96 81 QATIRK-IGKGTVLTDEALLSGYG--DPKGEAVRQHGLAIANKVDNDVLEALQGATLKVEADI----TKLAGLQTAIDKF 153 (275) T ss_pred eEEeeh-hcccccccHHHHHhhcc--chHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc----cCHHHHHHHHHHh Confidence 999965 89999999999888754 4567778999999999999999987765443333332 3488899999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQA 253 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~ 253 (319) .+.+. ++||++|+|++++.|+++. +|++....+...+++|.||++.|++|++. +..+....++.+++|+.+..|. T Consensus 154 gd~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s--~~~p~~t~~i~~~gA~~~~~~~ 230 (275) T protein:vir:96 154 NDEDL-EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRS--NKIKEGEAILAKRGAVKLITKR 230 (275) T ss_pred ccccC-CccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEe--CCCCcceEEEEeccceeeeecC Confidence 87644 6899999999999998874 78888777888899999999999999974 3455556677788888888776 Q ss_pred e-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCc Q lcl|Aclame:pro 254 D-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGV 303 (319) Q Consensus 254 ~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~ 303 (319) . .+|..| .++++.+.+.+|++||++|+++.+... -.. .|+.. |+ T Consensus 231 ~~~vE~~R-d~~~~~d~i~~~~~y~~~~~~~~~vv~--~t~-~~~~~--~~ 275 (275) T protein:vir:96 231 DFFLETER-HASHKSTALFSDKHYVAYLYDESKVVK--ITK-SASGL--GV 275 (275) T ss_pred Cccccccc-chhhcCcEEEEeEEEEEEEEcCccEEE--EEe-ccccc--CC Confidence 5 788888 588899999999999999999987432 221 12221 22 No 39 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=100.00 E-value=5.5e-32 Score=191.98 Aligned_cols=264 Identities=14% Similarity=0.048 Sum_probs=205.1 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhc--ccCcceeeeCCceEEeeeccccc-cccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPA--LISNDAIFMEGRSFTVMKGDTTE-LKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~--~~n~~~~~~~g~tVkIp~i~~~g-~~DY~r~~~~~~~~~t~t~~ 95 (319) -|+.-++...+--+|.|.+.+.+.+..+ +.... ..++.+...+|++|+||++...| ..+|..+...+++.++.+.. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~-~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~ 79 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKA-LRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTK 79 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhh-hhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCcce Confidence 4666677777777888877776665433 33333 33566777789999999998765 34688788889999999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) ++++.+ +++.|.++|+|..++.. +++....+++++.++.++|+++++.+.... ...+....++.|.+|..+| T Consensus 80 ~~~i~~-~~k~~~vtD~~~~~~~~--d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~-----~~~~~~~~~d~i~~A~~~l 151 (272) T protein:vir:36 80 SVTIKK-AAKGTEITDEAALSGYG--DPIGESNKQLGLSLANKVDDDLLSAAKTTS-----QTVSTKANVDGVQAALDIF 151 (272) T ss_pred eEeeeh-hhccccccHHHHhhccc--hHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccccccccHHHHHHHHHHh Confidence 999976 68899999999988754 567788899999999999999988764322 1234455688999999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhhhhhhhcccc-cccceeeeeeeeecCeEEEEeccccccc---ceEEEEcCCceeeee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKFVIALPQGDT-RQQVLGKGVQGELDGFVIVKVPTKLLQG---LQAIAVVGEVLASPI 251 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~-~~~~~~~g~Vg~idG~~I~~vps~~~~~---~n~i~~~~~A~~~~~ 251 (319) .+.+++ .|+++|+|.++..|+++.+|....+. ++..+++|.||++.|++|+++++ ...+ +..++.+++|+.... T Consensus 152 gd~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~-~p~~~~~~~~~~~~~gA~~~~~ 229 (272) T protein:vir:36 152 NDEDAQ-AYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKK-LAEGSALMFKIVSNSPALKLVL 229 (272) T ss_pred hhcCCC-ceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCC-CCCCceeEEEEEecccceeeee Confidence 998876 68999999999999999998876544 45678999999999999996433 2222 234667788886554 Q ss_pred eee-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 252 QAD-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 252 k~~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) |.. .+|..| .++++.+.+++|++||++|++|++......++. T Consensus 230 ~~~~~vE~~R-~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 230 KRGVQVETDR-DIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred cCCccccccc-chhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 444 888887 588899999999999999999998666656655 No 40 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=99.96 E-value=9.9e-32 Score=190.56 Aligned_cols=281 Identities=9% Similarity=0.017 Sum_probs=173.8 Q ss_pred hhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceee---eCCceEEeeeccccccccccCCCCcccCCcccceeEEE Q lcl|Aclame:pro 22 KSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF---MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYF 98 (319) Q Consensus 22 ~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~---~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tlt 98 (319) =....|+|--.+.|...+-+.+ .++++...++|++|+. ..||+|+||.++...++||. ...+++++....+++ T Consensus 1 m~~~~N~~ltp~iia~~~l~~l-~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~v~dg~---~~~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLL-KNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVKSASGR---TLVKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHH-HHhccchhhhcCCCchHHhhCCCEEEEeeCCceeecccC---CccccccccceEEEE Confidence 1233455544577765555555 3555677889999844 35899999999999999975 456788899999999 Q ss_pred EeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 99 LDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEI 178 (319) Q Consensus 99 idqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~ 178 (319) |||+||+.|.||+.|..+.. ..+.....+++.++|++++|+++++.+... +.......+..+.|+.|.++.++|+++ T Consensus 77 id~~k~~~~~itD~e~a~~~--~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a-~~~~gt~gt~~~~~~~i~~a~~~Ld~~ 153 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDI--MQFSERYLKSGMVQIANQIDRSLALTLKKA-FHSSGTPGVRPGAFIDFANAGAKQTTY 153 (418) T ss_pred EecccccceeechHHHhhhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-ccccccCCcCcchHHHHHHHHHHHHhc Confidence 99999999999999987765 466777789999999999999998865543 333333445567899999999999999 Q ss_pred cCC-CC-cEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEeccccccc------ceEEEEcCCceeee Q lcl|Aclame:pro 179 KAP-EN-RVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQG------LQAIAVVGEVLASP 250 (319) Q Consensus 179 ~VP-~~-R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~------~n~i~~~~~A~~~~ 250 (319) +|| ++ ||++++|++|..|+++++|.......+..+++|.||++.||+|+++++-...+ ..++.+. . T Consensus 154 ~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga---~--- 227 (418) T protein:vir:10 154 AVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGT---V--- 227 (418) T ss_pred CCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeecc---c--- Confidence 999 54 99999999999999999887765555567999999999999999864422111 1122111 0 Q ss_pred eeeeeeeeecCCCCCccceeeeeeeeeEEEecc------------ccceEEEEccccc---------------------- Q lcl|Aclame:pro 251 IQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEH------------LQKYIFTIGGTEV---------------------- 296 (319) Q Consensus 251 ~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~------------k~~~Iy~~~~~~~---------------------- 296 (319) .....+.+.......-|-+. ..|.+.+.- ...--|+...... T Consensus 228 ~~~~~~~~~~~t~s~~g~l~----~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~~~~~~~tv~i~p~~~~~~~~~ 303 (418) T protein:vir:10 228 VNGDTVGFDGGTASTTGFLK----AGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDTDAGGAGSIKISPSLNDGTATI 303 (418) T ss_pred ccceeEEEeecceeecccee----eccEEEECceeecccccccccccceEEEEEeeccccccCcceeEeccccccccccc Confidence 11111111111000011111 122221110 0111232221110 Q ss_pred ------------cCCCCCcccccccccc-ccccccC Q lcl|Aclame:pro 297 ------------ATKRDGVDAHADNVAK-PSGSLEM 319 (319) Q Consensus 297 ------------a~~~~~~~~~~~~~~~-~~~~~~~ 319 (319) -...++..++.+..|. -.+++.. T Consensus 304 ~~~~~~~~~~~~~~~v~a~~a~~~~it~~~~a~~~~ 339 (418) T protein:vir:10 304 NNENGDPVSLTAYQNVTALPADNAPITVLGAANTTY 339 (418) T ss_pred cccccccccccCCCcccccccCcceeeeecccccce Confidence 0111111122222221 0000100 No 41 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.96 E-value=2.9e-31 Score=188.00 Aligned_cols=271 Identities=15% Similarity=0.053 Sum_probs=208.6 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhh-cccCcceeeeCCceEEeeeccccc-cccccCCCCcccCCcccceeE Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTP-ALISNDAIFMEGRSFTVMKGDTTE-LKDYKRNATNEFDHPKIEETT 96 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~-~~~n~~~~~~~g~tVkIp~i~~~g-~~DY~r~~~~~~~~~t~t~~t 96 (319) +|+.-+....+--+|.|.+.+.+.+..+..-+. ..+++++...+|++|+||.+...| ..+|..+...+++.++.+..+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~~ 80 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRRE 80 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCccccccceee Confidence 566667777777788887766666644433222 233566777789999999998764 456887888899999999999 Q ss_pred EEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHHH Q lcl|Aclame:pro 97 YFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELD 176 (319) Q Consensus 97 ltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~Ld 176 (319) .++. .+++.|.++|.+..++.. +++....++++..++..+|++.++.+........+..+ .|+.|++|..+|+ T Consensus 81 a~i~-~~~k~~~~tD~a~~~~~~--dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~~~~----t~d~i~~A~~~lg 153 (276) T protein:vir:10 81 AKIH-KIGKGTDITDEALLSGYG--DPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSADIG----TLAGLEAAIDTFD 153 (276) T ss_pred EEee-hccccccccHHHHHhhcc--chHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc----CHHHHHHHHHHhc Confidence 9995 489999999999888754 55778889999999999999999887665544433333 3788999999998 Q ss_pred hccCCCCcEEEEChHHHHHHhhh--hhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeee Q lcl|Aclame:pro 177 EIKAPENRVLFVSPTFYKGIKKF--VIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQAD 254 (319) Q Consensus 177 e~~VP~~R~l~VsP~~~~~L~~~--~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~ 254 (319) +.++ +.++++|+|.++..|+++ ++|++....++..+++|.||++.|++|+..+ ..+....++.+++|+....|-. T Consensus 154 d~~~-~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~--~~p~~t~~l~~~gAi~~~~~~~ 230 (276) T protein:vir:10 154 DEDL-EPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSK--KLDEGEAILAKRGAVKLITKRD 230 (276) T ss_pred cccC-cccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcC--CCCcceEEEEeccceeeeecCC Confidence 8755 689999999999999775 6899887778788999999999999999743 3455666788889998776655 Q ss_pred -eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCc Q lcl|Aclame:pro 255 -LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGV 303 (319) Q Consensus 255 -~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~ 303 (319) .+|..| .++++.+.+.+|++|+++++++.+..... . ++...++|. T Consensus 231 ~~vE~dR-d~~~~~d~i~~~~~y~~~~~~~~~vv~~t-~--~~~~~~~~~ 276 (276) T protein:vir:10 231 FFLETDR-DPSTKTTALYSDKHYVAYLYDESKAVKVT-K--GAGTTDSGA 276 (276) T ss_pred ceeeccc-chhhcccEEEEeeEEEEEEEcCcceEEEe-c--CCcCCcCCC Confidence 778777 58889999999999999999998743322 1 122323333 No 42 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=99.96 E-value=1.2e-31 Score=190.05 Aligned_cols=298 Identities=9% Similarity=0.010 Sum_probs=217.5 Q ss_pred hhhhhh-cchh-------hhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCCcccCC Q lcl|Aclame:pro 18 HFANKS-VEPG-------QTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDH 89 (319) Q Consensus 18 ~~~~~~-~~~n-------~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~ 89 (319) |.-.|- ..|. ..-+-|+|.+.+++.|...+...... +.....+|++++||.++...++.|+......... T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~--~vrti~~GkS~qf~~iG~~~a~y~~~G~~ldg~~ 78 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYF--DVQTVTGTNTVSNKYLGETELQVLAPGQSPNATP 78 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcc--eeeeecccceEEEEEEeeeEEeeeccccccCCCC Confidence 222222 2221 12244999999999998777654332 3345669999999999999999998888777777 Q ss_pred cccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--c-------------- Q lcl|Aclame:pro 90 PKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA--K-------------- 153 (319) Q Consensus 90 ~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~--~-------------- 153 (319) +..+..+++||+-.++.+.||++|..|+..+. +-..+++++.++++...|++.+..+...+. . T Consensus 79 ~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~-vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g~ 157 (402) T protein:vir:97 79 TQADKNQLVIDTTVIARNTVAHIHDVQGDIDS-LKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGF 157 (402) T ss_pred cccccEEEEeCceeechhhhhhHHHHHhcccc-hhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCccccccc Confidence 88888899999999999999999999987541 356888999999999999998876643221 0 Q ss_pred ----c---ccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhccc--ccccceeeeeeeeecC Q lcl|Aclame:pro 154 ----H---LTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGD--TRQQVLGKGVQGELDG 223 (319) Q Consensus 154 ----~---~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~--~~~~~~~~g~Vg~idG 223 (319) . ....+++.+++++|.++..+|||++|| .+|+++|+|++|.+|+++++|....- .+.+...+|+|++++| T Consensus 158 s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~G 237 (402) T protein:vir:97 158 SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYN 237 (402) T ss_pred ccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEec Confidence 0 001245678899999999999999999 69999999999999999999874422 2344578999999999 Q ss_pred eEEEEeccccc----------------c---------cceEEEEcCCceeeeeeee-eeeeecCCCCCccceeeeeeeee Q lcl|Aclame:pro 224 FVIVKVPTKLL----------------Q---------GLQAIAVVGEVLASPIQAD-LAKTNSNIPGMFGTLAEQLLYTG 277 (319) Q Consensus 224 ~~I~~vps~~~----------------~---------~~n~i~~~~~A~~~~~k~~-~~~~~~~~~~~~~~~v~gr~~yg 277 (319) |+|+++|+-.. . +.-.++.||.|+..+.=++ ..++++ .+.+++|.+....-|| T Consensus 238 v~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~-d~r~~~~~id~~~a~G 316 (402) T protein:vir:97 238 CPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFY-EKKEKTYYIDTFMAEG 316 (402) T ss_pred eEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhh-chhHHHHHHHHHHHhC Confidence 99998643210 0 0124567888877654332 234454 5788999999999999 Q ss_pred EEEeccccceEEE-EccccccCCCCCcccccccccccc-----ccccC Q lcl|Aclame:pro 278 AFVPEHLQKYIFT-IGGTEVATKRDGVDAHADNVAKPS-----GSLEM 319 (319) Q Consensus 278 ~~V~~~k~~~Iy~-~~~~~~a~~~~~~~~~~~~~~~~~-----~~~~~ 319 (319) +.++||...|+.. ..+.+++.++..++.|.+..+-.- -++|= T Consensus 317 ~g~~RPeaa~vv~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 364 (402) T protein:vir:97 317 AIPDRWEAVSVVTTKRDATTGDAGGPGDDHATVLARAQRKAVYVKTEG 364 (402) T ss_pred CcccCccceEEEEEecccccccCCccccchhhhhcccccceEEEeccc Confidence 9999999999854 344677777777777766554321 11221 No 43 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.96 E-value=6.2e-31 Score=186.21 Aligned_cols=284 Identities=14% Similarity=0.066 Sum_probs=209.2 Q ss_pred eeeeh---hhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCCccc Q lcl|Aclame:pro 11 MLKLN---LQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEF 87 (319) Q Consensus 11 ~~~~~---~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~ 87 (319) |=.+| ..|.. +......+-| |.|++.+++.|..++..-....-+ ...+|++++||.++...+.+|++...... T Consensus 1 ms~~~~~tr~~~~-~s~~d~al~l-e~f~geV~~af~~~s~~~~~~~~r--ti~~g~s~~~~~iG~~~~~~~~pG~~l~~ 76 (335) T protein:vir:63 1 MSFLNDLTRPNYA-GKNADVDIHL-EEHLGIVDKHFAYTSKFAPLMNIR--DLRGSNVVRLDRLGNVEAKGRRAGEELER 76 (335) T ss_pred CCCcccchhhhcc-cccchhheeh-hhhhhhHHHHHHhhhhhcccccee--eeccceeEEEeeeeeeeeecccCCcCcCC Confidence 33332 22221 1222233334 999999988888777654332222 44689999999999999999999888888 Q ss_pred CCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc----------- Q lcl|Aclame:pro 88 DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLT----------- 156 (319) Q Consensus 88 ~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~----------- 156 (319) ..+..+..+++||.-.+..+.||++|..++.. ++...+++++.++++...|++.+-.++..+..... T Consensus 77 ~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~y--DvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:63 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSF--DMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred CCccccceEEEecceeechhhhhhHHHHhcCc--hhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCc Confidence 88888888999999999999999999888764 56788999999999999999998666554432110 Q ss_pred ------cc----CCHhHHHHHHHHHHHHHHhccCC-CC---cEEEEChHHHHHHhhhhhhhhcc-c-c-cccceeeeeee Q lcl|Aclame:pro 157 ------VG----TGSDAQYDAVLDVSVELDEIKAP-EN---RVLFVSPTFYKGIKKFVIALPQG-D-T-RQQVLGKGVQG 219 (319) Q Consensus 157 ------~~----~T~~n~~~~i~~a~~~Lde~~VP-~~---R~l~VsP~~~~~L~~~~~f~~~~-~-~-~~~~~~~g~Vg 219 (319) .+ ..++.+++++.++.++|+|++|| ++ ||++|+|++|.+|+++++|.... . . +.....+|+|+ T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:63 155 LEKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVA 234 (335) T ss_pred ceeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeE Confidence 01 13456678888999999999999 34 99999999999999999988652 1 2 22356899999 Q ss_pred eecCeEEEEecccc---cc----------------cceEEEEcCCceeeeeeee-eeeeecCCCCCccceeeeeeeeeEE Q lcl|Aclame:pro 220 ELDGFVIVKVPTKL---LQ----------------GLQAIAVVGEVLASPIQAD-LAKTNSNIPGMFGTLAEQLLYTGAF 279 (319) Q Consensus 220 ~idG~~I~~vps~~---~~----------------~~n~i~~~~~A~~~~~k~~-~~~~~~~~~~~~~~~v~gr~~yg~~ 279 (319) ++.||+|+++|+-- .. ..-.++.|++|+..+..++ ..|.++ .+..++|++.+++-||+. T Consensus 235 ~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~-~~~~~~~~i~~~~a~G~g 313 (335) T protein:vir:63 235 ILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWE-DNEKFSWVLDTFQMYNIG 313 (335) T ss_pred EeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceee-ccchhhHHhHHHHHcCCc Confidence 99999999864311 00 0135789999999998886 667776 477799999999999999 Q ss_pred EeccccceEEEEccccccCCCCCccccccc Q lcl|Aclame:pro 280 VPEHLQKYIFTIGGTEVATKRDGVDAHADN 309 (319) Q Consensus 280 V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~ 309 (319) ++||...++..-.+ . +..++|- T Consensus 314 ~lRPe~a~~i~~tg----~----~~~~~~~ 335 (335) T protein:vir:63 314 ARRPDTAGAIELKG----I----GAFDITA 335 (335) T ss_pred ccccceEEEEEEcC----C----CceeecC Confidence 99999876644322 1 1111111 No 44 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.95 E-value=1.2e-30 Score=184.54 Aligned_cols=284 Identities=14% Similarity=0.077 Sum_probs=209.9 Q ss_pred eeeeh---hhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCCccc Q lcl|Aclame:pro 11 MLKLN---LQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEF 87 (319) Q Consensus 11 ~~~~~---~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~ 87 (319) |=..| .++. -+..+...+-| |.|++.+++.|..++..-.. .+.. ...+|++++||.++...+..++....... T Consensus 1 ms~~~~~t~~~~-~~s~~d~al~l-e~f~geV~~af~~~s~~~~~-~~~r-ti~~g~s~~~~~iG~~~~~~~~pG~~l~~ 76 (335) T protein:vir:78 1 MSFLNDLTRPNY-AGKNADVDIHL-EEHLGIVDKHFAYTSKFAPL-MNIR-DLRGSNVVRLDRLGNVEAKGRRAGEELER 76 (335) T ss_pred CCcccccccccc-ccccchhhhhh-hhhhhHHHHHHHHhhhhccc-ccee-eeccceeEEEeeeeeeeecccccCcccCC Confidence 32222 2222 12223344444 99999998888877765433 2322 44689999999999999988887777777 Q ss_pred CCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc------------ Q lcl|Aclame:pro 88 DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL------------ 155 (319) Q Consensus 88 ~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~------------ 155 (319) ..+..+..+++||+-++..+.||++|..++.. ++...+++++.++++...|++.+..++..+.... T Consensus 77 ~~~~~~k~~itID~ll~a~~~VddlDe~~~~y--DvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~ 154 (335) T protein:vir:78 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSF--DMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGV 154 (335) T ss_pred CCcccCCeEEEecceeechhhHhhHHHhhcCc--hhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCc Confidence 78888999999999999999999999988874 5577899999999999999998877665442110 Q ss_pred ---------cccCCHhHHHHHHHHHHHHHHhccCCC----CcEEEEChHHHHHHhhhhhhhhcc-c-c-cccceeeeeee Q lcl|Aclame:pro 156 ---------TVGTGSDAQYDAVLDVSVELDEIKAPE----NRVLFVSPTFYKGIKKFVIALPQG-D-T-RQQVLGKGVQG 219 (319) Q Consensus 156 ---------~~~~T~~n~~~~i~~a~~~Lde~~VP~----~R~l~VsP~~~~~L~~~~~f~~~~-~-~-~~~~~~~g~Vg 219 (319) +....+..+.+++.++...|+|.+||+ +|+++|+|++|.+|+++++|.... . . +.....+|+|+ T Consensus 155 ~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~ 234 (335) T protein:vir:78 155 LEKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVA 234 (335) T ss_pred ceeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeE Confidence 011234567888889999999999994 599999999999999999988652 1 2 22457899999 Q ss_pred eecCeEEEEeccccc---------c----------cceEEEEcCCceeeeeeee-eeeeecCCCCCccceeeeeeeeeEE Q lcl|Aclame:pro 220 ELDGFVIVKVPTKLL---------Q----------GLQAIAVVGEVLASPIQAD-LAKTNSNIPGMFGTLAEQLLYTGAF 279 (319) Q Consensus 220 ~idG~~I~~vps~~~---------~----------~~n~i~~~~~A~~~~~k~~-~~~~~~~~~~~~~~~v~gr~~yg~~ 279 (319) ++.||+|+++|+--. . .--..++|+.|+..+..++ ..|+++ .++.++|++.++.-||+. T Consensus 235 ~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~-~~~~~~~~i~~~~a~G~g 313 (335) T protein:vir:78 235 ILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWE-DHDQFSWVLDTFQMYNIG 313 (335) T ss_pred EeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceee-ccchhhHhhhHHHHcCCc Confidence 999999998643110 0 0124679999999999988 558877 578899999999999999 Q ss_pred EeccccceEEEEccccccCCCCCccc Q lcl|Aclame:pro 280 VPEHLQKYIFTIGGTEVATKRDGVDA 305 (319) Q Consensus 280 V~~~k~~~Iy~~~~~~~a~~~~~~~~ 305 (319) ++||...++..-.+. .+-. .++ T Consensus 314 ~lRPe~a~~i~~tg~-~~~~---~~~ 335 (335) T protein:vir:78 314 ARRPDTAGAIELKGI-EAFD---ITA 335 (335) T ss_pred ccCcceEEEEEecCC-Cccc---ccC Confidence 999999776543321 1111 111 No 45 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=99.95 E-value=9.2e-30 Score=179.78 Aligned_cols=287 Identities=9% Similarity=-0.017 Sum_probs=172.1 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCccee--e---eCCceEEeeeccccccccccCC--CCcccCCcc Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAI--F---MEGRSFTVMKGDTTELKDYKRN--ATNEFDHPK 91 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~--~---~~g~tVkIp~i~~~g~~DY~r~--~~~~~~~~t 91 (319) -||+... ...+.+...+.+.+ +++++...++|++|. + ..||||+||+.....++||... ++..+++++ T Consensus 1 MAN~llT----~iP~iia~~al~~l-~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~ 75 (423) T protein:vir:35 1 MANNLES----NISQIVLKKFLPGF-MSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLF 75 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHH-HhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccc Confidence 2333221 12455544444444 456678888999994 3 2499999999999999999764 356778888 Q ss_pred cceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 92 IEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDV 171 (319) Q Consensus 92 ~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a 171 (319) ....+++|||.||+.|.+|+.|..+... ++.+.+ +.+..+++.++|.++++.++..+........+..+.|+.|.++ T Consensus 76 e~~v~l~id~~k~~a~~v~d~e~~l~i~--~~~~~l-~~a~~ala~~vd~~l~~~l~~~a~~~vgt~~t~~~~~~~i~~a 152 (423) T protein:vir:35 76 SAKATGKVGKYITVAVEWTQIEEALKLN--QLDQIL-SPIHERMVTDLETELAHFMMNNGALSLGSPNTAIKKWADVAQT 152 (423) T ss_pred cceeeEEeccceeccceeCHHHHHhhHH--HHHHHH-HHHHHHHHHHHHHHHHHHHhhccccccccccCCcchHHHHHHH Confidence 8889999999999999999999887654 444544 4566789999999999988776654333333555789999999 Q ss_pred HHHHHhccCC-CCcEEEEChHHHHHHhhhhh-hhhcccccccceeeeee-eeecCeEEEEecccccc---cceEEEEcCC Q lcl|Aclame:pro 172 SVELDEIKAP-ENRVLFVSPTFYKGIKKFVI-ALPQGDTRQQVLGKGVQ-GELDGFVIVKVPTKLLQ---GLQAIAVVGE 245 (319) Q Consensus 172 ~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~-f~~~~~~~~~~~~~g~V-g~idG~~I~~vps~~~~---~~n~i~~~~~ 245 (319) .++|++.+|| ++||++|+|+++..|+++++ |......+...+++|.| |++.||+|+++++-... +....+.... T Consensus 153 ~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~ 232 (423) T protein:vir:35 153 ASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKT 232 (423) T ss_pred HHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcCCCccccccccccceeecc Confidence 9999999999 69999999999988887665 54444555667888876 99999999975332111 1111111112 Q ss_pred ceeeeeeeeeeeeecCCCCCc------------cceeeee--eeeeEEEeccccceE-----------EEEccccccCCC Q lcl|Aclame:pro 246 VLASPIQADLAKTNSNIPGMF------------GTLAEQL--LYTGAFVPEHLQKYI-----------FTIGGTEVATKR 300 (319) Q Consensus 246 A~~~~~k~~~~~~~~~~~~~~------------~~~v~gr--~~yg~~V~~~k~~~I-----------y~~~~~~~a~~~ 300 (319) +..... .... ....+ +-+..|. .|-|.+.+.+-.+.. |+........+. T Consensus 233 a~~v~~-----~a~~-~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~ 306 (423) T protein:vir:35 233 APNVDY-----LSVK-DSYQFTVALTGATPSKTGFLKAGDQLKFTSTHWLNQQSKQTLYNGSTAMSFTATVLEETNSTAS 306 (423) T ss_pred cccccc-----cccc-ccccceeeeeeeeeccCCcEEecceEEeeeeeeccccccceeecccCCceeEEEEecccccccc Confidence 211110 0000 00001 1122222 233444444333322 222211111111 Q ss_pred CCccccc--------------cccccccccccC Q lcl|Aclame:pro 301 DGVDAHA--------------DNVAKPSGSLEM 319 (319) Q Consensus 301 ~~~~~~~--------------~~~~~~~~~~~~ 319 (319) .+.+.++ +-.+.|+.+.-+ T Consensus 307 g~~~v~i~p~~~~~~~~~~~~~v~a~~a~~~~v 339 (423) T protein:vir:35 307 GDVTVKLSGVPIYDEKNSQYNAVDAKVKAGDAV 339 (423) T ss_pred CceeEEccccccccCCCcccccccccccCCcee Confidence 1111111 111122222222 No 46 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.95 E-value=4.3e-29 Score=176.08 Aligned_cols=266 Identities=15% Similarity=0.066 Sum_probs=198.4 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhccc--CcceeeeCCceEEeeeccc-cccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALI--SNDAIFMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~--n~~~~~~~g~tVkIp~i~~-~g~~DY~r~~~~~~~~~t~t~~ 95 (319) -|+--+....+-.+|.|++++.+.+...+ ....+. +..+...+|++|+||++.. ....+|..+.....+.++.+.. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~-~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~ 79 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAI-RFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHh-hhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceE Confidence 23222445556667777777766655443 333333 4445666899999999975 4677888777888899999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) ++++.+ ++..|.+++.+..++.. ++.....+++...++.++|+.+++.+.. +.. .++....|+.|.++..+| T Consensus 80 ~~~~~~-~~~~~~itd~~~~~s~~--d~~~~~~~~~~~~~a~~~d~~i~~~~~~-a~~----~~~~~~t~d~i~da~~~l 151 (272) T protein:vir:98 80 TMTIKK-AGKGVEITDEAILSGYG--DPVGQAAKQIVEAIDHKVDADVLDALSK-STQ----TVEATATVDGVSKALDIF 151 (272) T ss_pred EEEeee-eeeeeeecHHHHhhccc--cHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccc----ccccccCHHHHHHHHHHH Confidence 999987 67889999999888764 5678889999999999999999886543 222 223344588899999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhh--hhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKF--VIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQA 253 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~--~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~ 253 (319) ++.+. ..++++|+|.+++.|+++ .+|.+....+....++|.||++.|++|+.+ +.++....++++++|+....+- T Consensus 152 ~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s--~~~p~~t~~~~~~~a~~~~~~~ 228 (272) T protein:vir:98 152 NDEDD-AETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRS--RKCPKGTAYMVRKGALRIMLKR 228 (272) T ss_pred hccCC-CccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEc--CCCCcceEEEEcCCeEEEEecC Confidence 88753 478999999999999877 566776667777889999999999999974 4456666778889998888665 Q ss_pred e-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCC Q lcl|Aclame:pro 254 D-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKR 300 (319) Q Consensus 254 ~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~ 300 (319) . .+|..| .+.++.+.+++|++||++|++|++.-. +.. +++.++ T Consensus 229 ~~~ve~~r-~~~~~~~~i~~~~~~~~~v~~~~~vv~-~t~--~~a~~~ 272 (272) T protein:vir:98 229 NTMVETDR-DITKAINQIVANKHYGVYLYKAEKAVK-ITL--KDAAKK 272 (272) T ss_pred Cceeeecc-ccccceeEEEEEEEEEEEEEcCCceEE-EEe--cccccC Confidence 5 677766 577889999999999999999987322 222 222222 No 47 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.95 E-value=4.3e-29 Score=176.08 Aligned_cols=266 Identities=15% Similarity=0.066 Sum_probs=198.4 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhccc--CcceeeeCCceEEeeeccc-cccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALI--SNDAIFMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~--n~~~~~~~g~tVkIp~i~~-~g~~DY~r~~~~~~~~~t~t~~ 95 (319) -|+--+....+-.+|.|++++.+.+...+ ....+. +..+...+|++|+||++.. ....+|..+.....+.++.+.. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~-~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~ 79 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAI-RFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKT 79 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHh-hhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceE Confidence 23222445556667777777766655443 333333 4445666899999999975 4677888777888899999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) ++++.+ ++..|.+++.+..++.. ++.....+++...++.++|+.+++.+.. +.. .++....|+.|.++..+| T Consensus 80 ~~~~~~-~~~~~~itd~~~~~s~~--d~~~~~~~~~~~~~a~~~d~~i~~~~~~-a~~----~~~~~~t~d~i~da~~~l 151 (272) T protein:vir:30 80 TMTIKK-AGKGVEITDEAILSGYG--DPVGQAAKQIVEAIDHKVDADVLDALSK-STQ----TVEATATVDGVSKALDIF 151 (272) T ss_pred EEEeee-eeeeeeecHHHHhhccc--cHHHHHHHHHHHHHHHHHHHHHHHHhcc-ccc----ccccccCHHHHHHHHHHH Confidence 999987 67889999999888764 5678889999999999999999886543 222 223344588899999999 Q ss_pred HhccCCCCcEEEEChHHHHHHhhh--hhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeee Q lcl|Aclame:pro 176 DEIKAPENRVLFVSPTFYKGIKKF--VIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQA 253 (319) Q Consensus 176 de~~VP~~R~l~VsP~~~~~L~~~--~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~ 253 (319) ++.+. ..++++|+|.+++.|+++ .+|.+....+....++|.||++.|++|+.+ +.++....++++++|+....+- T Consensus 152 ~~~~~-~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s--~~~p~~t~~~~~~~a~~~~~~~ 228 (272) T protein:vir:30 152 NDEDD-AETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRS--RKCPKGTAYMVRKGALRIMLKR 228 (272) T ss_pred hccCC-CccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEc--CCCCcceEEEEcCCeEEEEecC Confidence 88753 478999999999999877 566776667777889999999999999974 4456666778889998888665 Q ss_pred e-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCC Q lcl|Aclame:pro 254 D-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKR 300 (319) Q Consensus 254 ~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~ 300 (319) . .+|..| .+.++.+.+++|++||++|++|++.-. +.. +++.++ T Consensus 229 ~~~ve~~r-~~~~~~~~i~~~~~~~~~v~~~~~vv~-~t~--~~a~~~ 272 (272) T protein:vir:30 229 NTMVETDR-DITKAINQIVANKHYGVYLYKAEKAVK-ITL--KDAAKK 272 (272) T ss_pred Cceeeecc-ccccceeEEEEEEEEEEEEEcCCceEE-EEe--cccccC Confidence 5 677766 577889999999999999999987322 222 222222 No 48 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=99.94 E-value=7.4e-29 Score=174.81 Aligned_cols=289 Identities=8% Similarity=0.015 Sum_probs=165.9 Q ss_pred chhhhh--hhHhhHHHHHHHHHhhhhhhhcccCccee--e---eCCceEEeeeccccccccccCCC--CcccCCccccee Q lcl|Aclame:pro 25 EPGQTL--LKNKHVGILERVTAVNAYSTPALISNDAI--F---MEGRSFTVMKGDTTELKDYKRNA--TNEFDHPKIEET 95 (319) Q Consensus 25 ~~n~~~--l~~ky~~lld~~~~~~sl~~~~~~n~~~~--~---~~g~tVkIp~i~~~g~~DY~r~~--~~~~~~~t~t~~ 95 (319) =+|++. ..+.+...+.+.+ .++++...++|++|. + ..||||+||......++||+... +.++++++.... T Consensus 1 MaN~llT~~p~iia~~aL~~l-~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~v 79 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGF-MSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKA 79 (423) T ss_pred CccchhhhhHHHHHHHHHHHH-HhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCcccccee Confidence 234432 2344543333333 345667778999993 3 25999999999999999998532 457789999999 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVEL 175 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~L 175 (319) +++|||.||+.|.+|+.|..+... .+. ...+.+.++|+.++|+++++.++..+........++.+.|+.|.++..+| T Consensus 80 ~l~id~~k~va~~v~d~E~~~~i~--~~~-~~l~~A~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~~~a~~~i~~a~~~L 156 (423) T protein:vir:10 80 TGRVGNYITVAVEYQQLEEAIKLN--QLE-EILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTASFL 156 (423) T ss_pred EEEeeceeeeeeeechHHHhcChh--hHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcccchHHHHHHHHHHH Confidence 999999999999999888765432 343 34567788999999999998776654443333445567899999999999 Q ss_pred HhccCC-CCcEEEEChHHHHHHhhhhhhh-hcccccccceeeeee-eeecCeEEEEecccccccceEEEEcCCcee-eee Q lcl|Aclame:pro 176 DEIKAP-ENRVLFVSPTFYKGIKKFVIAL-PQGDTRQQVLGKGVQ-GELDGFVIVKVPTKLLQGLQAIAVVGEVLA-SPI 251 (319) Q Consensus 176 de~~VP-~~R~l~VsP~~~~~L~~~~~f~-~~~~~~~~~~~~g~V-g~idG~~I~~vps~~~~~~n~i~~~~~A~~-~~~ 251 (319) ++.+|| ++||++|+|+++..|++++++. .....+.+.+++|.| |++.||+|+++++- ......- .|..+.. +.. T Consensus 157 d~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snni-p~~T~gt-~~~t~~~~~~~ 234 (423) T protein:vir:10 157 KDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGL-ASRTQGA-FGGTLTVKTQP 234 (423) T ss_pred HhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCC-ccccccc-cccceeeeecc Confidence 999999 7999999999998888766644 444455677899987 99999999985332 2111110 1111000 000 Q ss_pred ee------eeeee---ec-CCCCCccceeeee--eeeeEEEeccccc-----------eEEEEccccccCCCCCcccccc Q lcl|Aclame:pro 252 QA------DLAKT---NS-NIPGMFGTLAEQL--LYTGAFVPEHLQK-----------YIFTIGGTEVATKRDGVDAHAD 308 (319) Q Consensus 252 k~------~~~~~---~~-~~~~~~~~~v~gr--~~yg~~V~~~k~~-----------~Iy~~~~~~~a~~~~~~~~~~~ 308 (319) .+ ..... .. .--..-+.+..|. .+-|.+.+.+-.+ --|+......+....+.+..+. T Consensus 235 ~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~ 314 (423) T protein:vir:10 235 TVTYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLS 314 (423) T ss_pred eeccccccccceeeeeeeeccccccCceeecceEEecceeeecccccccccccccCcceEEEEEeeeeeccCCceeeecc Confidence 00 00000 00 0000001111111 1122222211111 1123222221111111111111 Q ss_pred -------------cccc-ccccccC Q lcl|Aclame:pro 309 -------------NVAK-PSGSLEM 319 (319) Q Consensus 309 -------------~~~~-~~~~~~~ 319 (319) +.+. |+.+.-+ T Consensus 315 p~~i~~~~~~~~~~v~a~~a~~~~v 339 (423) T protein:vir:10 315 GVPIYDTTNPQYNSVSRQVEAGDAV 339 (423) T ss_pred CccccccCCcccccccccccCCcee Confidence 1111 2222222 No 49 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=99.94 E-value=6.8e-29 Score=175.00 Aligned_cols=291 Identities=9% Similarity=0.005 Sum_probs=169.5 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceee-----eCCceEEeeeccccccccccCCC--CcccCCcc Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF-----MEGRSFTVMKGDTTELKDYKRNA--TNEFDHPK 91 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~-----~~g~tVkIp~i~~~g~~DY~r~~--~~~~~~~t 91 (319) -||+... ...+.+...+.+.+ .++++...++|++|.. ..||||+||......++||.... +..+++++ T Consensus 1 MaN~llT----~ip~iia~~al~~l-~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~ 75 (423) T protein:vir:17 1 MPNNLDS----NVSQIVLKKFLPGF-MSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLI 75 (423) T ss_pred Cccchhh----hhHHHHHHHHHHHH-HhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccc Confidence 2333321 12344543333333 3455677789999943 25999999999999999997533 45678888 Q ss_pred cceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 92 IEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDV 171 (319) Q Consensus 92 ~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a 171 (319) ....+++|||.||+.|.+++.|..+... .+. ...+.+.++|+.++|+++++.++..+........++.+.|+.|.++ T Consensus 76 e~~v~l~id~~k~va~~v~d~E~~~~i~--~~~-~~l~~A~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~~~a~~~i~~a 152 (423) T protein:vir:17 76 SGKATGRVGNYITVAVEYQQLEEAIKLN--QLE-EILAPVRQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQT 152 (423) T ss_pred cceeEEEeeceeeeeeeecHHHHhcChh--HHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccCCcccccHHHHHHH Confidence 8899999999999999999888765432 343 3556778999999999999987766544444444566789999999 Q ss_pred HHHHHhccCC-CCcEEEEChHHHHHHhhhhhh-hhcccccccceeeeee-eeecCeEEEEecccccccceE----EEEcC Q lcl|Aclame:pro 172 SVELDEIKAP-ENRVLFVSPTFYKGIKKFVIA-LPQGDTRQQVLGKGVQ-GELDGFVIVKVPTKLLQGLQA----IAVVG 244 (319) Q Consensus 172 ~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f-~~~~~~~~~~~~~g~V-g~idG~~I~~vps~~~~~~n~----i~~~~ 244 (319) ..+|++.+|| ++||++|+|+++..|++++++ ......+...+++|.| |++.||+|+++++ ....... -+... T Consensus 153 ~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snn-ip~~T~gt~~~t~~~~ 231 (423) T protein:vir:17 153 ASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNG-LASRTQGAFGGTLTVK 231 (423) T ss_pred HHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCC-Cccccccceeceeeec Confidence 9999999999 799999999999888877664 4434445667899987 9999999997533 2211111 11111 Q ss_pred Ccee-------eeeeeeeeeeecCCCCCccceeeee--eeeeEEEeccccce-----------EEEEccccccCCCCCcc Q lcl|Aclame:pro 245 EVLA-------SPIQADLAKTNSNIPGMFGTLAEQL--LYTGAFVPEHLQKY-----------IFTIGGTEVATKRDGVD 304 (319) Q Consensus 245 ~A~~-------~~~k~~~~~~~~~~~~~~~~~v~gr--~~yg~~V~~~k~~~-----------Iy~~~~~~~a~~~~~~~ 304 (319) .+.. ...+.......+-... -+.+..|. .+-|.+.+.+-.+. -|+......+....+.+ T Consensus 232 ~~~~v~~~a~~~~~~~~~~~~~~~~~~-~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~t 310 (423) T protein:vir:17 232 TQPTVTYNAVKDSYQFTVTLTGATTSV-TGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGDVT 310 (423) T ss_pred ccccccccccccccceeeeeeeeeeec-cCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCceE Confidence 1100 0011000000000000 01111121 33454544443332 22322211111111111 Q ss_pred cccc-------------cccc-ccccccC Q lcl|Aclame:pro 305 AHAD-------------NVAK-PSGSLEM 319 (319) Q Consensus 305 ~~~~-------------~~~~-~~~~~~~ 319 (319) ..+. +.+. |+.+.-+ T Consensus 311 v~i~p~~i~~~~~~~~~~v~a~~a~~~~v 339 (423) T protein:vir:17 311 VTLSGVPIYDTTNPQYNSVSRQVAAGDAV 339 (423) T ss_pred EEecCccccccCCcccccceecccCCcee Confidence 1111 1111 2222222 No 50 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=99.93 E-value=2.8e-27 Score=166.19 Aligned_cols=291 Identities=8% Similarity=-0.052 Sum_probs=174.2 Q ss_pred hhhhhcchhhh--hhhHhhHHHHHHHHHhhhhhhhcccCcceeee-----CCceEEeeeccccccccccCC--CCcccCC Q lcl|Aclame:pro 19 FANKSVEPGQT--LLKNKHVGILERVTAVNAYSTPALISNDAIFM-----EGRSFTVMKGDTTELKDYKRN--ATNEFDH 89 (319) Q Consensus 19 ~~~~~~~~n~~--~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~-----~g~tVkIp~i~~~g~~DY~r~--~~~~~~~ 89 (319) -| |++ .-.+.+...+.+.+. ++++...++||+|... .||||+||........|+... .+...++ T Consensus 1 MA------Nsl~~l~p~iia~~al~~l~-~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~ 73 (423) T protein:vir:10 1 MA------NNLDANVSQIVLKKFLPGFM-SDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNS 73 (423) T ss_pred Cc------cccccccHHHHHHHHHHHHH-hhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcccccc Confidence 12 444 334555444444443 4556778899999432 489999999988888774332 2233455 Q ss_pred cccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHH Q lcl|Aclame:pro 90 PKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVL 169 (319) Q Consensus 90 ~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~ 169 (319) +.....+++|||.||+.|.+|+.|..+.- -.+. ...+.+.++|+.++|++++..++..+.......-+..+.|+.+. T Consensus 74 l~e~~v~l~id~~k~~a~~v~d~E~~l~i--~~~~-~~l~~A~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~~~a~~~~a 150 (423) T protein:vir:10 74 LISAKATGEVGNYITVAVEYRQIEEALKL--NQLD-QILVPINERMVTDLETELALFMMKHGALSLGSPNTPIKKWSDVA 150 (423) T ss_pred cccceEEEEecceeeeeeeeChHHHhcCh--hHHH-HHHHHHHHHHHHHHHHHHHHHhhhcccccccccccccccHHHHH Confidence 66677899999999999999988876433 3343 45577788999999999987777665544333334557899999 Q ss_pred HHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhc-ccccccceeeeee-eeecCeEEEEeccccc--ccceEEEEcC Q lcl|Aclame:pro 170 DVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQ-GDTRQQVLGKGVQ-GELDGFVIVKVPTKLL--QGLQAIAVVG 244 (319) Q Consensus 170 ~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~-~~~~~~~~~~g~V-g~idG~~I~~vps~~~--~~~n~i~~~~ 244 (319) ++..+|++.++| ++||++|+|+++..|++++.+... ...+.+.+++|.| |++.||+|+++++-.. +.-....+|. T Consensus 151 ~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~~~ 230 (423) T protein:vir:10 151 QTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKLTV 230 (423) T ss_pred HHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceEEEEecCCcccccccccceeee Confidence 999999999999 799999999999998876665443 3445667888876 9999999998543221 2222333444 Q ss_pred Cceeeeeeeee--eeeec-----CCCCCccceeeee--eeeeEEEeccccce-----------EEEEccccccCCCCCcc Q lcl|Aclame:pro 245 EVLASPIQADL--AKTNS-----NIPGMFGTLAEQL--LYTGAFVPEHLQKY-----------IFTIGGTEVATKRDGVD 304 (319) Q Consensus 245 ~A~~~~~k~~~--~~~~~-----~~~~~~~~~v~gr--~~yg~~V~~~k~~~-----------Iy~~~~~~~a~~~~~~~ 304 (319) .+..++..-.. .+..+ -....-|++.+|. .+-|.+.+++-.+- -|+......+....+.+ T Consensus 231 ~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~t 310 (423) T protein:vir:10 231 KGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSGDVT 310 (423) T ss_pred eeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEecccccccCceE Confidence 44433322111 00110 0111224444433 34455555554432 23433222222222222 Q ss_pred ccccc--------------cccccccccC Q lcl|Aclame:pro 305 AHADN--------------VAKPSGSLEM 319 (319) Q Consensus 305 ~~~~~--------------~~~~~~~~~~ 319 (319) ..+.- .+.|+.+.-+ T Consensus 311 v~i~p~~~~~~~~~~~~~V~a~~a~~~~v 339 (423) T protein:vir:10 311 VKISGVPIFDAGYPQYNAVDRLLAEGDTV 339 (423) T ss_pred EEeccccccccCcccccceeccccCCcee Confidence 22111 1112222222 No 51 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=99.92 E-value=5.3e-27 Score=164.64 Aligned_cols=277 Identities=14% Similarity=0.095 Sum_probs=180.3 Q ss_pred CCcccccccceeeehhh---hhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccc-cc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQ---HFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTT-EL 76 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~---~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~-g~ 76 (319) +|.+| +|..+|..| .|..+|..-....|+++-+.|. ..+-..-...++++++.+.+... .+ T Consensus 3 ~~~~~---~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~------------~tV~~~~~~~~~~~~~~~~~~~~~~~ 67 (322) T protein:vir:10 3 LNAIM---SMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLK------------QYCQHKNESSESHNWETLASMDPDAV 67 (322) T ss_pred cccee---eeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhh------------cccccccccccccceeeccccccccc Confidence 34433 455666664 3444444444444444322111 00111113345556666655332 12 Q ss_pred -----ccccCCCCc-ccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 77 -----KDYKRNATN-EFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARN 150 (319) Q Consensus 77 -----~DY~r~~~~-~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~ 150 (319) ..+.+++.+ ++.....-.....+.+|+|++|.||++|..+.. +++.....+.+.+++....|..+++.+... T Consensus 68 ~~~~~~~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~--~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~ 145 (322) T protein:vir:10 68 KRKRSRQQSADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQML--LDPNSALITSQAYAMARKTDDLIIAGAWKP 145 (322) T ss_pred ccccccccccCcccCCCccccccceEEEeecccccceecchHHHHHhh--cCchHHHHHHHHHHhhhHHHHHHHhhhhcc Confidence 122222221 111122345667888999999999999998876 456788899999999999999988766544 Q ss_pred cCcccc---cc------C---CHhHHHHHHHHHHHHHHhccCC-C-CcEEEEChHHHHHHhhhhhhhhcccccccce-ee Q lcl|Aclame:pro 151 KAKHLT---VG------T---GSDAQYDAVLDVSVELDEIKAP-E-NRVLFVSPTFYKGIKKFVIALPQGDTRQQVL-GK 215 (319) Q Consensus 151 a~~~~~---~~------~---T~~n~~~~i~~a~~~Lde~~VP-~-~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~-~~ 215 (319) +..... .. + +....++.|+++.+.|+|++|| + +||++|+|+++..|+++++|+...-.+.+.+ .+ T Consensus 146 a~~~~~gt~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~ 225 (322) T protein:vir:10 146 ASIKGTGQPVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSK 225 (322) T ss_pred ccccccccccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhc Confidence 432111 00 1 2244689999999999999999 4 5999999999999999999987665565555 67 Q ss_pred eeeeeecCeEEEEecccccc-----------------cceEEEEcCCceeeeeee-eeeeeecCCCCCccceeeeeeeee Q lcl|Aclame:pro 216 GVQGELDGFVIVKVPTKLLQ-----------------GLQAIAVVGEVLASPIQA-DLAKTNSNIPGMFGTLAEQLLYTG 277 (319) Q Consensus 216 g~Vg~idG~~I~~vps~~~~-----------------~~n~i~~~~~A~~~~~k~-~~~~~~~~~~~~~~~~v~gr~~yg 277 (319) |.||++.||.++... +... ....++.|++|+.++... .++++-..+...++|.|++.+.|| T Consensus 226 G~ig~~lGf~~i~s~-~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~G 304 (322) T protein:vir:10 226 GIITNWMGYTWIVST-RLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTAD 304 (322) T ss_pred CeeeeeeeEEEEEec-cCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhC Confidence 999999999999642 1110 134789999999999865 366665556777899999999999 Q ss_pred EEEeccccceEEEEcccc Q lcl|Aclame:pro 278 AFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 278 ~~V~~~k~~~Iy~~~~~~ 295 (319) +.+++|++.-.+--..+- T Consensus 305 a~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 305 CVRVEDEHIFKLRLKNSL 322 (322) T ss_pred ceEeccCcEEEEEEeccC Confidence 999999763221112222 No 52 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=99.92 E-value=1.9e-27 Score=167.11 Aligned_cols=299 Identities=10% Similarity=0.002 Sum_probs=209.1 Q ss_pred eeeehhhhhhhhhcchhhh-------hhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCC Q lcl|Aclame:pro 11 MLKLNLQHFANKSVEPGQT-------LLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) Q Consensus 11 ~~~~~~~~~~~~~~~~n~~-------~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~ 83 (319) |= ++++...|++- -+-|.|.+.++..|..++..-... +.....+|+++++|.++...++.|+++. T Consensus 1 Ms------~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~--~vRti~~gkS~qf~~~G~s~~~~~~pG~ 72 (401) T protein:vir:70 1 MS------TPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYF--DVQTVTGTNTVSNKYLGETELQVLAPGQ 72 (401) T ss_pred CC------CCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccc--eeeeecccceEEEEEeeeeEeeeecCCC Confidence 21 22333333322 256788888888887777654332 2225679999999999999999999998 Q ss_pred CcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc---------- Q lcl|Aclame:pro 84 TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK---------- 153 (319) Q Consensus 84 ~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~---------- 153 (319) ......+..+..+++||.-.+..+.|+++|..|+..+. .-..+++++.++++...|++.+..+...+.. T Consensus 73 ~ld~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~-vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~ 151 (401) T protein:vir:70 73 SPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDS-LKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPR 151 (401) T ss_pred CcCCCCcccccEEEEeCceeehhhhhhhHHHHHhcccc-cchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCC Confidence 88888888888899999999999999999999987541 3468889999999999999887665322100 Q ss_pred -------------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEE-ChHHHHHHhhhhhhh-hcccc-cccceeeee Q lcl|Aclame:pro 154 -------------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFV-SPTFYKGIKKFVIAL-PQGDT-RQQVLGKGV 217 (319) Q Consensus 154 -------------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~V-sP~~~~~L~~~~~f~-~~~~~-~~~~~~~g~ 217 (319) .....+++..+.++|.++...|+|.+||.+|++++ +|.+|.+|+..+++. +.... +....++|+ T Consensus 152 ~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~ 231 (401) T protein:vir:70 152 VKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGF 231 (401) T ss_pred cCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccce Confidence 01122356679999999999999999996666555 788888888776654 33222 234578999 Q ss_pred eeeecCeEEEEeccccc----------------cc---------ceEEEEcCCceeeeeeeee-eeeecCCCCCccceee Q lcl|Aclame:pro 218 QGELDGFVIVKVPTKLL----------------QG---------LQAIAVVGEVLASPIQADL-AKTNSNIPGMFGTLAE 271 (319) Q Consensus 218 Vg~idG~~I~~vps~~~----------------~~---------~n~i~~~~~A~~~~~k~~~-~~~~~~~~~~~~~~v~ 271 (319) |.+++||+|+++|+-.. .. .-.++.||+|+..+.=++- .++++ .+.+++|.+. T Consensus 232 v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~-d~r~~~~~id 310 (401) T protein:vir:70 232 TLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFY-EKKEKTYYID 310 (401) T ss_pred EEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhh-hhhhhHHHHH Confidence 99999999998743210 01 1246778888877433222 23444 4778999999 Q ss_pred eeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 272 QLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 272 gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) ...-||+.++||...+|.....+.+...+-|++..--+.++--|.... T Consensus 311 ~~~a~g~g~~RPeaa~vv~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~ 358 (401) T protein:vir:70 311 TFMAEGAIPDRWEAVSVVTTKRNTTTGAVEGTDGAQHTIVKNRAQRKA 358 (401) T ss_pred HHHHhCCcccchhheEEEeecCcccccccccCCcchhhhhhhhcccee Confidence 999999999999998887666655544554555443344442222222 No 53 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=99.89 E-value=3.1e-25 Score=154.91 Aligned_cols=299 Identities=10% Similarity=0.028 Sum_probs=204.0 Q ss_pred eeeehhhhhhhhhcchhhh-------hhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCC Q lcl|Aclame:pro 11 MLKLNLQHFANKSVEPGQT-------LLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA 83 (319) Q Consensus 11 ~~~~~~~~~~~~~~~~n~~-------~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~ 83 (319) |= ++++...|++- -+-|.|.+.++..|..++..-... +.....+|+++++|.++...++.+++.. T Consensus 1 Ms------~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~--~vRtI~~gkS~qf~~lG~s~a~y~~pG~ 72 (400) T protein:vir:10 1 MS------TPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYF--DVQTVTGTNTVSNKYLGETELQVLAPGQ 72 (400) T ss_pred CC------CCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccc--eeeeecccceEEEEEeeeeEEeeecCCC Confidence 21 22333333322 256889999988887777654332 2225679999999999999999999988 Q ss_pred CcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------ Q lcl|Aclame:pro 84 TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK------------ 151 (319) Q Consensus 84 ~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a------------ 151 (319) ......+..+...++||.-.+....||++|..+...+ ..-..+++++.++++...|++.+..+...+ T Consensus 73 ~ldg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD-~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~ 151 (400) T protein:vir:10 73 SPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDID-SLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPR 151 (400) T ss_pred CcCCCCcccCcEEEEeCceeeecchhhhHHHHhhccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCC Confidence 8777778888889999999999999999998887754 035788999999999999998876543221 Q ss_pred Cccc-----------cccCCHhHHHHHHHHHHHHHHhccCCCCc-EEEEChHHHHHHhhhhhhhh-ccc-ccccceeeee Q lcl|Aclame:pro 152 AKHL-----------TVGTGSDAQYDAVLDVSVELDEIKAPENR-VLFVSPTFYKGIKKFVIALP-QGD-TRQQVLGKGV 217 (319) Q Consensus 152 ~~~~-----------~~~~T~~n~~~~i~~a~~~Lde~~VP~~R-~l~VsP~~~~~L~~~~~f~~-~~~-~~~~~~~~g~ 217 (319) +... ...+++..+..+|.++...|+|.+||.+| +++++|.+|.+|+..+++.. ... .+......|+ T Consensus 152 g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~ 231 (400) T protein:vir:10 152 VKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGF 231 (400) T ss_pred ccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccce Confidence 0000 01124456788899999999999999545 55668889988888776553 322 2234578999 Q ss_pred eeeecCeEEEEecccc----------------ccc---------ceEEEEcCCceeeeeeee-eeeeecCCCCCccceee Q lcl|Aclame:pro 218 QGELDGFVIVKVPTKL----------------LQG---------LQAIAVVGEVLASPIQAD-LAKTNSNIPGMFGTLAE 271 (319) Q Consensus 218 Vg~idG~~I~~vps~~----------------~~~---------~n~i~~~~~A~~~~~k~~-~~~~~~~~~~~~~~~v~ 271 (319) |.+++||+|+++|+-- +.. .-.++.|++|+..+.=++ ..++++ .+.+++|.+. T Consensus 232 v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~-d~r~~~~~id 310 (400) T protein:vir:10 232 VLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFY-EKKEKTYYID 310 (400) T ss_pred EEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeecccccccc-chhhHHHHHH Confidence 9999999999874321 001 124677888887743222 234444 4778999999 Q ss_pred eeeeeeEEEeccccceEEEEccccccCCCCC-cccccc---------cccc---ccccccC Q lcl|Aclame:pro 272 QLLYTGAFVPEHLQKYIFTIGGTEVATKRDG-VDAHAD---------NVAK---PSGSLEM 319 (319) Q Consensus 272 gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~-~~~~~~---------~~~~---~~~~~~~ 319 (319) ...-||+.++||...++....++.+.+..+| +.-|.+ .-+| |.++-.- T Consensus 311 ~~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 371 (400) T protein:vir:10 311 TFMSEGAIPDRWEAVSVVTTKRQSTGAVDSGNAAQHTQVLNRAQRKAVYVKNAAPAGAFAA 371 (400) T ss_pred HHHHhCCcccchhheEEEEecCCcccccccCcchhHHHHHhhcccceEEEecccccccccc Confidence 9999999999999988877666554443322 222221 1111 2221111 No 54 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.89 E-value=1.3e-24 Score=151.45 Aligned_cols=267 Identities=17% Similarity=0.100 Sum_probs=192.3 Q ss_pred ehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhh-hhhhcccCcceeeeCCceEEeeeccccc-cccccCCCCcccCCcc Q lcl|Aclame:pro 14 LNLQHFANKSVEPGQTLLKNKHVGILERVTAVNA-YSTPALISNDAIFMEGRSFTVMKGDTTE-LKDYKRNATNEFDHPK 91 (319) Q Consensus 14 ~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~s-l~~~~~~n~~~~~~~g~tVkIp~i~~~g-~~DY~r~~~~~~~~~t 91 (319) |=+-+ .-.+--+|.|.+.+.+.+..+. ++....+++.+...+|++|+||.+.-.| ..++..+...+++.++ T Consensus 1 Ma~T~-------~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt 73 (270) T protein:vir:95 1 MTQTK-------KANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMS 73 (270) T ss_pred CCcee-------hhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcc Confidence 21111 1122234555444444433322 2333334455677799999999998654 5567777788999999 Q ss_pred cceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHH Q lcl|Aclame:pro 92 IEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDV 171 (319) Q Consensus 92 ~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a 171 (319) .+..+.+|. .++..|.+.|.+...+-+ ++.....++++..++.++|+.+++.+...... .+.+ .-++.|.+| T Consensus 74 ~~~~~a~i~-~~gk~~~itD~a~~~~~~--dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~-~~~~----~t~~~~~dA 145 (270) T protein:vir:95 74 MTTTKVTVK-ETGKAVEVTQTAIITNVN--GTLQEASRQLAMSLADKVEIDYIAELNKSKQT-ATVS----ADATGILDA 145 (270) T ss_pred cchheeeee-hhhCcceecHHHHhhhcc--chHHHHHHHHHHHHHHHHHHHHHHHhcccccc-cccc----cCHHHHHHH Confidence 999999995 478899999988776643 55677788999999999999988876543222 2222 346677888 Q ss_pred HHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeee Q lcl|Aclame:pro 172 SVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPI 251 (319) Q Consensus 172 ~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~ 251 (319) ..+|.+. .....+++|+|..+..|+++. +......++..+++|.||.+.|++|+ |.++.......++.+++|+.+.. T Consensus 146 ~~~lgd~-~~~~~~i~vhs~~~~~Lrk~~-~~~~~~~~~~~~~~G~ig~~~G~~Vi-v~s~~~~~~~~~l~~~gAi~~~~ 222 (270) T protein:vir:95 146 IEVFNSE-NDEDYVLYVNPKDYNKLVKSL-FKVGGNVQDRAISKGDLVEIVGVSDI-VKSKRVSENTAFLQRYGAMEIVN 222 (270) T ss_pred HHHhccc-cCCCcEEEEcHHHHHHHHhhh-cccccccccchhcccccceecceeEE-EeCCCCCceeEEEEeccceeeee Confidence 8888664 244678999999999999876 44444556677899999999999988 44556667788899999999999 Q ss_pred eee-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccccccccccccccC Q lcl|Aclame:pro 252 QAD-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 252 k~~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) |-+ .+|..| .+..+.+.+.+|.+|++.++++.+... -. -+|+|++|| T Consensus 223 ~~~~~vEtdR-d~~~~~d~i~~~~~y~v~~~~~skvv~--~t------------------~~~a~~~~~ 270 (270) T protein:vir:95 223 KKKPEAYTDF-DILKRTHLLSTNYHYSVNLKDETGVVK--VT------------------FKPSGSLEM 270 (270) T ss_pred cCCceeeecc-chhhcccEEEeeeEEEEEEEccceEEE--EE------------------ecCCCCcCC Confidence 887 788777 577889999999999999999876321 11 137788888 No 55 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.87 E-value=3.9e-24 Score=148.92 Aligned_cols=227 Identities=13% Similarity=0.034 Sum_probs=177.4 Q ss_pred ceeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 57 DAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVA 136 (319) Q Consensus 57 ~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~va 136 (319) +==.++|+||+||++ .....++......+++.++.+.++.+|.| ++..|.|.|.+...+-+ ++....+++++..++ T Consensus 1 ~~~~~~Gdtit~P~~-iGda~~v~eG~~i~~~~l~~t~~~atIk~-~gk~~~itD~a~l~~~g--Dp~~ea~~Q~~~~iA 76 (231) T protein:vir:73 1 ENGINLANLCEYPND-IGDAADVAEGGEISLDKIGTTTKSVTIKK-AAKGTEITDEAALSGYG--DPIGESNKQLGLSLA 76 (231) T ss_pred CccccCCceEEeccc-ccchhhhcCCCcCChhhccccceeeeEee-eccceeeeHHHHhhccC--chHHHHHHHHHHHHH Confidence 112357999999988 33456787788899999999999999976 58899999988877643 567788999999999 Q ss_pred HHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhccc-ccccceee Q lcl|Aclame:pro 137 PYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGD-TRQQVLGK 215 (319) Q Consensus 137 peiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~-~~~~~~~~ 215 (319) .++|..+++.+...+-+ .+..+ -++.|++|..+|.+.+ ..+++++|+|..+..|+++.+|....+ .++.++++ T Consensus 77 ~kvD~di~~~~~~a~l~-~~~~~----t~d~i~~A~~~fgde~-~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~~~ 150 (231) T protein:vir:73 77 NKVDDDLLKAAKTTSQT-VSTKA----NVDGVQAALDIFNDED-AQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALIN 150 (231) T ss_pred HhhhHHHHHhhcccccc-ccccc----cHHHHHHHHHHhcccc-ccceEEEEcchHHHhhhhccchhhhhhhhccceeee Confidence 99999988755433221 22223 3888899999998854 568999999999999999998877553 46778999 Q ss_pred eeeeeecCeEEEEeccccccc--ceEEEEcCCceeeeeeee-eeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEc Q lcl|Aclame:pro 216 GVQGELDGFVIVKVPTKLLQG--LQAIAVVGEVLASPIQAD-LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 216 g~Vg~idG~~I~~vps~~~~~--~n~i~~~~~A~~~~~k~~-~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~ 292 (319) |.||++.|++|+.++.....+ ..-+++.++|+....|-+ .+|..| .++.+.+.+.++.||++.++++.+.....-. T Consensus 151 G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdR-d~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~ 229 (231) T protein:vir:73 151 GTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDR-DIVTKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) T ss_pred cccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeeccc-cccccccEEEEeEEEEEEEEcCccEEEEEee Confidence 999999999999743321111 122566789999999988 888887 5889999999999999999999986665555 Q ss_pred cc Q lcl|Aclame:pro 293 GT 294 (319) Q Consensus 293 ~~ 294 (319) +. T Consensus 230 g~ 231 (231) T protein:vir:73 230 GV 231 (231) T ss_pred cC Confidence 55 No 56 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=99.82 E-value=1.3e-22 Score=140.52 Aligned_cols=194 Identities=18% Similarity=0.091 Sum_probs=120.2 Q ss_pred EeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc---------------cccCCHhH Q lcl|Aclame:pro 99 LDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL---------------TVGTGSDA 163 (319) Q Consensus 99 idqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~---------------~~~~T~~n 163 (319) ||.-....|.|||+|+.|++ +++...+++++.++++.++|++++..++..+.... ..+.++++ T Consensus 1 iD~lL~a~~~VdDiD~aqa~--~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~ 78 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQ--WNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQA 78 (221) T ss_pred CCcchhHHHHHHhHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHH Confidence 99999999999999999987 56688999999999999999999987765432211 12345788 Q ss_pred HHHHHHHHHHHHHhccCC-CCcEEEEChHHH-HHHhhhhhhhhccccc--ccceeee-eeeeecCeEEEEeccccc-ccc Q lcl|Aclame:pro 164 QYDAVLDVSVELDEIKAP-ENRVLFVSPTFY-KGIKKFVIALPQGDTR--QQVLGKG-VQGELDGFVIVKVPTKLL-QGL 237 (319) Q Consensus 164 ~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~-~~L~~~~~f~~~~~~~--~~~~~~g-~Vg~idG~~I~~vps~~~-~~~ 237 (319) +|++|+++.++|||++|| ++||++|+|.+| .+|++++++..+.+.+ +...++| .|++++||+|+++++-.. .+. T Consensus 79 l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt 158 (221) T protein:vir:17 79 IVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGT 158 (221) T ss_pred HHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccc Confidence 999999999999999999 799999999755 5554444444333332 3457788 499999999998643211 112 Q ss_pred eEEEEcCCceeeeeeeeeeeeecCCCCCccceeeeeeee-----eEEEec-cccceEEEEccccccCCCC Q lcl|Aclame:pro 238 QAIAVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYT-----GAFVPE-HLQKYIFTIGGTEVATKRD 301 (319) Q Consensus 238 n~i~~~~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~y-----g~~V~~-~k~~~Iy~~~~~~~a~~~~ 301 (319) +. |..|-.+..+....+.+++ +....-|+.+| =++++- |...-+..+.=+--.+.+. T Consensus 159 ~~---~~~ag~~~~~~~~~~~yr~----~fs~~~glv~~~~Avgtvkl~~~~~~~~~~~~~~~~~~~~~~ 221 (221) T protein:vir:17 159 NL---VTDPGDATTSGENNGSYRP----AITDRAGLVFHKEAADTVEVLLPPSRPPLVISMFSIRRPDRR 221 (221) T ss_pred cc---ccCCccccccccccccccc----cccceEEEEEcchheeeeeeecCCCCCceeeeeeeccCCCCC Confidence 22 2333344444334444443 11233344333 223322 1222121221111111111 No 57 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.12 E-value=1.2e-10 Score=75.06 Aligned_cols=285 Identities=15% Similarity=0.032 Sum_probs=155.9 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhh-------------hhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQT-------------LLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFT 67 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~-------------~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVk 67 (319) |.|.-+ +++|++||+++-.++-.+ -+++.+...+-+.....+..... + + .+..++.+++ T Consensus 1 ~~~~~~-----~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l-~-~-~~~~~~~~~~ 72 (324) T protein:vir:93 1 MEQTQK-----LKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL-G-K-YEPMEGTEKK 72 (324) T ss_pred CchhHH-----HHHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhh-c-c-eeeccCCceE Confidence 555433 678999999987644333 24555543333333333322211 1 1 2345677899 Q ss_pred eeecccccccccc-CCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 VMKGDTTELKDYK-RNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFA 145 (319) Q Consensus 68 Ip~i~~~g~~DY~-r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s 145 (319) ||......-.... .++.....+ .++...++.-.|.-.+ +.--+ +-.. ....+...+.++...+++-.+|...+. T Consensus 73 ip~~~~~~~a~~v~Eg~~~~~~~--~~f~~i~~~~~k~~~~-~~iS~-ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~ 148 (324) T protein:vir:93 73 FTFWADKPGAYWVGEGQKIETSK--ATWVNATMRAFKLGVI-LPVTK-EFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred EEEEecCcceeeecCCccccccc--cceeEEEEEeEEEEEe-ehhhH-HHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 9998654333322 222233333 3444555544443332 22112 1111 124556777788888888888886552 Q ss_pred HHHhc-cCc-------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeee Q lcl|Aclame:pro 146 TLARN-KAK-------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) Q Consensus 146 ~la~~-a~~-------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) --.+. .+. ........+..++.|.++...|..++.... .++++|..+..|++-.+ ..|......+. T Consensus 149 G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~l~d-----~~G~~~~~~~~ 222 (324) T protein:vir:93 149 NQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN-AFISKTQNRSLLRKIVD-----PETKERIYDRN 222 (324) T ss_pred CCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhhC-----CCCCeeecCCC Confidence 11000 000 000111223458889999988988765533 58899999999875432 22344445667 Q ss_pred eeeecCeEEEEecccccccceEEEEcCCceee-eeeeeeeeeecCC-------CC--------CccceeeeeeeeeEEEe Q lcl|Aclame:pro 218 QGELDGFVIVKVPTKLLQGLQAIAVVGEVLAS-PIQADLAKTNSNI-------PG--------MFGTLAEQLLYTGAFVP 281 (319) Q Consensus 218 Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~-~~k~~~~~~~~~~-------~~--------~~~~~v~gr~~yg~~V~ 281 (319) .+++.|.+|+.+++.......+++|..+-... ..+--.+++.+.. ++ .+--+++...++|..|. T Consensus 223 ~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~ 302 (324) T protein:vir:93 223 SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) T ss_pred CCcccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEe Confidence 78899999998777666666777777654332 2222233333210 10 12357889999999999 Q ss_pred ccccceEEEEccccccCCCCCccc Q lcl|Aclame:pro 282 EHLQKYIFTIGGTEVATKRDGVDA 305 (319) Q Consensus 282 ~~k~~~Iy~~~~~~~a~~~~~~~~ 305 (319) +|++..... ...+.+.+.++.. T Consensus 303 ~~~a~~~l~--~a~~~~~~~~~~~ 324 (324) T protein:vir:93 303 DDKAFAKLV--PADKRTDSVPGEV 324 (324) T ss_pred cccceEEEe--cccccCCCCCCCC Confidence 998755433 2333333344444 No 58 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=99.10 E-value=2.5e-10 Score=73.24 Aligned_cols=287 Identities=15% Similarity=0.032 Sum_probs=157.9 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhh-------------hhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQT-------------LLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFT 67 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~-------------~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVk 67 (319) |.|.= -++++++||+++-++.... -+++.+...+.+.....+..... +. ..-.++.+++ T Consensus 1 ~~~~~-----~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l-~~--~~~~~~~~~~ 72 (324) T protein:vir:96 1 MEQTQ-----KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL-GK--YEPMEGTEKK 72 (324) T ss_pred CCcch-----hhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhh-cc--eeeccCCceE Confidence 65542 3568999999987643221 34455543333333333322222 11 2345677899 Q ss_pred eeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 VMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATL 147 (319) Q Consensus 68 Ip~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~l 147 (319) ||.....+-..... .+-....-+.++...++.-.|.-.+ +.--+.---....++...+.++...+++-.+|...+.-- T Consensus 73 ~p~~~~~~~a~~v~-Eg~~~~~~~~~f~~v~~~~~k~~~~-~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~ 150 (324) T protein:vir:96 73 FTFWADKPGAYWVG-EGQKIETSKATWVNATMRAFKLGVI-LPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred EEEEecCcceeeec-CCccccccccceeEEEEEeEEEEEe-ehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 99986554444332 2222222334455555554543333 222221111112456677788888889999988655311 Q ss_pred Hhc-cCc-------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeee Q lcl|Aclame:pro 148 ARN-KAK-------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQG 219 (319) Q Consensus 148 a~~-a~~-------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg 219 (319) .+. .+. ...........|+.|+++...+..++.... .++++|..+..|++-.+ ..|......+..+ T Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~-~~i~n~~~~~~L~~lkd-----~~G~~~~~~~~~~ 224 (324) T protein:vir:96 151 GNNPFGKSIAQSIKKTNKVIKGDFTQDNIIDLEALLEDDELEAN-AFISKTQNRSLLRKIVD-----PETKERIYDRNSD 224 (324) T ss_pred CCCCcCccccccccccceecccccchHHHHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhhC-----CCCCeeecCCCCC Confidence 000 000 001111223458889999999988766533 47899999998875432 2233344556778 Q ss_pred eecCeEEEEecccccccceEEEEcCCceeeee-eeeeeeeecCC-------C--------CCccceeeeeeeeeEEEecc Q lcl|Aclame:pro 220 ELDGFVIVKVPTKLLQGLQAIAVVGEVLASPI-QADLAKTNSNI-------P--------GMFGTLAEQLLYTGAFVPEH 283 (319) Q Consensus 220 ~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~-k~~~~~~~~~~-------~--------~~~~~~v~gr~~yg~~V~~~ 283 (319) ++.|.+|+.+++.......+++|+.+...... +--.+++.+.. + ..+.-.++...++|..|.+| T Consensus 225 ~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~ 304 (324) T protein:vir:96 225 SLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) T ss_pred cccceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecc Confidence 89999999877776777778877765443322 22233333210 0 11224678889999999999 Q ss_pred ccceEEEEccccccCCCCCccc Q lcl|Aclame:pro 284 LQKYIFTIGGTEVATKRDGVDA 305 (319) Q Consensus 284 k~~~Iy~~~~~~~a~~~~~~~~ 305 (319) ++... -..+.+.+..+++.. T Consensus 305 ~a~~~--l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 305 KAFAK--LVPADKRTDSVPGEV 324 (324) T ss_pred cceEE--EecccccCCCCCCCC Confidence 87443 233334444444444 No 59 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.06 E-value=6.9e-11 Score=76.30 Aligned_cols=292 Identities=8% Similarity=-0.077 Sum_probs=163.4 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccc-c-ccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTT-E-LKD 78 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~-g-~~D 78 (319) |-- +-+..||-+..-.++..+-..+ ..+++....++.....+..+. .+|++|+||.+... | ..+ T Consensus 1 MA~---------T~lsd~i~PEvf~~yv~~~~~~----~~~l~qSG~i~~~~~l~~~~~-~~G~~it~P~~~~l~Gd~~~ 66 (351) T protein:vir:15 1 MAE---------THLSDLIVPEVFGNYVVNQIIK----TNRFVQSGILTPDPDLGPHLL-EAGTRITVPFLNDLTGDPDN 66 (351) T ss_pred CCc---------eeeeeeechhHHHHHHhhhhHH----hhhHhhcccccccHHHHHHhh-cCCCEEEecccccCCCcccc Confidence 321 2345677777666665432211 223332222222211122222 38999999999864 3 567 Q ss_pred ccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc---- Q lcl|Aclame:pro 79 YKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH---- 154 (319) Q Consensus 79 Y~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~---- 154 (319) |+.+...+++.++.....-++ +.|+..|.+.|....-+- .+++...+++.+...+...++.+++.|.+.-+.. T Consensus 67 ~~~~~~i~~~kitt~~~~a~i-~~~~kg~~~tD~a~~~sg--~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~ 143 (351) T protein:vir:15 67 WTDSDDIDVNNLTSGKQQGIK-FYQTKAYGYTDLGTMISG--APVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIAN 143 (351) T ss_pred cCCCcccchheecccceeEEE-EeeccceehhhhhHhhcc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcc Confidence 877778888888877777766 446666777766655442 3666667777777888889999888774321110 Q ss_pred -----ccccCCHhHH--HHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhh--hhhhcccccccceeeeeeeeecCeE Q lcl|Aclame:pro 155 -----LTVGTGSDAQ--YDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFV--IALPQGDTRQQVLGKGVQGELDGFV 225 (319) Q Consensus 155 -----~~~~~T~~n~--~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~--~f~~~~~~~~~~~~~g~Vg~idG~~ 225 (319) .+...+.++. ++.|.+|..+|.+..-..-..++|.|.++..|++.. .|.+..+ .++.|+.+.|++ T Consensus 144 ~~~~d~t~~~~~~~~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~------~~~~i~t~~G~~ 217 (351) T protein:vir:15 144 SKVYDQTKVSPSEPMFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQN------GATPFEAYNGLR 217 (351) T ss_pred cceeccccccccccccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhccccc------cCcccceecceE Confidence 1111122233 588999999996642233467889999999998764 3443211 245689999999 Q ss_pred EEEeccc-------ccccceEEEEcCCceeeeeeeeeeeeecCCCCCcc-ceeeeeeeeeEEEeccccceEEEEcc-ccc Q lcl|Aclame:pro 226 IVKVPTK-------LLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMFG-TLAEQLLYTGAFVPEHLQKYIFTIGG-TEV 296 (319) Q Consensus 226 I~~vps~-------~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~~~-~~v~gr~~yg~~V~~~k~~~Iy~~~~-~~~ 296 (319) |+...+. ....+..++..++|+..-.+-..+|+.|++....| +.+..|+.|. +-|+.-. |.... ... T Consensus 218 VivdD~~p~~~~~~~~~~ytsyl~~~GAi~~~~~~~~ve~~rd~~~~~g~d~l~~r~~~~---~hp~G~s-~~~~~~~~~ 293 (351) T protein:vir:15 218 IVLDDDIEIDLTDKTKPVSTSYIFAPGAVRYSTNMRSTETKYDPLINGGQDVIVQKRVGT---IHVAGTS-IKASFSPSK 293 (351) T ss_pred EEEcCCCccccCCCCCceeEEEEEecceeeeecCCcCcceeecccCCCCceEEEEeeeee---eeeeeee-ecccccccC Confidence 9964221 11235567777888887777777888875443333 4444555543 3333211 11110 011 Q ss_pred cCCCCCccccccccccccccccC Q lcl|Aclame:pro 297 ATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 297 a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) ...|+-..+....-|.-+++-|- T Consensus 294 ~~sPt~~~L~~~~NW~~v~~~d~ 316 (351) T protein:vir:15 294 ASFPTIDELAKSSTWEVVDGIDV 316 (351) T ss_pred cCCcChHHhcCCcccccccCCCc Confidence 11222233333334444433222 No 60 >protein:vir:2106 Length: 430 # NCBI annotation: coat protein # Family: family:all:1412 # MgeID: mge:46 # MgeName: P22 # Cross-refs: genbank:acc:NP_059630;genbank:gi:9635538;genbank:GeneID:1262831 Probab=99.02 E-value=1.8e-10 Score=74.06 Aligned_cols=267 Identities=10% Similarity=0.017 Sum_probs=146.3 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceee---eCCceEEeeecccccccc-ccCCCCcccCCcccce Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF---MEGRSFTVMKGDTTELKD-YKRNATNEFDHPKIEE 94 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~---~~g~tVkIp~i~~~g~~D-Y~r~~~~~~~~~t~t~ 94 (319) .|++..+..+|.. +.+..+++......++. ...++|.. ..|++|+||.-......+ -++++ ...++.-.. T Consensus 1 Ma~~~~~~lti~~-~eal~~~~n~lV~a~~~---~~~r~~d~~~~r~Gdti~ip~p~~~~~~~G~~~t~--~~~~~~e~~ 74 (430) T protein:vir:21 1 MALNEGQIVTLAV-DEIIETISAITPMAQKA---KKYTPPAASMQRSSNTIWMPVEQESPTQEGWDLTD--KATGLLELN 74 (430) T ss_pred CccccchhhHHHH-HHHHHHhhhhhhhhhhh---hccCCchhhhhcccceEEeeccccccccccccccC--CCccceeee Confidence 4666556666666 55555555444333321 23355532 468999999654322222 01121 123466678 Q ss_pred eEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc----cccccCCHhHHHHHHHH Q lcl|Aclame:pro 95 TTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----HLTVGTGSDAQYDAVLD 170 (319) Q Consensus 95 ~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~----~~~~~~T~~n~~~~i~~ 170 (319) .+++|++.|...|.+.+-+ .. ..++....-+.+..+++-+||+.+++.++....- ......+..+.+..+-+ T Consensus 75 v~~~~~~~~~V~~~~~~kE--l~--~~~~~er~l~pAm~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~ 150 (430) T protein:vir:21 75 VAVNMGEPDNDFFQLRADD--LR--DETAYRRRIQSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVAD 150 (430) T ss_pred EeEEEeeeccceEEeehhH--hc--ChhhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhccccccCCCCCCCCcchhhHHH Confidence 8999999998777776222 11 2333333445666899999999999876554321 11222334456888889 Q ss_pred HHHHHHhccCC-C-CcEEEEChHHHHHHhh-hhhhhhcccccccceeeeeeee-ecCeE-EEEe---cc----------- Q lcl|Aclame:pro 171 VSVELDEIKAP-E-NRVLFVSPTFYKGIKK-FVIALPQGDTRQQVLGKGVQGE-LDGFV-IVKV---PT----------- 231 (319) Q Consensus 171 a~~~Lde~~VP-~-~R~l~VsP~~~~~L~~-~~~f~~~~~~~~~~~~~g~Vg~-idG~~-I~~v---ps----------- 231 (319) +.+.|++.++| + +|.++++|+.+..|.. ...+........+..++|.|++ +.||+ +++. |. T Consensus 151 a~~~L~~~~vP~~~~R~~~~~p~~~~~l~~~l~~~~~~~~~~~~A~r~g~i~r~~~Gfd~~~~s~~~~~~t~gt~t~~tv 230 (430) T protein:vir:21 151 AEEIMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITV 230 (430) T ss_pred HHHHHHHhcCCCCCCcEEEeChHHHHHHhhhhccccccccchhHHHhhcccccccchhhhhhhcCCcccccCccCcCcee Confidence 99999999999 3 7999999999987733 2222222222334455666654 55653 4431 00 Q ss_pred --c------------------------------c--c-c----------------------------------------- Q lcl|Aclame:pro 232 --K------------------------------L--L-Q----------------------------------------- 235 (319) Q Consensus 232 --~------------------------------~--~-~----------------------------------------- 235 (319) . . + + T Consensus 231 ~gA~~~~~~~~tv~~~g~~~~~d~~~~~it~s~tg~l~~GD~ftiaGV~~v~~itk~~~~~l~qf~V~a~~~~ttv~I~P 310 (430) T protein:vir:21 231 SGAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGMKRGDKISFAGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITP 310 (430) T ss_pred ccccccccccceeccccccccccccceeeeeecccceecccEEEecceeeeccccccccCCcceEEEEEecCCceeEEee Confidence 0 0 0 0 Q ss_pred ----------------------------cceE---------EEEcCCceeeeeeee----------------------ee Q lcl|Aclame:pro 236 ----------------------------GLQA---------IAVVGEVLASPIQAD----------------------LA 256 (319) Q Consensus 236 ----------------------------~~n~---------i~~~~~A~~~~~k~~----------------------~~ 256 (319) .+.| ++.|++|+..+..-- .+ T Consensus 311 ai~~~~~~~~~~~~~~y~nVsaspa~~aavT~v~~a~~~~Nl~fh~~A~~La~~pl~~p~~~~~~~~~~~~~~~~~Glsi 390 (430) T protein:vir:21 311 KPVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNG 390 (430) T ss_pred cccccccccccccccccceeccccccCceeEEeccCCcccceeEccceeEEEEecccCCCChhHhhheeeeeccccceEE Confidence 0001 445555555443311 01 Q ss_pred eeecC-CCCCccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 257 KTNSN-IPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 257 ~~~~~-~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) ++++. ..+.....++--..||++.++|+..+|-..-.++ T Consensus 391 rv~~~yd~~~~~~~~r~DilyG~~~l~Pe~a~v~l~g~~~ 430 (430) T protein:vir:21 391 IFATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEEccccccCceEEEEEeecCccccCcceEEEEcCCCCC Confidence 11100 0111123445556789999999998775544333 No 61 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.01 E-value=9.6e-10 Score=70.03 Aligned_cols=285 Identities=16% Similarity=0.060 Sum_probs=158.3 Q ss_pred CCcccccccceeeehhhhhhhhhcchhh-------------hhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQ-------------TLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~-------------~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tV 66 (319) |.|. --++|++++|+++..+.-. ..+++.+. .+++.+...+.+.. . +. ..-..+.++ T Consensus 1 ~~~~-----~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~-l-~~--~~~~~~~~~ 71 (324) T protein:vir:78 1 MEQT-----QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ-L-GK--YEPMEGTEK 71 (324) T ss_pred CCcc-----hhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhh-h-cc--eeeccCCce Confidence 5554 2367899999988764311 23455554 34444443333322 2 22 234457789 Q ss_pred EeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 TVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFA 145 (319) Q Consensus 67 kIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s 145 (319) +||.....+-.... ..+-.....+.++...++.-.|.-.+. . +..+-.. ....+...+.+..+.+++-.+|...+. T Consensus 72 ~~p~~~~~~~a~~v-~Eg~~~~~~~~~~~~v~~~~~k~~~~~-~-is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~ 148 (324) T protein:vir:78 72 KFTFWADKPGAYWV-GEGQKIETSKATWVNATMRAFKLGVIL-P-VTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred EEEEEecCcceeEe-cCCccccccccceeEEEEeeEEEEEee-h-hhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 99998765444332 222222222334445555444433322 1 2221111 124556777788888899899886653 Q ss_pred HHHhc-cCc-------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeee Q lcl|Aclame:pro 146 TLARN-KAK-------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) Q Consensus 146 ~la~~-a~~-------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) --..+ ... ........+..|+.|.++...|..++.... .++++|..+..|.+-.+ ..|......+. T Consensus 149 G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~-~~vmn~~~~~~L~~l~d-----~~G~~~~~~~~ 222 (324) T protein:vir:78 149 NQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN-AFISKTQNRSLLRKIVD-----PETKERIYDRN 222 (324) T ss_pred cCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhhc-----cCCCeeecCCC Confidence 11100 000 001111234468889999998988765533 57899999998875432 12333445677 Q ss_pred eeeecCeEEEEecccccccceEEEEcCCceee-eeeeeeeeeecCC-------C--------CCccceeeeeeeeeEEEe Q lcl|Aclame:pro 218 QGELDGFVIVKVPTKLLQGLQAIAVVGEVLAS-PIQADLAKTNSNI-------P--------GMFGTLAEQLLYTGAFVP 281 (319) Q Consensus 218 Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~-~~k~~~~~~~~~~-------~--------~~~~~~v~gr~~yg~~V~ 281 (319) .+++.|.+|+.+++.......+++|+.+-... ..+--.+++.+.. + .++...++...++|..|. T Consensus 223 ~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~ 302 (324) T protein:vir:78 223 SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) T ss_pred CCcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEe Confidence 78899999998877766666777776654322 2232234443211 0 113357788899999999 Q ss_pred ccccceEEEEccccccCCCCCccc Q lcl|Aclame:pro 282 EHLQKYIFTIGGTEVATKRDGVDA 305 (319) Q Consensus 282 ~~k~~~Iy~~~~~~~a~~~~~~~~ 305 (319) +|++..... +....+...++.. T Consensus 303 ~~~A~~~l~--~a~~~~~~~~~~~ 324 (324) T protein:vir:78 303 DDKAFAKLV--PADKRTDSVPGEV 324 (324) T ss_pred cccceEEEe--cccccCCCCCCCC Confidence 998754433 3334444445554 No 62 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.01 E-value=9.6e-10 Score=70.03 Aligned_cols=285 Identities=16% Similarity=0.060 Sum_probs=158.3 Q ss_pred CCcccccccceeeehhhhhhhhhcchhh-------------hhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQ-------------TLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~-------------~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tV 66 (319) |.|. --++|++++|+++..+.-. ..+++.+. .+++.+...+.+.. . +. ..-..+.++ T Consensus 1 ~~~~-----~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~-l-~~--~~~~~~~~~ 71 (324) T protein:vir:96 1 MEQT-----QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ-L-GK--YEPMEGTEK 71 (324) T ss_pred CCcc-----hhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhh-h-cc--eeeccCCce Confidence 5554 2367899999988764311 23455554 34444443333322 2 22 234457789 Q ss_pred EeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 TVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFA 145 (319) Q Consensus 67 kIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s 145 (319) +||.....+-.... ..+-.....+.++...++.-.|.-.+. . +..+-.. ....+...+.+..+.+++-.+|...+. T Consensus 72 ~~p~~~~~~~a~~v-~Eg~~~~~~~~~~~~v~~~~~k~~~~~-~-is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~ 148 (324) T protein:vir:96 72 KFTFWADKPGAYWV-GEGQKIETSKATWVNATMRAFKLGVIL-P-VTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred EEEEEecCcceeEe-cCCccccccccceeEEEEeeEEEEEee-h-hhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 99998765444332 222222222334445555444433322 1 2221111 124556777788888899899886653 Q ss_pred HHHhc-cCc-------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeee Q lcl|Aclame:pro 146 TLARN-KAK-------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) Q Consensus 146 ~la~~-a~~-------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) --..+ ... ........+..|+.|.++...|..++.... .++++|..+..|.+-.+ ..|......+. T Consensus 149 G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~-~~vmn~~~~~~L~~l~d-----~~G~~~~~~~~ 222 (324) T protein:vir:96 149 NQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN-AFISKTQNRSLLRKIVD-----PETKERIYDRN 222 (324) T ss_pred cCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhhc-----cCCCeeecCCC Confidence 11100 000 001111234468889999998988765533 57899999998875432 12333445677 Q ss_pred eeeecCeEEEEecccccccceEEEEcCCceee-eeeeeeeeeecCC-------C--------CCccceeeeeeeeeEEEe Q lcl|Aclame:pro 218 QGELDGFVIVKVPTKLLQGLQAIAVVGEVLAS-PIQADLAKTNSNI-------P--------GMFGTLAEQLLYTGAFVP 281 (319) Q Consensus 218 Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~-~~k~~~~~~~~~~-------~--------~~~~~~v~gr~~yg~~V~ 281 (319) .+++.|.+|+.+++.......+++|+.+-... ..+--.+++.+.. + .++...++...++|..|. T Consensus 223 ~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~ 302 (324) T protein:vir:96 223 SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) T ss_pred CCcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEe Confidence 78899999998877766666777776654322 2232234443211 0 113357788899999999 Q ss_pred ccccceEEEEccccccCCCCCccc Q lcl|Aclame:pro 282 EHLQKYIFTIGGTEVATKRDGVDA 305 (319) Q Consensus 282 ~~k~~~Iy~~~~~~~a~~~~~~~~ 305 (319) +|++..... +....+...++.. T Consensus 303 ~~~A~~~l~--~a~~~~~~~~~~~ 324 (324) T protein:vir:96 303 DDKAFAKLV--PADKRTDSVPGEV 324 (324) T ss_pred cccceEEEe--cccccCCCCCCCC Confidence 998754433 3334444445554 No 63 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.98 E-value=1.3e-09 Score=69.39 Aligned_cols=286 Identities=15% Similarity=0.038 Sum_probs=156.2 Q ss_pred CCcccccccceeeehhhhhhhhhcch-------------hhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEP-------------GQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFT 67 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-------------n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVk 67 (319) |.|.- -+.+++.+|++.-.++ ....+++.+...+-+.....+..... + ..+-.++.+++ T Consensus 1 ~~~~~-----~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~-~--~~~~~~~~~~~ 72 (324) T protein:vir:97 1 MEQTQ-----KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQL-G--KYEPMEGTEKK 72 (324) T ss_pred Cccch-----hHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhh-c--ceeeccCCceE Confidence 66652 2567788888665532 22345666654333333333322222 1 22345677899 Q ss_pred eeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 VMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 68 Ip~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) ||.....+....... +-.....+.++...++...|.-.+. . +..+-.+. ...+...+.+....+++-.+|...+.- T Consensus 73 ip~~~~~~~a~~v~E-g~~~~~~~~~f~~v~~~~~k~~~~~-~-is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G 149 (324) T protein:vir:97 73 FTFWADKPGAYWVGE-GQKIETSKATWVNATMRAFKLGVIL-P-VTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEEecCcceeEecc-CccccccccceeEEEEeeEEEEEee-h-hhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 999875544433322 2223333445555555555544432 2 22221111 245567777888888888898876531 Q ss_pred HHhc-cCc-------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeee Q lcl|Aclame:pro 147 LARN-KAK-------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQ 218 (319) Q Consensus 147 la~~-a~~-------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~V 218 (319) -..+ ... ........+..|+.|+++...|..++.... .++++|..+..|.+-.+ ..|+.....+.- T Consensus 150 ~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~lkd-----~~g~~~~~~~~~ 223 (324) T protein:vir:97 150 QGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN-AFISKTQNRSLLRKIVD-----PETKERIYDRNS 223 (324) T ss_pred CCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhhc-----CCCceeecCCCC Confidence 1100 000 000111133458889999999988766543 57899999988875432 123333445556 Q ss_pred eeecCeEEEEecccccccceEEEEcCCceee-eeeeeeeeeecCC-------C--------CCccceeeeeeeeeEEEec Q lcl|Aclame:pro 219 GELDGFVIVKVPTKLLQGLQAIAVVGEVLAS-PIQADLAKTNSNI-------P--------GMFGTLAEQLLYTGAFVPE 282 (319) Q Consensus 219 g~idG~~I~~vps~~~~~~n~i~~~~~A~~~-~~k~~~~~~~~~~-------~--------~~~~~~v~gr~~yg~~V~~ 282 (319) +.+.|.+|+.+++...+...+++|..+-... ..+--.+++.+.. + .++--.++...++|..|.+ T Consensus 224 ~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~ 303 (324) T protein:vir:97 224 DTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) T ss_pred ccccceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEec Confidence 7899999998777666666777776654433 2332244444321 0 0123467888999999999 Q ss_pred cccceEEEEccccccCCCCCccc Q lcl|Aclame:pro 283 HLQKYIFTIGGTEVATKRDGVDA 305 (319) Q Consensus 283 ~k~~~Iy~~~~~~~a~~~~~~~~ 305 (319) |++..+..... +.+...++.. T Consensus 304 ~~a~~~l~~~~--~~~~~~~~~~ 324 (324) T protein:vir:97 304 DKAFAKLVPAD--KKTDSVPGEV 324 (324) T ss_pred ccceEEEEecc--CCCCCCCCCC Confidence 99865544432 2222222322 No 64 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.97 E-value=1.6e-09 Score=68.81 Aligned_cols=285 Identities=16% Similarity=0.062 Sum_probs=154.7 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhh-------------hhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQT-------------LLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~-------------~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tV 66 (319) |.|.=+ +++++++|+.+-.+..+. -+++.+. .+++.+...+.+.. . +. .+-.++.++ T Consensus 1 ~~k~~~-----~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~-~-~~--~~~~~~~~~ 71 (324) T protein:vir:99 1 MEQTQK-----LKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMR-L-GK--YEPMEGTEK 71 (324) T ss_pred CCCchH-----hhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhh-h-cc--eeeccCCce Confidence 777632 568999999887644332 3455554 33444433332221 1 21 234557789 Q ss_pred EeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 TVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFA 145 (319) Q Consensus 67 kIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s 145 (319) +||.....+-...... +-.....+.++...++...|.-.+. . +..+-.+. ...+...+.+....+++-.+|.-.+. T Consensus 72 ~~p~~~~~~~a~~v~E-g~~~~~~~~~~~~v~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~ 148 (324) T protein:vir:99 72 KFTFWADKPGAYWVGE-GQKIETSKATWVNATMRAFKLGVIL-P-VTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred EEEEEecCcceeEecc-CccccccccceeEEEEeeEEEEEee-h-hhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 9999865444443322 2222222334555555555544432 2 22221111 23455666777788888888886653 Q ss_pred HHHhcc-Cc-------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeee Q lcl|Aclame:pro 146 TLARNK-AK-------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) Q Consensus 146 ~la~~a-~~-------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) --..+. +. ........+..++.|.++...|...+.... .++++|..+..|.+-.+ ..++.....+. T Consensus 149 G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~l~d-----~~g~~~~~~~~ 222 (324) T protein:vir:99 149 NQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN-AFISKTQNRSLLRKIVD-----PETKERIYDRN 222 (324) T ss_pred cCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhhc-----CCCceeecCCC Confidence 111100 00 001111234458889999999988765544 47899999998875432 12333334455 Q ss_pred eeeecCeEEEEecccccccceEEEEcCCceeee-eeeeeeeeecCC-------CC--------CccceeeeeeeeeEEEe Q lcl|Aclame:pro 218 QGELDGFVIVKVPTKLLQGLQAIAVVGEVLASP-IQADLAKTNSNI-------PG--------MFGTLAEQLLYTGAFVP 281 (319) Q Consensus 218 Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~-~k~~~~~~~~~~-------~~--------~~~~~v~gr~~yg~~V~ 281 (319) -+++.|.+|+.+++.......+++++.+-.... .+=-.+++.+.. ++ ++.-+++...++|..|. T Consensus 223 ~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 302 (324) T protein:vir:99 223 SDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) T ss_pred CccccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEe Confidence 567999999987776666666777776543322 222233333210 00 12347788899999999 Q ss_pred ccccceEEEEccccccCCCCCccc Q lcl|Aclame:pro 282 EHLQKYIFTIGGTEVATKRDGVDA 305 (319) Q Consensus 282 ~~k~~~Iy~~~~~~~a~~~~~~~~ 305 (319) +|++...... ..+.....++.. T Consensus 303 ~~~a~~~lt~--a~~~~~~~~~~~ 324 (324) T protein:vir:99 303 DDKAFAKLVP--ADKKTDSVPGEV 324 (324) T ss_pred cccceEEEEe--ccCCCCCCCCCC Confidence 9998555433 333333333333 No 65 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.96 E-value=1.9e-09 Score=68.45 Aligned_cols=285 Identities=17% Similarity=0.073 Sum_probs=154.4 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhh-------------hhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQT-------------LLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~-------------~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tV 66 (319) |.|.=| ++++++.|+.+-.+.... -+++.+. .+++.+.. .+..... + ..+-.++.++ T Consensus 1 ~~~~~~-----~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~-~s~l~~~-~--~~~~~~~~~~ 71 (324) T protein:vir:10 1 MEQTQK-----LKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVME-NSKIMQL-G--KYEPMEGTEK 71 (324) T ss_pred CCCchH-----HHHHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHh-hchhhhh-c--ceeeccCCce Confidence 666533 568999999886533322 3455554 33433333 3322211 2 1244567789 Q ss_pred EeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 TVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFA 145 (319) Q Consensus 67 kIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s 145 (319) +||.....+...+... +-.....+.++...++...|.-.+. . +..+-.. ....+...+.+....+++-.+|.-.+. T Consensus 72 ~~p~~~~~~~a~~v~E-g~~~~~~~~~~~~v~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~ 148 (324) T protein:vir:10 72 KFTFWADKPGAYWVGE-GQKIETSKATWVNATMRAFKLGVIL-P-VTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred EEEEEeCCcceeEecc-CccccccccceeEEEEeeEEEEEee-h-hhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 9999865444443322 2222222344555555555544432 2 2221111 123455666778888888888886553 Q ss_pred HHHhc-cCcc-------ccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeee Q lcl|Aclame:pro 146 TLARN-KAKH-------LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) Q Consensus 146 ~la~~-a~~~-------~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) --..+ .+.. .....+.+..++.|.++...|..++.... .++++|..+..|.+-.+ ..|+.....+. T Consensus 149 G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~l~d-----~~g~~~~~~~~ 222 (324) T protein:vir:10 149 NQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEAN-AFISKTQNRSLLRKIVD-----PETKERIYDRN 222 (324) T ss_pred cCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCC-EEEEcHHHHHHHHHhhc-----cCCceeecCCC Confidence 11110 0000 00111223458888999988888765534 47899999998875432 12333334455 Q ss_pred eeeecCeEEEEecccccccceEEEEcCCceeeee-eeeeeeeecCC-------C--------CCccceeeeeeeeeEEEe Q lcl|Aclame:pro 218 QGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPI-QADLAKTNSNI-------P--------GMFGTLAEQLLYTGAFVP 281 (319) Q Consensus 218 Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~-k~~~~~~~~~~-------~--------~~~~~~v~gr~~yg~~V~ 281 (319) -+++.|.+|+.+++....+..+++++.+-..... +=-.+++.+.. + .++.-+++...++|..|. T Consensus 223 ~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~ 302 (324) T protein:vir:10 223 SDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) T ss_pred CccccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEe Confidence 5679999999887777777777777765543322 22233333210 1 113357788899999999 Q ss_pred ccccceEEEEccccccCCCCCccc Q lcl|Aclame:pro 282 EHLQKYIFTIGGTEVATKRDGVDA 305 (319) Q Consensus 282 ~~k~~~Iy~~~~~~~a~~~~~~~~ 305 (319) +|++....... .+.+...++.. T Consensus 303 ~~~A~~~l~~a--~~~~~~~~~~~ 324 (324) T protein:vir:10 303 DDKAFAKLVPA--DKKTDSVPGEV 324 (324) T ss_pred cccceEEEEec--cCCCCCCCCCC Confidence 99875554433 23332233333 No 66 >protein:vir:98871 Length: 314 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164418;genbank:gi:56694908;genbank:GeneID:3197261 Probab=98.89 E-value=2.8e-10 Score=72.95 Aligned_cols=286 Identities=15% Similarity=0.146 Sum_probs=173.8 Q ss_pred CCcccccccceeee-hhhhhhhhhcchhhh--hhhHhhHHHHHHHHHhhhhhhhcccCccee-----eeCCc--eEEeee Q lcl|Aclame:pro 1 MNKTIKNATGMLKL-NLQHFANKSVEPGQT--LLKNKHVGILERVTAVNAYSTPALISNDAI-----FMEGR--SFTVMK 70 (319) Q Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~~~~~~n~~--~l~~ky~~lld~~~~~~sl~~~~~~n~~~~-----~~~g~--tVkIp~ 70 (319) |.|..|- .|.+ |.|.||.--.-.|.. .|+++|.+||..++...++-...... .++ .++.. .||-.. T Consensus 1 ~~~~~~~---~~~~~~~~~~~~~t~N~n~avr~Y~Kqf~glL~~vf~~qa~F~~~FGg-~lQalDGV~~N~tafsvKtsD 76 (314) T protein:vir:98 1 MKKQFKP---FLPLNNIQFFASGTANQNKAARSYQKEFRQLLQAVFRSQAYFRDFFGG-GIEALDGVQHNDTAFYVKTSD 76 (314) T ss_pred Ccccccc---cccccceeeeeeccccCccceeeecHHHHHHHHHHHhhHhhhhhhccc-ceeeccCCCccceEEEEeecc Confidence 7776654 4555 467777433222222 58999999999999887765433222 121 11222 455555 Q ss_pred ccccccc-cccCCCCcccCCcc-----------cceeEEEEeecccce--eecchhhHHHHhhhHHH--HHHHHHHHHHH Q lcl|Aclame:pro 71 GDTTELK-DYKRNATNEFDHPK-----------IEETTYFLDQEKYWG--RFVDALDRKDTEGNIDI--NYVVARQGAEV 134 (319) Q Consensus 71 i~~~g~~-DY~r~~~~~~~~~t-----------~t~~tltidqdr~~~--F~VD~~D~~et~~~~~~--~~~~~~~~~~~ 134 (319) +.++ ++ .|+.+.-..+|+-+ .-++...+.++-.|. -.||.+- .|.++.+ ++- -+.++.+ T Consensus 77 ~pVV-ig~~Y~TdeNvaFGtGTg~SsRFGprkEi~y~dtdVpY~~~~~iHEGiD~~T---VNnd~~aaVAdR-L~LQA~A 151 (314) T protein:vir:98 77 IPVV-VGNEYNKDENVGFGEGTSRSTRFGPRREIIYQDTPVPYTWEWVYHEGIDKHT---VNNDFQAAVADR-LDLQANA 151 (314) T ss_pred ccee-ecCcccCCCCcccccCCccccccCceeEEEeecccccccccchhhhcccccc---ccCChhHHHHHH-HHHHHHH Confidence 4432 33 46543322222221 122222333333333 3455443 3333322 111 1234445 Q ss_pred HHHHHHHHHHHHHHhccCcccc-ccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccce Q lcl|Aclame:pro 135 VAPYLDNLRFATLARNKAKHLT-VGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVL 213 (319) Q Consensus 135 vapeiD~~~~s~la~~a~~~~~-~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~ 213 (319) -...+|.-.-.-|...|+.+.. +.++.+++...+-.+.+..-+.+|-.....+|+|++|.+|...+--+.+-.+.-++- T Consensus 152 kt~~~n~~~Gk~lS~~As~te~ltd~~~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~TsaK~SsaNID 231 (314) T protein:vir:98 152 KIKQFNAQHSKFISSIAEKTETLTDYSADNVLRLFNELSKYYVNIEAIGTKAAKVSPELYNAIVDHPLTTSAKSSSANID 231 (314) T ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhcchhhHHHHHHHHHhhhhcceeeEEEEEEEchhHHhHhhccccccccccceeeec Confidence 5555666433334444444333 456889999999999999888888877889999999999987765443222221222 Q ss_pred eeeeeeeecCeEEEEecccccccceEEEEc-CCceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEc Q lcl|Aclame:pro 214 GKGVQGELDGFVIVKVPTKLLQGLQAIAVV-GEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 214 ~~g~Vg~idG~~I~~vps~~~~~~n~i~~~-~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~ 292 (319) -+| +-++-||.|.++|...+.+-.+.+.+ .+-..+..=|...|+.+ .++-.|-+.+|--=||-|+++.-+.+|.=.. T Consensus 232 eng-i~~FkGf~i~e~P~~~~q~g~ia~~s~dnig~aftGIn~aR~Ie-sEdF~GValQgAGK~G~~I~edNk~Ai~k~t 309 (314) T protein:vir:98 232 QNG-IVNFKGFAIQEIPESMLQSGDVAYTYITNIGKAFTGINTSRIIE-SEDFDGVALQGAGKAGEFILDDNKKAVAKVT 309 (314) T ss_pred cCC-cceecceEEEecchhhcCCCcEEEEccccceeecccceeeeeee-cccccceeeecccccccccccccceeeEEEe Confidence 344 44688999999999999988877766 44455667788888887 7998999999999999999987777884444 Q ss_pred ccccc Q lcl|Aclame:pro 293 GTEVA 297 (319) Q Consensus 293 ~~~~a 297 (319) .++.+ T Consensus 310 ~tp~~ 314 (314) T protein:vir:98 310 STPEG 314 (314) T ss_pred cCCCC Confidence 44444 No 67 >protein:vir:100939 Length: 430 # NCBI annotation: Gp5 # Family: family:all:1412 # MgeID: mge:1509 # MgeName: ST104 # Cross-refs: genbank:acc:YP_006408;genbank:gi:46358700;genbank:GeneID:2777089 Probab=98.86 E-value=2e-09 Score=68.28 Aligned_cols=268 Identities=10% Similarity=0.013 Sum_probs=140.8 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCccee---eeCCceEEeeeccccccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAI---FMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~---~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~ 95 (319) -|++....-++ .++....+|+......... ...++|. ...|++|+||.-......|= ..-.-...++.-... T Consensus 1 MAn~l~~~~~i-i~~eal~~l~n~~v~a~~~---~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G-~~~t~~~~~i~e~~v 75 (430) T protein:vir:10 1 MALNEGQIVTL-AVDEIIETISAITPMAQKA---KKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTDKATGLLELNV 75 (430) T ss_pred CccchhhHHHH-HHHHHHHHHhhhhhhhhhh---cccCCchhhhhcccceEEeccccccccccC-cccCCCCCccccceE Confidence 34443333222 4444455554443332221 1234442 24689999997754433330 000111234555688 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc----cccccCCHhHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----HLTVGTGSDAQYDAVLDV 171 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~----~~~~~~T~~n~~~~i~~a 171 (319) +++|++-|...|.+.+-+ ... .-...+.+ +.+-.+++-+||+.+++.++.-..- .........+.+..+-.+ T Consensus 76 ~~~v~~~k~V~~~~~~ke--l~~-~~~~~~~i-~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a 151 (430) T protein:vir:10 76 AVNMGEPDNDFFQLRADD--LRD-ETAYRHRI-QSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) T ss_pred EEEEeeeccceEEechhH--hcC-hhHHHHHh-HHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHH Confidence 999999999999998533 322 12223333 4555699999999999876554321 112223345678888889 Q ss_pred HHHHHhccCC-C-CcEEEEChHHHHHHhhh-hhhhhcccccccceeeeeeee-ecCeE-EEEe---cc------------ Q lcl|Aclame:pro 172 SVELDEIKAP-E-NRVLFVSPTFYKGIKKF-VIALPQGDTRQQVLGKGVQGE-LDGFV-IVKV---PT------------ 231 (319) Q Consensus 172 ~~~Lde~~VP-~-~R~l~VsP~~~~~L~~~-~~f~~~~~~~~~~~~~g~Vg~-idG~~-I~~v---ps------------ 231 (319) .+.|++.++| + +|.++++|+.+..|..+ .++........+..++|.||+ +.||+ +++. |. T Consensus 152 ~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~ 231 (430) T protein:vir:10 152 EELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVS 231 (430) T ss_pred HHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceec Confidence 9999999999 3 79999999998777421 222222222233445555554 55552 4321 00 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 232 -------------------------------------------------------------------------------- 231 (319) Q Consensus 232 -------------------------------------------------------------------------------- 231 (319) T Consensus 232 gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~pa 311 (430) T protein:vir:10 232 GAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPK 311 (430) T ss_pred cccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEecc Confidence Q ss_pred ccc-----------------------ccceE---------EEEcCCceeeeeeeeee----------------------e Q lcl|Aclame:pro 232 KLL-----------------------QGLQA---------IAVVGEVLASPIQADLA----------------------K 257 (319) Q Consensus 232 ~~~-----------------------~~~n~---------i~~~~~A~~~~~k~~~~----------------------~ 257 (319) -.+ ..+.| ++.|++|+..+..--.+ + T Consensus 312 ii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsir 391 (430) T protein:vir:10 312 PVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGI 391 (430) T ss_pred ccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEE Confidence 000 00111 44555555544332100 0 Q ss_pred eecC-CCCCccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 258 TNSN-IPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 258 ~~~~-~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) +.+. ..+.....++--..||++.++|+..+|-..-.++ T Consensus 392 v~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:10 392 FATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 0000 0011122344456789999999998775544333 No 68 >protein:vir:9265 Length: 430 # NCBI annotation: 5 # Family: family:all:1412 # MgeID: mge:164 # MgeName: ST64T # Cross-refs: genbank:acc:NP_720329;genbank:gi:24371587;genbank:GeneID:955820 Probab=98.86 E-value=2e-09 Score=68.28 Aligned_cols=268 Identities=10% Similarity=0.013 Sum_probs=140.8 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCccee---eeCCceEEeeeccccccccccCCCCcccCCccccee Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAI---FMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEET 95 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~---~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~ 95 (319) -|++....-++ .++....+|+......... ...++|. ...|++|+||.-......|= ..-.-...++.-... T Consensus 1 MAn~l~~~~~i-i~~eal~~l~n~~v~a~~~---~~~r~~d~~~~r~Gdti~~p~~~~~~~~~G-~~~t~~~~~i~e~~v 75 (430) T protein:vir:92 1 MALNEGQIVTL-AVDEIIETISAITPMAQKA---KKYTPPAASMQRSSNTIWMPVEQESPTQEG-WDLTDKATGLLELNV 75 (430) T ss_pred CccchhhHHHH-HHHHHHHHHhhhhhhhhhh---cccCCchhhhhcccceEEeccccccccccC-cccCCCCCccccceE Confidence 34443333222 4444455554443332221 1234442 24689999997754433330 000111234555688 Q ss_pred EEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc----cccccCCHhHHHHHHHHH Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK----HLTVGTGSDAQYDAVLDV 171 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~----~~~~~~T~~n~~~~i~~a 171 (319) +++|++-|...|.+.+-+ ... .-...+.+ +.+-.+++-+||+.+++.++.-..- .........+.+..+-.+ T Consensus 76 ~~~v~~~k~V~~~~~~ke--l~~-~~~~~~~i-~~Am~~LA~~Vd~dl~~~~~~~~~~v~~~~~~t~~~~~~~~~~~A~a 151 (430) T protein:vir:92 76 AVNMGEPDNDFFQLRADD--LRD-ETAYRHRI-QSAARKLANNVELKVANMAAEMGSLVITSPDAIGTNTADAWNFVADA 151 (430) T ss_pred EEEEeeeccceEEechhH--hcC-hhHHHHHh-HHHHHHHHHHHHHHHHHHhhhcccccccccccCCCcCCcchhhHHHH Confidence 999999999999998533 322 12223333 4555699999999999876554321 112223345678888889 Q ss_pred HHHHHhccCC-C-CcEEEEChHHHHHHhhh-hhhhhcccccccceeeeeeee-ecCeE-EEEe---cc------------ Q lcl|Aclame:pro 172 SVELDEIKAP-E-NRVLFVSPTFYKGIKKF-VIALPQGDTRQQVLGKGVQGE-LDGFV-IVKV---PT------------ 231 (319) Q Consensus 172 ~~~Lde~~VP-~-~R~l~VsP~~~~~L~~~-~~f~~~~~~~~~~~~~g~Vg~-idG~~-I~~v---ps------------ 231 (319) .+.|++.++| + +|.++++|+.+..|..+ .++........+..++|.||+ +.||+ +++. |. T Consensus 152 ~~~L~~~~vP~~~~R~~vldp~~~~~l~~~l~~l~~~~~~~~~A~r~g~i~~~~~Gfd~~~~~~~~~~~t~g~~t~~tv~ 231 (430) T protein:vir:92 152 EELMFSRELNRDMGTSYFFNPQDYKKAGYDLTKRDIFGRIPEEAYRDGTIQRQVAGFDDVLRSPKLPVLTKSTATGITVS 231 (430) T ss_pred HHHHHHhcCCCCCCcEEEeChHHHHHHHhhhccccccccchhHHHhhccccccchhhhhhhhcCCcccccCccCcCceec Confidence 9999999999 3 79999999998777421 222222222233445555554 55552 4321 00 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 232 -------------------------------------------------------------------------------- 231 (319) Q Consensus 232 -------------------------------------------------------------------------------- 231 (319) T Consensus 232 gA~~~~~~~~~v~~~g~~~~~d~~~~tit~s~tg~l~~GD~ftiaGV~~v~~~tkq~~~~l~~F~Vt~~~~atsv~I~pa 311 (430) T protein:vir:92 232 GAQSFKPVAWQLDNDGNKVNVDNRFATVTLSATTGLKRGDKISFTGVKFLGQMAKNVLAQDATFSVVRVVDGTHVEITPK 311 (430) T ss_pred cccccccccceecccccccccccccceeeeecccceecccEEEecceeeeccccccccCCccEEEEEEecCCceeEEecc Confidence Q ss_pred ccc-----------------------ccceE---------EEEcCCceeeeeeeeee----------------------e Q lcl|Aclame:pro 232 KLL-----------------------QGLQA---------IAVVGEVLASPIQADLA----------------------K 257 (319) Q Consensus 232 ~~~-----------------------~~~n~---------i~~~~~A~~~~~k~~~~----------------------~ 257 (319) -.+ ..+.| ++.|++|+..+..--.+ + T Consensus 312 ii~~~~~~~~~~~~~y~nVsaspa~~aavTvv~~a~~~~Nl~fhr~A~aLa~~pL~~~~~~~~~~~~~~~~~~~~Glsir 391 (430) T protein:vir:92 312 PVALDDVSLSPEQRAYANVNTSLADAMAVNILNVKDARTNVFWADDAIRIVSQPIPANHELFAGMKTTSFSIPDVGLNGI 391 (430) T ss_pred ccccccccccccccccceeccccccCceeEEeccCCcccceeEcccceEEEEecccCCCCHHHhhhhheeccccceEEEE Confidence 000 00111 44555555544332100 0 Q ss_pred eecC-CCCCccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 258 TNSN-IPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 258 ~~~~-~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) +.+. ..+.....++--..||++.++|+..+|-..-.++ T Consensus 392 v~~~yd~~~~~~~~r~DvLyG~~~v~Pe~a~v~l~g~~~ 430 (430) T protein:vir:92 392 FATQGDISTLSGLCRIALWYGVNATRPEAIGVGLPGQTA 430 (430) T ss_pred EEEecccccCceEEEEeeeccceecCcceEEEEcCCCCC Confidence 0000 0011122344456789999999998775544333 No 69 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.75 E-value=5.6e-09 Score=65.84 Aligned_cols=293 Identities=9% Similarity=-0.089 Sum_probs=158.2 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccc-c-ccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTT-E-LKD 78 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~-g-~~D 78 (319) |-. -.+=+..||-++...++..+...+ ..+++....++.....+..+. .+|++|.||.+... | ..+ T Consensus 1 Ma~-------~~T~l~d~i~pevf~~yv~~~~~~----~~~l~qSG~i~~~~~i~~~~~-~~G~~i~~P~~~~l~G~~~~ 68 (330) T protein:vir:10 1 MAN-------ELTKILDTITPQQYNAYMQQYTAA----KSAFVQSGIAVSDERVSKNIT-SGGLLVNMPFWNDLTGDSEV 68 (330) T ss_pred CCC-------CceEeeeeechhHHHHHHHHHhHH----hhhhhhcccccccHHHHHHhh-cCCCEEEecccccCCCcccc Confidence 211 115556777777777765544432 223332222222222222223 38999999999853 4 335 Q ss_pred ccCC-CCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----- Q lcl|Aclame:pro 79 YKRN-ATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA----- 152 (319) Q Consensus 79 Y~r~-~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~----- 152 (319) |.-+ +..+++.++...+.-++-+ |+..|.+.|+-..- ...+++...+++.+.......+..+++.+.+--+ T Consensus 69 ~~dg~~~i~~~ki~t~~~~a~i~~-~~k~~~~tD~a~~~--~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~ 145 (330) T protein:vir:10 69 LGNGDKALETGKITAGADIACVLY-RGRGWAANELTGVV--AGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAG 145 (330) T ss_pred cCCCccccchhhcccceeEEEEEe-ecceeeehhhhhhh--cchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcc Confidence 6433 4577777777666665544 55566676555322 2345566666666666666777777775542110 Q ss_pred -cc---c---cccCCHhH--HHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecC Q lcl|Aclame:pro 153 -KH---L---TVGTGSDA--QYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDG 223 (319) Q Consensus 153 -~~---~---~~~~T~~n--~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG 223 (319) .. . ....+..+ -++.|.+|..+|.+.. ..-..++|.|.+|..|++.. +..... ....++.|+.+.| T Consensus 146 ~~~~~~~~~~~~~~~~~a~~s~~~l~~A~~~~GD~~-~~~~~ivmhS~v~~~L~~~~-li~~~~---~s~~~~~i~~~~G 220 (330) T protein:vir:10 146 EKGALEETHVSDQSKASTGIDAGMVLDAKQLLGDSA-DQVTAIAMHSAVYTKLQKDN-LIQYIQ---PTTATINIPTYLG 220 (330) T ss_pred cchhhhhhheecccccccccCHHHHHHHHHHhcccc-ccceEEEEcHHHHHHHHHhh-hhhhhc---ccccCcccccccc Confidence 00 0 00111111 2577888988997643 23568999999999998753 221111 1122467899999 Q ss_pred eEEEEeccc--ccccceEEEEcCCceeeeee----eeeeeeecCCCCCccceeeeeeeeeEEEeccccceE-EEEccc-c Q lcl|Aclame:pro 224 FVIVKVPTK--LLQGLQAIAVVGEVLASPIQ----ADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYI-FTIGGT-E 295 (319) Q Consensus 224 ~~I~~vps~--~~~~~n~i~~~~~A~~~~~k----~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~I-y~~~~~-~ 295 (319) ++|+...+. ....+..++..++|+..... ...+|.-|. +..-.+.+..|..|.+-+ . |+ |..... . T Consensus 221 ~~VivdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v~~EtdRd-~~~g~~~l~~r~~~~~hp---~--G~s~~~~~~~~ 294 (330) T protein:vir:10 221 YRVIIDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLTTFETSRE-AAKGNDMIYTRRALVMHP---Y--GVKWTGAEVDA 294 (330) T ss_pred eEEEEeCCCCCCCCceeEEEEecCceeeecccCCccccccccCC-ccccceEEEEeeEEEeee---e--eeeeccccccc Confidence 999964221 12345556666888877543 345565553 344346777787766554 2 23 222211 1 Q ss_pred ccCCCCCccccccccccccccccC Q lcl|Aclame:pro 296 VATKRDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 296 ~a~~~~~~~~~~~~~~~~~~~~~~ 319 (319) ....|+...+....-|.-+.+..- T Consensus 295 ~~~sPt~~~L~~~~NW~~v~~~k~ 318 (330) T protein:vir:10 295 GNITPSNADLAKFKNWKRVYEPKN 318 (330) T ss_pred CcCCcChHHhcCCcCcccccChhh Confidence 123344445455555554443332 No 70 >protein:vir:3969 Length: 287 # NCBI annotation: major capsid protein # Family: family:all:3269 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663677;genbank:gi:21716114;genbank:GeneID:951200 Probab=98.75 E-value=1.9e-09 Score=68.42 Aligned_cols=262 Identities=15% Similarity=0.141 Sum_probs=160.8 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceee-----eCCc--eEEeeeccccccccccCCCCcccCCcc Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIF-----MEGR--SFTVMKGDTTELKDYKRNATNEFDHPK 91 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~-----~~g~--tVkIp~i~~~g~~DY~r~~~~~~~~~t 91 (319) .| .=.|+++|.+||..++...+.-...... .++- ++.. +||...+.+ -++.|+.+.-..+|+-+ T Consensus 1 ~a-------vr~y~Kq~~glL~~vf~~qa~F~~~FGg-~lQ~~DGV~~N~taf~vKtsD~pV-Vi~~Y~Td~Nv~FGtGT 71 (287) T protein:vir:39 1 MA-------IKYFTKQYAGMLPDLFAKKSAFLRAFGG-VLQVKDGVTENDTFMELKVSDTDV-VIQAYSTDANVGFGSGT 71 (287) T ss_pred CC-------cccccHHHHHHHHHHHHHHHhhhhhccc-ceeeecCCcccceEEEEEecCcce-EEecccCCCCcccccCC Confidence 11 1137899999999999887765433221 1211 1111 455555543 24577654322233221 Q ss_pred -----------cceeEEEEeecc--cceeecchhhHHHHhhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc Q lcl|Aclame:pro 92 -----------IEETTYFLDQEK--YWGRFVDALDRKDTEGNIDI--NYVVARQGAEVVAPYLDNLRFATLARNKAKHLT 156 (319) Q Consensus 92 -----------~t~~tltidqdr--~~~F~VD~~D~~et~~~~~~--~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~ 156 (319) .-+....+.++- .++--||.+-+ |.++++ ++- -+.++.+-+..+|.-.-.-|...|....+ T Consensus 72 g~ssRFG~rkEi~y~dt~V~Y~~~~~ihEGiD~~TV---Nnd~~aaVAdR-L~Lqa~A~t~~~n~~~Gk~ls~~A~~t~~ 147 (287) T protein:vir:39 72 GNTSRFGQRKEVKSVNKQVSYDAPLAINEGIDDFTV---NDIKDQVVAER-LALHGVAWAQHVDKLLGKLLSDSASETLT 147 (287) T ss_pred CccccccceeEEEEecccccceeccccccccccccc---cCChhHHHHHH-HHhHHHHHHHHHHHHHHHHHHhhcchhee Confidence 122222233332 33345555543 222221 111 13445556666777544455666666666 Q ss_pred ccCCHhHHHHHHHHHHHHHHhccCC--CCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEeccccc Q lcl|Aclame:pro 157 VGTGSDAQYDAVLDVSVELDEIKAP--ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL 234 (319) Q Consensus 157 ~~~T~~n~~~~i~~a~~~Lde~~VP--~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~ 234 (319) ..+|.+++...+-++.++.-.++|. ...+.+|+|++|.+|...+--+.+-.+.-++--+| +-++-||.|-++|.... T Consensus 148 ~~~t~d~V~~LF~~a~~~yvNn~v~~~~~~~AyV~aevYnaiiD~~l~TsaK~SsaNiDen~-i~kFkGf~l~e~P~~~~ 226 (287) T protein:vir:39 148 VKLDEDSVTKLFSDAHKKFVNNNVSIAVPWVAYVNADIYDLLIDSKLATTAKNSSANVDEQT-LYKFKGFILSELPDEKF 226 (287) T ss_pred eeecccchHHHHHHHHHHhhccceeeEEEEEEEEChhHHhHHhccccccccccceeeeccCC-cceecceEEEecchHhh Confidence 6789999999999999999887776 46779999999999987765443222221222344 44688999999996555 Q ss_pred ccceE-EEEcCCceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 235 QGLQA-IAVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 235 ~~~n~-i~~~~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) ..-.+ +....+-..+..=|...|+.+ +++-.|-+.+|--=||-|+++.-+++|+-.+..+ T Consensus 227 q~g~~a~fs~dnig~af~GI~vaR~i~-sEdF~GvalQgAgK~G~~i~e~Nk~Ai~k~t~~k 287 (287) T protein:vir:39 227 QLNEGAYFAADNVGVAGVGIQVTRAMD-SEDFAGTALQAAAKYGKYLPEKNKKAILKATVTK 287 (287) T ss_pred ccCcEEEEccccceeecccceeEEeee-cccccceeeecccccccccccccceEEEEEecCC Confidence 44443 334444455567778888887 7998999999999999999987777887665433 No 71 >protein:vir:94528 Length: 286 # NCBI annotation: major head protein # Family: family:all:3269 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223889;genbank:gi:62327101;genbank:GeneID:5075544 Probab=98.73 E-value=2e-09 Score=68.25 Aligned_cols=266 Identities=15% Similarity=0.148 Sum_probs=156.7 Q ss_pred eeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccC---cceeeeCCc--eEEeeeccccccccccCCCCc Q lcl|Aclame:pro 11 MLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS---NDAIFMEGR--SFTVMKGDTTELKDYKRNATN 85 (319) Q Consensus 11 ~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n---~~~~~~~g~--tVkIp~i~~~g~~DY~r~~~~ 85 (319) |-+-|++|=+.. |+++|.+||..++...++-...... -|-..++.. .||-..+.+ -++.|+.+.-. T Consensus 1 m~t~N~n~avr~--------Y~Kqf~glL~~vf~~qa~F~~~fgglQalDGV~~N~tafsvKt~D~pV-Vig~Y~TdeNv 71 (286) T protein:vir:94 1 MATTNNDLPVRV--------YSKEFLQLLSTVYQAQSVFTPTFGALQALDGVPNNATAFSVKTNDMAV-VVGEYSTDANT 71 (286) T ss_pred CCCCccccceee--------hhHHHHHHHHHHHhhHHHhhhhhcchhhhhCCCccceEEEEeecCcce-EEecccCCCcc Confidence 777777764433 6788888999988877765433222 011112222 455554433 24467654322 Q ss_pred ccCCcc-----------cceeEEEEeecccce--eecchhhHHHHhhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 86 EFDHPK-----------IEETTYFLDQEKYWG--RFVDALDRKDTEGNIDI--NYVVARQGAEVVAPYLDNLRFATLARN 150 (319) Q Consensus 86 ~~~~~t-----------~t~~tltidqdr~~~--F~VD~~D~~et~~~~~~--~~~~~~~~~~~vapeiD~~~~s~la~~ 150 (319) .+|+-+ .-++...+.++-.|. --||.+- .|.++.+ ++- -+.++.+-...+|.-.-.-|+.. T Consensus 72 ~FGtgTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~T---VNnd~~aaVAdR-L~lQA~Akt~~~n~~~Gk~ls~~ 147 (286) T protein:vir:94 72 AFGTGTSNSSRFGEMKEVIYADTDVPYTAGWAIHEGLDQMT---VNNDLDAAVADR-LNLQAQAKTRLFNVAMGEALATA 147 (286) T ss_pred ccccCCccccccCceeeEEeecccccccccchhhhcccccc---ccCChhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhh Confidence 233222 122222333333333 3454443 3333322 111 12344444555555332233333 Q ss_pred cCccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEec Q lcl|Aclame:pro 151 KAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVP 230 (319) Q Consensus 151 a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vp 230 (319) |.. ++++ +++...+-.+.+..-+.+|-.....+|+|++|.+|...+--+.+-.+.-++--+| +-++-||.|.++| T Consensus 148 A~~--t~~~--D~V~~LF~~as~~yvn~ev~~~~~ayV~~evYnaiiD~~l~TsaK~SsaNiDeng-i~~FkGf~i~e~P 222 (286) T protein:vir:94 148 GTD--LGAV--DDVNALFESAVEKYTDLEVIAPVRAYVTASVYNAIIDLANVTTAKNSAVNIDTNG-MLSFRGIAITKVP 222 (286) T ss_pred hhh--hhhh--hhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccccceeeeccCC-cceecceEEeecc Confidence 332 2333 6777778888888888888766669999999999987765443222222222344 4468899999999 Q ss_pred ccccccceEEEEcCCceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEcccccc Q lcl|Aclame:pro 231 TKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) Q Consensus 231 s~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a 297 (319) ...+.+--.+..+.+-..+..=|...|+.+ .++-.|-+.+|--=||-|+++.-+.+|+..+. ++ T Consensus 223 ~~~~~g~~aifs~dnig~aftGIn~aR~Ie-sEdF~GValQgAGK~G~~I~edNk~Ai~~~~~--k~ 286 (286) T protein:vir:94 223 TQYMGGKAVIFAPDNVARVFTGINIARTIQ-AIDFAGVELQGAGKYGTFILDDNKKAIFTATP--KA 286 (286) T ss_pred hhhccCceEEEccccceeeeccceeeeeee-ccccCceeeeccccccccccccCceeEEEeec--CC Confidence 877775555555556666677788899987 79989999999999999999877778966553 33 No 72 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.71 E-value=2.3e-08 Score=62.43 Aligned_cols=288 Identities=8% Similarity=-0.029 Sum_probs=134.1 Q ss_pred CCcccccccceeeehhhhhhhhh-----------cchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEee Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS-----------VEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVM 69 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-----------~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp 69 (319) .+...+...|.+.+....+..+. ..+......+-...++.......+.. ..+++ ..-..++++++| T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i-~~~~~--~~~~~~~~~~~~ 171 (419) T protein:vir:94 95 REYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLV-ADLLD--QQNADYNVLEYI 171 (419) T ss_pred HHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhh-hhcce--eeeccCCceeee Confidence 00111111222222222222211 22222222333333333332222111 11111 123356778887 Q ss_pred eccccccc--cccCC-----CCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 70 KGDTTELK--DYKRN-----ATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNL 142 (319) Q Consensus 70 ~i~~~g~~--DY~r~-----~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~ 142 (319) +.....+. ...-. .+-....-+.+...++++-.+.-.+. . +..+-.+....+...+.+..+.+++-.+|.. T Consensus 172 ~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~-~-is~ell~d~~~l~~~i~~~la~a~~~~~d~a 249 (419) T protein:vir:94 172 RDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWL-P-ITRQAADDNSQLMGYIQGRLTYGLRFLRDRQ 249 (419) T ss_pred eeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEee-h-hhHHHHHhHHHHHHHHHHHHHHHHHHHHHHH Confidence 75432211 11100 11111122233444555544444432 1 2222222223455667778888999999987 Q ss_pred HHHH---------HHhc-----cCccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccc Q lcl|Aclame:pro 143 RFAT---------LARN-----KAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDT 208 (319) Q Consensus 143 ~~s~---------la~~-----a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~ 208 (319) ++.- +... +........+....|+.|.++...+...+.+.. .++++|..+..|++-.+-....-. T Consensus 250 ii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~~k~~~~~~~~ 328 (419) T protein:vir:94 250 LLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPD-GVVVHPQDWESIELDQAPGSGVFR 328 (419) T ss_pred HHhccCcccccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCC-EEEEcHHHHHHHHHHhhcCCCcee Confidence 7620 0000 000112233556779999999999988766533 689999999888654321111001 Q ss_pred cccceeeeeeeeecCeEEEEecccccccceEEEEcC-Cceeeee-eeeeeeeecCCCC---CccceeeeeeeeeEEEecc Q lcl|Aclame:pro 209 RQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVG-EVLASPI-QADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEH 283 (319) Q Consensus 209 ~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~-~A~~~~~-k~~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~ 283 (319) .+....+|..++|.|++|+.++ .++...+++|.- .+..... +--.++..+...+ .+...++...++|..|.+| T Consensus 329 ~~~~~~~~~~~~l~G~pV~~~~--~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~ 406 (419) T protein:vir:94 329 VIANVQGEATPRIWGLNVVSTV--AIAQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQP 406 (419) T ss_pred ecCCcccCCCccccceeeEEcC--CCCCccEEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecc Confidence 1223446677789999998743 444545555543 3433332 2222222221111 2345788999999999999 Q ss_pred ccceEEEEccccccCC Q lcl|Aclame:pro 284 LQKYIFTIGGTEVATK 299 (319) Q Consensus 284 k~~~Iy~~~~~~~a~~ 299 (319) ++... +... +++. T Consensus 407 ~a~~~-~~~~--aa~~ 419 (419) T protein:vir:94 407 KAFVR-VTFA--AATT 419 (419) T ss_pred ccEEE-EEec--cCCC Confidence 87433 2222 1221 No 73 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.69 E-value=2.1e-08 Score=62.66 Aligned_cols=291 Identities=10% Similarity=-0.066 Sum_probs=162.4 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcce-eeeCCceEEeeecccc-c-cc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDA-IFMEGRSFTVMKGDTT-E-LK 77 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~-~~~~g~tVkIp~i~~~-g-~~ 77 (319) |-- +-+..||-+..-.++..+-..+-. +++....++.......-+ ...+|++|.+|.+... | -. T Consensus 1 MA~---------T~lsd~i~peVf~~yv~~~~~~~~----~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~ 67 (324) T protein:vir:59 1 MAY---------TKISDVIVPELFNPYVINTTTQLS----AFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQ 67 (324) T ss_pred CCc---------eeeeceechhHHHHHHHhhhHHHH----HHhhcccccccHHHHHHhhccCCCCEEEecccccCCCccc Confidence 321 234567777766666543222211 222212222111111222 3458999999999864 3 55 Q ss_pred cccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC----c Q lcl|Aclame:pro 78 DYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA----K 153 (319) Q Consensus 78 DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~----~ 153 (319) +|+.+....++.++...+.-++-+ |+..|.+.|.-...+- -+++...+++.+...+...|+.+++.|.+..+ . T Consensus 68 ~v~~~~~i~~~~l~t~~~~a~i~~-~~k~~~~tD~a~~~sg--~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~ 144 (324) T protein:vir:59 68 VLNDTDDLVPQKINAGQDKAVLIL-RGNAWSSHDLAATLSG--SDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMK 144 (324) T ss_pred ccCCCcccchhhcccceeeEEEEe-ecCceeehhhhhhhcc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 788777778888887777776654 6777777766655443 36677777888888888999999887743211 1 Q ss_pred --cccccCCHhH--HHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEe Q lcl|Aclame:pro 154 --HLTVGTGSDA--QYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKV 229 (319) Q Consensus 154 --~~~~~~T~~n--~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~v 229 (319) ......+.+. -++.|.+|..+|.+.. ..-..++|.|.++..|++..- ..... ....++.|+.+.|++|+.. T Consensus 145 ~~~~dvsa~~~~~~s~~~l~~A~~~~GD~~-~~~~~ivmhS~v~~~L~~~~l-i~~~~---~s~~~~~i~~~~G~~Vivd 219 (324) T protein:vir:59 145 DNKLDISGTADGIYSAETFVDASYKLGDHE-SLLTAIGMHSATMASAVKQDL-IEFVK---DSQSGIRFPTYMNKRVIVD 219 (324) T ss_pred cceeeeeccccceecHHHHHHHHHHhCCcc-cCcEEEEEchHHHHHHHHhhh-hhhcc---ccccCceeeeecccEEEEe Confidence 1111112222 2578889999997742 345689999999999987642 21111 1112456899999999963 Q ss_pred ---cccc----cccceEEEEcCCceeeeeeee--eeeeecCCCCCccceeeeeeeeeEEEeccccceE-EEEccccccCC Q lcl|Aclame:pro 230 ---PTKL----LQGLQAIAVVGEVLASPIQAD--LAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYI-FTIGGTEVATK 299 (319) Q Consensus 230 ---ps~~----~~~~n~i~~~~~A~~~~~k~~--~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~I-y~~~~~~~a~~ 299 (319) |... ...+..++..++|+....+-. .+|..|. +..-.+.+..|+.|.+-+. . + |.... ..... T Consensus 220 D~~p~~~~~~~~~~y~s~l~~~GAi~~~~~~~~v~vE~dRd-~~~g~~~l~~r~~~~~~p~---G--~s~~~~~-~~~~s 292 (324) T protein:vir:59 220 DSMPVETLEDGTKVFTSYLFGAGALGYAEGQPEVPTETARN-ALGSQDILINRKHFVLHPR---G--VKFTENA-MAGTT 292 (324) T ss_pred CCCCccccCCCCceEEEEEEecCeEEEeecCCCcceecccC-ccccceEEEEeeEEEeEee---e--EEecccc-cCCCC Confidence 3221 123556777788888766433 3455443 3333456666777655443 2 2 11111 01122 Q ss_pred CCCccccccccccccccccC Q lcl|Aclame:pro 300 RDGVDAHADNVAKPSGSLEM 319 (319) Q Consensus 300 ~~~~~~~~~~~~~~~~~~~~ 319 (319) |+-......+-|.-+.+..- T Consensus 293 Pt~~~L~~~~NW~~v~~~k~ 312 (324) T protein:vir:59 293 PTDEELANGANWQRVYDPKK 312 (324) T ss_pred CChhhhcCCcccccccCccc Confidence 33333333444443333322 No 74 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.54 E-value=2.6e-07 Score=56.67 Aligned_cols=281 Identities=7% Similarity=-0.089 Sum_probs=129.6 Q ss_pred CCcccccccceeeehhhhhhhhh---------------cchh----hhhhhHhhH-HHHHHHHHhhhhhhhcccCcceee Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS---------------VEPG----QTLLKNKHV-GILERVTAVNAYSTPALISNDAIF 60 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~---------------~~~n----~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~ 60 (319) .+....+..+ ....++|.... .... ..-++..+. .+++.+.....+ .. +++ ..- T Consensus 69 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l-~~-~~~--~~~ 142 (385) T protein:vir:18 69 ENPGEKKSFS--ERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTI-RD-LLA--QGR 142 (385) T ss_pred cccchhhhhH--HHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccch-hh-hcc--eec Confidence 0000000000 00001110000 0000 001222222 333333222222 11 122 123 Q ss_pred eCCceEEeeecccc-ccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 MEGRSFTVMKGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYL 139 (319) Q Consensus 61 ~~g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapei 139 (319) .++.++++|..... .-..+... +-.....+.++...+++-.+.-.+..=....-+.. ..+.+.+.+..+.+++-.+ T Consensus 143 ~~~~~~~~~~~~~~~~~a~~v~E-~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~--~~l~~~i~~~la~a~~~~~ 219 (385) T protein:vir:18 143 TSSNALEYVREEVFTNNADVVAE-KALKPESDITFSKQTANVKTIAHWVQASRQVMDDA--PMLQSYINNRLMYGLALKE 219 (385) T ss_pred ccCcceEEEEEecCCcceeeecc-CccccccccceeEEEEeeeeEEEeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHH Confidence 45668999988653 33333222 22222223345555555555444321111111111 2345666777788888888 Q ss_pred HHHHHHHH---------HhccC-ccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhccccc Q lcl|Aclame:pro 140 DNLRFATL---------ARNKA-KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTR 209 (319) Q Consensus 140 D~~~~s~l---------a~~a~-~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~ 209 (319) |..++.-- ...+. ...+...+.+..++.|.++...|.....+. -.++|+|..+..|.+-.+-.... +. T Consensus 220 d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~-~~~~~~~~~~~~l~~lkd~~G~~-l~ 297 (385) T protein:vir:18 220 EGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSA-SGIVLNPRDWHNIALLKDNEGRY-IF 297 (385) T ss_pred HHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCC-CEEEEcHHHHHHHHHhhcCCCce-ec Confidence 87655310 00000 011112234567899999999997765543 36899999999887644311110 01 Q ss_pred ccceeeeeeeeecCeEEEEecccccccceEEEEcCC-ceeeeeee-eeeeeecCCCC---CccceeeeeeeeeEEEeccc Q lcl|Aclame:pro 210 QQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASPIQA-DLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHL 284 (319) Q Consensus 210 ~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~k~-~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~k 284 (319) .....|..+.+.|.+|+.+ ..++...+++|..+ +.....+. -.+++.....+ .+...++...++|..|.+|+ T Consensus 298 -~~~~~~~~~~l~G~pV~~~--~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~ 374 (385) T protein:vir:18 298 -GGPQAFTSNIMWGLPVVPT--KAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPT 374 (385) T ss_pred -cCcccCCCceecceeeEEc--CcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc Confidence 1234566788999999864 45555567777643 44444332 22333221111 23357788889999999998 Q ss_pred cceEEEEccccccC Q lcl|Aclame:pro 285 QKYIFTIGGTEVAT 298 (319) Q Consensus 285 ~~~Iy~~~~~~~a~ 298 (319) +..+ ..-++++ T Consensus 375 a~~~---~~~~aa~ 385 (385) T protein:vir:18 375 AIIK---GTFSSGS 385 (385) T ss_pred ceEE---EEeccCC Confidence 7433 2222222 No 75 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.54 E-value=2.6e-07 Score=56.67 Aligned_cols=281 Identities=7% Similarity=-0.089 Sum_probs=129.6 Q ss_pred CCcccccccceeeehhhhhhhhh---------------cchh----hhhhhHhhH-HHHHHHHHhhhhhhhcccCcceee Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS---------------VEPG----QTLLKNKHV-GILERVTAVNAYSTPALISNDAIF 60 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~---------------~~~n----~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~ 60 (319) .+....+..+ ....++|.... .... ..-++..+. .+++.+.....+ .. +++ ..- T Consensus 69 ~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l-~~-~~~--~~~ 142 (385) T protein:vir:19 69 ENPGEKKSFS--ERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTI-RD-LLA--QGR 142 (385) T ss_pred cccchhhhhH--HHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccch-hh-hcc--eec Confidence 0000000000 00001110000 0000 001222222 333333222222 11 122 123 Q ss_pred eCCceEEeeecccc-ccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 MEGRSFTVMKGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYL 139 (319) Q Consensus 61 ~~g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapei 139 (319) .++.++++|..... .-..+... +-.....+.++...+++-.+.-.+..=....-+.. ..+.+.+.+..+.+++-.+ T Consensus 143 ~~~~~~~~~~~~~~~~~a~~v~E-~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~--~~l~~~i~~~la~a~~~~~ 219 (385) T protein:vir:19 143 TSSNALEYVREEVFTNNADVVAE-KALKPESDITFSKQTANVKTIAHWVQASRQVMDDA--PMLQSYINNRLMYGLALKE 219 (385) T ss_pred ccCcceEEEEEecCCcceeeecc-CccccccccceeEEEEeeeeEEEeehhhHHHHhhH--HHHHHHHHHHHHHHHHHHH Confidence 45668999988653 33333222 22222223345555555555444321111111111 2345666777788888888 Q ss_pred HHHHHHHH---------HhccC-ccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhccccc Q lcl|Aclame:pro 140 DNLRFATL---------ARNKA-KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTR 209 (319) Q Consensus 140 D~~~~s~l---------a~~a~-~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~ 209 (319) |..++.-- ...+. ...+...+.+..++.|.++...|.....+. -.++|+|..+..|.+-.+-.... +. T Consensus 220 d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~-~~~~~~~~~~~~l~~lkd~~G~~-l~ 297 (385) T protein:vir:19 220 EGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSA-SGIVLNPRDWHNIALLKDNEGRY-IF 297 (385) T ss_pred HHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCC-CEEEEcHHHHHHHHHhhcCCCce-ec Confidence 87655310 00000 011112234567899999999997765543 36899999999887644311110 01 Q ss_pred ccceeeeeeeeecCeEEEEecccccccceEEEEcCC-ceeeeeee-eeeeeecCCCC---CccceeeeeeeeeEEEeccc Q lcl|Aclame:pro 210 QQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASPIQA-DLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHL 284 (319) Q Consensus 210 ~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~k~-~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~k 284 (319) .....|..+.+.|.+|+.+ ..++...+++|..+ +.....+. -.+++.....+ .+...++...++|..|.+|+ T Consensus 298 -~~~~~~~~~~l~G~pV~~~--~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~~ 374 (385) T protein:vir:19 298 -GGPQAFTSNIMWGLPVVPT--KAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRPT 374 (385) T ss_pred -cCcccCCCceecceeeEEc--CcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc Confidence 1234566788999999864 45555567777643 44444332 22333221111 23357788889999999998 Q ss_pred cceEEEEccccccC Q lcl|Aclame:pro 285 QKYIFTIGGTEVAT 298 (319) Q Consensus 285 ~~~Iy~~~~~~~a~ 298 (319) +..+ ..-++++ T Consensus 375 a~~~---~~~~aa~ 385 (385) T protein:vir:19 375 AIIK---GTFSSGS 385 (385) T ss_pred ceEE---EEeccCC Confidence 7433 2222222 No 76 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.53 E-value=3.4e-07 Score=56.03 Aligned_cols=285 Identities=6% Similarity=-0.111 Sum_probs=130.3 Q ss_pred CCccccccc--ceeeehhhhhhh----hhc--chhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecc Q lcl|Aclame:pro 1 MNKTIKNAT--GMLKLNLQHFAN----KSV--EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGD 72 (319) Q Consensus 1 ~~~~~~~~~--~~~~~~~~~~~~----~~~--~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~ 72 (319) +.....+.. ++.......+.. .-. .-....+++.+...+.+.....+..... ++ ..-.++.++.+|... T Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~-~~--~~~~~~~~~~~~~~~ 185 (418) T protein:vir:10 109 MKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDL-LM--PGQTSSSSIEYTVET 185 (418) T ss_pred HHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhh-cc--eeeccCCceeEEEEe Confidence 000000000 111111111100 000 0001123444433332333322222222 22 233456778888876 Q ss_pred cc-ccccccCCC-CcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh- Q lcl|Aclame:pro 73 TT-ELKDYKRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR- 149 (319) Q Consensus 73 ~~-g~~DY~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~- 149 (319) .. ....+.-.+ .....++ +..+.++.-.+.-.+. .+..+-......+...+.+....+++-.+|..++.---. T Consensus 186 ~~~~~a~~v~E~~~~~~~~~--~f~~v~~~~~k~~~~~--~is~ell~ds~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~ 261 (418) T protein:vir:10 186 GFTNNAAAVAEGAQKPTSDL--KFNLKNQPVRTIAHLF--KASRQILDDAPALQSYIDGRARYGLQLTEEGQILKGDGTG 261 (418) T ss_pred cCCCceeeeccCcccccccc--ceeeEEEeeeeEEEee--hhhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHhccCCCC Confidence 53 333333222 2222333 4445555545543332 122222122224556667778888888888876531000 Q ss_pred --------ccCc-cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeee Q lcl|Aclame:pro 150 --------NKAK-HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGE 220 (319) Q Consensus 150 --------~a~~-~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~ 220 (319) .++. ..+.+.+....++.|+++...+...+.+.. .++++|..+..|.+-.+-... -+.. ...+|..++ T Consensus 262 ~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~L~~lkd~~G~-~i~~-~~~~~~~~~ 338 (418) T protein:vir:10 262 ANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPAT-GIVLNPIDWASIELTKDSQGR-YIVG-NPVNGTTPR 338 (418) T ss_pred ccccccccccccccccccccccccHHHHHHHHHhhccccCCCC-EEEEcHHHHHHHHHhhcCCCc-eecc-ccccCCCce Confidence 0000 111112233457888888888877665544 477899999888653321100 0111 234566788 Q ss_pred ecCeEEEEecccccccceEEEEcCC-ceeeeee-eeeeeeecCCCC---CccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 221 LDGFVIVKVPTKLLQGLQAIAVVGE-VLASPIQ-ADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 221 idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~k-~~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) |.|++|+. ++.++...+++|..+ +.....+ =-.+++.+...+ .+...++...++|..+.+|++. +++.. ++ T Consensus 339 l~G~pV~~--~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~-~~~~~-~~ 414 (418) T protein:vir:10 339 LWNLPVVE--TQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESF-VTGAL-VE 414 (418) T ss_pred ecceeeEE--cCCCCCCcEEEeeccceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEecccce-EEEEe-cc Confidence 99999986 445666667777654 3433332 112333221111 2335778888999999999873 22222 22 Q ss_pred ccCC Q lcl|Aclame:pro 296 VATK 299 (319) Q Consensus 296 ~a~~ 299 (319) ++.+ T Consensus 415 ~~~g 418 (418) T protein:vir:10 415 QAGG 418 (418) T ss_pred CCCC Confidence 3322 No 77 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.53 E-value=1.8e-07 Score=57.52 Aligned_cols=269 Identities=11% Similarity=0.005 Sum_probs=137.9 Q ss_pred ehhhhh-hhhhc--chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCCcccCC Q lcl|Aclame:pro 14 LNLQHF-ANKSV--EPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDH 89 (319) Q Consensus 14 ~~~~~~-~~~~~--~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~ 89 (319) |++|-| +.|.. ......+++.+. .+++.+...+.+.. . +.. +.-.++..+.+|......-..+...+ -.... T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~-~-~~~-~~~~~~~~~~~~~~~~~~~a~~v~Eg-~~~~~ 76 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQ-L-GQY-QEMEGEQEKTVYVQTDGISAYWVNET-EKIKT 76 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhh-h-cce-eecCCCccEEEEEEcCCceeEEeecC-ccccc Confidence 777733 23332 112223566664 34444433333222 2 222 22223445677766554333333222 22222 Q ss_pred cccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-------cccCCH Q lcl|Aclame:pro 90 PKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL-------TVGTGS 161 (319) Q Consensus 90 ~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~-------~~~~T~ 161 (319) .+.++...++.-.|.-.+. . +..+-.+ ....+...+.++.+.+++-.+|.-.+.---+..+... ...... T Consensus 77 ~~~~f~~v~l~~~k~~~~~-~-is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~ 154 (297) T protein:vir:95 77 DKPEVVPVTLKAHKLGIIL-V-TSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGG 154 (297) T ss_pred cccceeEEEEeeEEEEEee-h-hhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceeccc Confidence 2344445555545443332 2 2211111 1234566777888888888888876621000000000 000112 Q ss_pred hHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEE Q lcl|Aclame:pro 162 DAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIA 241 (319) Q Consensus 162 ~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~ 241 (319) ..-|+.|+++..+|..++.+.. .++++|..+..|++-.+ ..| ..+.++..+.+.|.+|+..++.......+++ T Consensus 155 ~~t~~~i~~~~~~l~~~~~~~~-~~v~~~~~~~~L~~l~d-----~~G-~~i~~~~~~~l~G~Pv~~~~~~~~~~~~~~~ 227 (297) T protein:vir:95 155 PINYDNILKLQDALYDADVEPN-AFVSKIQNRSALREARD-----GNK-VSIYDKAANTIDGITTVDLKSARFEKGDLLA 227 (297) T ss_pred ccCHHHHHHHHHHhhhccCCcC-EEEEcHHHHHHHHHhhc-----cCC-ceeecCCCCcccceeeEeecCCCCCCceEEE Confidence 2347888889999988776643 47889999998875332 112 3345666678999999877776666666777 Q ss_pred EcCCceeeeeee-eeeeeecCC---------------CCCccceeeeeeeeeEEEeccccceEEEEcccccc Q lcl|Aclame:pro 242 VVGEVLASPIQA-DLAKTNSNI---------------PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) Q Consensus 242 ~~~~A~~~~~k~-~~~~~~~~~---------------~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a 297 (319) +..+.......- -.+++++.. -..+.-.+|...++|..|.+|++... -..++|. T Consensus 228 gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~--l~~at~~ 297 (297) T protein:vir:95 228 GDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAK--LTPAERV 297 (297) T ss_pred EecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEE--EeecCCC Confidence 766544332221 123333211 01123577888999999999987433 2222233 No 78 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=98.45 E-value=4.1e-08 Score=61.09 Aligned_cols=274 Identities=12% Similarity=0.043 Sum_probs=150.2 Q ss_pred eeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccc-cccccCCCCcccCC Q lcl|Aclame:pro 11 MLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTE-LKDYKRNATNEFDH 89 (319) Q Consensus 11 ~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g-~~DY~r~~~~~~~~ 89 (319) |-.=||-+-++ -+.|-+|.+..+|..-+++++.... +.|-..-..|+++++|++.-.+ -+|+-....+.... T Consensus 1 mAe~nlt~~~d-L~~~~sidfv~~f~~~i~~L~~~Lg------i~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplsk 73 (295) T protein:vir:99 1 MAEKNLNTMAD-LGDIKSIDFVNKFSKNINDLLKLLG------VTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSK 73 (295) T ss_pred CCCcccccHhh-ccCceeehhhHHhhhhHHHHHHHhc------cccccccccCCeEEeeeeeeecccccccCCcccchhh Confidence 43334443332 2367788899999766666644222 2344455569999999998665 44665555566677 Q ss_pred cccce-eEEEEeecccceeecchhhHH-HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHH Q lcl|Aclame:pro 90 PKIEE-TTYFLDQEKYWGRFVDALDRK-DTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDA 167 (319) Q Consensus 90 ~t~t~-~tltidqdr~~~F~VD~~D~~-et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~ 167 (319) ++.+. .+.++.-.|++. .+ .|++ |..+...+.....++...+++..+|...|+.+.....+. .+.+-...|+. T Consensus 74 vt~~~~~t~t~kikK~rK-~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~--tg~~lq~a~a~ 148 (295) T protein:vir:99 74 VTRTKDKDYTVKWFKKRR-AT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKV--KGVGLQKALSA 148 (295) T ss_pred heeeeeeeeEEEeeeecc-cc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceee--ehhhHHHHHHH Confidence 76542 334444444444 23 4666 445667778888999999999999999999774322221 11112335666 Q ss_pred HHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhc--ccccccceeeeeeeeecCeE-EEEecccc--------ccc Q lcl|Aclame:pro 168 VLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQ--GDTRQQVLGKGVQGELDGFV-IVKVPTKL--------LQG 236 (319) Q Consensus 168 i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~--~~~~~~~~~~g~Vg~idG~~-I~~vps~~--------~~~ 236 (319) +.++...+.|. ...+.++||+|.=...|+++-....+ ...|-+.+.| +.|++ |+.++.-. ..+ T Consensus 149 ~~~al~~f~Ee-~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~n-----fLG~q~II~S~kv~~G~~~aT~~~N 222 (295) T protein:vir:99 149 SWAKLATFNEF-EGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKN-----FLGMQNVIVMPSVPEGKIYSTAVEN 222 (295) T ss_pred hhhhhhhcccc-cCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhh-----hhccceEEEcccCCCceEEEeeccc Confidence 66666666552 33468999999888777776654432 2344444443 88986 76543211 112 Q ss_pred ceEEEEcCC----ceeee---eeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCc Q lcl|Aclame:pro 237 LQAIAVVGE----VLASP---IQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGV 303 (319) Q Consensus 237 ~n~i~~~~~----A~~~~---~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~ 303 (319) +++-=..++ +-.|. .+.--+-+.. ++...---++-+..-|...+---.+||....-..++...-|+ T Consensus 223 i~~ay~~~~~g~l~~~f~~~~D~tglIg~~h-~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~~~~~~ 295 (295) T protein:vir:99 223 LVFASLNVKGGDLGGLFADFTDETGLIAAAR-NRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAVPGIGG 295 (295) T ss_pred eEEEEecCCchhhhhhhhhccCcccceEEEe-ccccceeeehhhhHhHHHhcccccceEEEEEEecCcCCCCCC Confidence 333222222 11111 1111111111 111111122233333444555566788777655555555455 No 79 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.43 E-value=6.8e-07 Score=54.40 Aligned_cols=262 Identities=8% Similarity=-0.055 Sum_probs=132.4 Q ss_pred hhhhhhcchh-------hhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccC-CCCcccC Q lcl|Aclame:pro 18 HFANKSVEPG-------QTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKR-NATNEFD 88 (319) Q Consensus 18 ~~~~~~~~~n-------~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r-~~~~~~~ 88 (319) |...-+...+ ...+++.++ .+++.+.....+ ... + .....++++++||......-..+.. ++..... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l-~~~-~--~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAI-MKL-A--KNEPMTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccch-hhh-c--ceeeccCCceEEEEEeCCcceEEeecCcccccc Confidence 2222221111 122445553 333333332222 111 2 1234567789999987544443332 2223333 Q ss_pred CcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------------ccCccc Q lcl|Aclame:pro 89 HPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR-------------NKAKHL 155 (319) Q Consensus 89 ~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~-------------~a~~~~ 155 (319) +++.+..+ +...|.-.+. .--+.........+.+.+.+....+++-.+|...+.---. .+.... T Consensus 77 ~~~~~~i~--~~~~k~~~~~-~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 153 (304) T protein:vir:94 77 KPEYAQAE--MEAKKIGVII-PLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG 153 (304) T ss_pred cceeeEEE--EEEEEEEEee-hhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc Confidence 44444444 4444433332 2112111112245667777888888888888876531000 001111 Q ss_pred cccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccc-- Q lcl|Aclame:pro 156 TVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKL-- 233 (319) Q Consensus 156 ~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~-- 233 (319) ....+.+..|+.|.++..++..++.... .++++|..+..|++-.+ ..| ..+.....+++.|.+|+.+++-. T Consensus 154 ~~~~~~~~~~~~i~~~~~~l~~~~~~~~-~~v~~~~~~~~L~~lkd-----~~G-~~l~~~~~~~l~G~PV~~~~~~~~~ 226 (304) T protein:vir:94 154 NVVTDTNNLYVDLSALMATIEDEELDPN-GVLTTRSFRSKMRNALD-----AND-RPLFDANGNEIMGLPLSYTGADVYD 226 (304) T ss_pred cccccccchHHHHHHHHHHhhhccCCcC-EEEEcHHHHHHHHHhhc-----cCC-cEeecCCCccccceeeEEecccccC Confidence 1122345579999999999988766643 57899999998875432 112 23445556889999998654321 Q ss_pred cccceEEEEcCCceeeee-eeeeeeeecC-------CCC----------CccceeeeeeeeeEEEeccccceEEEEcc Q lcl|Aclame:pro 234 LQGLQAIAVVGEVLASPI-QADLAKTNSN-------IPG----------MFGTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) Q Consensus 234 ~~~~n~i~~~~~A~~~~~-k~~~~~~~~~-------~~~----------~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~ 293 (319) ..+..++++..+-..... +=-.+++.+. .++ .+...++...++|..|++|++..+....+ T Consensus 227 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 227 KKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 122335555544332221 1112222211 011 12357788899999999999854433333 No 80 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.43 E-value=6.8e-07 Score=54.40 Aligned_cols=262 Identities=8% Similarity=-0.055 Sum_probs=132.4 Q ss_pred hhhhhhcchh-------hhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccC-CCCcccC Q lcl|Aclame:pro 18 HFANKSVEPG-------QTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKR-NATNEFD 88 (319) Q Consensus 18 ~~~~~~~~~n-------~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r-~~~~~~~ 88 (319) |...-+...+ ...+++.++ .+++.+.....+ ... + .....++++++||......-..+.. ++..... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l-~~~-~--~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~ 76 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAI-MKL-A--KNEPMTAQKKKFTYLAKGVGAYWVSETERIQTS 76 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccch-hhh-c--ceeeccCCceEEEEEeCCcceEEeecCcccccc Confidence 2222221111 122445553 333333332222 111 2 1234567789999987544443332 2223333 Q ss_pred CcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-------------ccCccc Q lcl|Aclame:pro 89 HPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR-------------NKAKHL 155 (319) Q Consensus 89 ~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~-------------~a~~~~ 155 (319) +++.+..+ +...|.-.+. .--+.........+.+.+.+....+++-.+|...+.---. .+.... T Consensus 77 ~~~~~~i~--~~~~k~~~~~-~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 153 (304) T protein:vir:10 77 KPEYAQAE--MEAKKIGVII-PLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKG 153 (304) T ss_pred cceeeEEE--EEEEEEEEee-hhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccccc Confidence 44444444 4444433332 2112111112245667777888888888888876531000 001111 Q ss_pred cccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccc-- Q lcl|Aclame:pro 156 TVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKL-- 233 (319) Q Consensus 156 ~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~-- 233 (319) ....+.+..|+.|.++..++..++.... .++++|..+..|++-.+ ..| ..+.....+++.|.+|+.+++-. T Consensus 154 ~~~~~~~~~~~~i~~~~~~l~~~~~~~~-~~v~~~~~~~~L~~lkd-----~~G-~~l~~~~~~~l~G~PV~~~~~~~~~ 226 (304) T protein:vir:10 154 NVVTDTNNLYVDLSALMATIEDEELDPN-GVLTTRSFRSKMRNALD-----AND-RPLFDANGNEIMGLPLSYTGADVYD 226 (304) T ss_pred cccccccchHHHHHHHHHHhhhccCCcC-EEEEcHHHHHHHHHhhc-----cCC-cEeecCCCccccceeeEEecccccC Confidence 1122345579999999999988766643 57899999998875432 112 23445556889999998654321 Q ss_pred cccceEEEEcCCceeeee-eeeeeeeecC-------CCC----------CccceeeeeeeeeEEEeccccceEEEEcc Q lcl|Aclame:pro 234 LQGLQAIAVVGEVLASPI-QADLAKTNSN-------IPG----------MFGTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) Q Consensus 234 ~~~~n~i~~~~~A~~~~~-k~~~~~~~~~-------~~~----------~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~ 293 (319) ..+..++++..+-..... +=-.+++.+. .++ .+...++...++|..|++|++..+....+ T Consensus 227 ~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 227 KKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred CCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 122335555544332221 1112222211 011 12357788899999999999854433333 No 81 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.38 E-value=8.7e-07 Score=53.81 Aligned_cols=283 Identities=11% Similarity=-0.027 Sum_probs=139.7 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY 79 (319) ||....+.+|--.= ..++.....-+++.+. .+++.+...+.+.. ++. .+-.++.+++||......-..+ T Consensus 4 l~el~~~~~~~~~~------g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~--~~~--~~~~~~~~~~~p~~~~~~~a~~ 73 (333) T protein:vir:78 4 LNELLPNSAGSNHQ------GRLAHVPSDLLPKEIVGPIFDKAQESSLVLR--MGE--QIPISYGETIIPTTVKRPEVGQ 73 (333) T ss_pred hHHhhhhccccccc------CceecCCccccchhHHHHHHHHHHhhchhhh--hcc--eeeccCCceEEEEEeCCceeEe Confidence 44444443331100 0111111113455553 44444444333322 122 2445678899999977655444 Q ss_pred cCCCC-------cccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc- Q lcl|Aclame:pro 80 KRNAT-------NEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK- 151 (319) Q Consensus 80 ~r~~~-------~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a- 151 (319) ...+. -....-+.++...++...|.-.+.. --+.---.....+...+.+..++.++-.+|.-.+.---... T Consensus 74 v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~-is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~ 152 (333) T protein:vir:78 74 VGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVT-VSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTG 152 (333) T ss_pred ecCcccccccccccccccccceeEEEEeeEEEEEeeh-hhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCC Confidence 32211 1111223455555666555444332 22211111123456677788888899889887662100000 Q ss_pred ----------Cc-----cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhh-ccc-cccccee Q lcl|Aclame:pro 152 ----------AK-----HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALP-QGD-TRQQVLG 214 (319) Q Consensus 152 ----------~~-----~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~-~~~-~~~~~~~ 214 (319) +. ..+...+.+..++.|+++...+.....-....++++|..+..|++...... ++. ....... T Consensus 153 ~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~ 232 (333) T protein:vir:78 153 SALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINL 232 (333) T ss_pred cccccccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccc Confidence 00 011112334568889998888766433345568889999888865432221 111 1123345 Q ss_pred eeeeeeecCeEEEEecc---cc----cccceEEEEcCCceeeeeeeeeeeeecCCC----C-------Cc---cceeeee Q lcl|Aclame:pro 215 KGVQGELDGFVIVKVPT---KL----LQGLQAIAVVGEVLASPIQADLAKTNSNIP----G-------MF---GTLAEQL 273 (319) Q Consensus 215 ~g~Vg~idG~~I~~vps---~~----~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~----~-------~~---~~~v~gr 273 (319) .|..++|.|++|+.+++ +. .....+++|...-...... ..+++...++ + .| --.++.. T Consensus 233 ~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~ 311 (333) T protein:vir:78 233 AAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFA-DEIRIKMSDTATLTDSGSATVSMWQTNQIAILIE 311 (333) T ss_pred cCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEe-eccEEEEeccccccccccceeehhhcCcEEEEEE Confidence 56678999999986432 21 1234466666654333222 2222221111 1 11 2357888 Q ss_pred eeeeEEEeccccceEEEEccccc Q lcl|Aclame:pro 274 LYTGAFVPEHLQKYIFTIGGTEV 296 (319) Q Consensus 274 ~~yg~~V~~~k~~~Iy~~~~~~~ 296 (319) .++|..|++|++....... ++| T Consensus 312 ~r~d~~v~~~~a~~~l~~~-~a~ 333 (333) T protein:vir:78 312 VTFGWLLGDKQAFVKFVDD-EQP 333 (333) T ss_pred EEEccEEecccceEEEecc-CCC Confidence 9999999999885443222 233 No 82 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.37 E-value=6.1e-07 Score=54.64 Aligned_cols=278 Identities=8% Similarity=-0.057 Sum_probs=132.1 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY 79 (319) |. ....+.....+++.+. .+++.+...+.+.. ++ ..+-.+++.++||++...+-..+ T Consensus 1 Ma------------------~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~--l~--~~i~~~~~~~~ip~~~~~~~a~w 58 (315) T protein:vir:80 1 MA------------------DDFLSAGKLELPGSMIGAVRDRAIDSGVLAK--LS--PEQPTIFGPVKGAVFSGVPRAKI 58 (315) T ss_pred CC------------------CCcCCcCceEcchHHHHHHHHHHHhhchhhh--hc--ceeecCCCceEEEEEeCCcceEE Confidence 33 2233333334455553 34444433333221 22 23455677899999876544433 Q ss_pred cCCC-CcccCCcccceeEEEEeecccceeecchhhHH--HHhh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 80 KRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDRK--DTEG--NIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH 154 (319) Q Consensus 80 ~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~--et~~--~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~ 154 (319) ...+ .....++ ++..+++.-.|.-. .+.--+.- ++.. .-.+...+.+..+.+++-.+|.-.+.---...+.. T Consensus 59 v~Eg~~~~~s~~--~f~~v~l~~~kl~~-~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~ 135 (315) T protein:vir:80 59 VGEGEVKPSASV--DVSAFTAQPIKVVT-QQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKA 135 (315) T ss_pred eeCCcccccccc--ceeeeEeeeeeEEe-eehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcc Confidence 3222 2222333 44444444333222 22211110 0110 01133455666777777777765442100000000 Q ss_pred ----------ccccC-CHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccc---cceeeeeeee Q lcl|Aclame:pro 155 ----------LTVGT-GSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQ---QVLGKGVQGE 220 (319) Q Consensus 155 ----------~~~~~-T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~---~~~~~g~Vg~ 220 (319) .+... .....|+-|+++...+..++......++++|..+..|++-.........++ .....|..++ T Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~t 215 (315) T protein:vir:80 136 ASAVHTSLNKTKNIVDATDSATADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDN 215 (315) T ss_pred ccccccccccccceeeccccchHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCce Confidence 00011 123457788888888776655544458899999999976654332222222 1234555678 Q ss_pred ecCeEEEEeccc---cc----ccceEEEEcCC-ceeeeeeeeeeeeecCC-CC--------CccceeeeeeeeeEEEecc Q lcl|Aclame:pro 221 LDGFVIVKVPTK---LL----QGLQAIAVVGE-VLASPIQADLAKTNSNI-PG--------MFGTLAEQLLYTGAFVPEH 283 (319) Q Consensus 221 idG~~I~~vps~---~~----~~~n~i~~~~~-A~~~~~k~~~~~~~~~~-~~--------~~~~~v~gr~~yg~~V~~~ 283 (319) |.|.+|+.+++- .. ...-+++|.-+ ...-..+--.+++.+.. ++ ++.-.++...++|..|.+| T Consensus 216 l~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~ 295 (315) T protein:vir:80 216 WRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESL 295 (315) T ss_pred ecceeeEecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecc Confidence 999999964321 10 11234444322 22222222234433321 11 1224778889999999999 Q ss_pred ccceEEEEccccccCCCCCcccc Q lcl|Aclame:pro 284 LQKYIFTIGGTEVATKRDGVDAH 306 (319) Q Consensus 284 k~~~Iy~~~~~~~a~~~~~~~~~ 306 (319) ++..+.....+++++ +..+| T Consensus 296 ~a~~~l~~~~a~~~~---~~~~~ 315 (315) T protein:vir:80 296 DSFAVVKEKAAPKPN---PPAEN 315 (315) T ss_pred cceEEEeeccCCCCC---CCCCC Confidence 986554444333333 22233 No 83 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.32 E-value=1.3e-06 Score=52.78 Aligned_cols=293 Identities=9% Similarity=-0.014 Sum_probs=133.2 Q ss_pred CCc--------ccccccceeeehhh-hhhhh---hcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEe Q lcl|Aclame:pro 1 MNK--------TIKNATGMLKLNLQ-HFANK---SVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTV 68 (319) Q Consensus 1 ~~~--------~~~~~~~~~~~~~~-~~~~~---~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkI 68 (319) ..+ ..+....+|.-..+ .+.-. -........++.+...+.+.....+..... ++......+..++.+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~ 157 (397) T protein:vir:49 79 LTKNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEY-VNVENVTTLTGSRVY 157 (397) T ss_pred ccchhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhh-cceeeccCCcceEEE Confidence 000 00000011110000 00000 001111234555543333333323222211 222112223345667 Q ss_pred eeccc-cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 69 MKGDT-TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 69 p~i~~-~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) |+... .+...+...++-.++.-..++..++++-.|...+ +. +..+-.+ ....+.+.+.+.....++-.+|..++.- T Consensus 158 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~-~~-iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G 235 (397) T protein:vir:49 158 EKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGI-ST-VTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEA 235 (397) T ss_pred EeeccCCcceeeeccccccccccccceeeeEeeeeeeEee-hh-hHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 76654 3455555433322222222344555555554443 22 2222111 1234566677778888888888765431 Q ss_pred HHhccCccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEE Q lcl|Aclame:pro 147 LARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVI 226 (319) Q Consensus 147 la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I 226 (319) .+.+......++ |+.|.++...|+.+..+ +-.++++|..+..|.+-.+-.... .......+|.-++|.|++| T Consensus 236 --~g~~~~~~~~~~----~d~i~~~~~~l~~~~~~-~a~~v~n~~~~~~l~~lkd~~g~~-l~~~~~~~g~~~~l~G~pV 307 (397) T protein:vir:49 236 --IGTLPNKPTLAK----WDDIIDLQAKVDPAIKQ-TSLFLTNTSGFTALKKVKNAMGDY-LMERDVKSPTGYSIDGFVV 307 (397) T ss_pred --cccccccccccC----HHHHHHHHHhhhhhhcC-CCEEEEcHHHHHHHHHhhccCCce-eecccccCCCCceecceee Confidence 122222222233 56677788888876655 346889999999887643211110 0111234566678999999 Q ss_pred EEecccc-----cccceEEEEcC-Cceeeee-eeeeeeeecCCC---CCccceeeeeeeeeEEEeccccceEEEE--ccc Q lcl|Aclame:pro 227 VKVPTKL-----LQGLQAIAVVG-EVLASPI-QADLAKTNSNIP---GMFGTLAEQLLYTGAFVPEHLQKYIFTI--GGT 294 (319) Q Consensus 227 ~~vps~~-----~~~~n~i~~~~-~A~~~~~-k~~~~~~~~~~~---~~~~~~v~gr~~yg~~V~~~k~~~Iy~~--~~~ 294 (319) +.+.+.. .....+++|.- .+..... +=-.++..+... ..+...++...++|..|.+|++..+..- ..+ T Consensus 308 ~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~ 387 (397) T protein:vir:49 308 KEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIAD 387 (397) T ss_pred EEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccceEEEEeccccc Confidence 8765433 33455677753 3444433 322333322111 1234578889999999999987544321 222 Q ss_pred cccCCCCCcc Q lcl|Aclame:pro 295 EVATKRDGVD 304 (319) Q Consensus 295 ~~a~~~~~~~ 304 (319) .+++..+++. T Consensus 388 ~~~~~~~~~~ 397 (397) T protein:vir:49 388 QKAKLSTAGA 397 (397) T ss_pred ccCcccccCC Confidence 2222222222 No 84 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.30 E-value=1.5e-06 Score=52.51 Aligned_cols=279 Identities=8% Similarity=-0.104 Sum_probs=130.8 Q ss_pred CCcccccccceee--ehhhhhhhhhc----------------------chhhhhhhHhhH-HHHHHHHHhhhhhhhcccC Q lcl|Aclame:pro 1 MNKTIKNATGMLK--LNLQHFANKSV----------------------EPGQTLLKNKHV-GILERVTAVNAYSTPALIS 55 (319) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~~----------------------~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n 55 (319) .+...+....+.. .....|..+.. ...-.-+.+.+. .+++.+..... .... ++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~-i~~~-~~ 148 (390) T protein:vir:97 71 GDVQHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLT-VRDL-IG 148 (390) T ss_pred cccccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhh-hHhh-cc Confidence 1111111111100 00011111100 000001222222 23333322222 2221 22 Q ss_pred cceeeeCCceEEeeecccc-ccccccC-CCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHH Q lcl|Aclame:pro 56 NDAIFMEGRSFTVMKGDTT-ELKDYKR-NATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAE 133 (319) Q Consensus 56 ~~~~~~~g~tVkIp~i~~~-g~~DY~r-~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~ 133 (319) ....++.++++|..... +-..+.. ++.....+++ +...+++-.+.-.+ +. +..+-.+....+...+.+..+. T Consensus 149 --~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~--~~~i~~~~~k~~~~-~~-is~ell~ds~~l~~~i~~~la~ 222 (390) T protein:vir:97 149 --SGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLK--FAKKTDTTHVIAHT-MK-ATRQILSDAPQLASYMNNRLIR 222 (390) T ss_pred --eeeccCCceEEEEEecCCcceeeecCCccccccccc--eeEEEEeeeeEEEe-eh-hhHHHHHhHHHHHHHHHHHHHH Confidence 23335678899988653 2233322 2223333344 44445544443332 21 2211111122345666777888 Q ss_pred HHHHHHHHHHHHHH-Hh--------ccCc-cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhh Q lcl|Aclame:pro 134 VVAPYLDNLRFATL-AR--------NKAK-HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIAL 203 (319) Q Consensus 134 ~vapeiD~~~~s~l-a~--------~a~~-~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~ 203 (319) .++-.+|..++.-- .. .++. ......+.+..++.+.++...+.....+.. .++|+|..+..|.+-.+-. T Consensus 223 a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~-~~v~n~~~~~~L~~lkd~~ 301 (390) T protein:vir:97 223 GLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPAS-GIVINPIDWAAIELAKDAN 301 (390) T ss_pred HHHHHHHHHHhhcCCCCccccceeeccccccccccccccchHHHHHHHHHhhccccCCCC-EEEEcHHHHHHHHHhhcCC Confidence 88888888765310 00 0000 011223445678889999999988887755 4678999998887543211 Q ss_pred hcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCC-ceeeee-eeeeeeeecCCCC--CccceeeeeeeeeEE Q lcl|Aclame:pro 204 PQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASPI-QADLAKTNSNIPG--MFGTLAEQLLYTGAF 279 (319) Q Consensus 204 ~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~-k~~~~~~~~~~~~--~~~~~v~gr~~yg~~ 279 (319) ... ..+. ...+..++|.|.+|+.++ .++...+++|..+ +..... +--.++..+.... .+...++...++|.. T Consensus 302 G~~-l~~~-~~~~~~~~l~G~pV~~~~--~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~ 377 (390) T protein:vir:97 302 NQY-LIGN-ARGTLTPTLWGLPVVATQ--AMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALV 377 (390) T ss_pred Cce-eecC-ccCCCCceecceeeEEcC--CCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccE Confidence 000 0111 234555789999998753 4455556666543 444333 2223333332111 233567888999999 Q ss_pred EeccccceEEEEcc Q lcl|Aclame:pro 280 VPEHLQKYIFTIGG 293 (319) Q Consensus 280 V~~~k~~~Iy~~~~ 293 (319) |.+|++. +++..+ T Consensus 378 v~~~~a~-v~~~~a 390 (390) T protein:vir:97 378 VYRPEAL-ITGSFA 390 (390) T ss_pred EeccccE-EEEEeC Confidence 9999984 344443 No 85 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.30 E-value=1.5e-06 Score=52.51 Aligned_cols=299 Identities=11% Similarity=0.039 Sum_probs=135.3 Q ss_pred CCcccccccce--------------------eeehhhhhhhhh------------cchhhhhhhHhhHHHHHHHHHhhhh Q lcl|Aclame:pro 1 MNKTIKNATGM--------------------LKLNLQHFANKS------------VEPGQTLLKNKHVGILERVTAVNAY 48 (319) Q Consensus 1 ~~~~~~~~~~~--------------------~~~~~~~~~~~~------------~~~n~~~l~~ky~~lld~~~~~~sl 48 (319) .++........ ..-.+..|.+-- ........++.+...+.+.....+. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~ 150 (415) T protein:vir:94 71 NQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFN 150 (415) T ss_pred ccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhh Confidence 00000000000 000111121100 0011122333443222222222221 Q ss_pred hhhcccCcceeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHH Q lcl|Aclame:pro 49 STPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVV 127 (319) Q Consensus 49 ~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~ 127 (319) ... +++..-...+..++.+++....+-......++-.++.-..+...+++.-.+.-.+. . +..+-.+ ...++...+ T Consensus 151 l~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~-~-is~ell~ds~~~~~~~i 227 (415) T protein:vir:94 151 LDK-YVTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYF-R-ISREAIEDAKVNVLQEL 227 (415) T ss_pred hhh-hcceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeec-h-hhHHHHhhchHHHHHHH Confidence 111 11111111233455566554433333322222112111123444555445444432 1 1211111 123455666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCc---------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhh Q lcl|Aclame:pro 128 ARQGAEVVAPYLDNLRFATLARNKAK---------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKK 198 (319) Q Consensus 128 ~~~~~~~vapeiD~~~~s~la~~a~~---------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~ 198 (319) .+..++.++-.+|..++.-...+... ..+...+...-|+.|.++...+....+... .++|+|..+..|.+ T Consensus 228 ~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~ 306 (415) T protein:vir:94 228 KLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHN-VAIVSQTMFAKLDK 306 (415) T ss_pred HHHHHHHHHHHHHHHHhhccccCccccccccccccccccccccccchHHHHHHHHhhhhhccCCC-EEEEcHHHHHHHHH Confidence 77788888888887766533322111 112223444568889999988877666533 57899999998876 Q ss_pred hhhhhhcccccccceeeeeeeeecCeEEEEeccccc---ccceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceeeeee Q lcl|Aclame:pro 199 FVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL---QGLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAEQLL 274 (319) Q Consensus 199 ~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~---~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~gr~ 274 (319) -.+-.... .......+|..++|.|.+|+.+++... .+..+++|.-+ +.....+. .+.+-..+...+...+++.. T Consensus 307 lkd~~G~~-l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~-~~~v~~~~~~~~~~~~r~~~ 384 (415) T protein:vir:94 307 MKDKLGNY-LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRS-QYQASWTDYMHFGECLMIAV 384 (415) T ss_pred hhccCCCe-eeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeec-ceEEEEeccccCceEEEEEE Confidence 43211110 112223456678899999987765322 23457888644 45544432 22322223455667888899 Q ss_pred eeeEEEeccccceEEEEccccccCCCCCcccc Q lcl|Aclame:pro 275 YTGAFVPEHLQKYIFTIGGTEVATKRDGVDAH 306 (319) Q Consensus 275 ~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~ 306 (319) ++|..|.+|++. +++...++.....+-+-+. T Consensus 385 r~d~~~~~~~a~-~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:94 385 RQDCRILDYKSA-IVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EeccEEeccccE-EEEEEeccCCCCCccccCC Confidence 999999998874 3333332222222211111 No 86 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.30 E-value=1.5e-06 Score=52.44 Aligned_cols=293 Identities=13% Similarity=0.070 Sum_probs=134.9 Q ss_pred CCcccccccceeeehhhhhhhh------------hcchhhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCC--ce Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANK------------SVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEG--RS 65 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~------------~~~~n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g--~t 65 (319) +...+.. .....-....|..- -..-...-.++.+.. +++.+.....+. . +++ .+...+ .+ T Consensus 92 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~-~-~~~--~~~~~~~~~~ 166 (415) T protein:vir:98 92 LGISIQN-TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLD-K-YVT--VKRVTNGSGK 166 (415) T ss_pred Hhhhhhh-hhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhh-h-hee--eeeccCCcee Confidence 0000000 00000011111100 001112223444432 333332222221 1 122 122222 34 Q ss_pred EEeeeccccccccccCCCCcccCCc-ccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 FTVMKGDTTELKDYKRNATNEFDHP-KIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRF 144 (319) Q Consensus 66 VkIp~i~~~g~~DY~r~~~~~~~~~-t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~ 144 (319) +.+|+........+. ..+-...+. ..+...+++.-.+.-.+. .--+.--.....++...+.+...+.++-.+|..++ T Consensus 167 ~~~~~~~~~~~~~~v-~E~~~~~~~~~~~~~~v~~~~~k~~~~~-~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il 244 (415) T protein:vir:98 167 YPVVRQSEVAALEKV-EELEENPELAVKPFFQLAYDINTHRGYF-RISREAIEDAKVNVLQELKLWMARTIAATRNKAII 244 (415) T ss_pred EEEEeecCCccceee-ccccccCcccccceeeEEeeeeeeEeee-hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 555554443333332 222222211 234555555555544432 11111111112345666777777888888887766 Q ss_pred HHHHhccC---------ccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceee Q lcl|Aclame:pro 145 ATLARNKA---------KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGK 215 (319) Q Consensus 145 s~la~~a~---------~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~ 215 (319) .-.-.+.. ...+.+.+...-|+.|.++..++........ .++++|..+..|.+-.+-.... .......+ T Consensus 245 ~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lkd~~G~~-l~~~~~~~ 322 (415) T protein:vir:98 245 DVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHN-VAIVSQTMFAKLDKMKDKLGNY-LIQPDVKE 322 (415) T ss_pred hccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCC-EEEEcHHHHHHHHHhhccCCce-eeccCcCC Confidence 53322211 1112233445668999999999887766644 4788999999887532211100 11222345 Q ss_pred eeeeeecCeEEEEeccccc---ccceEEEEcCC-ceeeeeee-eeeeeecCCCCCccceeeeeeeeeEEEeccccceEEE Q lcl|Aclame:pro 216 GVQGELDGFVIVKVPTKLL---QGLQAIAVVGE-VLASPIQA-DLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFT 290 (319) Q Consensus 216 g~Vg~idG~~I~~vps~~~---~~~n~i~~~~~-A~~~~~k~-~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~ 290 (319) |..++|.|++|+.+++... .+..+++|.-+ +.....+. -.++.. +...+...+++-.++|..|.+|++. +++ T Consensus 323 ~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~r~d~~v~~~~a~-~~~ 399 (415) T protein:vir:98 323 KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT--DYMHFGECLMIAVRQDCRILDYKSA-IVI 399 (415) T ss_pred CCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEe--ccccCceEEEEEEEeccEEeccccE-EEE Confidence 6677999999987765322 34557788644 44443332 223322 2445666788888999999998874 233 Q ss_pred EccccccCCCC-Cccc Q lcl|Aclame:pro 291 IGGTEVATKRD-GVDA 305 (319) Q Consensus 291 ~~~~~~a~~~~-~~~~ 305 (319) ...++.....+ |-.+ T Consensus 400 ~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:98 400 EYDDSERGEGDLGLEA 415 (415) T ss_pred EEeccCCCCCccccCC Confidence 33222222222 1122 No 87 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.30 E-value=1.5e-06 Score=52.44 Aligned_cols=293 Identities=13% Similarity=0.070 Sum_probs=134.9 Q ss_pred CCcccccccceeeehhhhhhhh------------hcchhhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCC--ce Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANK------------SVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEG--RS 65 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~------------~~~~n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g--~t 65 (319) +...+.. .....-....|..- -..-...-.++.+.. +++.+.....+. . +++ .+...+ .+ T Consensus 92 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~-~-~~~--~~~~~~~~~~ 166 (415) T protein:vir:79 92 LGISIQN-TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLD-K-YVT--VKRVTNGSGK 166 (415) T ss_pred Hhhhhhh-hhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhh-h-hee--eeeccCCcee Confidence 0000000 00000011111100 001112223444432 333332222221 1 122 122222 34 Q ss_pred EEeeeccccccccccCCCCcccCCc-ccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 FTVMKGDTTELKDYKRNATNEFDHP-KIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRF 144 (319) Q Consensus 66 VkIp~i~~~g~~DY~r~~~~~~~~~-t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~ 144 (319) +.+|+........+. ..+-...+. ..+...+++.-.+.-.+. .--+.--.....++...+.+...+.++-.+|..++ T Consensus 167 ~~~~~~~~~~~~~~v-~E~~~~~~~~~~~~~~v~~~~~k~~~~~-~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il 244 (415) T protein:vir:79 167 YPVVRQSEVAALEKV-EELEENPELAVKPFFQLAYDINTHRGYF-RISREAIEDAKVNVLQELKLWMARTIAATRNKAII 244 (415) T ss_pred EEEEeecCCccceee-ccccccCcccccceeeEEeeeeeeEeee-hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 555554443333332 222222211 234555555555544432 11111111112345666777777888888887766 Q ss_pred HHHHhccC---------ccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceee Q lcl|Aclame:pro 145 ATLARNKA---------KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGK 215 (319) Q Consensus 145 s~la~~a~---------~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~ 215 (319) .-.-.+.. ...+.+.+...-|+.|.++..++........ .++++|..+..|.+-.+-.... .......+ T Consensus 245 ~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lkd~~G~~-l~~~~~~~ 322 (415) T protein:vir:79 245 DVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHN-VAIVSQTMFAKLDKMKDKLGNY-LIQPDVKE 322 (415) T ss_pred hccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCC-EEEEcHHHHHHHHHhhccCCce-eeccCcCC Confidence 53322211 1112233445668999999999887766644 4788999999887532211100 11222345 Q ss_pred eeeeeecCeEEEEeccccc---ccceEEEEcCC-ceeeeeee-eeeeeecCCCCCccceeeeeeeeeEEEeccccceEEE Q lcl|Aclame:pro 216 GVQGELDGFVIVKVPTKLL---QGLQAIAVVGE-VLASPIQA-DLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFT 290 (319) Q Consensus 216 g~Vg~idG~~I~~vps~~~---~~~n~i~~~~~-A~~~~~k~-~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~ 290 (319) |..++|.|++|+.+++... .+..+++|.-+ +.....+. -.++.. +...+...+++-.++|..|.+|++. +++ T Consensus 323 ~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~r~d~~v~~~~a~-~~~ 399 (415) T protein:vir:79 323 KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT--DYMHFGECLMIAVRQDCRILDYKSA-IVI 399 (415) T ss_pred CCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEe--ccccCceEEEEEEEeccEEeccccE-EEE Confidence 6677999999987765322 34557788644 44443332 223322 2445666788888999999998874 233 Q ss_pred EccccccCCCC-Cccc Q lcl|Aclame:pro 291 IGGTEVATKRD-GVDA 305 (319) Q Consensus 291 ~~~~~~a~~~~-~~~~ 305 (319) ...++.....+ |-.+ T Consensus 400 ~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:79 400 EYDDSERGEGDLGLEA 415 (415) T ss_pred EEeccCCCCCccccCC Confidence 33222222222 1122 No 88 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.30 E-value=1.5e-06 Score=52.44 Aligned_cols=293 Identities=13% Similarity=0.070 Sum_probs=134.9 Q ss_pred CCcccccccceeeehhhhhhhh------------hcchhhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCC--ce Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANK------------SVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEG--RS 65 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~------------~~~~n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g--~t 65 (319) +...+.. .....-....|..- -..-...-.++.+.. +++.+.....+. . +++ .+...+ .+ T Consensus 92 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~-~-~~~--~~~~~~~~~~ 166 (415) T protein:vir:81 92 LGISIQN-TKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLD-K-YVT--VKRVTNGSGK 166 (415) T ss_pred Hhhhhhh-hhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhh-h-hee--eeeccCCcee Confidence 0000000 00000011111100 001112223444432 333332222221 1 122 122222 34 Q ss_pred EEeeeccccccccccCCCCcccCCc-ccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 FTVMKGDTTELKDYKRNATNEFDHP-KIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRF 144 (319) Q Consensus 66 VkIp~i~~~g~~DY~r~~~~~~~~~-t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~ 144 (319) +.+|+........+. ..+-...+. ..+...+++.-.+.-.+. .--+.--.....++...+.+...+.++-.+|..++ T Consensus 167 ~~~~~~~~~~~~~~v-~E~~~~~~~~~~~~~~v~~~~~k~~~~~-~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il 244 (415) T protein:vir:81 167 YPVVRQSEVAALEKV-EELEENPELAVKPFFQLAYDINTHRGYF-RISREAIEDAKVNVLQELKLWMARTIAATRNKAII 244 (415) T ss_pred EEEEeecCCccceee-ccccccCcccccceeeEEeeeeeeEeee-hhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 555554443333332 222222211 234555555555544432 11111111112345666777777888888887766 Q ss_pred HHHHhccC---------ccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceee Q lcl|Aclame:pro 145 ATLARNKA---------KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGK 215 (319) Q Consensus 145 s~la~~a~---------~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~ 215 (319) .-.-.+.. ...+.+.+...-|+.|.++..++........ .++++|..+..|.+-.+-.... .......+ T Consensus 245 ~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~v~n~~~~~~l~~lkd~~G~~-l~~~~~~~ 322 (415) T protein:vir:81 245 DVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHN-VAIVSQTMFAKLDKMKDKLGNY-LIQPDVKE 322 (415) T ss_pred hccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCC-EEEEcHHHHHHHHHhhccCCce-eeccCcCC Confidence 53322211 1112233445668999999999887766644 4788999999887532211100 11222345 Q ss_pred eeeeeecCeEEEEeccccc---ccceEEEEcCC-ceeeeeee-eeeeeecCCCCCccceeeeeeeeeEEEeccccceEEE Q lcl|Aclame:pro 216 GVQGELDGFVIVKVPTKLL---QGLQAIAVVGE-VLASPIQA-DLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFT 290 (319) Q Consensus 216 g~Vg~idG~~I~~vps~~~---~~~n~i~~~~~-A~~~~~k~-~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~ 290 (319) |..++|.|++|+.+++... .+..+++|.-+ +.....+. -.++.. +...+...+++-.++|..|.+|++. +++ T Consensus 323 ~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~r~d~~v~~~~a~-~~~ 399 (415) T protein:vir:81 323 KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT--DYMHFGECLMIAVRQDCRILDYKSA-IVI 399 (415) T ss_pred CCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEe--ccccCceEEEEEEEeccEEeccccE-EEE Confidence 6677999999987765322 34557788644 44443332 223322 2445666788888999999998874 233 Q ss_pred EccccccCCCC-Cccc Q lcl|Aclame:pro 291 IGGTEVATKRD-GVDA 305 (319) Q Consensus 291 ~~~~~~a~~~~-~~~~ 305 (319) ...++.....+ |-.+ T Consensus 400 ~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:81 400 EYDDSERGEGDLGLEA 415 (415) T ss_pred EEeccCCCCCccccCC Confidence 33222222222 1122 No 89 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.30 E-value=1.6e-06 Score=52.41 Aligned_cols=280 Identities=7% Similarity=-0.091 Sum_probs=128.1 Q ss_pred CCcccccc----------c---------ceeeehhhhhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceee Q lcl|Aclame:pro 1 MNKTIKNA----------T---------GMLKLNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIF 60 (319) Q Consensus 1 ~~~~~~~~----------~---------~~~~~~~~~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~ 60 (319) .....+.. . .++.+....+..-... .-.-++..+. .+++.+.....+ .. +++. .- T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g~~vp~~~~~~ii~~~~~~~~l-~~-l~~~--~~ 151 (395) T protein:vir:43 77 GEEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGS-GGALVAPDRRPGVVAAPQRRLTI-RD-LVAP--GT 151 (395) T ss_pred ccchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCC-CccccchhhHHHHHHHHHhhhhH-Hh-hccc--ee Confidence 00000000 0 0011111111000000 0001222222 223222222222 11 1221 22 Q ss_pred eCCceEEeeecccc-ccccccC-CCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 MEGRSFTVMKGDTT-ELKDYKR-NATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPY 138 (319) Q Consensus 61 ~~g~tVkIp~i~~~-g~~DY~r-~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vape 138 (319) .++.++++|..... ....+.. ++.....+++ ....++.-.+.-.+.. +..+-.+....+...+.+..+..++-. T Consensus 152 ~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~--~~~i~~~~~k~~~~~~--is~ell~d~~~l~~~v~~~la~a~~~~ 227 (395) T protein:vir:43 152 TESNSVEYVRETGFVNNAAPVSEGTQKPYSDLT--FELENAPVRTIAHLFK--ASRQILDDASALQSYIDARARYGLMLV 227 (395) T ss_pred cCCCceEEEEEecCCCceeeecCCccccccccc--eeEEEEeeeeEEEeeh--hhHHHHHhHHHHHHHHHHHHHHHHHHH Confidence 35667899987553 2333222 2222333344 4444444444333321 221111111234556677777888888 Q ss_pred HHHHHHHH---------HHhccCc---cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcc Q lcl|Aclame:pro 139 LDNLRFAT---------LARNKAK---HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQG 206 (319) Q Consensus 139 iD~~~~s~---------la~~a~~---~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~ 206 (319) +|..++.- +....+. ..+.+.+....++.|.++...+.....+. -.++|+|..+..|.+-.+-... T Consensus 228 ~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~vmn~~~~~~l~~lkd~~G~- 305 (395) T protein:vir:43 228 EECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPA-SGIVLNPIDWALIELNKDAENR- 305 (395) T ss_pred HHHHHHhccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCC-cEEEEcHHHHHHHHHhhccCCc- Confidence 88865531 0000000 01112334567999999998888766553 3578999999888654321110 Q ss_pred cccccceeeeeeeeecCeEEEEecccccccceEEEEcCC-ceeeee-eeeeeeeecCCCC---CccceeeeeeeeeEEEe Q lcl|Aclame:pro 207 DTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASPI-QADLAKTNSNIPG---MFGTLAEQLLYTGAFVP 281 (319) Q Consensus 207 ~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~-k~~~~~~~~~~~~---~~~~~v~gr~~yg~~V~ 281 (319) -+.. ...+|..+.|.|.+|+.+ +.++...+++|.-+ +..... +=-.+++.+.... .+...++...++|..|. T Consensus 306 ~i~~-~~~~~~~~~l~G~pVv~~--~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 382 (395) T protein:vir:43 306 YIIG-SPQNGTTPTLWRLPVVET--QAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVY 382 (395) T ss_pred eecc-ccccCCCceecceeeEEc--CCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEe Confidence 0111 234666778999999864 45666666766643 333332 2222343331111 23457888899999999 Q ss_pred ccccceEEEEcccc Q lcl|Aclame:pro 282 EHLQKYIFTIGGTE 295 (319) Q Consensus 282 ~~k~~~Iy~~~~~~ 295 (319) +|++. +++.+.++ T Consensus 383 ~~~a~-~~~~~taa 395 (395) T protein:vir:43 383 RPEAF-VTGSLTAS 395 (395) T ss_pred cccce-EEEEeccC Confidence 99973 33333222 No 90 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.24 E-value=2.1e-06 Score=51.68 Aligned_cols=271 Identities=8% Similarity=-0.063 Sum_probs=128.9 Q ss_pred cceeeehhhhhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCCccc Q lcl|Aclame:pro 9 TGMLKLNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEF 87 (319) Q Consensus 9 ~~~~~~~~~~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~ 87 (319) -|.=.|+.. ...--...+++.+. .+++.+...+.+ ... ++ ....++++.++|..+..+..-+........ T Consensus 1 ~g~~a~~~~-----~~~~~~~~iP~~~~~~ii~~~~~~s~l-~~~-~~--~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~ 71 (299) T protein:vir:41 1 MGFNPDTTT-----MQSAKTGSIPINISEQIITGVKNGSAA-MKL-AK--AVPMTKPEEEFTFMSGVGAFWVDEAERIQT 71 (299) T ss_pred CCcCCCccc-----ccCCCceecchhHHHHHHHHHHhcchh-hhh-ce--eeecCCCcEEEEEEcCCceeeeecCccccc Confidence 010001000 00000112333342 333333332222 211 21 244567788999887655444433333333 Q ss_pred CCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHhccCcccccc Q lcl|Aclame:pro 88 DHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFAT--------LARNKAKHLTVG 158 (319) Q Consensus 88 ~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~--------la~~a~~~~~~~ 158 (319) .+++ ....+++..|.-.+.- +-.+-.+ ....+.+.+.+....+++-.+|..++.- +...+....+.. T Consensus 72 ~~~~--f~~v~l~~~k~~~~~~--is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~ 147 (299) T protein:vir:41 72 SKPT--FTKAKMRSKKMGVIIP--TTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLV 147 (299) T ss_pred cccc--eeEEEEeeEEEEEeeh--hhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceee Confidence 3444 4444555444333321 1111111 1234566777888888888888765521 000011111111 Q ss_pred CCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEeccccc--cc Q lcl|Aclame:pro 159 TGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL--QG 236 (319) Q Consensus 159 ~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~--~~ 236 (319) ......|+.|.++..+|.+.+.+.. .++++|..+..|.+-.+-.... ..+.. ..+-.+++.|.+|+.+++-.. .. T Consensus 148 ~~~~~~~~~l~~~~~~l~~~~~~~~-~~v~n~~~~~~L~~lkd~~G~~-l~~~~-~~~~~~~l~G~PV~~~~~~~~~~~~ 224 (299) T protein:vir:41 148 EETANKYDDLNEAIGLIEAEDLEPN-GIATIRKQRVKYRSTKDGNGMP-IFNTA-TSNGVDDVLGLPIAYTPKYTFGDKD 224 (299) T ss_pred ccccccHHHHHHHHHhhhcccCCcC-EEEEcHHHHHHHHHhhccCCce-eecCC-cCCCCceecceeeEEecccCCCCCc Confidence 2234568889999999988777644 4789999998888643211100 01111 223346899999997653221 12 Q ss_pred ceEEEEcCCceeeee-eeeeeeeecCC-------C--------CCccceeeeeeeeeEEEeccccceEEEEccccccCC Q lcl|Aclame:pro 237 LQAIAVVGEVLASPI-QADLAKTNSNI-------P--------GMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATK 299 (319) Q Consensus 237 ~n~i~~~~~A~~~~~-k~~~~~~~~~~-------~--------~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~ 299 (319) ..+++|.-+-..... +=-.+++.+.. + ..+.-+++...++|..|.+|++....-. +++. T Consensus 225 ~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~----~aa~ 299 (299) T protein:vir:41 225 ISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQP----KAGN 299 (299) T ss_pred eEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEe----ccCC Confidence 335666554332222 22233333210 0 0122456777899999999987444322 2222 No 91 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.23 E-value=2.2e-06 Score=51.56 Aligned_cols=262 Identities=12% Similarity=0.002 Sum_probs=131.9 Q ss_pred hhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCC-CcccCCccccee Q lcl|Aclame:pro 18 HFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA-TNEFDHPKIEET 95 (319) Q Consensus 18 ~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~-~~~~~~~t~t~~ 95 (319) |.. ....-.++.+. .+++.+...+.+.. . +. .+..++.+++||.+...+-..+...+ .....+++. . T Consensus 1 ma~-----~gG~lip~~~~~~ii~~~~~~s~i~~-~-~~--~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f--~ 69 (298) T protein:vir:94 1 MVL-----NKGTLFDPELVTDLISKVAGKSSIAR-L-SA--QKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTL--A 69 (298) T ss_pred Cee-----ccccccChhHHHHHHHHHHhhchhhh-h-cc--eeeccCCceEEEEEecCcceEEeeCCccccccccce--e Confidence 221 11222333342 33444433333221 1 21 24456677999998655444333222 223233444 4 Q ss_pred EEEEeecccceeecchhhHH--HH-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc----------------cc Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRK--DT-EGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH----------------LT 156 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~--et-~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~----------------~~ 156 (319) ..++.-.|.-. .+.--+.- ++ .....+...+.+..++++.-.+|.-.+.-.....+.. .. T Consensus 70 ~v~l~~~k~~~-~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~ 148 (298) T protein:vir:94 70 PQTMVPIKVEY-GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVE 148 (298) T ss_pred EEEEeeeEEEE-eeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccc Confidence 44444333322 22211210 00 0112345556778888888889887663211111100 00 Q ss_pred ccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEeccc---- Q lcl|Aclame:pro 157 VGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTK---- 232 (319) Q Consensus 157 ~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~---- 232 (319) ......++++.|+++..++..++.+.. .++++|..+..|++-.+-... -..+.....|..++|.|++|+.+++- T Consensus 149 ~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~-~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~ 226 (298) T protein:vir:94 149 APRGIADPNGAIENAVELLTGVDADVT-GIAINPSFRSALAKQKDLQGN-ALFPELKWGATPDTINGLPVDVNKTVSDMS 226 (298) T ss_pred cccccccHHHHHHHHHHhhhhcCCCcc-EEEEcHHHHHHHHHhhccCCC-eeecCcccCCCCceecceeeEEeccccccc Confidence 111234578889999999988776633 589999999888764321110 01133445677789999999965421 Q ss_pred ccccceEEEEcCCc-e-eeeeeeeeeeeecCC-CC--------CccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 233 LLQGLQAIAVVGEV-L-ASPIQADLAKTNSNI-PG--------MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 233 ~~~~~n~i~~~~~A-~-~~~~k~~~~~~~~~~-~~--------~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) ......+++|.-+- + ....+--.+++.+.. ++ .+.-.++...++|..|.+|++... -.+.+ T Consensus 227 ~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~--l~~~t 298 (298) T protein:vir:94 227 LTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFAR--VTEAN 298 (298) T ss_pred CCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEE--EEecC Confidence 11223466665542 2 333444444544321 11 122467888899999999987333 22222 No 92 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.23 E-value=2.3e-06 Score=51.45 Aligned_cols=279 Identities=10% Similarity=0.001 Sum_probs=133.1 Q ss_pred ehhhhhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccc-ccccccCCCCcccCCcc Q lcl|Aclame:pro 14 LNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTT-ELKDYKRNATNEFDHPK 91 (319) Q Consensus 14 ~~~~~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t 91 (319) |+-.|..-.-. -...-+++.|. .+++.+.....+.. +++.--...+..+..||+.... +...+...++-..+.-. T Consensus 1 ~l~~~~~~t~~-~gg~liP~~~~~~Ii~~~~~~~~l~~--~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 77 (293) T protein:vir:48 1 MLDSKTDHSGS-DAGLTIPQDIRTAINTLVRQYDSLQE--YVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDD 77 (293) T ss_pred CceeecccccC-cCceEechhHHHHHHHHHHhhhhhhh--hceeeeccCCcceEEEEeecCCCcceeeecCCcccccccc Confidence 22222222211 11223355553 33333333222211 1221112223457778887653 44454433332222222 Q ss_pred cceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHH Q lcl|Aclame:pro 92 IEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLD 170 (319) Q Consensus 92 ~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~ 170 (319) .+....++...|.-.+ +. +..+-.+ ....+.+.+.++.+..++-..|+-+++.+-..+. .. ...-|+.|.+ T Consensus 78 ~~~~~i~l~~~k~~~~-~~-iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~--~~----~~~~~d~i~~ 149 (293) T protein:vir:48 78 PKLSLIKYTIKRYAGI-ST-VTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT--KP----TLTKWDDIID 149 (293) T ss_pred cceeEEEEeeeEEEEe-eh-hhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc--cc----cccCHHHHHH Confidence 3344444444443332 22 1211111 1234556667777777877778766653322111 11 2223677777 Q ss_pred HHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccc-----cceEEEEcC- Q lcl|Aclame:pro 171 VSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQ-----GLQAIAVVG- 244 (319) Q Consensus 171 a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~-----~~n~i~~~~- 244 (319) +..+|..+..+ +-.++++|..+..|++-.+-.... ..+....+|..++|.|.+|+.+++..+. ...++++.- T Consensus 150 ~~~~l~~~~~~-~a~~vmn~~~~~~L~~lkd~~g~~-l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~ 227 (293) T protein:vir:48 150 LEAKVDPAIKQ-TSFFLTNTSGFTALKKVKNALGDY-LMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLK 227 (293) T ss_pred HHHhhhhhhcC-CCEEEEcHHHHHHHHHhhccCCce-EeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEecc Confidence 88888765443 446789999999887644211110 1122344566778999999876554433 345677764 Q ss_pred CceeeeeeeeeeeeecCCC--C---CccceeeeeeeeeEEEeccccceEEEEccccccCCCCCcccc Q lcl|Aclame:pro 245 EVLASPIQADLAKTNSNIP--G---MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAH 306 (319) Q Consensus 245 ~A~~~~~k~~~~~~~~~~~--~---~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~ 306 (319) .+.....+ ..+++...++ + .+...++...++|..+.+|++....--...+.+....|.++- T Consensus 228 ~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:48 228 QAVTLFDR-QQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGSTAV 293 (293) T ss_pred ceEEEEEe-cceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCccccccCC Confidence 35444433 2333322222 1 233578899999999999987433222221111111222222 No 93 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.20 E-value=2.8e-06 Score=51.06 Aligned_cols=292 Identities=10% Similarity=-0.056 Sum_probs=130.7 Q ss_pred CCcccccccceeeehhhhhhhhhcchh--hhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccc-c Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPG--QTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTE-L 76 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n--~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g-~ 76 (319) +.+.+++..+.+...-+. +.....+. ...+++.+.. +++.+.....+.. . ++..-...+..++.+++....+ . T Consensus 97 ~~~~~~~~~~~~~~~e~~-a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~-~-~~~~~~~~~~~~~~~~~~~~~~~~ 173 (404) T protein:vir:39 97 FVNMVRNPMAFLNTVSSK-TETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ-Y-VRVESVSTSNGSRVYEKWTDVTPL 173 (404) T ss_pred HHHHHhcchhhhhhhhhh-hhhcccccCCceeccHHHHHHHHHHHHhhhhHHh-h-cceeeccCCcceEEEEeecCCccc Confidence 111111111111000000 00000000 1124555543 3333333222221 1 2211122233455666665443 2 Q ss_pred ccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Q lcl|Aclame:pro 77 KDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL 155 (319) Q Consensus 77 ~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~ 155 (319) ..+...++-.++.-..++..+++.-.+...+. . +..+-.+ ...++.+.+.+.....++-.+|..++.- .+.+... T Consensus 174 a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g--~g~~~~~ 249 (404) T protein:vir:39 174 TVMDAEDGKIPDLDNPRLTIIKYLIKRYAGII-T-ATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAA--MGTVPKK 249 (404) T ss_pred eeeecCccccccccccceeeEEeeeeeEEeee-h-hHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhc--ccccccc Confidence 33322222222222234455555555544432 1 2221111 1234566677888888888888865542 1122222 Q ss_pred cccCCHhHHHHHHHHHHH-HHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEeccccc Q lcl|Aclame:pro 156 TVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL 234 (319) Q Consensus 156 ~~~~T~~n~~~~i~~a~~-~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~ 234 (319) ....+ ++.+.++.. .++. .+-.+-.++++|..+..|.+-.+-.... ........+..++|.|++|+.+.+..+ T Consensus 250 ~~~~~----~~~i~~~~~~~~~~-~~~~~a~~v~n~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~~l~G~pV~~~~~~~~ 323 (404) T protein:vir:39 250 PTIAK----FDDVITMINTSVDP-AIIATSSLLTNQSGLNKLALVKTAEGKY-LLEPDPTKPNSYLIKGKKVIVVADRWL 323 (404) T ss_pred ccccc----HHHHHHHHHHhhhh-hhccCCEEEEcHHHHHHHHHhhccCCce-eeccCcCCCCcceecceeEEEeccccc Confidence 22233 444444443 3333 3333557899999999998643211110 112223456667899999997654333 Q ss_pred c-----cceEEEEcCC-ceeeeee-eeeeeeecCCCC---CccceeeeeeeeeEEEeccccceEEEEccccccCCCCCcc Q lcl|Aclame:pro 235 Q-----GLQAIAVVGE-VLASPIQ-ADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVD 304 (319) Q Consensus 235 ~-----~~n~i~~~~~-A~~~~~k-~~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~ 304 (319) + ...+++|... +.....+ =-.+++.+...+ .+...++....+|+.|++|++..+ ..-++.+.+..+.+ T Consensus 324 ~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~--~~~~~~a~~~~~~~ 401 (404) T protein:vir:39 324 PNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVA--GSFTAIADQVGNFT 401 (404) T ss_pred CccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecccceEE--EEeeccccCCCCCC Confidence 3 3457777654 4544433 223333321111 234578888999999999987422 22233333333344 Q ss_pred ccc Q lcl|Aclame:pro 305 AHA 307 (319) Q Consensus 305 ~~~ 307 (319) +++ T Consensus 402 ~~~ 404 (404) T protein:vir:39 402 AGK 404 (404) T ss_pred CCC Confidence 555 No 94 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.19 E-value=2.8e-06 Score=50.99 Aligned_cols=295 Identities=10% Similarity=-0.006 Sum_probs=126.5 Q ss_pred CCcccccccceeeehhhhhhhhhc--chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccc-c Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSV--EPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTE-L 76 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~--~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g-~ 76 (319) +.+.+++..+.+...-+. |.+-. ......+++.|. .+++.+.....+. .+++..-...+..++.|++....+ . T Consensus 97 ~~~~~~~~~~~~~~~~~~-a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~--~~~~~~~~~~~~~~~~~~~~~~~~~~ 173 (408) T protein:vir:74 97 FVNMVRNPMAFLNTVSSK-TETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQ--QYVRVESVSTSSGSRVYEKWTDVTPL 173 (408) T ss_pred HHHHHhcchhhhhhhhhh-hhcccccCCCceeechhHhhHHHHHHhhhcchh--hhcceeeccCCcceEEEEeecCCccc Confidence 000111111110000000 00000 001112455554 3333333333322 122221122344567777776543 4 Q ss_pred ccccCCCCcccC--CcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 77 KDYKRNATNEFD--HPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH 154 (319) Q Consensus 77 ~DY~r~~~~~~~--~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~ 154 (319) +.+...++-..+ .++....+++. .+...+. .--+.--.....++...+.+.....++-.+|..++.- .+.+.. T Consensus 174 ~~~v~E~~~~~~~~~~~~~~i~~~~--~k~~~~~-~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G--~G~~~~ 248 (408) T protein:vir:74 174 KAMDEEDGKIPDLDNPRLTIIKYLI--KRYAGII-TATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAA--MGTVPK 248 (408) T ss_pred ccccccccccccccccceeeEEeee--eeEEeee-hhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhc--cccccc Confidence 444333222222 24444444444 4333321 1111100011234456666777778888888765431 111222 Q ss_pred ccccCCHhHHHHHHHHHH-HHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccc Q lcl|Aclame:pro 155 LTVGTGSDAQYDAVLDVS-VELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKL 233 (319) Q Consensus 155 ~~~~~T~~n~~~~i~~a~-~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~ 233 (319) ....++ ++.+.++. ..|+.... .+-.++++|..+..|++-.+-.... ........|.-++|.|++|+.+++.. T Consensus 249 ~~~~~~----~~~i~~~~~~~l~~~~~-~~a~~v~n~~~~~~l~~lkd~~G~~-l~~~~~~~~~~~~l~G~pV~~~~~~~ 322 (408) T protein:vir:74 249 KPTIAN----FDDVITMINTSVDPAII-ATSSLLTNQSGLNKLALVKTAEGKY-LLEPDPTKPNSYLIKGKQVIVVADRW 322 (408) T ss_pred cccccc----HHHHHHHHHHhhhhhhc-CCCEEEEcHHHHHHHHHhhcCCCce-EeccCcCCCCCceecceeeEEecCcc Confidence 222233 44444443 35555433 3456889999999998643211110 11122334555789999998766544 Q ss_pred cc-----cceEEEEcCC-ceeeee-eeeeeeeecCCC---CCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCc Q lcl|Aclame:pro 234 LQ-----GLQAIAVVGE-VLASPI-QADLAKTNSNIP---GMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGV 303 (319) Q Consensus 234 ~~-----~~n~i~~~~~-A~~~~~-k~~~~~~~~~~~---~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~ 303 (319) ++ ...+++|.-+ +..... +=-.++..+... ..+...++....+|..+++|++..+..-...+++. ... T Consensus 323 ~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~--~~~ 400 (408) T protein:vir:74 323 LPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQV--GNF 400 (408) T ss_pred cccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecccCCC--CCC Confidence 43 2346666543 444433 322233222111 12446788889999999999875443322222222 111 Q ss_pred cccccccc Q lcl|Aclame:pro 304 DAHADNVA 311 (319) Q Consensus 304 ~~~~~~~~ 311 (319) ...+.++- T Consensus 401 ~~~~~~~~ 408 (408) T protein:vir:74 401 KTTTSTAV 408 (408) T ss_pred CCCccccC Confidence 11111111 No 95 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.15 E-value=3.1e-06 Score=50.79 Aligned_cols=284 Identities=10% Similarity=-0.003 Sum_probs=132.2 Q ss_pred CCcccccc-------------cceeeeh-hhhhhhhh-----cchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeee Q lcl|Aclame:pro 1 MNKTIKNA-------------TGMLKLN-LQHFANKS-----VEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFM 61 (319) Q Consensus 1 ~~~~~~~~-------------~~~~~~~-~~~~~~~~-----~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~ 61 (319) ........ .|++..- ...++... +.....-..+.+..++.+.....+... .++ +..... T Consensus 74 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~-~~~-~~~~~~ 151 (390) T protein:vir:62 74 LQGSGSGAQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMR-GGA-TTFTTS 151 (390) T ss_pred cccccccchhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhh-hcc-eeeecC Confidence 00000000 0111000 00001000 001111223344555555544333321 122 223344 Q ss_pred CCceEEeeeccccccccccCCCC-cccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 62 EGRSFTVMKGDTTELKDYKRNAT-NEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYL 139 (319) Q Consensus 62 ~g~tVkIp~i~~~g~~DY~r~~~-~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapei 139 (319) +++.++||.....+-..+...++ ....++ ++...++.-.|...+. . +..+-.. ...++...+.+..+..++-.+ T Consensus 152 ~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~--~f~~i~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~i~~~~ 227 (390) T protein:vir:62 152 DANPLDFTVITGRSSASIVGETAEIPESYP--ATAQRSMGGFKYGFAS-V-VSYEFATDQVLDLVGFLVSDAGPAIGDAM 227 (390) T ss_pred CCceeEEEEEcCCcceeeeccccccccccc--ceeeeEeeeeeEEeeh-H-HHHHHHhhhhHHHHHHHHHHHHHHHHHHH Confidence 56779999887655555543222 222333 4455555555544432 2 2222111 123455667778888888888 Q ss_pred HHHHHHH------H-HhccCccc--cccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccc Q lcl|Aclame:pro 140 DNLRFAT------L-ARNKAKHL--TVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQ 210 (319) Q Consensus 140 D~~~~s~------l-a~~a~~~~--~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~ 210 (319) |...+.- + ...+.... ..+.+..-.|+.|.++...|+.. +..+-.++++|..+..|.+-.+-.... ..+ T Consensus 228 d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~-~~~~a~~vmn~~~~~~L~~lkd~~g~~-l~~ 305 (390) T protein:vir:62 228 GRHFITGTGQPRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSA-YRANAKYVVNDLRAAQMRKLKDANGQY-LWQ 305 (390) T ss_pred HhhhhccCCccccccccccccccceecccccccchHHHHHHHHhhhhh-hhcCCEEEEchHHHHHHHHhhccCCCe-eec Confidence 8875531 0 00000001 11111223477788888888664 333556899999998886532211100 112 Q ss_pred cceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCC---CCccceeeeeeeeeEEEeccccce Q lcl|Aclame:pro 211 QVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIP---GMFGTLAEQLLYTGAFVPEHLQKY 287 (319) Q Consensus 211 ~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~---~~~~~~v~gr~~yg~~V~~~k~~~ 287 (319) .....|.-+.|.|.+|+.++ .++...+++|.-+....... ..+++..... ..+...++....+|+.|++|++.. T Consensus 306 ~~~~~g~~~~l~G~Pv~~~~--~~p~~~i~~gd~s~~~i~~~-~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~~~A~~ 382 (390) T protein:vir:62 306 SGLTVGAPSLFNGKVVETDD--GMPADKILFADLSKYRVRFA-GSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAK 382 (390) T ss_pred CCcCCCccceecccceEEec--CCCCccEEEeeccceeEEee-cceEEEeeccccccCCcEEEEEEEEeCcEeechhheE Confidence 23445666689999998643 34444455565443222211 2233221111 123457788889999999999854 Q ss_pred EEEEccccccC Q lcl|Aclame:pro 288 IFTIGGTEVAT 298 (319) Q Consensus 288 Iy~~~~~~~a~ 298 (319) +...+. ++ T Consensus 383 ~l~~~~---~a 390 (390) T protein:vir:62 383 VLTVTP---GA 390 (390) T ss_pred EEEeec---CC Confidence 443332 22 No 96 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.15 E-value=3.5e-06 Score=50.50 Aligned_cols=294 Identities=10% Similarity=-0.002 Sum_probs=128.4 Q ss_pred CCcc--------cccccceee----ehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCc--eE Q lcl|Aclame:pro 1 MNKT--------IKNATGMLK----LNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGR--SF 66 (319) Q Consensus 1 ~~~~--------~~~~~~~~~----~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~--tV 66 (319) +... .+....+|. ..++..+..-..-....+++.+...+-+.....+..... ++. ....+. .+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~-~~~--~~~~~~~~~~ 155 (397) T protein:vir:48 79 LTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEY-VNV--ENVTTLTGSR 155 (397) T ss_pred ccchhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhh-hce--eeccCCcceE Confidence 0000 000000000 000000000000011224445532222222222222211 221 112233 33 Q ss_pred Eeeeccc-cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 TVMKGDT-TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFA 145 (319) Q Consensus 67 kIp~i~~-~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s 145 (319) .++.... .+.......++-..+.-+.++..++++-.+...+ +.--+.--......+.+.+.+..+.+++-.+|..++. T Consensus 156 ~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~-~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~ 234 (397) T protein:vir:48 156 VYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGI-STVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILE 234 (397) T ss_pred EEEeecCCCcceeeeccccccccccccceeeEEeeheeeeee-hhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 3444433 2333333222222222223455555555554433 2222211111123456667778888888888887654 Q ss_pred HHHhccCccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeE Q lcl|Aclame:pro 146 TLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFV 225 (319) Q Consensus 146 ~la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~ 225 (319) - .++. ...+...-++.|.++..+|..+..+ +-.++++|..+..|++-.+-.... ..+.....|.-++|.|.| T Consensus 235 G----~g~~--~~~~~~~~~d~i~~~~~~l~~~~~~-~a~~v~n~~~~~~L~~lkd~~G~~-i~~~~~~~~~~~~l~G~P 306 (397) T protein:vir:48 235 A----IATL--PTKPTLTKWDDIIDLQAKVDPAIKQ-TSFFLTNTSGFTALKKVKNAFGDY-LMERDVKSPTGYSIDGFA 306 (397) T ss_pred c----cccc--ccccccccHHHHHHHHHHhhhhhcC-CCEEEECHHHHHHHHHhhcCCCce-eeccCcCCCCCceeccce Confidence 2 1111 1112233467778888888876554 446789999999997643211110 112223456667899999 Q ss_pred EEEeccccc-----ccceEEEEcCC-ceeeeee-eeeeeeecCCCC---CccceeeeeeeeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 226 IVKVPTKLL-----QGLQAIAVVGE-VLASPIQ-ADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 226 I~~vps~~~-----~~~n~i~~~~~-A~~~~~k-~~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~ 295 (319) |+.+++..+ ....+++|.-+ +.....+ =-.+++.+-..+ .+.-.++.-.++|..+.+|+...+..-..++ T Consensus 307 V~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:48 307 VKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAIA 386 (397) T ss_pred eEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEEecccc Confidence 987765433 34567777644 4443332 222333321111 2335778888899999999874332222221 Q ss_pred ccCCCCCcccc Q lcl|Aclame:pro 296 VATKRDGVDAH 306 (319) Q Consensus 296 ~a~~~~~~~~~ 306 (319) .++...+..+- T Consensus 387 ~~~~~~~~~~~ 397 (397) T protein:vir:48 387 DQKGNLGSTAV 397 (397) T ss_pred cCCCCccccCC Confidence 11111111111 No 97 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.14 E-value=3.8e-06 Score=50.32 Aligned_cols=266 Identities=11% Similarity=-0.044 Sum_probs=128.3 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY 79 (319) |- .. ...-.-+++.+. .+++.+...+.+ ... +. .+...+.+++||.....+-..+ T Consensus 1 m~-------------t~-------t~gg~liP~~~~~~ii~~l~~~s~i-~~l-~~--~~~~~~~~~~ip~~~~~~~a~w 56 (303) T protein:vir:97 1 MG-------------TE-------TSKASLFDKHLVSDLINKVKGHSSL-AKL-SS--QKPIPFNGSKEFTFTLDSDIDV 56 (303) T ss_pred Cc-------------cc-------CCCCeEcchhHHHHHHHHHHhhchh-hhh-cc--eeecCCCceEEEEEecCcceEE Confidence 11 00 111122333342 333333322222 222 21 2445677899999866554554 Q ss_pred cCCC-CcccCCcccceeEEEEeecccceeecchhhHH--HH-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc-- Q lcl|Aclame:pro 80 KRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDRK--DT-EGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK-- 153 (319) Q Consensus 80 ~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~--et-~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~-- 153 (319) ...+ .....+++.+..++.. .|.-. .+.--++- ++ .....+.+.+.+..+.+++-.+|+-.+.-.-...+. T Consensus 57 v~E~~~~~~s~~~f~~v~l~~--~kl~~-~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~ 133 (303) T protein:vir:97 57 VAENGKKTHGGLSLEPVTIVP--IKVEY-GARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKAS 133 (303) T ss_pred eecCccccccccceeeEEeee--EEEEE-eehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccc Confidence 4322 2333344444444433 33222 22222210 00 111344566778888888888888665321000000 Q ss_pred -------------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeee Q lcl|Aclame:pro 154 -------------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGE 220 (319) Q Consensus 154 -------------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~ 220 (319) ......+....|+.|.++...+...+.... .++++|..+..|++-.+-....-...+....+..++ T Consensus 134 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~ 212 (303) T protein:vir:97 134 DVIGTNHFDSKVTQVVKFTESEDADANIEAAVNLIQGAEGVVT-GLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDS 212 (303) T ss_pred ccccccccccccccccccccccchHHHHHHHHHHHhhcCCCcc-EEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCce Confidence 001112345678999999988877665533 388899999888654321111011112223445578 Q ss_pred ecCeEEEEeccc---cc---ccceEEEEc-CC-ceeeeeeeeeeeeecCC-CC-----Cc---cceeeeeeeeeEEEecc Q lcl|Aclame:pro 221 LDGFVIVKVPTK---LL---QGLQAIAVV-GE-VLASPIQADLAKTNSNI-PG-----MF---GTLAEQLLYTGAFVPEH 283 (319) Q Consensus 221 idG~~I~~vps~---~~---~~~n~i~~~-~~-A~~~~~k~~~~~~~~~~-~~-----~~---~~~v~gr~~yg~~V~~~ 283 (319) |.|.+|+.+.+- .. ...-+++|. .. ......+--.+++.+.. ++ .| --.++...++|..|++| T Consensus 213 l~G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p 292 (303) T protein:vir:97 213 INGLKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDA 292 (303) T ss_pred ecceeeEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecc Confidence 999999864321 10 112244443 22 33444444444444310 11 12 23678888999999999 Q ss_pred ccceEEEEcccccc Q lcl|Aclame:pro 284 LQKYIFTIGGTEVA 297 (319) Q Consensus 284 k~~~Iy~~~~~~~a 297 (319) ++.... +.. +. T Consensus 293 ~af~~l--~~~-~~ 303 (303) T protein:vir:97 293 KSFARV--TKG-EV 303 (303) T ss_pred cceEEe--eCC-CC Confidence 874332 222 22 No 98 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.09 E-value=4.8e-06 Score=49.74 Aligned_cols=279 Identities=8% Similarity=-0.087 Sum_probs=130.6 Q ss_pred CCcccccccceeeehhhhhhhhhc----chhhhhhhHhh-HHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSV----EPGQTLLKNKH-VGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTE 75 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~----~~n~~~l~~ky-~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g 75 (319) +.+......+...+....+.+... .....-+...+ ..+++.+.....+.. +++ ....++.++++|.+.... T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~--~~~--~~~~~~~~~~~~~~~~~~ 166 (390) T protein:vir:81 91 SAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRD--LIG--SGRTDSALIEYVQETGFV 166 (390) T ss_pred HHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhh--hcc--eeeccCCceEEEEEecCC Confidence 000000111111112212111110 00000112222 233433333332221 122 233467788999886543 Q ss_pred -cccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-c--- Q lcl|Aclame:pro 76 -LKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR-N--- 150 (319) Q Consensus 76 -~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~-~--- 150 (319) -..+. ..+-.....+.++...+++-.+.-.+. . +..+-......+...+.+..+..++-.+|..++.---. . T Consensus 167 ~~a~~v-~Eg~~~~~~~~~~~~i~~~~~k~~~~~-~-is~ell~d~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~ 243 (390) T protein:vir:81 167 NNAAIV-AEGALKPESSLKFAKKTDTTHVIAHTM-K-ATRQILSDAPQLASYMNNRLIRGLKVKEDAEILRGTGANDGLL 243 (390) T ss_pred cceeee-cCCcccccccceeeEEEEeeeEEEEee-h-hhHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCccc Confidence 23332 222223333344555555555544432 1 11111111123455566777778888888765531000 0 Q ss_pred -----cCc-cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCe Q lcl|Aclame:pro 151 -----KAK-HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGF 224 (319) Q Consensus 151 -----a~~-~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~ 224 (319) ++. ..+...+....++.|.++...+...+.+.. .++|+|..+..|.+-.+-.... ..+. ...+..++|.|. T Consensus 244 Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~v~~~~~~~~l~~lkd~~G~~-l~~~-~~~~~~~~l~G~ 320 (390) T protein:vir:81 244 GLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPS-GIVINPIDWAAIELAKDANNQY-LIGN-ARGTLTPTLWGL 320 (390) T ss_pred ceeecccccccccccccchhHHHHHHHHHhhccccCCCC-EEEEcHHHHHHHHHhhcCCCce-eecC-cccccCceecce Confidence 000 111223345678889999999988777654 4788999998887543211100 1111 233445689999 Q ss_pred EEEEecccccccceEEEEcCC-ceeeeeeeeeeeeecCCCC----CccceeeeeeeeeEEEeccccceEEEEcc Q lcl|Aclame:pro 225 VIVKVPTKLLQGLQAIAVVGE-VLASPIQADLAKTNSNIPG----MFGTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) Q Consensus 225 ~I~~vps~~~~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~----~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~ 293 (319) +|+.+ +.++...+++|.-+ +.....+ ..+.+....+. .+.-.++...++|..|.+|++. +.+..+ T Consensus 321 pv~~~--~~~p~~~~~~gd~~~~~~~~~~-~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~-v~~t~a 390 (390) T protein:vir:81 321 PVVAT--QAMAPGEFLVGAFDLAAQIFDQ-WDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEAL-ISGSFA 390 (390) T ss_pred eeEEc--CCCCCCcEEEEehhceEEEEEe-cceEEEEecccchhhcCcEEEEEEEeeccEEecccce-EEEEeC Confidence 99864 34555556666654 3333332 12222211121 1234678888999999999984 334443 No 99 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.09 E-value=4.8e-06 Score=49.73 Aligned_cols=266 Identities=12% Similarity=-0.015 Sum_probs=132.9 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhh-HHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKH-VGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky-~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY 79 (319) |--+..+..+ + +...+ ..+++.+...+.+.. ++. .+...+..++||......-..+ T Consensus 1 ma~~t~~~G~------------------l-ip~~~~~~ii~~l~~~s~i~~--l~~--~~~~~~~~~~~p~~~~~~~a~w 57 (300) T protein:vir:95 1 MSEAQLSKGN------------------L-FNPELVTKVINKVKGHSSIAK--LSP--QKPIPFNGQREFVFDFDSDIDI 57 (300) T ss_pred CcccccCCcc------------------e-echhhHHHHHHHHHhhhhhhh--hcc--eeeccCCceEEEEEecCcceEE Confidence 3333322211 1 11122 234444443333322 121 2445566789998865544444 Q ss_pred cCCC-CcccCCcccceeEEEEeecccceeecchhhHHH--H-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC--- Q lcl|Aclame:pro 80 KRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKD--T-EGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA--- 152 (319) Q Consensus 80 ~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~e--t-~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~--- 152 (319) ...+ .....+++.+..+++. .|.-. .+.--+.-. + .....+.+...+..+++++-.+|...+.-.-...+ T Consensus 58 v~Eg~~~~~s~~~f~~v~l~~--~k~~~-~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~ 134 (300) T protein:vir:95 58 VAENGKKTHGGVSLDPVTIVP--LKVEY-GARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQAS 134 (300) T ss_pred eeCCcccccccccceeeEeee--EEEEE-eehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCc Confidence 4322 2222334444444433 33222 222222111 0 11234556677788889999999987632110000 Q ss_pred -----------ccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeee Q lcl|Aclame:pro 153 -----------KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGEL 221 (319) Q Consensus 153 -----------~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~i 221 (319) ...+...+....|+.|.++..++...+.... .++++|..+..|.+-.+-..+. ........|..++| T Consensus 135 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~-~~vmn~~~~~~L~~lkd~~G~~-i~~~~~~~~~~~~l 212 (300) T protein:vir:95 135 TIIGDNCFDKKVTQTVPFKDTNPDESMEDAVGMIDGSERDIT-GAILDPIFTTALSKMKNAEGGK-LYPELAWGGVPDAI 212 (300) T ss_pred ccccccccccccceeecccccchHHHHHHHHHHhhhcCCCcc-EEEECHHHHHHHHHhhccCCCe-eccCccccCCCcee Confidence 0111223456778999999999988765533 4789999998887544211110 11233445677899 Q ss_pred cCeEEEEeccc----ccccceEEEEcCC-ce-eeeeeeeeeeeecCC-CC--------CccceeeeeeeeeEEEeccccc Q lcl|Aclame:pro 222 DGFVIVKVPTK----LLQGLQAIAVVGE-VL-ASPIQADLAKTNSNI-PG--------MFGTLAEQLLYTGAFVPEHLQK 286 (319) Q Consensus 222 dG~~I~~vps~----~~~~~n~i~~~~~-A~-~~~~k~~~~~~~~~~-~~--------~~~~~v~gr~~yg~~V~~~k~~ 286 (319) .|.+|+.++.- ..++.-++++.-+ ++ ....+--.+++.+.. ++ .+.-.++...++|..|.+|++. T Consensus 213 ~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~ 292 (300) T protein:vir:95 213 NGLAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASF 292 (300) T ss_pred cceeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccce Confidence 99999864331 1122234555432 22 223332334433210 11 1235778888999999999984 Q ss_pred eEEEEccccccCC Q lcl|Aclame:pro 287 YIFTIGGTEVATK 299 (319) Q Consensus 287 ~Iy~~~~~~~a~~ 299 (319) .....+ +. T Consensus 293 ~~l~~~-----~g 300 (300) T protein:vir:95 293 ARIVKT-----GG 300 (300) T ss_pred EEEecC-----CC Confidence 433222 11 No 100 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.09 E-value=4.9e-06 Score=49.70 Aligned_cols=295 Identities=8% Similarity=-0.059 Sum_probs=129.0 Q ss_pred CCcccccccceeeehhhhhhhhhcch--hhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccc-ccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEP--GQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTT-ELK 77 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~-g~~ 77 (319) +-+.+++..+.+... ..-|.+...+ ....+++.+...+-+.....+... .+++..-...+...+.|++.... +.. T Consensus 97 ~~~~~~~~~~~~~~~-~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~~a 174 (408) T protein:vir:10 97 FVNMVRNPMAFMNTV-SSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQ-QYVRVESVSTSNGSRVYEKWTDVTPLT 174 (408) T ss_pred HHHHhhcchhhhhhh-hhhhhhcccccCCceeccHhHHHHHHHHHHhhchhh-hhcceeeccCCcceEEEeeccccccce Confidence 111111111111000 0000000000 112245555432323322222211 11222112223345666666543 333 Q ss_pred cccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc Q lcl|Aclame:pro 78 DYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLT 156 (319) Q Consensus 78 DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~ 156 (319) .+...++-.++.-..++..+++...+...+. . +..+-.+. ..++...+.+..+..++-.+|..++.-..+ +.... T Consensus 175 ~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~--~~~~~ 250 (408) T protein:vir:10 175 VMDAEDGKIPDLDNPQLTIIKYLIKRYAGII-T-ATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVMKA--APKKP 250 (408) T ss_pred eeecCccccccccCcceeeEEeeeeeEEeee-h-hHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc--ccccc Confidence 3332222222211224455555555544432 2 22222221 234566667777778888888765542221 11222 Q ss_pred ccCCHhHHHHHHHHHH-HHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccc Q lcl|Aclame:pro 157 VGTGSDAQYDAVLDVS-VELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQ 235 (319) Q Consensus 157 ~~~T~~n~~~~i~~a~-~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~ 235 (319) ...+ ++.|.++. ..|+. .+..+-.++++|..+..|.+-.+-.... ..+....+|..++|.|+||+.+++..++ T Consensus 251 ~~~~----~~~l~~~~~~~~~~-~~~~~a~~v~n~~~~~~l~~lkd~~G~~-i~~~~~~~~~~~~l~G~PV~~~~~~~~~ 324 (408) T protein:vir:10 251 TIAK----FDDVITMINTAVDP-AIIATSSLLTNQSGLNKLALVKTAEGKY-LLEPDPTKPNSYLIKGKQVIVVADRWLP 324 (408) T ss_pred cccc----HHHHHHHHHHhhhh-hhccCCEEEEcHHHHHHHHHhhccCCce-EeccCcCCCCCceecceeeEEecccccC Confidence 2223 44555544 34544 3344567899999999998654322111 1122234566678999999977654443 Q ss_pred c-----ceEEEEcCC-ceeeeeeeeeeeeecCCC--C---CccceeeeeeeeeEEEeccccceEEEEccc-cccCCCCCc Q lcl|Aclame:pro 236 G-----LQAIAVVGE-VLASPIQADLAKTNSNIP--G---MFGTLAEQLLYTGAFVPEHLQKYIFTIGGT-EVATKRDGV 303 (319) Q Consensus 236 ~-----~n~i~~~~~-A~~~~~k~~~~~~~~~~~--~---~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~-~~a~~~~~~ 303 (319) . ..+++|.-+ +.....+ ..+.+-..++ . .+...++...++|+.|.+|++.. ++...+ +++....+. T Consensus 325 ~~~~~~~~i~~gd~~~~~~~~~~-~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~-~~~~~~~~~~~~~~~~ 402 (408) T protein:vir:10 325 NTGSTVYPLYYGDMSQAITLFDR-ENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALV-AGSFSAIADQVGNFKT 402 (408) T ss_pred ccCCCceEEEEEehhccEEEEEe-cceEEEEcccccchhhcCceEEEEEEeeccEEeccccEE-EEEeeccccCCCCCCC Confidence 3 336677644 4444443 2233322222 1 23457888899999999998742 233221 111111111 Q ss_pred ccccccccccc Q lcl|Aclame:pro 304 DAHADNVAKPS 314 (319) Q Consensus 304 ~~~~~~~~~~~ 314 (319) + ++-.+ T Consensus 403 ~-----~~~~~ 408 (408) T protein:vir:10 403 T-----TSTAV 408 (408) T ss_pred C-----CcccC Confidence 1 11111 No 101 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.05 E-value=6.1e-06 Score=49.19 Aligned_cols=292 Identities=11% Similarity=0.009 Sum_probs=129.3 Q ss_pred CCcccc-------cccceeeehhhhhhhhh--cchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeee Q lcl|Aclame:pro 1 MNKTIK-------NATGMLKLNLQHFANKS--VEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMK 70 (319) Q Consensus 1 ~~~~~~-------~~~~~~~~~~~~~~~~~--~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~ 70 (319) +.+..+ +..++-......-|.+. .....+..++.+. .+++.+.....+.. ++...-.......+.||+ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~--l~~~~~~~~~~g~~~~~~ 160 (404) T protein:vir:10 83 FVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYN--MVDYEPVFTRSGSRTYEK 160 (404) T ss_pred HHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhh--hhceeeccCCccceEEEE Confidence 000000 00000000000000000 0111112334443 23333322222211 122211223345677777 Q ss_pred ccccccccccCCCCcccC-CcccceeEEEEeecccceeecchhhHHHH-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 GDTTELKDYKRNATNEFD-HPKIEETTYFLDQEKYWGRFVDALDRKDT-EGNIDINYVVARQGAEVVAPYLDNLRFATLA 148 (319) Q Consensus 71 i~~~g~~DY~r~~~~~~~-~~t~t~~tltidqdr~~~F~VD~~D~~et-~~~~~~~~~~~~~~~~~vapeiD~~~~s~la 148 (319) .....-......++.... ..+.+...++++..+.-.|. .+..+-. .....+.+.+.+..++.++-.+|..++.--- T Consensus 161 ~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~--~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g 238 (404) T protein:vir:10 161 RSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFM--SIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGAG 238 (404) T ss_pred ecCCcceeeccccccccccccccceeeeEeeheeeEeee--hhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 644332222222222222 23344455555554443322 1111111 1123456667788888888888887653211 Q ss_pred hcc--------CccccccCCHhHHHHHHHHHHH-HHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeee Q lcl|Aclame:pro 149 RNK--------AKHLTVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQG 219 (319) Q Consensus 149 ~~a--------~~~~~~~~T~~n~~~~i~~a~~-~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg 219 (319) .+. ....+...+....++.+.+++. .|.. +...+-.++++|..+..|.+-.+..... .......+|..+ T Consensus 239 ~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~v~n~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~ 316 (404) T protein:vir:10 239 GDEHATGIMTANKFKKITLPKSPALKDFKKCKNVELLN-VFKATSSWIVNQDGFNYLDSLEDKTGRP-YLQPDPKDPTQY 316 (404) T ss_pred CCCcccceeeccccceeeccccccHHHHHHHHHhhhhc-cccCCCEEEEcHHHHHHHHHhhccCCce-eeccCcCCCCCc Confidence 100 0011122334456777777665 3443 4454556899999999887644322111 111223456667 Q ss_pred eecCeEEEEeccccc----ccceEEEEcCC-ceeeee-eeeeeeeecCCCC---CccceeeeeeeeeEEEeccccceEEE Q lcl|Aclame:pro 220 ELDGFVIVKVPTKLL----QGLQAIAVVGE-VLASPI-QADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHLQKYIFT 290 (319) Q Consensus 220 ~idG~~I~~vps~~~----~~~n~i~~~~~-A~~~~~-k~~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~k~~~Iy~ 290 (319) +|.|.+|+.+++... .+..+++|+-+ +..... +--.++..+.+.. .+.-.++...++|..|.+|++..+.. T Consensus 317 ~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~ 396 (404) T protein:vir:10 317 RFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAE 396 (404) T ss_pred cccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE Confidence 899999987766432 23447777654 443332 3222333221111 23457888899999999998843322 Q ss_pred EccccccCCCC Q lcl|Aclame:pro 291 IGGTEVATKRD 301 (319) Q Consensus 291 ~~~~~~a~~~~ 301 (319) -. +++.+. T Consensus 397 ~~---~aa~~~ 404 (404) T protein:vir:10 397 IP---VESVQA 404 (404) T ss_pred ee---cccCCC Confidence 22 222221 No 102 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.04 E-value=6.3e-06 Score=49.10 Aligned_cols=283 Identities=7% Similarity=-0.133 Sum_probs=127.6 Q ss_pred cccccceeeehhhhhhhhhcchhh--hhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccC Q lcl|Aclame:pro 5 IKNATGMLKLNLQHFANKSVEPGQ--TLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKR 81 (319) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~n~--~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r 81 (319) .... -.|+.+.++..-+.... --+...+. .+++.+.....+.. ++. .+...+.+++||......-..+.. T Consensus 1 ~~~~---~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~--~~~--~~~~~~~~~~~p~~~~~~~a~~v~ 73 (320) T protein:vir:10 1 MAAG---TAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQ--FAQ--KVPMGTTGQKIPHWIGDVSAQWIG 73 (320) T ss_pred CCCC---ccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhh--hcc--eeeccCCceEEEEEeCCcceEEec Confidence 1111 12233333332221111 11333343 34444433222211 222 344467789999987543333332 Q ss_pred CCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-----------c Q lcl|Aclame:pro 82 NATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR-----------N 150 (319) Q Consensus 82 ~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~-----------~ 150 (319) . +-.....+.++.+.++...|.-.+ +.--+.---.....+...+.+..+.+++-.+|...+.---. . T Consensus 74 E-~~~~~~~~~~f~~v~~~~~k~~~~-~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~ 151 (320) T protein:vir:10 74 E-GDMKPITKGNMTSQNIAPHKIATI-FVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKS 151 (320) T ss_pred C-CccccccccceeEEEEeeEEEEEe-ehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCccccccccc Confidence 2 222223334455555555554443 22222111111245567777888888888898876531000 0 Q ss_pred cCccccccCCHhH--HHH-HHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhh----cccccccceeeeeeeeecC Q lcl|Aclame:pro 151 KAKHLTVGTGSDA--QYD-AVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALP----QGDTRQQVLGKGVQGELDG 223 (319) Q Consensus 151 a~~~~~~~~T~~n--~~~-~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~----~~~~~~~~~~~g~Vg~idG 223 (319) +....+...+.++ .++ .+.++...+..... ..-.++++|..+..|++-.+-.. .............-+.+.| T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g 230 (320) T protein:vir:10 152 VSLADPGGATASDLTAYDAVAVNGLSLLVNAKK-KWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVS 230 (320) T ss_pred ccceecccccccccccHHHHHHHHHhhhhcccC-CCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeee Confidence 0001111222222 222 34555656655433 35578999999998875332111 0001111111111236889 Q ss_pred eEEEEecccccccceEEEEcCCceee-eeeeeeeeeecC-------CC--------CCccceeeeeeeeeEEEeccccce Q lcl|Aclame:pro 224 FVIVKVPTKLLQGLQAIAVVGEVLAS-PIQADLAKTNSN-------IP--------GMFGTLAEQLLYTGAFVPEHLQKY 287 (319) Q Consensus 224 ~~I~~vps~~~~~~n~i~~~~~A~~~-~~k~~~~~~~~~-------~~--------~~~~~~v~gr~~yg~~V~~~k~~~ 287 (319) ++|+.+++-...+.-++.|+.+-... ..+--.+++.+. ++ .++.-.++...++|..|.+|++.. T Consensus 231 ~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~ 310 (320) T protein:vir:10 231 RPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFV 310 (320) T ss_pred eeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceE Confidence 99986543222233344555433322 212222222221 00 112246677789999999999865 Q ss_pred EEEEcccccc Q lcl|Aclame:pro 288 IFTIGGTEVA 297 (319) Q Consensus 288 Iy~~~~~~~a 297 (319) ....+.+++| T Consensus 311 ~l~~~~ap~~ 320 (320) T protein:vir:10 311 KLTNVVTPDA 320 (320) T ss_pred EEEeccCCCC Confidence 5555554444 No 103 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=98.03 E-value=3.6e-07 Score=55.89 Aligned_cols=266 Identities=12% Similarity=0.129 Sum_probs=144.6 Q ss_pred hhcchhhhhh--hHhhHHHHHHHHHhhhhhhhcccCc-ceeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEE Q lcl|Aclame:pro 22 KSVEPGQTLL--KNKHVGILERVTAVNAYSTPALISN-DAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYF 98 (319) Q Consensus 22 ~~~~~n~~~l--~~ky~~lld~~~~~~sl~~~~~~n~-~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tlt 98 (319) --..+|+.++ .|+|...++..+..+.|--. .-| -..|..|++.+||+++++.+..--..+.+.++++++-..++- T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~--~~R~V~DF~~G~~L~I~tiGs~~~~~~~E~~~~~~~~i~TGEIt~~ 78 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPET--FYRNVSDFGSGETLHIKTIGSVTLQEAEEDTPLIYNPIETGEITFQ 78 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchh--hhhhhccCCCCCEEEecccCceeeeccccCCCeeecccccceEEEE Confidence 1123455443 45666655555544444211 123 347889999999999999999888888999999999999999 Q ss_pred EeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-------cCccc---------cccCCHh Q lcl|Aclame:pro 99 LDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARN-------KAKHL---------TVGTGSD 162 (319) Q Consensus 99 idqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~-------a~~~~---------~~~~T~~ 162 (319) |..-++-+++|-+-=.+... .+.+++++..+++.-.-...|--..|+.+ -+..+ ..+.... T Consensus 79 i~~Y~G~A~~vt~~LR~D~~---~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~ 155 (313) T protein:vir:95 79 ITEYKGDAWYVTDDLREDGT---DIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGV 155 (313) T ss_pred EEeecCChhhhhhhhhhcch---hHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCce Confidence 98877777766543333322 33445544444443333333333333222 12111 1222334 Q ss_pred HHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcccc-c----ccce--eeeeeeeecCeEEEEeccccc Q lcl|Aclame:pro 163 AQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDT-R----QQVL--GKGVQGELDGFVIVKVPTKLL 234 (319) Q Consensus 163 n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~-~----~~~~--~~g~Vg~idG~~I~~vps~~~ 234 (319) -.+.-+..+.-.|+++++| +||+.+|.|....-|.---..+....- + .++. -...|-++.|..|+. |+++ T Consensus 156 ~~~~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~--SN~L 233 (313) T protein:vir:95 156 FALKHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILT--SNRL 233 (313) T ss_pred ehhhHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhh--hhhh Confidence 4567788888899999999 899999999998888644333321111 1 1111 234566788888874 2222 Q ss_pred c-------------cc-eEEE-EcC----CceeeeeeeeeeeeecCC-CCCccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 235 Q-------------GL-QAIA-VVG----EVLASPIQADLAKTNSNI-PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 235 ~-------------~~-n~i~-~~~----~A~~~~~k~~~~~~~~~~-~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) . .. |.++ +.. +-..+-...-+.+-+++. ....-..++-|+=+|.-=++ ..++.+. + T Consensus 234 ~~AN~~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~--~L~~~~~--~ 309 (313) T protein:vir:95 234 HVANYNDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRLD--TLGLLAT--S 309 (313) T ss_pred hhccccccccccCceeeeeeeeeecccccceeeeeccccccccccccccccccceeeeeecccceeec--ceeEEEe--c Confidence 1 01 4333 222 222222333333433321 11123466666555554433 3333222 1 Q ss_pred cccC Q lcl|Aclame:pro 295 EVAT 298 (319) Q Consensus 295 ~~a~ 298 (319) +++- T Consensus 310 A~~~ 313 (313) T protein:vir:95 310 ATAY 313 (313) T ss_pred cccC Confidence 2222 No 104 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.00 E-value=7.4e-06 Score=48.70 Aligned_cols=262 Identities=12% Similarity=-0.013 Sum_probs=132.9 Q ss_pred hcchhhhhhhHhh-HHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEEEee Q lcl|Aclame:pro 23 SVEPGQTLLKNKH-VGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQ 101 (319) Q Consensus 23 ~~~~n~~~l~~ky-~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidq 101 (319) -+.....-.++.+ ..+++.+...+.+ ... + ..+...+..++||.....+-..+.. .+-....-+.++...+++- T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i-~~l-~--~~~~~~~~~~~ip~~~~~~~a~~v~-E~~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSI-ARL-S--AQKPIPFNGEKVFTFTMDSEIDVVA-ESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhh-hhh-c--ceeeccCCceEEEEEecCcceEEec-CCccccccccceeEEEEee Confidence 1222222222223 2344444333222 222 1 1244456778999987655444332 2222222233445555555 Q ss_pred cccceeecchhhHHHH---h-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-------c---------cccCCH Q lcl|Aclame:pro 102 EKYWGRFVDALDRKDT---E-GNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH-------L---------TVGTGS 161 (319) Q Consensus 102 dr~~~F~VD~~D~~et---~-~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~-------~---------~~~~T~ 161 (319) .|.-.+ +. +..+-. . ....+...+.+..++++.-.+|...+.-.....+.. . ...... T Consensus 76 ~k~a~~-~~-iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) T protein:vir:16 76 IKVEYG-AR-ISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGI 153 (298) T ss_pred eeEEEe-eh-hhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccccccc Confidence 553332 22 222111 1 113445567778888888888887764211111100 0 011123 Q ss_pred hHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEeccc----ccccc Q lcl|Aclame:pro 162 DAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTK----LLQGL 237 (319) Q Consensus 162 ~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~----~~~~~ 237 (319) .+.++.|+++..++..++.+.. .++++|..+..|++-.+-.... ..+.....|..++|.|.+|+.+++- ..... T Consensus 154 ~~~~~~i~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~-i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~ 231 (298) T protein:vir:16 154 ADPNGAIENAVELLTGVDADVT-GIAINPSFRSALAKQKDLQDNA-LFPELKWGATPDTINGLPVDVNKTVSDMSLTQRD 231 (298) T ss_pred ccHHHHHHHHHHHhhhcCCCcc-EEEEcHHHHHHHHHhhccCCCe-eecCcccCCCCceecceeeEEecccccccCCCcc Confidence 4568889999999988877644 3778999998887644221110 1133445677789999999964321 11234 Q ss_pred eEEEEcCC-ce-eeeeeeeeeeeecCC-CC--------CccceeeeeeeeeEEEeccccceEEEEccccc Q lcl|Aclame:pro 238 QAIAVVGE-VL-ASPIQADLAKTNSNI-PG--------MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) Q Consensus 238 n~i~~~~~-A~-~~~~k~~~~~~~~~~-~~--------~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~ 296 (319) .+++|.=+ ++ ....+--.+++.+.. ++ .+.-.++...++|..|.+|++....... + T Consensus 232 ~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~a---t 298 (298) T protein:vir:16 232 RAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA---N 298 (298) T ss_pred EEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeec---C Confidence 46666532 22 333443445554321 11 1224677888999999999874332211 1 No 105 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=98.00 E-value=7.6e-06 Score=48.64 Aligned_cols=288 Identities=11% Similarity=0.012 Sum_probs=125.6 Q ss_pred CCcccccccce-eeehh---------hhhhhh--------------hcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCc Q lcl|Aclame:pro 1 MNKTIKNATGM-LKLNL---------QHFANK--------------SVEPGQTLLKNKHVGILERVTAVNAYSTPALISN 56 (319) Q Consensus 1 ~~~~~~~~~~~-~~~~~---------~~~~~~--------------~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~ 56 (319) ..+.+....+. ..++. +.+..+ -+.-...-.++.+...+-+.....+..... +. T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~-~~- 173 (425) T protein:vir:95 96 SRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPL-VD- 173 (425) T ss_pred hhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHh-hc- Confidence 00000000000 00000 000000 001111123444443232222222222222 21 Q ss_pred ceeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 57 DAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVA 136 (319) Q Consensus 57 ~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~va 136 (319) ..-..| .++||.....+-..+...++-.+..-..+...++++..+.-.+ +.--+.--......+.....+..++.++ T Consensus 174 -~~~~~g-~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~-~~iS~ell~ds~~~l~~~i~~~l~~~i~ 250 (425) T protein:vir:95 174 -KIRVKG-TTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKV-TFVDNYLLQDSIINLDDYVTKKIARAIA 250 (425) T ss_pred -eeecCc-eeEEEEecCCccccccccccccccccccccceeeeeheeeeee-ehhhHHHHhccHHHHHHHHHHHHHHHHH Confidence 122233 5689988776666665443322222212344555555554432 2222211111113456677788888999 Q ss_pred HHHHHHHHHHHH-----------hccCccccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHH-HHHHhhhhhhh Q lcl|Aclame:pro 137 PYLDNLRFATLA-----------RNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTF-YKGIKKFVIAL 203 (319) Q Consensus 137 peiD~~~~s~la-----------~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~-~~~L~~~~~f~ 203 (319) -.+|..++.--- ..++............|+.++++...+..+..+ .+-+++++|.. +..|..- +.. T Consensus 251 ~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l-~~~ 329 (425) T protein:vir:95 251 KALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEF-SIQ 329 (425) T ss_pred HHHHHHhhccCCCCccccceeecccccccccccccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHH-Hhh Confidence 999987664110 000111111112344678888888777665444 45566777664 4433211 111 Q ss_pred hcccccccc--eeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCCc---cceeeeeeeeeE Q lcl|Aclame:pro 204 PQGDTRQQV--LGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGMF---GTLAEQLLYTGA 278 (319) Q Consensus 204 ~~~~~~~~~--~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~~---~~~v~gr~~yg~ 278 (319) +.. .|.-. .-.+..++|.|.+|+.+ +.++...+++|.-+-..... ...+++-...+..| ...+++..++|. T Consensus 330 kd~-~g~~i~~~~~~~~~~l~G~pvv~~--~~~~~~~i~~Gd~~~~~~~~-~~~~~i~~~~~~~f~~~~~~~~~~~r~d~ 405 (425) T protein:vir:95 330 VDS-NGNVVGKLPNLRTPDLLGLRVVFN--NFLDDDTVLFGEFEQYTLVE-RENITIDSSTHVKFTEDQTAFRGKGRFDG 405 (425) T ss_pred cCC-CCceeeccCCCCCccccceeeEEc--CcCCCccEEEEecccEEEEe-ecceEEEeecccccccCceEEEEEEeeCc Confidence 100 01100 12445567899999863 34444455555543322211 22233322123333 357788889999 Q ss_pred EEeccccceEEEEccccccCC Q lcl|Aclame:pro 279 FVPEHLQKYIFTIGGTEVATK 299 (319) Q Consensus 279 ~V~~~k~~~Iy~~~~~~~a~~ 299 (319) .+.+|++.. ++.+.++.+.+ T Consensus 406 ~~~~~~a~~-~~~i~~~~~g~ 425 (425) T protein:vir:95 406 KPVKPEAFV-LVTITDPVQGA 425 (425) T ss_pred EeecccceE-EEEecCcCCCC Confidence 999999843 33333322222 No 106 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.99 E-value=6.5e-06 Score=49.01 Aligned_cols=284 Identities=11% Similarity=-0.032 Sum_probs=126.8 Q ss_pred CCcccccccc-eeeehhhhhhhhhc------chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecc Q lcl|Aclame:pro 1 MNKTIKNATG-MLKLNLQHFANKSV------EPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGD 72 (319) Q Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~------~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~ 72 (319) ..+..++..+ .++-..++...+.. .....-.++.|. .++.+.+...+... .+++ ...+...+.+|... T Consensus 225 ~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~-~~~~---~~~~~g~~~~~~~~ 300 (543) T protein:vir:81 225 WSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIR-RFAR---QVVATGDVWHGVSS 300 (543) T ss_pred HHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhh-hhcc---cccCCcceEEEEec Confidence 0000000000 00000111111100 011111223332 22223222222111 1111 12234567788765 Q ss_pred ccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH------ Q lcl|Aclame:pro 73 TTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFAT------ 146 (319) Q Consensus 73 ~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~------ 146 (319) ..+...+... +-....-+.++...+++-.+.-.+. .+..+-.+....+.+.+.+.....++-.+|..++.- T Consensus 301 ~~~~a~~v~E-g~~~~~~~~~~~~i~~~~~k~~~~~--~is~ell~d~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~ 377 (543) T protein:vir:81 301 AAVQWSWDAE-FEEVSDDSPEFGQPEIPVKKAQGFV--PISIEALQDEANVTETVALLFAEGKDELEAVTLTTGTGQGNQ 377 (543) T ss_pred CCcceeeccc-CccccccccccceeeeeeeeeEeee--hhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc Confidence 5544444322 2222223344555555555544432 122211122245667777788888888888865420 Q ss_pred ----HHhccC--ccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeee Q lcl|Aclame:pro 147 ----LARNKA--KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGE 220 (319) Q Consensus 147 ----la~~a~--~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~ 220 (319) +...++ ...+...+.+..|+.++++...+.... ..+-.++++|.++..|.+-.+-.... +. .....|..++ T Consensus 378 p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~v~n~~~~~~l~~lkd~~G~~-l~-~~~~~g~~~~ 454 (543) T protein:vir:81 378 PTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARH-RRQGAWLANNLIYNKIRQFDTQGGAG-LW-TTIGNGEPSQ 454 (543) T ss_pred cccchhhcccccccccccccccccHHHHHHHHHhhhccc-cCCcEEEEcHHHHHHHHHhhcCCCce-ec-cCcCCCCCcc Confidence 000000 011112233456888888888886543 33446889999999887533211000 01 1234556678 Q ss_pred ecCeEEEEeccccc--------ccceEEEEcCCceeeeeeeeeeeeecCCCCC-------ccceeeeeeeeeEEEecccc Q lcl|Aclame:pro 221 LDGFVIVKVPTKLL--------QGLQAIAVVGEVLASPIQADLAKTNSNIPGM-------FGTLAEQLLYTGAFVPEHLQ 285 (319) Q Consensus 221 idG~~I~~vps~~~--------~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~-------~~~~v~gr~~yg~~V~~~k~ 285 (319) |.|.+|+.+++-.. ....+++|+.+....... ..+++-..+... +...++...++|..|++|++ T Consensus 455 l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~-~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A 533 (543) T protein:vir:81 455 LLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADR-IGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNA 533 (543) T ss_pred ccceeeEEeccccccccccccCCcceEEEeeccceeEEee-cccEEEEeccccccchhhcCceEEEEEEeeccEeecccc Confidence 99999997643111 123366666654433322 123332211211 23467778889999999987 Q ss_pred ceEEEEccccccC Q lcl|Aclame:pro 286 KYIFTIGGTEVAT 298 (319) Q Consensus 286 ~~Iy~~~~~~~a~ 298 (319) ..+.... +++ T Consensus 534 ~~~l~~~---~~a 543 (543) T protein:vir:81 534 FRLLNVE---TAS 543 (543) T ss_pred eEEEEec---ccC Confidence 4332222 222 No 107 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.99 E-value=8e-06 Score=48.51 Aligned_cols=296 Identities=10% Similarity=-0.003 Sum_probs=132.3 Q ss_pred CCcccccccce--eeehhhhhhh-------------h--hcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCC Q lcl|Aclame:pro 1 MNKTIKNATGM--LKLNLQHFAN-------------K--SVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEG 63 (319) Q Consensus 1 ~~~~~~~~~~~--~~~~~~~~~~-------------~--~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g 63 (319) .++.......+ ..-...+|.. + -.....+-+++.+...+-+.....+.... +++..-...+. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~-~~~~~~~~~~~ 152 (397) T protein:vir:49 74 EEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQE-YVNVENVTTLT 152 (397) T ss_pred ccccccccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHh-hhceeecccCc Confidence 00000000000 0000000000 0 00111223445554333333222222211 12221122233 Q ss_pred ceEEeeecccc-ccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 64 RSFTVMKGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNL 142 (319) Q Consensus 64 ~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~ 142 (319) .++.||+..+. +...+...++-....-+.++..++++-.|...+. .--+.--......+...+.++.+.+++-.+|.- T Consensus 153 ~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~-~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~a 231 (397) T protein:vir:49 153 GSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGIS-TVTNSLLADSAENILAWLSGWIAKKVVVTRNKA 231 (397) T ss_pred cceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeee-hhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 45667766553 3344433222222222234444455545444432 111111111123456667778888888888876 Q ss_pred HHHHHHhccCccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeec Q lcl|Aclame:pro 143 RFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELD 222 (319) Q Consensus 143 ~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~id 222 (319) ++.-.- .+...... .-|+.|.++...|..+..+ +-.++++|..+..|.+-.+-.... ........|.-++|. T Consensus 232 i~~G~g--~~~~~~~~----~~~d~i~~~~~~l~~~~~~-~a~~vmn~~~~~~l~~lkd~~G~~-l~~~~~~~~~~~~l~ 303 (397) T protein:vir:49 232 ILEAIA--ALPTKPTL----TKWDDIIDLEAKVDPAIKQ-TSFFLTNTSGFTALKKVKNALGDY-LMERDVKSPTGYSID 303 (397) T ss_pred HHhhcc--cccccccc----ccHHHHHHHHHhhhhhhcC-CCEEEEcHHHHHHHHHhhcCCCce-eeccCcCCCCCceec Confidence 554221 11111111 2366777888888776554 456889999999887643211110 111223456667899 Q ss_pred CeEEEEeccccc-----ccceEEEEcCC-ceeeeee-eeeeeeecCCCC---CccceeeeeeeeeEEEeccccceEEEEc Q lcl|Aclame:pro 223 GFVIVKVPTKLL-----QGLQAIAVVGE-VLASPIQ-ADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 223 G~~I~~vps~~~-----~~~n~i~~~~~-A~~~~~k-~~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~k~~~Iy~~~ 292 (319) |.||+.+++..+ ....+++|.-+ +.....+ =-.++..+-..+ .+...++...++|..+++|++..+..-. T Consensus 304 G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 304 GFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFK 383 (397) T ss_pred ceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEEee Confidence 999987654333 34457777544 4444333 222333221111 2335788888999999999874333223 Q ss_pred cccccCCCCCcccc Q lcl|Aclame:pro 293 GTEVATKRDGVDAH 306 (319) Q Consensus 293 ~~~~a~~~~~~~~~ 306 (319) .++..+...|.++- T Consensus 384 ~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 384 AIADQKGNLGSTAV 397 (397) T ss_pred cccCCCCCcccccC Confidence 22333322333333 No 108 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=97.97 E-value=8.8e-06 Score=48.29 Aligned_cols=290 Identities=11% Similarity=0.059 Sum_probs=123.0 Q ss_pred CCccccccc-ceeeehhhhhhh----------h--hcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEE Q lcl|Aclame:pro 1 MNKTIKNAT-GMLKLNLQHFAN----------K--SVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFT 67 (319) Q Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~----------~--~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVk 67 (319) +........ ..-....+.|.. . -........++.+...+.+......+.. .++ .....+...+ T Consensus 125 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~~~~~~~l~~--~~~--~~~~~~~~~~ 200 (437) T protein:vir:10 125 LQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEKEVHQFPRLGS--LVR--TESVTTTTGK 200 (437) T ss_pred HhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHHHhhhhhhhhh--cce--eEeeccCcee Confidence 000000000 000000000000 0 0011111233344444433322222111 111 1223445677 Q ss_pred eeecccc-ccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 VMKGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFA 145 (319) Q Consensus 68 Ip~i~~~-g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s 145 (319) +|..... +...+...++...+.-+.++..+++...+...+ + .+..+-... ..++...+.+..+..+.-.+|..++. T Consensus 201 ~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~-~-~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~ 278 (437) T protein:vir:10 201 LPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYTGG-Y-VFSQELISDSSYDWQAELQSRLIELRDNTDDSLIIT 278 (437) T ss_pred eEEeeccccccccccccccccccccccceeeeeehhheeee-h-hhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 7766433 233332222222222234555666665554433 2 222111111 12445566677777787777776655 Q ss_pred HHHhccCccccccCCHhHHHHHHHHHHH-HHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCe Q lcl|Aclame:pro 146 TLARNKAKHLTVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGF 224 (319) Q Consensus 146 ~la~~a~~~~~~~~T~~n~~~~i~~a~~-~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~ 224 (319) -...+ .. ..+....++.|.++.. .|+.. +..+-.++++|..+..|.+-.+-.... .......+|.-++|.|. T Consensus 279 g~g~~----~~-~~~~~~~~~~~~~~~~~~l~~~-~~~~~~~~~~~~~~~~l~~lkd~~g~~-~~~~~~~~~~~~~l~G~ 351 (437) T protein:vir:10 279 ALTDG----IK-KTTSTYLLGDLKKVLNVTLKPQ-DSAAASIVMSQSAYNLFDMATDAMGRP-LLQPNVTAATGYTLLGK 351 (437) T ss_pred hhccc----cc-ccccccchhhHHHHHHhhhhhh-hhcCCEEEEcHHHHHHHHHhhccCCCe-eeccCccCCCCcccccc Confidence 32211 11 1122223344444432 45443 333456799999999887643211100 11122335566789999 Q ss_pred EEEEecccccc-----cceEEEEcCC-ceeeee-eeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEcccccc Q lcl|Aclame:pro 225 VIVKVPTKLLQ-----GLQAIAVVGE-VLASPI-QADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) Q Consensus 225 ~I~~vps~~~~-----~~n~i~~~~~-A~~~~~-k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a 297 (319) ||+.+++..++ +..+++|.-+ ++.... +--.++..+ .-..+...+++-..+|..|++|++..+.. ...++. T Consensus 352 pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~-~~~~~~~~~~~~~r~d~~~~~~~a~~~l~-~~~~~~ 429 (437) T protein:vir:10 352 TVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQD-TYDIWYKQLGIFLRQNVVQASKDLIVNLT-GKLKAV 429 (437) T ss_pred eeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEEec-ccccccceeeEEEEEccEEecccceEEEE-eecccc Confidence 99987654332 3346666543 443332 222222221 12234456666777899999999854332 222222 Q ss_pred CCCCCccc Q lcl|Aclame:pro 298 TKRDGVDA 305 (319) Q Consensus 298 ~~~~~~~~ 305 (319) +...+.++ T Consensus 430 ~~~~~~~~ 437 (437) T protein:vir:10 430 TVVQSTAV 437 (437) T ss_pred ccCCCCCC Confidence 22233333 No 109 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=97.96 E-value=9e-06 Score=48.26 Aligned_cols=288 Identities=11% Similarity=0.008 Sum_probs=127.0 Q ss_pred CCccccc--ccceeeehhhhhhhhh---------cch-hhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCCceEE Q lcl|Aclame:pro 1 MNKTIKN--ATGMLKLNLQHFANKS---------VEP-GQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFT 67 (319) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~---------~~~-n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g~tVk 67 (319) +.+...+ ..+.++-.-..|-... ..+ .....++.|.. +++.....+.+.. +++..-...+..+.. T Consensus 76 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~--~~~~~~~~~~~~~~~ 153 (392) T protein:vir:10 76 YRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQ--YVTVEPVRTRSGSRV 153 (392) T ss_pred HHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhh--hceeeeccCCceeEE Confidence 0000000 0011111111111000 011 12234555543 3333333222221 122111222333566 Q ss_pred eeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 VMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 68 Ip~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) ||.....+-..+...++-..+.-+.+...++++-.|...+. . +..+-... ...+...+.+..++++.-.+|..++.- T Consensus 154 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g 231 (392) T protein:vir:10 154 LEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGIL-P-LSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) T ss_pred EEeecCCccceeecccccccccccccceeEEeeeeeEEEee-h-hhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 66655443333332222222221234445555444433322 1 22111111 234566777888888888888876542 Q ss_pred HHhccCccccccCCHhHHHHHHHHHH-HHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeE Q lcl|Aclame:pro 147 LARNKAKHLTVGTGSDAQYDAVLDVS-VELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFV 225 (319) Q Consensus 147 la~~a~~~~~~~~T~~n~~~~i~~a~-~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~ 225 (319) .... .....++ |+.|.++. ..|+... -.+-.++++|..+..|.+-.+-.... .......+|.-++|.|.+ T Consensus 232 ~g~~---~~~~~~~----~d~i~~~~~~~l~~~~-~~~a~~vm~~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~tllG~~ 302 (392) T protein:vir:10 232 IEKL---TKQAIKS----LDDIKDVLNVKLDPAI-SPNAILLTNQDGFNYLDKLKDKDGKY-ILQSDPTQKNKKLFAGTN 302 (392) T ss_pred cccc---cccCccC----HHHHHHHHHHhhhhhh-ccCCEEEEcHHHHHHHHHhhccCCCe-EeecCccCCccccccCcc Confidence 2111 1112222 44555554 3454433 33566899999999997643211100 011122345567889987 Q ss_pred EEE-eccccc-------ccceEEEEcCC-ceeeeeeeeeeeeecCCCC--C---ccceeeeeeeeeEEEeccccceEEEE Q lcl|Aclame:pro 226 IVK-VPTKLL-------QGLQAIAVVGE-VLASPIQADLAKTNSNIPG--M---FGTLAEQLLYTGAFVPEHLQKYIFTI 291 (319) Q Consensus 226 I~~-vps~~~-------~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~--~---~~~~v~gr~~yg~~V~~~k~~~Iy~~ 291 (319) ++. +++..+ .+..+++|.-+ +.....+ ..+.+-..++. . +.-.++...++|..|.+|+......- T Consensus 303 ~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~-~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 381 (392) T protein:vir:10 303 PVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKR-EDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) T ss_pred cEEEecccccCCCcccCCceEEEEEehhceEEEEee-cceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 664 333322 23445666544 2222221 22222211221 1 23468888889999999988655445 Q ss_pred ccccccCCCCC Q lcl|Aclame:pro 292 GGTEVATKRDG 302 (319) Q Consensus 292 ~~~~~a~~~~~ 302 (319) ..++|+..++| T Consensus 382 ~~~a~~~~~~~ 392 (392) T protein:vir:10 382 DLSAPVEQPQG 392 (392) T ss_pred cccccccCCCC Confidence 56677777788 No 110 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=97.96 E-value=9e-06 Score=48.26 Aligned_cols=288 Identities=11% Similarity=0.008 Sum_probs=127.0 Q ss_pred CCccccc--ccceeeehhhhhhhhh---------cch-hhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCCceEE Q lcl|Aclame:pro 1 MNKTIKN--ATGMLKLNLQHFANKS---------VEP-GQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFT 67 (319) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~---------~~~-n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g~tVk 67 (319) +.+...+ ..+.++-.-..|-... ..+ .....++.|.. +++.....+.+.. +++..-...+..+.. T Consensus 76 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~--~~~~~~~~~~~~~~~ 153 (392) T protein:vir:10 76 YRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQ--YVTVEPVRTRSGSRV 153 (392) T ss_pred HHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhh--hceeeeccCCceeEE Confidence 0000000 0011111111111000 011 12234555543 3333333222221 122111222333566 Q ss_pred eeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 VMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 68 Ip~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) ||.....+-..+...++-..+.-+.+...++++-.|...+. . +..+-... ...+...+.+..++++.-.+|..++.- T Consensus 154 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g 231 (392) T protein:vir:10 154 LEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGIL-P-LSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) T ss_pred EEeecCCccceeecccccccccccccceeEEeeeeeEEEee-h-hhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 66655443333332222222221234445555444433322 1 22111111 234566777888888888888876542 Q ss_pred HHhccCccccccCCHhHHHHHHHHHH-HHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeE Q lcl|Aclame:pro 147 LARNKAKHLTVGTGSDAQYDAVLDVS-VELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFV 225 (319) Q Consensus 147 la~~a~~~~~~~~T~~n~~~~i~~a~-~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~ 225 (319) .... .....++ |+.|.++. ..|+... -.+-.++++|..+..|.+-.+-.... .......+|.-++|.|.+ T Consensus 232 ~g~~---~~~~~~~----~d~i~~~~~~~l~~~~-~~~a~~vm~~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~tllG~~ 302 (392) T protein:vir:10 232 IEKL---TKQAIKS----LDDIKDVLNVKLDPAI-SPNAILLTNQDGFNYLDKLKDKDGKY-ILQSDPTQKNKKLFAGTN 302 (392) T ss_pred cccc---cccCccC----HHHHHHHHHHhhhhhh-ccCCEEEEcHHHHHHHHHhhccCCCe-EeecCccCCccccccCcc Confidence 2111 1112222 44555554 3454433 33566899999999997643211100 011122345567889987 Q ss_pred EEE-eccccc-------ccceEEEEcCC-ceeeeeeeeeeeeecCCCC--C---ccceeeeeeeeeEEEeccccceEEEE Q lcl|Aclame:pro 226 IVK-VPTKLL-------QGLQAIAVVGE-VLASPIQADLAKTNSNIPG--M---FGTLAEQLLYTGAFVPEHLQKYIFTI 291 (319) Q Consensus 226 I~~-vps~~~-------~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~--~---~~~~v~gr~~yg~~V~~~k~~~Iy~~ 291 (319) ++. +++..+ .+..+++|.-+ +.....+ ..+.+-..++. . +.-.++...++|..|.+|+......- T Consensus 303 ~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~-~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 381 (392) T protein:vir:10 303 PVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKR-EDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) T ss_pred cEEEecccccCCCcccCCceEEEEEehhceEEEEee-cceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 664 333322 23445666544 2222221 22222211221 1 23468888889999999988655445 Q ss_pred ccccccCCCCC Q lcl|Aclame:pro 292 GGTEVATKRDG 302 (319) Q Consensus 292 ~~~~~a~~~~~ 302 (319) ..++|+..++| T Consensus 382 ~~~a~~~~~~~ 392 (392) T protein:vir:10 382 DLSAPVEQPQG 392 (392) T ss_pred cccccccCCCC Confidence 56677777788 No 111 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=97.96 E-value=9e-06 Score=48.26 Aligned_cols=288 Identities=11% Similarity=0.008 Sum_probs=127.0 Q ss_pred CCccccc--ccceeeehhhhhhhhh---------cch-hhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCCceEE Q lcl|Aclame:pro 1 MNKTIKN--ATGMLKLNLQHFANKS---------VEP-GQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFT 67 (319) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~---------~~~-n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g~tVk 67 (319) +.+...+ ..+.++-.-..|-... ..+ .....++.|.. +++.....+.+.. +++..-...+..+.. T Consensus 76 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~--~~~~~~~~~~~~~~~ 153 (392) T protein:vir:10 76 YRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQ--YVTVEPVRTRSGSRV 153 (392) T ss_pred HHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhh--hceeeeccCCceeEE Confidence 0000000 0011111111111000 011 12234555543 3333333222221 122111222333566 Q ss_pred eeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 VMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 68 Ip~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) ||.....+-..+...++-..+.-+.+...++++-.|...+. . +..+-... ...+...+.+..++++.-.+|..++.- T Consensus 154 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g 231 (392) T protein:vir:10 154 LEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGIL-P-LSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) T ss_pred EEeecCCccceeecccccccccccccceeEEeeeeeEEEee-h-hhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 66655443333332222222221234445555444433322 1 22111111 234566777888888888888876542 Q ss_pred HHhccCccccccCCHhHHHHHHHHHH-HHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeE Q lcl|Aclame:pro 147 LARNKAKHLTVGTGSDAQYDAVLDVS-VELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFV 225 (319) Q Consensus 147 la~~a~~~~~~~~T~~n~~~~i~~a~-~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~ 225 (319) .... .....++ |+.|.++. ..|+... -.+-.++++|..+..|.+-.+-.... .......+|.-++|.|.+ T Consensus 232 ~g~~---~~~~~~~----~d~i~~~~~~~l~~~~-~~~a~~vm~~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~tllG~~ 302 (392) T protein:vir:10 232 IEKL---TKQAIKS----LDDIKDVLNVKLDPAI-SPNAILLTNQDGFNYLDKLKDKDGKY-ILQSDPTQKNKKLFAGTN 302 (392) T ss_pred cccc---cccCccC----HHHHHHHHHHhhhhhh-ccCCEEEEcHHHHHHHHHhhccCCCe-EeecCccCCccccccCcc Confidence 2111 1112222 44555554 3454433 33566899999999997643211100 011122345567889987 Q ss_pred EEE-eccccc-------ccceEEEEcCC-ceeeeeeeeeeeeecCCCC--C---ccceeeeeeeeeEEEeccccceEEEE Q lcl|Aclame:pro 226 IVK-VPTKLL-------QGLQAIAVVGE-VLASPIQADLAKTNSNIPG--M---FGTLAEQLLYTGAFVPEHLQKYIFTI 291 (319) Q Consensus 226 I~~-vps~~~-------~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~--~---~~~~v~gr~~yg~~V~~~k~~~Iy~~ 291 (319) ++. +++..+ .+..+++|.-+ +.....+ ..+.+-..++. . +.-.++...++|..|.+|+......- T Consensus 303 ~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~-~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 381 (392) T protein:vir:10 303 PVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKR-EDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) T ss_pred cEEEecccccCCCcccCCceEEEEEehhceEEEEee-cceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 664 333322 23445666544 2222221 22222211221 1 23468888889999999988655445 Q ss_pred ccccccCCCCC Q lcl|Aclame:pro 292 GGTEVATKRDG 302 (319) Q Consensus 292 ~~~~~a~~~~~ 302 (319) ..++|+..++| T Consensus 382 ~~~a~~~~~~~ 392 (392) T protein:vir:10 382 DLSAPVEQPQG 392 (392) T ss_pred cccccccCCCC Confidence 56677777788 No 112 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=97.96 E-value=9e-06 Score=48.26 Aligned_cols=288 Identities=11% Similarity=0.008 Sum_probs=127.0 Q ss_pred CCccccc--ccceeeehhhhhhhhh---------cch-hhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCCceEE Q lcl|Aclame:pro 1 MNKTIKN--ATGMLKLNLQHFANKS---------VEP-GQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFT 67 (319) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~---------~~~-n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g~tVk 67 (319) +.+...+ ..+.++-.-..|-... ..+ .....++.|.. +++.....+.+.. +++..-...+..+.. T Consensus 76 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~--~~~~~~~~~~~~~~~ 153 (392) T protein:vir:10 76 YRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQ--YVTVEPVRTRSGSRV 153 (392) T ss_pred HHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhh--hceeeeccCCceeEE Confidence 0000000 0011111111111000 011 12234555543 3333333222221 122111222333566 Q ss_pred eeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 VMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 68 Ip~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) ||.....+-..+...++-..+.-+.+...++++-.|...+. . +..+-... ...+...+.+..++++.-.+|..++.- T Consensus 154 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g 231 (392) T protein:vir:10 154 LEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGIL-P-LSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGV 231 (392) T ss_pred EEeecCCccceeecccccccccccccceeEEeeeeeEEEee-h-hhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 66655443333332222222221234445555444433322 1 22111111 234566777888888888888876542 Q ss_pred HHhccCccccccCCHhHHHHHHHHHH-HHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeE Q lcl|Aclame:pro 147 LARNKAKHLTVGTGSDAQYDAVLDVS-VELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFV 225 (319) Q Consensus 147 la~~a~~~~~~~~T~~n~~~~i~~a~-~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~ 225 (319) .... .....++ |+.|.++. ..|+... -.+-.++++|..+..|.+-.+-.... .......+|.-++|.|.+ T Consensus 232 ~g~~---~~~~~~~----~d~i~~~~~~~l~~~~-~~~a~~vm~~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~tllG~~ 302 (392) T protein:vir:10 232 IEKL---TKQAIKS----LDDIKDVLNVKLDPAI-SPNAILLTNQDGFNYLDKLKDKDGKY-ILQSDPTQKNKKLFAGTN 302 (392) T ss_pred cccc---cccCccC----HHHHHHHHHHhhhhhh-ccCCEEEEcHHHHHHHHHhhccCCCe-EeecCccCCccccccCcc Confidence 2111 1112222 44555554 3454433 33566899999999997643211100 011122345567889987 Q ss_pred EEE-eccccc-------ccceEEEEcCC-ceeeeeeeeeeeeecCCCC--C---ccceeeeeeeeeEEEeccccceEEEE Q lcl|Aclame:pro 226 IVK-VPTKLL-------QGLQAIAVVGE-VLASPIQADLAKTNSNIPG--M---FGTLAEQLLYTGAFVPEHLQKYIFTI 291 (319) Q Consensus 226 I~~-vps~~~-------~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~--~---~~~~v~gr~~yg~~V~~~k~~~Iy~~ 291 (319) ++. +++..+ .+..+++|.-+ +.....+ ..+.+-..++. . +.-.++...++|..|.+|+......- T Consensus 303 ~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~-~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~ 381 (392) T protein:vir:10 303 PVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKR-EDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEI 381 (392) T ss_pred cEEEecccccCCCcccCCceEEEEEehhceEEEEee-cceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEe Confidence 664 333322 23445666544 2222221 22222211221 1 23468888889999999988655445 Q ss_pred ccccccCCCCC Q lcl|Aclame:pro 292 GGTEVATKRDG 302 (319) Q Consensus 292 ~~~~~a~~~~~ 302 (319) ..++|+..++| T Consensus 382 ~~~a~~~~~~~ 392 (392) T protein:vir:10 382 DLSAPVEQPQG 392 (392) T ss_pred cccccccCCCC Confidence 56677777788 No 113 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=97.96 E-value=9.2e-06 Score=48.19 Aligned_cols=296 Identities=11% Similarity=0.012 Sum_probs=134.7 Q ss_pred CCccccc-------------c-------cceeeehhhhhhhh------------hcchhhhhhhHhhHHHHHHHHHhhhh Q lcl|Aclame:pro 1 MNKTIKN-------------A-------TGMLKLNLQHFANK------------SVEPGQTLLKNKHVGILERVTAVNAY 48 (319) Q Consensus 1 ~~~~~~~-------------~-------~~~~~~~~~~~~~~------------~~~~n~~~l~~ky~~lld~~~~~~sl 48 (319) .++..+- . .....-....|... -........++.+...+-+.....+. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~ 150 (415) T protein:vir:47 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFN 150 (415) T ss_pred cccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhh Confidence 0000000 0 00000000011000 01112223445554323222222222 Q ss_pred hhhcccCcceeeeCCceEEeeecc--ccccccccCCCCcccCCc-ccceeEEEEeecccceeecchhhHHHHhh-hHHHH Q lcl|Aclame:pro 49 STPALISNDAIFMEGRSFTVMKGD--TTELKDYKRNATNEFDHP-KIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDIN 124 (319) Q Consensus 49 ~~~~~~n~~~~~~~g~tVkIp~i~--~~g~~DY~r~~~~~~~~~-t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~ 124 (319) .... ++ .+...+.+.++|... ...-... ...+-..... ..++..+++.-.+.-.+. .+..+-.+. ..++. T Consensus 151 l~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~-v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~--~iS~ell~ds~~~l~ 224 (415) T protein:vir:47 151 LDKY-VT--VKRVTNGSGKYPVVRQSEVAALEK-VEELEENPELAVKPFFQLAYDINTHRGYF--RISREAIEDAKVNVL 224 (415) T ss_pred hhhh-cc--eeeccCCceeEEEEEecCCcceee-cccccccccccccceeeEEeeeeeeEeee--hhhHHHHhhchHHHH Confidence 1111 11 233333444554432 2221112 1222222222 234556666555544432 222222121 23456 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCc---------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHH Q lcl|Aclame:pro 125 YVVARQGAEVVAPYLDNLRFATLARNKAK---------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKG 195 (319) Q Consensus 125 ~~~~~~~~~~vapeiD~~~~s~la~~a~~---------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~ 195 (319) +.+.+..+++++-.+|..++.-.-.+... ......+....|+.|.++...+....... -.++++|..+.. T Consensus 225 ~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~v~n~~~~~~ 303 (415) T protein:vir:47 225 QELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-NVAIVSQTMFAK 303 (415) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCC-CEEEEcHHHHHH Confidence 67778888888888888776543222111 11122234456888889988887766553 357899999998 Q ss_pred HhhhhhhhhcccccccceeeeeeeeecCeEEEEeccccc---ccceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceee Q lcl|Aclame:pro 196 IKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL---QGLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAE 271 (319) Q Consensus 196 L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~---~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~ 271 (319) |.+-.+-.... .......+|..++|.|++|+.+++... .+..+++|.-+ +.....+. .+.+-..+...+...++ T Consensus 304 L~~lkd~~G~~-i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~~v~~~~~~~~~~~~~ 381 (415) T protein:vir:47 304 LDKMKDKLGNY-LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRS-QYQASWTDYMHFGECLM 381 (415) T ss_pred HHHhhccCCCe-eeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeec-ceEEEeeccccCceEEE Confidence 86532211100 111223466678899999987654322 23457777644 44444432 22222223444566788 Q ss_pred eeeeeeEEEeccccceEEEEccccccCCCCCcccc Q lcl|Aclame:pro 272 QLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAH 306 (319) Q Consensus 272 gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~ 306 (319) +...+|+.|++|++. +++...++.....+-+-+. T Consensus 382 ~~~r~d~~v~~~~a~-~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:47 382 IAVRQDCRILDYKSA-IVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEEeccEEeccccE-EEEEeeccCCCCCCccCCC Confidence 888999999999763 3344432222222211111 No 114 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=97.96 E-value=9.2e-06 Score=48.19 Aligned_cols=296 Identities=11% Similarity=0.012 Sum_probs=134.7 Q ss_pred CCccccc-------------c-------cceeeehhhhhhhh------------hcchhhhhhhHhhHHHHHHHHHhhhh Q lcl|Aclame:pro 1 MNKTIKN-------------A-------TGMLKLNLQHFANK------------SVEPGQTLLKNKHVGILERVTAVNAY 48 (319) Q Consensus 1 ~~~~~~~-------------~-------~~~~~~~~~~~~~~------------~~~~n~~~l~~ky~~lld~~~~~~sl 48 (319) .++..+- . .....-....|... -........++.+...+-+.....+. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~ 150 (415) T protein:vir:46 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFN 150 (415) T ss_pred cccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhh Confidence 0000000 0 00000000011000 01112223445554323222222222 Q ss_pred hhhcccCcceeeeCCceEEeeecc--ccccccccCCCCcccCCc-ccceeEEEEeecccceeecchhhHHHHhh-hHHHH Q lcl|Aclame:pro 49 STPALISNDAIFMEGRSFTVMKGD--TTELKDYKRNATNEFDHP-KIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDIN 124 (319) Q Consensus 49 ~~~~~~n~~~~~~~g~tVkIp~i~--~~g~~DY~r~~~~~~~~~-t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~ 124 (319) .... ++ .+...+.+.++|... ...-... ...+-..... ..++..+++.-.+.-.+. .+..+-.+. ..++. T Consensus 151 l~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~-v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~--~iS~ell~ds~~~l~ 224 (415) T protein:vir:46 151 LDKY-VT--VKRVTNGSGKYPVVRQSEVAALEK-VEELEENPELAVKPFFQLAYDINTHRGYF--RISREAIEDAKVNVL 224 (415) T ss_pred hhhh-cc--eeeccCCceeEEEEEecCCcceee-cccccccccccccceeeEEeeeeeeEeee--hhhHHHHhhchHHHH Confidence 1111 11 233333444554432 2221112 1222222222 234556666555544432 222222121 23456 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCc---------cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHH Q lcl|Aclame:pro 125 YVVARQGAEVVAPYLDNLRFATLARNKAK---------HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKG 195 (319) Q Consensus 125 ~~~~~~~~~~vapeiD~~~~s~la~~a~~---------~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~ 195 (319) +.+.+..+++++-.+|..++.-.-.+... ......+....|+.|.++...+....... -.++++|..+.. T Consensus 225 ~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~v~n~~~~~~ 303 (415) T protein:vir:46 225 QELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEH-NVAIVSQTMFAK 303 (415) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCC-CEEEEcHHHHHH Confidence 67778888888888888776543222111 11122234456888889988887766553 357899999998 Q ss_pred HhhhhhhhhcccccccceeeeeeeeecCeEEEEeccccc---ccceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceee Q lcl|Aclame:pro 196 IKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL---QGLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAE 271 (319) Q Consensus 196 L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~---~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~ 271 (319) |.+-.+-.... .......+|..++|.|++|+.+++... .+..+++|.-+ +.....+. .+.+-..+...+...++ T Consensus 304 L~~lkd~~G~~-i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~~v~~~~~~~~~~~~~ 381 (415) T protein:vir:46 304 LDKMKDKLGNY-LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRS-QYQASWTDYMHFGECLM 381 (415) T ss_pred HHHhhccCCCe-eeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeec-ceEEEeeccccCceEEE Confidence 86532211100 111223466678899999987654322 23457777644 44444432 22222223444566788 Q ss_pred eeeeeeEEEeccccceEEEEccccccCCCCCcccc Q lcl|Aclame:pro 272 QLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAH 306 (319) Q Consensus 272 gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~ 306 (319) +...+|+.|++|++. +++...++.....+-+-+. T Consensus 382 ~~~r~d~~v~~~~a~-~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:46 382 IAVRQDCRILDYKSA-IVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEEEeccEEeccccE-EEEEeeccCCCCCCccCCC Confidence 888999999999763 3344432222222211111 No 115 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=97.95 E-value=9.5e-06 Score=48.12 Aligned_cols=297 Identities=9% Similarity=0.016 Sum_probs=129.1 Q ss_pred CCcccccccceeeehhhhhhhhhcch-hhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEP-GQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY 79 (319) +.+.++. +.++. .+..-.... .....++.+...+-+.....+..... ++ .+...+.++++|.......... T Consensus 100 ~~~~~~~--~~~~~---~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l-~~--~~~~~~~~~~~~~~~~~~~~~~ 171 (421) T protein:vir:13 100 MSKTIRG--IQLSE---EERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEH-CH--VIPVNRNAGKMPVRAGASVDKL 171 (421) T ss_pred HHHhhhc--cchhH---HHhhccccCCcceecchhhHHHHHHHHHhhhhhhhh-ce--eeeccCCceEEEEeecCCccce Confidence 1111100 00000 000000000 11123344432222222222222211 22 2334566777776544332211 Q ss_pred -cCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Q lcl|Aclame:pro 80 -KRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTV 157 (319) Q Consensus 80 -~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~ 157 (319) ....+-....-+.++..+++.-.+...+. . +..+-... ...+.+.+.+..++++.-.+|.-.++.+.+.. .. T Consensus 172 ~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v-~-iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~~g~~--~~-- 245 (421) T protein:vir:13 172 ANLAKDTELVKAMLKTQPMAYDIDDYGLLA-P-IDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQAKAVL--AE-- 245 (421) T ss_pred eeccccccccccccceeEEEeeeeeeEeeh-h-hhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhhhhcc--cc-- Confidence 11111111222234444444444433322 1 11111111 12345556666666666666655444332111 11 Q ss_pred cCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEeccccc--- Q lcl|Aclame:pro 158 GTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLL--- 234 (319) Q Consensus 158 ~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~--- 234 (319) +...-|+.|.++...|..+..+. -.++++|..+..|.+-.+-.... +.. ....|..++|.|.+|+.+++... T Consensus 246 --~~~~~~d~i~~~~~~l~~~~~~~-a~~v~n~~~~~~l~~lkd~~G~~-i~~-~~~~~~~~tl~G~pV~~~~~~~~~~~ 320 (421) T protein:vir:13 246 --ETINDYAGLVKTINSLVPNARKR-AIIVTNSDGRAYLDGLMDKQGRP-LLK-ELSDGGDLVFKGRPVIELEESIFDVG 320 (421) T ss_pred --ccccchHHHHHHHHHhhhhhcCC-CEEEEcHHHHHHHHHhhcCCCce-eec-CcCCCCCceecceeeEEeccccccCC Confidence 11223777888888887765554 35788999998887533211100 011 12345667899999998765322 Q ss_pred ccceEEEEcCCc-eeeeeeeeeeeeecCCCCCcc---ceeeeeeeeeEEEeccccceE--------EEEc-cccccCCCC Q lcl|Aclame:pro 235 QGLQAIAVVGEV-LASPIQADLAKTNSNIPGMFG---TLAEQLLYTGAFVPEHLQKYI--------FTIG-GTEVATKRD 301 (319) Q Consensus 235 ~~~n~i~~~~~A-~~~~~k~~~~~~~~~~~~~~~---~~v~gr~~yg~~V~~~k~~~I--------y~~~-~~~~a~~~~ 301 (319) ....+++|.-+. .....+ ..+++-...+..|. ..++....+|..+.+++.... |+.. ++++++.++ T Consensus 321 ~~~~~~~gd~~~~~~~~~~-~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~~~~ 399 (421) T protein:vir:13 321 DETKFIVSDFKTLIKFMDR-KQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSSPRS 399 (421) T ss_pred CceEEEEEeccccEEEEEe-cceEEEeecccccccCeeEEEEEeeecceeecchhhheeeecccceeeccccccCCCCcC Confidence 234577776553 333222 23333322233333 588999999999999887532 3332 223333233 Q ss_pred Ccc--cccccccc----ccccc Q lcl|Aclame:pro 302 GVD--AHADNVAK----PSGSL 317 (319) Q Consensus 302 ~~~--~~~~~~~~----~~~~~ 317 (319) +.+ ++++..+- +.|+. T Consensus 400 ~~~~~~~~~~~~~~~~~~~~~~ 421 (421) T protein:vir:13 400 GKNKNESKEEIKEEGEATQQNE 421 (421) T ss_pred CCCccccchheeeccccccCCC Confidence 322 44444432 11111 No 116 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=97.91 E-value=1.1e-05 Score=47.67 Aligned_cols=279 Identities=8% Similarity=-0.078 Sum_probs=129.2 Q ss_pred CCccccccccee--------------------eehhhhhhhhh----cc-hhhhhhhHhhHHHHHHHHHhhhhhhhcccC Q lcl|Aclame:pro 1 MNKTIKNATGML--------------------KLNLQHFANKS----VE-PGQTLLKNKHVGILERVTAVNAYSTPALIS 55 (319) Q Consensus 1 ~~~~~~~~~~~~--------------------~~~~~~~~~~~----~~-~n~~~l~~ky~~lld~~~~~~sl~~~~~~n 55 (319) .+...+.....+ .+.+..+.... .. -...--++....+++.+.....+.. +++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~--~~~ 148 (390) T protein:vir:10 71 GDVQHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRD--LIG 148 (390) T ss_pred ccccccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhh--hcc Confidence 111111111111 11111110000 00 0000111122344444433333221 222 Q ss_pred cceeeeCCceEEeeecccc-ccccccCCC-CcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHH Q lcl|Aclame:pro 56 NDAIFMEGRSFTVMKGDTT-ELKDYKRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAE 133 (319) Q Consensus 56 ~~~~~~~g~tVkIp~i~~~-g~~DY~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~ 133 (319) ..-.++.++++|.+... +-..+...+ .....+++ ...+++.-.+.-.+. .--+ +-......+...+.+..+. T Consensus 149 --~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~--~~~i~~~~~k~~~~~-~is~-ell~d~~~l~~~i~~~l~~ 222 (390) T protein:vir:10 149 --SGRTDSALIEYVQETGFVNNAAIVAEGALKPESSLK--FAKKTDTTHVIAHTM-KATR-QILSDAPQLASYMNNRLIR 222 (390) T ss_pred --eeeccCCceEEEEEecCCcceeeecCCccccccccc--eeEEEEeeEEEEEee-hhhH-HHHHhHHHHHHHHHHHHHH Confidence 23345668999988653 333433222 22223344 444444444433322 1111 1111112345566777777 Q ss_pred HHHHHHHHHHHHHH-Hhc--------cC-ccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhh Q lcl|Aclame:pro 134 VVAPYLDNLRFATL-ARN--------KA-KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIAL 203 (319) Q Consensus 134 ~vapeiD~~~~s~l-a~~--------a~-~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~ 203 (319) .++-.+|..++.-- ... ++ ...+.+.+..+.++.+.++...|.....+... ++++|..+..|.+-.+-. T Consensus 223 ~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~-~v~n~~~~~~L~~lkd~~ 301 (390) T protein:vir:10 223 GLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASG-IVINPIDWAAIELAKDAN 301 (390) T ss_pred HHHHHHHHHHhhcCCCCccccccccccccccccccccccchHHHHHHHHHhhccccCCCCE-EEEcHHHHHHHHHhhcCC Confidence 88888888765310 000 00 01112223455788899999999888777554 679999998887543211 Q ss_pred hcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCC-ceeee-eeeeeeeeecCCC--CCccceeeeeeeeeEE Q lcl|Aclame:pro 204 PQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASP-IQADLAKTNSNIP--GMFGTLAEQLLYTGAF 279 (319) Q Consensus 204 ~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~-~k~~~~~~~~~~~--~~~~~~v~gr~~yg~~ 279 (319) ... +.+. ...+..++|.|.+|+.+ ..++...+++|.-+ +.... .+--.++..+... ..+...++...++|+. T Consensus 302 g~~-l~~~-~~~~~~~~l~G~pv~~~--~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~ 377 (390) T protein:vir:10 302 NQY-LIGN-ARGTLTPTLWGLPVVAT--QAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALV 377 (390) T ss_pred Cce-eecC-CcCcCCceecceeeEEc--CCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccE Confidence 100 1111 12344567999999864 34555556666543 33322 2322333332111 1234577888999999 Q ss_pred EeccccceEEEEcc Q lcl|Aclame:pro 280 VPEHLQKYIFTIGG 293 (319) Q Consensus 280 V~~~k~~~Iy~~~~ 293 (319) |.+|++. +++..+ T Consensus 378 v~~~~a~-~~~~~a 390 (390) T protein:vir:10 378 VYRPEAL-ISGSFA 390 (390) T ss_pred EeccccE-EEEEeC Confidence 9999984 334443 No 117 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=97.89 E-value=1.2e-05 Score=47.48 Aligned_cols=281 Identities=8% Similarity=-0.089 Sum_probs=132.5 Q ss_pred CCcccccccceeeehhhhhhhhhcch--hhhhhhH-hhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEP--GQTLLKN-KHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELK 77 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~l~~-ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~ 77 (319) |-.-.- -+.+...+ ..-.+.. ....+++.+.....+.. ++. ..-..+.+++||.....+-. T Consensus 1 m~~~~~------------~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~--~~~--~~~~~~~~~~~p~~~~~~~a 64 (330) T protein:vir:77 1 MAGSTV------------PSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQR--IAR--KVPMGPTGISIPHWTGAVSA 64 (330) T ss_pred Cccccc------------chhhccccCCCcceechhHHHHHHHHHHhccchhh--hcc--eeeccCCceEEEEEcCCcce Confidence 111110 11111100 0001222 22344444443332221 122 24456778999998765444 Q ss_pred cccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHH---------H Q lcl|Aclame:pro 78 DYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFAT---------L 147 (319) Q Consensus 78 DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~---------l 147 (319) .+.. .+-....-+.++...+++-.|.-.+. . +..+-.+. ...+...+.+..+..++-.+|+..+.- + T Consensus 65 ~~v~-Eg~~~~~~~~~f~~i~~~~~k~~~~~-~-is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~ 141 (330) T protein:vir:77 65 SWTG-EAERKPITKGSFGKQELEPVKITTIF-A-ESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGY 141 (330) T ss_pred eEec-CCCccccccceeeEEEEeEEEEEEee-h-hhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccc Confidence 4432 22222222334444555555433332 2 22222121 245667778888889999999876620 0 Q ss_pred HhccC--------ccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhh----hcccccccceee Q lcl|Aclame:pro 148 ARNKA--------KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIAL----PQGDTRQQVLGK 215 (319) Q Consensus 148 a~~a~--------~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~----~~~~~~~~~~~~ 215 (319) ...+. ...+...+..+.|+.|.++...+..++.+.. .++|+|..+..|++-.+-. ............ T Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~ 220 (330) T protein:vir:77 142 LAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWT-GTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGA 220 (330) T ss_pred cccccccceeecccccccccccchhHHHHHHHHHhhhhcCCCcc-EEEEcHHHHHHHHHHhccCCceeecCccccccccc Confidence 00000 0011122345678999999988888766543 5789999998887533211 110111111112 Q ss_pred eeeeeecCeEEEEecccc----cccceEEEEcCCceeeee-eeeeeeeecCC-------------------CCCccceee Q lcl|Aclame:pro 216 GVQGELDGFVIVKVPTKL----LQGLQAIAVVGEVLASPI-QADLAKTNSNI-------------------PGMFGTLAE 271 (319) Q Consensus 216 g~Vg~idG~~I~~vps~~----~~~~n~i~~~~~A~~~~~-k~~~~~~~~~~-------------------~~~~~~~v~ 271 (319) ..-+++.|++|+.+++-. ..+.-++++..+...... +=-.+++.+.. -..+...++ T Consensus 221 ~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r 300 (330) T protein:vir:77 221 IREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVR 300 (330) T ss_pred cCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEE Confidence 233578999998754311 112346666654443222 21122222210 011235778 Q ss_pred eeeeeeEEEeccccceEEEEccccccCCCCCccc Q lcl|Aclame:pro 272 QLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDA 305 (319) Q Consensus 272 gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~ 305 (319) ...++|..|.+|++.... .. +++...|-.+ T Consensus 301 ~~~r~d~~v~~~~a~~~i-~~---~~~~~~~~~~ 330 (330) T protein:vir:77 301 CEAEFAFMVNDKDAFVKL-TD---QVAGTDPEEE 330 (330) T ss_pred EEEEeccEEecccceEEE-Ee---ccCCcCCCCC Confidence 889999999999874332 22 1122222222 No 118 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=97.88 E-value=1.3e-05 Score=47.37 Aligned_cols=290 Identities=10% Similarity=0.020 Sum_probs=123.7 Q ss_pred CCcccccccceeeehhh-hh-hhhh---cch--hhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCCceEEeeecc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQ-HF-ANKS---VEP--GQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGD 72 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~-~~-~~~~---~~~--n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~ 72 (319) ..+.+++....|+-.-+ .| ..+. ..+ .-...++.+.. +++.+.....+.. +++. +.-.++..+.+|..+ T Consensus 92 ~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~--~~~~-~~~~~~~~~~~~~~~ 168 (409) T protein:vir:45 92 FDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIAS--VAQI-LTTSDGRTMEWATAD 168 (409) T ss_pred HHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhh--hcee-eecCCCceEEEEeec Confidence 11111111111111111 00 0000 000 01123444433 3333322222211 1221 222345567777765 Q ss_pred cc-ccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 73 TT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK 151 (319) Q Consensus 73 ~~-g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a 151 (319) .. ..+.+... +-....-+.++...++...|.....|.--+.---....++...+.+..+..+.-.+|..++.- .++ T Consensus 169 ~~~~~~~~v~E-~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G--~G~ 245 (409) T protein:vir:45 169 GTSEVGVLLGE-NEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQG--TGA 245 (409) T ss_pred cCccccccccc-cccccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhcc--CCC Confidence 43 23333322 222222333455555554443322221111111011234556666777777877777765420 111 Q ss_pred C-------------ccccccCCHhHHHHHHHHHHHHHHhccCCCCcE-EEEChHHHHHHhhhhhhhhcccccccceeeee Q lcl|Aclame:pro 152 A-------------KHLTVGTGSDAQYDAVLDVSVELDEIKAPENRV-LFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) Q Consensus 152 ~-------------~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~-l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) + .......+..--++.|+++...|.........| ++++|..+..|.+-.+-.... ..+....+|. T Consensus 246 ~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~-i~~~~~~~~~ 324 (409) T protein:vir:45 246 GTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRP-LWLPDIVGVA 324 (409) T ss_pred CCccccceeeeccccccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCce-eeccCcCCCC Confidence 1 001111122234777888888887754334555 577999988876533211100 1122234455 Q ss_pred eeeecCeEEEEecccc--ccc-ceEEEEcCC-ceeeeeeeeeeeeecCCC-CCccceeeeeeeeeEEEeccccceEEEEc Q lcl|Aclame:pro 218 QGELDGFVIVKVPTKL--LQG-LQAIAVVGE-VLASPIQADLAKTNSNIP-GMFGTLAEQLLYTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 218 Vg~idG~~I~~vps~~--~~~-~n~i~~~~~-A~~~~~k~~~~~~~~~~~-~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~ 292 (319) .++|.|.||+.+.+-. ..+ .-+++|.-+ ..+...+-..++....+- ..+.-.++....+|..+.+|++..++... T Consensus 325 ~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k 404 (409) T protein:vir:45 325 PASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGK 404 (409) T ss_pred CceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhheEEEEec Confidence 6789999998643211 112 224445533 333322222233322111 11224688888999999999975443333 Q ss_pred cccccCC Q lcl|Aclame:pro 293 GTEVATK 299 (319) Q Consensus 293 ~~~~a~~ 299 (319) . .+.+ T Consensus 405 ~--s~~~ 409 (409) T protein:vir:45 405 G--SVGG 409 (409) T ss_pred c--CCCC Confidence 2 1111 No 119 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=97.85 E-value=1.5e-05 Score=47.08 Aligned_cols=281 Identities=11% Similarity=0.031 Sum_probs=128.9 Q ss_pred CC----cccccccceeeehhhhh--hhh-----hcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEee Q lcl|Aclame:pro 1 MN----KTIKNATGMLKLNLQHF--ANK-----SVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVM 69 (319) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~~--~~~-----~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp 69 (319) .. +.++ .|++.- +.-+ +.. ......+--.+.+..++.+.....+... .++ +-+.-.++..+.|| T Consensus 85 ~~~~~~~~~r--~g~~~~-~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~-~~~-~~~~~~~~~~~~~~ 159 (392) T protein:vir:13 85 ADHDDDAVLR--AGNLGE-ARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMR-GGA-STFTTSDANPMDFT 159 (392) T ss_pred hhHHHHHHHh--ccchhh-hHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhh-hcc-eeeecCCCceeEEE Confidence 00 0000 011110 0000 000 0000011112233444555444333321 111 22233467789999 Q ss_pred eccccccccccC-CCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 70 KGDTTELKDYKR-NATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFAT- 146 (319) Q Consensus 70 ~i~~~g~~DY~r-~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~- 146 (319) ......-..+.. ++.....++ ++...++.-.|.-.+. . +..+-.. ...++...+.+..+..++-.+|..++.- T Consensus 160 ~~~~~~~a~~v~E~~~~~~~~~--~f~~v~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~ 235 (392) T protein:vir:13 160 VITGRATAGIVGETAEIPESYP--ATTQRSMGGFKYGFAS-V-VSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGT 235 (392) T ss_pred EEcCCcceeeeccccccccccc--ceeeEEeeeeeEEeee-h-hHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 887644333322 222222333 4445555545543332 2 2222111 1234556677788888888888876530 Q ss_pred --------HHhccCccc--cccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeee Q lcl|Aclame:pro 147 --------LARNKAKHL--TVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) Q Consensus 147 --------la~~a~~~~--~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) +........ +.+......|+.|.++...|... ...+-.++++|..+..|.+-.+-.... ..+.....| T Consensus 236 Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~-~~~~a~~v~n~~~~~~l~~lkd~~G~~-l~~~~~~~g 313 (392) T protein:vir:13 236 GTGQPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSA-YRKNAKFVVNDLRAAQMRKLKDANGQY-LWQSALTVG 313 (392) T ss_pred CCccccccccccccccccccccccccccHHHHHHHHHhhhhh-hhcCCEEEEcHHHHHHHHHhhccCCce-eecCCcCCC Confidence 000000000 00111223478888888777654 333445788999998886532211100 012233445 Q ss_pred eeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCC---ccceeeeeeeeeEEEeccccceEEEEcc Q lcl|Aclame:pro 217 VQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) Q Consensus 217 ~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~---~~~~v~gr~~yg~~V~~~k~~~Iy~~~~ 293 (319) .-++|.|.||+.+ ..++...+++|+-+....... ..+++-...... +...++...+.|+.|.+|++..+.-.. T Consensus 314 ~~~~l~G~Pv~~~--~~~~~~~i~~Gdf~~~~i~~~-~~~~i~~~~~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~- 389 (392) T protein:vir:13 314 APDTFNGKVVETD--DGMPADKVLFADLSKYRVRFA-GSLRVDRSVDAKFSTDQIVYRFLQRADGLLVDARGAKVLTVT- 389 (392) T ss_pred CCceecceeeEEc--CCCCCCcEEEeeccceeEEee-cceEEEeeccccccCCcEEEEEEEEeccEEecccceEEEEee- Confidence 5568999999864 344455566677554322222 334433211222 335788889999999999984332222 Q ss_pred ccccC Q lcl|Aclame:pro 294 TEVAT 298 (319) Q Consensus 294 ~~~a~ 298 (319) +++ T Consensus 390 --~aa 392 (392) T protein:vir:13 390 --PAA 392 (392) T ss_pred --ccC Confidence 222 No 120 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=97.84 E-value=1.5e-05 Score=47.00 Aligned_cols=283 Identities=14% Similarity=0.107 Sum_probs=127.8 Q ss_pred CCcccccccceeeehhhhhhhhhcch--hhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccc-ccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEP--GQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT-TEL 76 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~--n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~-~g~ 76 (319) +++.++. +.. ....+-+-..+ .....++.|. .+++.+.....+.. +++ ..-..+.+.++|.... .+- T Consensus 96 ~~~~l~~---~~~--~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~--~~~--~~~~~~~~~~~~~~~~~~~~ 166 (394) T protein:vir:10 96 INDFIHS---HGK--VIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLST--LVT--KTPVTTPKGTYPILKRATDR 166 (394) T ss_pred HHHHHhc---cch--hhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhh--hce--eeeccCCceEEEEEecCCCc Confidence 0000000 000 00000000000 1122444554 33434333333321 122 2334566777776643 233 Q ss_pred ccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Q lcl|Aclame:pro 77 KDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL 155 (319) Q Consensus 77 ~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~ 155 (319) ..+...++-..+.-..+...++++-.+...| +. +..+-.+. ..++.+.+.+..+..++-.+|.-++... +... T Consensus 167 ~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~-~~-iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~----g~~~ 240 (394) T protein:vir:10 167 FSSVAELAENPALAEPEFEQVDWSVSTYRGA-IP-LSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVL----QSFT 240 (394) T ss_pred cccccccccccccccccceeEEeeeeeeEee-eh-hHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcc----cccc Confidence 3443333332322223344444444444443 22 22221111 1345666777888888888887655433 2222 Q ss_pred cccCCHhHHHHHHHHHHH-HHHhccCCCCcEEEEChHHHHHHhhhhhhhhc----ccccccceeeeeeeeecCeEEEEec Q lcl|Aclame:pro 156 TVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQ----GDTRQQVLGKGVQGELDGFVIVKVP 230 (319) Q Consensus 156 ~~~~T~~n~~~~i~~a~~-~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~----~~~~~~~~~~g~Vg~idG~~I~~vp 230 (319) ..+.+....++.|.++.. .++.+ . +-.++++|..+..|.+-.+-... ... ......|.-++|.|.+|+.++ T Consensus 241 ~~~~~~~~~~d~l~~~~~~~~~~~-~--~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~-~~~~~~~~~~~L~G~PV~~~~ 316 (394) T protein:vir:10 241 AKATTTDTLVDSLKHILNVDLDPA-Y--SRALVVTQSLFNTLDTLKDKNGRYLLHDAS-DSITDGTAKGTVLGVPVYVVG 316 (394) T ss_pred cccccccccHHHHHHHHHhhhhhh-c--cCEEEecHHHHHHHHHhhccCCCeeeeccc-cccccCCcccccccceeEEec Confidence 233333445566665543 33332 2 34688999999888754321111 000 111222444679999998776 Q ss_pred ccccc----cceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEc-cccccCCCCCc Q lcl|Aclame:pro 231 TKLLQ----GLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIG-GTEVATKRDGV 303 (319) Q Consensus 231 s~~~~----~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~-~~~~a~~~~~~ 303 (319) +..+. +..++++.-+ +.....+- .+++-......|...++....+|+.|.++++..+..-. .++.++..+|. T Consensus 317 ~~~~~~~~~~~~i~~gd~s~~~~~~~~~-~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~ai~~~~~~~~~~~~~~~~~~ 394 (394) T protein:vir:10 317 DALLGSAAGDQKAFVGDLKRGVLFADRQ-QVTLAWEDSKIYGRYLGAAFRFGVKQADSNAGYFVTNTDAASGSTSGTGK 394 (394) T ss_pred ccccCCCCCceEEEEeeccccEEEEeec-ceEEEEecccccceeEEEEEEeccEEeccccEEEEEeecccCCCCCCCCC Confidence 54332 3446666533 44444332 23332223445666777788889999998875332221 12222222333 No 121 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=97.83 E-value=1.6e-05 Score=46.87 Aligned_cols=277 Identities=13% Similarity=0.079 Sum_probs=118.6 Q ss_pred CCc------------ccccc-cceeeehhh--hh----hhhhcch--hhhhhhHhhHHHHHHHHHhhhhhhhcccCccee Q lcl|Aclame:pro 1 MNK------------TIKNA-TGMLKLNLQ--HF----ANKSVEP--GQTLLKNKHVGILERVTAVNAYSTPALISNDAI 59 (319) Q Consensus 1 ~~~------------~~~~~-~~~~~~~~~--~~----~~~~~~~--n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~ 59 (319) +.. ..+.. .+.....+. +. ...-..+ .....++.|...+.+.....+.... .++ .+ T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~-~~~--~~ 171 (400) T protein:vir:38 95 LNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKP-FTN--VF 171 (400) T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhh-cce--eE Confidence 000 00000 000000000 00 0000000 1122334443323222222221111 111 23 Q ss_pred eeCCceEEeeeccc-cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 60 FMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAP 137 (319) Q Consensus 60 ~~~g~tVkIp~i~~-~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vap 137 (319) ..++.++++|.... .+-..+-..++-....-+.+...++++-.+.-.+ + .+..+-.+. ..++...+.+..+..+.- T Consensus 172 ~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~-~-~is~ell~ds~~~~~~~i~~~l~~~~~~ 249 (400) T protein:vir:38 172 QASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQA-L-PVSQESIDDSAIDLVGLIAQNGQQIKVN 249 (400) T ss_pred eccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeee-h-hhHHHHHhhhHHHHHHHHHHHHHHHHHH Confidence 34566777777643 2322332222222222223445555555554432 2 222211111 133455556666666666 Q ss_pred HHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHH-HHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeee Q lcl|Aclame:pro 138 YLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) Q Consensus 138 eiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~-~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) .+|...+...-+. ......+ ++.|.++.. .++.+ .+-.++++|..+..|.+-.+-.... .......+| T Consensus 250 ~~~~~i~~~~~~~---~~~~~~~----~~~~~~~~~~~~~~~---~~a~~v~~~~~~~~l~~lkd~~G~~-i~~~~~~~~ 318 (400) T protein:vir:38 250 TTNGAVATLLKGF---TAKTISS----VDDLKHINNVDLDPA---YSRVIIASQSFYNFLDTVKDGNGRY-LLQDSILTP 318 (400) T ss_pred HHHHhhhhccccc---ccccccc----HHHHHHHHHhhhhhh---hCcEEEEcHHHHHHHHHhhccCCCe-eeecCcCCC Confidence 6666544322111 1112223 333333332 22211 2457889999999887643211110 111223455 Q ss_pred eeeeecCeEEEEecccc---cccceEEEEcCC-ceeee-eeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEE Q lcl|Aclame:pro 217 VQGELDGFVIVKVPTKL---LQGLQAIAVVGE-VLASP-IQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTI 291 (319) Q Consensus 217 ~Vg~idG~~I~~vps~~---~~~~n~i~~~~~-A~~~~-~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~ 291 (319) .-++|.|++|+.+++.. ..+..+++|.-+ +.... .+--.++.. ....|...+++..++|..|.+|+... ++. T Consensus 319 ~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~r~d~~~~~~~a~~-~l~ 395 (400) T protein:vir:38 319 SGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWV--DDQIYGQFLQAGMRFGVSVADEKAGY-FLT 395 (400) T ss_pred CccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEe--cccccceeEEEEEEeccEEecccceE-EEE Confidence 66789999999765422 223456777654 33333 332333332 35667889999999999999988732 233 Q ss_pred cccccc Q lcl|Aclame:pro 292 GGTEVA 297 (319) Q Consensus 292 ~~~~~a 297 (319) . ++.| T Consensus 396 ~-~~~a 400 (400) T protein:vir:38 396 Y-TPKA 400 (400) T ss_pred e-ecCC Confidence 2 2222 No 122 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=97.81 E-value=1.8e-05 Score=46.66 Aligned_cols=290 Identities=10% Similarity=0.013 Sum_probs=127.8 Q ss_pred CCcccccccceeeehhhhhhhhhc---ch--hhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSV---EP--GQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTE 75 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~---~~--n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g 75 (319) ..+-+++..+ -.+..+-.+.. .+ .-...++.+...+-+.....+.... +++ .+..++..+++|...... T Consensus 86 ~~~~l~~g~~---~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~-~~~--~~~~~~~~~~~~~~~~~~ 159 (407) T protein:vir:48 86 FIGFMRKGRE---DGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQ-EAT--VITLGGSDYKKLVNLGGT 159 (407) T ss_pred HHHHHhccch---hhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhh-hce--eeecCCCceEEEEecCCc Confidence 1111111000 00011111111 00 0112345554333333333332221 222 233456688888765544 Q ss_pred cccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHH-------- Q lcl|Aclame:pro 76 LKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFAT-------- 146 (319) Q Consensus 76 ~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~-------- 146 (319) -..+...++-.++.-..++...++.-.|...|. . +..+-.. ...++...+.+...+.++-.+|..++.- T Consensus 160 ~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~G 237 (407) T protein:vir:48 160 TSGWVGETDARPETATSKLGLIEPFMGEIYGNP-Q-ATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKG 237 (407) T ss_pred ceeeecccccccccccccceeEEeeeeeeEeeh-h-hHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccce Confidence 333332222222221223444455545544432 1 2221111 1234555666777777777777764420 Q ss_pred -HHhccCcc------------ccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccce Q lcl|Aclame:pro 147 -LARNKAKH------------LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVL 213 (319) Q Consensus 147 -la~~a~~~------------~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~ 213 (319) +...+... .+......-.|+.|.++...|..+..+.. .++++|..+..|.+-.+-.... ..+... T Consensus 238 il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a-~~v~n~~~~~~L~~lkD~~Gr~-l~~~~~ 315 (407) T protein:vir:48 238 FLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGA-KFMMNNSSLFAIRLLKDNDGNY-LWRPGI 315 (407) T ss_pred eeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCC-EEEEcHHHHHHHHHhhccCCce-eeccCc Confidence 00000000 00011112237888888888877644433 4679999998886543211110 112233 Q ss_pred eeeeeeeecCeEEEEecccc--cccce-EEEEcCC-ceeeeeeeeeeeeecCCC-CCccceeeeeeeeeEEEeccccceE Q lcl|Aclame:pro 214 GKGVQGELDGFVIVKVPTKL--LQGLQ-AIAVVGE-VLASPIQADLAKTNSNIP-GMFGTLAEQLLYTGAFVPEHLQKYI 288 (319) Q Consensus 214 ~~g~Vg~idG~~I~~vps~~--~~~~n-~i~~~~~-A~~~~~k~~~~~~~~~~~-~~~~~~v~gr~~yg~~V~~~k~~~I 288 (319) ..|..++|.|.||+.+++-. ..+-. +++|.-+ +.....+ ..+++.+.+- ..+--.++....+|+.|++|++..+ T Consensus 316 ~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~-~~~~i~~d~~~~~~~~~~~~~~r~d~~v~~~~a~~~ 394 (407) T protein:vir:48 316 ELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDR-IGTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAIKL 394 (407) T ss_pred CCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEe-eceEEEeeccccCCcEEEEEEEEeccEEecccceEE Confidence 45666789999998654311 11222 4445543 4444433 3455544211 1223467777889999999997433 Q ss_pred EEEccccccCCCCCccc Q lcl|Aclame:pro 289 FTIGGTEVATKRDGVDA 305 (319) Q Consensus 289 y~~~~~~~a~~~~~~~~ 305 (319) +-.. +++...+ .+ T Consensus 395 l~~~---aa~~~~~-~~ 407 (407) T protein:vir:48 395 MKIG---AATRQKA-AA 407 (407) T ss_pred EEee---ccCCCCC-CC Confidence 2222 1111111 11 No 123 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=97.73 E-value=2.5e-05 Score=45.85 Aligned_cols=275 Identities=12% Similarity=0.021 Sum_probs=112.2 Q ss_pred CC---cccccccceeee----------hhhhhhh-hh----c-chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceee Q lcl|Aclame:pro 1 MN---KTIKNATGMLKL----------NLQHFAN-KS----V-EPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIF 60 (319) Q Consensus 1 ~~---~~~~~~~~~~~~----------~~~~~~~-~~----~-~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~ 60 (319) ++ +..++...++.- ..+.+.. +. . ...-.-+++.+. .+++.+.....+.. +++ +.. T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~--~~~--v~~ 156 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE--KAR--LTN 156 (387) T ss_pred cchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhh--hee--eee Confidence 11 000100000000 0000000 00 0 000112344443 34444433333211 111 122 Q ss_pred eCCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 MEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYL 139 (319) Q Consensus 61 ~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapei 139 (319) .+ ..++|.+...+-..+-...+-.....+.+...++++..++..|. .+..+-... ..++...+.+..++.+.-.. T Consensus 157 ~~--~~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~--~iS~ell~Ds~~~l~~~i~~~la~~~~~~e 232 (387) T protein:vir:93 157 IK--GLEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFA--AISDTVIHGSDVDLVNWVENALQSGLAAKE 232 (387) T ss_pred cC--CceEEEEeecCCccccccCcccccccccccceeeeeheeeeeec--hhhHHHHhhhHHHHHHHHHHHHHHHHHHHH Confidence 22 24567664332211111222222222344455555555554432 112111111 12334444555555555443 Q ss_pred HHHHHHHHHhccCcc---------ccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChH-HHHHHhhhhhhhhccccc Q lcl|Aclame:pro 140 DNLRFATLARNKAKH---------LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPT-FYKGIKKFVIALPQGDTR 209 (319) Q Consensus 140 D~~~~s~la~~a~~~---------~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~-~~~~L~~~~~f~~~~~~~ 209 (319) +...|. .+.+.. ....++.++.|+.|+++...|+.+-.....| +|++. ++.++++-.. + T Consensus 233 ~~~~~~---~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~~~~l~~~~~~~a~~-~mn~~t~~~~~~~~~d-------~ 301 (387) T protein:vir:93 233 RKDALA---VSPKSGLDHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATI-YMRYADYVKIISVLSN-------G 301 (387) T ss_pred HHhHhh---cCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEE-EEechHHHHHHHHHhc-------C Confidence 443332 221111 1123355667999999988888764444454 56655 4555543221 1 Q ss_pred ccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCC-CCccceeeeeeeeeEEEeccccceE Q lcl|Aclame:pro 210 QQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIP-GMFGTLAEQLLYTGAFVPEHLQKYI 288 (319) Q Consensus 210 ~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~-~~~~~~v~gr~~yg~~V~~~k~~~I 288 (319) +.....|.=.+|.|.||+.+.. ++ .+++|.-+-.... ++.+.+.+..+ ...-..+..+..+|++|+++++.- T Consensus 302 ~~~~~~~~~~~llG~PV~~~~~--~~--~~~~GDf~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~- 374 (387) T protein:vir:93 302 TTNFFDTPAEKVFGKPVVFTDA--AV--KPIVGDFNYFGIN--YDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFR- 374 (387) T ss_pred CCcccccCCccccccceEEecC--CC--ceeeeehhhhhee--hhhheeeecccccCCceeEEEEeeeCceeechhheE- Confidence 1122233334789999997542 22 2344443221111 11122211111 122356778889999999998743 Q ss_pred EEEccccccCCCC Q lcl|Aclame:pro 289 FTIGGTEVATKRD 301 (319) Q Consensus 289 y~~~~~~~a~~~~ 301 (319) ++...+++++.++ T Consensus 375 ~l~~k~~~~~~~~ 387 (387) T protein:vir:93 375 IAKAKENTGSLPS 387 (387) T ss_pred EEEeecCCCCCCC Confidence 3455444554444 No 124 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=97.67 E-value=3e-05 Score=45.36 Aligned_cols=302 Identities=11% Similarity=-0.026 Sum_probs=132.8 Q ss_pred CCcccccccce--eeehhhhhhhhhcc-----hhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccc Q lcl|Aclame:pro 1 MNKTIKNATGM--LKLNLQHFANKSVE-----PGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT 73 (319) Q Consensus 1 ~~~~~~~~~~~--~~~~~~~~~~~~~~-----~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~ 73 (319) .........|. |+.-..-|.+++.+ -.-.-+++.++..+-+.....+..... ++ .+-.++..++||.... T Consensus 59 ~~~~~~~~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~-~~--~~~~~~~~~~i~~~~~ 135 (390) T protein:vir:40 59 NDNNVLASRGANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSK-IN--FVNTTATTEWIISVGD 135 (390) T ss_pred HHHHHHHhcCchhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhh-ce--eeecCCceeEEEEEcC Confidence 00000000000 11111111111111 111123444433222222222222211 22 2445778899998876 Q ss_pred cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHH------ Q lcl|Aclame:pro 74 TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFAT------ 146 (319) Q Consensus 74 ~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~------ 146 (319) .+-..+.-.++-..+..+.++..+++.-.+...+.. +..+-.. ...++...+.+..+..++-.+|...+.- T Consensus 136 ~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~--iS~ell~ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P 213 (390) T protein:vir:40 136 VATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIP--VCNAMLDLGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQP 213 (390) T ss_pred CcceeeeccccccCccccccceeeEeeeeeEEEeeh--hhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCcc Confidence 665555433332222223344445555555544322 1111111 1134556677888888888888876531 Q ss_pred ---HHhccC-------ccccccCCHhHHHHHHHHHHHHHHhccCC--CCcEEEEChHHHHHHhhhhhhhhccccccccee Q lcl|Aclame:pro 147 ---LARNKA-------KHLTVGTGSDAQYDAVLDVSVELDEIKAP--ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLG 214 (319) Q Consensus 147 ---la~~a~-------~~~~~~~T~~n~~~~i~~a~~~Lde~~VP--~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~ 214 (319) +...+. ......++..++.+.+......+....-. .+-+++|+|..+..+++..+..+. ..|..+ T Consensus 214 ~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~~~~d-~~G~~v-- 290 (390) T protein:vir:40 214 IGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAATSYMT-PQGVWV-- 290 (390) T ss_pred ceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHhhccC-CCCccc-- Confidence 000000 01112244555566665555555553322 356788998764333222221111 111111 Q ss_pred eeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCC---ccceeeeeeeeeEEEeccccceEEEE Q lcl|Aclame:pro 215 KGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTLAEQLLYTGAFVPEHLQKYIFTI 291 (319) Q Consensus 215 ~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~---~~~~v~gr~~yg~~V~~~k~~~Iy~~ 291 (319) .+ ....|.+|+. ++.++...+++|..+-..... ...+++-..++.. +...++....+|+.|.++++..++-- T Consensus 291 ~~--~~~~g~pvv~--~~~~p~~~i~~Gd~s~~~i~~-~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~ 365 (390) T protein:vir:40 291 TG--ILPVPLEIVQ--SVAVPVGKAVAGRAKDYFMGI-GSEQVIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDI 365 (390) T ss_pred cc--cCCCceeEEE--cCCCCCCcEEEEeeceEEEEe-ecceEEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEe Confidence 11 1235888875 455666667777665433332 2344443323333 33688999999999999998433221 Q ss_pred ccccccCCCCCcccccccccc-cccccc Q lcl|Aclame:pro 292 GGTEVATKRDGVDAHADNVAK-PSGSLE 318 (319) Q Consensus 292 ~~~~~a~~~~~~~~~~~~~~~-~~~~~~ 318 (319) . ++......+.+.++.+. |--..| T Consensus 366 ~---~~~~~~~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:40 366 T---GLEGSPAIDVNVVNNATPSETPAE 390 (390) T ss_pred e---ccCCCCCCCcceeeCCCCCCCCCC Confidence 1 11122234444444444 222333 No 125 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=97.67 E-value=3.1e-05 Score=45.32 Aligned_cols=282 Identities=12% Similarity=0.092 Sum_probs=124.4 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccc-ccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT-TELKD 78 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~-~g~~D 78 (319) +++.++.-.-... ..+-.-..-.....++.|.. +++.+.....+ ... ++ .+...+.+.++|.... .+-.. T Consensus 95 ~~~~lr~~~~~~~----~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l-~~~-~~--~~~~~~~~~~~~~~~~~~~~~~ 166 (389) T protein:vir:10 95 INDFIHSHGKVID----ATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDL-STL-VT--KTPVTTPKGTYPILKRATDRFS 166 (389) T ss_pred HHHHhhcchhhhh----hhcccccCCcceeehHHHHHHHHHHHHhhhhH-Hhh-cc--eeeccCCeeEEEEEecCCCccc Confidence 1111110000000 00000000011223444433 33333222222 111 22 2334555677776643 22222 Q ss_pred ccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Q lcl|Aclame:pro 79 YKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTV 157 (319) Q Consensus 79 Y~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~ 157 (319) +...++-....-..++..+++.-.+.-.+. .+..+-.+. ..++...+.+..+..++-..|..+++.+... ... T Consensus 167 ~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~--~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~----~~~ 240 (389) T protein:vir:10 167 SVAELAENPKLAEPEFNKVDWSVATYRGAI--PLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSF----TAK 240 (389) T ss_pred cccccccccccccccceeeeeeheeeEeee--hhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccc----ccc Confidence 222222222111233444444444433322 111111111 1234556666777777777777655433322 223 Q ss_pred cCCHhHHHHHHHHHHH-HHHhccCCCCcEEEEChHHHHHHhhhhhh----hhcccccccceeeeeeeeecCeEEEEeccc Q lcl|Aclame:pro 158 GTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIA----LPQGDTRQQVLGKGVQGELDGFVIVKVPTK 232 (319) Q Consensus 158 ~~T~~n~~~~i~~a~~-~Lde~~VP~~R~l~VsP~~~~~L~~~~~f----~~~~~~~~~~~~~g~Vg~idG~~I~~vps~ 232 (319) +.+...-|+.|.++.. .++.. .+-.++++|..+..|++-.+- ...... .+....|..++|.|.+|+.+++. T Consensus 241 ~~~~~~~~d~l~~~~~~~~~~~---~~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~-~~~~~~~~~~~l~G~pV~~~~~~ 316 (389) T protein:vir:10 241 KTTTDTLVDSLKHILNVDLDPA---YSRALVVTQSLFNTLDTLKDKNGRYLLHDAS-DSITDGTAKGTILGVPVYVVGDT 316 (389) T ss_pred cccccccHHHHHHHHHhhhhhh---hCcEEEecHHHHHHHHHhhccCCCeeeecCc-ccccccccccccccceeEEeccc Confidence 3334445666666543 33332 145789999999988764421 111111 12223455568999999877654 Q ss_pred ccc----cceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCC Q lcl|Aclame:pro 233 LLQ----GLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDG 302 (319) Q Consensus 233 ~~~----~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~ 302 (319) .+. +..+++|.-+ +.....+ ..+.+....+..|...++.-..+|..|++|++.. ++.....+++++.- T Consensus 317 ~~~~~~~~~~~~~gd~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~~r~d~~~~~~~a~~-~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 317 LLGSLAGDQKAFVGDLKRGVLFTDR-QQVTLAWEDSKIYGKYLGAAFRFGVQKADSKAGY-FVTNTDVPGSALGK 389 (389) T ss_pred ccCCCCCceEEEEeeccccEEEEee-cceEEEeeccccccceEEEEEEeccEEecccceE-EEEeeccCCCCCCC Confidence 433 2346666543 4433332 1233332235556777888888999999988743 33332222221111 No 126 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=97.65 E-value=3.2e-05 Score=45.22 Aligned_cols=280 Identities=10% Similarity=0.012 Sum_probs=127.4 Q ss_pred CCcccccccceeeehhhhhhhhh-cc-hhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS-VE-PGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKD 78 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~-~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~D 78 (319) +...++. |-.+= +.+- .. -.-+..++.+...+-+.....+.... +++ .....+..+++|.....+-.. T Consensus 117 f~~~l~~--~e~~~-----al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~-l~~--~~~~~~~~~~~~~~~~~~~a~ 186 (425) T protein:vir:10 117 FKAHVKR--GDVQA-----ALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQ-LCR--VQPVSKAGFSKLFNMGGTTSG 186 (425) T ss_pred HHHHhhh--hhhHH-----HhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhh-hce--eeeccCCceEEEEEcCCccee Confidence 0000000 00000 0000 00 01112344443322222222222221 222 233455678888765443333 Q ss_pred ccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHH---------HH Q lcl|Aclame:pro 79 YKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFAT---------LA 148 (319) Q Consensus 79 Y~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~---------la 148 (319) +...++-.++.-..++..+++...+...+. . +..+-... ..++.+...+.....++-.+|..++.- +. T Consensus 187 wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i-~-iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~ 264 (425) T protein:vir:10 187 WVGEASQRPQTNAATFQPLSFASGEIYANP-A-ATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLT 264 (425) T ss_pred eeccccccccccccccceeeeeheeeEeeh-H-hHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeee Confidence 322222222221224455555555543332 1 12111111 245566777888888888888865531 00 Q ss_pred hccCcc------------ccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeee Q lcl|Aclame:pro 149 RNKAKH------------LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKG 216 (319) Q Consensus 149 ~~a~~~------------~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g 216 (319) ..+... .....+....++.|+++...|..... .+-.++++|..+..|.+-.+-... -..+....+| T Consensus 265 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~-~~a~~vmn~~~~~~L~~lkD~~G~-~l~~~~~~~g 342 (425) T protein:vir:10 265 YIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFT-GNARFAMNRNTQRQVRKLKDGQGN-YLWQPSYVAG 342 (425) T ss_pred ccccccccccccccccccccccccccccHHHHHHHHhhhhhhhc-cCCEEEEchHHHHHHHHhhcCCCc-eeeccCccCC Confidence 000000 00011223457888888887766432 344678999999988754321110 0112234456 Q ss_pred eeeeecCeEEEEecccc--c-ccceEEEEcC-CceeeeeeeeeeeeecCCC-CCccceeeeeeeeeEEEeccccceEEEE Q lcl|Aclame:pro 217 VQGELDGFVIVKVPTKL--L-QGLQAIAVVG-EVLASPIQADLAKTNSNIP-GMFGTLAEQLLYTGAFVPEHLQKYIFTI 291 (319) Q Consensus 217 ~Vg~idG~~I~~vps~~--~-~~~n~i~~~~-~A~~~~~k~~~~~~~~~~~-~~~~~~v~gr~~yg~~V~~~k~~~Iy~~ 291 (319) .-++|.|.+|+.+++-. . ....+++|.- .+.....+ ..+++.+.+- ..+-..++...++|..|++|++..+. . T Consensus 343 ~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~-~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l-~ 420 (425) T protein:vir:10 343 QPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDR-IGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAM-K 420 (425) T ss_pred CCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEe-cceEEEecccccCCcEEEEEEEEeccEeecccceEEE-E Confidence 66789999998754311 1 1223555643 34433333 3455544211 12335777888899999999975443 3 Q ss_pred ccccc Q lcl|Aclame:pro 292 GGTEV 296 (319) Q Consensus 292 ~~~~~ 296 (319) ..+.. T Consensus 421 ~~as~ 425 (425) T protein:vir:10 421 VAASE 425 (425) T ss_pred eeccC Confidence 32222 No 127 >protein:vir:2770 Length: 318 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612887;genbank:gi:20065804;genbank:GeneID:935710 Probab=97.64 E-value=2.5e-05 Score=45.80 Aligned_cols=227 Identities=11% Similarity=0.034 Sum_probs=122.9 Q ss_pred hhhhhhcchhhhh-------------hhHhhHHHHHHHHHhhhhhhhc---------ccCcceeeeCCceEEeeeccc-- Q lcl|Aclame:pro 18 HFANKSVEPGQTL-------------LKNKHVGILERVTAVNAYSTPA---------LISNDAIFMEGRSFTVMKGDT-- 73 (319) Q Consensus 18 ~~~~~~~~~n~~~-------------l~~ky~~lld~~~~~~sl~~~~---------~~n~~~~~~~g~tVkIp~i~~-- 73 (319) |+-.-+..||-.- -.+.|...+..-..+++.--.. ..-+++....|++|+++=+.- T Consensus 1 mt~~~~~~~~~~~~~~~ft~~~~~~~~vk~ws~~l~~~~~~~~~~~~~~g~~~~~~I~r~~dL~K~~GD~Vtf~L~~~L~ 80 (318) T protein:vir:27 1 MTTVTSAQANKLFQVALFTAANRNRSMVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80 (318) T ss_pred CCccCCCChHHHHHHHHHHHHhcCChHHHHHHHhhhhHHHhhhhhhcccCCCCCceEEEeccCCCCCccEEEEeEeeccc Confidence 3333333333111 1234444443332222211111 111344567899999875532 Q ss_pred --cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 74 --TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK 151 (319) Q Consensus 74 --~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a 151 (319) ...+|....+ ..+.++..+.+++|||-|.---.=..|+..-+. .++.....+....-+....|+-.|-.|++.. T Consensus 81 g~gv~Gd~~lEG--nee~L~~~~d~l~IDq~r~~V~~gg~msqqRt~--~dlR~~ar~~L~~w~~~~~Dq~~~v~laGar 156 (318) T protein:vir:27 81 KRPTMGDERVEG--RGEDLSHADFSLKINQGRHLVDAGGRMSQQRTK--FNLASSARTLLGTYFNDLQDQCAIVHLAGAR 156 (318) T ss_pred cCccccCceeec--cccceEEEeeEEEEeeeccccccccchhhhhhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 2355654332 346678888999999987432222455555444 3445666667777777778887777776432 Q ss_pred Cc--------------------------------------cccccCCHhHH--HHHHHHHHHHHHhccC-------CC-C Q lcl|Aclame:pro 152 AK--------------------------------------HLTVGTGSDAQ--YDAVLDVSVELDEIKA-------PE-N 183 (319) Q Consensus 152 ~~--------------------------------------~~~~~~T~~n~--~~~i~~a~~~Lde~~V-------P~-~ 183 (319) +. +....++.+++ ++.|+.+...+++..- .+ . T Consensus 157 g~~~n~~~~~p~~~~~~~~~~~~N~v~aPt~~r~~~~g~at~~~~l~stD~~s~~lid~~~~~~~~~a~pi~PV~v~g~~ 236 (318) T protein:vir:27 157 GDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDE 236 (318) T ss_pred cccccccceEecccCccchhhhhcccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceeecccc Confidence 20 00112333332 5667777777766322 21 1 Q ss_pred -------cEEEEChHHHHHHhhhh------hhhhcccc----cccceeeeeeeeecCeEEEEecccccccceEEEEcCCc Q lcl|Aclame:pro 184 -------RVLFVSPTFYKGIKKFV------IALPQGDT----RQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEV 246 (319) Q Consensus 184 -------R~l~VsP~~~~~L~~~~------~f~~~~~~----~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A 246 (319) ++|+++|..+.-|+.+. ++.+.... .++.+..|.+|+++|+-|.+-|. .-|.|.+|.. T Consensus 237 ~~~~~~~yV~~~~p~q~~~Lrtdt~~~~w~d~q~~A~~r~~g~knPLF~G~~gm~ngvil~~~~~---vpIrf~~G~~-- 311 (318) T protein:vir:27 237 LHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAG---MPIRFYQGQR-- 311 (318) T ss_pred ccCCcceEEEEechHHHHHHhhcCCCHHHHHHHHHHHhcccccCCCceecceeeecCEEEeecCC---ccEEEcCCCe-- Confidence 67899999999999875 33332111 24578999999999999986432 1244543322 Q ss_pred eeeeeeee Q lcl|Aclame:pro 247 LASPIQAD 254 (319) Q Consensus 247 ~~~~~k~~ 254 (319) +.+..+. T Consensus 312 -v~~~~~~ 318 (318) T protein:vir:27 312 -FWYQRIT 318 (318) T ss_pred -eeeeecC Confidence 2222222 No 128 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.52 E-value=5.2e-05 Score=44.07 Aligned_cols=288 Identities=10% Similarity=-0.043 Sum_probs=122.6 Q ss_pred CCcc----cccccceeeehh--------hhhhh-------hhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceee Q lcl|Aclame:pro 1 MNKT----IKNATGMLKLNL--------QHFAN-------KSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIF 60 (319) Q Consensus 1 ~~~~----~~~~~~~~~~~~--------~~~~~-------~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~ 60 (319) ..+. .+...+.....+ .+.+. .........+++.+. .+++.......+.. . ++ ... T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~-~-~~--~~~ 156 (413) T protein:vir:81 81 FAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVAD-L-MD--NLT 156 (413) T ss_pred hhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHh-h-cc--eee Confidence 0000 000000000000 00000 000001111222232 23333322222211 1 11 233 Q ss_pred eCCceEEeeeccccccc----cccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 MEGRSFTVMKGDTTELK----DYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVA 136 (319) Q Consensus 61 ~~g~tVkIp~i~~~g~~----DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~va 136 (319) ..+.++++|........ .+...++-.++.-..+....++...+.-.+. .+..+-.+....+...+.+..+.+++ T Consensus 157 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~--~iS~ell~ds~~l~~~i~~~la~~~~ 234 (413) T protein:vir:81 157 MTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLT--KITDEMIEDYDFLVSYINARLLEELA 234 (413) T ss_pred ccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEee--hhhHHHHHHHHHHHHHHHHHHHHHHH Confidence 45667788776532221 1111111111111123344455444443332 12211111111244556677778888 Q ss_pred HHHHHHHHHH---------HHhccCccccccCCHhHHHHHHHHHHHHHHhcc-CCCCcEEEEChHHHHHHhhhhh----h Q lcl|Aclame:pro 137 PYLDNLRFAT---------LARNKAKHLTVGTGSDAQYDAVLDVSVELDEIK-APENRVLFVSPTFYKGIKKFVI----A 202 (319) Q Consensus 137 peiD~~~~s~---------la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~-VP~~R~l~VsP~~~~~L~~~~~----f 202 (319) -.+|..++.- +...++.......+....++.+.++...+.... .+.+. ++|+|..+..|.+-.+ + T Consensus 235 ~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~-~vmn~~~~~~l~~lkd~~G~~ 313 (413) T protein:vir:81 235 IEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQADA-LVINPLDYQELRLAKDANGQY 313 (413) T ss_pred HHHHHHHhccCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCCcE-EEEcHHHHHHHHHhhccCCce Confidence 8888876531 111111111112245567888888887775543 34444 7899999988764332 1 Q ss_pred hhccccc--ccceeeeeeeeecCeEEEEecccccccceEEEEcCC-ceeeee-eeeeeeeecCCCC---Cccceeeeeee Q lcl|Aclame:pro 203 LPQGDTR--QQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASPI-QADLAKTNSNIPG---MFGTLAEQLLY 275 (319) Q Consensus 203 ~~~~~~~--~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~-k~~~~~~~~~~~~---~~~~~v~gr~~ 275 (319) ....... ......+..+++.|.+|+.+ ..++...+++|.-+ +..... +=-.+++.+...+ .+.-.++...+ T Consensus 314 l~~~~~~~~~~~~~~~~~~~l~G~pv~~s--~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r 391 (413) T protein:vir:81 314 YGGGVFQGQYGSGGIMLDPAPWGLRTVQS--QVVPVGKPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEER 391 (413) T ss_pred eccccccccccccccccCceecceeeEEc--CCCCcccEEEEecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEe Confidence 1111110 01112223357889999864 44455566766543 443333 2233444432111 23357888889 Q ss_pred eeEEEeccccceEEEEccccccCCC Q lcl|Aclame:pro 276 TGAFVPEHLQKYIFTIGGTEVATKR 300 (319) Q Consensus 276 yg~~V~~~k~~~Iy~~~~~~~a~~~ 300 (319) +|..+.+|++..+. ... ++++| T Consensus 392 ~d~~~~~~~a~~~l-~~~--~~~~p 413 (413) T protein:vir:81 392 VGLMVTFPEAIVQL-DVA--EVVTP 413 (413) T ss_pred eccEEecccceEEE-Eec--CCCCC Confidence 99999999875432 332 22222 No 129 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.51 E-value=5.3e-05 Score=44.04 Aligned_cols=286 Identities=11% Similarity=-0.016 Sum_probs=132.6 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY 79 (319) ||..-.+.+|--.-+ .+.....--+++.+. .+++.+...+.+. .. +. .+...+++++||.+........ T Consensus 4 ~~e~~~~~~~~~~~~------~~~~~~~~liP~~~~~~ii~~~~~~s~l~-~l-~~--~~~~~~~~~~ip~~~~~~~a~~ 73 (338) T protein:vir:78 4 LNELAPNTAGSNHQG------RLAHVPSDLLPKEIVGPIFDKAQESSLVL-RL-GE--NIPISYGETIIPTTVKRPEVGQ 73 (338) T ss_pred hHHhhhhhccccccc------ceecccccccchHHHHHHHHHHHhhchhh-hh-cc--eeeccCCceEEEEEecCcccee Confidence 333333333311111 000000112444443 3444443333332 22 22 3556788999999865322211 Q ss_pred cC-------CCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH----- Q lcl|Aclame:pro 80 KR-------NATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATL----- 147 (319) Q Consensus 80 ~r-------~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~l----- 147 (319) .. ..+-....-+.++...++...|... .+.--+.--.....++...+.+..+.+++-.+|..++.-- T Consensus 74 v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~-~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~ 152 (338) T protein:vir:78 74 VGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLAT-IVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTG 152 (338) T ss_pred ecccccccccccccccccccceeEEEEEEEEEEE-eehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc Confidence 11 1111111122334444444444332 2222222111112455677778888899999998766311 Q ss_pred ------HhccCccc-----cccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhh-ccc-cccccee Q lcl|Aclame:pro 148 ------ARNKAKHL-----TVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALP-QGD-TRQQVLG 214 (319) Q Consensus 148 ------a~~a~~~~-----~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~-~~~-~~~~~~~ 214 (319) ...+.... .........|+.|.++...+....--....++++|..+..|.+-..... ++. ....... T Consensus 153 ~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~ 232 (338) T protein:vir:78 153 SALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINL 232 (338) T ss_pred ccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeeccccc Confidence 00000000 1112234578888888877765322245579999999988755432211 111 1123345 Q ss_pred eeeeeeecCeEEEEe---cccc----cccceEEEEcCCceeee-eeeeeeeeecC-------CC-----C---Cccceee Q lcl|Aclame:pro 215 KGVQGELDGFVIVKV---PTKL----LQGLQAIAVVGEVLASP-IQADLAKTNSN-------IP-----G---MFGTLAE 271 (319) Q Consensus 215 ~g~Vg~idG~~I~~v---ps~~----~~~~n~i~~~~~A~~~~-~k~~~~~~~~~-------~~-----~---~~~~~v~ 271 (319) .|..++|.|.||+.. |.+. ....-+++|.-+-.... .+--.+++.+. .+ + .+-..++ T Consensus 233 ~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 312 (338) T protein:vir:78 233 AASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAIL 312 (338) T ss_pred CCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEE Confidence 667788999999863 3221 12234555554432222 12122333221 00 0 1224677 Q ss_pred eeeeeeEEEeccccceEEEEccccccCCCCC Q lcl|Aclame:pro 272 QLLYTGAFVPEHLQKYIFTIGGTEVATKRDG 302 (319) Q Consensus 272 gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~ 302 (319) .-.++|..|++|++...... +++++. T Consensus 313 ~~~r~d~~v~~~~a~~~l~~-----~~~~~~ 338 (338) T protein:vir:78 313 IEVTFGWLLGDKQAFVKFVD-----DEDPDA 338 (338) T ss_pred EEEEeccEeecccceEEEec-----ccCCCC Confidence 88899999999987433222 222222 No 130 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=97.45 E-value=6.4e-05 Score=43.59 Aligned_cols=281 Identities=8% Similarity=-0.015 Sum_probs=127.2 Q ss_pred CCcccccccceeeehhhhhhhhhc-----chhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSV-----EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTE 75 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-----~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g 75 (319) +.+-+++.. ...+..+..+.. ...-...++.+...+-+.....+.... +++ ..-.++.+.++|...... T Consensus 87 ~~~~lr~~~---~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~-~~~--~~~~~~~~~~~~~~~~~~ 160 (401) T protein:vir:44 87 FVGFLRKGR---EDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQ-EAT--VITVGGSDYKKLVNLGGT 160 (401) T ss_pred HHHHHhhhh---hhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhh-hce--eeecCCCceEEEEecCCc Confidence 111111110 001111111110 001123445554333333333222211 222 233456777888765433 Q ss_pred cccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc- Q lcl|Aclame:pro 76 LKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFATLARNKAK- 153 (319) Q Consensus 76 ~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~- 153 (319) ...+...++-.+..-+.++..++++..|...|. . +..+-.+. ..++...+.+..+..++-.+|..++.- .+.+. T Consensus 161 ~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G--~G~~~p 236 (401) T protein:vir:44 161 ASGWVGETDTRSQTATSRLGLIEPFMGEIYGNP-Q-ATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTG--DGTKKP 236 (401) T ss_pred cceeeccccccCccccccceeeeeehhheeeeh-h-hhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhcc--CCCCcc Confidence 333322222222222234455555555544432 2 22221111 234556667777788888888765521 00000 Q ss_pred ----------c------------ccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhccccccc Q lcl|Aclame:pro 154 ----------H------------LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQ 211 (319) Q Consensus 154 ----------~------------~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~ 211 (319) . .+.+....-.|+.|+++...|.... ..+-+++++|..+..|.+-.+-... -..+. T Consensus 237 ~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~-~~~a~~v~n~~~~~~L~~lkd~~G~-~l~~~ 314 (401) T protein:vir:44 237 KGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAH-RTGAKFMMNNNSLFAIRLLKDTEGN-YLWRP 314 (401) T ss_pred ceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhh-hcCCEEEEcHHHHHHHHHhhccCCc-eeecC Confidence 0 0000111223888888888886642 2345678999999888654321110 01122 Q ss_pred ceeeeeeeeecCeEEEEeccc--ccc-cceEEEEcCC-ceeeeeeeeeeeeecCCC-CCccceeeeeeeeeEEEeccccc Q lcl|Aclame:pro 212 VLGKGVQGELDGFVIVKVPTK--LLQ-GLQAIAVVGE-VLASPIQADLAKTNSNIP-GMFGTLAEQLLYTGAFVPEHLQK 286 (319) Q Consensus 212 ~~~~g~Vg~idG~~I~~vps~--~~~-~~n~i~~~~~-A~~~~~k~~~~~~~~~~~-~~~~~~v~gr~~yg~~V~~~k~~ 286 (319) ...+|..++|.|.||+.+++- ... +.-+++|+-+ +....... .+++.+.+- ...-..++....+|+.|+++++. T Consensus 315 ~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~-~~~~~~~~~~~~~~v~~~a~~r~d~~~~~~~a~ 393 (401) T protein:vir:44 315 GLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRI-GTRILRDPYTNKPFVGFYTTKRTGGMLVDSQAI 393 (401) T ss_pred CcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEec-ceEEeeeccccCCcEEEEEEEEeccEEecccce Confidence 234566778999999865331 111 2224456543 44433332 244433211 12335677777899999999885 Q ss_pred eEEEEcccc Q lcl|Aclame:pro 287 YIFTIGGTE 295 (319) Q Consensus 287 ~Iy~~~~~~ 295 (319) .++ ...++ T Consensus 394 ~~l-~~~aa 401 (401) T protein:vir:44 394 KLL-KIAAA 401 (401) T ss_pred EEE-EeecC Confidence 432 22222 No 131 >protein:vir:4786 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:3269 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150166;swissprot:trembl:q94m45;genbank:gi:15088777;uniprot:Q94M45;genbank:GeneID:955980 Probab=97.39 E-value=2.7e-05 Score=45.60 Aligned_cols=263 Identities=13% Similarity=0.115 Sum_probs=137.7 Q ss_pred hhhhh--hcchhhhhhhHhhHHHHHHHHHhhhhhhhcccC---cceeeeCCc--eEEeeeccccccccccCCCC---ccc Q lcl|Aclame:pro 18 HFANK--SVEPGQTLLKNKHVGILERVTAVNAYSTPALIS---NDAIFMEGR--SFTVMKGDTTELKDYKRNAT---NEF 87 (319) Q Consensus 18 ~~~~~--~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n---~~~~~~~g~--tVkIp~i~~~g~~DY~r~~~---~~~ 87 (319) |=+|| .++ .|+++|.+||..++...++-...... -|-..++.. .||-..+.+ -++.|+.+.. +.. T Consensus 1 mp~N~n~avr----~Y~Kqf~glL~~vf~~qa~F~~~FGglQalDGV~~N~tafsvKt~D~pV-Vig~Y~TdeNvagFGt 75 (295) T protein:vir:47 1 MPSNQNNAVR----RYEKQYAGILETVFGVRAAFSNALAPIQILDGVQENSKAFSVKTNNTPV-VIGEYKTGENDGGFGD 75 (295) T ss_pred CCCCCCccch----hhhHHHHHHHHHHHhHHHHHhhhhcchhhhhCCCccceEEEEeecCcce-EeecccCCCccccccc Confidence 33332 222 47899999999999887765443222 111122222 455554443 2446765332 222 Q ss_pred CCcc---------cceeEEEEeecccce--eecchhhHHHHhhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 88 DHPK---------IEETTYFLDQEKYWG--RFVDALDRKDTEGNIDI--NYVVARQGAEVVAPYLDNLRFATLARNKAKH 154 (319) Q Consensus 88 ~~~t---------~t~~tltidqdr~~~--F~VD~~D~~et~~~~~~--~~~~~~~~~~~vapeiD~~~~s~la~~a~~~ 154 (319) |+-. .-+....+.++-.|. -.||.+- .|.++.+ ++- -+.++.+-+..+|.-.-.-|...|... T Consensus 76 GTg~SsRFG~rkEi~y~dtdV~Y~~~~~iHEGiD~~T---VNnd~~aaVAdR-L~LQA~Akt~~~n~~~Gk~ls~~A~~t 151 (295) T protein:vir:47 76 NSGAQSRFGGVTEVKYENTDVNYDYTLTIHEGLDRYT---VNNDLNAAVADR-LKLQSEAQTRTVNKRIGKYLSDTATKT 151 (295) T ss_pred CCccccccCceeeEEeecccccccccchhhhcccccc---ccCChhHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhhhhh Confidence 2211 122223333333333 3454443 3333322 111 134445555666664433444555443 Q ss_pred c-cccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccc Q lcl|Aclame:pro 155 L-TVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKL 233 (319) Q Consensus 155 ~-~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~ 233 (319) . .+.+|.+++...+-.+.+..-+.+|-.....+|+|++|.+|...+--+.+-.+.-++--+| +-++-||.|.++|... T Consensus 152 e~~td~t~d~V~~LF~~as~~yvn~ev~~~~~AyV~~evYnaiiD~~l~TsaK~SsaNiDeng-i~~FkGf~i~e~P~~~ 230 (295) T protein:vir:47 152 EALADFTDDKVKALFNKLSAFYTNNEVTAPITVYLRSEFYNAIVDMASVTSAKGATISLDENG-LPKYKGFTLEETPAQY 230 (295) T ss_pred hhhhcccchhHHHHHHHHHHHhhhhheeeeeEEEEchhHHHHHhccccccccccceeeeccCC-cceecceEEEeccHhh Confidence 3 3467889999999999999999899876669999999999987765443222222222344 4468899999999999 Q ss_pred cccceEEEEcC-CceeeeeeeeeeeeecCCCCCccceeeeee--eeeEEE-ecc---c-cceEEEE Q lcl|Aclame:pro 234 LQGLQAIAVVG-EVLASPIQADLAKTNSNIPGMFGTLAEQLL--YTGAFV-PEH---L-QKYIFTI 291 (319) Q Consensus 234 ~~~~n~i~~~~-~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~--~yg~~V-~~~---k-~~~Iy~~ 291 (319) +..-.+.+..+ +-..+..=|...|+.+ +++-.|-+.+-+. +--+.+ ++. | ...+|-. T Consensus 231 ~q~G~~aifs~dnig~aftGIn~aR~Ie-sEdF~GValQ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (295) T protein:vir:47 231 FETGVIAIFSPNGIIIPFVGISTARVIE-AENFDGVNCKLLLRVVLTLLMTIRKQFTKLQELLYRR 295 (295) T ss_pred ccCCcEEEEccccceeecccceeeeeee-cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 97655555444 3344556677777776 5665553222110 000000 000 0 0001111 No 132 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=97.38 E-value=8e-05 Score=43.05 Aligned_cols=280 Identities=11% Similarity=0.035 Sum_probs=119.0 Q ss_pred CCccccc----ccceeeeh--------hhhhhhhhcch-------hhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceee Q lcl|Aclame:pro 1 MNKTIKN----ATGMLKLN--------LQHFANKSVEP-------GQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIF 60 (319) Q Consensus 1 ~~~~~~~----~~~~~~~~--------~~~~~~~~~~~-------n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~ 60 (319) +++.++. ...+.... ........... ...-.++.|.. +++.+.....+.. +++ ..- T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~--~~~--~~~ 166 (394) T protein:vir:97 91 VNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP--FTT--VYQ 166 (394) T ss_pred HHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhh--hce--eee Confidence 0000000 00000000 00000000000 11123344432 2322222222211 111 123 Q ss_pred eCCceEEeeeccccccccccCCCCcccCC-cccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 MEGRSFTVMKGDTTELKDYKRNATNEFDH-PKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPY 138 (319) Q Consensus 61 ~~g~tVkIp~i~~~g~~DY~r~~~~~~~~-~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vape 138 (319) ..+.+.++|.....+-.-+....+-...+ -+.++..++++-.+.-.+ |.--+ +-.+. ..++.+.+.+..+..++-. T Consensus 167 ~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~-i~is~-ell~ds~~~~~~~i~~~la~~~~~~ 244 (394) T protein:vir:97 167 AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGA-IPLSQ-ESIDDADVDLVGIVSESISQIKVNT 244 (394) T ss_pred ccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeee-hhhHH-HHHhhhhHHHHHHHHHHHHHHHHHH Confidence 34556777766433222111122212211 223445555555544332 22222 11111 1234555666666677766 Q ss_pred HHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeee Q lcl|Aclame:pro 139 LDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQ 218 (319) Q Consensus 139 iD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~V 218 (319) .|..++..+.+. ....+.+ ++.|.++...+-... .+-.++++|..+..|.+-.+-.... .......+|.- T Consensus 245 ~~~~i~~g~~~~---~~~~~~~----~~~~~~~~~~~~~~~--~~a~~v~n~~~~~~l~~lkd~~G~~-i~~~~~~~~~~ 314 (394) T protein:vir:97 245 TNDAIAKVLKSF---TTKTVKN----LDEIKALLNGGFDPA--YNVSLIVSQSFYQTLDTLKDGNGRY-LLQDDITAVSG 314 (394) T ss_pred HHHHHhhccccc---ccccccc----HHHHHHHHHhhhhhh--hCCEEEEcHHHHHHHHHhhccCCCe-eeecCcCCCCC Confidence 666544322111 1111222 344444443221111 2345789999998887543211100 01112334555 Q ss_pred eeecCeEEEEecccccccceEEEEcCC--ceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEccccc Q lcl|Aclame:pro 219 GELDGFVIVKVPTKLLQGLQAIAVVGE--VLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) Q Consensus 219 g~idG~~I~~vps~~~~~~n~i~~~~~--A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~ 296 (319) ++|.|++|+.+++.......+++|.-+ ..++..+--.++.. ....+...+++..++|..|.+|+.... +....++ T Consensus 315 ~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~r~d~~v~~~~a~~~-~~~~~~~ 391 (394) T protein:vir:97 315 KVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWA--DNEIYGQYLQAVLRFGVSKVDDKAGYY-VTFTPEP 391 (394) T ss_pred ceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEe--cccccceeEEEEEEEccEEecccceEE-EEecccc Confidence 689999999888766666667777632 23443443344432 244567788999999999999987322 2222222 Q ss_pred cCC Q lcl|Aclame:pro 297 ATK 299 (319) Q Consensus 297 a~~ 299 (319) ++- T Consensus 392 ~p~ 394 (394) T protein:vir:97 392 LPL 394 (394) T ss_pred cCC Confidence 221 No 133 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=97.37 E-value=8.3e-05 Score=42.95 Aligned_cols=288 Identities=8% Similarity=-0.099 Sum_probs=118.9 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhh-HHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKH-VGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky-~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY 79 (319) +|+.-.. -.|..+.+-=..--.....--+++.+ ..+++.+.....+.. ++. ....++++.+||......-..+ T Consensus 3 ~~~~r~~--~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~--~~~--~~~~~~~~~~~p~~~~~~~a~~ 76 (326) T protein:vir:42 3 VNPDRTT--PFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQ--FAQ--KIPMGTTGQKIPHWTGDVSASW 76 (326) T ss_pred CCccchh--hhcCcchhhheeccccCCcceechhhHHHHHHHHHhcchhhh--hcc--eeeccCCceEEEEEeCCcceEE Confidence 2221100 00101000000000000111123333 234444444333322 122 2344677899998865433333 Q ss_pred cCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--------c Q lcl|Aclame:pro 80 KRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARN--------K 151 (319) Q Consensus 80 ~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~--------a 151 (319) . ..+-.....+.++.+.++.-.|.-. .|.--+.--......+...+.++.+.+++-.+|...+.---++ . T Consensus 77 v-~Eg~~~~~~~~~f~~i~~~~~k~~~-~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~ 154 (326) T protein:vir:42 77 I-GEGDMKPITKGNMTSQTIAPHKIAT-IFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTT 154 (326) T ss_pred e-cCCccccccccceeEEEEeeEEEEE-eehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccc Confidence 2 2222222233445555555444322 2322221111112456677788888899999998766210000 0 Q ss_pred ---Ccccc--ccCCHhHHH-HH-HHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhh----cccccccceeeeeeee Q lcl|Aclame:pro 152 ---AKHLT--VGTGSDAQY-DA-VLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALP----QGDTRQQVLGKGVQGE 220 (319) Q Consensus 152 ---~~~~~--~~~T~~n~~-~~-i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~----~~~~~~~~~~~g~Vg~ 220 (319) ..... ...+.+-.+ +. +..+...+.... -.+-.++++|..+..|++-.+-.. .............-+. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~ 233 (326) T protein:vir:42 155 KEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAG-KKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGR 233 (326) T ss_pred cccceeecccccccccchhHHHHHHHHHhhhhhhc-cCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCce Confidence 00000 011111111 21 222222333222 234567899999998875332111 0011111112223357 Q ss_pred ecCeEEEEecccccccceEEEEcCC-ceeeeeeeeeeeeecC-------CCC--------CccceeeeeeeeeEEEeccc Q lcl|Aclame:pro 221 LDGFVIVKVPTKLLQGLQAIAVVGE-VLASPIQADLAKTNSN-------IPG--------MFGTLAEQLLYTGAFVPEHL 284 (319) Q Consensus 221 idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~k~~~~~~~~~-------~~~--------~~~~~v~gr~~yg~~V~~~k 284 (319) +.|++|+.+++-...+.-++.|.-+ +.....+--.+++.+. ++. ++...++...++|++|.+|+ T Consensus 234 l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~ 313 (326) T protein:vir:42 234 IVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKD 313 (326) T ss_pred eeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEeccc Confidence 8999998654322222222222222 1111111112222110 111 13367899999999999998 Q ss_pred cceEEEEcccccc Q lcl|Aclame:pro 285 QKYIFTIGGTEVA 297 (319) Q Consensus 285 ~~~Iy~~~~~~~a 297 (319) +......+..+++ T Consensus 314 a~~~l~~~~~~~~ 326 (326) T protein:vir:42 314 AFVKLTNVDATEA 326 (326) T ss_pred ceEEEeeccccCC Confidence 8544444433333 No 134 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.34 E-value=7.7e-05 Score=43.13 Aligned_cols=274 Identities=12% Similarity=0.044 Sum_probs=132.4 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEee-eccccccc-c Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVM-KGDTTELK-D 78 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp-~i~~~g~~-D 78 (319) |-..-.- -.=||-+-.+ -...-+|.+.++|+.-+++++..- .+-|-.--..|.++|++ ++...+-. | T Consensus 1 ~~~~~~~----~e~nlt~~~d-l~~~~siDf~~~f~~~i~~L~~~L------Gv~r~~pla~GstIkt~k~~~y~gda~d 69 (296) T protein:vir:98 1 MVTSRTY----PEENLIKSTD-LKYPITIDVTNKFQENISKLLEML------GVTRKISVSEGMTLKTYAGYDVTLAEGN 69 (296) T ss_pred CCCcccc----CcCCCcchhh-hhhhhhhhhHHHHhhhHHHHHHHh------hhcccccccCCCEEeeccceeeeecccc Confidence 1000000 0011111111 112446778888876666664322 12233344459999774 57655433 4 Q ss_pred ccCCCCcccCCcccc---eeEEEEeecccceeecchhhHH-HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 79 YKRNATNEFDHPKIE---ETTYFLDQEKYWGRFVDALDRK-DTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH 154 (319) Q Consensus 79 Y~r~~~~~~~~~t~t---~~tltidqdr~~~F~VD~~D~~-et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~ 154 (319) --....+....++.+ ..++++. ||+.= + .|++ |..+...+.....++...+++..+|...|+.+....++. T Consensus 70 VaEGe~Iplskvt~~~~~t~t~~ik--K~rK~-t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~ 144 (296) T protein:vir:98 70 VPEGEVIPLSKVERKIHSEKKIELK--KYRKA-T--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQ 144 (296) T ss_pred ccCCcccchhhheeeecceEEEEee--ccccc-c--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccccee Confidence 433334455556654 3455554 44432 2 4666 556667778888999999999999999999875544332 Q ss_pred ccccCC-HhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccc Q lcl|Aclame:pro 155 LTVGTG-SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKL 233 (319) Q Consensus 155 ~~~~~T-~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~ 233 (319) ...+-+ ...+...+.++..++.+.+ ..+.++||+|.=...++++..+..+...|-.-+. .+.|.+|+.++.-. T Consensus 145 ~~t~~~lQ~Ala~~~~~l~~~feded-~~~~V~FVnP~D~a~ylg~a~it~qt~fG~tyl~-----nfLG~~II~S~kV~ 218 (296) T protein:vir:98 145 DALGAGLQGALASAWGKLQVLFEDYG-SERAIVFANSLDVAEYIAKAGITTQTAFGLTYLV-----DFTGTVIISTNDVT 218 (296) T ss_pred eechhhHHHHHHHHhhhhhhhccccC-CCceEEEEehHHHHHHhcCCccchhheechhhhh-----hccccEEEEcCcCC Confidence 211111 1223445566666776642 2478999999877777777666543333322222 26677777543211 Q ss_pred --------cccceEEEEcCC----ceeeee---eeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEcccccc Q lcl|Aclame:pro 234 --------LQGLQAIAVVGE----VLASPI---QADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVA 297 (319) Q Consensus 234 --------~~~~n~i~~~~~----A~~~~~---k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a 297 (319) ..++++.=..++ +..|.. +.--+-+.. ++...---++-+..-|...+---.+||....-++-. T Consensus 219 ~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~h-~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 219 KGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNH-FQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred CceEEEeeecceEEEeecccccchhhhhccccccccceEEEe-ccccceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 112333322221 111111 111111111 111111122222233444444455677665432222 No 135 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=97.33 E-value=9.4e-05 Score=42.66 Aligned_cols=278 Identities=12% Similarity=-0.011 Sum_probs=110.7 Q ss_pred CC------cccccccceee--ehhhhhhhh---------h----c-chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcc Q lcl|Aclame:pro 1 MN------KTIKNATGMLK--LNLQHFANK---------S----V-EPGQTLLKNKHV-GILERVTAVNAYSTPALISND 57 (319) Q Consensus 1 ~~------~~~~~~~~~~~--~~~~~~~~~---------~----~-~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~ 57 (319) .+ +..++-.+.++ +.-+.+... . + .-.-.-+++-+. .+++.+.....+.. +++ T Consensus 78 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~--~~~-- 153 (387) T protein:vir:96 78 YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE--KAR-- 153 (387) T ss_pred CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhh--hce-- Confidence 00 00000000000 000000000 0 0 000112333332 23333333332211 111 Q ss_pred eeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 58 AIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVA 136 (319) Q Consensus 58 ~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~va 136 (319) +.-.+ ..++|.+...+-..+-...+-.....+.+...++++..+...|. . +..+-... ..++...+.+..++.++ T Consensus 154 ~~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i-~-iS~ell~ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:96 154 LTNIK--GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFA-A-ISDTVIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred eeecC--CceeeeeeccCCccccccccccccccccccceeeechheeeeec-h-hhHHHHhhhHHHHHHHHHHHHHHHHH Confidence 12222 24567664332111111112222222344455556656555542 2 22111111 12334445555555555 Q ss_pred HHHHHHHHHHHHhccC---c---cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHH-HHHHhhhhhhhhccccc Q lcl|Aclame:pro 137 PYLDNLRFATLARNKA---K---HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTF-YKGIKKFVIALPQGDTR 209 (319) Q Consensus 137 peiD~~~~s~la~~a~---~---~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~-~~~L~~~~~f~~~~~~~ 209 (319) -..+...|........ . .....++.++.|+.|.++...|+.+-.+...| +|++.. ..++.+-+. + T Consensus 230 ~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~-imn~~t~~~~~~~~~~-------~ 301 (387) T protein:vir:96 230 AKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATI-YMRYADYVKIISVLSN-------G 301 (387) T ss_pred HHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEE-EEechHHHHHHHHHhc-------C Confidence 4444444432111100 0 11123355677999999998887754444454 566554 454443221 1 Q ss_pred ccceeeeeeeeecCeEEEEecccccccceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceE Q lcl|Aclame:pro 210 QQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYI 288 (319) Q Consensus 210 ~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~I 288 (319) +..+..|.=..|.|.||+.+.. +.. +++|.-+ +...... .....++ .....-..++....+|++|++|++..+ T Consensus 302 ~~~~~~~~~~~llG~PV~~~~~--~~~--~~~GDf~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~r~Dg~v~~~~A~~~ 375 (387) T protein:vir:96 302 TTNFFDTPAEKVFGKPVVFTDA--AVK--PIVGDFNYFGINYDG-TTYDTDK-DVKKGEYLFVLTAWYDQQRTLDSAFRI 375 (387) T ss_pred CCcccccCCccccccceEEecC--CCc--eeeechhhhhhhhhh-hhheecc-cccCCceEEEEEEEeCcEeechhheEE Confidence 1223334345788999987543 222 3343322 1111110 0111111 112233567788899999999998543 Q ss_pred EEEccccccCCCC Q lcl|Aclame:pro 289 FTIGGTEVATKRD 301 (319) Q Consensus 289 y~~~~~~~a~~~~ 301 (319) +....+++..++ T Consensus 376 -l~~ka~~~~~~~ 387 (387) T protein:vir:96 376 -AKAKENTGPLPS 387 (387) T ss_pred -EEeecCCCCCCC Confidence 344333433333 No 136 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=97.33 E-value=9.4e-05 Score=42.66 Aligned_cols=278 Identities=12% Similarity=-0.011 Sum_probs=110.7 Q ss_pred CC------cccccccceee--ehhhhhhhh---------h----c-chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcc Q lcl|Aclame:pro 1 MN------KTIKNATGMLK--LNLQHFANK---------S----V-EPGQTLLKNKHV-GILERVTAVNAYSTPALISND 57 (319) Q Consensus 1 ~~------~~~~~~~~~~~--~~~~~~~~~---------~----~-~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~ 57 (319) .+ +..++-.+.++ +.-+.+... . + .-.-.-+++-+. .+++.+.....+.. +++ T Consensus 78 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~--~~~-- 153 (387) T protein:vir:94 78 YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE--KAR-- 153 (387) T ss_pred CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhh--hce-- Confidence 00 00000000000 000000000 0 0 000112333332 23333333332211 111 Q ss_pred eeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 58 AIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVA 136 (319) Q Consensus 58 ~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~va 136 (319) +.-.+ ..++|.+...+-..+-...+-.....+.+...++++..+...|. . +..+-... ..++...+.+..++.++ T Consensus 154 ~~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i-~-iS~ell~ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:94 154 LTNIK--GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFA-A-ISDTVIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred eeecC--CceeeeeeccCCccccccccccccccccccceeeechheeeeec-h-hhHHHHhhhHHHHHHHHHHHHHHHHH Confidence 12222 24567664332111111112222222344455556656555542 2 22111111 12334445555555555 Q ss_pred HHHHHHHHHHHHhccC---c---cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHH-HHHHhhhhhhhhccccc Q lcl|Aclame:pro 137 PYLDNLRFATLARNKA---K---HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTF-YKGIKKFVIALPQGDTR 209 (319) Q Consensus 137 peiD~~~~s~la~~a~---~---~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~-~~~L~~~~~f~~~~~~~ 209 (319) -..+...|........ . .....++.++.|+.|.++...|+.+-.+...| +|++.. ..++.+-+. + T Consensus 230 ~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~-imn~~t~~~~~~~~~~-------~ 301 (387) T protein:vir:94 230 AKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATI-YMRYADYVKIISVLSN-------G 301 (387) T ss_pred HHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEE-EEechHHHHHHHHHhc-------C Confidence 4444444432111100 0 11123355677999999998887754444454 566554 454443221 1 Q ss_pred ccceeeeeeeeecCeEEEEecccccccceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceE Q lcl|Aclame:pro 210 QQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYI 288 (319) Q Consensus 210 ~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~I 288 (319) +..+..|.=..|.|.||+.+.. +.. +++|.-+ +...... .....++ .....-..++....+|++|++|++..+ T Consensus 302 ~~~~~~~~~~~llG~PV~~~~~--~~~--~~~GDf~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~r~Dg~v~~~~A~~~ 375 (387) T protein:vir:94 302 TTNFFDTPAEKVFGKPVVFTDA--AVK--PIVGDFNYFGINYDG-TTYDTDK-DVKKGEYLFVLTAWYDQQRTLDSAFRI 375 (387) T ss_pred CCcccccCCccccccceEEecC--CCc--eeeechhhhhhhhhh-hhheecc-cccCCceEEEEEEEeCcEeechhheEE Confidence 1223334345788999987543 222 3343322 1111110 0111111 112233567788899999999998543 Q ss_pred EEEccccccCCCC Q lcl|Aclame:pro 289 FTIGGTEVATKRD 301 (319) Q Consensus 289 y~~~~~~~a~~~~ 301 (319) +....+++..++ T Consensus 376 -l~~ka~~~~~~~ 387 (387) T protein:vir:94 376 -AKAKENTGPLPS 387 (387) T ss_pred -EEeecCCCCCCC Confidence 344333433333 No 137 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=97.33 E-value=9.4e-05 Score=42.66 Aligned_cols=278 Identities=12% Similarity=-0.011 Sum_probs=110.7 Q ss_pred CC------cccccccceee--ehhhhhhhh---------h----c-chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcc Q lcl|Aclame:pro 1 MN------KTIKNATGMLK--LNLQHFANK---------S----V-EPGQTLLKNKHV-GILERVTAVNAYSTPALISND 57 (319) Q Consensus 1 ~~------~~~~~~~~~~~--~~~~~~~~~---------~----~-~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~ 57 (319) .+ +..++-.+.++ +.-+.+... . + .-.-.-+++-+. .+++.+.....+.. +++ T Consensus 78 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~--~~~-- 153 (387) T protein:vir:26 78 YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE--KAR-- 153 (387) T ss_pred CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhh--hce-- Confidence 00 00000000000 000000000 0 0 000112333332 23333333332211 111 Q ss_pred eeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 58 AIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVA 136 (319) Q Consensus 58 ~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~va 136 (319) +.-.+ ..++|.+...+-..+-...+-.....+.+...++++..+...|. . +..+-... ..++...+.+..++.++ T Consensus 154 ~~~~~--~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i-~-iS~ell~ds~~~l~~~i~~~la~~~~ 229 (387) T protein:vir:26 154 LTNIK--GLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFA-A-ISDTVIHGSDVDLVNWVENALQSGLA 229 (387) T ss_pred eeecC--CceeeeeeccCCccccccccccccccccccceeeechheeeeec-h-hhHHHHhhhHHHHHHHHHHHHHHHHH Confidence 12222 24567664332111111112222222344455556656555542 2 22111111 12334445555555555 Q ss_pred HHHHHHHHHHHHhccC---c---cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHH-HHHHhhhhhhhhccccc Q lcl|Aclame:pro 137 PYLDNLRFATLARNKA---K---HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTF-YKGIKKFVIALPQGDTR 209 (319) Q Consensus 137 peiD~~~~s~la~~a~---~---~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~-~~~L~~~~~f~~~~~~~ 209 (319) -..+...|........ . .....++.++.|+.|.++...|+.+-.+...| +|++.. ..++.+-+. + T Consensus 230 ~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~-imn~~t~~~~~~~~~~-------~ 301 (387) T protein:vir:26 230 AKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATI-YMRYADYVKIISVLSN-------G 301 (387) T ss_pred HHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEE-EEechHHHHHHHHHhc-------C Confidence 4444444432111100 0 11123355677999999998887754444454 566554 454443221 1 Q ss_pred ccceeeeeeeeecCeEEEEecccccccceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceE Q lcl|Aclame:pro 210 QQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYI 288 (319) Q Consensus 210 ~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~I 288 (319) +..+..|.=..|.|.||+.+.. +.. +++|.-+ +...... .....++ .....-..++....+|++|++|++..+ T Consensus 302 ~~~~~~~~~~~llG~PV~~~~~--~~~--~~~GDf~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~~r~Dg~v~~~~A~~~ 375 (387) T protein:vir:26 302 TTNFFDTPAEKVFGKPVVFTDA--AVK--PIVGDFNYFGINYDG-TTYDTDK-DVKKGEYLFVLTAWYDQQRTLDSAFRI 375 (387) T ss_pred CCcccccCCccccccceEEecC--CCc--eeeechhhhhhhhhh-hhheecc-cccCCceEEEEEEEeCcEeechhheEE Confidence 1223334345788999987543 222 3343322 1111110 0111111 112233567788899999999998543 Q ss_pred EEEccccccCCCC Q lcl|Aclame:pro 289 FTIGGTEVATKRD 301 (319) Q Consensus 289 y~~~~~~~a~~~~ 301 (319) +....+++..++ T Consensus 376 -l~~ka~~~~~~~ 387 (387) T protein:vir:26 376 -AKAKENTGPLPS 387 (387) T ss_pred -EEeecCCCCCCC Confidence 344333433333 No 138 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.31 E-value=9.8e-05 Score=42.55 Aligned_cols=301 Identities=10% Similarity=-0.063 Sum_probs=134.5 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK 80 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~ 80 (319) |--.-.+.....+.-- .. -.+..| +....+++++.....+.. ++. .+..++.+++||+.....-..+. T Consensus 1 ~g~~~e~~~~~~~~t~-~~-~g~l~~------~~~~~ii~~l~~~s~i~~--l~~--~~~~~~~~~~ip~~~~~~~a~wv 68 (397) T protein:vir:23 1 MGFSADHSQIAQTKDT-MF-TGYLDP------VQAKDYFAEAEKTSIVQR--VAQ--KIPMGATGIVIPHWTGDVSAQWI 68 (397) T ss_pred CCcCHHHHHHhhccCC-CC-ccccch------hHHHHHHHHHHhccchhh--hcc--eeeccCCceEEEEEcCCcceEEe Confidence 3222222211111000 00 001111 122344555544333322 222 24456778999998765444443 Q ss_pred CCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc-------Cc Q lcl|Aclame:pro 81 RNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK-------AK 153 (319) Q Consensus 81 r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a-------~~ 153 (319) ..+-....-+.++...++.-.|...+ +.--+.--......+...+.++.+.+++-.+|+..+.---... .. T Consensus 69 -~Eg~~~~~s~~~f~~v~l~~~k~~~~-v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~ 146 (397) T protein:vir:23 69 -GEGDMKPITKGNMTKRDVHPAKIATI-FVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQS 146 (397) T ss_pred -cCCccccccccceeEEEEeeEEEEEe-ehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccc Confidence 22222333344555555555554443 3332321111224567888899999999999997663111100 00 Q ss_pred cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhh----cccccccceeeeeeeeecCeEEEEe Q lcl|Aclame:pro 154 HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALP----QGDTRQQVLGKGVQGELDGFVIVKV 229 (319) Q Consensus 154 ~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~----~~~~~~~~~~~g~Vg~idG~~I~~v 229 (319) ......+....++.+.++...|.++..+ +-.++++|..+..|++-.+-.. ...........+..+++.|.+|+.. T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~-~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s 225 (397) T protein:vir:23 147 NKTQSISPNAYQGLGVSGLTKLVTDGKK-WTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILS 225 (397) T ss_pred cceeeecccchhHHHHHHHHhhhhcccC-CCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEe Confidence 1111223344567777888888876554 3458999999988876432111 1111122233344568999999875 Q ss_pred cccccccceEEEEcCC-ceeeeeeeeeeeeecC--------CC-------CCccceeeeeeeeeEEEeccccceEEEEcc Q lcl|Aclame:pro 230 PTKLLQGLQAIAVVGE-VLASPIQADLAKTNSN--------IP-------GMFGTLAEQLLYTGAFVPEHLQKYIFTIGG 293 (319) Q Consensus 230 ps~~~~~~n~i~~~~~-A~~~~~k~~~~~~~~~--------~~-------~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~ 293 (319) ++-...++-++++..+ +.....+--.+++.+. +. .++.-.+|-..++|..|.+|++... +... T Consensus 226 ~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~-~~~~ 304 (397) T protein:vir:23 226 DHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVK-LTFD 304 (397) T ss_pred CCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEE-Eeec Confidence 4322222333333322 2221111112222221 00 0112366777899999999998432 2211 Q ss_pred cc-ccCC--CCCcccccccccccc-cccc-----------C Q lcl|Aclame:pro 294 TE-VATK--RDGVDAHADNVAKPS-GSLE-----------M 319 (319) Q Consensus 294 ~~-~a~~--~~~~~~~~~~~~~~~-~~~~-----------~ 319 (319) .. .... ..+.+.. +-+++. |.++ + T Consensus 305 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~a~~~~~ 343 (397) T protein:vir:23 305 PVLTTYALDLDGASAG--NFTLSLDGKTSANIAYNASTATV 343 (397) T ss_pred cccceeeecccccCcc--eEEEEecCccccCcccccchhhh Confidence 11 0000 0111111 111111 1111 1 No 139 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=97.30 E-value=0.0001 Score=42.48 Aligned_cols=274 Identities=9% Similarity=-0.052 Sum_probs=137.8 Q ss_pred cccccceee------ehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccc-ccc Q lcl|Aclame:pro 5 IKNATGMLK------LNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTT-ELK 77 (319) Q Consensus 5 ~~~~~~~~~------~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~-g~~ 77 (319) .-|.||..+ +.+-++.+ +|- -+..+-..+++..+....+ -+++.-.++..|+..+..-. ... T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~---~P~--~I~~~i~e~~~~~~iad~l------f~~~~a~~~~~v~f~~~~p~~~~~ 69 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVG---NPL--WIPTALKKMMVNQFISESL------FRNGGANPNGVVAYNEGNPSFLED 69 (318) T ss_pred CCCCCcceeeecCCceehHHhhC---Cch--hHHHHHHHHHhccchhhhh------hhcccccccceeEEEecccccccC Confidence 222333322 22222211 111 1111112222222211111 12222334556666544322 123 Q ss_pred ccc-CCCCccc--CCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 78 DYK-RNATNEF--DHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH 154 (319) Q Consensus 78 DY~-r~~~~~~--~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~ 154 (319) |+. +..+-++ ...+.....+-.-+..+..|.|- |+.+..+.....+-..++++-.++-.+|.-.+..|.+..... T Consensus 70 d~e~VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS--~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~ 147 (318) T protein:vir:10 70 DVADVAEFGEIPVSAGARGLPRTAFAVKKALGVRVS--KEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPT 147 (318) T ss_pred cHhhccCcccccccCCCCCchhhhhhehhccceecc--HHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 332 2222222 22222222222222444444443 666665666777777788888999999999888775543222 Q ss_pred ccccC-------CHhH---HHHHHHHHHHHHHh-------ccCC-CCcEEEEChHHHHHHhhhhhhhhcccc-----ccc Q lcl|Aclame:pro 155 LTVGT-------GSDA---QYDAVLDVSVELDE-------IKAP-ENRVLFVSPTFYKGIKKFVIALPQGDT-----RQQ 211 (319) Q Consensus 155 ~~~~~-------T~~n---~~~~i~~a~~~Lde-------~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~-----~~~ 211 (319) .+.+. +..+ +.+.+..+.-.+.. .+.. ..-.++|.|..+..|.+++.+.+.-.- +.. T Consensus 148 ~~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~ 227 (318) T protein:vir:10 148 LAVPTAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTA 227 (318) T ss_pred ccCCcCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhc Confidence 22111 1112 23333322222211 1222 345799999999999999988653321 111 Q ss_pred ceeeeee-eeecCeEEEEecccccccceEEEEcCCceeeee---eeeeeeeecCCC-------CCccceeeeeeeeeEEE Q lcl|Aclame:pro 212 VLGKGVQ-GELDGFVIVKVPTKLLQGLQAIAVVGEVLASPI---QADLAKTNSNIP-------GMFGTLAEQLLYTGAFV 280 (319) Q Consensus 212 ~~~~g~V-g~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~---k~~~~~~~~~~~-------~~~~~~v~gr~~yg~~V 280 (319) ....|-+ |++.|++|+.+| ..+.-..+++.++.+.+.. .+... ..++ + ....|.++.+++-..-| T Consensus 228 ~~~tg~~~g~~lGl~vi~s~--~~p~~~alvlq~g~vG~~~d~~pl~~t-~~~~-egg~~~g~~~~s~~~~~~~~~~~~V 303 (318) T protein:vir:10 228 PDWTGNFPGSVMGLNVIRSR--TFPIDRVLIMERGTVGFYSDTRPLQFT-ALYP-EGNGPNGGPTESYRADASHKRALAV 303 (318) T ss_pred ccccccccceeeceEEeecC--ccCCCeeEEEecCCcceeeccccceee-eccc-CCCCCCCCcchhhheehheeeeeee Confidence 2233444 578999998654 3334446777787777664 22222 2221 2 23569999999999999 Q ss_pred eccccceEEEEcccc Q lcl|Aclame:pro 281 PEHLQKYIFTIGGTE 295 (319) Q Consensus 281 ~~~k~~~Iy~~~~~~ 295 (319) .+||+........++ T Consensus 304 ~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 304 DQPKAALWLTGIVTP 318 (318) T ss_pred eCcceeEEEeeccCC Confidence 999986666677766 No 140 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=97.18 E-value=0.00014 Score=41.71 Aligned_cols=281 Identities=12% Similarity=0.009 Sum_probs=119.5 Q ss_pred CCcccccccceeeehhhhhhhhh-------c-----chhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEe Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS-------V-----EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTV 68 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-------~-----~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkI 68 (319) .+...+...-+.....+.|+... . .......++.|...+-+.....+..... ++..-...+..++.+ T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~-~~~~~~~~~~~~~~~ 139 (371) T protein:vir:81 61 DKEPLKPTVQVKENEVEAFVNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNL-ITVEPVTTLSGSRVF 139 (371) T ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhh-ceeeeccCCceeEEE Confidence 11000000001111122222110 0 1111224555544333333333322222 221111223345667 Q ss_pred eeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 69 MKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFATL 147 (319) Q Consensus 69 p~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~l 147 (319) +.....+-..+...++-.++.-+.+...++++-.|.-.+. . +..+-.+. ...+...+.+..+.+++-.+|..++. T Consensus 140 ~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~-- 215 (371) T protein:vir:81 140 KKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFF-R-VTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIIN-- 215 (371) T ss_pred EeecCCcceeeeccccccccccccceeeEEeeeeEEEEee-h-hhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHh-- Confidence 7665544343332222222212233444444444433322 1 22211111 13445666677777777777765444 Q ss_pred HhccCccccccCCHhHHHHHHHHHH-HHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEE Q lcl|Aclame:pro 148 ARNKAKHLTVGTGSDAQYDAVLDVS-VELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVI 226 (319) Q Consensus 148 a~~a~~~~~~~~T~~n~~~~i~~a~-~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I 226 (319) +.++....+. .-++.+..+. ..|... +-.+-.++++|..+..|++-.+-.... ........|..++|.|.+| T Consensus 216 --g~g~~~~~~~---~~~~~i~~~~~~~l~~~-~~~~a~~vmn~~~~~~L~~lkd~~g~~-l~~~~~~~~~~~~l~G~pV 288 (371) T protein:vir:81 216 --VLNTKAKTAI---ADLDGLKQIINVQLDPV-FRSTSSVIVNQDAFNWLDTLKDQNGQY-LLQPSISSPTGRQLLGLPV 288 (371) T ss_pred --hccccccccc---ccHHHHHHHHHhhcchh-hhcCCEEEEcHHHHHHHHHhhccCCCe-eeecccCCCCCceecceeE Confidence 2222222221 1234444433 234332 223456889999999887643211110 1122234566789999999 Q ss_pred EEecccc----------cccceEEEEcCC-ceeeeeeeeeeeeecCCCC-----CccceeeeeeeeeEEEeccccceEEE Q lcl|Aclame:pro 227 VKVPTKL----------LQGLQAIAVVGE-VLASPIQADLAKTNSNIPG-----MFGTLAEQLLYTGAFVPEHLQKYIFT 290 (319) Q Consensus 227 ~~vps~~----------~~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~-----~~~~~v~gr~~yg~~V~~~k~~~Iy~ 290 (319) +.+++-. .....+++|.-. +.....+ ..+++-..++. .+...++...++|..+.+|+...+ + T Consensus 289 ~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~-~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~-~ 366 (371) T protein:vir:81 289 VIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDR-QRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVF-G 366 (371) T ss_pred EEecccccCccccccccCCcceEEEEehhceEEEEee-cceEEEEeccccchhhcCceEEEEEEeeccEEecccceEE-E Confidence 9764321 112346666533 2333322 22222222222 233588888999999999987432 2 Q ss_pred Ecccc Q lcl|Aclame:pro 291 IGGTE 295 (319) Q Consensus 291 ~~~~~ 295 (319) ...++ T Consensus 367 ~~~~A 371 (371) T protein:vir:81 367 EVQLA 371 (371) T ss_pred EEecC Confidence 33222 No 141 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=97.16 E-value=0.00015 Score=41.61 Aligned_cols=284 Identities=11% Similarity=-0.022 Sum_probs=124.8 Q ss_pred CC-cccccccceee-------------------ehhhhhhhhh---cchhh--hhhhHhhHHHHHHHHHhhhhhhhcccC Q lcl|Aclame:pro 1 MN-KTIKNATGMLK-------------------LNLQHFANKS---VEPGQ--TLLKNKHVGILERVTAVNAYSTPALIS 55 (319) Q Consensus 1 ~~-~~~~~~~~~~~-------------------~~~~~~~~~~---~~~n~--~~l~~ky~~lld~~~~~~sl~~~~~~n 55 (319) +. +..+.+-+... ....+.+.-. ..+.. ....+-|...+.+.....+..... ++ T Consensus 64 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~-~~ 142 (379) T protein:vir:10 64 LDVKLKEKAKSEDKSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDI-VG 142 (379) T ss_pred HHHHHHhcccccccchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhh-ce Confidence 00 00000000000 0000000000 00110 012223322222222222211111 11 Q ss_pred cceeeeCCceEEeeeccccc-cccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHH Q lcl|Aclame:pro 56 NDAIFMEGRSFTVMKGDTTE-LKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEV 134 (319) Q Consensus 56 ~~~~~~~g~tVkIp~i~~~g-~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~ 134 (319) ..-..+.+++||+....+ -..+.-..+-.....+.+...+++.-.++-.+. .+..+-.+....+...+....+.. T Consensus 143 --~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~--~iS~ell~D~~~l~~~i~~~la~~ 218 (379) T protein:vir:10 143 --AVSISGGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFT--RYSKKMANNLPFLTSFIPNALRRD 218 (379) T ss_pred --eeeccCCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeee--hhhHHHHhhHHHHHHHHHHHHHHH Confidence 123356789999765322 122111222222222334445555555554443 122222221122345555666677 Q ss_pred HHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccc-c-c Q lcl|Aclame:pro 135 VAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQ-Q-V 212 (319) Q Consensus 135 vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~-~-~ 212 (319) ++-.+|...++-+.+. +.....+.+....++.|.++...+...+.+.+. ++++|..+..|.+-.+-.... ..+ + . T Consensus 219 ~~~~~~~~~~~g~~~~-~~~~~~~~~~~~~~d~i~~~~~~~~~~~~~~~~-~vmn~~~~~~l~~lkd~~G~~-l~~~~~~ 295 (379) T protein:vir:10 219 YAKAENAAFNAVLAAN-ATASTEIITNKNKVEMLINEIAKQENLDFPVTA-IVLRPTDYYDILVTQKSVGAG-YGLPGVV 295 (379) T ss_pred HHHHHHHHHhcccccc-cccccccccCcccHHHHHHHHHhhhhccCCCCE-EEEcHHHHHHHHHhhccCCce-eccCCcc Confidence 7777777655432222 112223334455678888888888887776554 678999998886543211110 011 1 1 Q ss_pred eeeeeeeeecCeEEEEecccccccceEEEEcCCcee-eeeeeeeeeeecCCCCCc---cceeeeeeeeeEEEeccccceE Q lcl|Aclame:pro 213 LGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLA-SPIQADLAKTNSNIPGMF---GTLAEQLLYTGAFVPEHLQKYI 288 (319) Q Consensus 213 ~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~-~~~k~~~~~~~~~~~~~~---~~~v~gr~~yg~~V~~~k~~~I 288 (319) ...|...++.|++|+.+ ..++.-.+++|.-+... ...+--.+++.+...+.| -..++...-+|+.|.+|++. + T Consensus 296 ~~~~~~~~l~G~pvv~s--~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~-v 372 (379) T protein:vir:10 296 TQDNGVLRINGIPLFRA--TWLAANKYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAAL-I 372 (379) T ss_pred CCCCCcceecceeeEec--CCCCCCceEEeecccEEEEEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCccE-E Confidence 12344457999999864 44555456666554432 222222344433222223 34777777899999999873 4 Q ss_pred EEEccccc Q lcl|Aclame:pro 289 FTIGGTEV 296 (319) Q Consensus 289 y~~~~~~~ 296 (319) ++... +. T Consensus 373 ~~~~~-~~ 379 (379) T protein:vir:10 373 FGDFT-AV 379 (379) T ss_pred EEEec-CC Confidence 44432 11 No 142 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=97.16 E-value=0.00015 Score=41.58 Aligned_cols=293 Identities=12% Similarity=-0.033 Sum_probs=126.0 Q ss_pred CCcccccccceeeehh--------hhhhhh--------hcc--hhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeC Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNL--------QHFANK--------SVE--PGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFME 62 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~--------~~~~~~--------~~~--~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~ 62 (319) .+...++..++-.... +.|... ... ....-+++.|...+-+.....+.... +++..-...+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~-~~~~~~~~~~ 149 (395) T protein:vir:38 71 LNAEPVNKKPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLES-LANVENVTTS 149 (395) T ss_pred hhhccccccccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhh-hcceeeccCC Confidence 1111111111111110 111100 000 11122344443222222222222111 1221111122 Q ss_pred CceEEeeecccc-ccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 63 GRSFTVMKGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLD 140 (319) Q Consensus 63 g~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD 140 (319) ...+.++..... +...+...++-..+..+.+...+++.-.|.-.+. .+..+-.+. ..++.+.+.+..+..++-.+| T Consensus 150 ~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~--~iS~ell~ds~~~l~~~i~~~la~~~~~~~~ 227 (395) T protein:vir:38 150 HGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGIT--TVTNTLLKDTVDNIIQWLVNWAAKKDVVTRN 227 (395) T ss_pred cceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeeh--hhHHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 334555555432 3334333333323333344555666655554442 233222221 234566677777778887777 Q ss_pred HHHHHHHHhccCccccccCCHhHHHHHHHHHHH-HHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeee Q lcl|Aclame:pro 141 NLRFATLARNKAKHLTVGTGSDAQYDAVLDVSV-ELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQG 219 (319) Q Consensus 141 ~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~-~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg 219 (319) .-++.-. +.+.......+ |+.|.++.. .|+.. +-.+-.++++|..+..|++-.+-.... .......+|..+ T Consensus 228 ~~il~g~--g~~~~~~~~~~----~~~i~~~~~~~l~~~-~~~~a~~v~n~~~~~~L~~lkd~~G~~-l~~~~~~~~~~~ 299 (395) T protein:vir:38 228 AKILEVM--GKAPKKPTISQ----FDNIKDLENNTLDPA-IESTSSFITNQSGYNILSKVKDADGRY-LMQPDVTSPDKY 299 (395) T ss_pred HHHhhcc--ccccccccccc----HHHHHHHHHHhhhhh-hcCCCEEEEcHHHHHHHHHhhccCCce-eeccCcCCCCcc Confidence 7654311 11122222223 444554443 34332 223557899999999987643211100 112223456667 Q ss_pred eecCeEEEEeccccc----ccceEEEEcCC-ceeeeee-eeeeeeecCCCC---CccceeeeeeeeeEEEeccccceEEE Q lcl|Aclame:pro 220 ELDGFVIVKVPTKLL----QGLQAIAVVGE-VLASPIQ-ADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHLQKYIFT 290 (319) Q Consensus 220 ~idG~~I~~vps~~~----~~~n~i~~~~~-A~~~~~k-~~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~k~~~Iy~ 290 (319) +|.|++|+.+++..+ .+..+++|.-+ +.....+ =-.++..+.... .+...++...++|..|++|++..+.. T Consensus 300 ~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~ 379 (395) T protein:vir:38 300 LIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAAS 379 (395) T ss_pred eeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE Confidence 899999997654322 23456777644 4443332 223443331111 23457788888999999998854433 Q ss_pred Ecc--ccccCCCCCcc Q lcl|Aclame:pro 291 IGG--TEVATKRDGVD 304 (319) Q Consensus 291 ~~~--~~~a~~~~~~~ 304 (319) -.. +.++++...|. T Consensus 380 ~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:38 380 FKTVANQAQGTAGTGK 395 (395) T ss_pred eecccCCCCCccCCCC Confidence 221 11111111222 No 143 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=97.08 E-value=0.00018 Score=41.11 Aligned_cols=292 Identities=9% Similarity=-0.062 Sum_probs=127.0 Q ss_pred CCcccccccce---e----eehhhhhhhhhcchh----------------hhhhhHhhHHHHHHHHHhhhhhhhcccCcc Q lcl|Aclame:pro 1 MNKTIKNATGM---L----KLNLQHFANKSVEPG----------------QTLLKNKHVGILERVTAVNAYSTPALISND 57 (319) Q Consensus 1 ~~~~~~~~~~~---~----~~~~~~~~~~~~~~n----------------~~~l~~ky~~lld~~~~~~sl~~~~~~n~~ 57 (319) +++.+.....+ . ....+.|. +|...+ -.-.++.|...+-+.....+.... +++. T Consensus 102 ~~~~~~~~~~~~~~~~~~~~e~r~a~~-~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~-~~~~- 178 (434) T protein:vir:62 102 ISASIAAALSTKGHRTNKETEIRSVFA-NYIVGNIDEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRR-LGTG- 178 (434) T ss_pred HHHHHHhhhhhccccchHHHHHHHHHH-HHhccccchhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhh-hcce- Confidence 11111111000 0 00111121 111100 011234443322233222222211 2221 Q ss_pred eeeeCCceEEeeeccccccccc--cCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 58 AIFMEGRSFTVMKGDTTELKDY--KRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVV 135 (319) Q Consensus 58 ~~~~~g~tVkIp~i~~~g~~DY--~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~v 135 (319) +.. +..+++|.....+-... ....+-....-+.++...+++-.+.-.+ +.--+.--.....++...+.+..++.+ T Consensus 179 -~~~-~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~-~~iS~ell~ds~~~l~~~i~~~la~~~ 255 (434) T protein:vir:62 179 -VKT-KENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDAL-ATVTKKLLARTGLPIEQIVMDELKKAY 255 (434) T ss_pred -ecc-CCceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEee-hhhHHHHHhcchHHHHHHHHHHHHHHH Confidence 222 33578887654332221 1111111111122344444444443332 221111110112345667778888888 Q ss_pred HHHHHHHHHHHHHhcc-------CccccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccc Q lcl|Aclame:pro 136 APYLDNLRFATLARNK-------AKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDT 208 (319) Q Consensus 136 apeiD~~~~s~la~~a-------~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~ 208 (319) +-.+|..++.---.+. ....+...+....++.|+++...|+...-+ +-.++++|..+..|.+-.+-....-. T Consensus 256 ~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~-~a~~v~n~~~~~~L~~lkd~~G~~l~ 334 (434) T protein:vir:62 256 VRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYDALVKMKNTPVKEVRK-KARWVLNTAALTKIETMKTDDGFPLL 334 (434) T ss_pred HHHHHHHHhccCCCCccccceeecccccccccccchhhHHHHHHhhcchhhhc-CCEEEEcHHHHHHHHHhhccCCCEee Confidence 8888887652100000 011122234456799999999888775434 44568899999888654321111000 Q ss_pred ccc-ceeeeeeeeecCeEEEEecccc---cccce-EEEEcCCceeeeeeeeeeeeecCCCCCcc-c--eeeeeeeeeEEE Q lcl|Aclame:pro 209 RQQ-VLGKGVQGELDGFVIVKVPTKL---LQGLQ-AIAVVGEVLASPIQADLAKTNSNIPGMFG-T--LAEQLLYTGAFV 280 (319) Q Consensus 209 ~~~-~~~~g~Vg~idG~~I~~vps~~---~~~~n-~i~~~~~A~~~~~k~~~~~~~~~~~~~~~-~--~v~gr~~yg~~V 280 (319) ... ....|.-.+|.|.+|+.++.-. ..+.. ++.|.-+..........+++-...+..+. . .++.....|+++ T Consensus 335 ~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~ 414 (434) T protein:vir:62 335 RPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQL 414 (434) T ss_pred ccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeeccee Confidence 001 1223444579999998754321 11122 44455554433333333333332222222 2 456667778886 Q ss_pred e-ccccceEEEEccccccCC Q lcl|Aclame:pro 281 P-EHLQKYIFTIGGTEVATK 299 (319) Q Consensus 281 ~-~~k~~~Iy~~~~~~~a~~ 299 (319) + .|....||-....++..+ T Consensus 415 i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 415 IHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred ecCcccceEEEEEeccCCCC Confidence 5 599988886654433332 No 144 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=96.93 E-value=0.00025 Score=40.32 Aligned_cols=277 Identities=12% Similarity=0.024 Sum_probs=109.8 Q ss_pred CCc-cccccc-----ceeeehhhhhhhh------hc----ch-hhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeC Q lcl|Aclame:pro 1 MNK-TIKNAT-----GMLKLNLQHFANK------SV----EP-GQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFME 62 (319) Q Consensus 1 ~~~-~~~~~~-----~~~~~~~~~~~~~------~~----~~-n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~ 62 (319) ++. ..+... .++....+....+ .. .. .-...++-+. .+++.+.....+.. +++ +.-.+ T Consensus 98 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~--~~~--v~~~~ 173 (402) T protein:vir:93 98 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE--KAR--LTNIK 173 (402) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhh--hce--eeecC Confidence 110 000000 0000000000000 00 00 0112333332 33433333333211 111 12222 Q ss_pred CceEEeeeccccc-cccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 63 GRSFTVMKGDTTE-LKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLD 140 (319) Q Consensus 63 g~tVkIp~i~~~g-~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD 140 (319) ..++|.+...+ -..+. ..+-.....+.+...++++..+...+. . +..+-... ..++...+.+..+++++-..+ T Consensus 174 --~~~~p~~~~~~~~a~~v-~Eg~~~~~~~~~f~~i~~~~~k~~~~i-~-iS~ell~Ds~~~l~~~i~~~la~~~~~~e~ 248 (402) T protein:vir:93 174 --GLEIPRVSYTLDDDDFI-TDVETAKELKAKGDTVKFTTNKFKVFA-A-ISDTVIHGSDVDLVNWVENALQSGLAAKER 248 (402) T ss_pred --CceeeeeeccCCccccc-cccccccccccccceeeecceeeeeec-h-hhHHHHhhhHHHHHHHHHHHHHHHHHHHHH Confidence 24567664332 12222 122222222334455555555554432 1 11111111 123344455555555555444 Q ss_pred HHHHHHHHhcc---Cc---cccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChH-HHHHHhhhhhhhhcccccccce Q lcl|Aclame:pro 141 NLRFATLARNK---AK---HLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPT-FYKGIKKFVIALPQGDTRQQVL 213 (319) Q Consensus 141 ~~~~s~la~~a---~~---~~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~-~~~~L~~~~~f~~~~~~~~~~~ 213 (319) ...|....... +. .....++.++.|+.|.++...|+..-.....| +|++. +..++++-+. ++..+ T Consensus 249 ~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~~~~l~~~y~~na~~-imn~~t~~~~~~~~~d-------~~~~~ 320 (402) T protein:vir:93 249 KDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATI-YMRYADYVKIISVLSN-------GTTNF 320 (402) T ss_pred HhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEE-EEechHHHHHHHHHhc-------CCCcc Confidence 43343211100 00 11122355677999999998887754344454 56655 4555543221 11222 Q ss_pred eeeeeeeecCeEEEEecccccccceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEc Q lcl|Aclame:pro 214 GKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 214 ~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~ 292 (319) ..|.=.+|.|.||+.+.. ++. +++|.-+ +..-... ..++.++. ....-..++...++|.+|++|++..| +.. T Consensus 321 ~~~~~~~llG~PV~~t~~--~~~--i~~GDf~~~~~~~~~-~~~~~~~~-~~~~~~~~~~~~r~Dg~v~~~~A~~~-l~i 393 (402) T protein:vir:93 321 FDTPAEKVFGKPVVFTDA--AVK--PIVGDFNYFGINYDG-TTYDTDKD-VKKGEYLFVLTAWYDQQRTLDSAFRI-AKA 393 (402) T ss_pred cccCCccccccceEEecC--CCc--eeeechhhhhhhhhh-hhhhhhhc-ccCCceEEEEEEEeCcEEechhheEE-EEe Confidence 233334688999997543 332 2333221 1111111 11112221 12233577888899999999988544 333 Q ss_pred cccccCCCC Q lcl|Aclame:pro 293 GTEVATKRD 301 (319) Q Consensus 293 ~~~~a~~~~ 301 (319) ..+++..++ T Consensus 394 k~~~~~~~~ 402 (402) T protein:vir:93 394 KENTGPLPS 402 (402) T ss_pred ecCCCCCCC Confidence 333333333 No 145 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=96.90 E-value=0.00027 Score=40.14 Aligned_cols=282 Identities=8% Similarity=-0.057 Sum_probs=125.2 Q ss_pred ehhhhhhhh---hcchhhh---------hhhHhh-HHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccccc Q lcl|Aclame:pro 14 LNLQHFANK---SVEPGQT---------LLKNKH-VGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK 80 (319) Q Consensus 14 ~~~~~~~~~---~~~~n~~---------~l~~ky-~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~ 80 (319) |-.+.|.++ .++.+.+ .+...+ +.+++++...+.+- ..++ .......+.+||.++..+..... T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l--~~i~--v~~v~~~~~~i~~~~~~~~~~~~ 76 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLL--DAIR--TETVGAKKTRIPTLNIGERHRRP 76 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhh--hhce--eeeccCcceeeeeeccCCccccc Confidence 222222221 1111211 222222 44555544432221 1222 24445566778887654433322 Q ss_pred CCCC---cccCCcccceeEEEEeecccce-eecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH-HHHHhcc---- Q lcl|Aclame:pro 81 RNAT---NEFDHPKIEETTYFLDQEKYWG-RFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRF-ATLARNK---- 151 (319) Q Consensus 81 r~~~---~~~~~~t~t~~tltidqdr~~~-F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~-s~la~~a---- 151 (319) ...+ .....++....++.+.+=.... ..=+-+| +....-++.+.+....+.+++-.++...+ +.-.+.. T Consensus 77 ~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~--d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~ 154 (321) T protein:vir:31 77 QDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQ--ENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFEN 154 (321) T ss_pred ccccccccccccceeeeeeeeeEEEEeehhccHHHHH--hhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccc Confidence 2111 1122344444455444322111 1111122 11111234555566666666655555433 2111000 Q ss_pred ---C--------ccccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeee Q lcl|Aclame:pro 152 ---A--------KHLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQG 219 (319) Q Consensus 152 ---~--------~~~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg 219 (319) | .......+....++.|.++...|++.--. .+.+++|+++.+..+++.-. .+....++..+..|... T Consensus 155 ~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~-~~~~~~~~~~l~~~~~~ 233 (321) T protein:vir:31 155 QNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLT-DRDTPLGDNVIMGEADV 233 (321) T ss_pred cchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHh-cCCCccccchhhccccc Confidence 0 00001112234467788888888775332 35578999998765432111 11223445556677777 Q ss_pred eecCeEEEEecccccccceEEEEcCCceeeeeeee-eeeeecC-CCCCcc-cee--eeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 220 ELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQAD-LAKTNSN-IPGMFG-TLA--EQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 220 ~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~-~~~~~~~-~~~~~~-~~v--~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) ++.|.+|+.+| .|++-.+++++..-++.....+ .++..+. .+..+. ..+ ..+.-.|+-|-+..+.++...... T Consensus 234 tl~G~pvv~~~--~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~ 311 (321) T protein:vir:31 234 NPFSFPIIGSG--LWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGD 311 (321) T ss_pred cccceeEEEcC--CCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCc Confidence 89999998765 5777778888887766544322 2332221 111111 112 233334444455555545444544 Q ss_pred cccCCCCCcc Q lcl|Aclame:pro 295 EVATKRDGVD 304 (319) Q Consensus 295 ~~a~~~~~~~ 304 (319) +..+-..... T Consensus 312 ~~~~~~~~~~ 321 (321) T protein:vir:31 312 PLEHLEEETS 321 (321) T ss_pred chhcccCCCC Confidence 3333222211 No 146 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=96.87 E-value=0.00028 Score=40.02 Aligned_cols=266 Identities=10% Similarity=-0.086 Sum_probs=124.2 Q ss_pred eeeehhhhhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccccccccc-CCCCcccC Q lcl|Aclame:pro 11 MLKLNLQHFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYK-RNATNEFD 88 (319) Q Consensus 11 ~~~~~~~~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~-r~~~~~~~ 88 (319) |-+. .....-.++.|. .+++.+...+.+. .+++ .+...+..++||.+....-..+. .++..... T Consensus 1 mat~----------~~gg~lvP~~~~~~ii~~~~~~s~i~--~~~~--~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~ 66 (311) T protein:vir:81 1 MVAL----------ATGTFQLPKHLVPGVWQKAQGQSVLA--RLSM--AEPQEFGEQQYMTLTAPPRGEVVGEGAQKSES 66 (311) T ss_pred Ccee----------cCCceEcchhHHHHHHHHHHhcchhh--hhcc--eeecCCCceEEEEEeCCceeEEeecCcccccc Confidence 1111 111122233332 3444443333222 1222 24456678999998654433332 22233333 Q ss_pred CcccceeEEEEeecccceeecchhhHHH--H-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc----------- Q lcl|Aclame:pro 89 HPKIEETTYFLDQEKYWGRFVDALDRKD--T-EGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH----------- 154 (319) Q Consensus 89 ~~t~t~~tltidqdr~~~F~VD~~D~~e--t-~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~----------- 154 (319) +++.+..++.. .|.-. .+.--++-- + .....+.+.+.+..+++++..+|...+.---.+.+.. T Consensus 67 ~~~f~~v~l~~--~kl~~-~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~ 143 (311) T protein:vir:81 67 TATFAPVTAIP--RKVQV-TQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDT 143 (311) T ss_pred cceeeEEEEee--EEEEE-eehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCccccccccccccc Confidence 44444444433 33222 222222110 0 0112345667788888899899887653211111110 Q ss_pred ----ccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEe- Q lcl|Aclame:pro 155 ----LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKV- 229 (319) Q Consensus 155 ----~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~v- 229 (319) .....++...+..|.++...+...+... ..++++|..+..|.+-.+-.... ..+.....+..+++.|.+|+.. T Consensus 144 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~~vmn~~~~~~l~~lkd~~G~~-l~~~~~~~~~~~tl~G~Pv~~~~ 221 (311) T protein:vir:81 144 TNIVELTTGTSATPDLAVEAAVGLVLGDNLSP-DGVALDNTFSFMLATQRDSQGRK-LYPELGFGTDVASFAGLNAAVSD 221 (311) T ss_pred ceeeeecccccchHHHHHHHHHHHhhhcCCCc-eEEEEcHHHHHHHHhhhccCCCe-eecCccccCCCceecceeEEecc Confidence 0111233455666777777777666543 34789999998887643211100 1122334566788999999852 Q ss_pred --ccccc-------------ccceEEEEcCCc-eeeeeeeeeeeeecCC-CC-------CccceeeeeeeeeEEEecccc Q lcl|Aclame:pro 230 --PTKLL-------------QGLQAIAVVGEV-LASPIQADLAKTNSNI-PG-------MFGTLAEQLLYTGAFVPEHLQ 285 (319) Q Consensus 230 --ps~~~-------------~~~n~i~~~~~A-~~~~~k~~~~~~~~~~-~~-------~~~~~v~gr~~yg~~V~~~k~ 285 (319) |.... ....+++|.-+- .+-..+=-.+++.+.. ++ ++.-.++...++|..|.+|++ T Consensus 222 ~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a 301 (311) T protein:vir:81 222 TVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDA 301 (311) T ss_pred cccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccc Confidence 21110 112233443322 1111221233333211 10 122477888999999999987 Q ss_pred ceEEEEcccccc Q lcl|Aclame:pro 286 KYIFTIGGTEVA 297 (319) Q Consensus 286 ~~Iy~~~~~~~a 297 (319) .... .+...| T Consensus 302 ~~~l--~~a~~~ 311 (311) T protein:vir:81 302 FAVV--RDADES 311 (311) T ss_pred eEEE--EeeccC Confidence 4433 222222 No 147 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=96.84 E-value=0.0003 Score=39.88 Aligned_cols=282 Identities=13% Similarity=-0.005 Sum_probs=128.6 Q ss_pred CCcc-cccccceeeehhhhhhh-----hh-----cchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEee Q lcl|Aclame:pro 1 MNKT-IKNATGMLKLNLQHFAN-----KS-----VEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVM 69 (319) Q Consensus 1 ~~~~-~~~~~~~~~~~~~~~~~-----~~-----~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp 69 (319) +.+. .+...|-...+-..-.. +. ......-.++.|...+-+.....+..... ++..-...+..++.|| T Consensus 94 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~-~~~~~~~~~~~~~~~~ 172 (397) T protein:vir:12 94 YSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQY-VTVEPVTTRSGTRLLE 172 (397) T ss_pred HHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhh-cceeeccCCceeEEEE Confidence 0000 01111111111110000 00 11122234566644333333333322222 2221122233467777 Q ss_pred eccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 70 KGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRFATLA 148 (319) Q Consensus 70 ~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~la 148 (319) +....+-..+...++-.++.-..++..+++...|.-.+ +. +..+-.+. ..++.+.+.+..+..++-.+|..++.- T Consensus 173 ~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~-~~-is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G-- 248 (397) T protein:vir:12 173 KNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGI-MT-LSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAA-- 248 (397) T ss_pred EecCCcceeeecccccccccccccceeEEeeheeeEee-eh-hhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhc-- Confidence 76554434333222222222223455555555554443 22 22222221 235566777788888888888865542 Q ss_pred hccCccccccCCHhHHHHHHHHHH-HHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEE Q lcl|Aclame:pro 149 RNKAKHLTVGTGSDAQYDAVLDVS-VELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIV 227 (319) Q Consensus 149 ~~a~~~~~~~~T~~n~~~~i~~a~-~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~ 227 (319) .+.....+++ -|+.|.++. ..|+.. +-.+-.++++|..+..|.+-.+-.... ..+....+|.-++|.|.+|+ T Consensus 249 --~g~~~~~g~~---~~~~i~~~~~~~l~~~-~~~~a~~~~n~~~~~~L~~lkd~~G~~-l~~~~~~~g~~~~l~G~pv~ 321 (397) T protein:vir:12 249 --IASLKKVDID---GLDGIKKALNVTLDPM-VAPGSIVLTNQDGYDWLDTLKDGTGRY-LLQPDPTNPTKKLLDGRPVV 321 (397) T ss_pred --cccccccccc---cHHHHHHHHhhccchh-hhCCCEEEEcHHHHHHHHHhhccCCce-eecccccCCCCccccceeeE Confidence 2222222221 144455444 344432 224556889999999886543211110 11223456767789999998 Q ss_pred Eecccc----cccceEEEEcCC-ceeeee-eeeeeeeecCCCC---CccceeeeeeeeeEEEeccccceEEEEccccc Q lcl|Aclame:pro 228 KVPTKL----LQGLQAIAVVGE-VLASPI-QADLAKTNSNIPG---MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEV 296 (319) Q Consensus 228 ~vps~~----~~~~n~i~~~~~-A~~~~~-k~~~~~~~~~~~~---~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~ 296 (319) .+++.. .....+++|.-+ +..... +--.++..+.... .+...++...++|..+.+|+... .+.. +.+ T Consensus 322 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~-~~~~-t~~ 397 (397) T protein:vir:12 322 PFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVV-FGQI-TVE 397 (397) T ss_pred EecccccccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceE-EEEE-eeC Confidence 765422 123447777644 444443 3223333321111 23468889999999999998732 2233 222 No 148 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=96.81 E-value=0.00032 Score=39.72 Aligned_cols=275 Identities=11% Similarity=-0.037 Sum_probs=131.4 Q ss_pred CCcccccccceeeehhhhhhhhhc-chhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccc---c- Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSV-EPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTT---E- 75 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~---g- 75 (319) |-+ --++....-. ..-+|.+.++|+.-+++++..-- +-|-.--..|.+++++++..- + T Consensus 1 M~~-----------e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LG------v~r~~pla~Gt~iktyK~~~~~y~gd 63 (303) T protein:vir:10 1 MSA-----------ENNLINVEALGKAKSIDFANKLGVGLNKLFEALA------IQNKIPMNVGSALKQYRFKVEDSEKP 63 (303) T ss_pred CCC-----------CcCCcchhhcccceeehhhhhhhhhHHHHHHHhh------hhccccccCCceeeeeeeeceeeccc Confidence 111 0111111111 45678888999766666644221 112222235778887776421 1 Q ss_pred cccccCCCCcccCCcccc-eeEEEEeecccceeecchhhHH-HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|Aclame:pro 76 LKDYKRNATNEFDHPKIE-ETTYFLDQEKYWGRFVDALDRK-DTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK 153 (319) Q Consensus 76 ~~DY~r~~~~~~~~~t~t-~~tltidqdr~~~F~VD~~D~~-et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~ 153 (319) -+|--....+....++.+ ..+.++.-.||+. .+ .|++ |..+.-.+.....++....++-.+|...|+.+....++ T Consensus 64 a~dVaEGe~Iplskvt~~~~~t~~~~~kK~rK-~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t 140 (303) T protein:vir:10 64 NGDVAEGDVIPLTKVTREQVDITELQFAKYRK-ST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIEN 140 (303) T ss_pred cccccCCcccchhhheeeecceEEEEeecccc-cc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccc Confidence 112222223333445443 2344555556555 33 4655 55566677788889999999999999999987665443 Q ss_pred cc---cccCCHhHHHHHHHHHHHHH---HhccCCCCcEEEEChHHHHHHhhhhhhhhc-ccccccceeeeeeeeecCeEE Q lcl|Aclame:pro 154 HL---TVGTGSDAQYDAVLDVSVEL---DEIKAPENRVLFVSPTFYKGIKKFVIALPQ-GDTRQQVLGKGVQGELDGFVI 226 (319) Q Consensus 154 ~~---~~~~T~~n~~~~i~~a~~~L---de~~VP~~R~l~VsP~~~~~L~~~~~f~~~-~~~~~~~~~~g~Vg~idG~~I 226 (319) .. +...+.+++-.++.....+| +|.. .+-++||+|.=.+-++.+...... ...|-+.+.| +.|++| T Consensus 141 ~~~t~~t~~s~~glq~Al~~~~~kl~~~~ed~--~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~n-----fLG~~I 213 (303) T protein:vir:10 141 GKRTNKTKLSAENLQGALSKGRANLSVLLDDE--ITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLTP-----YVGVKI 213 (303) T ss_pred cccccceeecHHHHHHHHHhhhhhcccccccc--ccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhhh-----hhcceE Confidence 22 22344555555555444443 4432 356999999877666666554422 3344444443 888998 Q ss_pred EEecccc--------cccceEEEEcCCc---eeee---eeeeeeeeecCCCCCccceeeeeeeeeEEEeccccceEEEEc Q lcl|Aclame:pro 227 VKVPTKL--------LQGLQAIAVVGEV---LASP---IQADLAKTNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 227 ~~vps~~--------~~~~n~i~~~~~A---~~~~---~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~ 292 (319) +.++.-. ..++++--+.++. -.|- .+.--+-+.. ++...---++-+..-|...+---.+||.... T Consensus 214 I~S~kv~~G~~~~T~~~Ni~~ay~~~~g~l~~~f~~t~D~tglIGv~h-~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~t 292 (303) T protein:vir:10 214 VEFADVPQGEVWMTVAENLNVAYANPRGELSRAFAFATDATGFVGVLH-DIQPQRLTSDTIYASAISMFPENIDAVIKVT 292 (303) T ss_pred EEeccCCCceEEEeeccceEEEEecCchhhhhhhhhccccccceEEEe-ccccceeeehhHhHhHHHhcccccceEEEEE Confidence 7643211 1223432222221 0000 0111111111 1111111122222233444444556776654 Q ss_pred cccccCCCCCc Q lcl|Aclame:pro 293 GTEVATKRDGV 303 (319) Q Consensus 293 ~~~~a~~~~~~ 303 (319) -++-...+-+. T Consensus 293 i~~~e~~~~~~ 303 (303) T protein:vir:10 293 IKKDEAGELPS 303 (303) T ss_pred EeccccCCCCC Confidence 32222222121 No 149 >protein:vir:93696 Length: 364 # NCBI annotation: Bcep22gp55 # Family: family:all:974 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944284;genbank:gi:38640361;genbank:GeneID:2658350 Probab=96.75 E-value=0.00036 Score=39.45 Aligned_cols=271 Identities=11% Similarity=0.045 Sum_probs=131.8 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhh-cc---------cCcceeeeCCceEEeee Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTP-AL---------ISNDAIFMEGRSFTVMK 70 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~-~~---------~n~~~~~~~g~tVkIp~ 70 (319) |-+|.-.+ -.|+ -.++|...|.....+.+.-.. +. .-+++..+.|++|+++= T Consensus 1 Ma~T~~~~---------------~~p~---a~~~ws~~l~~~~~~~s~f~~~l~G~~~~~~I~~~~dL~k~~Gd~v~f~L 62 (364) T protein:vir:93 1 MSQTVIPF---------------GDPK---AVKRWSADLAVDVRKKSYFEQRFIGTSENAVIQRKTELESDAGDRITFDL 62 (364) T ss_pred CceeccCc---------------CCHH---HHHHHHHHHHHHHHhhCccccccccCCCCCcEEEeeecCCCCCceEEeee Confidence 43332211 1222 123444333333333332221 11 11334556799999875 Q ss_pred ccc----cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 GDT----TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 71 i~~----~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) +.- ...+|.... ...++++..+.+++|||-|---..=..|+...+.. ++.....+....-+....|+-.|-. T Consensus 63 ~~~L~g~gv~Gd~~le--Gnee~L~~~~~~i~idq~r~~V~~~g~ms~qRt~~--dlr~~ar~~L~~w~~~~~d~~~f~~ 138 (364) T protein:vir:93 63 SVHLRGKPTYGDARVE--GKEESLRFYQDEVRIDQVRHSVSAGGRMSRKRTVH--NIRRIARDRLGDYFYKFTDELLFIY 138 (364) T ss_pred eeecccCCcccCceee--ccccceeEEeeEEEEeeccccccccCchhhhhhHH--HHHHHHHHHHHHHHHHHHHHHHHHH Confidence 432 224454322 23467788899999999874211113466666553 4455555666666667777777777 Q ss_pred HHhccCc--------------------------------cccccCCHhH--HHHHHHHHHHHHHhccCC--C-------- Q lcl|Aclame:pro 147 LARNKAK--------------------------------HLTVGTGSDA--QYDAVLDVSVELDEIKAP--E-------- 182 (319) Q Consensus 147 la~~a~~--------------------------------~~~~~~T~~n--~~~~i~~a~~~Lde~~VP--~-------- 182 (319) |+...+. +....+++++ -++.|+.+...++..+.+ . T Consensus 139 laGarg~~~~~~~~~~~~~~~~N~v~aPt~~r~~~~~~at~~~~l~stD~~sl~~id~a~~~a~~~~~~~~~~~~~~Pv~ 218 (364) T protein:vir:93 139 LSGARGINLDFIETPDFTGYAGNPLDAPDVDHLLYGGVATSKASLAATDIMAPLVIEKAVEKAAMMQAENPDVANMVPVS 218 (364) T ss_pred hhcccccccccccccCcccccccccCCCCCCcEEeccccCchhhccccccccHHHHHHHHHHHHHhCCCCCCCcccceeE Confidence 7652211 1112233333 378888888887765432 1 Q ss_pred ----C-cEEEEChHHHHHHhhhh-----hhhhcc--ccc-ccceeeeeeeeecCeEEEEeccc-----c--ccc----ce Q lcl|Aclame:pro 183 ----N-RVLFVSPTFYKGIKKFV-----IALPQG--DTR-QQVLGKGVQGELDGFVIVKVPTK-----L--LQG----LQ 238 (319) Q Consensus 183 ----~-R~l~VsP~~~~~L~~~~-----~f~~~~--~~~-~~~~~~g~Vg~idG~~I~~vps~-----~--~~~----~n 238 (319) . -++++.|..+.-|+.+. ++.+.. ..+ ++.+..|.+|+++|+.|++-+.. . ..+ .. T Consensus 219 ~~g~~~yV~~l~p~q~~~Lr~~t~~~w~d~qk~A~~~~g~~nPlF~G~~gm~ngvii~~~~~vi~~~~~~~~~~v~~~ra 298 (364) T protein:vir:93 219 IDGDDHYVCVMSEYQATDMRTAAGGTWIDFQKAAAAAEGRNNPIFKGGLGMINNVVLHKHRNVIRFNDYGAGANVEAARA 298 (364) T ss_pred ecCcceeEEEEcchhhhhhhhcCCHHHHHHHHHhhhcccccCCceecCeeeEcCeEEeccCCcccccccccCccccchhh Confidence 1 27889999999998643 355532 222 35789999999999999874321 1 111 23 Q ss_pred EEEEcCCceeeeeeeeeeeee------cC--CCCCccceeeee---eeeeEEEeccccceEEEEccccccCCCCCccccc Q lcl|Aclame:pro 239 AIAVVGEVLASPIQADLAKTN------SN--IPGMFGTLAEQL---LYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHA 307 (319) Q Consensus 239 ~i~~~~~A~~~~~k~~~~~~~------~~--~~~~~~~~v~gr---~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~ 307 (319) +++|...+..+.-+..-.+.+ +. ......+.+.|. +| +.+--|+.+--..++ +|+ T Consensus 299 lllGaQA~~~a~g~~~g~~~~w~Ee~~D~gn~~~i~~~~i~G~kK~rF------~~~DfGvi~idtaa~--------~~~ 364 (364) T protein:vir:93 299 LFMGRQAGVIAYGTANGLRFDWEETVKDYGNEPAIAAGFIAGMKKARF------NNKDFGVISIDTAAK--------KHS 364 (364) T ss_pred heecceeeEEEeecCCCCCceeeecccCCCCchhhhhhhHhhhhhccc------CCccceEEEeccccc--------ccC Confidence 555554444443332222211 10 000111111110 11 122223322211111 122 No 150 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=96.73 E-value=0.00038 Score=39.33 Aligned_cols=273 Identities=10% Similarity=0.057 Sum_probs=108.9 Q ss_pred CCccc-ccccc-------eee-------ehhhhh----------hhh--hcchhhhhhhHhhHHHHHHHHHhhhhhhhcc Q lcl|Aclame:pro 1 MNKTI-KNATG-------MLK-------LNLQHF----------ANK--SVEPGQTLLKNKHVGILERVTAVNAYSTPAL 53 (319) Q Consensus 1 ~~~~~-~~~~~-------~~~-------~~~~~~----------~~~--~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~ 53 (319) ..... +.... .+. ...+.| ... -........++.+...+-+.....++. .. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~~l~--~~ 164 (397) T protein:vir:96 87 AADPTDQKPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLEPKDIVDLS--KY 164 (397) T ss_pred hhhhhhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHHhhhhhhHH--Hh Confidence 00000 00000 000 000000 000 001111112222221121111111111 01 Q ss_pred cCcceeeeCCceEEeeec--cccccccccCCCCcccC--CcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHH Q lcl|Aclame:pro 54 ISNDAIFMEGRSFTVMKG--DTTELKDYKRNATNEFD--HPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVA 128 (319) Q Consensus 54 ~n~~~~~~~g~tVkIp~i--~~~g~~DY~r~~~~~~~--~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~ 128 (319) ++ ..-..+...++|.. +....+-.. .++-.+. .++....++++ .+...+ +. +..+-.+. ..++...+. T Consensus 165 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~-E~~~~~~~~~~~~~~i~~~~--~~~~~~-~~-~s~ell~ds~~~l~~~i~ 237 (397) T protein:vir:96 165 VR--SVPVNSASGKFPVISKSGSKMATVQ-QLEKNPQLANPKMVEIDYSV--ATRRGY-IP-ISQEMIDDASYDVTGLIA 237 (397) T ss_pred hh--hccccccceeEEEEeccCCcccccc-ccccccccccccccceeecH--hHhhcc-hh-hHHHHHhhhHHHHHHHHH Confidence 11 11123334444433 333333222 1222221 23333334433 332221 11 11111111 123344555 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCccc-cccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 129 RQGAEVVAPYLDNLRFATLARNKAKHL-TVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGD 207 (319) Q Consensus 129 ~~~~~~vapeiD~~~~s~la~~a~~~~-~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~ 207 (319) +..+..++-..|..++. +.+... +..++ |+.|.++...+... . .+-.++++|..+..|.+-.+-.... T Consensus 238 ~~l~~~~~~~~~~~i~~----g~g~~~~~~~~~----~d~~~~~~~~~~~~-~-~~a~~v~n~~~~~~l~~lkd~~G~~- 306 (397) T protein:vir:96 238 DEIQDQSLNTKNADIAA----VLKTATAKSVVG----VDGLKDLINKEIKK-V-YDVKLFISASMYSELDKLKDKNGRY- 306 (397) T ss_pred HHHHHHHHHHHHHHHhh----cccccccccccc----hHHHHHHHHHhhhh-h-cCcEEEEcHHHHHHHHHhhccCCCe- Confidence 66666777666665443 222222 22223 44444444332221 1 2456899999999887643211100 Q ss_pred ccccceeeeeeeeecCeEEEEeccc----ccccceEEEEcCC-ceeeeeeeeeeeeecCCCCCccceeeeeeeeeEEEec Q lcl|Aclame:pro 208 TRQQVLGKGVQGELDGFVIVKVPTK----LLQGLQAIAVVGE-VLASPIQADLAKTNSNIPGMFGTLAEQLLYTGAFVPE 282 (319) Q Consensus 208 ~~~~~~~~g~Vg~idG~~I~~vps~----~~~~~n~i~~~~~-A~~~~~k~~~~~~~~~~~~~~~~~v~gr~~yg~~V~~ 282 (319) .......+|.-++|.|.||+.+++. ...+..+++|.-+ +...... ..+.+-...+..|...+++-..+|..|.+ T Consensus 307 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~r~d~~~~~ 385 (397) T protein:vir:96 307 LLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDR-KQVSVSWVDNNIYGQLLAGIIRYDVKATD 385 (397) T ss_pred EeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEee-cceEEEEecccccceeEEEEEEEccEEec Confidence 1122234555578999999976542 2233456777655 3333332 22333322456677788899999999999 Q ss_pred cccceEEEEcccc Q lcl|Aclame:pro 283 HLQKYIFTIGGTE 295 (319) Q Consensus 283 ~k~~~Iy~~~~~~ 295 (319) |+.. +.+...++ T Consensus 386 ~~a~-~~~~~~~a 397 (397) T protein:vir:96 386 KKAG-FYVTFTIG 397 (397) T ss_pred ccce-EEEEeecC Confidence 9983 22333222 No 151 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=96.72 E-value=0.00038 Score=39.32 Aligned_cols=290 Identities=11% Similarity=-0.009 Sum_probs=137.7 Q ss_pred eeeehhh-----hhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccc-ccc-cccCCC Q lcl|Aclame:pro 11 MLKLNLQ-----HFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTT-ELK-DYKRNA 83 (319) Q Consensus 11 ~~~~~~~-----~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~-g~~-DY~r~~ 83 (319) |=+.|-+ +|.+..-.+|..+...+ ..+++....++.....+. ....+|+.|.||-+... |-. +|..+. T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e----~~~l~qSGiv~~d~~l~~-~~~~gG~~v~iPf~~~L~g~~~n~~~d~ 75 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPE----LTAFFLSGAVASNDFLSQ-FLSAPGRLINIPFWRDLDSLEPNYGSDN 75 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhh----hhhhhhcceeecCHHHHH-HhhcCCCEEEeeeeccCCCCccccCCCC Confidence 5554433 68777777776665543 333333222222111111 12468999999999763 433 464433 Q ss_pred C---cccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------- Q lcl|Aclame:pro 84 T---NEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA-------- 152 (319) Q Consensus 84 ~---~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~-------- 152 (319) + .+++.++...+. -.-+.|...|...|+-..-+. -+++...+.+-+.--.....+.+++.|.+--+ T Consensus 76 ~~~~~t~~kittg~~~-a~v~~r~kaw~~~Dla~~lsG--~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~ 152 (367) T protein:vir:80 76 PNVEAPIDGLGSGEMK-TTKTWLNKAYGAMDLTAELAG--SNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFA 152 (367) T ss_pred Ccccccccccccchhe-eeeehhcccchhhhHHHHhhC--chHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchh Confidence 2 344554443332 233456667778887766663 24443333332222233333444554432110 Q ss_pred ----------------ccccccCC-----HhH--HHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhh--hhhccc Q lcl|Aclame:pro 153 ----------------KHLTVGTG-----SDA--QYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVI--ALPQGD 207 (319) Q Consensus 153 ----------------~~~~~~~T-----~~n--~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~--f~~~~~ 207 (319) ...+..++ +++ -.+.+.+|...|.++.- .=..++|.|.++..|++..- |.+..+ T Consensus 153 ~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~-~l~~i~mHS~V~~~L~~~~li~~i~~sd 231 (367) T protein:vir:80 153 TIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVG-SIAAIAVHSMVYKRMTNNDEIEFIPDSK 231 (367) T ss_pred hhhhhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccc-cccEEEEchHHHHHHHhccccccccCCC Confidence 00001111 122 26678889888877421 23578999999999987642 232221 Q ss_pred ccccceeeeeeeeecCeEEEEe---cccc---cccceEEEEcCCceeeeeee--eeeeeecCCCCCcc---ceeeeeeee Q lcl|Aclame:pro 208 TRQQVLGKGVQGELDGFVIVKV---PTKL---LQGLQAIAVVGEVLASPIQA--DLAKTNSNIPGMFG---TLAEQLLYT 276 (319) Q Consensus 208 ~~~~~~~~g~Vg~idG~~I~~v---ps~~---~~~~n~i~~~~~A~~~~~k~--~~~~~~~~~~~~~~---~~v~gr~~y 276 (319) -+..|+.+.|.+|+.- |-.. .+.+-..+.-++|+.+-..- .-+|..|++-...+ +.+-.|+. T Consensus 232 ------~~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~- 304 (367) T protein:vir:80 232 ------GQLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGAAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE- 304 (367) T ss_pred ------CccccceecceeEEEeCCCcccccCCCceEEEEEEecceeeecccCCccceecccchhhhcCCceEEEEeeee- Confidence 1456888889888852 2111 12344455556777655333 22466554322222 33444433 Q ss_pred eEEEeccccceEEEEcccc-cc-----CCCCCcccccccccc-ccccccC Q lcl|Aclame:pro 277 GAFVPEHLQKYIFTIGGTE-VA-----TKRDGVDAHADNVAK-PSGSLEM 319 (319) Q Consensus 277 g~~V~~~k~~~Iy~~~~~~-~a-----~~~~~~~~~~~~~~~-~~~~~~~ 319 (319) +++-|.... |...... |. ........+.+.+-+ ...+-|. T Consensus 305 --~~~hP~G~s-~~~~~v~~~~~~~~~~~~~~~~~sPt~~eLa~~~NW~~ 351 (367) T protein:vir:80 305 --WIVHPGGFN-WLDADVTIPDNTGSPSGITSGPPAITLANLANPDNWER 351 (367) T ss_pred --EEeecceee-ecccccccccccccccccccccCCCChHHhcCCccccc Confidence 666666421 2222111 10 011112222333322 2222222 No 152 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=96.70 E-value=0.0004 Score=39.21 Aligned_cols=284 Identities=6% Similarity=-0.127 Sum_probs=126.5 Q ss_pred cccccceeeehhhhhhhhhcc--hhhhhhhHhh-HHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccC Q lcl|Aclame:pro 5 IKNATGMLKLNLQHFANKSVE--PGQTLLKNKH-VGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKR 81 (319) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~--~n~~~l~~ky-~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r 81 (319) ++. | =.|+.+..+..-.. ....-+++.+ ..+++.+.....+.. ++. .+-..+.+++||.....+-.... T Consensus 1 ~~~--~-~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~--~~~--~~~~~~~~~~ip~~~~~~~a~~v- 72 (318) T protein:vir:24 1 MAA--G-TAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQ--FAQ--KVPMGTTGQKIPHWVGDVSAQWI- 72 (318) T ss_pred CCC--C-CCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhh--hcc--eeeccCCceEEEEEeCCcceEEe- Confidence 111 1 12222222211111 1111233333 334444433333321 222 34456778999988764433322 Q ss_pred CCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc------- Q lcl|Aclame:pro 82 NATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK------- 153 (319) Q Consensus 82 ~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~------- 153 (319) +.+-....-+.++...+++-.|.-.+. . +..+-.. ...++...+.+.....++-.+|.-.+.---...+. T Consensus 73 ~Eg~~~~~~~~~f~~i~~~~~k~~~~~-~-iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~ 150 (318) T protein:vir:24 73 GEGDMKPITKGNMTSQTIAPHKIATIF-V-ASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTK 150 (318) T ss_pred cCCccccccccceeEEEEeeEEEEEee-h-hhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccc Confidence 222222223344555555555543322 2 2221111 12456677778888888888888765311000000 Q ss_pred --cccc-cCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhh----cccccccceeeeeeeeecCeEE Q lcl|Aclame:pro 154 --HLTV-GTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALP----QGDTRQQVLGKGVQGELDGFVI 226 (319) Q Consensus 154 --~~~~-~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~----~~~~~~~~~~~g~Vg~idG~~I 226 (319) .... ..+.....+.+.++...+.....+ ...++++|..+..|++-.+-.. .............-+.+.|+++ T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv 229 (318) T protein:vir:24 151 AISIADTTGATTVYDQVAVNGLSLLVNDGKK-WTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPT 229 (318) T ss_pred cccccccccccchHHHHHHHHHHhhccccCC-CCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEee Confidence 0000 111223344455666565554333 4467999999998875332110 0011111111111135778888 Q ss_pred EEecccccccceEEEEcCCceeeee-eeeeeeeecC-------CC--------CCccceeeeeeeeeEEEeccccceEEE Q lcl|Aclame:pro 227 VKVPTKLLQGLQAIAVVGEVLASPI-QADLAKTNSN-------IP--------GMFGTLAEQLLYTGAFVPEHLQKYIFT 290 (319) Q Consensus 227 ~~vps~~~~~~n~i~~~~~A~~~~~-k~~~~~~~~~-------~~--------~~~~~~v~gr~~yg~~V~~~k~~~Iy~ 290 (319) +.+++....+.-++++..+...... +--.+++.+. .+ .++.-.++...++|..|.+|++... T Consensus 230 ~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~-- 307 (318) T protein:vir:24 230 ILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVA-- 307 (318) T ss_pred EEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEE-- Confidence 7655443333344555544333222 2122332221 00 1133578899999999999988433 Q ss_pred EccccccCCCCC Q lcl|Aclame:pro 291 IGGTEVATKRDG 302 (319) Q Consensus 291 ~~~~~~a~~~~~ 302 (319) ....+++...| T Consensus 308 -i~~~~a~~~~~ 318 (318) T protein:vir:24 308 -LTNVVSGGGEG 318 (318) T ss_pred -EEeeccCCCCC Confidence 22223333333 No 153 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=96.66 E-value=0.00043 Score=39.04 Aligned_cols=284 Identities=11% Similarity=0.041 Sum_probs=125.7 Q ss_pred CCcccccccceeeehhhhhhh--------hhc---chh--hhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFAN--------KSV---EPG--QTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~--------~~~---~~n--~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tV 66 (319) +-+.+..+.|-+......+.. +.. .+. -.-.++.+. .+++.+.....+.. +..+. .-.....+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~--~~~~~-~~~~~~~~ 177 (435) T protein:vir:14 101 MVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRK--LGART-LPLSNGNI 177 (435) T ss_pred HHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhh--hccee-eecCCCce Confidence 111111222222111111110 000 000 011233332 23333322222211 11122 22334479 Q ss_pred EeeeccccccccccCCC-CcccCCcccceeEEEEeecccceeecchhhH--HHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 TVMKGDTTELKDYKRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDR--KDTEGNIDINYVVARQGAEVVAPYLDNLR 143 (319) Q Consensus 67 kIp~i~~~g~~DY~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~--~et~~~~~~~~~~~~~~~~~vapeiD~~~ 143 (319) +||.....+-..+...+ .....++ ++...+++..+.-.+. .--+. +++.....+.+.+.......+.-.+|..+ T Consensus 178 ~~p~~~~~~~a~~v~E~~~~~~~~~--~f~~i~~~~~k~~~~~-~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~ 254 (435) T protein:vir:14 178 TIPRLKGGAIVGYIGADTDIPTTQQ--QFDDLKLTAKKMAALV-PIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAF 254 (435) T ss_pred EEEEEeCCcceeeeccCcccccccc--ceeEEEeeeEEEEEee-hhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHh Confidence 99998665444443222 2222334 4444455545444432 21111 11211223556667778888888888866 Q ss_pred HHHH-HhccC------------ccccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhccccc Q lcl|Aclame:pro 144 FATL-ARNKA------------KHLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTR 209 (319) Q Consensus 144 ~s~l-a~~a~------------~~~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~ 209 (319) +.-- ..+.. ...+.+.+.+.++..+.++...+..+..- .+..++++|..+..|.+-.+- .| T Consensus 255 l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~-----~G 329 (435) T protein:vir:14 255 IRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDG-----NG 329 (435) T ss_pred hccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhcc-----CC Confidence 5210 00000 01112234556677778887777765433 466789999999888754321 12 Q ss_pred ccceeeeeeeeecCeEEEEecc---ccc---ccceEEEEcCCceeeeeeeeeeeeecCCCC--------------Cccce Q lcl|Aclame:pro 210 QQVLGKGVQGELDGFVIVKVPT---KLL---QGLQAIAVVGEVLASPIQADLAKTNSNIPG--------------MFGTL 269 (319) Q Consensus 210 ~~~~~~g~Vg~idG~~I~~vps---~~~---~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~--------------~~~~~ 269 (319) .-......=|+|.|.||+.++. +.. +...+++|.-+-...... ..+++...++. .+.-. T Consensus 330 ~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~-~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~ 408 (435) T protein:vir:14 330 NKVYPELANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEE-ETLEIDYSKEATYKDADGHMVSAFQRDQTL 408 (435) T ss_pred ceeccCCCCCeeecceeEeeccccccccCCCccceEEEeecccEEEEEe-cccEEEEeccccccccccchhhhhhcChhh Confidence 1111111124689999987532 211 222466665544332211 22222211111 12357 Q ss_pred eeeeeeeeEEEeccccceEEEEccccccC Q lcl|Aclame:pro 270 AEQLLYTGAFVPEHLQKYIFTIGGTEVAT 298 (319) Q Consensus 270 v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~ 298 (319) ++...++|..|.+|++..+.... +-.+ T Consensus 409 ~r~~~r~d~~~~~~~a~~~l~~~--~~~~ 435 (435) T protein:vir:14 409 IRVIAKNDFGPRHVESIAVLAGV--AWGA 435 (435) T ss_pred eeeeeeeCceeecccceEEEecC--CCCC Confidence 78899999999999974332211 1111 No 154 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=96.65 E-value=0.00044 Score=38.98 Aligned_cols=282 Identities=11% Similarity=0.065 Sum_probs=122.2 Q ss_pred CCcccccccceeeehhhhhhhhh-----------cch--hhhhhhHhhHH-HHHHHHHhhhhhhhcccCcceeeeCCceE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS-----------VEP--GQTLLKNKHVG-ILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-----------~~~--n~~~l~~ky~~-lld~~~~~~sl~~~~~~n~~~~~~~g~tV 66 (319) +-+.+....|.+......+.... ..+ .-.-.++.+.. +++.+.....+.. +..+ .+-.....+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~--~~~~-~v~~~~~~~ 177 (435) T protein:vir:80 101 MVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRK--LGAR-TLPLSNGNI 177 (435) T ss_pred HHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhh--ccce-eeecCCCce Confidence 00111111111111100000000 000 00012233322 2222222121111 1111 122334468 Q ss_pred EeeeccccccccccCCC-CcccCCcccceeEEEEeecccceeecchhhH--HHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 TVMKGDTTELKDYKRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDR--KDTEGNIDINYVVARQGAEVVAPYLDNLR 143 (319) Q Consensus 67 kIp~i~~~g~~DY~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~--~et~~~~~~~~~~~~~~~~~vapeiD~~~ 143 (319) +||......-..+...+ .....+++.+ ..+++-.+...+. .--+. +++.....+...+.+.....++-.+|..+ T Consensus 178 ~~p~~~~~~~a~~v~E~~~~~~~~~~f~--~i~~~~~k~~~~~-~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~ 254 (435) T protein:vir:80 178 TIPRLKGGAIVGYIGADTDIPTTQQQFD--DLKLTAKKMAALV-PIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAF 254 (435) T ss_pred EEEEEeCCcceeeeccCcccccccccee--eEEEeeEEEEEee-hhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHh Confidence 99988654444333222 2233334444 4444444433322 21111 11211223456677888888888888866 Q ss_pred HHHHHhccCc---------------cccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhccc Q lcl|Aclame:pro 144 FATLARNKAK---------------HLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGD 207 (319) Q Consensus 144 ~s~la~~a~~---------------~~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~ 207 (319) +.- .+.+. ..+.+.+...++..+.++...|..+... .+-.++++|..+..|.+-.+-. T Consensus 255 l~G--~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~---- 328 (435) T protein:vir:80 255 IRD--DGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGN---- 328 (435) T ss_pred hcc--CCCCCcccceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccC---- Confidence 531 11100 0011123445566677777777766544 5667789999998886543211 Q ss_pred ccccceeeeeeeeecCeEEEEecc---ccc---ccceEEEEcCCceeeeeeeeeeeeecCCCC--------------Ccc Q lcl|Aclame:pro 208 TRQQVLGKGVQGELDGFVIVKVPT---KLL---QGLQAIAVVGEVLASPIQADLAKTNSNIPG--------------MFG 267 (319) Q Consensus 208 ~~~~~~~~g~Vg~idG~~I~~vps---~~~---~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~--------------~~~ 267 (319) |........=++|.|.+|+.++. ... ....+++|+.+-.... ....+++-...+. ++. T Consensus 329 -G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~-~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~ 406 (435) T protein:vir:80 329 -GNKVYPELANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIG-EEETLEIDYSKEATYKDADGHMVSAFQRDQ 406 (435) T ss_pred -CceeccCCCCCeEeeeeeEEeccccccccCCCCcceEEEEEcccEEEE-eecceEEEEeccccccccccchhhhhhcCc Confidence 11111100113689999986432 111 1123555655433222 1122222211121 123 Q ss_pred ceeeeeeeeeEEEeccccceEEEEccccccC Q lcl|Aclame:pro 268 TLAEQLLYTGAFVPEHLQKYIFTIGGTEVAT 298 (319) Q Consensus 268 ~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~ 298 (319) ..++...++|..|.+|+...+...+. -.+ T Consensus 407 ~~~r~~~r~d~~~~~~~a~~~l~~~~--~~~ 435 (435) T protein:vir:80 407 TLIRVIAKNDFGPRHVESIAVLSGVA--WGA 435 (435) T ss_pred ceeeeeeeeCcEeecccceEEEeccC--CCC Confidence 57789999999999999854433322 111 No 155 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=96.63 E-value=0.00045 Score=38.91 Aligned_cols=274 Identities=13% Similarity=0.046 Sum_probs=112.2 Q ss_pred CCcccccc-----cceee---------ehhhhhhhhhc--chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCC Q lcl|Aclame:pro 1 MNKTIKNA-----TGMLK---------LNLQHFANKSV--EPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEG 63 (319) Q Consensus 1 ~~~~~~~~-----~~~~~---------~~~~~~~~~~~--~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g 63 (319) -.+.++.. .+++. .+...-|.+.. .-.-.-.++.+. .+++.+.....+.. +++ +.-.++ T Consensus 49 ~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~--~~~--v~~~~~ 124 (352) T protein:vir:78 49 NEKLVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE--KAR--LTNIKG 124 (352) T ss_pred hhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhh--hee--eEecCC Confidence 00000000 00110 01111111110 001112344443 34444333333211 111 122233 Q ss_pred ceEEeeecccc-ccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 64 RSFTVMKGDTT-ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDN 141 (319) Q Consensus 64 ~tVkIp~i~~~-g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~ 141 (319) .++|.+... +-..+. ..+-.....+.++..+++.-.|+-.+ |. +..+-... ..++...+.+..+..+.-..+. T Consensus 125 --~~~p~~~~~~~~a~~v-~E~~~~~~~~~~f~~v~~~~~k~~~~-i~-is~ell~Ds~~~l~~~i~~~la~~~~~~e~~ 199 (352) T protein:vir:78 125 --LEIPRVSYTLDDDDFI-TDVETAKELKLKGDTVKFTTNKFKVF-AA-ISDTVIHGSDVDLVNWVENALQSGLAAKERK 199 (352) T ss_pred --ceEEEEecCCCccccc-ccccccccccccceeeeecceeEEee-ch-hhHHHHhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 456766433 222222 22222222344555566666665553 22 22222221 1234444455555555433233 Q ss_pred HHHHHHHhccCcc---------ccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccc Q lcl|Aclame:pro 142 LRFATLARNKAKH---------LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQV 212 (319) Q Consensus 142 ~~~s~la~~a~~~---------~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~ 212 (319) ..|. .+.+.. ....++.++.|+.|+++...|+..... +-.++|+|..+..|++-.+ + ++.. T Consensus 200 ~~~~---~g~g~~~~~g~l~~~~~~~~t~~~~~d~i~~~~~~l~~~~~~-~a~~~mn~~t~~~l~~~~~-----~-~~~~ 269 (352) T protein:vir:78 200 DALA---VSPKSGLEHMSFYNGSVKEVEGANMYDAIINALADLHEDYRD-NATIYMRYADYVKIISVLS-----N-GTTN 269 (352) T ss_pred hhhh---cCCCCcccccceeccccccccccchHHHHHHHHhccChhhhc-CCEEEEehHHHHHHHHHHh-----c-cCCc Confidence 3332 111111 112335566799999999888775433 4456777766544433211 0 1122 Q ss_pred eeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeee--eecCCCCCccceeeeeeeeeEEEeccccceEEE Q lcl|Aclame:pro 213 LGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAK--TNSNIPGMFGTLAEQLLYTGAFVPEHLQKYIFT 290 (319) Q Consensus 213 ~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~--~~~~~~~~~~~~v~gr~~yg~~V~~~k~~~Iy~ 290 (319) +..|.=.+|.|.||+.+.. ++. +++|.-+-... -.+.+. .++. ....--.++++.++|+.|++|++.-+ + T Consensus 270 ~~~~~~~~llG~PV~~~~~--~~~--~~~Gdf~~~~~--~~~~~~~~~~~~-~~~g~~~f~~~~r~Dg~~~~~eA~~~-l 341 (352) T protein:vir:78 270 FFDTPAEKVFGKPVVFTDA--AVK--PIVGDFNYFGI--NYDGTTYDTDKD-VKKGEYLFVLTAWYDQQRTLDSAFRI-A 341 (352) T ss_pred ccccCCccccccceEEecC--CCc--eeEeehhhhhh--hhhhheeeeecc-ccCCeeEEEEEeeeCceeechhheEE-E Confidence 2333334688999997542 332 33443321111 111121 1111 11222467778899999999998433 3 Q ss_pred EccccccCCCC Q lcl|Aclame:pro 291 IGGTEVATKRD 301 (319) Q Consensus 291 ~~~~~~a~~~~ 301 (319) ...+++.+.|+ T Consensus 342 ~~~a~~~~~~~ 352 (352) T protein:vir:78 342 KAKESTGSLPS 352 (352) T ss_pred EeecccCCCCC Confidence 33333333333 No 156 >protein:vir:819 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050552;genbank:gi:9633449;genbank:GeneID:1262254 Probab=96.60 E-value=0.00048 Score=38.77 Aligned_cols=278 Identities=13% Similarity=0.064 Sum_probs=133.9 Q ss_pred CCcccccccceeeehhhhhhhhh-cchhhhhhhHhhHHHHHHHHHhhh---------hhhhcccCcceeeeCCceEEeee Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS-VEPGQTLLKNKHVGILERVTAVNA---------YSTPALISNDAIFMEGRSFTVMK 70 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~n~~~l~~ky~~lld~~~~~~s---------l~~~~~~n~~~~~~~g~tVkIp~ 70 (319) |- ++-.+ .-|-+|+-|.=- ...|+- ..++|.+.+..-...++ ...+-..-+++....|++|+++= T Consensus 1 ~~-~~~~~---~a~~~~~~~lft~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L 75 (404) T protein:vir:81 1 MT-TVTSA---QANKLYQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) T ss_pred CC-CcCCc---chhhhHHHHHHHHHhcCCh-hHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeE Confidence 11 11000 111122111100 000111 12333222211111111 11111112344667899999875 Q ss_pred ccc----cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 GDT----TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 71 i~~----~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) +.- ...+|.... .+.++++..+.+++|||-|---..=..|+..-+. .++..........-+....|+-.|-. T Consensus 76 ~~~L~g~gv~Gd~~lE--Gnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~--~dlr~~ar~~L~~w~~~~~d~~~~~~ 151 (404) T protein:vir:81 76 MHKLSKRPTMGDERVE--GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTK--FNLASSARTLLGTYFNDLQDQCAIVH 151 (404) T ss_pred eeecccCCcccCceee--ccccceeEEeeEEEEeeecccccccCchhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 532 234454322 2356788889999999987442222456644444 45566666777777778888887776 Q ss_pred HHhccCc----------------------------cc----------cccCCHhH--HHHHHHHHHHHHHhccCC-C--- Q lcl|Aclame:pro 147 LARNKAK----------------------------HL----------TVGTGSDA--QYDAVLDVSVELDEIKAP-E--- 182 (319) Q Consensus 147 la~~a~~----------------------------~~----------~~~~T~~n--~~~~i~~a~~~Lde~~VP-~--- 182 (319) |+...+. .+ ...++.++ .++.|+.+.+.+++..-| . T Consensus 152 laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~ 231 (404) T protein:vir:81 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVR 231 (404) T ss_pred HhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceE Confidence 6633210 00 01122222 266677888888774333 1 Q ss_pred --C---------cEEEEChHHHHHHhhhhh------hhhcccc----cccceeeeeeeeecCeEEEEeccc---c----- Q lcl|Aclame:pro 183 --N---------RVLFVSPTFYKGIKKFVI------ALPQGDT----RQQVLGKGVQGELDGFVIVKVPTK---L----- 233 (319) Q Consensus 183 --~---------R~l~VsP~~~~~L~~~~~------f~~~~~~----~~~~~~~g~Vg~idG~~I~~vps~---~----- 233 (319) + ++|+++|..+..|+.++. +.+.... .++.+..|.+|+++|+.|.+-|+. . T Consensus 232 ~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~ 311 (404) T protein:vir:81 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) T ss_pred eccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccce Confidence 1 678999999999999863 3332111 246789999999999999863321 1 Q ss_pred --------cc-----------cceEEEEcCCceeeeeeeeeeeee------cC--CCCCccc------eeeee------e Q lcl|Aclame:pro 234 --------LQ-----------GLQAIAVVGEVLASPIQADLAKTN------SN--IPGMFGT------LAEQL------L 274 (319) Q Consensus 234 --------~~-----------~~n~i~~~~~A~~~~~k~~~~~~~------~~--~~~~~~~------~v~gr------~ 274 (319) .. ...+++|...+..+..|..-.+.+ +. ......+ +.+.- . T Consensus 312 ~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~ 391 (404) T protein:vir:81 312 VLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQ 391 (404) T ss_pred eeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCcee Confidence 00 023566665554444443222110 10 0000011 11111 2 Q ss_pred eeeEEEeccccce Q lcl|Aclame:pro 275 YTGAFVPEHLQKY 287 (319) Q Consensus 275 ~yg~~V~~~k~~~ 287 (319) =||+.|++.-.+- T Consensus 392 DfGvi~idta~~~ 404 (404) T protein:vir:81 392 DHGVIAVDTAVKL 404 (404) T ss_pred eEEEEEecccccC Confidence 4555555543332 No 157 >protein:vir:10123 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859253;genbank:gi:32171009;genbank:GeneID:2653345 Probab=96.60 E-value=0.00048 Score=38.77 Aligned_cols=278 Identities=13% Similarity=0.064 Sum_probs=133.9 Q ss_pred CCcccccccceeeehhhhhhhhh-cchhhhhhhHhhHHHHHHHHHhhh---------hhhhcccCcceeeeCCceEEeee Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS-VEPGQTLLKNKHVGILERVTAVNA---------YSTPALISNDAIFMEGRSFTVMK 70 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~n~~~l~~ky~~lld~~~~~~s---------l~~~~~~n~~~~~~~g~tVkIp~ 70 (319) |- ++-.+ .-|-+|+-|.=- ...|+- ..++|.+.+..-...++ ...+-..-+++....|++|+++= T Consensus 1 ~~-~~~~~---~a~~~~~~~lft~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L 75 (404) T protein:vir:10 1 MT-TVTSA---QANKLYQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) T ss_pred CC-CcCCc---chhhhHHHHHHHHHhcCCh-hHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeE Confidence 11 11000 111122111100 000111 12333222211111111 11111112344667899999875 Q ss_pred ccc----cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 GDT----TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 71 i~~----~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) +.- ...+|.... .+.++++..+.+++|||-|---..=..|+..-+. .++..........-+....|+-.|-. T Consensus 76 ~~~L~g~gv~Gd~~lE--Gnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~--~dlr~~ar~~L~~w~~~~~d~~~~~~ 151 (404) T protein:vir:10 76 MHKLSKRPTMGDERVE--GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTK--FNLASSARTLLGTYFNDLQDQCAIVH 151 (404) T ss_pred eeecccCCcccCceee--ccccceeEEeeEEEEeeecccccccCchhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 532 234454322 2356788889999999987442222456644444 45566666777777778888887776 Q ss_pred HHhccCc----------------------------cc----------cccCCHhH--HHHHHHHHHHHHHhccCC-C--- Q lcl|Aclame:pro 147 LARNKAK----------------------------HL----------TVGTGSDA--QYDAVLDVSVELDEIKAP-E--- 182 (319) Q Consensus 147 la~~a~~----------------------------~~----------~~~~T~~n--~~~~i~~a~~~Lde~~VP-~--- 182 (319) |+...+. .+ ...++.++ .++.|+.+.+.+++..-| . T Consensus 152 laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~ 231 (404) T protein:vir:10 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVR 231 (404) T ss_pred HhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceE Confidence 6633210 00 01122222 266677888888774333 1 Q ss_pred --C---------cEEEEChHHHHHHhhhhh------hhhcccc----cccceeeeeeeeecCeEEEEeccc---c----- Q lcl|Aclame:pro 183 --N---------RVLFVSPTFYKGIKKFVI------ALPQGDT----RQQVLGKGVQGELDGFVIVKVPTK---L----- 233 (319) Q Consensus 183 --~---------R~l~VsP~~~~~L~~~~~------f~~~~~~----~~~~~~~g~Vg~idG~~I~~vps~---~----- 233 (319) + ++|+++|..+..|+.++. +.+.... .++.+..|.+|+++|+.|.+-|+. . T Consensus 232 ~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~ 311 (404) T protein:vir:10 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) T ss_pred eccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccce Confidence 1 678999999999999863 3332111 246789999999999999863321 1 Q ss_pred --------cc-----------cceEEEEcCCceeeeeeeeeeeee------cC--CCCCccc------eeeee------e Q lcl|Aclame:pro 234 --------LQ-----------GLQAIAVVGEVLASPIQADLAKTN------SN--IPGMFGT------LAEQL------L 274 (319) Q Consensus 234 --------~~-----------~~n~i~~~~~A~~~~~k~~~~~~~------~~--~~~~~~~------~v~gr------~ 274 (319) .. ...+++|...+..+..|..-.+.+ +. ......+ +.+.- . T Consensus 312 ~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~ 391 (404) T protein:vir:10 312 VLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQ 391 (404) T ss_pred eeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCcee Confidence 00 023566665554444443222110 10 0000011 11111 2 Q ss_pred eeeEEEeccccce Q lcl|Aclame:pro 275 YTGAFVPEHLQKY 287 (319) Q Consensus 275 ~yg~~V~~~k~~~ 287 (319) =||+.|++.-.+- T Consensus 392 DfGvi~idta~~~ 404 (404) T protein:vir:10 392 DHGVIAVDTAVKL 404 (404) T ss_pred eEEEEEecccccC Confidence 4555555543332 No 158 >protein:vir:3298 Length: 404 # NCBI annotation: hypothetical protein # Family: family:all:974 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049514;genbank:gi:9632520;genbank:GeneID:1262006 Probab=96.60 E-value=0.00048 Score=38.77 Aligned_cols=278 Identities=13% Similarity=0.064 Sum_probs=133.9 Q ss_pred CCcccccccceeeehhhhhhhhh-cchhhhhhhHhhHHHHHHHHHhhh---------hhhhcccCcceeeeCCceEEeee Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS-VEPGQTLLKNKHVGILERVTAVNA---------YSTPALISNDAIFMEGRSFTVMK 70 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~n~~~l~~ky~~lld~~~~~~s---------l~~~~~~n~~~~~~~g~tVkIp~ 70 (319) |- ++-.+ .-|-+|+-|.=- ...|+- ..++|.+.+..-...++ ...+-..-+++....|++|+++= T Consensus 1 ~~-~~~~~---~a~~~~~~~lft~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L 75 (404) T protein:vir:32 1 MT-TVTSA---QANKLYQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) T ss_pred CC-CcCCc---chhhhHHHHHHHHHhcCCh-hHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeE Confidence 11 11000 111122111100 000111 12333222211111111 11111112344667899999875 Q ss_pred ccc----cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 GDT----TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 71 i~~----~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) +.- ...+|.... .+.++++..+.+++|||-|---..=..|+..-+. .++..........-+....|+-.|-. T Consensus 76 ~~~L~g~gv~Gd~~lE--Gnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~--~dlr~~ar~~L~~w~~~~~d~~~~~~ 151 (404) T protein:vir:32 76 MHKLSKRPTMGDERVE--GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTK--FNLASSARTLLGTYFNDLQDQCAIVH 151 (404) T ss_pred eeecccCCcccCceee--ccccceeEEeeEEEEeeecccccccCchhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 532 234454322 2356788889999999987442222456644444 45566666777777778888887776 Q ss_pred HHhccCc----------------------------cc----------cccCCHhH--HHHHHHHHHHHHHhccCC-C--- Q lcl|Aclame:pro 147 LARNKAK----------------------------HL----------TVGTGSDA--QYDAVLDVSVELDEIKAP-E--- 182 (319) Q Consensus 147 la~~a~~----------------------------~~----------~~~~T~~n--~~~~i~~a~~~Lde~~VP-~--- 182 (319) |+...+. .+ ...++.++ .++.|+.+.+.+++..-| . T Consensus 152 laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~ 231 (404) T protein:vir:32 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVR 231 (404) T ss_pred HhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceE Confidence 6633210 00 01122222 266677888888774333 1 Q ss_pred --C---------cEEEEChHHHHHHhhhhh------hhhcccc----cccceeeeeeeeecCeEEEEeccc---c----- Q lcl|Aclame:pro 183 --N---------RVLFVSPTFYKGIKKFVI------ALPQGDT----RQQVLGKGVQGELDGFVIVKVPTK---L----- 233 (319) Q Consensus 183 --~---------R~l~VsP~~~~~L~~~~~------f~~~~~~----~~~~~~~g~Vg~idG~~I~~vps~---~----- 233 (319) + ++|+++|..+..|+.++. +.+.... .++.+..|.+|+++|+.|.+-|+. . T Consensus 232 ~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~ 311 (404) T protein:vir:32 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) T ss_pred eccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccce Confidence 1 678999999999999863 3332111 246789999999999999863321 1 Q ss_pred --------cc-----------cceEEEEcCCceeeeeeeeeeeee------cC--CCCCccc------eeeee------e Q lcl|Aclame:pro 234 --------LQ-----------GLQAIAVVGEVLASPIQADLAKTN------SN--IPGMFGT------LAEQL------L 274 (319) Q Consensus 234 --------~~-----------~~n~i~~~~~A~~~~~k~~~~~~~------~~--~~~~~~~------~v~gr------~ 274 (319) .. ...+++|...+..+..|..-.+.+ +. ......+ +.+.- . T Consensus 312 ~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~ 391 (404) T protein:vir:32 312 VLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQ 391 (404) T ss_pred eeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCcee Confidence 00 023566665554444443222110 10 0000011 11111 2 Q ss_pred eeeEEEeccccce Q lcl|Aclame:pro 275 YTGAFVPEHLQKY 287 (319) Q Consensus 275 ~yg~~V~~~k~~~ 287 (319) =||+.|++.-.+- T Consensus 392 DfGvi~idta~~~ 404 (404) T protein:vir:32 392 DHGVIAVDTAVKL 404 (404) T ss_pred eEEEEEecccccC Confidence 4555555543332 No 159 >protein:vir:104439 Length: 404 # NCBI annotation: putative virion structural protein # Family: family:all:974 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794063;genbank:gi:116222008;genbank:GeneID:4397504 Probab=96.60 E-value=0.00048 Score=38.77 Aligned_cols=278 Identities=13% Similarity=0.064 Sum_probs=133.9 Q ss_pred CCcccccccceeeehhhhhhhhh-cchhhhhhhHhhHHHHHHHHHhhh---------hhhhcccCcceeeeCCceEEeee Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKS-VEPGQTLLKNKHVGILERVTAVNA---------YSTPALISNDAIFMEGRSFTVMK 70 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~-~~~n~~~l~~ky~~lld~~~~~~s---------l~~~~~~n~~~~~~~g~tVkIp~ 70 (319) |- ++-.+ .-|-+|+-|.=- ...|+- ..++|.+.+..-...++ ...+-..-+++....|++|+++= T Consensus 1 ~~-~~~~~---~a~~~~~~~lft~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~I~~~~dL~K~aGd~vtf~L 75 (404) T protein:vir:10 1 MT-TVTSA---QANKLYQVALFTAANRNRS-MVNILTEQQEAPKAVSPDKKSTKQTSAGAPVVRITDLNKQAGDEVTFSI 75 (404) T ss_pred CC-CcCCc---chhhhHHHHHHHHHhcCCh-hHhhhhhhhhhhhhhccchhhccCCCCCccEEEeecCCCCCCcEEEEeE Confidence 11 11000 111122111100 000111 12333222211111111 11111112344667899999875 Q ss_pred ccc----cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 GDT----TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFAT 146 (319) Q Consensus 71 i~~----~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~ 146 (319) +.- ...+|.... .+.++++..+.+++|||-|---..=..|+..-+. .++..........-+....|+-.|-. T Consensus 76 ~~~L~g~gv~Gd~~lE--Gnee~L~~~s~~i~Idq~r~~V~~~g~msqQRt~--~dlr~~ar~~L~~w~~~~~d~~~~~~ 151 (404) T protein:vir:10 76 MHKLSKRPTMGDERVE--GRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTK--FNLASSARTLLGTYFNDLQDQCAIVH 151 (404) T ss_pred eeecccCCcccCceee--ccccceeEEeeEEEEeeecccccccCchhhhhhH--HHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 532 234454322 2356788889999999987442222456644444 45566666777777778888887776 Q ss_pred HHhccCc----------------------------cc----------cccCCHhH--HHHHHHHHHHHHHhccCC-C--- Q lcl|Aclame:pro 147 LARNKAK----------------------------HL----------TVGTGSDA--QYDAVLDVSVELDEIKAP-E--- 182 (319) Q Consensus 147 la~~a~~----------------------------~~----------~~~~T~~n--~~~~i~~a~~~Lde~~VP-~--- 182 (319) |+...+. .+ ...++.++ .++.|+.+.+.+++..-| . T Consensus 152 laG~rg~~~n~~~~vp~~~~~~~~~~~~N~v~APt~~r~~~~g~at~~~~l~stD~~s~~~Id~~~~~~~~~~~pi~Pv~ 231 (404) T protein:vir:10 152 LAGARGDFVADDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFEQIEAADIFSIGLVDNLSLFIDEMAHPLQPVR 231 (404) T ss_pred HhccccccccccceeeccccccccceeecccCCCCCCcEEeccCccchhhhhhcccccHHHHHHHHHHHHHhCCCCcceE Confidence 6633210 00 01122222 266677888888774333 1 Q ss_pred --C---------cEEEEChHHHHHHhhhhh------hhhcccc----cccceeeeeeeeecCeEEEEeccc---c----- Q lcl|Aclame:pro 183 --N---------RVLFVSPTFYKGIKKFVI------ALPQGDT----RQQVLGKGVQGELDGFVIVKVPTK---L----- 233 (319) Q Consensus 183 --~---------R~l~VsP~~~~~L~~~~~------f~~~~~~----~~~~~~~g~Vg~idG~~I~~vps~---~----- 233 (319) + ++|+++|..+..|+.++. +.+.... .++.+..|.+|+++|+.|.+-|+. . T Consensus 232 ~~g~~~~~~~~~yV~~~~p~q~~~Lr~dt~~~~w~d~q~~A~a~~rg~~nPlF~G~~gm~ngvii~~~~~~~Irf~~g~~ 311 (404) T protein:vir:10 232 LSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYAGMPIRFYQGSK 311 (404) T ss_pred eccccccCccceEEEEechHHHHHHhhCCCcHHHHHHHHHHhhccccccCCceecCeeEEcCEEEEecCCceeeecccce Confidence 1 678999999999999863 3332111 246789999999999999863321 1 Q ss_pred --------cc-----------cceEEEEcCCceeeeeeeeeeeee------cC--CCCCccc------eeeee------e Q lcl|Aclame:pro 234 --------LQ-----------GLQAIAVVGEVLASPIQADLAKTN------SN--IPGMFGT------LAEQL------L 274 (319) Q Consensus 234 --------~~-----------~~n~i~~~~~A~~~~~k~~~~~~~------~~--~~~~~~~------~v~gr------~ 274 (319) .. ...+++|...+..+..|..-.+.+ +. ......+ +.+.- . T Consensus 312 ~~~~~n~~~a~~~~~aa~~~v~RallLGaQAl~~A~g~~~g~~~~w~Ee~~D~g~~~~i~~~~i~G~kK~rF~~~~g~~~ 391 (404) T protein:vir:10 312 VLVSENNLTATTKEVAAATNIDRAMLLGAQALANAYGQKAGGHFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSGKMQ 391 (404) T ss_pred eeecCCccccccccccccccchhheeecceeEEEEeeccCCCCceeEeeccccCchhhhhhHHHhhhhhccccCCCCcee Confidence 00 023566665554444443222110 10 0000011 11111 2 Q ss_pred eeeEEEeccccce Q lcl|Aclame:pro 275 YTGAFVPEHLQKY 287 (319) Q Consensus 275 ~yg~~V~~~k~~~ 287 (319) =||+.|++.-.+- T Consensus 392 DfGvi~idta~~~ 404 (404) T protein:vir:10 392 DHGVIAVDTAVKL 404 (404) T ss_pred eEEEEEecccccC Confidence 4555555543332 No 160 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=96.37 E-value=0.00069 Score=37.91 Aligned_cols=269 Identities=9% Similarity=-0.024 Sum_probs=120.5 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhh-HHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKH-VGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDY 79 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky-~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY 79 (319) |+-+=....| ...++.+ ..+++.+.....+.. +++ .+...+++++||......-..+ T Consensus 1 ma~~t~~~gg------------------~liP~~~~~~Ii~~~~~~s~l~~--l~~--~~~~~~~~~~~p~~~~~~~a~w 58 (305) T protein:vir:25 1 MADISRAEVA------------------SLIQEAYSDTLLAAAKQGSTVLS--AFQ--NVNMGTKTTHLPVLATLPEADW 58 (305) T ss_pred CCCccCCccc------------------eecCHHHHHHHHHHHHhhchhhh--hcc--eeeccCCcEEEEEEeCCcceEE Confidence 3222111111 1133334 344444444333322 222 3445677899999876544433 Q ss_pred cCCC-CcccCC---cccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------ Q lcl|Aclame:pro 80 KRNA-TNEFDH---PKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR------ 149 (319) Q Consensus 80 ~r~~-~~~~~~---~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~------ 149 (319) ...+ ....+. -+.++...++.-.|...+ +.--+.-.-....++...+.+...++++-.+|...+.---+ T Consensus 59 v~E~~~~~~~~~~~s~~~f~~i~~~~~k~~~~-~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~ 137 (305) T protein:vir:25 59 VGESATDPKGVKPTSKVTWANRTLVAEEIAVI-IPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVS 137 (305) T ss_pred eecccccccccccccccceeeEEeeeEEEEEe-ehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccc Confidence 2221 111111 123444555555553333 22112111111245567778888889999999877631000 Q ss_pred -----ccCc--c----ccccCCHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeee Q lcl|Aclame:pro 150 -----NKAK--H----LTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQ 218 (319) Q Consensus 150 -----~a~~--~----~~~~~T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~V 218 (319) .+.. . .....+..++++.+..+...+...+..... ++++|..+..|.+-.+ ..|...... T Consensus 138 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~~~~~l~~lkd-----~~G~~i~~~--- 208 (305) T protein:vir:25 138 PALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDT-LLSSLALRYEVANIRD-----ANGNPVFRD--- 208 (305) T ss_pred cccccccccccccccccccchhhhHHHHHHHHHHHhhhhcccccce-eEecHHHHHHHHHhhc-----cCCceeecC--- Confidence 0000 0 001112234555555555555544333223 6789999988864322 112222223 Q ss_pred eeecCeEEEEeccc--ccccceEEEEcCCceeeee-eeeeeeeec-----CCCC------CccceeeeeeeeeEEEeccc Q lcl|Aclame:pro 219 GELDGFVIVKVPTK--LLQGLQAIAVVGEVLASPI-QADLAKTNS-----NIPG------MFGTLAEQLLYTGAFVPEHL 284 (319) Q Consensus 219 g~idG~~I~~vps~--~~~~~n~i~~~~~A~~~~~-k~~~~~~~~-----~~~~------~~~~~v~gr~~yg~~V~~~k 284 (319) +.+.|.+++.+++. ...+..++++..+-..... +=-.+++.+ .... ++.-.+|-..++|..|.+|+ T Consensus 209 ~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~ 288 (305) T protein:vir:25 209 DSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSA 288 (305) T ss_pred CcccccceEEcCccCCCCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcc Confidence 36899999865331 1123345555544332221 111122221 1110 12236677788999999998 Q ss_pred cceEEEEccccccCCCCCcc Q lcl|Aclame:pro 285 QKYIFTIGGTEVATKRDGVD 304 (319) Q Consensus 285 ~~~Iy~~~~~~~a~~~~~~~ 304 (319) .... ....+ .+...+.+ T Consensus 289 a~v~--~~~~~-~~~~~pa~ 305 (305) T protein:vir:25 289 TAQG--ANKTP-VAVVAPAA 305 (305) T ss_pred cEEE--Ecccc-ccccCCCC Confidence 6333 22221 11222222 No 161 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=95.92 E-value=0.0013 Score=36.49 Aligned_cols=279 Identities=10% Similarity=0.044 Sum_probs=129.5 Q ss_pred CCcccccccc---eeeehhhhhhhhhcchh-hhhhhHhhHHHHHHHHH--hhhhhhhcccCcceeee-CCceEEeeeccc Q lcl|Aclame:pro 1 MNKTIKNATG---MLKLNLQHFANKSVEPG-QTLLKNKHVGILERVTA--VNAYSTPALISNDAIFM-EGRSFTVMKGDT 73 (319) Q Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~~~~~~n-~~~l~~ky~~lld~~~~--~~sl~~~~~~n~~~~~~-~g~tVkIp~i~~ 73 (319) |+|.+++..- ++-|--||........- -.-+++..+.+..+++. ...+++..+..=.-... +..++....... T Consensus 6 ~~~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~ 85 (329) T protein:vir:79 6 MSKEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDK 85 (329) T ss_pred hhhhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeec Confidence 7777776532 22223333322222111 12234443434444442 22233333222100111 334777787777 Q ss_pred ccccc-ccC-CCCcccCCcccceeEEEEee-cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 74 TELKD-YKR-NATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARN 150 (319) Q Consensus 74 ~g~~D-Y~r-~~~~~~~~~t~t~~tltidq-dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~ 150 (319) .|... |.- +.....-+++.++....+-+ ..+|.+.+.++...+..+ +++.....+-++.++.-..|+..|-=-... T Consensus 86 ~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~ 164 (329) T protein:vir:79 86 VGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTG-KSLSTRKANAAQNAHDQLVNHLVFKGSKPH 164 (329) T ss_pred ceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhccEEEeecccc Confidence 66432 322 12233345666666666655 567777777777776544 444444455566666666666544110100 Q ss_pred c---------------Ccc---ccccCCHhHHHHHHHHHHHHHHhc--cCCCCcEEEEChHHHHHHhh-hhhhhhccccc Q lcl|Aclame:pro 151 K---------------AKH---LTVGTGSDAQYDAVLDVSVELDEI--KAPENRVLFVSPTFYKGIKK-FVIALPQGDTR 209 (319) Q Consensus 151 a---------------~~~---~~~~~T~~n~~~~i~~a~~~Lde~--~VP~~R~l~VsP~~~~~L~~-~~~f~~~~~~~ 209 (319) . +.. .-...|.+.+++-|.++..++.+. ++-..-.|+++|+.|..|.. .+.. ..... T Consensus 165 g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~--~~tvl 242 (329) T protein:vir:79 165 KIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPET--TMSYL 242 (329) T ss_pred cceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCC--CccHH Confidence 0 000 011236788999999999999875 43344579999999988842 1111 00111 Q ss_pred ccceeeeeeeeecCeEEEEecccc---cccce-EEEEcCCc--e--eeeeeeeeeeeecCCCCC-ccceee-eeeeeeEE Q lcl|Aclame:pro 210 QQVLGKGVQGELDGFVIVKVPTKL---LQGLQ-AIAVVGEV--L--ASPIQADLAKTNSNIPGM-FGTLAE-QLLYTGAF 279 (319) Q Consensus 210 ~~~~~~g~Vg~idG~~I~~vps~~---~~~~n-~i~~~~~A--~--~~~~k~~~~~~~~~~~~~-~~~~v~-gr~~yg~~ 279 (319) +-...++ ...+|..+|--. ..+.+ +++...+. + ..+.++. .. |.|.+ ..+.+. .-.++|+. T Consensus 243 ~~lk~~~-----~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~---~l-~~q~~~~~~~v~~~~r~~Gv~ 313 (329) T protein:vir:79 243 DYFKQQN-----GGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFN---ML-TAQPKDLHFKVPCTSKCTGLT 313 (329) T ss_pred HHHHHhC-----CCcEEEEcccccccCCCCceEEEEEecCCceEEEecCccee---ee-eceecCceEEEceeeeEEEEE Confidence 1111111 123344333211 11112 22222211 1 1122222 22 22322 345554 35677899 Q ss_pred Eeccccc----eEEEE Q lcl|Aclame:pro 280 VPEHLQK----YIFTI 291 (319) Q Consensus 280 V~~~k~~----~Iy~~ 291 (319) |.+|... ||.+. T Consensus 314 i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 314 IYRPLTLVLIKGLVVG 329 (329) T ss_pred EECcceeeeeeeeeeC Confidence 9999863 34333 No 162 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=95.86 E-value=0.0013 Score=36.33 Aligned_cols=266 Identities=8% Similarity=-0.070 Sum_probs=122.3 Q ss_pred hhhhhhcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccccccccC-CCCcccCCccccee Q lcl|Aclame:pro 18 HFANKSVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKR-NATNEFDHPKIEET 95 (319) Q Consensus 18 ~~~~~~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r-~~~~~~~~~t~t~~ 95 (319) |-... ......+++.|. .+++.+ ...+.... ++. .+..+++.++||.+...+-..+.. ++.....+++. . T Consensus 1 Mat~t--t~~g~~vP~~~~~~ii~~~-~~~s~l~~-~~~--~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f--~ 72 (311) T protein:vir:99 1 MATFG--TGNLKNLPRNIADGMVKDV-VQGSTVAV-LSA--RKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEF--D 72 (311) T ss_pred Cceec--CCCceeccHHHHHHHHHHH-Hhhchhhh-hcc--eeeccCCceEEEEEeCCceeEEeecCccccccccee--e Confidence 32211 122223344443 333333 32332221 222 244556778999986554444332 22233334444 4 Q ss_pred EEEEeecccceeecchhhHHH--H-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcc------------cc---c Q lcl|Aclame:pro 96 TYFLDQEKYWGRFVDALDRKD--T-EGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKH------------LT---V 157 (319) Q Consensus 96 tltidqdr~~~F~VD~~D~~e--t-~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~------------~~---~ 157 (319) ..+++-.|.-.+ +.--+.-. + .....+.+.+.+..+++++-.+|+-.+.-.-...+.. .. . T Consensus 73 ~v~l~~~k~~~~-~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~ 151 (311) T protein:vir:99 73 FVTSTPKKAQVT-MRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELT 151 (311) T ss_pred EEEEeeEEEEEe-ehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecc Confidence 444444443332 22222110 0 0123456777888889999999987663211111100 00 1 Q ss_pred cCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccc--- Q lcl|Aclame:pro 158 GTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKL--- 233 (319) Q Consensus 158 ~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~--- 233 (319) ..+....+..+.++...+..++.. ..--++++|..+..|.+-.+-... -..+.....+..+++.|.+++.+++-. T Consensus 152 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~-~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~ 230 (311) T protein:vir:99 152 ADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGR-KKFPELGLGIGVSSFEGIDASVSDTVNGGD 230 (311) T ss_pred ccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCC-eeecCcccCCCCceecceeeEeeccccccc Confidence 112233455566666666655444 222378899999888654321100 011222334566789999998643211 Q ss_pred c-----------ccceEEEEcCC-ce-eeeeeeeeeeeecCCC-C-------CccceeeeeeeeeEEEeccccceEEEEc Q lcl|Aclame:pro 234 L-----------QGLQAIAVVGE-VL-ASPIQADLAKTNSNIP-G-------MFGTLAEQLLYTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 234 ~-----------~~~n~i~~~~~-A~-~~~~k~~~~~~~~~~~-~-------~~~~~v~gr~~yg~~V~~~k~~~Iy~~~ 292 (319) . ....+++|.-+ ++ ....+--.++..+... + ++.-.+|...++|..|.+|+ ++.+.. T Consensus 231 ~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~--~v~~~~ 308 (311) T protein:vir:99 231 EADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDR--FVVIEN 308 (311) T ss_pred ccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChh--Heeeec Confidence 0 11233444322 22 2233333444433111 1 12236788899999988764 232222 Q ss_pred ccc Q lcl|Aclame:pro 293 GTE 295 (319) Q Consensus 293 ~~~ 295 (319) .++ T Consensus 309 ~~A 311 (311) T protein:vir:99 309 AVA 311 (311) T ss_pred ccC Confidence 222 No 163 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=95.84 E-value=0.0014 Score=36.26 Aligned_cols=279 Identities=12% Similarity=0.010 Sum_probs=124.7 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccc--ccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTE--LKD 78 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g--~~D 78 (319) |||-.. .++-+ +..--.--|+.|. .++.+++++...+.+-+ .+ +.+.-.+..+.+||+++... ..- T Consensus 4 ~~~~~~-~~k~i--t~~d~~gG~L~P~------~~~~~i~~l~e~s~i~~--~a-~vi~t~~s~~~~i~~i~~g~~~~~~ 71 (314) T protein:vir:41 4 LNKPFQ-ITPKI--DVPDLGKGILAVQ------RFGEFVREVRENSAIIK--DA-RVLNALKSYEVDISRISLGVELEPG 71 (314) T ss_pred hhhHHH-hhccc--ccccCCCceeChH------HHHHHHHHHHhccchhh--he-eeecccCccceeecccccCcccccc Confidence 454433 22211 1111112233443 23334444433222221 11 11122356789999987532 112 Q ss_pred ccCCCC---cccCCcccceeEEEEeecccceeecchhh--HHHHhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHH----- Q lcl|Aclame:pro 79 YKRNAT---NEFDHPKIEETTYFLDQEKYWGRFVDALD--RKDTEGNIDINYVVARQGAEVVAP-YLDNLRFATL----- 147 (319) Q Consensus 79 Y~r~~~---~~~~~~t~t~~tltidqdr~~~F~VD~~D--~~et~~~~~~~~~~~~~~~~~vap-eiD~~~~s~l----- 147 (319) .+-.+. .+..+++....+|.+.+=. ..|+--+ .+++....++.+.+....+..+.- +.+.++-+.- T Consensus 72 ~~~~~~~~~~~~~~~tf~~~~l~~~kl~---~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~ 148 (314) T protein:vir:41 72 RNTSGTKVAPTADEVTVSTNTLEMKELV---TKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTG 148 (314) T ss_pred cccccCCccCCcccccccceeeeeEEEE---EeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCc Confidence 221111 1223455555555553322 2233211 111111123444444444444443 3344332211 Q ss_pred ----------HhccCcc--ccccCCHhHHHHHHHHHHHHHHhc--cCCCCcEEEEChHHHHHHhhhhhhhhcccccccce Q lcl|Aclame:pro 148 ----------ARNKAKH--LTVGTGSDAQYDAVLDVSVELDEI--KAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVL 213 (319) Q Consensus 148 ----------a~~a~~~--~~~~~T~~n~~~~i~~a~~~Lde~--~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~ 213 (319) ...++.. ...+.+..+..+.+.++...|... +-+.+-+++|+++.+..+++-..- +....++... T Consensus 149 ~~~~~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~-~~~~l~~~~~ 227 (314) T protein:vir:41 149 RELYRINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLV-RETGLGDSAL 227 (314) T ss_pred ccchhcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhc-cCCcccchhh Confidence 0111111 111223345666677777777553 123355788899998877643211 1122355566 Q ss_pred eeeeeeeecCeEEEEeccc---ccccceEEEEcCCceeeeeeeeeeeeecC-CCCCccceeeeeeeeeEEEeccccceEE Q lcl|Aclame:pro 214 GKGVQGELDGFVIVKVPTK---LLQGLQAIAVVGEVLASPIQADLAKTNSN-IPGMFGTLAEQLLYTGAFVPEHLQKYIF 289 (319) Q Consensus 214 ~~g~Vg~idG~~I~~vps~---~~~~~n~i~~~~~A~~~~~k~~~~~~~~~-~~~~~~~~v~gr~~yg~~V~~~k~~~Iy 289 (319) ..|.-.++.|++|+.+|.- .+...-++++++.-++.... ..+++.+. .....-..+.-++..|+.+.+.....+. T Consensus 228 ~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~-~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~ 306 (314) T protein:vir:41 228 IGATGLQYDGIPIQYVPALDALGDDKARALLTVPTNLVYGFW-RNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAA 306 (314) T ss_pred hCCCCceecceeeEecccccccCCCCceEEEechhheEEEee-ceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEE Confidence 6777778999999987642 23445677888877655443 33444321 1122234555566667777665543332 Q ss_pred E-EccccccCCCCCc Q lcl|Aclame:pro 290 T-IGGTEVATKRDGV 303 (319) Q Consensus 290 ~-~~~~~~a~~~~~~ 303 (319) + ...++ + T Consensus 307 ~~~~~~~-------~ 314 (314) T protein:vir:41 307 VIDMSSG-------G 314 (314) T ss_pred EeeccCC-------C Confidence 2 22111 1 No 164 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=95.57 E-value=0.0018 Score=35.59 Aligned_cols=278 Identities=12% Similarity=0.053 Sum_probs=123.6 Q ss_pred CCccccc--------ccceeeehhhh------------h-----hhhhc-----chhhhhhhHh-hH-HHHHHHHHhhhh Q lcl|Aclame:pro 1 MNKTIKN--------ATGMLKLNLQH------------F-----ANKSV-----EPGQTLLKNK-HV-GILERVTAVNAY 48 (319) Q Consensus 1 ~~~~~~~--------~~~~~~~~~~~------------~-----~~~~~-----~~n~~~l~~k-y~-~lld~~~~~~sl 48 (319) |.+.++. +.....++... + ..+.. .....-.+.. +. .+++.+. ..+. T Consensus 309 l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr-~~s~ 387 (632) T protein:vir:96 309 LMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILR-NKAI 387 (632) T ss_pred HHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHh-hcch Confidence 1111110 00000000000 0 00000 0000001111 11 1222221 1111 Q ss_pred hhhcccCcceeeeCCceEEeeeccccccccccCCC-CcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHH Q lcl|Aclame:pro 49 STPALISNDAIFMEGRSFTVMKGDTTELKDYKRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVV 127 (319) Q Consensus 49 ~~~~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~ 127 (319) .. .+..+ ..-.....++||+....+-..+...+ .....+ .++..+++.-.+.-.+ |.--..---.....+...+ T Consensus 388 i~-~l~~~-~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s~--~~f~~i~l~~~k~~~~-v~iS~ell~ds~~~~~~~i 462 (632) T protein:vir:96 388 IG-QMGAR-MLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSD--FDFTTLSFSPKTIAGA-VPVTRKLRKQSSIHVENLI 462 (632) T ss_pred hh-hhcce-EeecCCcceEEEEEeCCceeEeecCCccccccc--cceeeEEeeeeEEEEe-hhhHHHHHhccchHHHHHH Confidence 11 11111 12223456899988764444433222 222233 3444444444443332 2221111001123556667 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-hccCccc-----ccc---CCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHh Q lcl|Aclame:pro 128 ARQGAEVVAPYLDNLRFATLA-RNKAKHL-----TVG---TGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIK 197 (319) Q Consensus 128 ~~~~~~~vapeiD~~~~s~la-~~a~~~~-----~~~---~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~ 197 (319) .......++-.+|...+.--. ++..... ... .+...-|+.+.++...+..+++. .+-.++++|..+..|+ T Consensus 463 ~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~ 542 (632) T protein:vir:96 463 REDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAK 542 (632) T ss_pred HHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHH Confidence 778888888889987653100 0100000 011 11223478888999899888776 4556788999888887 Q ss_pred hhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCC---Cccceeeeee Q lcl|Aclame:pro 198 KFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPG---MFGTLAEQLL 274 (319) Q Consensus 198 ~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~---~~~~~v~gr~ 274 (319) +..-.. ..|.-+... +.+.|.+|+. ++.++...+++|.-+-+. .-....+++...++. ...-.++... T Consensus 543 ~~~l~d---~~G~~i~~~---~~l~G~pv~~--s~~ip~~~~~~gd~s~~~-i~~~~~~~i~~~~~~~~~~~~v~~~~~~ 613 (632) T protein:vir:96 543 KAQVFD---NTGERIWQN---NEVNGYRAEA--SNQIPADTWIFGDWSQIV-IAMWGVLDLKVDPYTKAASDGLVLRVFQ 613 (632) T ss_pred HHhccC---CCCceeecC---CeecccceEe--ccccccCcEEEeecceEE-EEEecceEEEEccccccccCceEEEEEe Confidence 643221 122222223 4688999985 344444445555443322 222233444332232 2345888999 Q ss_pred eeeEEEeccccceEEEEcccc Q lcl|Aclame:pro 275 YTGAFVPEHLQKYIFTIGGTE 295 (319) Q Consensus 275 ~yg~~V~~~k~~~Iy~~~~~~ 295 (319) ++|..|.+|+...+ ....+ T Consensus 614 ~~d~~v~~~~af~~--~k~~A 632 (632) T protein:vir:96 614 DVDAGVRRKEAFCI--AKKGA 632 (632) T ss_pred ecCceeechhhhhh--eeecC Confidence 99999999987432 22112 No 165 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=95.48 E-value=0.002 Score=35.41 Aligned_cols=295 Identities=12% Similarity=0.055 Sum_probs=121.0 Q ss_pred CCcccccc-----cceeeehh----hhh-----hhhhc----ch-hhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeee Q lcl|Aclame:pro 1 MNKTIKNA-----TGMLKLNL----QHF-----ANKSV----EP-GQTLLKNKHVGILERVTAVNAYSTPALISNDAIFM 61 (319) Q Consensus 1 ~~~~~~~~-----~~~~~~~~----~~~-----~~~~~----~~-n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~ 61 (319) ..+.-..+ ..+..... .++ ..+.. .. ...-.++.+...+-+.+...+... .++....... T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~-~~~~~~~~~~ 198 (477) T protein:vir:84 120 VGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYA-NLCPTEPLPG 198 (477) T ss_pred hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcchHH-HhhceeeecC Confidence 00000000 00000000 000 00000 00 000111222222222222222221 1223222344 Q ss_pred CCceEEeeeccccccccc-cCCCCc-c---cCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHH Q lcl|Aclame:pro 62 EGRSFTVMKGDTTELKDY-KRNATN-E---FDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVV 135 (319) Q Consensus 62 ~g~tVkIp~i~~~g~~DY-~r~~~~-~---~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~v 135 (319) ++..++||++...+..-| ...++- . ....+.++..++++-.+.-.+. . +..+-... ...+...+.+..++.+ T Consensus 199 ~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~-~-iS~ell~ds~~~l~~~i~~~l~~~~ 276 (477) T protein:vir:84 199 GTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQ-G-IAIQLLDQAAVSVDEFVFRDLAADY 276 (477) T ss_pred CcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeee-H-HHHHHHhccchhHHHHHHHHHHHHH Confidence 667899999865433222 222111 1 1122334555566655544432 2 23222111 2355667778888889 Q ss_pred HHHHHHHHHHH-HHhcc--------Cccc-c---ccCC---HhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhh Q lcl|Aclame:pro 136 APYLDNLRFAT-LARNK--------AKHL-T---VGTG---SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKF 199 (319) Q Consensus 136 apeiD~~~~s~-la~~a--------~~~~-~---~~~T---~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~ 199 (319) +-.+|..++.- -.++. +... + .+.| .+..++.|.++...++....-....++++|..+..|.+- T Consensus 277 ~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l 356 (477) T protein:vir:84 277 ANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAI 356 (477) T ss_pred HHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHh Confidence 88898865520 00000 0000 0 0111 124566667766666554333456789999988887654 Q ss_pred hhhhh------ccccc------ccceeeeeeeeecCeEEEEec---ccccc---cceEEEEcCCceeeeeeeeeeeeecC Q lcl|Aclame:pro 200 VIALP------QGDTR------QQVLGKGVQGELDGFVIVKVP---TKLLQ---GLQAIAVVGEVLASPIQADLAKTNSN 261 (319) Q Consensus 200 ~~f~~------~~~~~------~~~~~~g~Vg~idG~~I~~vp---s~~~~---~~n~i~~~~~A~~~~~k~~~~~~~~~ 261 (319) .+-.. ..... .....+|..|++.|.+|+.++ .+... ...++++.-+...... ..+++... T Consensus 357 kd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~--~~~~~~~~ 434 (477) T protein:vir:84 357 FAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE--SSVRMRAL 434 (477) T ss_pred hccCCCeeeecCcccccccccccccccccccchhcccceEecCcccccccccCCcceEEEEEeceEEEEe--eceeEEec Confidence 32111 11111 112445667899999999753 22111 1234555544333222 22333332 Q ss_pred CCCCccc---eeeeeeeeeEEEec-cccceEEEEccccccCCC Q lcl|Aclame:pro 262 IPGMFGT---LAEQLLYTGAFVPE-HLQKYIFTIGGTEVATKR 300 (319) Q Consensus 262 ~~~~~~~---~v~gr~~yg~~V~~-~k~~~Iy~~~~~~~a~~~ 300 (319) +....+. .++-+.|++...++ |+...+....+.+.++-. T Consensus 435 ~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 435 QETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTAPTFA 477 (477) T ss_pred cccccccceeeeeehhhhhhhhhccccceEEeecccccccccC Confidence 2333332 33333445555556 776544444433332222 No 166 >protein:vir:105610 Length: 430 # NCBI annotation: virion structural protein # Family: family:all:974 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164307;genbank:gi:56692923;genbank:GeneID:3197221 Probab=95.42 E-value=0.0021 Score=35.26 Aligned_cols=267 Identities=14% Similarity=0.075 Sum_probs=129.5 Q ss_pred hhhh----hhcchhhhhhhHhhHHHHH-HHHHhhhhh-------------------------hhcccCcceeeeCCceEE Q lcl|Aclame:pro 18 HFAN----KSVEPGQTLLKNKHVGILE-RVTAVNAYS-------------------------TPALISNDAIFMEGRSFT 67 (319) Q Consensus 18 ~~~~----~~~~~n~~~l~~ky~~lld-~~~~~~sl~-------------------------~~~~~n~~~~~~~g~tVk 67 (319) |+|- -+-.|... +.|+..|. +..+++++. .+-..-+++....|++|. T Consensus 1 ~~~a~T~~~~~~p~a~---~~ws~~l~~~~~k~~~~~~kl~G~~~~~~~~~~~~~~~~ts~~~pI~r~~dL~K~~GD~Vt 77 (430) T protein:vir:10 1 MTASKTTMRYGDPNAM---IQQAAGLFALCQGRNSTLNRLTGKMPSGTSDAEKKTKGQSSLELPIVQAQDLGRNKGDEVR 77 (430) T ss_pred CcceeeecccCChhHH---HHHHHHHHHHHhhhhhhHHHhhccccccccchhhhccCCCCCCccEEEeccCCCCCccEEE Confidence 3332 12233322 22222221 111111111 112222455677899999 Q ss_pred eeeccc----cccccccCCCCcccCCcccceeEEEEeecccc-eeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 VMKGDT----TELKDYKRNATNEFDHPKIEETTYFLDQEKYW-GRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNL 142 (319) Q Consensus 68 Ip~i~~----~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~-~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~ 142 (319) ++-+.- ...+|....+ ..+.++..+..++|||-|-- ... ..|+..-+. .++.....+....-+....|+- T Consensus 78 f~L~~~L~g~gv~Gd~~lEG--nee~L~~~~d~l~IDq~R~~V~~g-g~msqQRt~--~dlR~~ar~~L~~w~~~~~Dq~ 152 (430) T protein:vir:10 78 FHFVQPANAFPIMGSEYAEG--KGTGLKIGSDQLRVNQARFPVDLG-DVMSQIRNP--YDLRRLGRPKAKWFMDAYLDQS 152 (430) T ss_pred EeEeeccccCceecCceeec--cccceEEEeeEEEEeeeccccccC-Cchhhhhhh--hHHHHHHHHHHHHHHHHHHHHH Confidence 975542 3356654332 34567888999999998732 211 234444343 3445555556666666677776 Q ss_pred HHHHHHhcc----------------------------Cccc------------------cccCCHhHH--HHHHHHHHHH Q lcl|Aclame:pro 143 RFATLARNK----------------------------AKHL------------------TVGTGSDAQ--YDAVLDVSVE 174 (319) Q Consensus 143 ~~s~la~~a----------------------------~~~~------------------~~~~T~~n~--~~~i~~a~~~ 174 (319) .|-.|+..- .+.+ ...++.+++ ++.|+.+... T Consensus 153 ~~v~laGarg~~~~~~~~~~~~~~~~~~~~~~N~v~aPt~nrh~~~~G~at~~~~~~~~~~sl~stD~~s~~~id~a~~~ 232 (430) T protein:vir:10 153 MLVHLAGARGNHYNKEWCLPLETHPKLADMLVNRVKAPTKNRHFVASADAITGVAPNAGEYNITTADVLDVDVVDSIATY 232 (430) T ss_pred HHHHHhhhhcccccccccccccCCcchhhhhccccCCCCCceeEeecccccccccccccccchhhhcccCHHHHHHHHHH Confidence 666665321 1111 011233333 7778888888 Q ss_pred HHhccCC-------C-C-------cEEEEChHHHHHHhhhhhhhh-c------cccc-ccceeeeeeeeecCeEEEEecc Q lcl|Aclame:pro 175 LDEIKAP-------E-N-------RVLFVSPTFYKGIKKFVIALP-Q------GDTR-QQVLGKGVQGELDGFVIVKVPT 231 (319) Q Consensus 175 Lde~~VP-------~-~-------R~l~VsP~~~~~L~~~~~f~~-~------~~~~-~~~~~~g~Vg~idG~~I~~vps 231 (319) ++..+.| + . ++||++|..+..|+.++.|.. + ...+ ++.+..|.+|+++|+-|++-|. T Consensus 233 a~~~~~~i~Pv~v~gd~~~g~~~~yV~~~~p~q~~~Lr~dt~~~~wq~~~~a~a~~g~~nPlF~G~~gm~ngvii~~~~~ 312 (430) T protein:vir:10 233 MDQIELPPPPVKFEGDEAAEDSPIRVLLCSPAQYNSFAKQEKFRSWQAAALARASNAKQHPIFRVDAGLWSNTLIIKMPK 312 (430) T ss_pred HHhhCCCCcceEeecccccCCccEEEEEechHHHHHHhhCcchHHHHHHHHHhhcccccCCceecceeeecCeEEecCCc Confidence 8886532 1 2 678999999999999988742 1 1112 4678999999999999986321 Q ss_pred c----------c------cc----------------cceEEEEcCCceeeeeee--eeee------eecC-C-CCCccc- Q lcl|Aclame:pro 232 K----------L------LQ----------------GLQAIAVVGEVLASPIQA--DLAK------TNSN-I-PGMFGT- 268 (319) Q Consensus 232 ~----------~------~~----------------~~n~i~~~~~A~~~~~k~--~~~~------~~~~-~-~~~~~~- 268 (319) - . .. ...+++|...+..+.-+. ..++ .++. + .....+ T Consensus 313 virf~~g~~~~~~a~~~~~~~~~~~~~a~~~~~~~v~RalllGaQA~~~A~g~~~~~g~~f~w~Ee~~D~g~~~~i~~~~ 392 (430) T protein:vir:10 313 PIRFYAGDTIKYCAAYNSEAESSAVVSDSFGNQYAVDRALLLGGQALAQAWAASEHSGMPFFWSEKDMDHGDKLELLIGA 392 (430) T ss_pred eeeecCCCccccccCCcccccccccccccccccccchhhhhccchhheeeeeccCCCCcceeeeeeccccCchhhhhhhH Confidence 1 0 00 012344444433333331 1111 1111 0 000111 Q ss_pred -----eeeee---------eeeeEEEeccccceEEEEccccccCCCCC Q lcl|Aclame:pro 269 -----LAEQL---------LYTGAFVPEHLQKYIFTIGGTEVATKRDG 302 (319) Q Consensus 269 -----~v~gr---------~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~ 302 (319) +.+.. .=||+.+++.-.+ .-.+.. T Consensus 393 i~G~kK~rF~~~~~~~~~~~DfGvi~idtaa~----------~~~~~~ 430 (430) T protein:vir:10 393 ILGCSKIRFAVEATNGLEYTDHGVMAIDTAVK----------IIGPRK 430 (430) T ss_pred HhccceeeecCCCCCCceeeeeEEEEhhhhhh----------hhcCCC Confidence 11111 1244444443222 111111 No 167 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=94.63 E-value=0.0039 Score=33.79 Aligned_cols=284 Identities=11% Similarity=0.011 Sum_probs=118.1 Q ss_pred CCcccccccceeeeh-----hhhhhhhhc--------------------ch---hhhhhhHhhH-HHHHHHHHhhhhhhh Q lcl|Aclame:pro 1 MNKTIKNATGMLKLN-----LQHFANKSV--------------------EP---GQTLLKNKHV-GILERVTAVNAYSTP 51 (319) Q Consensus 1 ~~~~~~~~~~~~~~~-----~~~~~~~~~--------------------~~---n~~~l~~ky~-~lld~~~~~~sl~~~ 51 (319) .+...+...++...+ ..+|..+.. .. .....++.+. .+++.+.....+ . T Consensus 116 ~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l-~- 193 (458) T protein:vir:10 116 GDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVV-G- 193 (458) T ss_pred hhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhH-H- Confidence 111111111110000 011111111 00 1112233332 233333222222 1 Q ss_pred cccCcceeeeCCceEEeeeccccccccccCCCCcccC-----CcccceeEEEEeecccceeecchhhHHHHhhhHHHHHH Q lcl|Aclame:pro 52 ALISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFD-----HPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYV 126 (319) Q Consensus 52 ~~~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~-----~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~ 126 (319) .+++ .+-.+++..++|.....+-..+...++...+ ....++..++++-.|.-.+. .--+.--......+... T Consensus 194 ~~~~--~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v-~is~ell~ds~~~~~~~ 270 (458) T protein:vir:10 194 ALFE--ELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKS-FITDETEEDAIFSLLPL 270 (458) T ss_pred hhcc--eeecCCcceEEEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeee-hhhHHHHhcchHHHHHH Confidence 1122 2334566777777655443333222222221 12234555556555544432 11111111112345667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCc----------c---ccccCC----HhHHHHHHHHHHHHHHhccCCCCcEEEEC Q lcl|Aclame:pro 127 VARQGAEVVAPYLDNLRFATLARNKAK----------H---LTVGTG----SDAQYDAVLDVSVELDEIKAPENRVLFVS 189 (319) Q Consensus 127 ~~~~~~~~vapeiD~~~~s~la~~a~~----------~---~~~~~T----~~n~~~~i~~a~~~Lde~~VP~~R~l~Vs 189 (319) +.+....+++-.+|..++.- .+.+. . .....+ ..-.|+.|.++...|...... +-.++++ T Consensus 271 i~~~l~~~i~~~~d~~~l~G--~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~-~~~~v~~ 347 (458) T protein:vir:10 271 LRKRLIEAHAVSIEEAFMTG--DGSGKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLK-LSKLVLI 347 (458) T ss_pred HHHHHHHHHHHHHHHHhhcC--CCCCccceeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcC-CCEEEEc Confidence 77888888888888876531 01100 0 000111 112378888888888876544 4457899 Q ss_pred hHHHHHHhhhhhhhhcc--c-ccccceeeeeeeeecCeEEEEe---cccccccceEEEE-cCCceeeeeeeeeeeeecCC Q lcl|Aclame:pro 190 PTFYKGIKKFVIALPQG--D-TRQQVLGKGVQGELDGFVIVKV---PTKLLQGLQAIAV-VGEVLASPIQADLAKTNSNI 262 (319) Q Consensus 190 P~~~~~L~~~~~f~~~~--~-~~~~~~~~g~Vg~idG~~I~~v---ps~~~~~~n~i~~-~~~A~~~~~k~~~~~~~~~~ 262 (319) |..+..|.+-.+-.... . ........|..++|.|.+|+.+ |... ....++++ ...+...... ..+.+.+++ T Consensus 348 ~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~-~~~~~~~~~f~~~~~~~~~-~~~~v~~d~ 425 (458) T protein:vir:10 348 VSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKA-NSAEFAVIVYKDNFVMPRQ-RAVTVERER 425 (458) T ss_pred HHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEcccccccc-CCcceEEEEecccEEEEEe-eceEEEeec Confidence 99998886533221110 0 1123455666778999999864 2211 11223333 2233333333 223332211 Q ss_pred CC-CccceeeeeeeeeEEEeccccceEEEEccccccC Q lcl|Aclame:pro 263 PG-MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVAT 298 (319) Q Consensus 263 ~~-~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~ 298 (319) -. ..--.++...=.|.-|++|.+ + +. ++..|+ T Consensus 426 ~~~~~~~~~~~~~r~~~~v~~~~a--~-v~-~~~aa~ 458 (458) T protein:vir:10 426 QAGKQRDAYYVTQRVNLQRYFANG--V-VS-GTYAAS 458 (458) T ss_pred ccCCCceEEEEEEEecceEecccc--e-EE-EeeccC Confidence 11 111123333334667778865 3 22 222222 No 168 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=94.03 E-value=0.0056 Score=32.92 Aligned_cols=281 Identities=9% Similarity=-0.012 Sum_probs=116.7 Q ss_pred CC------------cc---cccccceee----e--------hhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcc Q lcl|Aclame:pro 1 MN------------KT---IKNATGMLK----L--------NLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPAL 53 (319) Q Consensus 1 ~~------------~~---~~~~~~~~~----~--------~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~ 53 (319) .. +. +-.+.|-+. + .+++...--+.....-.++.+...+-+.....+... .+ T Consensus 20 ~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~-~l 98 (366) T protein:vir:57 20 IKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTVVR-IL 98 (366) T ss_pred cccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcchh-hh Confidence 00 00 000001000 0 000000000111111234444433333333222221 11 Q ss_pred cCcceeeeCCceEEeeeccccccccccCCC-CcccCCcccceeEEEEeecccceeecchhhH--HHHhhhHHHHHHHHHH Q lcl|Aclame:pro 54 ISNDAIFMEGRSFTVMKGDTTELKDYKRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDR--KDTEGNIDINYVVARQ 130 (319) Q Consensus 54 ~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~--~et~~~~~~~~~~~~~ 130 (319) .-+. .......++||+.....-..+...+ .....++ +...+++.-.|.-. .+.--+. +++. .++.+.+.+. T Consensus 99 g~~~-v~~~~g~~~~p~~t~~~~a~wv~E~~~~~~s~~--~f~~i~~~~~k~~~-~~~iS~ell~ds~--~~~~~~i~~~ 172 (366) T protein:vir:57 99 GARS-IPLPNGNLSMPRLSGGATAGYVGEGKDVVATGA--TFDDVKLSAKTMIA-LVPVSNQLIGRAG--FNVEQLLLGD 172 (366) T ss_pred ceee-eecCCCceEEEEEeCCcceeeeccCcccccccc--ceeEEEEeeEEEEE-eehhhHHHHhhhh--HHHHHHHHHH Confidence 1121 2233446999988654433333222 2222333 34444444444333 2222221 1232 3456777888 Q ss_pred HHHHHHHHHHHHHHHHHH-hccCc---------ccc-----ccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHH Q lcl|Aclame:pro 131 GAEVVAPYLDNLRFATLA-RNKAK---------HLT-----VGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYK 194 (319) Q Consensus 131 ~~~~vapeiD~~~~s~la-~~a~~---------~~~-----~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~ 194 (319) ....++..+|...+.--. +.... ... .+.+...+...+..+.......+.. .+-.++++|..+. T Consensus 173 l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~ 252 (366) T protein:vir:57 173 ILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYM 252 (366) T ss_pred HHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHH Confidence 888898888886552100 00000 000 0111222222222222222222222 3445789999998 Q ss_pred HHhhhhhhhhcccccccceeeeeeeeecCeEEEEec---cccc---ccceEEEEcCCceeeeeeeeee--eeecC----- Q lcl|Aclame:pro 195 GIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVP---TKLL---QGLQAIAVVGEVLASPIQADLA--KTNSN----- 261 (319) Q Consensus 195 ~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vp---s~~~---~~~n~i~~~~~A~~~~~k~~~~--~~~~~----- 261 (319) .|.+-.+ ..|........-|.|.|.+|+.++ .+.. ....+++|..+-..... ...+ ++.+. T Consensus 253 ~L~~lkd-----~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~-~~~i~i~~~~ea~~~~ 326 (366) T protein:vir:57 253 TLFGLRD-----GNGNKVYPEMSQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGE-DGMMKVDFSTEATYKD 326 (366) T ss_pred HHHhhhc-----cCCceeccCCCCCeecceeeEEccccccccccCCCccEEEEEecceEEEEE-ecceEEEEeecccccc Confidence 8875332 112222112223579999998643 2211 12335555554433221 1122 22211 Q ss_pred CC-------CCccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 262 IP-------GMFGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 262 ~~-------~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) .. ..+.-+++-..++|..|.+|+...+.....= T Consensus 327 ~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 327 ADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred ccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 00 0123478899999999999998665544433 No 169 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=92.65 E-value=0.011 Score=31.43 Aligned_cols=269 Identities=13% Similarity=0.050 Sum_probs=125.4 Q ss_pred eeehhhhhhhhhcchhhhhhhHhhHHHHHHHHH--hhhhhhhcccCcce-eeeCCceEEeeeccccccc-cccC-CCCcc Q lcl|Aclame:pro 12 LKLNLQHFANKSVEPGQTLLKNKHVGILERVTA--VNAYSTPALISNDA-IFMEGRSFTVMKGDTTELK-DYKR-NATNE 86 (319) Q Consensus 12 ~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~--~~sl~~~~~~n~~~-~~~~g~tVkIp~i~~~g~~-DY~r-~~~~~ 86 (319) ++|+. . +.--.-+++..+.+..+++. ...+++..+..=.- ...+..+|..+.....|.. -|.- +.... T Consensus 1 ~~~~~------a-~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip 73 (296) T protein:vir:10 1 MGVDK------A-DAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLP 73 (296) T ss_pred Ccccc------h-hhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccc Confidence 23322 1 22223355555555555553 22333333222000 1112447777777665533 2321 22333 Q ss_pred cCCcccceeEEEEee-cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------Cccccc Q lcl|Aclame:pro 87 FDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK--------AKHLTV 157 (319) Q Consensus 87 ~~~~t~t~~tltidq-dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a--------~~~~~~ 157 (319) .-+++.++....+-. ..+|.+.+.++...+..+ .++...-...++.++.-..|+..|-=-.... +..... T Consensus 74 ~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~ 152 (296) T protein:vir:10 74 LVDALATERQGKVFRFGNAFLISIDEIKVGQATG-QSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVV 152 (296) T ss_pred eeeccceeEEEEEEEEEeeeeecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCcccc Confidence 446677777777765 577888877777666543 3444444455666666666665441111110 000111 Q ss_pred -c---CCHhHHHHHHHHHHHHHHhc--cCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecc Q lcl|Aclame:pro 158 -G---TGSDAQYDAVLDVSVELDEI--KAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPT 231 (319) Q Consensus 158 -~---~T~~n~~~~i~~a~~~Lde~--~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps 231 (319) . .+++++++-|..+...|... ++-..-.|+++|+.|..|..-. .+ .+-.. .+-.-....+.+|..+|- T Consensus 153 ~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~---~~--~~~t~-l~~ik~~~~~l~i~~~~~ 226 (296) T protein:vir:10 153 SGGSWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLV---PG--TSVSY-GEFFRQNNSGVTVEFVQY 226 (296) T ss_pred ccCCccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhcc---CC--CCccH-HHHHHHhcCCceEEEeee Confidence 1 24678999999999877654 4444457888999999885321 11 11110 000000123444544432 Q ss_pred ---cccccce-EEEEc--CCceeeeeeeeeeeeecCCCCCccceeeeee-eeeEEEeccccceEEEEcccccc Q lcl|Aclame:pro 232 ---KLLQGLQ-AIAVV--GEVLASPIQADLAKTNSNIPGMFGTLAEQLL-YTGAFVPEHLQKYIFTIGGTEVA 297 (319) Q Consensus 232 ---~~~~~~n-~i~~~--~~A~~~~~k~~~~~~~~~~~~~~~~~v~gr~-~yg~~V~~~k~~~Iy~~~~~~~a 297 (319) ....+.. +++.. +.-+.++.- ..++.....+....+.+++.. +.|+.|.+|......--. +=| T Consensus 227 l~~a~~~g~~~~v~~~~~~~~~~~~v~-~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI--~~~ 296 (296) T protein:vir:10 227 LNDYNGTGTSAAIAYEKDPNNMAIEIP-EATNALPAQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGI--TFA 296 (296) T ss_pred eccCCCCcceEEEEEEcCCceEEEEcC-cceeeecccccCceEEEeeEeeEEEEEEECCceeEEEeee--ecC Confidence 1111122 22222 112221111 122222212233557777666 678999999873322111 111 No 170 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=91.71 E-value=0.015 Score=30.65 Aligned_cols=282 Identities=10% Similarity=-0.007 Sum_probs=125.5 Q ss_pred CCcccccccceee----ehhhhhhhhhcc-hhhhhhhHhhHHHHHHHHHh--hhhhhhcccCcceeee-CCceEEeeecc Q lcl|Aclame:pro 1 MNKTIKNATGMLK----LNLQHFANKSVE-PGQTLLKNKHVGILERVTAV--NAYSTPALISNDAIFM-EGRSFTVMKGD 72 (319) Q Consensus 1 ~~~~~~~~~~~~~----~~~~~~~~~~~~-~n~~~l~~ky~~lld~~~~~--~sl~~~~~~n~~~~~~-~g~tVkIp~i~ 72 (319) |. +.+-..-++. --+||-..+-+. .--+-+++.++.+..+++.. ..+++..+..=.-... +..++...... T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~ 79 (319) T protein:vir:10 1 MT-TKKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFD 79 (319) T ss_pred CC-CcchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeec Confidence 21 1110000110 001122222211 11123344444444444432 1222222221000111 23367777776 Q ss_pred ccccc-cccC-CCCcccCCcccceeEEEEee-cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 73 TTELK-DYKR-NATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR 149 (319) Q Consensus 73 ~~g~~-DY~r-~~~~~~~~~t~t~~tltidq-dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~ 149 (319) ..|.. -|.- +.....-+++.++....+-. ..+|.+.+.++...+..+ +++...-...++.++.-..|...|---.. T Consensus 80 ~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aA~~~~~~~~n~i~f~G~~~ 158 (319) T protein:vir:10 80 KVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATG-RPLSTRKASACQLAHDQLVNRLVFKGSAP 158 (319) T ss_pred cccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhceEEEeeccc Confidence 65533 2321 22334456677777777765 578888888887777543 44444445566666766666655411111 Q ss_pred cc--------Cc---c-----ccccCCHhHHHHHHHHHHHHHHhc--cCCCCcEEEEChHHHHHHhh-hhhhhhcccccc Q lcl|Aclame:pro 150 NK--------AK---H-----LTVGTGSDAQYDAVLDVSVELDEI--KAPENRVLFVSPTFYKGIKK-FVIALPQGDTRQ 210 (319) Q Consensus 150 ~a--------~~---~-----~~~~~T~~n~~~~i~~a~~~Lde~--~VP~~R~l~VsP~~~~~L~~-~~~f~~~~~~~~ 210 (319) .. +. . ...+.|.+++++-|..+..+|... ++-..-.|+++|+.|..|.. -+.. .....+ T Consensus 159 ~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~--~~t~l~ 236 (319) T protein:vir:10 159 HKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPET--TMSYLD 236 (319) T ss_pred ccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCC--CeeHHH Confidence 00 00 0 001135678999999999988754 55455679999999998842 1110 000111 Q ss_pred cceeeeeeeeecCeEEEEecccc---cccceE-EEEc--CCceeeeeeeeeeeeecCCCCCccceeee-eeeeeEEEecc Q lcl|Aclame:pro 211 QVLGKGVQGELDGFVIVKVPTKL---LQGLQA-IAVV--GEVLASPIQADLAKTNSNIPGMFGTLAEQ-LLYTGAFVPEH 283 (319) Q Consensus 211 ~~~~~g~Vg~idG~~I~~vps~~---~~~~n~-i~~~--~~A~~~~~k~~~~~~~~~~~~~~~~~v~g-r~~yg~~V~~~ 283 (319) -...+ ..+.+|..+|--. ..+.+. ++.. +.-+.++.- ..++.....+....+.+.+ -++.|+.|.+| T Consensus 237 ~lk~~-----~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~-~~~~~~~~e~~~l~~~~~~~~r~~Gv~i~~P 310 (319) T protein:vir:10 237 YFKSQ-----NSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIP-EAFNMLPAQPKDLHFKVPCTSKCTGLTIYRP 310 (319) T ss_pred HHHHh-----cCCceEEEeeeecccCCCcceEEEEEecCCceEEEecC-cceeeeeeeecCceEEEeeeeeeEEEEEEcc Confidence 11111 1234454444211 111222 2222 122221110 1222221112335567655 44788999999 Q ss_pred ccceEEEEccc Q lcl|Aclame:pro 284 LQKYIFTIGGT 294 (319) Q Consensus 284 k~~~Iy~~~~~ 294 (319) ...+. -.+- T Consensus 311 ~ai~~--~dGI 319 (319) T protein:vir:10 311 MTIVL--ITGV 319 (319) T ss_pred ceeEe--eecC Confidence 87322 2221 No 171 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=91.58 E-value=0.015 Score=30.55 Aligned_cols=289 Identities=12% Similarity=0.032 Sum_probs=121.2 Q ss_pred CCcccccccc--eeeehhhhhhhhhcchhhh-----hhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecc Q lcl|Aclame:pro 1 MNKTIKNATG--MLKLNLQHFANKSVEPGQT-----LLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGD 72 (319) Q Consensus 1 ~~~~~~~~~~--~~~~~~~~~~~~~~~~n~~-----~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~ 72 (319) ....+....| .|+....-|-+ ++...+- -.++.++ .+++.+...+.+.. +++ .+-.+| .++||..+ T Consensus 62 ~~~~~~~~r~~~~l~~ee~~~~~-~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~--~~~--v~~~~~-~~~i~~~~ 135 (395) T protein:vir:95 62 VDNGILAKRSQDPLTSEERKFFN-DINYDVGYTDEKILPETVVERVFDDLQKDHPLLS--KIN--FQNAGI-KTRVIKAD 135 (395) T ss_pred HHHHHHhhcCccccchHHHHHHH-HHhhccCCCCceeccHHHHHHHHHHHHhhhhhhh--hce--eEecCC-ceEEEEec Confidence 1111111111 12222222221 2111111 1344443 33333333222221 222 222344 57898877 Q ss_pred ccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 73 TTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFATLARNK 151 (319) Q Consensus 73 ~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a 151 (319) ..+..-+....+-..+..+.+...+++...+.-.+.. +..+-.. ...++...+.+..+.+++-.+|+..+.- .+. T Consensus 136 ~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~--iS~ell~ds~~~ie~~i~~~la~~ia~~~~~a~i~G--~G~ 211 (395) T protein:vir:95 136 PAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVV--LPDDLSTFGPAWIERFVRTQIQEAISVALESAIING--GGA 211 (395) T ss_pred CCcceEEeecccccCccccccceeeeeceeeEEEeec--ccHHHHhcchhHHHHHHHHHHHHHHHHHHhhheeec--cCC Confidence 6655444222222223334445555555555443321 1111111 1123455667777778888888754420 010 Q ss_pred Cc-------------c---c----cccCCHh---HHHHHHHHHHHHHH------hccCCCCcEEEEChHHHHHHhhhhhh Q lcl|Aclame:pro 152 AK-------------H---L----TVGTGSD---AQYDAVLDVSVELD------EIKAPENRVLFVSPTFYKGIKKFVIA 202 (319) Q Consensus 152 ~~-------------~---~----~~~~T~~---n~~~~i~~a~~~Ld------e~~VP~~R~l~VsP~~~~~L~~~~~f 202 (319) +. . . +..++.+ ..++.+.++...+. ......+..++++|..+.-+. .++ T Consensus 212 ~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~--g~~ 289 (395) T protein:vir:95 212 AKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQ--ARY 289 (395) T ss_pred CCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcC--Ccc Confidence 00 0 0 0001112 22333444333331 112234456788887765332 122 Q ss_pred hhcccccccceeeeeeeeec--CeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCC---ccceeeeeeeee Q lcl|Aclame:pro 203 LPQGDTRQQVLGKGVQGELD--GFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTLAEQLLYTG 277 (319) Q Consensus 203 ~~~~~~~~~~~~~g~Vg~id--G~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~---~~~~v~gr~~yg 277 (319) .... .+|...++. |.+|+. ++.++...+++|.-+-..... ...+++-...+.. +...++.+.++| T Consensus 290 ~~~~-------~~G~~~~~lg~g~~v~~--~~~~p~~~i~fgdfs~y~i~~-r~~~~i~~~~~~~~~~d~~~f~~~~r~d 359 (395) T protein:vir:95 290 TYLT-------ANGGFVTVLPYNVTIIT--SEFVPEGKLVAFVTDRYNAVR-GGGLTVKKFDQTLALEDAVLFTAKTFAY 359 (395) T ss_pred eecc-------CCCcceeccCCcceEEE--cCCCCCCcEEEEecccEEEEE-ecceEEEeccchhhhCCcEEEEEEEEEC Confidence 2211 134444554 445654 556666666766665432222 2233333222322 346799999999 Q ss_pred EEEeccccceEEEEc-cccccCCC-CCccccccccccccccc Q lcl|Aclame:pro 278 AFVPEHLQKYIFTIG-GTEVATKR-DGVDAHADNVAKPSGSL 317 (319) Q Consensus 278 ~~V~~~k~~~Iy~~~-~~~~a~~~-~~~~~~~~~~~~~~~~~ 317 (319) .+++++++.-++... .+++.+.. .|++-. |..+- T Consensus 360 g~~~~~~A~~~l~i~~~~~~~~~~~~~~~~~------~~~~~ 395 (395) T protein:vir:95 360 GQPDDNKASAVYDLKVASAPRRQTSAGGTTD------GIAEA 395 (395) T ss_pred CEEeccccEEEEEeeccCCCCCCCCCCCCCC------ccccC Confidence 999999987554432 22222222 121111 22222 No 172 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=90.74 E-value=0.019 Score=29.99 Aligned_cols=276 Identities=11% Similarity=0.039 Sum_probs=114.1 Q ss_pred CCcccccccceeeehhhhhhhh-------------hcchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceE Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANK-------------SVEPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~-------------~~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tV 66 (319) +.+.+..+.|.++ +...|+.. .+.....-.++.+. .+++.+ ...+..... ..+ ..-.....+ T Consensus 96 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l-~~~~~l~~~-~~~-~~~~~~g~~ 171 (428) T protein:vir:10 96 MVMSIAAAQGNLQ-DAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELL-RDRTIVRKL-GAR-SIPLPNGNM 171 (428) T ss_pred HHHHHHHhhhhHH-HHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHH-hhhchhhhh-cce-eeecCCcce Confidence 0000000111110 00000000 00011111233333 233322 222221111 111 122233458 Q ss_pred EeeeccccccccccCC-CCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 TVMKGDTTELKDYKRN-ATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEVVAPYLDNLRF 144 (319) Q Consensus 67 kIp~i~~~g~~DY~r~-~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~ 144 (319) +||++....-..+... +.....+++.+ ..++...+...+ +. +..+-.+. ...+.+.+.+....+++-.+|...+ T Consensus 172 ~~p~~~~~~~a~~v~Eg~~~~~~~~~f~--~i~~~~~k~~~~-v~-is~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l 247 (428) T protein:vir:10 172 SLPRLAGGATASYTGENQDAKVSEARFD--DVKLTAKTMIAM-VP-ISNALIGRAGFNVEQLVLQDILTAISVREDKAFM 247 (428) T ss_pred EEEEEeCCcceeeeccCcccccccccee--eEEeeeEEEEEe-eh-hhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHh Confidence 9998865433333322 22333334444 444444443332 22 22221111 2455677788888888888888655 Q ss_pred HHHHhccCcc-------------cc---ccCCHh---HHHHHHHHHHHHHHhc-cC-CCCcEEEEChHHHHHHhhhhhhh Q lcl|Aclame:pro 145 ATLARNKAKH-------------LT---VGTGSD---AQYDAVLDVSVELDEI-KA-PENRVLFVSPTFYKGIKKFVIAL 203 (319) Q Consensus 145 s~la~~a~~~-------------~~---~~~T~~---n~~~~i~~a~~~Lde~-~V-P~~R~l~VsP~~~~~L~~~~~f~ 203 (319) . +.++. .. .+.... ...+.+.++...+... .. ..+-.++++|..+..|.+-.+ T Consensus 248 ~----G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd-- 321 (428) T protein:vir:10 248 R----DDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRD-- 321 (428) T ss_pred c----cCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhc-- Confidence 2 11110 00 000111 1222222322222221 11 234567889999988865432 Q ss_pred hcccccccceeeeeeeeecCeEEEEec---cccc---ccceEEEEcCCceeeeeeeeeeeeecCCCC-----------C- Q lcl|Aclame:pro 204 PQGDTRQQVLGKGVQGELDGFVIVKVP---TKLL---QGLQAIAVVGEVLASPIQADLAKTNSNIPG-----------M- 265 (319) Q Consensus 204 ~~~~~~~~~~~~g~Vg~idG~~I~~vp---s~~~---~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~-----------~- 265 (319) ..|.-......-|+|.|.+|+.++ .+.. +...+++|..+-..... ...+++...++. . T Consensus 322 ---~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~-~~~i~i~~~~~~~~~~~~~~~~~~f 397 (428) T protein:vir:10 322 ---GNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGE-DGNMKVDFSKEASYIDTDGKLVSAF 397 (428) T ss_pred ---cCCceeccCCCCCeeeceeeEEeccccccccCCCccceEEEEecceEEEEE-ecceEEEeecccccccccccccchh Confidence 112211111122469999998643 2222 22346667665433322 223333221121 1 Q ss_pred --ccceeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 266 --FGTLAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 266 --~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) +.-.++....+|..|.+|++..+.....= T Consensus 398 ~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 398 SRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred hcchhheeeeeeeCceeeccceEEEEeccCC Confidence 22477899999999999998655333322 No 173 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=90.12 E-value=0.023 Score=29.61 Aligned_cols=287 Identities=11% Similarity=-0.009 Sum_probs=115.7 Q ss_pred CCcccccccceeeehh--h-----hhhhhhc--------chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNL--Q-----HFANKSV--------EPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGR 64 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~--~-----~~~~~~~--------~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~ 64 (319) ++.....+..-..... . +.+.... .-....+.+.|. .+++.... .+... .+++. ...++. T Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~-~~~i~-~l~~~--~~~~~~ 193 (497) T protein:vir:10 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFY-ELSLA-DLISS--RPVTSP 193 (497) T ss_pred hhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHh-hhhHH-hhccc--cccCCC Confidence 1100000000000000 0 0000000 001112344443 33333322 22222 22222 334566 Q ss_pred eEEeeecccc-ccccccCCC-CcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 SFTVMKGDTT-ELKDYKRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNL 142 (319) Q Consensus 65 tVkIp~i~~~-g~~DY~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~ 142 (319) +++||..... +-..+...+ .....++ ++..+++.-.|.-.+. .+..+-.+....+...+.+..++.++-.+|.. T Consensus 194 ~~~~~~~~~~~~~a~wv~E~~~~~~s~~--~f~~i~~~~~k~a~~~--~iS~ell~d~~~l~~~i~~~l~~~i~~~~d~~ 269 (497) T protein:vir:10 194 NLSYLTESAAHNNAAAVAEAGTYPFSSE--EFARVYEQVGKVANAL--TITDEGLRDAPELFNFVQGRLLEGIQRKEEVQ 269 (497) T ss_pred ceEEEEEcCCCCcceeeccCcccccccc--cceeeEeeeeeeEeec--HhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH Confidence 7999987542 233333222 2222333 3444555545544432 22222211112234556677777888888876 Q ss_pred HHHH---------HHhccCcccccc------------------------------------------------------C Q lcl|Aclame:pro 143 RFAT---------LARNKAKHLTVG------------------------------------------------------T 159 (319) Q Consensus 143 ~~s~---------la~~a~~~~~~~------------------------------------------------------~ 159 (319) ++.- +....+...+.+ . T Consensus 270 ~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 349 (497) T protein:vir:10 270 LLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYP 349 (497) T ss_pred hhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhcccc Confidence 5420 000000000000 0 Q ss_pred CHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhh----hhccccc-ccceeeeeeeeecCeEEEEeccccc Q lcl|Aclame:pro 160 GSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIA----LPQGDTR-QQVLGKGVQGELDGFVIVKVPTKLL 234 (319) Q Consensus 160 T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f----~~~~~~~-~~~~~~g~Vg~idG~~I~~vps~~~ 234 (319) +.......+..+...+...+.=..-.++++|..+..|.+-.+- ......+ ..+...+....|.|.+|+.+++ + T Consensus 350 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~--~ 427 (497) T protein:vir:10 350 TAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--I 427 (497) T ss_pred chhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCC--C Confidence 0111122223333333322221122578999988888654432 1111111 1111223344788999987543 4 Q ss_pred ccceEEEEcCC--ceeeeeeeeeeeeecCCCCC-----ccceeeeeeeeeEEEeccccceEEEEccccccCC Q lcl|Aclame:pro 235 QGLQAIAVVGE--VLASPIQADLAKTNSNIPGM-----FGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATK 299 (319) Q Consensus 235 ~~~n~i~~~~~--A~~~~~k~~~~~~~~~~~~~-----~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~ 299 (319) +.-.+++|.-+ ++..... ..+.+-..++.. +-..++....+|..|.+|++. +++...++..+. T Consensus 428 ~~~~~~~Gd~~~~~~~i~~r-~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~-~~l~~~~~~~~~ 497 (497) T protein:vir:10 428 PLGTILVGHFAPSVIQTARR-EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF-QLIQLKKGATGS 497 (497) T ss_pred CCCceEEeecccceEEEEEe-cccEEEeecccchhhhcCcEEEEEEEeecceeeccccE-EEEEecCCccCC Confidence 44455555432 3332222 222222222322 335788888999999999874 334443222222 No 174 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=90.12 E-value=0.023 Score=29.61 Aligned_cols=287 Identities=11% Similarity=-0.009 Sum_probs=115.7 Q ss_pred CCcccccccceeeehh--h-----hhhhhhc--------chhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNL--Q-----HFANKSV--------EPGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGR 64 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~--~-----~~~~~~~--------~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~ 64 (319) ++.....+..-..... . +.+.... .-....+.+.|. .+++.... .+... .+++. ...++. T Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~-~~~i~-~l~~~--~~~~~~ 193 (497) T protein:vir:78 118 FNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFY-ELSLA-DLISS--RPVTSP 193 (497) T ss_pred hhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHh-hhhHH-hhccc--cccCCC Confidence 1100000000000000 0 0000000 001112344443 33333322 22222 22222 334566 Q ss_pred eEEeeecccc-ccccccCCC-CcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 SFTVMKGDTT-ELKDYKRNA-TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNL 142 (319) Q Consensus 65 tVkIp~i~~~-g~~DY~r~~-~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~ 142 (319) +++||..... +-..+...+ .....++ ++..+++.-.|.-.+. .+..+-.+....+...+.+..++.++-.+|.. T Consensus 194 ~~~~~~~~~~~~~a~wv~E~~~~~~s~~--~f~~i~~~~~k~a~~~--~iS~ell~d~~~l~~~i~~~l~~~i~~~~d~~ 269 (497) T protein:vir:78 194 NLSYLTESAAHNNAAAVAEAGTYPFSSE--EFARVYEQVGKVANAL--TITDEGLRDAPELFNFVQGRLLEGIQRKEEVQ 269 (497) T ss_pred ceEEEEEcCCCCcceeeccCcccccccc--cceeeEeeeeeeEeec--HhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH Confidence 7999987542 233333222 2222333 3444555545544432 22222211112234556677777888888876 Q ss_pred HHHH---------HHhccCcccccc------------------------------------------------------C Q lcl|Aclame:pro 143 RFAT---------LARNKAKHLTVG------------------------------------------------------T 159 (319) Q Consensus 143 ~~s~---------la~~a~~~~~~~------------------------------------------------------~ 159 (319) ++.- +....+...+.+ . T Consensus 270 ~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 349 (497) T protein:vir:78 270 LLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYP 349 (497) T ss_pred hhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhcccc Confidence 5420 000000000000 0 Q ss_pred CHhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhh----hhccccc-ccceeeeeeeeecCeEEEEeccccc Q lcl|Aclame:pro 160 GSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIA----LPQGDTR-QQVLGKGVQGELDGFVIVKVPTKLL 234 (319) Q Consensus 160 T~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f----~~~~~~~-~~~~~~g~Vg~idG~~I~~vps~~~ 234 (319) +.......+..+...+...+.=..-.++++|..+..|.+-.+- ......+ ..+...+....|.|.+|+.+++ + T Consensus 350 ~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~--~ 427 (497) T protein:vir:78 350 TAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPL--I 427 (497) T ss_pred chhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCC--C Confidence 0111122223333333322221122578999988888654432 1111111 1111223344788999987543 4 Q ss_pred ccceEEEEcCC--ceeeeeeeeeeeeecCCCCC-----ccceeeeeeeeeEEEeccccceEEEEccccccCC Q lcl|Aclame:pro 235 QGLQAIAVVGE--VLASPIQADLAKTNSNIPGM-----FGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATK 299 (319) Q Consensus 235 ~~~n~i~~~~~--A~~~~~k~~~~~~~~~~~~~-----~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~ 299 (319) +.-.+++|.-+ ++..... ..+.+-..++.. +-..++....+|..|.+|++. +++...++..+. T Consensus 428 ~~~~~~~Gd~~~~~~~i~~r-~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~-~~l~~~~~~~~~ 497 (497) T protein:vir:78 428 PLGTILVGHFAPSVIQTARR-EGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAF-QLIQLKKGATGS 497 (497) T ss_pred CCCceEEeecccceEEEEEe-cccEEEeecccchhhhcCcEEEEEEEeecceeeccccE-EEEEecCCccCC Confidence 44455555432 3332222 222222222322 335788888999999999874 334443222222 No 175 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=89.10 E-value=0.028 Score=29.07 Aligned_cols=276 Identities=11% Similarity=0.024 Sum_probs=114.7 Q ss_pred CCcccc------------------------cccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCc Q lcl|Aclame:pro 1 MNKTIK------------------------NATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISN 56 (319) Q Consensus 1 ~~~~~~------------------------~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~ 56 (319) |.-.|. ..+|-..+...|....+-+.... ..+.+........ .. T Consensus 162 ~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~astA---------ss~Al~gEA~t~~---sT 229 (523) T protein:vir:59 162 SSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIVGA---------VGSALYARLFFVT---GS 229 (523) T ss_pred cccceeeeeccccccccccccccccccccccccccccccccchhhcccccccc---------ccccccccccccc---cc Confidence 221110 11111111111111111100000 0000000000000 00 Q ss_pred ceeeeC-Cc------eEEee---ecc---ccccccccCCCCcccCCcccceeEEEEeecc------cce----eecchhh Q lcl|Aclame:pro 57 DAIFME-GR------SFTVM---KGD---TTELKDYKRNATNEFDHPKIEETTYFLDQEK------YWG----RFVDALD 113 (319) Q Consensus 57 ~~~~~~-g~------tVkIp---~i~---~~g~~DY~r~~~~~~~~~t~t~~tltidqdr------~~~----F~VD~~D 113 (319) +..... |. ..-.. .+. ....+++.. .++.+..|-++.+.-|| .|+ +.+.-.- T Consensus 230 d~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~-----~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQ 304 (523) T protein:vir:59 230 DFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPD-----PGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQ 304 (523) T ss_pred cccccCCCcccccccccccccccccchhhccccccccc-----cccccccccceeeEEEeEEEeeecccccccccHHHHH Confidence 000000 00 00000 000 001111110 01122223333222221 222 2222111 Q ss_pred HHHHhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--------cC----CH--------hHHHHHHHHHH Q lcl|Aclame:pro 114 RKDTEG-NIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTV--------GT----GS--------DAQYDAVLDVS 172 (319) Q Consensus 114 ~~et~~-~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~--------~~----T~--------~n~~~~i~~a~ 172 (319) .-.+-. -+++-..++.-..+.+.-||...++..|...+...... .+ ++ ....+++..+. T Consensus 305 DLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~ 384 (523) T protein:vir:59 305 DLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLM 384 (523) T ss_pred HHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhhhhHHHHHHHH Confidence 112211 24455555666667788899999998886554221110 11 11 12356666665 Q ss_pred HHHHhc--cC------CCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeec-CeEEEEecccccccceEEEEc Q lcl|Aclame:pro 173 VELDEI--KA------PENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELD-GFVIVKVPTKLLQGLQAIAVV 243 (319) Q Consensus 173 ~~Lde~--~V------P~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~id-G~~I~~vps~~~~~~n~i~~~ 243 (319) .++++. .+ -.+-+|++||.+-++|..++.+....+.-....-...+|.+. |..|+.=|. ..--.+++|. T Consensus 385 ~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~~g~l~~~~~vy~d~~--~~~dy~~~g~ 462 (523) T protein:vir:59 385 IELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFYVGMVQGRYRLYKNIY--QNQPVIIMGN 462 (523) T ss_pred HHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCccccccccceeEEEecCceEEEecCC--CCcceEEEEe Confidence 555432 11 135689999999999988887754322111112233567776 477775333 3334466666 Q ss_pred CCce------eeeeeeeeeeeecC--CCCCccceeeeeeeeeEEEeccccceE-EEEcccc Q lcl|Aclame:pro 244 GEVL------ASPIQADLAKTNSN--IPGMFGTLAEQLLYTGAFVPEHLQKYI-FTIGGTE 295 (319) Q Consensus 244 ~~A~------~~~~k~~~~~~~~~--~~~~~~~~v~gr~~yg~~V~~~k~~~I-y~~~~~~ 295 (319) +.+. .|..-+..+..++- .|++|.-.+--+-=|++.|.+|-.-|+ |+..-.+ T Consensus 463 k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 463 QDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVKLLQP 523 (523) T ss_pred cccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecchhHhhhhhhhhcCC Confidence 6533 23333333322221 467888888888889999989887776 5554433 No 176 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=88.51 E-value=0.032 Score=28.79 Aligned_cols=280 Identities=14% Similarity=0.044 Sum_probs=118.9 Q ss_pred CCcccccccceeeehhh-------hhhhhhcc----hhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEe Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQ-------HFANKSVE----PGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTV 68 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~-------~~~~~~~~----~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkI 68 (319) ++... ++..+....++ -|-+...+ -.-.-.++.+. .+++.+...+.+.. +++ .+-.+|+ ++| T Consensus 55 ~~~~~-~~~~~~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~--~~~--v~~~~~~-~~i 128 (383) T protein:vir:78 55 ARQEA-DAYISASRTDKNITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLA--SIG--MRTTGLR-TKF 128 (383) T ss_pred HHHHH-HHHHHhcCChhhhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhcccee--eee--eEecCCc-eEE Confidence 11110 01111111111 11111110 00112344443 33333333222211 122 2334555 689 Q ss_pred eeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 69 MKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFATL 147 (319) Q Consensus 69 p~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~l 147 (319) |.....+..-+....+-..+..+.++..+++...|.-.|- . +..+--+ ...++.+.+.+..+..++-.+|+..+. T Consensus 129 ~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i-~-is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~-- 204 (383) T protein:vir:78 129 LKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFV-V-VPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIV-- 204 (383) T ss_pred EEEcCCcceEEeecccccccccCcceeeEeecceeeEeec-c-chHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEe-- Confidence 9887766655543323223344556677777777765542 2 1111111 113445566677777777777776442 Q ss_pred HhccCcc--------------cc----------ccCCHhH---HHHHHHHHHHH----HHhcc--CCCCcEEEEChHHHH Q lcl|Aclame:pro 148 ARNKAKH--------------LT----------VGTGSDA---QYDAVLDVSVE----LDEIK--APENRVLFVSPTFYK 194 (319) Q Consensus 148 a~~a~~~--------------~~----------~~~T~~n---~~~~i~~a~~~----Lde~~--VP~~R~l~VsP~~~~ 194 (319) +.|.. .+ ..++..+ .++.+...... +.... +-....++++|.-+. T Consensus 205 --G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 282 (383) T protein:vir:78 205 --GDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAW 282 (383) T ss_pred --ccCCCCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchh Confidence 11110 00 0011111 12222111110 01000 112345778875442 Q ss_pred HHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCC---ccceee Q lcl|Aclame:pro 195 GIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTLAE 271 (319) Q Consensus 195 ~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~---~~~~v~ 271 (319) .+. +.... . ..+|....+.|+++..+-+..++...++.+..+......+ ..+++-+..+.. +...++ T Consensus 283 ~~~--~~~~~----~---~~~G~~~t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~r-~~~~i~~~~~~~f~~d~~~f~ 352 (383) T protein:vir:78 283 DVK--KQYTS----L---NANGVYVTALPFNLNIIESLFVPEKKAISYVAERYDALIG-GPLDIGTYDQTLAIEDLNLYA 352 (383) T ss_pred hhc--cchhc----c---CCCCceeeecCCCceEEecCCCCcccEEEeeccceEEEec-ccceEEecchhhhhcCceEEE Confidence 221 11110 0 1123333444555432335556655566665554333332 233332212222 335899 Q ss_pred eeeeeeEEEeccccceEEEEccccccCCCCC Q lcl|Aclame:pro 272 QLLYTGAFVPEHLQKYIFTIGGTEVATKRDG 302 (319) Q Consensus 272 gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~ 302 (319) ++.++|.+++++++..++--.-.++...+.| T Consensus 353 ~~~r~dG~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 353 AKQFAYGKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred EEEEEcCEEecCCeEEEEEEEecCCCCCCCC Confidence 9999999999999976666555555555555 No 177 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=87.55 E-value=0.038 Score=28.37 Aligned_cols=273 Identities=13% Similarity=0.014 Sum_probs=114.1 Q ss_pred CCccccc--ccceeeehhhhhhhhhcchhh-----hhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecc Q lcl|Aclame:pro 1 MNKTIKN--ATGMLKLNLQHFANKSVEPGQ-----TLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGD 72 (319) Q Consensus 1 ~~~~~~~--~~~~~~~~~~~~~~~~~~~n~-----~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~ 72 (319) +...+.. ...-|+.--.-|-+++.+... .-.++-+. .+++.+...+.+.. . ++ .+-.+|+ ++||.-. T Consensus 54 ~~~~~~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~-~-~~--v~~~~~~-~~~~~~~ 128 (377) T protein:vir:98 54 MERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLK-V-IN--FKNTSLR-LKALTAE 128 (377) T ss_pred HHHHHHhccCCcccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhh-h-ee--eEecCcc-eEEEEec Confidence 0000000 000011111112222221111 11233333 33333332222221 1 22 2333444 6888765 Q ss_pred ccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 73 TTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFATLARNK 151 (319) Q Consensus 73 ~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a 151 (319) ..+-.-+....+-..+..+.+...+++...+...|.. +..+--. ...++...+.+..+..++-.+|...+. +. T Consensus 129 ~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~--is~elL~ds~~~ie~~i~~~la~~~a~~~~~a~i~----G~ 202 (377) T protein:vir:98 129 TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVV--IPKDALKFGPKWIKQFITEQLKEAIAVALELAIVK----GD 202 (377) T ss_pred CCcceeEeecccccCcccCccceeEeecceeEEeeec--ccHHhhhccHhHHHHHHHHHHHHHHHHHHhhceEe----cc Confidence 5544433222222223344566777777777665532 2211111 112344555666666777666664332 11 Q ss_pred Ccccc------------------ccCCHhHHHHHHHHH--------------------HHHHHhccCCCCc-EEEEChHH Q lcl|Aclame:pro 152 AKHLT------------------VGTGSDAQYDAVLDV--------------------SVELDEIKAPENR-VLFVSPTF 192 (319) Q Consensus 152 ~~~~~------------------~~~T~~n~~~~i~~a--------------------~~~Lde~~VP~~R-~l~VsP~~ 192 (319) |.... .+.+.....+.+.++ ...+++..-..|+ +|+++|.- T Consensus 203 G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~ 282 (377) T protein:vir:98 203 GLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPED 282 (377) T ss_pred CCCcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccc Confidence 11100 000100011111111 1112222223444 45567754 Q ss_pred HHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCC---ccce Q lcl|Aclame:pro 193 YKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTL 269 (319) Q Consensus 193 ~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~---~~~~ 269 (319) +..+. +... ....+|.-..+.|.++..+-+..++...++++..+......+ ..+++.+..+-. +... T Consensus 283 ~~~~~--p~~~-------~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r-~~~~i~~~~~~~~~~d~~~ 352 (377) T protein:vir:98 283 RWALE--AQFT-------SRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMA-TASTIEEYDQTFAMEDLQL 352 (377) T ss_pred hhhcc--cccc-------ccCCCCccccccCCCceEEecCCCCcccEEEEEecceeEEee-cceEEEeechhhhhcCceE Confidence 43331 1111 111234444556666543446677776777776655444433 234443322322 3468 Q ss_pred eeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 270 AEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 270 v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) ++++.++|.+++++++..++.-.+- T Consensus 353 f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 353 YLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEEEEEcCEEeccCcEEEEEEecC Confidence 9999999999999999776654432 No 178 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=87.52 E-value=0.038 Score=28.36 Aligned_cols=290 Identities=11% Similarity=-0.012 Sum_probs=127.1 Q ss_pred CCcccccccceee-----------------ehhhhh--hhhh---cchhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcc Q lcl|Aclame:pro 1 MNKTIKNATGMLK-----------------LNLQHF--ANKS---VEPGQTLLKNKHV-GILERVTAVNAYSTPALISND 57 (319) Q Consensus 1 ~~~~~~~~~~~~~-----------------~~~~~~--~~~~---~~~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~ 57 (319) +-+.+-.+.|... +.+.+- +..- .....+...+.|. .+++.+...+.+.. . ..+. T Consensus 298 ~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv~~-l-~~~~ 375 (645) T protein:vir:93 298 FAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTIIGR-F-GQGG 375 (645) T ss_pred HHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhhhhHHh-h-cccc Confidence 0000111111110 000000 0000 0001112223332 23333322222211 1 1111 Q ss_pred eee-e-CCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhh-hHHHHHHHHHHHHHH Q lcl|Aclame:pro 58 AIF-M-EGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEG-NIDINYVVARQGAEV 134 (319) Q Consensus 58 ~~~-~-~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~-~~~~~~~~~~~~~~~ 134 (319) ... . ....++||+....+..-|...++- ...-+.++..+++.-.|--. .+. +..+-... ...+...+.+..... T Consensus 376 ~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~-~~~s~~~f~~v~l~~~kla~-~~~-iS~ell~ds~~~~~~~i~~~l~~a 452 (645) T protein:vir:93 376 IPALRQVPFNIRVHAQVSGGAAGWVGEGKT-KPLTKFDFESITFSHAKVSA-IAV-LTEELIRFSSPAADALVRNALAEA 452 (645) T ss_pred ccccccccCceeeeeeecCcceEEeccCcc-ccccccceeEEEEeeEEEEE-eeh-hHHHHHhhchHHHHHHHHHHHHHH Confidence 111 1 123688998766555555433222 22223344455554444322 222 22221111 134456677888889 Q ss_pred HHHHHHHHHHHHHH-hcc---Ccccc---cc-CCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHHhhhhhhhhc Q lcl|Aclame:pro 135 VAPYLDNLRFATLA-RNK---AKHLT---VG-TGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGIKKFVIALPQ 205 (319) Q Consensus 135 vapeiD~~~~s~la-~~a---~~~~~---~~-~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L~~~~~f~~~ 205 (319) ++-.+|...+.--- ..+ +.... .+ .+..+.+.-+..+...+..+++. .+-+++++|..+..|.+-.+-..+ T Consensus 453 ia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~ 532 (645) T protein:vir:93 453 VVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRKNALGQ 532 (645) T ss_pred HHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhccccCCc Confidence 99999987663111 111 11111 01 12234455677777788887776 567889999999888765432111 Q ss_pred ccccccceeeeeeeeecCeEEEEeccccccc-------ceEEEEcC-CceeeeeeeeeeeeecCCCC------------- Q lcl|Aclame:pro 206 GDTRQQVLGKGVQGELDGFVIVKVPTKLLQG-------LQAIAVVG-EVLASPIQADLAKTNSNIPG------------- 264 (319) Q Consensus 206 ~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~-------~n~i~~~~-~A~~~~~k~~~~~~~~~~~~------------- 264 (319) . ........| ++|.|.||+.+. .+++ -.++++.. ...+.......+++...+.+ T Consensus 533 ~-~~~~~~~~~--~tL~G~PV~~s~--~vp~~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~l 607 (645) T protein:vir:93 533 K-EYPDMTLLG--GSFQGLPVIVSQ--YVGDQLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSM 607 (645) T ss_pred e-eecCCCCCC--ceeeceeeEEec--cCCcceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhH Confidence 0 011111122 579999998642 2222 12233333 23334444444544332111 Q ss_pred --CccceeeeeeeeeEEEeccccceEEEEccccccCCCCCc Q lcl|Aclame:pro 265 --MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGV 303 (319) Q Consensus 265 --~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~ 303 (319) .+--+++-..++|..+.+|++.++...+ +=-+..|+ T Consensus 608 f~~d~vaira~~r~d~~~~~p~a~~~lt~~---~~g~~~~~ 645 (645) T protein:vir:93 608 FQTGSVAIRAERWINWRRRRTAAVAVITGV---NYGSASGG 645 (645) T ss_pred hhcCceEEEEEEEEcceeeCccceEEEecc---cCCcccCC Confidence 1234788889999999999985443322 22223333 No 179 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=87.26 E-value=0.04 Score=28.25 Aligned_cols=275 Identities=13% Similarity=0.079 Sum_probs=114.3 Q ss_pred eeeehhhhhhhhhcchh------hhhhhHhh--HHHHHHHHHhhhhhhhcccC-cceeeeCCceEEeeeccccccccccC Q lcl|Aclame:pro 11 MLKLNLQHFANKSVEPG------QTLLKNKH--VGILERVTAVNAYSTPALIS-NDAIFMEGRSFTVMKGDTTELKDYKR 81 (319) Q Consensus 11 ~~~~~~~~~~~~~~~~n------~~~l~~ky--~~lld~~~~~~sl~~~~~~n-~~~~~~~g~tVkIp~i~~~g~~DY~r 81 (319) ||+-|- +.....- .-+....| ...|...-. -++...... ...-.+.|+||+..+-.-..-..--. T Consensus 1 ~~~~~a----~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~--~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl 74 (401) T protein:vir:95 1 MLNYNA----PTDGQKSSIDGANSDQMQTFFWLKKAIITARK--EQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNIN 74 (401) T ss_pred CCccCC----CcccccccccccccceeeehhhHHHHHhhhhh--hhhhhhcccccccccccCCeEEEEecccccccccch Confidence 333331 1111111 11122222 222322211 133333322 33467789999986543211000000 Q ss_pred CCCccc-----------------CCcc-----------------cceeEEEEeecccceeecchhhHHH----HhhhHH- Q lcl|Aclame:pro 82 NATNEF-----------------DHPK-----------------IEETTYFLDQEKYWGRFVDALDRKD----TEGNID- 122 (319) Q Consensus 82 ~~~~~~-----------------~~~t-----------------~t~~tltidqdr~~~F~VD~~D~~e----t~~~~~- 122 (319) ..|.++ ++++ .+.++....-.+ +.|.+.-.|+.+ -+.... T Consensus 75 ~eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~q-yG~~~e~Td~~~dt~~D~~l~~h 153 (401) T protein:vir:95 75 DQGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHK-FGFFYEFTQESIDFDSDDGLMEH 153 (401) T ss_pred hcCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeee-ccCccchhhhhhhhhcchHHHHH Confidence 011111 1111 112222211122 222222222222 111010 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhc-------cCccc-----cccCCHhHHHHHHHHHHHHHHhccC---------- Q lcl|Aclame:pro 123 INYVVARQGAEVVAPYLDNLRFATLARN-------KAKHL-----TVGTGSDAQYDAVLDVSVELDEIKA---------- 180 (319) Q Consensus 123 ~~~~~~~~~~~~vapeiD~~~~s~la~~-------a~~~~-----~~~~T~~n~~~~i~~a~~~Lde~~V---------- 180 (319) +...+...+...- .|..+...|+++ +.+.. ....+..-.++.+..+...|+++.. T Consensus 154 ~s~ell~g~~~~t---~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s 230 (401) T protein:vir:95 154 LSRELMNGATQIT---EAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGS 230 (401) T ss_pred HHHHHhhhhhhhH---HHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhh Confidence 1112223332222 222222333222 21111 1122233357788888888886332 Q ss_pred --------CCCcEEEEChHHHHH------Hhhhhhhhhcccccc-cceeeeeeeeecCeEEEEecccc----------cc Q lcl|Aclame:pro 181 --------PENRVLFVSPTFYKG------IKKFVIALPQGDTRQ-QVLGKGVQGELDGFVIVKVPTKL----------LQ 235 (319) Q Consensus 181 --------P~~R~l~VsP~~~~~------L~~~~~f~~~~~~~~-~~~~~g~Vg~idG~~I~~vps~~----------~~ 235 (319) +.-||.||-|+.-.. |..++.|+.....+. +...+|.||++.+++++.+|.-. .. T Consensus 231 ~~~dTk~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~ 310 (401) T protein:vir:95 231 RMIDTKVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGA 310 (401) T ss_pred hccCccccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccc Confidence 235788987744433 446688998777764 56889999999999999876411 00 Q ss_pred ----------------cceEEEEcCCceeeee-----eee--eeeeecCCCC--Cccc------eeeeeeeeeEEEeccc Q lcl|Aclame:pro 236 ----------------GLQAIAVVGEVLASPI-----QAD--LAKTNSNIPG--MFGT------LAEQLLYTGAFVPEHL 284 (319) Q Consensus 236 ----------------~~n~i~~~~~A~~~~~-----k~~--~~~~~~~~~~--~~~~------~v~gr~~yg~~V~~~k 284 (319) =+..+++-+.|.+... +.. .+-+-+|... .++| .+-.-.||++.|++++ T Consensus 311 ~~~y~~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e 390 (401) T protein:vir:95 311 NPGYRTSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPE 390 (401) T ss_pred ccccccccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccc Confidence 1345666565554331 100 0111122110 0111 2334468889999887 Q ss_pred cceEEEEccccc Q lcl|Aclame:pro 285 QKYIFTIGGTEV 296 (319) Q Consensus 285 ~~~Iy~~~~~~~ 296 (319) -... +...++- T Consensus 391 ~m~~-ies~a~~ 401 (401) T protein:vir:95 391 RLAL-IKTVAPL 401 (401) T ss_pred eeEE-EEeecCC Confidence 6433 1221111 No 180 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=86.27 E-value=0.047 Score=27.87 Aligned_cols=290 Identities=9% Similarity=-0.101 Sum_probs=126.1 Q ss_pred CCcccccccceeeehhhhhhhh--hcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcce---eeeCCceEEeeeccc-c Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANK--SVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDA---IFMEGRSFTVMKGDT-T 74 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~--~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~---~~~~g~tVkIp~i~~-~ 74 (319) |--| -+..|..+| .-.+|..+... .+.+++....++ .|.++ ...+|+.|.||-+.. + T Consensus 1 Ma~T---------~l~D~iipe~~vf~~Yv~~~~~----e~~~l~qSGii~----~d~~l~~~~~~gG~~~~iPf~~~l~ 63 (349) T protein:vir:94 1 MAIT---------TIGNIVTGNIPVLASYMTEDPV----EKTAFFNSGILT----PTPYAAEIARGPSNIANLPFWKAID 63 (349) T ss_pred CCce---------EEeeeeccChHHHHHHHHHhHH----Hhhhhhhcccee----ccHHHHHHHhcCCCEEEeeeeecCC Confidence 2211 122333333 23344333332 233333322222 12223 236899999999975 4 Q ss_pred ccc--cccCCCC---cccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 75 ELK--DYKRNAT---NEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR 149 (319) Q Consensus 75 g~~--DY~r~~~---~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~ 149 (319) |-. +|+.++. .+++.++. .+..-+-+.|...|...|+-.+-+.. +++...+++-+.--.....+.+++.|.+ T Consensus 64 g~~e~n~~~dt~~~~~t~~kit~-~~~~a~~~~r~kaw~~~Dla~~lsG~--dpm~~Ia~~va~yW~r~~q~~Lia~L~G 140 (349) T protein:vir:94 64 TSIEPNYSNDVYQDIATPRAIQT-GEMMARVAYLNEGFGQADLTVELTSQ--NPLQSVASRLDNFWQRQAQRRLIATALG 140 (349) T ss_pred CCcccccCCCCcccccccccccc-cceeeeeeeeccccchhHHHHHhhCc--hHHHHHHHHHHHHHhhHHHHHHHHHHHh Confidence 543 4654432 23343332 33334445666677777776655532 4443333332222333334445554432 Q ss_pred c-----cCccc-------cccC--CHhHHHHHHHHHHHHHHhc--cCCCC--cEEEEChHHHHHHhhhhhhhhccccccc Q lcl|Aclame:pro 150 N-----KAKHL-------TVGT--GSDAQYDAVLDVSVELDEI--KAPEN--RVLFVSPTFYKGIKKFVIALPQGDTRQQ 211 (319) Q Consensus 150 ~-----a~~~~-------~~~~--T~~n~~~~i~~a~~~Lde~--~VP~~--R~l~VsP~~~~~L~~~~~f~~~~~~~~~ 211 (319) - +..+. +..+ ++..-.+.+.++..+|-++ +-..+ ..++|-+.+|..|++.....- -+. T Consensus 141 vf~~~~~~~~~~~~~~~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~----i~~ 216 (349) T protein:vir:94 141 LYNDNVSATDAYHEQNDMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDF----IRD 216 (349) T ss_pred hhcccccccccccccCceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhh----ccC Confidence 1 11110 0111 1111234555566666553 22333 468899999999987653221 112 Q ss_pred ceeeeeeeeecCeEEEEe---cccc---cccceEEEEcCCceeeeeeeee--eeeecCCCCCc--c-ceeeeeeeeeEEE Q lcl|Aclame:pro 212 VLGKGVQGELDGFVIVKV---PTKL---LQGLQAIAVVGEVLASPIQADL--AKTNSNIPGMF--G-TLAEQLLYTGAFV 280 (319) Q Consensus 212 ~~~~g~Vg~idG~~I~~v---ps~~---~~~~n~i~~~~~A~~~~~k~~~--~~~~~~~~~~~--~-~~v~gr~~yg~~V 280 (319) ...+..|+.+.|.+|+.. |-.. ...+-..+.-++|+.+-..-.. +|..|.+-+.. | +.+..|+.| + T Consensus 217 s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~~~---~ 293 (349) T protein:vir:94 217 AENNTMFATYQGYRVIVDDSMTVVGQDTSRKFISIIFGQGAIGYGEGNPEMPLEYEREASRANGGGVETLWTRKTW---L 293 (349) T ss_pred cccCcccceecCcEEEEeCCCccccCCCCceEEEEEeecceEEeecCCCCcceeeecccccCCcceeEEEEEeeEE---E Confidence 224556788889999853 2111 1123344555778877766432 55555322221 2 455555443 3 Q ss_pred eccccceEEEEccccccCCCCCcccccccccc-ccccccC Q lcl|Aclame:pro 281 PEHLQKYIFTIGGTEVATKRDGVDAHADNVAK-PSGSLEM 319 (319) Q Consensus 281 ~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~-~~~~~~~ 319 (319) +-|+.-. |.....+. ...++.+.+.+.+-+ ...+-|. T Consensus 294 ~hp~G~s-~~~a~v~~-~~~~~~~~sPt~aeLa~~~NW~~ 331 (349) T protein:vir:94 294 LHPFGYS-FTSAVITG-NGTETIARSASWQDLANAANWNR 331 (349) T ss_pred eeeeeee-ecccccCC-CccccccCCCChHHhcCCcCccc Confidence 4444311 22211110 000112222232222 1122222 No 181 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=86.02 E-value=0.049 Score=27.79 Aligned_cols=256 Identities=9% Similarity=-0.005 Sum_probs=124.3 Q ss_pred hhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcce--ee---e-CCceEEeeecccccccc-ccC-CCCcccCCc Q lcl|Aclame:pro 19 FANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDA--IF---M-EGRSFTVMKGDTTELKD-YKR-NATNEFDHP 90 (319) Q Consensus 19 ~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~--~~---~-~g~tVkIp~i~~~g~~D-Y~r-~~~~~~~~~ 90 (319) ..+....+ -+++.++.++.+++.... ..+.-+++ +. . +..++..+..+..|... |.- ......-++ T Consensus 1 ~~~~~~g~---f~~~~l~~id~~v~e~~~---~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~ 74 (301) T protein:vir:80 1 MQGKITAT---IEARDLQAIDNVIYEPKQ---EELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDV 74 (301) T ss_pred CCccccch---hhHHHHHHHHHHHHHhhh---hhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccc Confidence 12222222 355555555555543221 12223343 11 1 33467777777666443 321 122334466 Q ss_pred ccceeEEEEee-cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--------C--c-----c Q lcl|Aclame:pro 91 KIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK--------A--K-----H 154 (319) Q Consensus 91 t~t~~tltidq-dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a--------~--~-----~ 154 (319) +.++....+-+ ..+|.+...++...+..+ .++.......++.++.-..|...|--..... + . . T Consensus 75 ~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~ 153 (301) T protein:vir:80 75 DMVRKSVPIYSIGIGLSYTIQDLRAARMQG-TTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTT 153 (301) T ss_pred cceeEEEEEEEEEeeeeecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCc Confidence 67777777776 678888888888887644 4444445566777777777776552111100 0 0 0 Q ss_pred cc------ccCCHhHHHHHHHHHHHHHHhc--cCCCCcEEEEChHHHHHHhhh---hhhhhcccccccceeeeeeeeecC Q lcl|Aclame:pro 155 LT------VGTGSDAQYDAVLDVSVELDEI--KAPENRVLFVSPTFYKGIKKF---VIALPQGDTRQQVLGKGVQGELDG 223 (319) Q Consensus 155 ~~------~~~T~~n~~~~i~~a~~~Lde~--~VP~~R~l~VsP~~~~~L~~~---~~f~~~~~~~~~~~~~g~Vg~idG 223 (319) .. ...|++.+++-|.++..+|.+. ++-..-.|+++|+.|..|..- +..- ....+-...+ .-+ T Consensus 154 ~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~--~tvl~~l~~~-----~~~ 226 (301) T protein:vir:80 154 GVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDS--RSVLKVLQDN-----AWF 226 (301) T ss_pred ccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCC--eeHHHHHHHH-----cCc Confidence 00 1236788999999999999775 443456799999999988521 1110 0001101111 112 Q ss_pred eEEEEecccc---cccceE-EEEcCC----ceeeeeeeeeeeeecCCCCC-ccceee-eeeeeeEEEeccccceEEEEc Q lcl|Aclame:pro 224 FVIVKVPTKL---LQGLQA-IAVVGE----VLASPIQADLAKTNSNIPGM-FGTLAE-QLLYTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 224 ~~I~~vps~~---~~~~n~-i~~~~~----A~~~~~k~~~~~~~~~~~~~-~~~~v~-gr~~yg~~V~~~k~~~Iy~~~ 292 (319) ..|+.+|--. ..+.+. ++...+ .+.++.++ +.. +.|.+ -.+.+. .-.+.|+.|.+|......--. T Consensus 227 ~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~---~~~-~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 227 SAIVRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMDI---TRH-PEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred ceEEEcceeccCCCCcccEEEEEecCCcEEEEEecCce---eee-cceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 3343333211 111222 222211 11222222 222 22322 234442 345678999999873221112 No 182 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=85.82 E-value=0.05 Score=27.72 Aligned_cols=277 Identities=12% Similarity=0.032 Sum_probs=122.0 Q ss_pred CCccc-ccc-cceeeehhhhhhhhhcc-----hhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecc Q lcl|Aclame:pro 1 MNKTI-KNA-TGMLKLNLQHFANKSVE-----PGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGD 72 (319) Q Consensus 1 ~~~~~-~~~-~~~~~~~~~~~~~~~~~-----~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~ 72 (319) ++.-+ .+. .--|+---.-|-+.+.+ -.-.-+++.+. .+++.+...+.+.. +++ ++ .-+..++||.-. T Consensus 54 ~~~~~~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~--~~~--v~-~~~~~~~i~~~~ 128 (377) T protein:vir:96 54 MERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLK--VIN--FK-NTSLRLKALTAE 128 (377) T ss_pred HHHHHHhccCCcccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhh--hce--eE-ecCCceEEEEec Confidence 00000 000 00000000111111110 00111333332 34444333222222 122 22 223457888766 Q ss_pred ccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHH-H---- Q lcl|Aclame:pro 73 TTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFA-T---- 146 (319) Q Consensus 73 ~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s-~---- 146 (319) ..+-..+....+-..+..+.+...+++...|.-.|.. +..+--. ...++...+.+..+..++-.+|+..+. . T Consensus 129 ~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~--is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~ 206 (377) T protein:vir:96 129 TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVV--IPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQ 206 (377) T ss_pred CCcceeEeecccccccccCccceeEeeeeeeEEeech--hhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCc Confidence 5554444322222223344567777777777665532 3322221 123345556666666777666665442 0 Q ss_pred ----HHhccCcc--------------------ccccCCHhHHHHHHHHHHHHHHhcc------CCCCcEEEEChHHHHHH Q lcl|Aclame:pro 147 ----LARNKAKH--------------------LTVGTGSDAQYDAVLDVSVELDEIK------APENRVLFVSPTFYKGI 196 (319) Q Consensus 147 ----la~~a~~~--------------------~~~~~T~~n~~~~i~~a~~~Lde~~------VP~~R~l~VsP~~~~~L 196 (319) +...+... .....+.+.+++.+..+...+...+ ...+-+++++|..+..+ T Consensus 207 P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~ 286 (377) T protein:vir:96 207 PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTL 286 (377) T ss_pred ceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhc Confidence 00000000 0011234556666666665554332 12345688999877654 Q ss_pred hhhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCC---ccceeeee Q lcl|Aclame:pro 197 KKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTLAEQL 273 (319) Q Consensus 197 ~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~---~~~~v~gr 273 (319) . +++..... +|.-..+.|+++..+-+..++...++.+..+-.....+ ..+++-+-.+-. +...+++. T Consensus 287 ~--~~~~~~~~-------~G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r-~~~~i~~~~~~~~~~d~~~f~~~ 356 (377) T protein:vir:96 287 E--AKFTSRNQ-------FGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMA-TASTIEEYDQTFAMEDLQLYLTK 356 (377) T ss_pred c--ccccccCC-------CCCceeccCCCceEEecCCCCcccEEEEEcCcEEEEEe-cccEEEeehhhhhhcCCeEEEEE Confidence 2 12221111 23333455555433335566665566665554333333 233332222222 34689999 Q ss_pred eeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 274 LYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 274 ~~yg~~V~~~k~~~Iy~~~~~ 294 (319) .++|.+++++++..++.-..- T Consensus 357 ~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 357 NYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred EEEcCEEecCCcEEEEEEecC Confidence 999999999999666554432 No 183 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=82.87 E-value=0.073 Score=26.81 Aligned_cols=266 Identities=9% Similarity=-0.043 Sum_probs=120.5 Q ss_pred hhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeee-CCceEEeeeccccccc-cc---cCCCCcccCCccc Q lcl|Aclame:pro 18 HFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFM-EGRSFTVMKGDTTELK-DY---KRNATNEFDHPKI 92 (319) Q Consensus 18 ~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~-~g~tVkIp~i~~~g~~-DY---~r~~~~~~~~~t~ 92 (319) |-|.-|+.. +++.-+ +.+.+.. ...+.+..++.-+-... ...++....++..|.. +| ++......-+++. T Consensus 1 ~~~lafl~~-qL~~id--~~vye~~--~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~ 75 (304) T protein:vir:52 1 MSLLAYVKN-GLTAVS--KDIAETK--YPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGF 75 (304) T ss_pred CchHHHHHH-HHHHHh--hhhhccc--cccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeeccc Confidence 555545433 222211 1111111 12334433333211222 3446777777776644 44 3344444456666 Q ss_pred ceeEEEEee-cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC---------cc-------- Q lcl|Aclame:pro 93 EETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKA---------KH-------- 154 (319) Q Consensus 93 t~~tltidq-dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~---------~~-------- 154 (319) ++...++.. ..++.+.+.++-.++..+ .++...-.+-++.++...+|+..|--.....+ .. T Consensus 76 ~~~~~~i~~~~~~~~y~~~El~~a~~~g-~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~ 154 (304) T protein:vir:52 76 TPTRSYIVPWAKSVTWTKPELEQGKLLG-LALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAA 154 (304) T ss_pred ceeEEEEEEEeeeeeecHHHHHHHHHhC-CCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCc Confidence 777666655 567888887777666543 23322222223333433333332200010001 00 Q ss_pred ---ccccCCHhHHHHHHHHHHHHHHhc--cCCCCcEEEEChHHHHHHhhhhhhhh-cccccccceeeeeeeeecCeEEEE Q lcl|Aclame:pro 155 ---LTVGTGSDAQYDAVLDVSVELDEI--KAPENRVLFVSPTFYKGIKKFVIALP-QGDTRQQVLGKGVQGELDGFVIVK 228 (319) Q Consensus 155 ---~~~~~T~~n~~~~i~~a~~~Lde~--~VP~~R~l~VsP~~~~~L~~~~~f~~-~~~~~~~~~~~g~Vg~idG~~I~~ 228 (319) .=...|++.|++.|.++..++... ++-..-.|.++|+.+..|..- +... ..++.+-...+..-.+--+.+|.. T Consensus 155 a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~-~~~~~~~Tvl~~l~~n~~~~~g~~l~I~~ 233 (304) T protein:vir:52 155 QNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALV-QRANTDTTALEFLTKHLSAAAGRQVAIKA 233 (304) T ss_pred cCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhc-cCCCCCchHHHHHHHhcccccCCcceEEE Confidence 001236788999999998888654 222344699999999888431 1000 001111111222111112346666 Q ss_pred ecccccc----cceEEEEcCCceeeeeee--eeeeeecCCCCC--ccceee-eeeeeeEEEeccccceEEEEc Q lcl|Aclame:pro 229 VPTKLLQ----GLQAIAVVGEVLASPIQA--DLAKTNSNIPGM--FGTLAE-QLLYTGAFVPEHLQKYIFTIG 292 (319) Q Consensus 229 vps~~~~----~~n~i~~~~~A~~~~~k~--~~~~~~~~~~~~--~~~~v~-gr~~yg~~V~~~k~~~Iy~~~ 292 (319) +|....+ +.+.+++.....-..... ...+.. |.|.+ ..+.+- .-++.|+.|..|.. .+|+.. T Consensus 234 v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l-~~q~~~~~~~~vp~~~r~gGv~v~~P~a-~~y~D~ 304 (304) T protein:vir:52 234 LPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVL-DAQPKGLLAFESGLRMAFGGVTFMEPDS-ALYVDY 304 (304) T ss_pred ecccccccCCCCceEEEEEecChhheEEecCcccccc-chhhcCCceEEecceeeeeeEEEEccce-eeeecC Confidence 6643321 223233222211111111 011111 22332 346663 44589999999997 467777 No 184 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=81.98 E-value=0.081 Score=26.57 Aligned_cols=257 Identities=12% Similarity=-0.046 Sum_probs=121.2 Q ss_pred hhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccc-cccccccCCCCcccCCcccceeE Q lcl|Aclame:pro 18 HFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT-TELKDYKRNATNEFDHPKIEETT 96 (319) Q Consensus 18 ~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~-~g~~DY~r~~~~~~~~~t~t~~t 96 (319) |..+. .+--.|.+-+...+.+-|....-+|...+.+ .. ...++-+...++. +++... .+.+..+++.....+ T Consensus 1 m~it~---~~l~~l~~~~~~~~~~~y~~a~~~~~~~a~~-~~-sdf~~~~~~~lg~~p~l~e~--~Ge~~~~~l~~~~~~ 73 (302) T protein:vir:10 1 MLINK---QSLNAAFVAIKTIFNNAFAAAPTTWQKIAME-VP-SNTSSNDYKWLSTFPKMRRW--IGAKVVKNLKAYKYV 73 (302) T ss_pred CcccH---HHHHHHHHHHHHHHHHHHHhhhhhhhceeee-cC-CCcceeeceecCCCCCcccc--ccceeecccccccee Confidence 22211 1122455555666667777777677665532 21 3334444444432 344443 244556666555555 Q ss_pred EEEe-ecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccc-------------------c Q lcl|Aclame:pro 97 YFLD-QEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHL-------------------T 156 (319) Q Consensus 97 ltid-qdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~-------------------~ 156 (319) +++. +++-+++.=+++..++... -..+.+.+....+...|..+++.|.++...+- . T Consensus 74 i~~~~~g~~v~i~R~~i~nDdlg~----~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~~~N 149 (302) T protein:vir:10 74 VENEDFEATVEVDRNDIEDDQIGI----YSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDASVSN 149 (302) T ss_pred EEeecccceecccHHhhcccccch----hHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceeccccccccccccc Confidence 5543 3666666666666666432 23344566677788889999998876432110 0 Q ss_pred ----------ccCCHhHHHHHHHHHHHHH-HhccCC---CCcEEEEChHHHHHHhh---hhhhhhcccccccceeeeeee Q lcl|Aclame:pro 157 ----------VGTGSDAQYDAVLDVSVEL-DEIKAP---ENRVLFVSPTFYKGIKK---FVIALPQGDTRQQVLGKGVQG 219 (319) Q Consensus 157 ----------~~~T~~n~~~~i~~a~~~L-de~~VP---~~R~l~VsP~~~~~L~~---~~~f~~~~~~~~~~~~~g~Vg 219 (319) ..++ ...|++.+.++.++ +..|-| ..++|+|+|.....-++ +... ..+......|. T Consensus 150 ~g~~~~~~~~~~l~-~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~----~~g~~Np~~g~-- 222 (302) T protein:vir:10 150 KGTAPLSNASQAAA-KAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKL----ADNTPNPYVGT-- 222 (302) T ss_pred ccchhhhhcccccc-hHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhcccc----CCCCcceeccc-- Confidence 0112 23455555555444 444556 36899999987754432 2221 11111112233 Q ss_pred eecCeEEEEecccccccceEEEEcCCce--eeeeeeeeeeeecC-CCCCccceeeeeeeeeEEEeccccce----EEEEc Q lcl|Aclame:pro 220 ELDGFVIVKVPTKLLQGLQAIAVVGEVL--ASPIQADLAKTNSN-IPGMFGTLAEQLLYTGAFVPEHLQKY----IFTIG 292 (319) Q Consensus 220 ~idG~~I~~vps~~~~~~n~i~~~~~A~--~~~~k~~~~~~~~~-~~~~~~~~v~gr~~yg~~V~~~k~~~----Iy~~~ 292 (319) ++++..|.-...+-.+++..+.++ ++.+....+++... .++.++-.++-++.||+.---+-.-+ .|.+. T Consensus 223 ----~~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~~~~~dgv~~k~~~d~Gvd~R~~~G~~~wq~a~~s~ 298 (302) T protein:vir:10 223 ----AELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQVNLDSDDVFNLRKLKFGAEARAAAGYGFWQLAYGST 298 (302) T ss_pred ----eEEEEeeccCCCCceEEEecCCccceEEEcCccccEEEeccCCCCCceEEEEEEEEeeeeeeecchhhhhhhhccC Confidence 456654433223334455444431 12222233333321 23444556666666665221111111 13333 Q ss_pred cccccC Q lcl|Aclame:pro 293 GTEVAT 298 (319) Q Consensus 293 ~~~~a~ 298 (319) + +++ T Consensus 299 g--~~~ 302 (302) T protein:vir:10 299 G--TGA 302 (302) T ss_pred c--cCC Confidence 3 121 No 185 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=78.08 E-value=0.12 Score=25.67 Aligned_cols=277 Identities=10% Similarity=-0.029 Sum_probs=111.4 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccccc--ccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTE--LKD 78 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~g--~~D 78 (319) ++.-.-+.+..++ ..--.-=|+.| +.++.+++++...+.+-. .++ -..-.++.+..|++++... ..- T Consensus 8 ~~~~~~~~~k~~t--~~d~~Gg~l~P------~~~~~~i~~~~e~s~~l~--~~~-vi~~~~~~~~~i~~~g~~~~~~~g 76 (315) T protein:vir:41 8 RGGKPFEIVPKID--VPDLGRGVLSV------DRFGEFVKAVRDSAVIIP--EAR-IDNALKSYEKDISRLSLVLDVGPG 76 (315) T ss_pred hcCChhhhhhhcC--CcCCCCceech------HHHHHHHHHHHhhhhhhh--hce-eeeccccccccccccccCcccccc Confidence 1111111111100 00000001222 223344444443322221 111 1112234455566654321 111 Q ss_pred ccCCC-Cc--ccCCcccceeEEEEeecccceeecchhhH--HHHhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHh--- Q lcl|Aclame:pro 79 YKRNA-TN--EFDHPKIEETTYFLDQEKYWGRFVDALDR--KDTEGNIDINYVVARQGAEVVAPY-LDNLRFATLAR--- 149 (319) Q Consensus 79 Y~r~~-~~--~~~~~t~t~~tltidqdr~~~F~VD~~D~--~et~~~~~~~~~~~~~~~~~vape-iD~~~~s~la~--- 149 (319) ++-.+ .- +...++....++.+. +... .+.--+. +++....++.+.+....+.+++-. .+.+.-+.=++ T Consensus 77 ~~~~~~~~~~~~~~~~f~~~~l~~~--~l~~-~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p 153 (315) T protein:vir:41 77 RDETGQKLAPPESTAEVKTNTLYMR--EMVT-KVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDP 153 (315) T ss_pred cccccCcCCCCCCccccceeeecee--eeee-eccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCc Confidence 11111 11 112244444444333 2222 2221111 111111234455555555555543 33433331110 Q ss_pred ----------ccCccc-cccC---CHhHHHHHHHHHHHHHHhcc--CCCCcEEEEChHHHHHHhhhhhhhhcccccccce Q lcl|Aclame:pro 150 ----------NKAKHL-TVGT---GSDAQYDAVLDVSVELDEIK--APENRVLFVSPTFYKGIKKFVIALPQGDTRQQVL 213 (319) Q Consensus 150 ----------~a~~~~-~~~~---T~~n~~~~i~~a~~~Lde~~--VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~ 213 (319) .+.... .... +.....+.|.++...|...= -..+-+++|+++.+..+++-..-. ..-.++..+ T Consensus 154 ~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~-g~~lw~~~~ 232 (315) T protein:vir:41 154 LLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGR-ETGLGDQAL 232 (315) T ss_pred cccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccC-CCccccchh Confidence 000000 0111 11223566777776665421 123557899999998876533211 112345566 Q ss_pred eeeeeeeecCeEEEEeccc---ccccceEEEEcCCceeeeeeeeeeeeecC-CCCCccceeeeeeeeeEEEeccccceEE Q lcl|Aclame:pro 214 GKGVQGELDGFVIVKVPTK---LLQGLQAIAVVGEVLASPIQADLAKTNSN-IPGMFGTLAEQLLYTGAFVPEHLQKYIF 289 (319) Q Consensus 214 ~~g~Vg~idG~~I~~vps~---~~~~~n~i~~~~~A~~~~~k~~~~~~~~~-~~~~~~~~v~gr~~yg~~V~~~k~~~Iy 289 (319) ..|....|.|.+|+.+|.- .....-+++++..-++.... ..+++.+. .....-..+.-++..|+.+......++- T Consensus 233 ~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~-~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~ 311 (315) T protein:vir:41 233 TGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFW-RNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSA 311 (315) T ss_pred hcCCCceecccceEecccccccCCCCccEEEecccceEEEec-cccEEEeeecCCCCceEEEEEEEeceeEEeccceeEe Confidence 7777788999999887642 22344567777765544433 23444321 1122224455556667765544444453 Q ss_pred EEcc Q lcl|Aclame:pro 290 TIGG 293 (319) Q Consensus 290 ~~~~ 293 (319) +-.. T Consensus 312 ~~~v 315 (315) T protein:vir:41 312 TITV 315 (315) T ss_pred eeeC Confidence 3332 No 186 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=77.89 E-value=0.12 Score=25.63 Aligned_cols=288 Identities=9% Similarity=-0.082 Sum_probs=123.7 Q ss_pred CCcccccccceeeehhhhhhhh--hcchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcce---eeeCCceEEeeeccc-c Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANK--SVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDA---IFMEGRSFTVMKGDT-T 74 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~--~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~~---~~~~g~tVkIp~i~~-~ 74 (319) |--| -+..|..+| .-.+|..+... ...+++....++ .|.++ ...+|+.|.||-+.. + T Consensus 1 Ma~T---------~l~D~iipe~~vf~~Yv~~~~~----e~~~l~qSGii~----~d~~l~~~~~~gG~~~~iPf~~~L~ 63 (349) T protein:vir:78 1 MAIT---------TIGDIVTGNIPVLASYMTEDPV----EKTAFFDSGILT----STPYAAEIANGPSNIANLPFWKAID 63 (349) T ss_pred CCce---------EEeeeeccCHHHHHHHHHHhhH----Hhhhhhhcccee----ccHHHHHHhhcCCCEEEeeeeecCC Confidence 2211 112233333 23334333332 233333322222 12222 236899999999976 4 Q ss_pred ccc--cccCCC---CcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 75 ELK--DYKRNA---TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR 149 (319) Q Consensus 75 g~~--DY~r~~---~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~ 149 (319) |-. +|..++ ..+++.++. .+..-+-+.|...|...|+-.+-+. -+++...+.+-+.--.....+.+++.|.+ T Consensus 64 g~~e~nv~~D~~~~~~t~~kitt-~~~~a~~~~r~kaw~~~Dla~~lsG--~dpm~~Ia~~va~yW~r~~q~~Lia~L~G 140 (349) T protein:vir:78 64 TSIEPNYSNDVYQDIATPRAIQT-GEMMARVAYLNEGFGQADLTVELTS--QNPLQSVASRLDNFWQRQAQRRLIATALG 140 (349) T ss_pred CCcccccCCCCcccccccccccc-cceeeeeeeeccccchhHHHHHhhC--chHHHHHHHHHHHHHhhHHHHHHHHHHHH Confidence 543 353322 223444443 3333344566667777766655543 24443333322222223333444454431 Q ss_pred c-----cCccc-------cccCCH--hHHHHHHHHHHHHHHhc--cCCCC--cEEEEChHHHHHHhhhhhhhhccccccc Q lcl|Aclame:pro 150 N-----KAKHL-------TVGTGS--DAQYDAVLDVSVELDEI--KAPEN--RVLFVSPTFYKGIKKFVIALPQGDTRQQ 211 (319) Q Consensus 150 ~-----a~~~~-------~~~~T~--~n~~~~i~~a~~~Lde~--~VP~~--R~l~VsP~~~~~L~~~~~f~~~~~~~~~ 211 (319) - ++... +..++. ..-.+.+.++..+|.++ +-..+ ..++|-|.+|..|++..... .-+. T Consensus 141 vf~~~~~a~~~~~~~~~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~----~i~~ 216 (349) T protein:vir:78 141 LYNDNVSATDAYHEQNDMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID----FIRD 216 (349) T ss_pred hhcccccccchhhhcccceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhh----hccC Confidence 1 11100 001111 11234556666666554 22333 46899999999998664321 1122 Q ss_pred ceeeeeeeeecCeEEEEeccc------ccccceEEEEcCCceeeeeee--eeeeeecCCCCCc--c-ceeeeeeeeeEEE Q lcl|Aclame:pro 212 VLGKGVQGELDGFVIVKVPTK------LLQGLQAIAVVGEVLASPIQA--DLAKTNSNIPGMF--G-TLAEQLLYTGAFV 280 (319) Q Consensus 212 ~~~~g~Vg~idG~~I~~vps~------~~~~~n~i~~~~~A~~~~~k~--~~~~~~~~~~~~~--~-~~v~gr~~yg~~V 280 (319) ...+..|+.+.|.+|+..-+- ..+.+-.++.-++|+.+-..- .-+|..|.+-... | +.+..|+.| + T Consensus 217 s~~~~~i~ty~G~~VivDD~~Pv~~~g~~~~yttylfg~GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~~---~ 293 (349) T protein:vir:78 217 AENNTMFATYQGYRVIVDDSMTVVGQGAQRKFISIIFGQGAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKTW---L 293 (349) T ss_pred cccCcccceecCeEEEEeCCCccccCCCCceEEEEEeecceEEEccCCCccceeeecccccCCcceeEEEEEeeEE---E Confidence 234556788899999863111 112233345557777665432 2355555432221 2 555555544 3 Q ss_pred eccccceEEEEccccccCC--CCCcccccccccc-ccccccC Q lcl|Aclame:pro 281 PEHLQKYIFTIGGTEVATK--RDGVDAHADNVAK-PSGSLEM 319 (319) Q Consensus 281 ~~~k~~~Iy~~~~~~~a~~--~~~~~~~~~~~~~-~~~~~~~ 319 (319) +-|+.-. |... ..+. .+.++.+.+.+-+ ...+-|. T Consensus 294 ~hp~G~s-~~~a---~v~~~~~~~~~~sPt~aeLa~~~NW~~ 331 (349) T protein:vir:78 294 LHPFGYR-FTSA---VITGNGTETIARSASWQDLANATNWNR 331 (349) T ss_pred eeeeeee-eccc---cccCCccccccCCCChHHhcCCcCccc Confidence 3444311 2221 1111 0112222222222 1112222 No 187 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=77.00 E-value=0.13 Score=25.45 Aligned_cols=279 Identities=10% Similarity=0.073 Sum_probs=121.7 Q ss_pred CCc-cc-ccccceeeehhhhhhhh-h---cchhhhhhhHhhHHHHHHHHHhhhhhhhcccCcc-e-eeeCCceEEeeecc Q lcl|Aclame:pro 1 MNK-TI-KNATGMLKLNLQHFANK-S---VEPGQTLLKNKHVGILERVTAVNAYSTPALISND-A-IFMEGRSFTVMKGD 72 (319) Q Consensus 1 ~~~-~~-~~~~~~~~~~~~~~~~~-~---~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~-~-~~~~g~tVkIp~i~ 72 (319) |-+ .+ ....+.-.|+--..+.. + ..-.-.-|..--...|.+-|....-+|...+.+. + .|...+.|.+-..+ T Consensus 371 lAr~~L~~rg~~~~~~~~~~~~~~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~ 450 (693) T protein:vir:95 371 LARASLVDRGIGVASLNAPQMVGLAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFS 450 (693) T ss_pred HHHHHHHhcCCccCCCCHHHHHHHHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCC Confidence 000 00 01111112221111111 1 1111112222233344444555555555443311 1 45555556553332 Q ss_pred ccccccccCCCCcccCCcccceeEEEEe-ecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 73 TTELKDYKRNATNEFDHPKIEETTYFLD-QEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK 151 (319) Q Consensus 73 ~~g~~DY~r~~~~~~~~~t~t~~tltid-qdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a 151 (319) .+.....++.|.++.+....+++.+. +.|-|++.=-.+= |.++.+..-+-.....+.+..++..+|+.|..+. T Consensus 451 --~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiI----NDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np 524 (693) T protein:vir:95 451 --SLRQVREGAEYKYVTLGERGEQIILATYGELFSITRQAII----NDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNP 524 (693) T ss_pred --ChhhcCCCCceeeeecCCccceeehhhcCCeeeecHHhhh----ccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCc Confidence 34444445666677776666666653 3555555432222 4444444444556666777888899999888663 Q ss_pred Cc-----------cc--c---ccCCHhHHHHHHHHHHHHHH---h-ccCC---CCcEEEEChHHHHHHhhhhhhhhcccc Q lcl|Aclame:pro 152 AK-----------HL--T---VGTGSDAQYDAVLDVSVELD---E-IKAP---ENRVLFVSPTFYKGIKKFVIALPQGDT 208 (319) Q Consensus 152 ~~-----------~~--~---~~~T~~n~~~~i~~a~~~Ld---e-~~VP---~~R~l~VsP~~~~~L~~~~~f~~~~~~ 208 (319) .- .+ + .+++.+.+-.....+.+.-+ + .+-+ ..++|+|+|+.....++ +..+... T Consensus 525 ~m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~---l~~s~~~ 601 (693) T protein:vir:95 525 AMSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQ---IINSESV 601 (693) T ss_pred cccCCcceeeccccccccccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHH---Hhccccc Confidence 21 11 1 12333333333333332211 1 1222 36788888887764442 2211111 Q ss_pred cccceeeeeeeeecCe-EEEEecccccc---cceE-EEEcCCceeeeeeeeeee------eecC-CCCCccceeeeeeee Q lcl|Aclame:pro 209 RQQVLGKGVQGELDGF-VIVKVPTKLLQ---GLQA-IAVVGEVLASPIQADLAK------TNSN-IPGMFGTLAEQLLYT 276 (319) Q Consensus 209 ~~~~~~~g~Vg~idG~-~I~~vps~~~~---~~n~-i~~~~~A~~~~~k~~~~~------~~~~-~~~~~~~~v~gr~~y 276 (319) -....-.|.|--+.|+ +++..| ++. ...| ++..+..- ...+...+ +... .-+.+|-.+|=|+-| T Consensus 602 ~~a~~~~~~~NP~~~~~~vi~~p--rL~~~s~~~Wyl~a~~~~d--tie~~yL~G~~~P~ie~~~gf~~dG~~~kvr~D~ 677 (693) T protein:vir:95 602 PGADVNSGIVNPIRAFAQVIGEP--RLDDASATAWYMAAKKGSD--TIEVAYLDGVDTPYLEQQEGFTVDGVASKVRIDA 677 (693) T ss_pred cccccccccccchhccccccccc--eecCCCCCceEEecCCCCC--eEEEEEecCCCCCeEeecCCCCcceEEEEEEEec Confidence 0001112333334443 454333 332 2234 44444331 12212211 1110 112345577788889 Q ss_pred eEEEeccccceEEEEccc Q lcl|Aclame:pro 277 GAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 277 g~~V~~~k~~~Iy~~~~~ 294 (319) |+.++|-+. +|=+.++ T Consensus 678 G~~~iD~Rg--~~kn~GA 693 (693) T protein:vir:95 678 GVAPLDFRG--LQKSNGA 693 (693) T ss_pred cCceeeccc--cccCCCC Confidence 999999774 5555554 No 188 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=74.99 E-value=0.15 Score=25.07 Aligned_cols=282 Identities=12% Similarity=0.026 Sum_probs=119.9 Q ss_pred CCcccc-ccc-ceeeehhhhhhhhhcc----hhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccc Q lcl|Aclame:pro 1 MNKTIK-NAT-GMLKLNLQHFANKSVE----PGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT 73 (319) Q Consensus 1 ~~~~~~-~~~-~~~~~~~~~~~~~~~~----~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~ 73 (319) ++..+. +.. ..|+-.-+-|-+...+ ..-.-.++.+. .+++.+...+.+.. +++ .+-.+| ..+||.-+. T Consensus 52 ~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~--~~~--v~~~~~-~~~i~~~~~ 126 (381) T protein:vir:95 52 AERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLA--DLG--IKNAGL-RLKFLKSET 126 (381) T ss_pred HHHHHHhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhcccee--hee--eEecCc-ceEEEEecC Confidence 000000 000 0111122222221111 00112344443 33333333333322 222 233344 468888766 Q ss_pred cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 74 TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFATLARNKA 152 (319) Q Consensus 74 ~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~ 152 (319) .+...+....+-..++.+.++..+++...+.-.|. . +..+-.. ...++...+.+..+.+++-.+|.-.+. +.| T Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~-~-is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~----G~G 200 (381) T protein:vir:95 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFV-V-LPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTG 200 (381) T ss_pred CcceeeecccccccccccccceeeeecceeEEeec-h-hhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEe----ccC Confidence 55554433222222333445666777766665442 1 1111111 112334445556666666666654321 111 Q ss_pred cc------------------------ccccCC---HhHHHHHHHHHHHHHHhcc-----CC-CCcEEEEChHHHHHHhhh Q lcl|Aclame:pro 153 KH------------------------LTVGTG---SDAQYDAVLDVSVELDEIK-----AP-ENRVLFVSPTFYKGIKKF 199 (319) Q Consensus 153 ~~------------------------~~~~~T---~~n~~~~i~~a~~~Lde~~-----VP-~~R~l~VsP~~~~~L~~~ 199 (319) .. ....++ ..+.++.|.+....+.... .+ .+-+++++|..+..|... T Consensus 201 ~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~ 280 (381) T protein:vir:95 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) T ss_pred CCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccc Confidence 10 001111 2334666666666554321 12 456788999888766533 Q ss_pred hhhhhcccccccceeeeeeeeec--CeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCC---ccceeeeee Q lcl|Aclame:pro 200 VIALPQGDTRQQVLGKGVQGELD--GFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTLAEQLL 274 (319) Q Consensus 200 ~~f~~~~~~~~~~~~~g~Vg~id--G~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~---~~~~v~gr~ 274 (319) ..+.. .+|..-... |.+|+. +..++...+++|.-+-.....+ ..+++-.-.+-. +...++.+. T Consensus 281 ~~~~~---------~~G~~v~~l~~g~~vv~--s~~~p~~~iifgDfs~Y~i~~r-~~~~i~~~~~~~~~~d~~~f~a~~ 348 (381) T protein:vir:95 281 YTHLN---------ANGVYVTALPFNLNVIE--STVQEAGKVLTYVKGLYDGYLA-GGINVQKFKETLALDDMDLYTAKQ 348 (381) T ss_pred cccCC---------CCCceeecCCCCceEEe--cCCCCcCcEEEEecccEEEEEe-cccEEEeechhHhhcCCeEEEEEE Confidence 32111 122211222 445554 4556655566665544333332 222222212222 335899999 Q ss_pred eeeEEEeccccceEEEEcc--ccccCCCCCcccccccccc Q lcl|Aclame:pro 275 YTGAFVPEHLQKYIFTIGG--TEVATKRDGVDAHADNVAK 312 (319) Q Consensus 275 ~yg~~V~~~k~~~Iy~~~~--~~~a~~~~~~~~~~~~~~~ 312 (319) ++|.+++++++..++--.. .++++. ...|. + T Consensus 349 r~dg~~~~~~A~~v~~l~~~~~~~~~~---~~~~~----~ 381 (381) T protein:vir:95 349 FAYGKAKDNKVAAVWKLDLKGHKPALE---GTEET----L 381 (381) T ss_pred EEcCEEecCceEEEEEEEecCCCcCcc---ccccc----C Confidence 9999999999865544322 222221 11111 1 No 189 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=74.99 E-value=0.15 Score=25.07 Aligned_cols=282 Identities=12% Similarity=0.026 Sum_probs=119.9 Q ss_pred CCcccc-ccc-ceeeehhhhhhhhhcc----hhhhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeCCceEEeeeccc Q lcl|Aclame:pro 1 MNKTIK-NAT-GMLKLNLQHFANKSVE----PGQTLLKNKHV-GILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDT 73 (319) Q Consensus 1 ~~~~~~-~~~-~~~~~~~~~~~~~~~~----~n~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~ 73 (319) ++..+. +.. ..|+-.-+-|-+...+ ..-.-.++.+. .+++.+...+.+.. +++ .+-.+| ..+||.-+. T Consensus 52 ~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~--~~~--v~~~~~-~~~i~~~~~ 126 (381) T protein:vir:10 52 AERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLA--DLG--IKNAGL-RLKFLKSET 126 (381) T ss_pred HHHHHHhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhcccee--hee--eEecCc-ceEEEEecC Confidence 000000 000 0111122222221111 00112344443 33333333333322 222 233344 468888766 Q ss_pred cccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 74 TELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDNLRFATLARNKA 152 (319) Q Consensus 74 ~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~ 152 (319) .+...+....+-..++.+.++..+++...+.-.|. . +..+-.. ...++...+.+..+.+++-.+|.-.+. +.| T Consensus 127 ~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~-~-is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~----G~G 200 (381) T protein:vir:10 127 SGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFV-V-LPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK----GTG 200 (381) T ss_pred CcceeeecccccccccccccceeeeecceeEEeec-h-hhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEe----ccC Confidence 55554433222222333445666777766665442 1 1111111 112334445556666666666654321 111 Q ss_pred cc------------------------ccccCC---HhHHHHHHHHHHHHHHhcc-----CC-CCcEEEEChHHHHHHhhh Q lcl|Aclame:pro 153 KH------------------------LTVGTG---SDAQYDAVLDVSVELDEIK-----AP-ENRVLFVSPTFYKGIKKF 199 (319) Q Consensus 153 ~~------------------------~~~~~T---~~n~~~~i~~a~~~Lde~~-----VP-~~R~l~VsP~~~~~L~~~ 199 (319) .. ....++ ..+.++.|.+....+.... .+ .+-+++++|..+..|... T Consensus 201 ~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~ 280 (381) T protein:vir:10 201 KDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQ 280 (381) T ss_pred CCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccc Confidence 10 001111 2334666666666554321 12 456788999888766533 Q ss_pred hhhhhcccccccceeeeeeeeec--CeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCC---ccceeeeee Q lcl|Aclame:pro 200 VIALPQGDTRQQVLGKGVQGELD--GFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM---FGTLAEQLL 274 (319) Q Consensus 200 ~~f~~~~~~~~~~~~~g~Vg~id--G~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~---~~~~v~gr~ 274 (319) ..+.. .+|..-... |.+|+. +..++...+++|.-+-.....+ ..+++-.-.+-. +...++.+. T Consensus 281 ~~~~~---------~~G~~v~~l~~g~~vv~--s~~~p~~~iifgDfs~Y~i~~r-~~~~i~~~~~~~~~~d~~~f~a~~ 348 (381) T protein:vir:10 281 YTHLN---------ANGVYVTALPFNLNVIE--STVQEAGKVLTYVKGLYDGYLA-GGINVQKFKETLALDDMDLYTAKQ 348 (381) T ss_pred cccCC---------CCCceeecCCCCceEEe--cCCCCcCcEEEEecccEEEEEe-cccEEEeechhHhhcCCeEEEEEE Confidence 32111 122211222 445554 4556655566665544333332 222222212222 335899999 Q ss_pred eeeEEEeccccceEEEEcc--ccccCCCCCcccccccccc Q lcl|Aclame:pro 275 YTGAFVPEHLQKYIFTIGG--TEVATKRDGVDAHADNVAK 312 (319) Q Consensus 275 ~yg~~V~~~k~~~Iy~~~~--~~~a~~~~~~~~~~~~~~~ 312 (319) ++|.+++++++..++--.. .++++. ...|. + T Consensus 349 r~dg~~~~~~A~~v~~l~~~~~~~~~~---~~~~~----~ 381 (381) T protein:vir:10 349 FAYGKAKDNKVAAVWKLDLKGHKPALE---GTEET----L 381 (381) T ss_pred EEcCEEecCceEEEEEEEecCCCcCcc---ccccc----C Confidence 9999999999865544322 222221 11111 1 No 190 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=74.96 E-value=0.15 Score=25.07 Aligned_cols=285 Identities=11% Similarity=0.010 Sum_probs=122.5 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHH--hhhhhhhcccCcceeee-CCceEEeeeccccccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTA--VNAYSTPALISNDAIFM-EGRSFTVMKGDTTELK 77 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~--~~sl~~~~~~n~~~~~~-~g~tVkIp~i~~~g~~ 77 (319) ||-- .++-.+-.-.++|.....-+-. +-+++....+..+++. ...+++..+..=.-... +..++..+.....|.. T Consensus 3 ~~~~-~~~~~~~~~~~~~~~~~~d~~~-~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a 80 (314) T protein:vir:10 3 IKFD-AEQAKITTHLEQMGVEKADAAG-IWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIA 80 (314) T ss_pred cchH-HHHHHHHHHHHhhcccchhhhH-HHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccce Confidence 2222 1111111222233322222111 1233333333333442 12233332221000111 2337777777776643 Q ss_pred c-ccC-CCCcccCCcccceeEEEEee-cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc--- Q lcl|Aclame:pro 78 D-YKR-NATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK--- 151 (319) Q Consensus 78 D-Y~r-~~~~~~~~~t~t~~tltidq-dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a--- 151 (319) . |.- +.....-+++.+++...+-. ..+|.+.+.++...+..+ .++...-..-++.++.-..|+..|---+... T Consensus 81 ~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~G 159 (314) T protein:vir:10 81 QIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATG-QSLSARKQALAFEAHDNLLDKLVWSGSAPHGIVS 159 (314) T ss_pred eeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhceEEEeeccccccee Confidence 2 322 22233346666777776655 567777777776666543 3443333445555555555554331011000 Q ss_pred -----Ccc----ccccCCHhHHHHHHHHHHHHHHhc--cCCCCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeee Q lcl|Aclame:pro 152 -----AKH----LTVGTGSDAQYDAVLDVSVELDEI--KAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGE 220 (319) Q Consensus 152 -----~~~----~~~~~T~~n~~~~i~~a~~~Lde~--~VP~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~ 220 (319) +.. .+.-.|++++++-|..+..+|.+. ++-..-.|+++|+.|.+|..-..-. ..+..+-...++ T Consensus 160 LlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~-~~tvl~~l~~n~---- 234 (314) T protein:vir:10 160 VFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQT-NLSYGELFTRNN---- 234 (314) T ss_pred EeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCC-CccHHHHHHHhC---- Confidence 000 011137889999999999999875 4333456889999998774211000 000111111111 Q ss_pred ecCeEEEEeccc---ccccce-EEEEcCCceeeeeee-eeeeeecCCC-CCccceeee-eeeeeEEEeccccceEEEEcc Q lcl|Aclame:pro 221 LDGFVIVKVPTK---LLQGLQ-AIAVVGEVLASPIQA-DLAKTNSNIP-GMFGTLAEQ-LLYTGAFVPEHLQKYIFTIGG 293 (319) Q Consensus 221 idG~~I~~vps~---~~~~~n-~i~~~~~A~~~~~k~-~~~~~~~~~~-~~~~~~v~g-r~~yg~~V~~~k~~~Iy~~~~ 293 (319) -+.+|..+|.- ...+.+ +++...+.-..-.-+ ..++.. |.| ....+.+.+ -++.|+.|.+|...+. + .+ T Consensus 235 -~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l-~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~-~-dG 310 (314) T protein:vir:10 235 -PGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVL-PAQPKDLHFRYPVTSKATGLIVYRPLTMAV-I-KG 310 (314) T ss_pred -CCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceee-cceecCceEEEcceeeeEEEEEECcceeEe-e-ee Confidence 23445444321 111222 223222221111111 122222 223 334566633 3667899999987321 1 11 Q ss_pred cccc Q lcl|Aclame:pro 294 TEVA 297 (319) Q Consensus 294 ~~~a 297 (319) -+=| T Consensus 311 I~~~ 314 (314) T protein:vir:10 311 ITFA 314 (314) T ss_pred eecC Confidence 1111 No 191 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=64.64 E-value=0.3 Score=23.48 Aligned_cols=285 Identities=11% Similarity=0.016 Sum_probs=117.0 Q ss_pred CCc-----------cccccc--ceeeehhhhhhhhhcchh----hhhhhHhhH-HHHHHHHHhhhhhhhcccCcceeeeC Q lcl|Aclame:pro 1 MNK-----------TIKNAT--GMLKLNLQHFANKSVEPG----QTLLKNKHV-GILERVTAVNAYSTPALISNDAIFME 62 (319) Q Consensus 1 ~~~-----------~~~~~~--~~~~~~~~~~~~~~~~~n----~~~l~~ky~-~lld~~~~~~sl~~~~~~n~~~~~~~ 62 (319) .+. .+-... ..|+-...-|-+...+.- -.-.++.+. .+++.+...+.+.. +++ .+-.+ T Consensus 41 ~~~~~~~~~~e~~~~~~~~~~~~~l~~~e~~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~--~a~--v~~~~ 116 (381) T protein:vir:10 41 FEETKLQAKAEAERVSSLPKSAQTLSANQRNFFMDINKSVGYKEEKLLPEETIDRIFEDLTTNHPLLA--DLG--IKNAG 116 (381) T ss_pred hhhHHHHHHHHHHHHHHhcccccccCHHHHHHHHHHhhcCCCCCceecCHHHHHHHHHHHHhhcceee--eee--eEecC Confidence 000 000000 011111111111111000 012334443 33333332222211 222 23334 Q ss_pred CceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHh-hhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 63 GRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTE-GNIDINYVVARQGAEVVAPYLDN 141 (319) Q Consensus 63 g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~-~~~~~~~~~~~~~~~~vapeiD~ 141 (319) | ..+||.-+..+...+....+-..+..+.+...++++..|.-.+- . +..+--. ...++...+.+..+..++-.+|+ T Consensus 117 ~-~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i-~-is~elL~Ds~~~le~~i~~~la~~~a~~~~~ 193 (381) T protein:vir:10 117 L-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFV-V-LPKDLNDFGPAWIERFVRVQIEEAFAVALET 193 (381) T ss_pred c-ceEEEeecCCcceEEeecccccccccCccceeEeecceeEEeec-c-ccHHHHhccHHHHHHHHHHHHHHHHHHHhhc Confidence 4 56788776655554422222222333445666777766654432 1 2211111 01233444555666666666665 Q ss_pred HHHHHHHhccCccc------------------------cccCC---HhHHHHHHHHHHHHHHhc------cCCCCcEEEE Q lcl|Aclame:pro 142 LRFATLARNKAKHL------------------------TVGTG---SDAQYDAVLDVSVELDEI------KAPENRVLFV 188 (319) Q Consensus 142 ~~~s~la~~a~~~~------------------------~~~~T---~~n~~~~i~~a~~~Lde~------~VP~~R~l~V 188 (319) ..+. +.|+.. ...+| ....++.+......+.-. .+-.+.++++ T Consensus 194 afi~----GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vm 269 (381) T protein:vir:10 194 AFLK----GTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVV 269 (381) T ss_pred eeEe----cccCCCceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEE Confidence 4321 111110 00111 122344444433333211 1124667889 Q ss_pred ChHHHHHHhhhhhhhhcccccccceeeeeeeee-cCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeecCCCCC-- Q lcl|Aclame:pro 189 SPTFYKGIKKFVIALPQGDTRQQVLGKGVQGEL-DGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNSNIPGM-- 265 (319) Q Consensus 189 sP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~i-dG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~~-- 265 (319) +|..+..|.....+.. ..|+ .|..+ .|.+|+. +..++...+++|.-+-...... ..+++-...+-. T Consensus 270 n~~t~~~l~~~~~~~~--~~G~------~v~~lp~g~~vv~--~~~~p~~~i~fGDfs~Y~i~~r-~~~~i~~~~~~~~~ 338 (381) T protein:vir:10 270 NPSDAFEVQAQYTHLN--ANGV------YVTALPFNLNVIE--STVQEAGKVLTYVKGLYDGYLA-GGINVQKFKETLAL 338 (381) T ss_pred chhhHHhhccccccCC--CCCc------eeecCCCCceeEE--cCCCCcCcEEEEEcccEEEEEe-cccEEEeechhhhh Confidence 9998877754332211 1111 11112 2666765 4455555566555444332222 222222212222 Q ss_pred -ccceeeeeeeeeEEEeccccceEEEEccccccCCCCCccccc Q lcl|Aclame:pro 266 -FGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHA 307 (319) Q Consensus 266 -~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~ 307 (319) +...++.+.++|.++++|++..+|--.-..+.+...+..+.- T Consensus 339 ~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 339 DDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEETL 381 (381) T ss_pred cCceEEEEEEEEcCEEecCCcEEEEEEeecCCccccccccccC Confidence 335899999999999999998776554333333222222211 No 192 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=60.45 E-value=0.37 Score=22.94 Aligned_cols=272 Identities=8% Similarity=0.050 Sum_probs=121.1 Q ss_pred CCcccccccceeeehhhhhhhhhc----chhhhhhhHhhHHHHHHHHHhhhhhhhcccCcc-e-eeeCCceEEeeecccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSV----EPGQTLLKNKHVGILERVTAVNAYSTPALISND-A-IFMEGRSFTVMKGDTT 74 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~----~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n~~-~-~~~~g~tVkIp~i~~~ 74 (319) +...-....| |+..-.+..+. .---.-|..--...|.+-|....-+|...+... + .|...+.|.+-. .. T Consensus 341 L~~~G~~~~~---~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~--~~ 415 (652) T protein:vir:79 341 LTERGIGVSS---YNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGG--FS 415 (652) T ss_pred HHhhccCCCC---CCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCC--CC Confidence 1111111112 22111122211 111112222222333444444455555444321 1 455566666643 23 Q ss_pred ccccccCCCCcccCCcccceeEEEEee-cccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|Aclame:pro 75 ELKDYKRNATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAK 153 (319) Q Consensus 75 g~~DY~r~~~~~~~~~t~t~~tltidq-dr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~ 153 (319) .+....-++.|.++.+....+++.+.. .|-|++.=-.+= |.++.+-.-+-.....+.+..++..+|+.|..+..- T Consensus 416 ~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiI----NDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~ 491 (652) T protein:vir:79 416 ALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAII----NDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKI 491 (652) T ss_pred CccccCCCCccceeeecCccceeeeecccCeeeeehheee----ccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCccc Confidence 344444566777777777777777753 566666533332 222333333344455567777888889888766421 Q ss_pred c-------------c---cccCCHhHHHHHHHHHHHHHHhc---c--CC-CCcEEEEChHHHHHHhhhhhhhhccccccc Q lcl|Aclame:pro 154 H-------------L---TVGTGSDAQYDAVLDVSVELDEI---K--AP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQ 211 (319) Q Consensus 154 ~-------------~---~~~~T~~n~~~~i~~a~~~Lde~---~--VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~ 211 (319) . + +.+++ .+.|..++..|..+ + .. .+++|+|+|+.....++ +..+...... T Consensus 492 ~~DGk~LF~hA~H~Nl~~~aa~~----~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~---ll~s~~v~~a 564 (652) T protein:vir:79 492 STDNVSLFDKAKHANVLESAAMD----VASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQ---VIRSSSVKGA 564 (652) T ss_pred ccCCceeecccccccccccccCC----HHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHH---HhccCCCccc Confidence 1 1 11122 23344444444322 2 22 47899999987654432 2222111111 Q ss_pred ceeeeeeeeecCe-EEEEeccccc---ccceEEEEcCCceeeeeeeeee------eeecC-CCCCccceeeeeeeeeEEE Q lcl|Aclame:pro 212 VLGKGVQGELDGF-VIVKVPTKLL---QGLQAIAVVGEVLASPIQADLA------KTNSN-IPGMFGTLAEQLLYTGAFV 280 (319) Q Consensus 212 ~~~~g~Vg~idG~-~I~~vps~~~---~~~n~i~~~~~A~~~~~k~~~~------~~~~~-~~~~~~~~v~gr~~yg~~V 280 (319) ..-.|.|--+.|+ +|+..| ++ ....|++..+... -...+... .+.+. .-+.+|-.+|=|+=||+.+ T Consensus 565 ~~~~~~~Np~~~~~~~i~ep--rL~~~s~~~wylaa~~~~-dtiev~yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~~~ 641 (652) T protein:vir:79 565 DINAGIINPVKDFATVIAEP--RLDDNSQTTFYLAASKGS-DTIEVAYLNGVDTPYIDQMEGFSVDGVTTKVRIDAGVAP 641 (652) T ss_pred cccccccccccccccccccc--ccCCCCcccEEEecCCCC-CeEEEEEecCCCCCeeeecCCCCcceEEEEEEEeccCce Confidence 1112333334443 555433 22 2233544433322 11111111 11110 1233566778888899999 Q ss_pred eccccceEEEEcc Q lcl|Aclame:pro 281 PEHLQKYIFTIGG 293 (319) Q Consensus 281 ~~~k~~~Iy~~~~ 293 (319) +|-+. +|=... T Consensus 642 iD~RG--~~k~t~ 652 (652) T protein:vir:79 642 VDHRG--LVKCTA 652 (652) T ss_pred eeccc--eeeecC Confidence 99775 332221 No 193 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=42.96 E-value=0.86 Score=20.93 Aligned_cols=273 Identities=12% Similarity=0.066 Sum_probs=115.5 Q ss_pred CCccccccccee-------eehhhh----------------hhhhhcchh----hhhhhHhhH----HHHHHHHHhhhhh Q lcl|Aclame:pro 1 MNKTIKNATGML-------KLNLQH----------------FANKSVEPG----QTLLKNKHV----GILERVTAVNAYS 49 (319) Q Consensus 1 ~~~~~~~~~~~~-------~~~~~~----------------~~~~~~~~n----~~~l~~ky~----~lld~~~~~~sl~ 49 (319) .++.-....++. .+.+-| |+.-+..-. +..+...|. .++++.-...++- T Consensus 86 ~~~~r~~p~~~~veyRSaGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd~~~~i~~~~v~d~i~li~q~r~i~slf 165 (410) T protein:vir:83 86 ISAMRGSPVGTEVEYRSAGEYMLDMWNSAQGNASAADRLEVYARAADHQKTGDLQGVIPDPIVGPVIDFIDSARPLVSTL 165 (410) T ss_pred hccCcCCCCCCCcccccHHHHHHHHhccCCchHHHHHHHHHHHHhhccCcccccccccchhHhhhHHHHHhhccchhhhh Confidence 000000000000 001111 111111100 001222222 2233222211211 Q ss_pred hhcccCcceeeeCCceEEeeecc-cccccccc-------CCCCcccCCcccceeEEEEeeccccee----ecchhhHHHH Q lcl|Aclame:pro 50 TPALISNDAIFMEGRSFTVMKGD-TTELKDYK-------RNATNEFDHPKIEETTYFLDQEKYWGR----FVDALDRKDT 117 (319) Q Consensus 50 ~~~~~n~~~~~~~g~tVkIp~i~-~~g~~DY~-------r~~~~~~~~~t~t~~tltidqdr~~~F----~VD~~D~~et 117 (319) ..+ -..|.|+..|-.. +.+++.|. ..+...++.++.+..+-.|+---+..+ .|+-.+..-. T Consensus 166 ~tL-------P~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyGGyt~LSRQ~IERs~v~~L 238 (410) T protein:vir:83 166 GTL-------PLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTLGGYVNVSRQAIDFSSPSAL 238 (410) T ss_pred hhC-------CCCCCeeEEeeecccccccccccccccccccccccccceeeeeccceeehhcCcccccceeeecCChhhH Confidence 111 1125566665442 34566553 122345566666666666654333332 2333333222 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccCCHhHHHHHHHHHHHHHHhccCC-CCcEEEEChHHHHHH Q lcl|Aclame:pro 118 EGNIDINYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAP-ENRVLFVSPTFYKGI 196 (319) Q Consensus 118 ~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~~a~~~~~~~~T~~n~~~~i~~a~~~Lde~~VP-~~R~l~VsP~~~~~L 196 (319) +..+.+. ...-+-+.--.+-+++.+.+... .....+|+++....|-++....+++.-. .-++|.|+|++++-+ T Consensus 239 ~~~lraL---~~AYA~atea~vra~L~~t~t~~---~a~~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl~~~ 312 (410) T protein:vir:83 239 DLVVNGL---GQQYAIETEALVGAALASTSTGA---VGYGNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVLGDF 312 (410) T ss_pred HHHHHHH---HHHHHHHHHHHHHHHHHHhhhhh---hhhhhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhhhhc Confidence 2222221 11111111112223333333322 2344568899888888988888886323 346799999996433 Q ss_pred hhhhhhhhcccccc-----cc--eeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeee-eeeeeecCCCCCccc Q lcl|Aclame:pro 197 KKFVIALPQGDTRQ-----QV--LGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQA-DLAKTNSNIPGMFGT 268 (319) Q Consensus 197 ~~~~~f~~~~~~~~-----~~--~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~-~~~~~~~~~~~~~~~ 268 (319) .+.|..-...+. +. +-.|.=|++.|++|+..|.....+..|| .+.|+.....- --+.+-+ ++.+.- T Consensus 313 --~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~a~AgTA~f~--~~~Ai~~~eS~~gp~qL~d--~~i~nL 386 (410) T protein:vir:83 313 --GPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAALGSGDAYLF--STAAIECFEQRVGTLQVVE--PSVFGL 386 (410) T ss_pred --cceeeccCCCCcccccccccccccchhhhhcccceEEecCCCcCeeeEe--ccceeeeeecCCceeEeeC--Cchhhh Confidence 244422111111 11 1256668899999998887776677776 45555544331 1122221 222221 Q ss_pred eeeeeeeeeEEEeccccceEEEEccc Q lcl|Aclame:pro 269 LAEQLLYTGAFVPEHLQKYIFTIGGT 294 (319) Q Consensus 269 ~v~gr~~yg~~V~~~k~~~Iy~~~~~ 294 (319) ...+--||..-+..+ +||.-..++ T Consensus 387 t~~ySgY~a~a~~~~--~gliPv~g~ 410 (410) T protein:vir:83 387 QVAYAGYFSTLVVNE--DAIVPLVGS 410 (410) T ss_pred hhhheeeeeeccccc--cceeeeccC Confidence 222224444444444 456555554 No 194 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=40.27 E-value=0.98 Score=20.63 Aligned_cols=283 Identities=13% Similarity=-0.020 Sum_probs=109.6 Q ss_pred CCcccccccceeeehhhhhhhhhcchhhhhhh------HhhHHHHHHHHHhhhhhhhcccCcceeeeCCceEEeeecccc Q lcl|Aclame:pro 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLK------NKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTT 74 (319) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~l~------~ky~~lld~~~~~~sl~~~~~~n~~~~~~~g~tVkIp~i~~~ 74 (319) |. .|+|----||++++...-+....-.|. ++....+++++.. +..+---+.+....++..|++++.. T Consensus 1 ~~---~~~~~~~~~n~~~~~i~k~~it~~~l~~g~L~p~~a~~Fl~~v~~~----t~iL~~~r~~~~~s~~~ei~kig~G 73 (360) T protein:vir:99 1 MS---SNSTIDSVRNQNMNSLSQKDIGLAELDGFQLPVDVTEEFLERMQKG----VQILGMADTMTLARLEMEVPQFGVP 73 (360) T ss_pred Cc---chhHHHHHhhhHHHHHHhhhccccccCceeecHHHHHHHHHHHhhc----cchhhhcceeecccccccccccccc Confidence 21 122222224555555544332222221 2223333444332 2222112235566778888887763 Q ss_pred ccc--cccCCCCcccCCcccce--eEE-EEeecccceeecchhhHH--H---HhhhHHHHHHHHHHHHHHH---HHHH-- Q lcl|Aclame:pro 75 ELK--DYKRNATNEFDHPKIEE--TTY-FLDQEKYWGRFVDALDRK--D---TEGNIDINYVVARQGAEVV---APYL-- 139 (319) Q Consensus 75 g~~--DY~r~~~~~~~~~t~t~--~tl-tidqdr~~~F~VD~~D~~--e---t~~~~~~~~~~~~~~~~~v---apei-- 139 (319) .+. -++-++. ..+..+.+. ..+ .++.-+.+.-.+.+-..+ + .++.-.+++.+++..+.-+ .-.- T Consensus 74 ~r~~r~~~e~~~-~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ 152 (360) T protein:vir:99 74 RLSGHTRDEEGS-RTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGA 152 (360) T ss_pred eeeccccccCCC-CCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccc Confidence 322 2222211 111111111 111 122112222111111111 1 0000122333332222211 0000 Q ss_pred -----------HHHH---HH--HHHhc-------cCccccccCCH------------------------hHHHHHHHHHH Q lcl|Aclame:pro 140 -----------DNLR---FA--TLARN-------KAKHLTVGTGS------------------------DAQYDAVLDVS 172 (319) Q Consensus 140 -----------D~~~---~s--~la~~-------a~~~~~~~~T~------------------------~n~~~~i~~a~ 172 (319) |.|. -+ +.|.+ |+. ...++. ..-...|.++. T Consensus 153 ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d--~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~ 230 (360) T protein:vir:99 153 SSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGD--STRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETI 230 (360) T ss_pred hhcccccCcccchhhhhhHHHHHHhhcccchhhcccc--ccccccccccccccccchhhhccccccccccchHHHHHHHH Confidence 1110 00 11100 000 000000 00122345666 Q ss_pred HHHHhcc--CC-CCcEEEEChHHHHHHhhhhhhhhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceee Q lcl|Aclame:pro 173 VELDEIK--AP-ENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLAS 249 (319) Q Consensus 173 ~~Lde~~--VP-~~R~l~VsP~~~~~L~~~~~f~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~ 249 (319) ..|...- =| ...+++++|..+...+..-. .|.+..|.+.+..+..-.+.|++|+.|| .+++-.+|+.+|.-+++ T Consensus 231 ~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~-~R~t~LGd~~l~g~~~~~~~Gipi~~v~--~~pd~~~mlT~p~NLi~ 307 (360) T protein:vir:99 231 QTLDSRYRESDAYSPVLMTSPNQVQSYTMSLT-EREDPLGSAVIFGDSDITPFSYDLVGVN--GFPDEYMMFTDPNNLAF 307 (360) T ss_pred HhcchhhhcCcccceEEEccCchHHHHHHHHh-ccCcccchhheecccccccceeeeEEcC--CCCCCceEEeccCceeE Confidence 6664431 12 23367888876544432211 2334566667775555568899998887 44555688888877744 Q ss_pred eeeeeeeeeecC-CCCC-cc---ceeeeeeeeeEEEecc-ccceEEEEcccccc Q lcl|Aclame:pro 250 PIQADLAKTNSN-IPGM-FG---TLAEQLLYTGAFVPEH-LQKYIFTIGGTEVA 297 (319) Q Consensus 250 ~~k~~~~~~~~~-~~~~-~~---~~v~gr~~yg~~V~~~-k~~~Iy~~~~~~~a 297 (319) .-...+|+..- .+.. .. +.+..+..+--++++. .+.++-.....|+| T Consensus 308 -g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 308 -GLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred -EeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 34455555311 1111 11 1222222234455555 45445556666666 No 195 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=33.06 E-value=1.4 Score=19.81 Aligned_cols=286 Identities=8% Similarity=-0.079 Sum_probs=116.5 Q ss_pred eeeehhhhhhhhhcchhhhhhhHhhHHHHHHH-HHhhhhhhhccc-CcceeeeCCceEEeeeccc-c-cc---ccccCCC Q lcl|Aclame:pro 11 MLKLNLQHFANKSVEPGQTLLKNKHVGILERV-TAVNAYSTPALI-SNDAIFMEGRSFTVMKGDT-T-EL---KDYKRNA 83 (319) Q Consensus 11 ~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~-~~~~sl~~~~~~-n~~~~~~~g~tVkIp~i~~-~-g~---~DY~r~~ 83 (319) |-.-.++-|- .|+ ...|...+.+- -+.+..+...++ .+ ....|+-|.+|-+.. . +. .+|+-++ T Consensus 1 m~lsD~~vfN-~~~-------~~a~~e~~~q~~~~fn~as~gai~l~~--~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~ 70 (325) T protein:vir:95 1 MALSDLAVYS-EYA-------YSAFSETLRQQVDLFNTATGGAIMLQS--AAHQGDFSDVAFFAKVTGGLVRRRNAYGSG 70 (325) T ss_pred Cchhhhhhhh-hhh-------hhhhhhhhhhhHhhhhhcccceeEecc--ccccCceeeccccccccccccccccCCCCc Confidence 4333444332 222 22332222221 111111111111 11 222599999998864 2 22 3454444 Q ss_pred CcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---ccCccc----- Q lcl|Aclame:pro 84 TNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLAR---NKAKHL----- 155 (319) Q Consensus 84 ~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~~vapeiD~~~~s~la~---~a~~~~----- 155 (319) ..+++.++ +.+...+-..|.+.|..- |..+..-..+.+....+..+..+++...+.++..+.+ .+.... T Consensus 71 ~vt~~kit-t~~~~av~~~r~~g~~~~--d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~ 147 (325) T protein:vir:95 71 TVAEKVLK-HLVDTSVKVAAGTPPVRL--DPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVY 147 (325) T ss_pred eeccceec-cccceeeEEecccCcccc--cHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccee Confidence 44455544 333333334455554433 3333222222333333444444544444443433321 111111 Q ss_pred --cccCCHhH---HHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhh--hhcccccccceeeeeeeeecCeEEEE Q lcl|Aclame:pro 156 --TVGTGSDA---QYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIA--LPQGDTRQQVLGKGVQGELDGFVIVK 228 (319) Q Consensus 156 --~~~~T~~n---~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f--~~~~~~~~~~~~~g~Vg~idG~~I~~ 228 (319) +...+..+ -.+.|.++..+|.+.. ..=..++|.+.+|.-|++..-. .+.++.+. ++ .|+.+.|.+|+. T Consensus 148 dis~~~~~~~~~~s~~~l~~A~~klGD~~-~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g---~~-~i~t~~G~~VIV 222 (325) T protein:vir:95 148 DATANTDAADKLPTWNNLNNGQAKFGDQS-SQIAAWIMHSTPMHKLYGSNLTNGERLFTYGT---VN-VVRDPFGKLLVM 222 (325) T ss_pred eeecccCcccccccHHHHHHHHHHhcccc-cceeEEEEchHHHHHHHHhhccccccccccCC---cc-cccccCCcEEEE Confidence 11122111 3578889999986642 1113578899999888864221 11111111 11 345567888886 Q ss_pred ecccc------cccceEEEEcCCceeeeeeeeeeeeecCCCC-CccceeeeeeeeeEEEeccccceEEEEccccccCCCC Q lcl|Aclame:pro 229 VPTKL------LQGLQAIAVVGEVLASPIQADLAKTNSNIPG-MFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRD 301 (319) Q Consensus 229 vps~~------~~~~n~i~~~~~A~~~~~k~~~~~~~~~~~~-~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~ 301 (319) .-+-. ...+..+..-++|+.....-+ .+.. +.+. ...-+..+.+-...|++.|+.-. |. . +.....|+ T Consensus 223 dD~~p~~~~g~~~~ytty~lg~GAi~~~~~~~-~~~~-~~~~~~~~~~~~~~~~~~tf~lhp~G~s-w~-~-s~~g~sPt 297 (325) T protein:vir:95 223 TDSPNLFAAGTPNVYHILGLVPGGVLIGQNND-FDAN-EETKNGDENIIRTYQAEWSYNIGVKGFA-WD-K-ANGGKSPT 297 (325) T ss_pred eCCCCCCCccCceeEEEEEEecCeEEecCCCC-cccc-ccccCcccceeeeeeeeeeEEeecceee-ee-c-ccccCCcC Confidence 42211 113444555577766555322 1111 1111 11111111112224666776521 21 1 11112233 Q ss_pred Cccccccccccccc-cccC Q lcl|Aclame:pro 302 GVDAHADNVAKPSG-SLEM 319 (319) Q Consensus 302 ~~~~~~~~~~~~~~-~~~~ 319 (319) -.++...+-|..+- +... T Consensus 298 ~aeL~~~~NW~rv~~~~K~ 316 (325) T protein:vir:95 298 DAALFTSTNWDKYATSHKD 316 (325) T ss_pred hHhhcCCcCcceecCCCcc Confidence 33444444453222 2222 No 196 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=24.19 E-value=2.2 Score=18.70 Aligned_cols=287 Identities=11% Similarity=-0.025 Sum_probs=90.4 Q ss_pred cccccceeeehhhhhhhhhcchhhhhhhHhhHHHHHHHHHhhhhhhhcccC------cceeee-CCc--eEEeeecccc- Q lcl|Aclame:pro 5 IKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALIS------NDAIFM-EGR--SFTVMKGDTT- 74 (319) Q Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~n~~~l~~ky~~lld~~~~~~sl~~~~~~n------~~~~~~-~g~--tVkIp~i~~~- 74 (319) .+| -||.|+||-|++.+.+..+...-..|...+ ...++-+..+.. .++... +++ -+--|-+... T Consensus 1 ~~~--~~~~~~~~~~~~~~~d~~~~~~l~~~~~~~----~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~ 74 (349) T protein:vir:10 1 MKN--QKLQLDLQRFATPILDMFSQNTVLDYTRNR----QYPEMLGDTLFPAVKVPTLEVDILKAGSRVPTIASVSAFDA 74 (349) T ss_pred CCc--chhhHHHHHHHHHhhcccCHHHHHHHHHhc----CcchhhHhhcCCccccccceeEEEeeccCcceeeeeecCCC Confidence 344 699999999999988775543333332111 001111111111 111111 111 1111111111 Q ss_pred ccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhH------HHHHHH---HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 75 ELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNI------DINYVV---ARQGAEVVAPYLDNLRFA 145 (319) Q Consensus 75 g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~------~~~~~~---~~~~~~~vapeiD~~~~s 145 (319) +..=-+|++... +.+.-.+. -...++.-|........ .+.+.+ .++++..+.-.++..+.. T Consensus 75 ~~~~~~r~~~~~------~~~~p~ik----~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q 144 (349) T protein:vir:10 75 EAEIGTREASKM------TAELAYVK----RKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTME 144 (349) T ss_pred CcceecccceeE------Eeeccccc----cccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000011221100 11101111 12334433322211100 001111 112222233333333333 Q ss_pred HHHhccC--------------ccccccCC--------HhHHHHHHHHHHHHHHhccCCCCcEEEEChHHHHHHhhhhhhh Q lcl|Aclame:pro 146 TLARNKA--------------KHLTVGTG--------SDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIAL 203 (319) Q Consensus 146 ~la~~a~--------------~~~~~~~T--------~~n~~~~i~~a~~~Lde~~VP~~R~l~VsP~~~~~L~~~~~f~ 203 (319) .|..+.. ......+| ..++++-|.++. +..+.. ..+++++++++..|++++.+. T Consensus 145 ~l~~Gki~~~~~g~~vD~g~~~~~~~~lt~~~~Ws~~~adpi~Di~~~~---~~~g~~-p~~~vm~~~~~~~l~~~~~i~ 220 (349) T protein:vir:10 145 MFATGKITDKKNGIAIDYGVPKKHQETLSGTKTWDKSDASIIDNLQDWS---DSLDVT-PTRALTSKKVLRILMRSTEIK 220 (349) T ss_pred HHhCCeeEEcCCcEEEecccCccceeEecCcccCCCCCCCHHHHHHHHH---HHhCCC-ccEEEeCHHHHHHHhcCHHHH Confidence 3332210 01111111 234555555444 333443 356889999999999999887 Q ss_pred hccccccc-c-----eeeeeeeeecCeEEEEecccc--------------cccceEEEEcCCceeeeeeeeee-eeecCC Q lcl|Aclame:pro 204 PQGDTRQQ-V-----LGKGVQGELDGFVIVKVPTKL--------------LQGLQAIAVVGEVLASPIQADLA-KTNSNI 262 (319) Q Consensus 204 ~~~~~~~~-~-----~~~g~Vg~idG~~I~~vps~~--------------~~~~n~i~~~~~A~~~~~k~~~~-~~~~~~ 262 (319) ......+. . .++...+...|.+|+...... .+.-.+++++.... ....+-.+ +..++. T Consensus 221 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~l~~~~~~-G~~~yG~~~e~~~~~ 299 (349) T protein:vir:10 221 EAIFGKDTGRVVGQADLDQWMTAQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIVLFNDEVP-GQKIYGPTPEENRLI 299 (349) T ss_pred HHhcccccccccCHHHHHHHHHhcCCceEEEEeeEEEeecCCCceeecccccCCeEEEecCCCc-eeEEeeccchhhhhc Confidence 66432221 1 123444566677776432110 11111222221111 00000000 000000 Q ss_pred CCCccceeeeeeeeeEEEeccccceEEEEccccccCCCCCcccccccccccc Q lcl|Aclame:pro 263 PGMFGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPS 314 (319) Q Consensus 263 ~~~~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~ 314 (319) .+......-+-.+...+..+..-...++...+.+=+ .....+..-.++=. T Consensus 300 ~g~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lP--v~~~~~~~~~a~Vl 349 (349) T protein:vir:10 300 SSNAQVSNVGNIMAKIYETSEDPIGTWILASATMLP--SFASADDVFQAKVL 349 (349) T ss_pred ccccceeeccceEEEeeeecCCCceEEEEEeeeeee--eecCCCcEEEEEeC Confidence 000000000111111111111111222222211111 00111111111111 No 197 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=22.45 E-value=2.4 Score=18.45 Aligned_cols=298 Identities=9% Similarity=0.018 Sum_probs=110.6 Q ss_pred CCccccc----ccceee-------------ehhhhhhhhh--------cchh-hhhhhHhhHHHH-HHHHHhhhhhhhcc Q lcl|Aclame:pro 1 MNKTIKN----ATGMLK-------------LNLQHFANKS--------VEPG-QTLLKNKHVGIL-ERVTAVNAYSTPAL 53 (319) Q Consensus 1 ~~~~~~~----~~~~~~-------------~~~~~~~~~~--------~~~n-~~~l~~ky~~ll-d~~~~~~sl~~~~~ 53 (319) .+..++. ..++++ -..+.|.... +..+ ....++-+...+ +.+.....+.. . T Consensus 105 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~--~ 182 (466) T protein:vir:80 105 RTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLIS--K 182 (466) T ss_pred hhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhh--h Confidence 0000000 000000 0000000000 0000 011223232222 11111111110 1 Q ss_pred cCcceeeeCCceEEeeeccccccccccCCCCcccCCcccceeEEEEeecccceeecchhhHHHHhhhHHHHHHHHHHHHH Q lcl|Aclame:pro 54 ISNDAIFMEGRSFTVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRFVDALDRKDTEGNIDINYVVARQGAE 133 (319) Q Consensus 54 ~n~~~~~~~g~tVkIp~i~~~g~~DY~r~~~~~~~~~t~t~~tltidqdr~~~F~VD~~D~~et~~~~~~~~~~~~~~~~ 133 (319) ++ +.-. ...+++|.-+..+.+.... .+-.....+.++..+++.-.+.-.|. .--+.--.-...++...+.+..+. T Consensus 183 ~~--v~~~-~g~~~~~~~~~~~~a~wv~-E~~~~~~~~~~f~~i~~~~~k~~~~~-~iS~ell~ds~~~l~~~i~~~la~ 257 (466) T protein:vir:80 183 VR--LRPL-KGTARQNIAGAIPEGVWTE-AVANLNELSLSFSQIEVDGYKVGGFI-PIPNSTLEDSDLNLADEILDAIGQ 257 (466) T ss_pred ee--eeec-CceeEeeeecCCcceeecc-cccccccccccccceeecceeeeeeh-hhhHHHHhcchHHHHHHHHHHHHH Confidence 11 1111 2345666544333232221 11112222334444555555544432 211111111112445566777778 Q ss_pred HHHHHHHHHHHHH---------HHhccCcc---cc-------ccCCH----------hHHHHHHHHHHHHH--HhccCCC Q lcl|Aclame:pro 134 VVAPYLDNLRFAT---------LARNKAKH---LT-------VGTGS----------DAQYDAVLDVSVEL--DEIKAPE 182 (319) Q Consensus 134 ~vapeiD~~~~s~---------la~~a~~~---~~-------~~~T~----------~n~~~~i~~a~~~L--de~~VP~ 182 (319) +++-.+|.-++.- +...+... .. ..++. .+.+..+.++...+ ....... T Consensus 258 ~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 337 (466) T protein:vir:80 258 AIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSN 337 (466) T ss_pred HHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccC Confidence 8888888765420 00000000 00 00111 11122222322222 2223333 Q ss_pred -CcEEEEChHHHHHHhhhhhh-hhcccccccceeeeeeeeecCeEEEEecccccccceEEEEcCCceeeeeeeeeeeeec Q lcl|Aclame:pro 183 -NRVLFVSPTFYKGIKKFVIA-LPQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGEVLASPIQADLAKTNS 260 (319) Q Consensus 183 -~R~l~VsP~~~~~L~~~~~f-~~~~~~~~~~~~~g~Vg~idG~~I~~vps~~~~~~n~i~~~~~A~~~~~k~~~~~~~~ 260 (319) ..++++++..+..|..-.-. ...+..... ..++ ..+.|.+|+. +..+++..++.+...+...... ..+++-. T Consensus 338 ~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~-~~~~--~~i~G~pvv~--s~~~~~~~~~~g~~~~y~i~~r-~~~~i~~ 411 (466) T protein:vir:80 338 GMKFWAMSSNTHAVLMSKAITFNSAGALVAS-LNNT--MPIVGGDIVI--LDFIPDNDIIGGYGSLYLLAER-ADIKLAQ 411 (466) T ss_pred CceeEEecchhHHHhhcccccccCCcccccc-CCCc--ccccccceee--cCccCccceeeeccccEEEEee-cceEEEe Confidence 34578899888777533211 111110000 0111 2377999975 4455555577777666544432 2344433 Q ss_pred CCCCC---ccceeeeeeeeeEEEeccccceEEEEccccccCCCCCcccccccccccccc-ccC Q lcl|Aclame:pro 261 NIPGM---FGTLAEQLLYTGAFVPEHLQKYIFTIGGTEVATKRDGVDAHADNVAKPSGS-LEM 319 (319) Q Consensus 261 ~~~~~---~~~~v~gr~~yg~~V~~~k~~~Iy~~~~~~~a~~~~~~~~~~~~~~~~~~~-~~~ 319 (319) ..+.. +-..++...++|.+|.+++..-+..-....+.+ + ..++.+.|. -|+ T Consensus 412 ~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~-----~---~~~~~~~~~~~~~ 466 (466) T protein:vir:80 412 SEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTT-----S---ITFAPDEANVPEV 466 (466) T ss_pred chhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCccc-----c---eeeecCcCcCCCC Confidence 23333 336789999999999999874332211111111 1 112222222 223 Done!