Query lcl|Aclame:protein:vir:78830|NCBI_annot:major head protein|genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Match_columns 324 No_of_seqs 121 out of 1241 Neff 9.6 Searched_HMMs 1612 Date Mon Dec 2 22:24:31 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_43 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_43_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:9309 Length: 324 # 100.0 3.8E-76 2.4E-79 434.0 33.4 324 1-324 1-324 (324) 2 protein:vir:97148 Length: 324 100.0 1.2E-75 7.7E-79 431.2 33.3 324 1-324 1-324 (324) 3 protein:vir:78830 Length: 324 100.0 3E-75 1.9E-78 429.1 33.2 324 1-324 1-324 (324) 4 protein:vir:96392 Length: 324 100.0 3E-75 1.9E-78 429.1 33.2 324 1-324 1-324 (324) 5 protein:vir:96223 Length: 324 100.0 5E-75 3.1E-78 427.9 32.9 324 1-324 1-324 (324) 6 protein:vir:103955 Length: 324 100.0 1.3E-74 7.8E-78 425.7 32.8 324 1-324 1-324 (324) 7 protein:vir:99749 Length: 324 100.0 1.6E-74 1E-77 425.1 33.1 324 1-324 1-324 (324) 8 protein:vir:95763 Length: 297 100.0 1.1E-63 6.7E-67 365.8 29.6 296 19-316 1-297 (297) 9 protein:vir:41 Length: 299 # N 100.0 6.7E-63 4.1E-66 361.5 29.6 294 22-316 1-299 (299) 10 protein:vir:105905 Length: 304 100.0 5.7E-62 3.6E-65 356.3 29.5 295 19-314 1-304 (304) 11 protein:vir:94142 Length: 304 100.0 5.7E-62 3.6E-65 356.3 29.5 295 19-314 1-304 (304) 12 protein:vir:7771 Length: 330 # 100.0 3.9E-61 2.4E-64 351.8 30.5 304 19-324 1-330 (330) 13 protein:vir:5739 Length: 366 # 100.0 2.9E-61 1.8E-64 352.4 28.7 312 1-315 20-366 (366) 14 protein:vir:2430 Length: 318 # 100.0 6E-61 3.8E-64 350.7 30.0 307 11-320 1-318 (318) 15 protein:vir:2344 Length: 397 # 100.0 4.5E-61 2.8E-64 351.4 28.9 306 15-324 1-315 (397) 16 protein:vir:1433 Length: 435 # 100.0 3.6E-60 2.2E-63 346.5 30.0 314 1-317 91-435 (435) 17 protein:vir:80376 Length: 435 100.0 6.1E-60 3.8E-63 345.2 30.4 314 1-317 92-435 (435) 18 protein:vir:105038 Length: 428 100.0 4.2E-60 2.6E-63 346.1 28.9 312 1-315 83-428 (428) 19 protein:vir:104085 Length: 320 100.0 1E-59 6.3E-63 344.0 29.7 305 13-319 1-320 (320) 20 protein:vir:4226 Length: 326 # 100.0 5E-60 3.1E-63 345.7 27.9 308 1-318 3-326 (326) 21 protein:vir:485 Length: 407 # 100.0 1.6E-59 1E-62 342.9 29.8 306 1-322 80-407 (407) 22 protein:vir:80684 Length: 315 100.0 1.2E-59 7.7E-63 343.5 28.8 292 27-324 1-315 (315) 23 protein:vir:4456 Length: 401 # 100.0 9.4E-59 5.9E-62 338.7 29.0 298 1-315 81-401 (401) 24 protein:vir:9574 Length: 300 # 100.0 4E-58 2.5E-61 335.2 29.2 282 28-315 1-300 (300) 25 protein:vir:100247 Length: 425 100.0 4.7E-58 2.9E-61 334.9 29.1 299 1-316 102-425 (425) 26 protein:vir:1638 Length: 298 # 100.0 8.7E-58 5.4E-61 333.4 29.6 278 31-314 1-298 (298) 27 protein:vir:9759 Length: 303 # 100.0 9.5E-58 5.9E-61 333.2 29.6 281 29-315 1-303 (303) 28 protein:vir:94771 Length: 298 100.0 1.4E-57 8.7E-61 332.3 29.6 278 31-314 1-298 (298) 29 protein:vir:93616 Length: 645 100.0 3.9E-57 2.4E-60 329.8 30.6 315 1-321 289-645 (645) 30 protein:vir:2504 Length: 305 # 100.0 3.9E-57 2.4E-60 329.8 28.6 289 27-323 1-305 (305) 31 protein:vir:100135 Length: 418 100.0 9.7E-57 6E-60 327.7 30.4 303 1-318 103-418 (418) 32 protein:vir:8187 Length: 311 # 100.0 7.1E-57 4.4E-60 328.4 29.6 281 29-316 1-311 (311) 33 protein:vir:78223 Length: 333 100.0 1.6E-56 9.7E-60 326.5 29.6 302 11-316 1-333 (333) 34 protein:vir:78523 Length: 338 100.0 2.4E-56 1.5E-59 325.5 30.0 307 8-318 1-338 (338) 35 protein:vir:7855 Length: 497 # 100.0 5.9E-56 3.7E-59 323.4 30.0 303 1-319 116-497 (497) 36 protein:vir:101650 Length: 497 100.0 5.9E-56 3.7E-59 323.4 30.0 303 1-319 116-497 (497) 37 protein:vir:1328 Length: 392 # 100.0 4.8E-56 3E-59 323.8 29.4 300 1-316 83-392 (392) 38 protein:vir:1886 Length: 385 # 100.0 7.6E-56 4.7E-59 322.8 29.9 301 1-316 75-385 (385) 39 protein:vir:191 Length: 385 # 100.0 7.6E-56 4.7E-59 322.8 29.9 301 1-316 75-385 (385) 40 protein:vir:4339 Length: 395 # 100.0 1.1E-55 6.7E-59 321.9 30.4 299 1-315 88-395 (395) 41 protein:vir:6242 Length: 390 # 100.0 8.7E-56 5.4E-59 322.5 29.0 297 1-316 79-390 (390) 42 protein:vir:97053 Length: 390 100.0 1.8E-55 1.1E-58 320.7 30.3 297 1-313 82-390 (390) 43 protein:vir:10364 Length: 390 100.0 2.7E-55 1.6E-58 319.8 30.4 297 1-313 82-390 (390) 44 protein:vir:81070 Length: 390 100.0 2.8E-55 1.8E-58 319.6 30.4 297 1-313 82-390 (390) 45 protein:vir:8102 Length: 543 # 100.0 3.1E-55 1.9E-58 319.4 29.6 304 1-316 216-543 (543) 46 protein:vir:102119 Length: 404 100.0 1.2E-54 7.1E-58 316.3 30.9 307 1-319 78-404 (404) 47 protein:vir:95376 Length: 425 100.0 1.2E-54 7.6E-58 316.1 28.4 301 1-319 110-425 (425) 48 protein:vir:4997 Length: 397 # 100.0 3.9E-54 2.4E-57 313.4 30.5 298 1-324 81-394 (397) 49 protein:vir:6212 Length: 434 # 100.0 2.2E-54 1.4E-57 314.7 29.2 301 1-318 114-434 (434) 50 protein:vir:4953 Length: 397 # 100.0 4E-54 2.5E-57 313.3 30.4 298 1-324 79-395 (397) 51 protein:vir:4511 Length: 409 # 100.0 4.1E-54 2.6E-57 313.3 29.4 304 1-318 85-409 (409) 52 protein:vir:4600 Length: 415 # 100.0 1.3E-53 8.2E-57 310.5 31.3 304 1-324 96-411 (415) 53 protein:vir:4700 Length: 415 # 100.0 1.3E-53 8.2E-57 310.5 31.3 304 1-324 96-411 (415) 54 protein:vir:81160 Length: 371 100.0 1E-53 6.4E-57 311.1 30.4 287 1-315 66-371 (371) 55 protein:vir:102873 Length: 392 100.0 6.2E-54 3.9E-57 312.3 29.2 296 1-323 74-392 (392) 56 protein:vir:105004 Length: 392 100.0 6.2E-54 3.9E-57 312.3 29.2 296 1-323 74-392 (392) 57 protein:vir:102082 Length: 392 100.0 6.2E-54 3.9E-57 312.3 29.2 296 1-323 74-392 (392) 58 protein:vir:107593 Length: 392 100.0 6.2E-54 3.9E-57 312.3 29.2 296 1-323 74-392 (392) 59 protein:vir:81227 Length: 413 100.0 1.4E-53 8.6E-57 310.4 29.8 303 1-318 85-413 (413) 60 protein:vir:1025 Length: 408 # 100.0 2.7E-53 1.7E-56 308.8 31.1 298 1-324 84-408 (408) 61 protein:vir:4830 Length: 397 # 100.0 2.5E-53 1.6E-56 308.9 30.7 297 1-324 79-395 (397) 62 protein:vir:4856 Length: 293 # 100.0 1.5E-53 9.5E-57 310.1 28.9 276 23-324 1-291 (293) 63 protein:vir:3991 Length: 404 # 100.0 7.3E-53 4.6E-56 306.4 31.4 297 1-323 87-404 (404) 64 protein:vir:3845 Length: 395 # 100.0 6.1E-53 3.8E-56 306.8 30.7 295 1-324 81-392 (395) 65 protein:vir:98339 Length: 415 100.0 9.9E-53 6.1E-56 305.7 31.4 304 1-324 96-411 (415) 66 protein:vir:79987 Length: 415 100.0 9.9E-53 6.1E-56 305.7 31.4 304 1-324 96-411 (415) 67 protein:vir:81100 Length: 415 100.0 9.9E-53 6.1E-56 305.7 31.4 304 1-324 96-411 (415) 68 protein:vir:104256 Length: 458 100.0 5.8E-53 3.6E-56 307.0 30.1 298 1-315 126-458 (458) 69 protein:vir:96762 Length: 632 100.0 1.5E-53 9.5E-57 310.1 26.7 294 1-314 304-632 (632) 70 protein:vir:7409 Length: 408 # 100.0 9.1E-53 5.6E-56 305.9 30.8 298 1-324 84-408 (408) 71 protein:vir:1268 Length: 397 # 100.0 3.7E-53 2.3E-56 308.0 28.6 288 1-315 89-397 (397) 72 protein:vir:9410 Length: 415 # 100.0 1.5E-52 9.6E-56 304.6 31.0 304 1-324 96-411 (415) 73 protein:vir:99920 Length: 311 100.0 6.4E-53 3.9E-56 306.7 28.3 280 28-315 1-311 (311) 74 protein:vir:101607 Length: 379 100.0 1.3E-52 8E-56 305.1 29.2 290 1-315 76-379 (379) 75 protein:vir:4092 Length: 390 # 100.0 2.2E-52 1.4E-55 303.8 27.2 303 1-324 39-379 (390) 76 protein:vir:1383 Length: 421 # 100.0 1.1E-51 7E-55 299.9 27.7 294 1-324 87-392 (421) 77 protein:vir:9704 Length: 394 # 100.0 6.7E-51 4.2E-54 295.7 28.9 286 1-319 85-394 (394) 78 protein:vir:98635 Length: 377 100.0 3.5E-52 2.2E-55 302.7 21.6 299 1-315 29-377 (377) 79 protein:vir:100172 Length: 394 100.0 5.3E-50 3.3E-53 290.7 30.1 293 1-324 84-393 (394) 80 protein:vir:94673 Length: 419 100.0 4.3E-50 2.6E-53 291.3 29.1 301 1-317 88-419 (419) 81 protein:vir:100884 Length: 389 100.0 1.1E-49 7.1E-53 288.9 29.8 289 1-323 83-389 (389) 82 protein:vir:3870 Length: 400 # 100.0 8.8E-50 5.4E-53 289.5 28.2 283 1-316 89-400 (400) 83 protein:vir:1084 Length: 437 # 100.0 7E-50 4.3E-53 290.1 27.4 293 1-324 126-437 (437) 84 protein:vir:101291 Length: 381 100.0 6E-50 3.7E-53 290.4 26.1 301 1-324 37-378 (381) 85 protein:vir:9509 Length: 381 # 100.0 6E-50 3.7E-53 290.4 26.1 301 1-324 37-378 (381) 86 protein:vir:78640 Length: 352 100.0 3.8E-50 2.3E-53 291.5 24.8 294 1-321 46-352 (352) 87 protein:vir:2685 Length: 387 # 100.0 9.5E-50 5.9E-53 289.3 24.3 294 1-321 85-387 (387) 88 protein:vir:96978 Length: 387 100.0 9.5E-50 5.9E-53 289.3 24.3 294 1-321 85-387 (387) 89 protein:vir:94424 Length: 387 100.0 9.5E-50 5.9E-53 289.3 24.3 294 1-321 85-387 (387) 90 protein:vir:95963 Length: 395 100.0 6.9E-49 4.3E-52 284.6 28.7 303 1-324 45-389 (395) 91 protein:vir:100632 Length: 381 100.0 4.8E-49 3E-52 285.5 24.2 301 1-324 20-380 (381) 92 protein:vir:80128 Length: 466 100.0 1.2E-48 7.4E-52 283.3 26.1 306 1-324 103-457 (466) 93 protein:vir:8420 Length: 477 # 100.0 9.9E-49 6.2E-52 283.8 25.6 305 1-321 103-477 (477) 94 protein:vir:93881 Length: 387 100.0 7.9E-49 4.9E-52 284.3 24.8 294 1-321 81-387 (387) 95 protein:vir:9643 Length: 377 # 100.0 2.5E-48 1.5E-51 281.6 26.5 293 1-315 39-377 (377) 96 protein:vir:9361 Length: 402 # 100.0 7.9E-49 4.9E-52 284.3 23.5 294 1-321 98-402 (402) 97 protein:vir:78350 Length: 383 100.0 1.2E-48 7.6E-52 283.3 22.8 299 1-323 43-383 (383) 98 protein:vir:962 Length: 397 # 100.0 2.3E-47 1.4E-50 276.3 26.2 280 1-315 93-397 (397) 99 protein:vir:4197 Length: 314 # 100.0 5.4E-40 3.4E-43 235.9 24.9 287 8-318 1-314 (314) 100 protein:vir:4159 Length: 315 # 100.0 2.8E-38 1.8E-41 226.4 22.9 286 1-312 1-315 (315) 101 protein:vir:3158 Length: 321 # 100.0 1.5E-35 9.6E-39 211.4 25.6 301 1-324 1-320 (321) 102 protein:vir:97397 Length: 517 100.0 1E-35 6.4E-39 212.4 21.9 294 1-318 190-517 (517) 103 protein:vir:3033 Length: 272 # 100.0 3.2E-34 2E-37 204.2 24.6 261 27-318 1-272 (272) 104 protein:vir:9820 Length: 272 # 100.0 3.2E-34 2E-37 204.2 24.6 261 27-318 1-272 (272) 105 protein:vir:4074 Length: 480 # 99.9 2.7E-30 1.7E-33 182.6 15.9 284 1-318 151-480 (480) 106 protein:vir:93742 Length: 274 99.9 6.3E-26 3.9E-29 158.7 22.6 263 24-319 1-274 (274) 107 protein:vir:3613 Length: 272 # 99.9 1.7E-25 1E-28 156.4 20.9 259 26-315 1-272 (272) 108 protein:vir:105334 Length: 276 99.9 8.9E-25 5.5E-28 152.4 21.8 265 26-323 1-276 (276) 109 protein:vir:96833 Length: 275 99.9 1.2E-24 7.6E-28 151.7 21.5 264 24-319 1-275 (275) 110 protein:vir:96123 Length: 274 99.9 1.1E-23 6.6E-27 146.5 22.4 263 24-319 1-274 (274) 111 protein:vir:97433 Length: 274 99.9 2.5E-23 1.6E-26 144.5 22.8 263 26-319 1-274 (274) 112 protein:vir:94494 Length: 274 99.9 2.5E-23 1.6E-26 144.5 22.8 263 26-319 1-274 (274) 113 protein:vir:79928 Length: 393 99.9 6.5E-24 4E-27 147.7 18.6 311 1-324 42-386 (393) 114 protein:vir:94933 Length: 330 99.9 2.5E-23 1.6E-26 144.5 19.4 296 1-316 1-330 (330) 115 protein:vir:80930 Length: 278 99.9 7.8E-23 4.8E-26 141.8 21.8 266 26-316 1-278 (278) 116 protein:vir:95107 Length: 270 99.8 1.4E-22 8.8E-26 140.4 21.0 262 29-320 1-270 (270) 117 protein:vir:95898 Length: 274 99.8 4.9E-22 3E-25 137.4 22.4 263 24-324 1-274 (274) 118 protein:vir:96262 Length: 274 99.8 4.9E-22 3E-25 137.4 22.4 263 24-324 1-274 (274) 119 protein:vir:1239 Length: 274 # 99.8 5.1E-22 3.1E-25 137.3 22.2 263 24-319 1-274 (274) 120 protein:vir:739 Length: 231 # 99.7 7.5E-20 4.6E-23 125.4 17.7 223 61-315 1-231 (231) 121 protein:vir:97255 Length: 310 99.7 1.9E-18 1.2E-21 117.7 21.6 274 27-315 1-310 (310) 122 protein:vir:99424 Length: 360 99.7 6.4E-17 4E-20 109.4 22.5 298 1-318 1-360 (360) 123 protein:vir:108211 Length: 318 99.6 9.1E-17 5.6E-20 108.5 17.9 282 23-316 1-318 (318) 124 protein:vir:7990 Length: 273 # 99.6 1.4E-15 8.7E-19 102.0 20.3 258 31-315 1-273 (273) 125 protein:vir:8324 Length: 410 # 99.5 4.2E-16 2.6E-19 104.9 16.6 280 1-313 59-410 (410) 126 protein:vir:105822 Length: 273 99.5 4.4E-15 2.7E-18 99.3 20.9 258 31-315 1-273 (273) 127 protein:vir:102605 Length: 273 99.5 4.4E-15 2.7E-18 99.3 20.9 258 31-315 1-273 (273) 128 protein:vir:94622 Length: 341 99.5 3.1E-15 1.9E-18 100.1 19.0 293 24-319 1-341 (341) 129 protein:vir:5974 Length: 324 # 99.4 4E-14 2.5E-17 94.0 19.5 275 30-324 1-298 (324) 130 protein:vir:80213 Length: 334 99.4 4.5E-14 2.8E-17 93.8 17.8 293 15-317 1-334 (334) 131 protein:vir:80180 Length: 381 99.3 1E-12 6.3E-16 86.3 19.2 292 8-324 1-314 (381) 132 protein:vir:2201 Length: 345 # 99.3 1E-12 6.3E-16 86.3 17.3 281 15-315 1-345 (345) 133 protein:vir:102944 Length: 330 99.2 7E-12 4.3E-15 81.7 20.3 278 24-324 1-304 (330) 134 protein:vir:94576 Length: 347 99.2 1.2E-12 7.2E-16 86.0 15.9 284 15-315 1-347 (347) 135 protein:vir:1583 Length: 351 # 99.2 4.3E-12 2.7E-15 82.9 18.6 275 30-324 1-302 (351) 136 protein:vir:100057 Length: 375 99.2 1.3E-11 7.9E-15 80.3 20.5 291 15-321 1-375 (375) 137 protein:vir:10450 Length: 344 99.2 1.6E-12 1E-15 85.2 14.8 282 15-315 1-344 (344) 138 protein:vir:78739 Length: 332 99.2 5.4E-12 3.4E-15 82.4 17.5 296 12-313 1-332 (332) 139 protein:vir:8885 Length: 347 # 99.2 4.4E-12 2.7E-15 82.8 16.2 285 1-316 1-347 (347) 140 protein:vir:6324 Length: 335 # 99.1 2.9E-11 1.8E-14 78.3 19.5 296 15-322 1-335 (335) 141 protein:vir:9927 Length: 295 # 99.1 2E-11 1.2E-14 79.2 17.6 269 24-322 1-295 (295) 142 protein:vir:78935 Length: 335 99.1 7.3E-11 4.5E-14 76.2 19.9 295 15-321 1-335 (335) 143 protein:vir:93858 Length: 400 99.1 9.6E-12 5.9E-15 81.0 14.9 292 1-313 77-400 (400) 144 protein:vir:103323 Length: 364 99.1 2.6E-10 1.6E-13 73.1 22.6 290 17-324 1-349 (364) 145 protein:vir:1541 Length: 347 # 99.1 3.8E-11 2.4E-14 77.7 17.5 285 15-317 1-347 (347) 146 protein:vir:3364 Length: 347 # 99.0 5E-11 3.1E-14 77.1 16.4 285 15-317 1-347 (347) 147 protein:vir:94711 Length: 347 99.0 1.5E-11 9.5E-15 79.9 13.4 284 15-316 1-347 (347) 148 protein:vir:106647 Length: 303 99.0 5.2E-11 3.3E-14 76.9 15.8 279 23-322 1-303 (303) 149 protein:vir:3136 Length: 322 # 99.0 7.7E-11 4.8E-14 76.0 16.7 289 24-319 1-322 (322) 150 protein:vir:95318 Length: 328 99.0 1.3E-10 8.1E-14 74.8 17.0 224 20-245 1-328 (328) 151 protein:vir:9875 Length: 296 # 99.0 1E-10 6.4E-14 75.3 16.4 275 18-316 1-296 (296) 152 protein:vir:97031 Length: 402 99.0 2.4E-10 1.5E-13 73.3 18.4 298 17-324 1-342 (402) 153 protein:vir:105645 Length: 400 98.9 3.1E-10 1.9E-13 72.7 18.3 293 23-324 1-342 (400) 154 protein:vir:99675 Length: 324 98.9 3.2E-10 2E-13 72.6 15.7 246 60-324 1-306 (324) 155 protein:vir:107120 Length: 329 98.9 1.4E-09 8.6E-13 69.1 19.1 286 1-324 12-314 (329) 156 protein:vir:7019 Length: 401 # 98.8 1.1E-09 7.1E-13 69.6 16.6 291 23-324 1-340 (401) 157 protein:vir:97331 Length: 319 98.8 6E-09 3.7E-12 65.6 20.3 285 1-324 1-303 (319) 158 protein:vir:94800 Length: 319 98.8 6E-09 3.7E-12 65.6 20.3 285 1-324 1-303 (319) 159 protein:vir:103285 Length: 296 98.8 3.3E-09 2E-12 67.1 18.5 271 24-316 1-296 (296) 160 protein:vir:107826 Length: 331 98.8 2.8E-09 1.7E-12 67.5 18.0 224 20-245 1-331 (331) 161 protein:vir:107388 Length: 331 98.8 2.8E-09 1.7E-12 67.5 18.0 224 20-245 1-331 (331) 162 protein:vir:98525 Length: 331 98.8 2.8E-09 1.7E-12 67.5 18.0 224 20-245 1-331 (331) 163 protein:vir:107687 Length: 319 98.7 1.4E-08 8.7E-12 63.6 20.2 288 1-313 1-319 (319) 164 protein:vir:95512 Length: 693 98.7 1.2E-08 7.4E-12 64.0 19.7 302 1-313 366-693 (693) 165 protein:vir:102655 Length: 322 98.7 5.7E-09 3.5E-12 65.8 17.9 276 24-316 1-322 (322) 166 protein:vir:80068 Length: 301 98.7 1E-08 6.5E-12 64.3 19.1 264 30-313 1-301 (301) 167 protein:vir:8843 Length: 317 # 98.7 4E-08 2.5E-11 61.1 21.1 276 23-317 1-317 (317) 168 protein:vir:79548 Length: 652 98.7 3.2E-08 2E-11 61.7 20.5 301 1-312 331-652 (652) 169 protein:vir:103759 Length: 330 98.6 4.2E-09 2.6E-12 66.5 14.6 224 20-245 1-330 (330) 170 protein:vir:104342 Length: 314 98.6 3.2E-08 2E-11 61.7 17.8 287 1-316 3-314 (314) 171 protein:vir:99075 Length: 392 98.5 9.2E-08 5.7E-11 59.2 18.7 281 31-324 1-325 (392) 172 protein:vir:7324 Length: 335 # 98.4 2.5E-08 1.5E-11 62.3 14.6 225 20-246 1-335 (335) 173 protein:vir:79642 Length: 329 98.4 3.3E-07 2E-10 56.1 20.0 294 1-316 1-329 (329) 174 protein:vir:108303 Length: 418 98.2 1.6E-06 1E-09 52.3 19.7 276 30-324 1-319 (418) 175 protein:vir:95131 Length: 325 98.2 5.6E-07 3.5E-10 54.9 17.0 281 24-324 1-301 (325) 176 protein:vir:80446 Length: 367 98.1 3.7E-06 2.3E-09 50.3 18.7 279 1-324 1-343 (367) 177 protein:vir:78387 Length: 349 97.6 4.4E-05 2.7E-08 44.5 20.4 279 30-324 1-323 (349) 178 protein:vir:95875 Length: 401 97.5 1.7E-05 1E-08 46.7 14.5 289 21-316 1-401 (401) 179 protein:vir:105522 Length: 423 97.5 6.2E-05 3.9E-08 43.6 19.9 278 31-324 1-329 (423) 180 protein:vir:94989 Length: 349 97.4 7E-05 4.3E-08 43.4 21.1 279 30-324 1-323 (349) 181 protein:vir:95603 Length: 463 97.4 3.7E-05 2.3E-08 44.9 15.3 313 1-324 1-360 (463) 182 protein:vir:99311 Length: 463 97.4 3.7E-05 2.3E-08 44.9 15.3 313 1-324 1-360 (463) 183 protein:vir:3525 Length: 423 # 97.4 8.4E-05 5.2E-08 42.9 18.2 272 31-324 1-309 (423) 184 protein:vir:3643 Length: 336 # 97.3 3.4E-05 2.1E-08 45.1 14.2 291 1-313 1-336 (336) 185 protein:vir:5255 Length: 304 # 97.3 7.3E-05 4.6E-08 43.2 15.8 264 33-312 1-304 (304) 186 protein:vir:94070 Length: 339 97.3 3.5E-05 2.2E-08 45.0 13.7 290 1-313 5-339 (339) 187 protein:vir:103886 Length: 302 97.2 0.00014 8.6E-08 41.7 16.9 266 1-319 1-302 (302) 188 protein:vir:105374 Length: 423 97.2 0.00015 9E-08 41.6 19.8 273 31-324 1-329 (423) 189 protein:vir:79008 Length: 299 97.1 0.00017 1E-07 41.3 17.0 274 31-319 1-299 (299) 190 protein:vir:174 Length: 423 # 97.1 0.00017 1.1E-07 41.2 19.4 278 31-324 1-329 (423) 191 protein:vir:101557 Length: 336 97.1 9.2E-05 5.7E-08 42.7 14.2 292 1-313 1-336 (336) 192 protein:vir:99576 Length: 388 97.0 8.6E-05 5.4E-08 42.9 13.7 298 1-313 33-388 (388) 193 protein:vir:96666 Length: 462 96.9 0.00025 1.5E-07 40.4 15.4 298 1-324 3-331 (462) 194 protein:vir:1781 Length: 221 # 96.9 0.00015 9.2E-08 41.6 13.6 184 109-324 1-209 (221) 195 protein:vir:78558 Length: 336 96.5 0.00059 3.7E-07 38.3 14.5 293 1-313 1-336 (336) 196 protein:vir:1829 Length: 355 # 96.2 0.00088 5.5E-07 37.3 18.0 298 1-324 1-351 (355) 197 protein:vir:107732 Length: 379 96.0 0.00093 5.7E-07 37.2 13.1 295 1-313 21-379 (379) 198 protein:vir:96792 Length: 315 95.6 0.0018 1.1E-06 35.6 17.5 268 27-324 1-288 (315) 199 protein:vir:106734 Length: 336 95.3 0.0023 1.4E-06 35.0 13.6 292 1-313 1-336 (336) 200 protein:vir:861 Length: 318 # 95.3 0.00074 4.6E-07 37.7 10.1 286 1-313 5-318 (318) 201 protein:vir:98566 Length: 355 95.0 0.003 1.9E-06 34.4 17.9 298 1-324 1-351 (355) 202 protein:vir:78777 Length: 358 94.8 0.0034 2.1E-06 34.1 17.7 298 1-324 1-353 (358) 203 protein:vir:63741 Length: 468 94.8 0.0035 2.2E-06 34.1 15.0 314 1-324 1-360 (468) 204 protein:vir:96079 Length: 382 94.6 0.004 2.5E-06 33.7 15.9 301 1-313 27-382 (382) 205 protein:vir:95451 Length: 313 94.5 0.0041 2.6E-06 33.7 16.0 272 28-316 1-313 (313) 206 protein:vir:1663 Length: 393 # 94.4 0.0016 1E-06 35.9 9.7 287 1-313 63-393 (393) 207 protein:vir:80835 Length: 464 94.3 0.0049 3E-06 33.2 15.8 314 1-324 3-357 (464) 208 protein:vir:94870 Length: 318 94.2 0.0028 1.7E-06 34.6 10.5 288 1-313 5-318 (318) 209 protein:vir:6061 Length: 357 # 94.1 0.0053 3.3E-06 33.1 16.4 298 1-324 1-351 (357) 210 protein:vir:2016 Length: 357 # 93.9 0.0061 3.8E-06 32.7 16.2 299 1-324 1-351 (357) 211 protein:vir:93966 Length: 400 93.8 0.0026 1.6E-06 34.7 9.7 287 1-313 87-400 (400) 212 protein:vir:100851 Length: 514 93.6 0.0071 4.4E-06 32.4 11.8 303 1-324 1-363 (514) 213 protein:vir:5694 Length: 357 # 93.6 0.0071 4.4E-06 32.4 16.2 299 1-324 1-351 (357) 214 protein:vir:102823 Length: 470 92.9 0.0021 1.3E-06 35.3 7.6 287 1-324 1-313 (470) 215 protein:vir:1153 Length: 338 # 91.8 0.014 8.8E-06 30.7 17.2 291 1-317 1-338 (338) 216 protein:vir:100331 Length: 342 91.8 0.014 8.9E-06 30.7 16.2 293 1-319 1-342 (342) 217 protein:vir:348 Length: 321 # 91.7 0.015 9.1E-06 30.6 17.1 276 1-313 1-321 (321) 218 protein:vir:79157 Length: 339 89.1 0.028 1.8E-05 29.1 16.9 292 1-319 1-339 (339) 219 protein:vir:78920 Length: 290 89.1 0.028 1.8E-05 29.1 21.2 268 1-315 1-290 (290) 220 protein:vir:104011 Length: 337 88.2 0.034 2.1E-05 28.7 18.2 292 1-318 1-337 (337) 221 protein:vir:79171 Length: 337 87.7 0.037 2.3E-05 28.4 18.2 292 1-318 1-337 (337) 222 protein:vir:99888 Length: 309 85.5 0.052 3.2E-05 27.6 14.0 277 32-316 1-309 (309) 223 protein:vir:105464 Length: 346 85.5 0.053 3.3E-05 27.6 21.4 280 30-324 1-310 (346) 224 protein:vir:3746 Length: 336 # 85.2 0.054 3.4E-05 27.5 19.0 291 1-324 1-336 (336) 225 protein:vir:79712 Length: 285 84.9 0.057 3.5E-05 27.4 18.4 256 35-316 1-285 (285) 226 protein:vir:78186 Length: 337 84.6 0.059 3.7E-05 27.3 16.6 291 1-318 1-337 (337) 227 protein:vir:99523 Length: 311 84.3 0.061 3.8E-05 27.2 16.8 253 31-324 1-278 (311) 228 protein:vir:80491 Length: 467 83.7 0.067 4.1E-05 27.0 16.8 313 1-324 1-359 (467) 229 protein:vir:98856 Length: 343 81.4 0.086 5.3E-05 26.4 18.6 297 1-324 1-342 (343) 230 protein:vir:270 Length: 341 # 79.0 0.11 6.7E-05 25.9 15.2 299 1-324 1-340 (341) 231 protein:vir:3783 Length: 336 # 77.6 0.12 7.6E-05 25.6 19.1 290 1-324 1-336 (336) 232 protein:vir:103370 Length: 418 77.5 0.12 7.7E-05 25.5 14.6 307 1-323 31-418 (418) 233 protein:vir:5942 Length: 523 # 72.1 0.19 0.00012 24.6 12.6 290 1-317 171-523 (523) 234 protein:vir:2736 Length: 348 # 70.4 0.21 0.00013 24.3 20.0 285 31-316 1-348 (348) 235 protein:vir:96442 Length: 418 70.1 0.21 0.00013 24.3 12.5 308 1-324 12-416 (418) 236 protein:vir:106286 Length: 534 69.5 0.22 0.00014 24.2 17.3 303 1-324 36-523 (534) 237 protein:vir:96490 Length: 348 64.0 0.31 0.00019 23.4 20.0 285 31-316 1-348 (348) 238 protein:vir:102335 Length: 312 61.3 0.36 0.00022 23.0 20.6 275 31-317 1-312 (312) 239 protein:vir:107882 Length: 307 58.1 0.42 0.00026 22.7 16.5 264 30-315 1-307 (307) 240 protein:vir:106590 Length: 349 49.7 0.63 0.00039 21.7 21.8 289 2-313 1-349 (349) 241 protein:vir:78090 Length: 302 48.6 0.66 0.00041 21.6 15.1 271 31-319 1-302 (302) 242 protein:vir:79078 Length: 307 45.9 0.75 0.00047 21.3 15.8 270 30-315 1-307 (307) 243 protein:vir:4902 Length: 348 # 45.5 0.77 0.00048 21.2 18.9 285 31-316 1-348 (348) 244 protein:vir:98480 Length: 348 37.1 1.1 0.0007 20.3 18.6 285 29-314 1-348 (348) 245 protein:vir:104549 Length: 462 29.3 1.7 0.001 19.4 15.8 294 1-324 1-445 (462) 246 protein:vir:100603 Length: 529 27.4 1.8 0.0011 19.1 17.2 290 1-324 38-505 (529) 247 protein:vir:1991 Length: 305 # 23.3 2.3 0.0014 18.6 12.0 228 1-268 1-305 (305) No 1 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=3.8e-76 Score=434.04 Aligned_cols=324 Identities=99% Similarity=1.408 Sum_probs=313.0 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |||++++|.++++|+......+++++++++++++++++||++++++|++.+++.++|++++++++++++.++||+.++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) .++|++|++.+|+++++|++++++++|++++++||+|+++||.++++++|.+++++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~ 160 (324) T protein:vir:93 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988888888 Q ss_pred cccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCCC Q lcl|Aclame:pro 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....+..+++++.+++.++...++.+++|+||++++..|++++|++|+|++..+.+++|+|+||+++++...+ T Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PVv~~~~~~~~ 240 (324) T protein:vir:93 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCcccceeeEeecCCCCC Confidence 77777777777888999999999999999999999999999999999999999999999999999999999998888889 Q ss_pred CceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCC Q lcl|Aclame:pro 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) Q Consensus 241 ~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~ 320 (324) ++.+++|||++++++.+++++++++++.++....++++.++++|++|+++||+++|+||.+.||+||++|+.+.+++++| T Consensus 241 ~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~ 320 (324) T protein:vir:93 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC Q lcl|Aclame:pro 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:93 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 2 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=1.2e-75 Score=431.25 Aligned_cols=324 Identities=99% Similarity=1.408 Sum_probs=313.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |+|+++.+..+++|+.+....+++++++++.+++++++||++++++|++.+++.++|++++++++++++.++||+.++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~ 80 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999988999999999999999999999999999999999999999999999999 Q ss_pred ceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) .+.|++|++++|+++++|++++++++|++++++||+|+++|+.++++++|.+++++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~ 160 (324) T protein:vir:97 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred cccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCCC Q lcl|Aclame:pro 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....+.+++++|.+++.++...++.+++|+||++++..|++++|.+|+|++.++..++|+|+||+++++.+.+ T Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~~~~~~~tl~G~PV~~~~~~~~~ 240 (324) T protein:vir:97 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCccccceeeEeecCCCCC Confidence 77777777777888999999999999999999999999999999999999999999999999999999999999888899 Q ss_pred CceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCC Q lcl|Aclame:pro 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) Q Consensus 241 ~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~ 320 (324) ++++++|||++++++++++++++++++.++....+.++.+|++|++|+++||+++|+||++.+|+||++|+++.+++++| T Consensus 241 ~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~ 320 (324) T protein:vir:97 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSV 320 (324) T ss_pred cceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC Q lcl|Aclame:pro 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:97 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 3 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=3e-75 Score=429.11 Aligned_cols=324 Identities=100% Similarity=1.410 Sum_probs=312.7 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |+|.++.++++++|+.+....+.+++.+.+++++++++||+++.++|++.+++.++|++++++++++++.++||+.++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999889999999989999999999999999999999999999999999999999999999999 Q ss_pred ceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) .++|++|++++|+++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~ 160 (324) T protein:vir:78 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988888888 Q ss_pred cccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCCC Q lcl|Aclame:pro 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....++.++++|.+++.++...++.+++|+||++++..|++++|.+|+|++..+.+++|+|+||+++++...+ T Consensus 161 ~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV~~~~~~~~~ 240 (324) T protein:vir:78 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCcccceeeEeeCCCCCC Confidence 77777777777888999999999999999999999999999999999999999999999999999999999999888899 Q ss_pred CceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCC Q lcl|Aclame:pro 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) Q Consensus 241 ~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~ 320 (324) ++.+++|||++++++++++++++++++.+.....+.++.+|++|++|+++||+++|+||.+.||+||++|+++.|++++| T Consensus 241 ~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:78 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC Q lcl|Aclame:pro 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:78 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 4 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=3e-75 Score=429.11 Aligned_cols=324 Identities=100% Similarity=1.410 Sum_probs=312.7 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |+|.++.++++++|+.+....+.+++.+.+++++++++||+++.++|++.+++.++|++++++++++++.++||+.++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999889999999989999999999999999999999999999999999999999999999999 Q ss_pred ceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) .++|++|++++|+++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~ 160 (324) T protein:vir:96 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988888888 Q ss_pred cccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCCC Q lcl|Aclame:pro 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....++.++++|.+++.++...++.+++|+||++++..|++++|.+|+|++..+.+++|+|+||+++++...+ T Consensus 161 ~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~~~~~~~~l~G~PV~~~~~~~~~ 240 (324) T protein:vir:96 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeeecCCCCCcccceeeEeeCCCCCC Confidence 77777777777888999999999999999999999999999999999999999999999999999999999999888899 Q ss_pred CceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCC Q lcl|Aclame:pro 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) Q Consensus 241 ~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~ 320 (324) ++.+++|||++++++++++++++++++.+.....+.++.+|++|++|+++||+++|+||.+.||+||++|+++.|++++| T Consensus 241 ~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:96 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC Q lcl|Aclame:pro 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:96 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 5 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=5e-75 Score=427.91 Aligned_cols=324 Identities=99% Similarity=1.405 Sum_probs=312.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |+|+++++..+++|+......+..++++++.+++++++||++++++|++.+++.++|++++++++++++.++||+.++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999888889999999999999999999999999999999999999999999999 Q ss_pred ceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) .+.|++|++.+|+++++|+++++.++|++++++||+|+++|+.++++++|.+++++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~ 160 (324) T protein:vir:96 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999988888888 Q ss_pred cccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCCC Q lcl|Aclame:pro 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ...........+.+++++|.+++.++...++.+++|+||++++..|++++|.+|+|++..+.+++|+|+||+++++...+ T Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~~~~~~~l~G~PV~~~~~~~~~ 240 (324) T protein:vir:96 161 QSIKKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCCCCCcccceeeEeecCCCCC Confidence 77776666777888999999999999999999999999999999999999999999999999999999999999888899 Q ss_pred CceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCC Q lcl|Aclame:pro 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) Q Consensus 241 ~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~ 320 (324) ++.+++|||++++++++++++++++++.++....+.++.++++|++|+++||+++|+||.+.+|+||++|+.+.++++++ T Consensus 241 ~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~ 320 (324) T protein:vir:96 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) T ss_pred cceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC Q lcl|Aclame:pro 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:96 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 6 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=1.3e-74 Score=425.73 Aligned_cols=324 Identities=99% Similarity=1.405 Sum_probs=312.8 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |+|.++.+.++|+|+......+.+++++++++++++++||++++++|++.+++.++|++++++++++++.++||+.++.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999899 Q ss_pred ceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) .+.|++|++++|+++++|+++++.++|++++++||+|+++|+.++++++|.+++++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~ 160 (324) T protein:vir:10 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred cccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCCC Q lcl|Aclame:pro 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....++++++++.+++.++...++.+++|+||++++..|++++|.+|++++.++.+++|+|+||+++++.+.+ T Consensus 161 ~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PV~~~~~~~~~ 240 (324) T protein:vir:10 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeecCCCCccccceeEEeecCCCCC Confidence 77777777777889999999999999999999999999999999999999999999999999999999999999888899 Q ss_pred CceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCC Q lcl|Aclame:pro 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) Q Consensus 241 ~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~ 320 (324) ++.+++|||++++++.+++++++++++.++....++++.++++|++|++++|+++|+||.+.+|+||++|++++|++.+| T Consensus 241 ~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~ 320 (324) T protein:vir:10 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSV 320 (324) T ss_pred cceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC Q lcl|Aclame:pro 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:10 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 7 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=1.6e-74 Score=425.09 Aligned_cols=324 Identities=98% Similarity=1.404 Sum_probs=313.0 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |+|+++++.++++|+......+.+++++++++++++++||++++++|++.+++.++|++++++++++++.++||+.++.+ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) .+.|++|++.+|+++++|++++++++|++++++||+|+++|+.++++++|.+++++++++++|+++|+|+|++..+.++. T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) T protein:vir:99 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) T ss_pred ceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999998888888 Q ss_pred cccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCCC Q lcl|Aclame:pro 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ......+....++++++++.+++.++.+.++.+++|+|||+++..|++++|.+|++++.++.+++|+|+||+++++.+.+ T Consensus 161 ~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~~~~~~~~l~G~PVv~~~~~~~~ 240 (324) T protein:vir:99 161 QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDTLDGLPVVNLKSSNLK 240 (324) T ss_pred ccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCCCccccceeEEeecCCCCC Confidence 87777777777889999999999999999999999999999999999999999999999999999999999999888889 Q ss_pred CceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCC Q lcl|Aclame:pro 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) Q Consensus 241 ~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~ 320 (324) ++.+++|||++++++++++++++++++.++....++++.++++|++|++++|+++|+||.+.||+||++|+++++++.++ T Consensus 241 ~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~ 320 (324) T protein:vir:99 241 RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSV 320 (324) T ss_pred cceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCC Q lcl|Aclame:pro 321 PGEV 324 (324) Q Consensus 321 ~~~~ 324 (324) |||| T Consensus 321 ~~~~ 324 (324) T protein:vir:99 321 PGEV 324 (324) T ss_pred CCCC Confidence 9999 No 8 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1.1e-63 Score=365.79 Aligned_cols=296 Identities=49% Similarity=0.800 Sum_probs=271.8 Q ss_pred hhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccCccccccccc Q lcl|Aclame:pro 19 VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKAT 97 (324) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~~ 97 (324) ...+.+++.+++++++++++||++++++|++.+++.++|++++++++++++. ..+|+..+.+.+.|++|++.+|+++++ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 80 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNETEKIKTDKPE 80 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecCccccccccc Confidence 3346678889999999999999999999999999999999999999987654 678888888999999999999999999 Q ss_pred eeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhh Q lcl|Aclame:pro 98 WVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQD 177 (324) Q Consensus 98 ~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (324) |++++++++|++++++||+|+++||.++++++|.+++++++++++|+++|+|+|++.+ .++...........++.++++ T Consensus 81 f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~-~gi~~~~~~~~~~~~~~~t~~ 159 (297) T protein:vir:95 81 VVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFA-NSVAKAAKDANKVIGGPINYD 159 (297) T ss_pred eeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCccc-ccccccccccceecccccCHH Confidence 9999999999999999999999999999999999999999999999999999997654 444444445555566788999 Q ss_pred HHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEe Q lcl|Aclame:pro 178 NIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIP 257 (324) Q Consensus 178 ~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~ 257 (324) ++.+++.++...++.+++|+||++++.+|++++|.+|+|++.+. +++++|+||+.+++.+.+++++++|||++++++.+ T Consensus 160 ~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~-~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~ 238 (297) T protein:vir:95 160 NILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKA-ANTIDGITTVDLKSARFEKGDLLAGDFDNLIYGVP 238 (297) T ss_pred HHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCC-CCcccceeeEeecCCCCCCceEEEEecccEEEEEe Confidence 99999999999999999999999999999999999999998654 56899999999888889999999999999999999 Q ss_pred cceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 258 QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 258 ~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) ++++++++++.++....+.++..+++|++|++++|+++|+||++.+|+||++|+.++.. T Consensus 239 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 239 YNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred cCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 99999999999999999999999999999999999999999999999999999988777 No 9 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=6.7e-63 Score=361.45 Aligned_cols=294 Identities=34% Similarity=0.523 Sum_probs=269.3 Q ss_pred HhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccceeeE Q lcl|Aclame:pro 22 QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 22 ~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) -..++++.+++++++++||++++++|++.+++.++|+++++++|++++..++|+.+ .+.+.|++|++++|+++++|+++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~~v 79 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS-GVGAFWVDEAERIQTSKPTFTKA 79 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc-CCceeeeecCccccccccceeEE Confidence 34678888888899999999999999999999999999999999999999999876 47799999999999999999999 Q ss_pred EeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHH Q lcl|Aclame:pro 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++.++|++++++||+|+++||.++++++|.+.|++++++++|+++|+|+|++.+.+.+.......+....+..+++++.+ T Consensus 80 ~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~~~~l~~ 159 (299) T protein:vir:41 80 KMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANKYDDLNE 159 (299) T ss_pred EEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeeccccccHHHHHH Confidence 99999999999999999999999999999999999999999999999999877666555544455555667788999999 Q ss_pred HHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccC---CcceeecceeEeecCCCCCC--ceeEEeecccEEEEE Q lcl|Aclame:pro 182 LEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---NSDSLDGLPVVNLKSSNLKR--GELITGDFDKLIYGI 256 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~---~~~~l~G~pv~~~~~~~~~~--~~~i~gd~s~~~~~~ 256 (324) ++.++...++.+++|+||++++.+|++++|.+|+|++.+. ..++|+|+||++++.++... ..+++|||++++++. T Consensus 160 ~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~ 239 (299) T protein:vir:41 160 AIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAYYGI 239 (299) T ss_pred HHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceeeEEecccCCCCCceEEEEEecccEEEEE Confidence 9999999999999999999999999999999999999653 34689999999988776543 348999999999999 Q ss_pred ecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 257 PQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) +++++++++++.+.....+.++.++++|++|++++|+++|+||++.||+||++|+.++|- T Consensus 240 ~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 240 LRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred ecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 999999999999999999999999999999999999999999999999999999999998 No 10 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=5.7e-62 Score=356.33 Aligned_cols=295 Identities=39% Similarity=0.625 Sum_probs=263.7 Q ss_pred hhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccce Q lcl|Aclame:pro 19 VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) .+.+.+++.+++++++++++||++++++|++.+++.++|++++++++++++.++||+.++.+.+.|++|++++|+++++| T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~ 80 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEY 80 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCccccccccee Confidence 55677788889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccc----c-cccccccccch Q lcl|Aclame:pro 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQ----S-IEKTNKVIKGD 173 (324) Q Consensus 99 ~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~----~-~~~~~~~~~~~ 173 (324) ++++++++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|++.+.+.... . .........+. T Consensus 81 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (304) T protein:vir:10 81 AQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTN 160 (304) T ss_pred eEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999997654433221 1 12222234456 Q ss_pred hhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCC--CCceeEEeeccc Q lcl|Aclame:pro 174 FTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNL--KRGELITGDFDK 251 (324) Q Consensus 174 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~--~~~~~i~gd~s~ 251 (324) .++++|.+++.++...++.+++|+||++++..|++++|++|+|+|.+. +++|+|+||+++++++. +++.+++|||++ T Consensus 161 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~ 239 (304) T protein:vir:10 161 NLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-GNEIMGLPLSYTGADVYDKKKSLALMGDWDY 239 (304) T ss_pred chHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-CccccceeeEEecccccCCCCcEEEEEehhh Confidence 789999999999999999999999999999999999999999998654 47899999998877654 456789999999 Q ss_pred EEEEEecceEEEEeecccee--ccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeec Q lcl|Aclame:pro 252 LIYGIPQLIEYKIDETAQLS--TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 252 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~ 314 (324) ++++++++++++++++.++. .+.+.++..+++|++|++++|+++|+|+.+.||+||++|+.+. T Consensus 240 ~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 240 ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 99999999999999998765 4455678889999999999999999999999999999999988 No 11 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=5.7e-62 Score=356.33 Aligned_cols=295 Identities=39% Similarity=0.625 Sum_probs=263.7 Q ss_pred hhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccce Q lcl|Aclame:pro 19 VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) .+.+.+++.+++++++++++||++++++|++.+++.++|++++++++++++.++||+.++.+.+.|++|++++|+++++| T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~~ 80 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSETERIQTSKPEY 80 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeecCccccccccee Confidence 55677788889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccc----c-cccccccccch Q lcl|Aclame:pro 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQ----S-IEKTNKVIKGD 173 (324) Q Consensus 99 ~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~----~-~~~~~~~~~~~ 173 (324) ++++++++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|++.+.+.... . .........+. T Consensus 81 ~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (304) T protein:vir:94 81 AQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTDTN 160 (304) T ss_pred eEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999997654433221 1 12222234456 Q ss_pred hhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCC--CCceeEEeeccc Q lcl|Aclame:pro 174 FTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNL--KRGELITGDFDK 251 (324) Q Consensus 174 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~--~~~~~i~gd~s~ 251 (324) .++++|.+++.++...++.+++|+||++++..|++++|++|+|+|.+. +++|+|+||+++++++. +++.+++|||++ T Consensus 161 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~ 239 (304) T protein:vir:94 161 NLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-GNEIMGLPLSYTGADVYDKKKSLALMGDWDY 239 (304) T ss_pred chHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-CccccceeeEEecccccCCCCcEEEEEehhh Confidence 789999999999999999999999999999999999999999998654 47899999998877654 456789999999 Q ss_pred EEEEEecceEEEEeecccee--ccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeec Q lcl|Aclame:pro 252 LIYGIPQLIEYKIDETAQLS--TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 252 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~ 314 (324) ++++++++++++++++.++. .+.+.++..+++|++|++++|+++|+|+.+.||+||++|+.+. T Consensus 240 ~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 240 ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 99999999999999998765 4455678889999999999999999999999999999999988 No 12 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=3.9e-61 Score=351.79 Aligned_cols=304 Identities=19% Similarity=0.265 Sum_probs=264.9 Q ss_pred hhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccce Q lcl|Aclame:pro 19 VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 19 ~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) .+.+++++..++++.++|+++|++++++|++.+++.++|++++++++++++.++||+.++.+.+.|++|++++|+++++| T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f 80 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGEAERKPITKGSF 80 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecCCCcccccccee Confidence 55567788888888889999999999999999999999999999999999889999999999999999999999999999 Q ss_pred eeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccccccc---------cccc Q lcl|Aclame:pro 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEK---------TNKV 169 (324) Q Consensus 99 ~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~---------~~~~ 169 (324) ++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++.++.++.+.... .... T Consensus 81 ~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~~~ 160 (330) T protein:vir:77 81 GKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLTTAS 160 (330) T ss_pred eEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccccccc Confidence 9999999999999999999999999999999999999999999999999999998877666543221 1122 Q ss_pred ccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccC---------CcceeecceeEeecCCCCC Q lcl|Aclame:pro 170 IKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 170 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~---------~~~~l~G~pv~~~~~~~~~ 240 (324) ......++++.+++.++...+..+++|+||++++..|+++||.+|+|+|.+. .+++|+|+||+++.+++.. T Consensus 161 ~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~ 240 (330) T protein:vir:77 161 GPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNG 240 (330) T ss_pred cccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEeccccCC Confidence 2334567899999999999999999999999999999999999999998642 3468999999998776542 Q ss_pred ----CceeEEeecccEEEEEecceEEEEeeccceecccccc----ccchhhhhcCcEEEEEEEEeccEEeccCceEEEEe Q lcl|Aclame:pro 241 ----RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNED----GTPVNLFEQDMVALRATMHVALHIADDKAFAKLVP 312 (324) Q Consensus 241 ----~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~ 312 (324) +..+++|||++++++++++++++++++.++.+..+.. ...+++|++|+++||+++|+||++.||+||++|+. T Consensus 241 ~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~ 320 (330) T protein:vir:77 241 TVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTD 320 (330) T ss_pred CCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEe Confidence 2348999999999999999999999999887665543 45778999999999999999999999999999999 Q ss_pred ecCCCCCCCCCC Q lcl|Aclame:pro 313 ADKRTDSVPGEV 324 (324) Q Consensus 313 ~~~~~~~~~~~~ 324 (324) ++|+ ++|-|- T Consensus 321 ~~~~--~~~~~~ 330 (330) T protein:vir:77 321 QVAG--TDPEEE 330 (330) T ss_pred ccCC--cCCCCC Confidence 8855 334444 No 13 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=2.9e-61 Score=352.44 Aligned_cols=312 Identities=12% Similarity=0.172 Sum_probs=247.9 Q ss_pred Cc------hhHHHHHHHHH-------------HHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhh- Q lcl|Aclame:pro 1 ME------QTQKLKLNLQH-------------FASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQL- 60 (324) Q Consensus 1 ~~------~~~~~k~~~~~-------------~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l- 60 (324) .. +--.+.++.+. ++............-.+++++||.+||+++.++|++.+++.++++++ T Consensus 20 ~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg 99 (366) T protein:vir:57 20 IKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILG 99 (366) T ss_pred cccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcchhhhc Confidence 00 00001111111 11111111111111223445688899999999999999999999998 Q ss_pred cceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 GKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYK 140 (324) Q Consensus 61 ~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~ 140 (324) ++++++.++.+++|+.++.+.++|++|++.+|+++++|++++++++|++++++||+|+++||.++++++|+++|++++++ T Consensus 100 ~~~v~~~~g~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~~~i~~~l~~a~~~ 179 (366) T protein:vir:57 100 ARSIPLPNGNLSMPRLSGGATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAGFNVEQLLLGDILSAIAT 179 (366) T ss_pred eeeeecCCCceEEEEEeCCcceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHH Confidence 78899988899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhccCccccccccccccccccccc---cchhhhh------HHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhc Q lcl|Aclame:pro 141 KFDEAGILNQGNNPFGKSIAQSIEKTNKVI---KGDFTQD------NIIDLEALLEDDELEANAFISKTQNRSLLRKIVD 211 (324) Q Consensus 141 ~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~---~~~~~~~------~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d 211 (324) ++|+++|+|+|++..+.++.+.....+... .+..+++ ++..+.......+...+.|+||+.++..|++++| T Consensus 180 ~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lkd 259 (366) T protein:vir:57 180 REDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLRD 259 (366) T ss_pred HHHHHhhccCCCCccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhhc Confidence 999999999998777777765433222111 1122222 2233333444556778999999999999999999 Q ss_pred cCCceeeccCCcceeecceeEeecCCCC------CCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhh Q lcl|Aclame:pro 212 PETKERIYDRNSDSLDGLPVVNLKSSNL------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) Q Consensus 212 ~~g~~~~~~~~~~~l~G~pv~~~~~~~~------~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) ++|+|+|.+...++|+|+||++++.++. +...++||||++++++.+++++++++++.+ +.+.++..+++|+ T Consensus 260 ~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~---~~~~~g~~~~~f~ 336 (366) T protein:vir:57 260 GNGNKVYPEMSQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEAT---YKDADGQLVSAFA 336 (366) T ss_pred cCCceeccCCCCCeecceeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEEeeccc---cccccccchhhhh Confidence 9999999888888999999999876653 235689999999999999999999999874 5667788889999 Q ss_pred cCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 286 QDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 286 ~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) +|++++|+++|+||++.||+||++|+++.| T Consensus 337 ~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 337 RNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred cCceeEEeeeeeCcEeeccccEEEEecccC Confidence 999999999999999999999999999999 No 14 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=6e-61 Score=350.72 Aligned_cols=307 Identities=18% Similarity=0.207 Sum_probs=263.8 Q ss_pred HHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCcc Q lcl|Aclame:pro 11 LQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK 90 (324) Q Consensus 11 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~ 90 (324) |++- ......+ +....+++++++++||++++++|++.+++.++|++++++++++++.++||+.++.+.++|++|+++ T Consensus 1 ~~~~--~~~~~e~-~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~ 77 (318) T protein:vir:24 1 MAAG--TAFAVDH-AQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEGDM 77 (318) T ss_pred CCCC--CCCCHHH-HHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCCcc Confidence 2221 1222222 223345566788899999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccccccccc--c Q lcl|Aclame:pro 91 IETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTN--K 168 (324) Q Consensus 91 ~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~--~ 168 (324) +|+++++|++++++++|+++++++|+|+++||.++++++|.++|++++++++|+++++|+|++.+.+.......... . T Consensus 78 ~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~ 157 (318) T protein:vir:24 78 KPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADT 157 (318) T ss_pred ccccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999997654433322211111 1 Q ss_pred cccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCC---------cceeecceeEeecCCCC Q lcl|Aclame:pro 169 VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRN---------SDSLDGLPVVNLKSSNL 239 (324) Q Consensus 169 ~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~---------~~~l~G~pv~~~~~~~~ 239 (324) ........+++.+++..+...++.+++|+|||+++..|+++||++|+|++.+.. ..+++|+|++++++.+. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~ 237 (318) T protein:vir:24 158 TGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVE 237 (318) T ss_pred ccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCC Confidence 122233446678899999999999999999999999999999999999986432 35799999999988877 Q ss_pred CCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 240 KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 240 ~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) ++..+++|||++++++.++++.++++++.++....++++.++++|++|+++||+++|+||++.+|+||++|+.+++++.. T Consensus 238 ~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~ 317 (318) T protein:vir:24 238 GTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGE 317 (318) T ss_pred CccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCC Confidence 77778999999999999999999999999999999999999999999999999999999999999999999999999877 Q ss_pred C Q lcl|Aclame:pro 320 V 320 (324) Q Consensus 320 ~ 320 (324) - T Consensus 318 ~ 318 (318) T protein:vir:24 318 G 318 (318) T ss_pred C Confidence 6 No 15 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=4.5e-61 Score=351.44 Aligned_cols=306 Identities=18% Similarity=0.198 Sum_probs=264.7 Q ss_pred HhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCcccccc Q lcl|Aclame:pro 15 ASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETS 94 (324) Q Consensus 15 a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~ 94 (324) +.-. .+.+....+++++++++||++++++|++.+++.++|++++++++++++.++||+.+..+.+.|++|++++|++ T Consensus 1 ~g~~---~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s 77 (397) T protein:vir:23 1 MGFS---ADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMKPIT 77 (397) T ss_pred CCcC---HHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCcccccc Confidence 1111 1112222344455677889999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchh Q lcl|Aclame:pro 95 KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDF 174 (324) Q Consensus 95 ~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 174 (324) +++|+++++++||++++++||+|+++|+.++++++|++++++++++++|+++|+|+|+.....++....... ...++.. T Consensus 78 ~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~-~~~~~~~ 156 (397) T protein:vir:23 78 KGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKT-QSISPNA 156 (397) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccce-eeecccc Confidence 999999999999999999999999999999999999999999999999999999999877766665544432 2334556 Q ss_pred hhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCC---------cceeecceeEeecCCCCCCceeE Q lcl|Aclame:pro 175 TQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRN---------SDSLDGLPVVNLKSSNLKRGELI 245 (324) Q Consensus 175 ~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~---------~~~l~G~pv~~~~~~~~~~~~~i 245 (324) .++++.+++.++...++.+++|+||++++..|+++||++|+|+|.+.. .++++|+||+++++++.++..++ T Consensus 157 ~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~ 236 (397) T protein:vir:23 157 YQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGY 236 (397) T ss_pred hhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEE Confidence 778888999999999999999999999999999999999999986532 35799999999988777666678 Q ss_pred EeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 246 TGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 246 ~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) +|||++++++.++++.++++++.++....++.+.++++|++|+++||+++|+||+++||+||++++..+.....+..+- T Consensus 237 ~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~ 315 (397) T protein:vir:23 237 AGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYALDLD 315 (397) T ss_pred EeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeeeccc Confidence 9999999999999999999999999999999999999999999999999999999999999999998777655443322 No 16 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=3.6e-60 Score=346.47 Aligned_cols=314 Identities=15% Similarity=0.200 Sum_probs=254.3 Q ss_pred Cc-hhHHHHHHHHHHHh--------------hhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhh-ccee Q lcl|Aclame:pro 1 ME-QTQKLKLNLQHFAS--------------NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQL-GKYE 64 (324) Q Consensus 1 ~~-~~~~~k~~~~~~a~--------------~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l-~~~~ 64 (324) ++ +...+..+++.... ....+...++.+..+...||.+||+++.++|++.+++.++++++ ++.+ T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~ 170 (435) T protein:vir:14 91 LEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTL 170 (435) T ss_pred hhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceee Confidence 11 00111111111110 01111122344555666788899999999999999999999998 6788 Q ss_pred ecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKF 142 (324) Q Consensus 65 ~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~ 142 (324) ++.++.++||+.++.+.+.|++|++.+|+++++|+++++.++|++++++||+|+++|+. ++++++|.++|++++++++ T Consensus 171 ~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~ 250 (435) T protein:vir:14 171 PLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGARE 250 (435) T ss_pred ecCCCceEEEEEeCCcceeeeccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHH Confidence 98888999999999999999999999999999999999999999999999999999984 5699999999999999999 Q ss_pred HHHHHhccCcccccccccccccccc-----ccccchhhhhHHHHHHHHhhhh--cCCCcEEEEcHHHHHHHHHhhccCCc Q lcl|Aclame:pro 143 DEAGILNQGNNPFGKSIAQSIEKTN-----KVIKGDFTQDNIIDLEALLEDD--ELEANAFISKTQNRSLLRKIVDPETK 215 (324) Q Consensus 143 d~~~l~G~g~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~--~~~~~~~v~~~~~~~~l~~~~d~~g~ 215 (324) |+++++|+|++..+.++........ ...+......++.+++..+... ++.+++|+||+.++..|++++|.+|+ T Consensus 251 d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~ 330 (435) T protein:vir:14 251 DKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGN 330 (435) T ss_pred HHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCc Confidence 9999999998877777755332211 1122233456778888877654 55678999999999999999999999 Q ss_pred eeeccCCcceeecceeEeecCCCC------CCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcE Q lcl|Aclame:pro 216 ERIYDRNSDSLDGLPVVNLKSSNL------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) Q Consensus 216 ~~~~~~~~~~l~G~pv~~~~~~~~------~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v 289 (324) |+|.....++|+|+||++++.++. +.+.+++|||++++++.+++++++++++.. +.+..+.++.+|++|++ T Consensus 331 ~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~f~~~~~ 407 (435) T protein:vir:14 331 KVYPELANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLEIDYSKEAT---YKDADGHMVSAFQRDQT 407 (435) T ss_pred eeccCCCCCeeecceeEeeccccccccCCCccceEEEeecccEEEEEecccEEEEecccc---ccccccchhhhhhcChh Confidence 999877788999999999876543 345789999999999999999999999875 45556778899999999 Q ss_pred EEEEEEEeccEEeccCceEEEEeecCCC Q lcl|Aclame:pro 290 ALRATMHVALHIADDKAFAKLVPADKRT 317 (324) Q Consensus 290 ~~r~~~r~d~~v~~~~A~~~l~~~~~~~ 317 (324) +||+++|+||++.+|+||++|++++|+. T Consensus 408 ~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 408 LIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred heeeeeeeCceeecccceEEEecCCCCC Confidence 9999999999999999999999999999 No 17 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=6.1e-60 Score=345.22 Aligned_cols=314 Identities=15% Similarity=0.181 Sum_probs=253.2 Q ss_pred CchhHHHHHHHHHHHh--------------hhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhh-cceee Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFAS--------------NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQL-GKYEP 65 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~--------------~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l-~~~~~ 65 (324) ..+...+...++.... ....+...++.+..++..||.+||+++.++|++.+++.++++++ +++++ T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~ 171 (435) T protein:vir:80 92 EVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLP 171 (435) T ss_pred hhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeee Confidence 1111111111111110 01111112334455666788899999999999999999999998 67889 Q ss_pred cCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFD 143 (324) Q Consensus 66 ~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d 143 (324) +.++.++||+.++.+.+.|++|++.+|+++++|+++++.++|++++++||+|+++||. ++++++|.++++++++.++| T Consensus 172 ~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d 251 (435) T protein:vir:80 172 LSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGARED 251 (435) T ss_pred cCCCceEEEEEeCCcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHH Confidence 9998999999999999999999999999999999999999999999999999999984 57999999999999999999 Q ss_pred HHHHhccCccccccccccccccccc--c---ccchhhhhHHHHHHHHhhhh--cCCCcEEEEcHHHHHHHHHhhccCCce Q lcl|Aclame:pro 144 EAGILNQGNNPFGKSIAQSIEKTNK--V---IKGDFTQDNIIDLEALLEDD--ELEANAFISKTQNRSLLRKIVDPETKE 216 (324) Q Consensus 144 ~~~l~G~g~~~~~~~~~~~~~~~~~--~---~~~~~~~~~i~~~~~~l~~~--~~~~~~~v~~~~~~~~l~~~~d~~g~~ 216 (324) +++|+|+|++..|.++......... . ........++.+++..+... ++.+++|+||+.++..|++++|++|+| T Consensus 252 ~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~ 331 (435) T protein:vir:80 252 KAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNK 331 (435) T ss_pred HHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCce Confidence 9999999987777776554322111 1 11223345677777777644 556789999999999999999999999 Q ss_pred eeccCCcceeecceeEeecCCCC------CCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEE Q lcl|Aclame:pro 217 RIYDRNSDSLDGLPVVNLKSSNL------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) Q Consensus 217 ~~~~~~~~~l~G~pv~~~~~~~~------~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~ 290 (324) +|.....++|+|+||+++..++. +.+.++||||++++++.+++++++++++.+. .+..+..+++|++|+++ T Consensus 332 l~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~~~~~~~~---~~~~~~~~~~f~~n~~~ 408 (435) T protein:vir:80 332 VYPELANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEIDYSKEATY---KDADGHMVSAFQRDQTL 408 (435) T ss_pred eccCCCCCeEeeeeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEEEEeccccc---cccccchhhhhhcCcce Confidence 99877788999999998876543 3357899999999999999999999998754 45667788999999999 Q ss_pred EEEEEEeccEEeccCceEEEEeecCCC Q lcl|Aclame:pro 291 LRATMHVALHIADDKAFAKLVPADKRT 317 (324) Q Consensus 291 ~r~~~r~d~~v~~~~A~~~l~~~~~~~ 317 (324) ||++.|+||++.||+||+.|++..|+. T Consensus 409 ~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 409 IRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred eeeeeeeCcEeecccceEEEeccCCCC Confidence 999999999999999999999999999 No 18 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=4.2e-60 Score=346.13 Aligned_cols=312 Identities=13% Similarity=0.160 Sum_probs=246.0 Q ss_pred CchhHH----HHHHHHHH-------------HhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhh-cc Q lcl|Aclame:pro 1 MEQTQK----LKLNLQHF-------------ASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQL-GK 62 (324) Q Consensus 1 ~~~~~~----~k~~~~~~-------------a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l-~~ 62 (324) ++..+. +....+.+ +..............+.+++||.+||+++.++|++.+++.++|+++ ++ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~ 162 (428) T protein:vir:10 83 AEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGAR 162 (428) T ss_pred cccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhcce Confidence 111111 11111111 0000111111111223344678899999999999999999999999 67 Q ss_pred eeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 63 YEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) Q Consensus 63 ~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~ 142 (324) ++++.++.++||+.++.+.+.|++|++.+|+++++|+++++.++|++++++||+|+++||.++++++|.++|+++++.++ T Consensus 163 ~~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l~~ai~~~~ 242 (428) T protein:vir:10 163 SIPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDILTAISVRE 242 (428) T ss_pred eeecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHH Confidence 78888888999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhccCccccccccccccccccc----cccchhhhhHH------HHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhcc Q lcl|Aclame:pro 143 DEAGILNQGNNPFGKSIAQSIEKTNK----VIKGDFTQDNI------IDLEALLEDDELEANAFISKTQNRSLLRKIVDP 212 (324) Q Consensus 143 d~~~l~G~g~~~~~~~~~~~~~~~~~----~~~~~~~~~~i------~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~ 212 (324) |+++|+|+|++..|.|+.+....... ......+.+.+ ..+.......+..+++|+||+.++..|++++|+ T Consensus 243 d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~ 322 (428) T protein:vir:10 243 DKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDG 322 (428) T ss_pred HHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHHhhcc Confidence 99999999998777777654332211 11122222222 223334455566788999999999999999999 Q ss_pred CCceeeccCCcceeecceeEeecCCCC------CCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhc Q lcl|Aclame:pro 213 ETKERIYDRNSDSLDGLPVVNLKSSNL------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ 286 (324) Q Consensus 213 ~g~~~~~~~~~~~l~G~pv~~~~~~~~------~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~ 286 (324) +|+|+|.+...++|+|+||+++..++. ..+.++||||++++++.++++++++++++. +.+..+..+++|++ T Consensus 323 ~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~~~~~~~---~~~~~~~~~~~f~~ 399 (428) T protein:vir:10 323 NGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVDFSKEAS---YIDTDGKLVSAFSR 399 (428) T ss_pred CCceeccCCCCCeeeceeeEEeccccccccCCCccceEEEEecceEEEEEecceEEEeecccc---cccccccccchhhc Confidence 999999888888999999998876543 344689999999999999999999999875 34455677789999 Q ss_pred CcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 287 DMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 287 ~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) |+++||+++|+||++.||+||+.+++..| T Consensus 400 ~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 400 NQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred chhheeeeeeeCceeeccceEEEEeccCC Confidence 99999999999999999999999999999 No 19 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=1e-59 Score=344.02 Aligned_cols=305 Identities=18% Similarity=0.234 Sum_probs=255.1 Q ss_pred HHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCcccc Q lcl|Aclame:pro 13 HFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIE 92 (324) Q Consensus 13 ~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~ 92 (324) ..+. ....-+.+....+++++++++||++++++|++.+++.++|++++++++++++.++||+.++.+.+.|++|++++| T Consensus 1 ~~~~-~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~ 79 (320) T protein:vir:10 1 MAAG-TAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEGDMKP 79 (320) T ss_pred CCCC-ccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCCcccc Confidence 1111 111112222334556677889999999999999999999999999999999999999999999999999999999 Q ss_pred ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccc--ccccc--c Q lcl|Aclame:pro 93 TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQS--IEKTN--K 168 (324) Q Consensus 93 ~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~--~~~~~--~ 168 (324) +++++|++++++++|++++++||+|+++||.++++++|.+++++++++++|+++|+|+|++.+....... ..... . T Consensus 80 ~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~ 159 (320) T protein:vir:10 80 ITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPGG 159 (320) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceeccc Confidence 9999999999999999999999999999999999999999999999999999999999976544332211 11111 1 Q ss_pred cccchhh--hhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccC---------CcceeecceeEeecCC Q lcl|Aclame:pro 169 VIKGDFT--QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSDSLDGLPVVNLKSS 237 (324) Q Consensus 169 ~~~~~~~--~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~---------~~~~l~G~pv~~~~~~ 237 (324) ...+.+. .+++.+++..+...+..+++|+|||+++.+|+++||++|+|++... ..++++|+||++++.+ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~ 239 (320) T protein:vir:10 160 ATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHV 239 (320) T ss_pred ccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCC Confidence 1111222 2346788888999999999999999999999999999999998632 2357999999998876 Q ss_pred CCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCC Q lcl|Aclame:pro 238 NLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) Q Consensus 238 ~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~ 317 (324) +.++..+++|||++++++.+++++++++++.++....+.++.++++|++|+++||+++|+||++.||+||++|+.+++ | T Consensus 240 ~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~a-p 318 (320) T protein:vir:10 240 ADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVT-P 318 (320) T ss_pred CCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccC-C Confidence 666556789999999999999999999999999999999999999999999999999999999999999999998886 4 Q ss_pred CC Q lcl|Aclame:pro 318 DS 319 (324) Q Consensus 318 ~~ 319 (324) ++ T Consensus 319 ~~ 320 (320) T protein:vir:10 319 DA 320 (320) T ss_pred CC Confidence 44 No 20 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=5e-60 Score=345.67 Aligned_cols=308 Identities=19% Similarity=0.237 Sum_probs=257.4 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) ||+... .++ ....+.++.++ +++++|++||++++++|++.+++.+++++++++++++++.+++|+.++.+ T Consensus 3 ~~~~r~-~~~--------~~~~e~~a~~~-~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~ 72 (326) T protein:vir:42 3 VNPDRT-TPF--------LGVNDPKVAQT-GDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDV 72 (326) T ss_pred CCccch-hhh--------cCcchhhheec-cccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCc Confidence 333321 111 11123344333 33456778999999999999999999999999999999999999999999 Q ss_pred ceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIA 160 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~ 160 (324) .+.|++|++++|+++++|++++++++|+++++++|+|+++||.++++++|.++|++++++++|+++|+|+|++.+.+... T Consensus 73 ~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~ 152 (326) T protein:vir:42 73 SASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQ 152 (326) T ss_pred ceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999875544332 Q ss_pred ccccc-----ccccccchhhhhH--HHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccC---------Ccc Q lcl|Aclame:pro 161 QSIEK-----TNKVIKGDFTQDN--IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR---------NSD 224 (324) Q Consensus 161 ~~~~~-----~~~~~~~~~~~~~--i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~---------~~~ 224 (324) ..... ......+..+..+ +.+++..+...++.+++|+||++++..|+++||++|+|+|.+. ..+ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~ 232 (326) T protein:vir:42 153 TTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLG 232 (326) T ss_pred cccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCc Confidence 22111 1111222233333 4566677778888999999999999999999999999998653 234 Q ss_pred eeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEecc Q lcl|Aclame:pro 225 SLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) Q Consensus 225 ~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~ 304 (324) +++|+||++++..+.+...+++|||++++++.+++++++++++.++....++++.++++|++|+++||+++|+||++.|| T Consensus 233 ~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~ 312 (326) T protein:vir:42 233 RIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDK 312 (326) T ss_pred eeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecc Confidence 79999999988877766667899999999999999999999999999999999999999999999999999999999999 Q ss_pred CceEEEEeecCCCC Q lcl|Aclame:pro 305 KAFAKLVPADKRTD 318 (324) Q Consensus 305 ~A~~~l~~~~~~~~ 318 (324) +||++|+.++++.. T Consensus 313 ~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 313 DAFVKLTNVDATEA 326 (326) T ss_pred cceEEEeeccccCC Confidence 99999999999877 No 21 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=1.6e-59 Score=342.85 Aligned_cols=306 Identities=13% Similarity=0.065 Sum_probs=254.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) .+....+..++++.......+.+.++.+.++.++||++||++++++|++.+++.++|++++++++++++.+++|+..+++ T Consensus 80 ~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 159 (407) T protein:vir:48 80 SEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYKKLVNLGGT 159 (407) T ss_pred hHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCc Confidence 22222233334433334444455666777777889999999999999999999999999999999999999999999999 Q ss_pred ceeeeccCccccccc-cceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSI 159 (324) Q Consensus 81 ~a~~v~Eg~~~~~~~-~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~ 159 (324) .+.|++|++.+|+++ ++|+++++.++|++++++||+|+++||.++++++|.++|+++++.++|+++++|+|++.+.+.+ T Consensus 160 ~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil 239 (407) T protein:vir:48 160 TSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFL 239 (407) T ss_pred ceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceee Confidence 999999999999865 7999999999999999999999999999999999999999999999999999999987544333 Q ss_pred ccccc-------------cccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CC Q lcl|Aclame:pro 160 AQSIE-------------KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) Q Consensus 160 ~~~~~-------------~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~ 222 (324) ..... ......++.+++++|.+++..|...++.+++|+||++++..|++++|.+|||+|++ +. T Consensus 240 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~ 319 (407) T protein:vir:48 240 AYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQ 319 (407) T ss_pred ecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCC Confidence 22111 11223445678999999999999999999999999999999999999999999864 45 Q ss_pred cceeecceeEeecCCCC---CCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEec Q lcl|Aclame:pro 223 SDSLDGLPVVNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVA 298 (324) Q Consensus 223 ~~~l~G~pv~~~~~~~~---~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d 298 (324) +++|+|+||+++++++. ....++||||+. +.++++.++++..++ +|++|++.||++.|+| T Consensus 320 ~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~----------------~~~~~~~~~~~~~r~d 383 (407) T protein:vir:48 320 PSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP----------------YTNKPFVGFYTTKRTG 383 (407) T ss_pred CceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeec----------------cccCCcEEEEEEEEec Confidence 67899999999876553 334578899986 667888888886542 4678999999999999 Q ss_pred cEEeccCceEEEEeecCCCCCCCC Q lcl|Aclame:pro 299 LHIADDKAFAKLVPADKRTDSVPG 322 (324) Q Consensus 299 ~~v~~~~A~~~l~~~~~~~~~~~~ 322 (324) +++++|+||++|+.+++......+ T Consensus 384 ~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 384 GMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred cEEecccceEEEEeeccCCCCCCC Confidence 999999999999999998777766 No 22 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=1.2e-59 Score=343.53 Aligned_cols=292 Identities=16% Similarity=0.144 Sum_probs=247.4 Q ss_pred ccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeehe Q lcl|Aclame:pro 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 27 ~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 106 (324) ....++++||++||++++++|++.+++.|++++++++++++++.++||+.++.+.++|++|++.+|+++++|+++++.++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~ 80 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQPI 80 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEeeee Confidence 33456667999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEeeeehHHHhhcChHH----HHHHHHHHHHHHHHHHHHHHHHhccCc--ccccccccccccc-ccccccchhhhhHH Q lcl|Aclame:pro 107 KLGVILPVTKEFLNYTYSQ----FFEEMKPMIAEAFYKKFDEAGILNQGN--NPFGKSIAQSIEK-TNKVIKGDFTQDNI 179 (324) Q Consensus 107 k~~~~~~iS~e~l~ds~~~----~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~~~~~~~~~~~~-~~~~~~~~~~~~~i 179 (324) |++++++||+|+++++..+ ++++|.+++++++++++|+++|+|++. +..+.++...... .+....+...++++ T Consensus 81 kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 160 (315) T protein:vir:80 81 KVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSATADL 160 (315) T ss_pred eEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccchHHH Confidence 9999999999999988765 789999999999999999999999763 2233333332222 22333445567899 Q ss_pred HHHHHHhhhh-cCCCcEEEEcHHHHHHHHHhhccCCce-----eec---cCCcceeecceeEeecCCCCC-------Cce Q lcl|Aclame:pro 180 IDLEALLEDD-ELEANAFISKTQNRSLLRKIVDPETKE-----RIY---DRNSDSLDGLPVVNLKSSNLK-------RGE 243 (324) Q Consensus 180 ~~~~~~l~~~-~~~~~~~v~~~~~~~~l~~~~d~~g~~-----~~~---~~~~~~l~G~pv~~~~~~~~~-------~~~ 243 (324) .+++.++... +..+++|+|||+++..|+++++.+|++ ++. .+.+++|+|+||+++.+++.. ... T Consensus 161 ~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~ 240 (315) T protein:vir:80 161 VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVK 240 (315) T ss_pred HHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceeeEecCcCCcccccccccccE Confidence 9999888654 445678999999999999998877654 332 234579999999998776532 345 Q ss_pred eEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 244 LITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 244 ~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) +++|||++++++.+++++++++++.+ .++..+++|++|+++||+++|+||.+.||+||++|+.++++...+|+| T Consensus 241 ~~~GDfs~~~~g~~~~~~i~i~~~~~------~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~~~~~ 314 (315) T protein:vir:80 241 AIVGDFSRVHWGFQRNFPIELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPNPPAE 314 (315) T ss_pred EEEeecccEEEEEecCeeEEEecccc------ccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCCCCCC Confidence 78999999999999999999988653 456678999999999999999999999999999999999999999999 Q ss_pred C Q lcl|Aclame:pro 324 V 324 (324) Q Consensus 324 ~ 324 (324) - T Consensus 315 ~ 315 (315) T protein:vir:80 315 N 315 (315) T ss_pred C Confidence 9 No 23 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=9.4e-59 Score=338.69 Aligned_cols=298 Identities=14% Similarity=0.068 Sum_probs=248.2 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) .+....+..+++..........+.++...++.++||.+||++++++|++.+++.++|++++++++++++.+++|+..+++ T Consensus 81 ~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 160 (401) T protein:vir:44 81 AEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDYKKLVNLGGT 160 (401) T ss_pred HHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEEEecCCc Confidence 33333334444443334444445566666677788999999999999999999999999999999999999999999999 Q ss_pred ceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccc Q lcl|Aclame:pro 81 GAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSI 159 (324) Q Consensus 81 ~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~ 159 (324) .+.|++|++.+|.+ .++|+++++.++|++++++||+|+++||.++++++|.++|+++++.++|.++|+|+|++.+ .|+ T Consensus 161 ~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p-~Gi 239 (401) T protein:vir:44 161 ASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKP-KGF 239 (401) T ss_pred cceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCcc-cee Confidence 99999999999875 5799999999999999999999999999999999999999999999999999999998654 443 Q ss_pred cccccc--------------ccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----C Q lcl|Aclame:pro 160 AQSIEK--------------TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----R 221 (324) Q Consensus 160 ~~~~~~--------------~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~ 221 (324) ...... .....++.++++++.++++.|...++.+++|+||++++..|++++|.+|+|+|.+ + T Consensus 240 l~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g 319 (401) T protein:vir:44 240 LAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELG 319 (401) T ss_pred eccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCC Confidence 322111 1122345678999999999999999999999999999999999999999999864 4 Q ss_pred CcceeecceeEeecCCCC---CCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 222 NSDSLDGLPVVNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) Q Consensus 222 ~~~~l~G~pv~~~~~~~~---~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~ 297 (324) .+++|+|+||+++.+++. +...++||||++ +.++++.++++..++ +|++|++.||++.|+ T Consensus 320 ~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~----------------~~~~~~v~~~a~~r~ 383 (401) T protein:vir:44 320 QPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP----------------YTNKPFVGFYTTKRT 383 (401) T ss_pred CCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeec----------------cccCCcEEEEEEEEe Confidence 567899999998876543 334578899986 568888888876542 478999999999999 Q ss_pred ccEEeccCceEEEEeecC Q lcl|Aclame:pro 298 ALHIADDKAFAKLVPADK 315 (324) Q Consensus 298 d~~v~~~~A~~~l~~~~~ 315 (324) |+++.+++||++|+.+++ T Consensus 384 d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 384 GGMLVDSQAIKLLKIAAA 401 (401) T ss_pred ccEEecccceEEEEeecC Confidence 999999999999999999 No 24 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=4e-58 Score=335.25 Aligned_cols=282 Identities=16% Similarity=0.190 Sum_probs=240.0 Q ss_pred cccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeehee Q lcl|Aclame:pro 28 NVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFK 107 (324) Q Consensus 28 ~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 107 (324) =..+++++|.+||++++.+|++.+++.|++++++++++++++..++|+.++.+.++|++|++++|+++++|+++++++|| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k 80 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIVPLK 80 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEeeeEE Confidence 12344557889999999999999999999999999999998889999999999999999999999999999999999999 Q ss_pred eEEeeeehHHHhh---cChHHHHHHHHHHHHHHHHHHHHHHHHhccCc----ccccccccc--ccccccccccchhhhhH Q lcl|Aclame:pro 108 LGVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN----NPFGKSIAQ--SIEKTNKVIKGDFTQDN 178 (324) Q Consensus 108 ~~~~~~iS~e~l~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~----~~~~~~~~~--~~~~~~~~~~~~~~~~~ 178 (324) +++.++||+|+++ ++.++++++|.+++++++++++|+++|+|+++ +..+.+... ..........+..++++ T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (300) T protein:vir:95 81 VEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPDES 160 (300) T ss_pred EEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchHHH Confidence 9999999999994 66799999999999999999999999999543 222222222 11122223345677899 Q ss_pred HHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeEeecCCCC----CCceeEEeecc Q lcl|Aclame:pro 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNL----KRGELITGDFD 250 (324) Q Consensus 179 i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~~----~~~~~i~gd~s 250 (324) +.+++..+...++.+++|+|||+++.+|+++||++|+|+|.. +.+++|+|+||++++..+. ++..+++|||+ T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf~ 240 (300) T protein:vir:95 161 MEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVGDFE 240 (300) T ss_pred HHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCccEEEEeecc Confidence 999999999999999999999999999999999999999853 4678999999998876643 23347889999 Q ss_pred cEE-EEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 251 KLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 251 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) +++ ++.+++++++++++. +.++..+++|++|++++|+++|+||.+.||+||++|++++- T Consensus 241 ~~~~~~~~~~~~~~v~~~~------~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 241 TMFKWGYAKEVPMEIIKYG------DPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred ceEEEEEecccEEEEeecc------CCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 765 899999999998754 34567789999999999999999999999999999998765 No 25 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=4.7e-58 Score=334.89 Aligned_cols=299 Identities=16% Similarity=0.140 Sum_probs=243.3 Q ss_pred CchhH--HHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC Q lcl|Aclame:pro 1 MEQTQ--KLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD 78 (324) Q Consensus 1 ~~~~~--~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~ 78 (324) .+..+ +..+..+.|.......++.++.+..++++||++||++++++|++.+++.++|+++|++++++++..++|+.++ T Consensus 102 ~~~~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~ 181 (425) T protein:vir:10 102 ANGVKPLRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMG 181 (425) T ss_pred cccccccccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcC Confidence 00000 0001111222211122334455666777889999999999999999999999999999999999999999999 Q ss_pred CcceeeeccCccccccc-cceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc Q lcl|Aclame:pro 79 KPGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~~-~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~ 157 (324) .+.+.|++|++.+|+++ ++|+++++.++|++++++||+|+++|+.++++++|.++|+++++.++|+++|+|+|++. |. T Consensus 182 ~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~-p~ 260 (425) T protein:vir:10 182 GTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNK-PN 260 (425) T ss_pred CcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCC-cc Confidence 99999999999999876 79999999999999999999999999999999999999999999999999999999875 44 Q ss_pred cccccccc--------------ccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc--- Q lcl|Aclame:pro 158 SIAQSIEK--------------TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD--- 220 (324) Q Consensus 158 ~~~~~~~~--------------~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~--- 220 (324) |+.+.... .....++.+++++|.+++..|...++.+++|+||++++.+|++++|.+|+|+|.+ T Consensus 261 Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~ 340 (425) T protein:vir:10 261 GLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYV 340 (425) T ss_pred eeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhhcCCCceeeccCcc Confidence 44332211 1122445678999999999999999999999999999999999999999999964 Q ss_pred -CCcceeecceeEeecCCCC---CCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 221 -RNSDSLDGLPVVNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 221 -~~~~~l~G~pv~~~~~~~~---~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) +.+++|+|+||+++++++. ....++||||++ +.++++.++++..+. +|.+|++.||++. T Consensus 341 ~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~----------------~~~~~~~~~~~~~ 404 (425) T protein:vir:10 341 AGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDP----------------YTAKPYVLFYTTK 404 (425) T ss_pred CCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecc----------------cccCCcEEEEEEE Confidence 4567999999999876653 334589999997 567888887765432 3678999999999 Q ss_pred EeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 296 HVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 296 r~d~~v~~~~A~~~l~~~~~~ 316 (324) |+|+++.||+||++|+.++.- T Consensus 405 r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 405 RVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred EeccEeecccceEEEEeeccC Confidence 999999999999999998877 No 26 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=8.7e-58 Score=333.41 Aligned_cols=278 Identities=19% Similarity=0.193 Sum_probs=238.2 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEE Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~ 110 (324) +.+++|.+||++++++|++.+++.++|++++++++++++..+||+.++.+.++|++|++++|+++++|++++++++|+++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~~~k~a~ 80 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEeeeeEEE Confidence 66778999999999999999999999999999999998889999999999999999999999999999999999999999 Q ss_pred eeeehHHHhh---cChHHHHHHHHHHHHHHHHHHHHHHHHhccC----ccccccccccccc----cccccccchhhhhHH Q lcl|Aclame:pro 111 ILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQG----NNPFGKSIAQSIE----KTNKVIKGDFTQDNI 179 (324) Q Consensus 111 ~~~iS~e~l~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g----~~~~~~~~~~~~~----~~~~~~~~~~~~~~i 179 (324) .+++|+|+++ ++..+++++|.+++++++++++|+++++|.+ ......+...... .......+...++++ T Consensus 81 ~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) T protein:vir:16 81 GARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) T ss_pred eehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHHHH Confidence 9999999995 5568999999999999999999999999953 2222222211111 111122234457789 Q ss_pred HHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeEeecCCCC----CCceeEEeeccc Q lcl|Aclame:pro 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNL----KRGELITGDFDK 251 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~~----~~~~~i~gd~s~ 251 (324) .+++.++...++++++|+||++++..|+++||.+|+|+|.+ +.+++|+|+||+++...+. ++..+++|||++ T Consensus 161 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs~ 240 (298) T protein:vir:16 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) T ss_pred HHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeeccc Confidence 99999999999999999999999999999999999999864 4467999999998876543 345689999997 Q ss_pred E-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeec Q lcl|Aclame:pro 252 L-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 252 ~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~ 314 (324) + .++.+++++++++++. ++++..+++|++|+++||+++|+||++.||+||++|++++ T Consensus 241 ~~~~~~~~~~~~~~~~~~------~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 241 GFKWGYAKEVPLEVIQYG------DPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eEEEEEecCceEEEeecc------CCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 6 4889999999988754 4556788999999999999999999999999999999988 No 27 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=9.5e-58 Score=333.19 Aligned_cols=281 Identities=17% Similarity=0.211 Sum_probs=241.3 Q ss_pred ccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheee Q lcl|Aclame:pro 29 VMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) Q Consensus 29 ~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~ 108 (324) ..+.+++|++||++++++|++.+++.|++++++++++++++..+||+.++.+.+.|++|++++|+++++|+++++++||+ T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~kl 80 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIKV 80 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEEE Confidence 44556789999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeehHHHhh---cChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc----cc--cc-ccccccccccchhhhhH Q lcl|Aclame:pro 109 GVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK----SI--AQ-SIEKTNKVIKGDFTQDN 178 (324) Q Consensus 109 ~~~~~iS~e~l~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~----~~--~~-~~~~~~~~~~~~~~~~~ 178 (324) ++.+++|+|+++ ++.++++++|.+++++++++++|+++++|+++..... +. .. .........++...+++ T Consensus 81 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (303) T protein:vir:97 81 EYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDADAN 160 (303) T ss_pred EEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchHHH Confidence 999999999994 6678999999999999999999999999975322221 11 11 11222233345677899 Q ss_pred HHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc-----CCcceeecceeEeecCCCC------CCceeEEe Q lcl|Aclame:pro 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD-----RNSDSLDGLPVVNLKSSNL------KRGELITG 247 (324) Q Consensus 179 i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~-----~~~~~l~G~pv~~~~~~~~------~~~~~i~g 247 (324) +.+++.++...++.++.|+|||+++.+|+++||++|+|++++ ..+++|+|+||+++..++. +...+++| T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~G 240 (303) T protein:vir:97 161 IEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAESKDLVIIG 240 (303) T ss_pred HHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccCCCccEEEEe Confidence 999999999999999999999999999999999999999864 3456899999999876543 33458999 Q ss_pred eccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 248 DFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 248 d~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) ||+. +.++.+++++++++++ .+.++.++++|++|+++||+++|+||+++||+||++|+.+.- T Consensus 241 df~~~~~~~~~~~~~~~~~~~------~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 241 DFESMFKWGYAKQIPMEIIKY------GDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred eccccEEEEEecCcEEEEeec------cCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 9965 6799999999998864 456777889999999999999999999999999999998766 No 28 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=1.4e-57 Score=332.26 Aligned_cols=278 Identities=19% Similarity=0.203 Sum_probs=239.3 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEE Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGV 110 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~ 110 (324) ++.++|.+||++++++|++.+++.|++++++++++++++..+||+.++.+.++|++|++++|+++++|+++++.++|+++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 80 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEEEE Confidence 66678999999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred eeeehHHHhh---cChHHHHHHHHHHHHHHHHHHHHHHHHhcc----Ccccccccccccccc----ccccccchhhhhHH Q lcl|Aclame:pro 111 ILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQ----GNNPFGKSIAQSIEK----TNKVIKGDFTQDNI 179 (324) Q Consensus 111 ~~~iS~e~l~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~----g~~~~~~~~~~~~~~----~~~~~~~~~~~~~i 179 (324) .++||+|+++ ++..+++++|.+++++++++++|.++++|. |+...+.+....... ......+...++++ T Consensus 81 ~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 160 (298) T protein:vir:94 81 GARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNGAI 160 (298) T ss_pred eeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHHHH Confidence 9999999996 445789999999999999999999999984 333333332221111 11222344567899 Q ss_pred HHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeEeecCCCC----CCceeEEeeccc Q lcl|Aclame:pro 180 IDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNL----KRGELITGDFDK 251 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~~----~~~~~i~gd~s~ 251 (324) .+++.++...+.++++|+||++++.+|+++||.+|+|+|.+ +.+++|+|+||++++..+. +...+++|||++ T Consensus 161 ~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~ 240 (298) T protein:vir:94 161 ENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFAN 240 (298) T ss_pred HHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeeccc Confidence 99999999999999999999999999999999999999864 5567999999998876543 345689999998 Q ss_pred EE-EEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeec Q lcl|Aclame:pro 252 LI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 252 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~ 314 (324) ++ ++.+++++++++++. ++++.++++|++|++++|++.|+||.+.||+||++|++++ T Consensus 241 ~~~~~~~~~~~~~~~~~~------~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 241 GFKWGYAKEVPLEVIQYG------DPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred eEEEEEecCceEEEeecC------CCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 64 899999999998754 3556788999999999999999999999999999999988 No 29 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=3.9e-57 Score=329.85 Aligned_cols=315 Identities=14% Similarity=0.121 Sum_probs=242.4 Q ss_pred CchhHHHHHHHHHHHhh--------hhhHH-----------hh----ccccccccccCccccchHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASN--------NVKPQ-----------VF----NPDNVMMHEKKDGTLMNEFTTPILQEVMENSKI 57 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~--------~~~~~-----------~~----~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l 57 (324) +.+...+.+.++..+.. ..+++ .. ++.+..+...|+.++|+++.++|++.+++.+++ T Consensus 289 ~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~svv 368 (645) T protein:vir:93 289 LDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQTII 368 (645) T ss_pred hhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhhhhH Confidence 22222222222221110 00000 00 111112223366788999999999999999999 Q ss_pred hhhcceeecC----CCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHH Q lcl|Aclame:pro 58 MQLGKYEPME----GTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPM 133 (324) Q Consensus 58 ~~l~~~~~~~----~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~ 133 (324) ++++..+..+ .+.+++|+.++++.++|++|++.+|+++++|++++++++|+++++++|+|+++||.++++++|.++ T Consensus 369 ~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~~~~~~~i~~~ 448 (645) T protein:vir:93 369 GRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFSSPAADALVRNA 448 (645) T ss_pred HhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhchHHHHHHHHHH Confidence 9997654222 245789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhccCccc---cccccccccccccccccchhhhhHHHHHHHHhhhhcC--CCcEEEEcHHHHHHHHH Q lcl|Aclame:pro 134 IAEAFYKKFDEAGILNQGNNP---FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRK 208 (324) Q Consensus 134 l~~ai~~~~d~~~l~G~g~~~---~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~l~~ 208 (324) |+++++.++|+++|+|+|++. .+.++.+... ...++..+..|+.+++.++..++. ..++|+|||.++.+|++ T Consensus 449 l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~---~~~~~~~~~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~ 525 (645) T protein:vir:93 449 LAEAVVARLDTDFVDPKKAAVADVSPASITHDVK---GTASSGNPDADAEAAFGQFVAANLQPTGAVWLMSSTNALALSM 525 (645) T ss_pred HHHHHHHHHHHHhhcCCCcccCCccccceecccc---ccccccchHHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHh Confidence 999999999999999987643 2344433222 222333456788888888876644 35689999999999999 Q ss_pred hhccCCceeec--cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccc--------cc Q lcl|Aclame:pro 209 IVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNE--------DG 278 (324) Q Consensus 209 ~~d~~g~~~~~--~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~ 278 (324) +||++|++++. ...+++|+|+||+++..++ + .+++|||++++++.++++.+.+++++++.....+ .+ T Consensus 526 lkd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp--~-~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~ 602 (645) T protein:vir:93 526 RKNALGQKEYPDMTLLGGSFQGLPVIVSQYVG--D-QLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPV 602 (645) T ss_pred ccccCCceeecCCCCCCceeeceeeEEeccCC--c-ceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccc Confidence 99999999874 2346799999999987654 2 5789999999999999999999999988655433 33 Q ss_pred cchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCC Q lcl|Aclame:pro 279 TPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVP 321 (324) Q Consensus 279 ~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~ 321 (324) ..+++|++|+++||+++|+||++.||+||++|+++.|+...-- T Consensus 603 ~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~~~ 645 (645) T protein:vir:93 603 ELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSASGG 645 (645) T ss_pred cchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCcccCC Confidence 5789999999999999999999999999999999999854332 No 30 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=3.9e-57 Score=329.81 Aligned_cols=289 Identities=20% Similarity=0.241 Sum_probs=240.2 Q ss_pred ccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCcc-----ccccccceeeE Q lcl|Aclame:pro 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQK-----IETSKATWVNA 101 (324) Q Consensus 27 ~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~-----~~~~~~~~~~v 101 (324) ...++++++|++||++++++|++.+++.++|++++++++++++.++||+.++.+.+.|++|++. +|.++++|+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 4556777899999999999999999999999999999999999999999999999999999986 45678999999 Q ss_pred EeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccc--cccccc---ccccccchhhh Q lcl|Aclame:pro 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSI--AQSIEK---TNKVIKGDFTQ 176 (324) Q Consensus 102 ~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~--~~~~~~---~~~~~~~~~~~ 176 (324) +++++|++++++||+|+++||.++++++|.++|++++++++|+++|+|+|+....... ...... ......+.... T Consensus 81 ~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVANE 160 (305) T ss_pred EeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccchhh Confidence 9999999999999999999999999999999999999999999999999864332211 111111 11111122223 Q ss_pred ----hHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecceeEeecCCCC--CCceeEEeecc Q lcl|Aclame:pro 177 ----DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNL--KRGELITGDFD 250 (324) Q Consensus 177 ----~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~--~~~~~i~gd~s 250 (324) +++.++...+...+...+.|+||+.++..|+++||++|+|+|.+ ++++|+||+++...+. +++.+++|||+ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~---~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s 237 (305) T protein:vir:25 161 SDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRD---DSFAGFRTFFNRNGAWDADAAIEVIADSS 237 (305) T ss_pred hHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecC---CcccccceEEcCccCCCCCccEEEEEecc Confidence 33445555555666677789999999999999999999999965 4799999998876543 45688999999 Q ss_pred cEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 251 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) +++++.+++++++++++.++.. ++..+++|++|++++|++.|+||.+.||+||++++..+++ .++|+. T Consensus 238 ~~~i~~~~~~~i~~~~~~~~~~----~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~-~~~pa~ 305 (305) T protein:vir:25 238 RVKIGVRQDITVKFLDQATLGT----GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVA-VVAPAA 305 (305) T ss_pred eEEEEEecCeEEEEeeeeeeec----CCceeeeeecCcEEEEEEEeecceeeCcccEEEEcccccc-ccCCCC Confidence 9999999999999999987654 3457889999999999999999999999999999998876 444444 No 31 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=9.7e-57 Score=327.68 Aligned_cols=303 Identities=13% Similarity=0.084 Sum_probs=247.7 Q ss_pred CchhHHHHHH---HHHHHh---hhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE Q lcl|Aclame:pro 1 MEQTQKLKLN---LQHFAS---NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 ~~~~~~~k~~---~~~~a~---~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) .....+++.+ ++.... .............++++++|++||++++++|++.+++.++|++++++++++++.+++| T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 182 (418) T protein:vir:10 103 VTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYT 182 (418) T ss_pred hhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEE Confidence 1111111111 111000 0000011122223445568889999999999999999999999999999999889999 Q ss_pred EEeC-CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 75 FWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 75 ~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) +.+. .+.+.|++|++++|+++++|+++++.++|++++++||+|+++++ .+++++|.++++++++.++|.++|+|+|++ T Consensus 183 ~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~ 261 (418) T protein:vir:10 183 VETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDA-PALQSYIDGRARYGLQLTEEGQILKGDGTG 261 (418) T ss_pred EEecCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCC Confidence 9866 67899999999999999999999999999999999999999987 589999999999999999999999999998 Q ss_pred ccccccccccccccc--cccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---CCcceeec Q lcl|Aclame:pro 154 PFGKSIAQSIEKTNK--VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDG 228 (324) Q Consensus 154 ~~~~~~~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---~~~~~l~G 228 (324) ..+.++.+....... ..++..+++++.+++..+...++.+++|+|||+++..|++++|.+|+|+|.. +.+++|+| T Consensus 262 ~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G 341 (418) T protein:vir:10 262 ANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWN 341 (418) T ss_pred ccccccccccccccccccccccccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecc Confidence 877777665443322 3345567899999999999999999999999999999999999999999853 45779999 Q ss_pred ceeEeecCCCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|Aclame:pro 229 LPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 229 ~pv~~~~~~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~ 307 (324) +||+++.. ++.+++++|||++ +.++++++++++++++.. .+|++|++.||++.|+||.+.+|+|| T Consensus 342 ~pV~~~~~--~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~~~d~~~~~~~a~ 407 (418) T protein:vir:10 342 LPVVETQA--MTANEFLVGAFSMAAQIFDRMEIEVLLSTENV------------DDFEKNMVSIRAEERLALAVYRPESF 407 (418) T ss_pred eeeEEcCC--CCCCcEEEeeccceEEEEEecceEEEEecccc------------hhhhcCceEEEEEEeeccEEecccce Confidence 99998765 5678899999997 567889999999876532 46999999999999999999999999 Q ss_pred EEEEeecCCCC Q lcl|Aclame:pro 308 AKLVPADKRTD 318 (324) Q Consensus 308 ~~l~~~~~~~~ 318 (324) ++++.+++.++ T Consensus 408 ~~~~~~~~~~g 418 (418) T protein:vir:10 408 VTGALVEQAGG 418 (418) T ss_pred EEEEeccCCCC Confidence 99999988877 No 32 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=7.1e-57 Score=328.43 Aligned_cols=281 Identities=19% Similarity=0.182 Sum_probs=236.5 Q ss_pred ccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheee Q lcl|Aclame:pro 29 VMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKL 108 (324) Q Consensus 29 ~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~ 108 (324) .-+.++||++||+++.++|++.+++.|++++++++++++++..++|+.++.+.++|++|++++|+++++|+++++.++|+ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~kl 80 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPVTAIPRKV 80 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEEE Confidence 34455688999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeeeehHHHhh---cChHHHHHHHHHHHHHHHHHHHHHHHHhccCc--ccccccccccc----cccccccc-chhhhhH Q lcl|Aclame:pro 109 GVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN--NPFGKSIAQSI----EKTNKVIK-GDFTQDN 178 (324) Q Consensus 109 ~~~~~iS~e~l~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~--~~~~~~~~~~~----~~~~~~~~-~~~~~~~ 178 (324) +++++||+|+++ ++..+++++|.+++++++++++|+++++|+++ +..+.++.... ........ ....+.+ T Consensus 81 ~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) T protein:vir:81 81 QVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPDLA 160 (311) T ss_pred EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHHHH Confidence 999999999996 55678999999999999999999999999753 32333332221 11111112 2233566 Q ss_pred HHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeEeecCCC---------------- Q lcl|Aclame:pro 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN---------------- 238 (324) Q Consensus 179 i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~---------------- 238 (324) +.+++.++.....++++|+||+.++.+|+++||.+|+|+|.. +.+++|+|+||+++..++ T Consensus 161 i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~ 240 (311) T protein:vir:81 161 VEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTT 240 (311) T ss_pred HHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccccchhccc Confidence 778888888878888899999999999999999999999864 457899999999865443 Q ss_pred CCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 239 LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 239 ~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) ..+..+++|||++++++.+++++++++++.. . ...+++|++|+++||++.|+||++.||+||++|++++.. T Consensus 241 ~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~------~-~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 241 NPNVKAIAGDFSAFRWGVQVSIPLELIEFGD------P-DGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred CCccEEEEEecccEEEEEeccceEEEeccCC------C-CcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 2345679999999999999999999987652 2 235688999999999999999999999999999998887 No 33 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=1.6e-56 Score=326.53 Aligned_cols=302 Identities=19% Similarity=0.279 Sum_probs=242.3 Q ss_pred HHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccC-- Q lcl|Aclame:pro 11 LQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEG-- 88 (324) Q Consensus 11 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg-- 88 (324) |......+........++.. .+.++++||+++.++|++.+++.+++++++++++++++.+++|+.++.+.+.|++|+ T Consensus 1 ~a~l~el~~~~~~~~~~g~~-~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~ 79 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRL-AHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTS 79 (333) T ss_pred CchhHHhhhhcccccccCce-ecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCccc Confidence 11111100011111112211 223455999999999999999999999999999999999999999999988888776 Q ss_pred ------ccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--cccc Q lcl|Aclame:pro 89 ------QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG--KSIA 160 (324) Q Consensus 89 ------~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~--~~~~ 160 (324) +.+|+++++|+++++.++|++++++||+|+++|+.++++++|+++|++++++++|+++|+|+|...+. .++. T Consensus 80 ~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~ 159 (333) T protein:vir:78 80 NEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGID 159 (333) T ss_pred ccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccc Confidence 56788999999999999999999999999999999999999999999999999999999999875432 2222 Q ss_pred c------ccccccccccchhhhhHHHHHHHHhhhh-cCCCcEEEEcHHHHHHHHH---hhccCCceeecc----CCccee Q lcl|Aclame:pro 161 Q------SIEKTNKVIKGDFTQDNIIDLEALLEDD-ELEANAFISKTQNRSLLRK---IVDPETKERIYD----RNSDSL 226 (324) Q Consensus 161 ~------~~~~~~~~~~~~~~~~~i~~~~~~l~~~-~~~~~~~v~~~~~~~~l~~---~~d~~g~~~~~~----~~~~~l 226 (324) + ..........+..+++++.+++..+... ++..+.|+|||.++..|++ ++|.+|+|++.. +.+++| T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l 239 (333) T protein:vir:78 160 TDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDV 239 (333) T ss_pred ccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCcee Confidence 2 1112222344566789999999988765 4556789999999987765 678999999864 456799 Q ss_pred ecceeEeecCCCCC-------CceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 227 DGLPVVNLKSSNLK-------RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 227 ~G~pv~~~~~~~~~-------~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) +|+||+++...+.. +..+++|||++++++++++++++++++++ ..+.++.++++|++|++.||+++|+|| T Consensus 240 ~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~---~~~~~~~~~~~~~~~~v~~r~~~r~d~ 316 (333) T protein:vir:78 240 LGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTAT---LTDSGSATVSMWQTNQIAILIEVTFGW 316 (333) T ss_pred eceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEecccc---ccccccceeehhhcCcEEEEEEEEEcc Confidence 99999987665432 44789999999999999999999999875 356667888999999999999999999 Q ss_pred EEeccCceEEEEeecCC Q lcl|Aclame:pro 300 HIADDKAFAKLVPADKR 316 (324) Q Consensus 300 ~v~~~~A~~~l~~~~~~ 316 (324) .+.||+||++|+.++++ T Consensus 317 ~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 317 LLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEecccceEEEeccCCC Confidence 99999999999999988 No 34 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=2.4e-56 Score=325.52 Aligned_cols=307 Identities=20% Similarity=0.267 Sum_probs=245.5 Q ss_pred HHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcc------ Q lcl|Aclame:pro 8 KLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPG------ 81 (324) Q Consensus 8 k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~------ 81 (324) ...++.+...... ....+ ..++.++++||++++++|++.+++.++|+++|++++++++.++||+.+..+. T Consensus 1 ~~~~~e~~~~~~~---~~~~~-~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~ 76 (338) T protein:vir:78 1 MATLNELAPNTAG---SNHQG-RLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGV 76 (338) T ss_pred CcchHHhhhhhcc---ccccc-ceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecc Confidence 1111111110000 01111 1222456699999999999999999999999999999999999999876543 Q ss_pred --eeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc--ccc Q lcl|Aclame:pro 82 --AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP--FGK 157 (324) Q Consensus 82 --a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~--~~~ 157 (324) +.|++|++++|+++++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++|+|+|++. .+. T Consensus 77 ~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~ 156 (338) T protein:vir:78 77 GTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQ 156 (338) T ss_pred cccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccc Confidence 5677899999999999999999999999999999999999999999999999999999999999999998643 233 Q ss_pred cccccccc------ccccccchhhhhHHHHHHHHhhh-hcCCCcEEEEcHHHHHHHH---HhhccCCceeecc----CCc Q lcl|Aclame:pro 158 SIAQSIEK------TNKVIKGDFTQDNIIDLEALLED-DELEANAFISKTQNRSLLR---KIVDPETKERIYD----RNS 223 (324) Q Consensus 158 ~~~~~~~~------~~~~~~~~~~~~~i~~~~~~l~~-~~~~~~~~v~~~~~~~~l~---~~~d~~g~~~~~~----~~~ 223 (324) ++...... ..........++++.++..++.. .....++|+||+.++..|+ +++|.+|+|+|.+ +.+ T Consensus 157 gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~ 236 (338) T protein:vir:78 157 GIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASA 236 (338) T ss_pred ccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCC Confidence 33222111 11112234567888888888864 3456678999999988774 5679999999853 456 Q ss_pred ceeecceeEeecCCCC-------CCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEE Q lcl|Aclame:pro 224 DSLDGLPVVNLKSSNL-------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) Q Consensus 224 ~~l~G~pv~~~~~~~~-------~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r 296 (324) ++|+|+||+++..++. .+..+++|||+.++++++++++++++++.++....++.+..+++|++|++++|++.| T Consensus 237 ~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r 316 (338) T protein:vir:78 237 GDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVT 316 (338) T ss_pred ceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 7999999998765542 345689999999999999999999999999999999999999999999999999999 Q ss_pred eccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 297 VALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 297 ~d~~v~~~~A~~~l~~~~~~~~ 318 (324) +||+++||+||++|+.++++.- T Consensus 317 ~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 317 FGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred eccEeecccceEEEecccCCCC Confidence 9999999999999999777654 No 35 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=5.9e-56 Score=323.37 Aligned_cols=303 Identities=15% Similarity=0.139 Sum_probs=232.9 Q ss_pred Cchh---------HHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCce Q lcl|Aclame:pro 1 MEQT---------QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) Q Consensus 1 ~~~~---------~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~ 71 (324) ++.+ .+..+....+........+.+....++++++|++||+++..+|++.+++.++|++++++++++++.+ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~ 195 (497) T protein:vir:78 116 VSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNL 195 (497) T ss_pred hhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCce Confidence 0000 0111112222222222223344445666788999999999999999999999999999999999999 Q ss_pred EEEEEeC-CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 72 KFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 72 ~ip~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~ 150 (324) +||+.++ .+.+.|++|++.+|+++++|+++++.+||++++++||+|+++|+ ++++++|.++|+++|+.++|.++|+|+ T Consensus 196 ~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~ 274 (497) T protein:vir:78 196 SYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGG 274 (497) T ss_pred EEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 9999876 46899999999999999999999999999999999999999987 579999999999999999999999999 Q ss_pred Cccccccccccccccccccc--------------------------------------------------------cchh Q lcl|Aclame:pro 151 GNNPFGKSIAQSIEKTNKVI--------------------------------------------------------KGDF 174 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~--------------------------------------------------------~~~~ 174 (324) |++. +.++........... .... T Consensus 275 G~~~-p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (497) T protein:vir:78 275 GYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAE 353 (497) T ss_pred Cccc-ccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhh Confidence 9875 444332211110000 0011 Q ss_pred hhhHHHHHHHHhhhh-cCCCcEEEEcHHHHHHHHHhhccCCceeeccC----------CcceeecceeEeecCCCCCCce Q lcl|Aclame:pro 175 TQDNIIDLEALLEDD-ELEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLKSSNLKRGE 243 (324) Q Consensus 175 ~~~~i~~~~~~l~~~-~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~----------~~~~l~G~pv~~~~~~~~~~~~ 243 (324) ..+++..++..+... ++.+++|+||+.+|..|+++||.+|+|+|.+. ..++|+|+||++++++ +.++ T Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~--~~~~ 431 (497) T protein:vir:78 354 IAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLI--PLGT 431 (497) T ss_pred hhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCC--CCCc Confidence 122334444444433 45567899999999999999999999999643 2348999999998765 5678 Q ss_pred eEEeeccc--EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 244 LITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 244 ~i~gd~s~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) +++|||+. +.++++++++++++++.. .+|++|+++||++.|+||.|.+|+||++|+.++..... T Consensus 432 ~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 432 ILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred eEEeecccceEEEEEecccEEEeecccc------------hhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 99999986 346789999999876532 35999999999999999999999999999998888766 No 36 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=5.9e-56 Score=323.37 Aligned_cols=303 Identities=15% Similarity=0.139 Sum_probs=232.9 Q ss_pred Cchh---------HHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCce Q lcl|Aclame:pro 1 MEQT---------QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) Q Consensus 1 ~~~~---------~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~ 71 (324) ++.+ .+..+....+........+.+....++++++|++||+++..+|++.+++.++|++++++++++++.+ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~ 195 (497) T protein:vir:10 116 VSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNL 195 (497) T ss_pred hhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCce Confidence 0000 0111112222222222223344445666788999999999999999999999999999999999999 Q ss_pred EEEEEeC-CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 72 KFTFWAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 72 ~ip~~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~ 150 (324) +||+.++ .+.+.|++|++.+|+++++|+++++.+||++++++||+|+++|+ ++++++|.++|+++|+.++|.++|+|+ T Consensus 196 ~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~ 274 (497) T protein:vir:10 196 SYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGG 274 (497) T ss_pred EEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 9999876 46899999999999999999999999999999999999999987 579999999999999999999999999 Q ss_pred Cccccccccccccccccccc--------------------------------------------------------cchh Q lcl|Aclame:pro 151 GNNPFGKSIAQSIEKTNKVI--------------------------------------------------------KGDF 174 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~--------------------------------------------------------~~~~ 174 (324) |++. +.++........... .... T Consensus 275 G~~~-p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 353 (497) T protein:vir:10 275 GYPG-VNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAE 353 (497) T ss_pred Cccc-ccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhh Confidence 9875 444332211110000 0011 Q ss_pred hhhHHHHHHHHhhhh-cCCCcEEEEcHHHHHHHHHhhccCCceeeccC----------CcceeecceeEeecCCCCCCce Q lcl|Aclame:pro 175 TQDNIIDLEALLEDD-ELEANAFISKTQNRSLLRKIVDPETKERIYDR----------NSDSLDGLPVVNLKSSNLKRGE 243 (324) Q Consensus 175 ~~~~i~~~~~~l~~~-~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~----------~~~~l~G~pv~~~~~~~~~~~~ 243 (324) ..+++..++..+... ++.+++|+||+.+|..|+++||.+|+|+|.+. ..++|+|+||++++++ +.++ T Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~--~~~~ 431 (497) T protein:vir:10 354 IAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLI--PLGT 431 (497) T ss_pred hhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCC--CCCc Confidence 122334444444433 45567899999999999999999999999643 2348999999998765 5678 Q ss_pred eEEeeccc--EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 244 LITGDFDK--LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 244 ~i~gd~s~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) +++|||+. +.++++++++++++++.. .+|++|+++||++.|+||.|.+|+||++|+.++..... T Consensus 432 ~~~Gd~~~~~~~i~~r~~~~v~~~~~~~------------~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 432 ILVGHFAPSVIQTARREGVTMQMTNSNG------------TDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred eEEeecccceEEEEEecccEEEeecccc------------hhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 99999986 346789999999876532 35999999999999999999999999999998888766 No 37 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=4.8e-56 Score=323.85 Aligned_cols=300 Identities=13% Similarity=0.103 Sum_probs=237.1 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhc-cccccccccCccccchHHHHHHHHHHHh-hhhhhhhcceeecCC-CceEEEEEe Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFN-PDNVMMHEKKDGTLMNEFTTPILQEVME-NSKIMQLGKYEPMEG-TEKKFTFWA 77 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~-~~~~~~~~~~~~~vp~~~~~~i~~~~~~-~s~l~~l~~~~~~~~-~~~~ip~~~ 77 (324) -......+.++++.........+.. .....+++.+|+++|+++..+++..+.. .++++++++++++++ +.+.+|+.+ T Consensus 83 ~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 162 (392) T protein:vir:13 83 RSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVIT 162 (392) T ss_pred hhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEc Confidence 0001111222222211111111111 1122344556678888888887766555 556777888888754 458899999 Q ss_pred CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc Q lcl|Aclame:pro 78 DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~ 157 (324) +.+.++|++|++++|+++++|+++++.++|++++++||+|+++|+.++++++|.++|+++++.++|.++|+|+|++.+.+ T Consensus 163 ~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~G 242 (392) T protein:vir:13 163 GRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRG 242 (392) T ss_pred CCcceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999875443 Q ss_pred cccccccc---ccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecce Q lcl|Aclame:pro 158 SIAQSIEK---TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLP 230 (324) Q Consensus 158 ~~~~~~~~---~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~p 230 (324) .+...... .....++..+++++.+++..|...++.+++|+||++++..|++++|++|+|+|.+ +.+++|+|+| T Consensus 243 il~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~P 322 (392) T protein:vir:13 243 ILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKV 322 (392) T ss_pred cccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceeccee Confidence 33322211 2223346678999999999999999999999999999999999999999999864 4457899999 Q ss_pred eEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEE Q lcl|Aclame:pro 231 VVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) Q Consensus 231 v~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l 310 (324) |+++.. +++++++||||++++++.+++++++.+.+. +|.+|++.||++.|+|+++.||+||+.+ T Consensus 323 v~~~~~--~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~--------------~~~~~~~~~r~~~r~d~~~~~~~A~~~~ 386 (392) T protein:vir:13 323 VETDDG--MPADKVLFADLSKYRVRFAGSLRVDRSVDA--------------KFSTDQIVYRFLQRADGLLVDARGAKVL 386 (392) T ss_pred eEEcCC--CCCCcEEEeeccceeEEeecceEEEeeccc--------------cccCCcEEEEEEEEeccEEecccceEEE Confidence 998765 457789999999999999999999877543 4899999999999999999999999999 Q ss_pred EeecCC Q lcl|Aclame:pro 311 VPADKR 316 (324) Q Consensus 311 ~~~~~~ 316 (324) +.+++. T Consensus 387 ~~~~aa 392 (392) T protein:vir:13 387 TVTPAA 392 (392) T ss_pred EeeccC Confidence 998887 No 38 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=7.6e-56 Score=322.77 Aligned_cols=301 Identities=13% Similarity=0.086 Sum_probs=248.2 Q ss_pred CchhHHHHHHHHH-HHhh--hhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe Q lcl|Aclame:pro 1 MEQTQKLKLNLQH-FASN--NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 ~~~~~~~k~~~~~-~a~~--~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) +...++.....++ +... .....+.+....++++.+|++||++++..|++.+++.++|++++++++++++.+++|+.+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 154 (385) T protein:vir:18 75 KSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREE 154 (385) T ss_pred hhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEe Confidence 1111111111111 1111 111112232333455667889999999999999999999999999999999889999987 Q ss_pred C-CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Q lcl|Aclame:pro 78 D-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 78 ~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~ 156 (324) . .+.+.|++|++.+|+++++|+++++.++|++++++||+|+++|+ ++++++|.++|+++++.++|+++|+|+|++.++ T Consensus 155 ~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~ 233 (385) T protein:vir:18 155 VFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNL 233 (385) T ss_pred cCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc Confidence 5 57889999999999999999999999999999999999999986 579999999999999999999999999999888 Q ss_pred ccccccccccc--ccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---CCcceeeccee Q lcl|Aclame:pro 157 KSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGLPV 231 (324) Q Consensus 157 ~~~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---~~~~~l~G~pv 231 (324) .++........ ...++..++++|.+++.++...++.+++|+|||+++..|++++|++|+|++.+ +.+++|+|+|| T Consensus 234 ~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV 313 (385) T protein:vir:18 234 EGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPV 313 (385) T ss_pred cccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceee Confidence 88766544332 23345678899999999999999999999999999999999999999999864 56789999999 Q ss_pred EeecCCCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEE Q lcl|Aclame:pro 232 VNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) Q Consensus 232 ~~~~~~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l 310 (324) ++++. ++++++++|||+. +.++.+++++++++++.. .+|++|++.||++.|+||++.+|+||+++ T Consensus 314 ~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~ 379 (385) T protein:vir:18 314 VPTKA--QAAGTFTVGGFDMASQVWDRMDATVEVSREDR------------DNFVKNMLTILCEERLALAHYRPTAIIKG 379 (385) T ss_pred EEcCc--CCCCcEEEeecccEEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEecccceEEE Confidence 98765 5678999999986 667889999998876542 46999999999999999999999999999 Q ss_pred EeecCC Q lcl|Aclame:pro 311 VPADKR 316 (324) Q Consensus 311 ~~~~~~ 316 (324) +.+++. T Consensus 380 ~~~aa~ 385 (385) T protein:vir:18 380 TFSSGS 385 (385) T ss_pred EeccCC Confidence 999988 No 39 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=7.6e-56 Score=322.77 Aligned_cols=301 Identities=13% Similarity=0.086 Sum_probs=248.2 Q ss_pred CchhHHHHHHHHH-HHhh--hhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe Q lcl|Aclame:pro 1 MEQTQKLKLNLQH-FASN--NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 ~~~~~~~k~~~~~-~a~~--~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) +...++.....++ +... .....+.+....++++.+|++||++++..|++.+++.++|++++++++++++.+++|+.+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 154 (385) T protein:vir:19 75 KSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVREE 154 (385) T ss_pred hhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEEe Confidence 1111111111111 1111 111112232333455667889999999999999999999999999999999889999987 Q ss_pred C-CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Q lcl|Aclame:pro 78 D-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 78 ~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~ 156 (324) . .+.+.|++|++.+|+++++|+++++.++|++++++||+|+++|+ ++++++|.++|+++++.++|+++|+|+|++.++ T Consensus 155 ~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~ 233 (385) T protein:vir:19 155 VFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA-PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNL 233 (385) T ss_pred cCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc Confidence 5 57889999999999999999999999999999999999999986 579999999999999999999999999999888 Q ss_pred ccccccccccc--ccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---CCcceeeccee Q lcl|Aclame:pro 157 KSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGLPV 231 (324) Q Consensus 157 ~~~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---~~~~~l~G~pv 231 (324) .++........ ...++..++++|.+++.++...++.+++|+|||+++..|++++|++|+|++.+ +.+++|+|+|| T Consensus 234 ~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV 313 (385) T protein:vir:19 234 EGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPV 313 (385) T ss_pred cccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceee Confidence 88766544332 23345678899999999999999999999999999999999999999999864 56789999999 Q ss_pred EeecCCCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEE Q lcl|Aclame:pro 232 VNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) Q Consensus 232 ~~~~~~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l 310 (324) ++++. ++++++++|||+. +.++.+++++++++++.. .+|++|++.||++.|+||++.+|+||+++ T Consensus 314 ~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~~~~~r~~~~v~~~~a~~~~ 379 (385) T protein:vir:19 314 VPTKA--QAAGTFTVGGFDMASQVWDRMDATVEVSREDR------------DNFVKNMLTILCEERLALAHYRPTAIIKG 379 (385) T ss_pred EEcCc--CCCCcEEEeecccEEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEecccceEEE Confidence 98765 5678999999986 667889999998876542 46999999999999999999999999999 Q ss_pred EeecCC Q lcl|Aclame:pro 311 VPADKR 316 (324) Q Consensus 311 ~~~~~~ 316 (324) +.+++. T Consensus 380 ~~~aa~ 385 (385) T protein:vir:19 380 TFSSGS 385 (385) T ss_pred EeccCC Confidence 999988 No 40 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1.1e-55 Score=321.93 Aligned_cols=299 Identities=14% Similarity=0.103 Sum_probs=245.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-C Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD-K 79 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~-~ 79 (324) .......+.+.+.......... .+....+++.++|+++|++++++|++.+++.++|++++++++++++.+++|+.++ . T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~ 166 (395) T protein:vir:43 88 VAESLKEQGVTSSLRGSHRVSM-PRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFV 166 (395) T ss_pred HHHHHHHHHHHHHhhhhhhhhh-hhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCC Confidence 0011111111111111111111 1223345556788999999999999999999999999999999998899999876 4 Q ss_pred cceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccc Q lcl|Aclame:pro 80 PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSI 159 (324) Q Consensus 80 ~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~ 159 (324) +.+.|++|++.+|+++++|++++++++|++++++||+|+++|+ ++++++|.++|+++++.++|.++|+|+|++.++.++ T Consensus 167 ~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi 245 (395) T protein:vir:43 167 NNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDA-SALQSYIDARARYGLMLVEECQLLYGNGTGANLHGI 245 (395) T ss_pred CceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccc Confidence 6899999999999999999999999999999999999999986 579999999999999999999999999998888777 Q ss_pred ccccccccc----cccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---CCcceeecceeE Q lcl|Aclame:pro 160 AQSIEKTNK----VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGLPVV 232 (324) Q Consensus 160 ~~~~~~~~~----~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---~~~~~l~G~pv~ 232 (324) .+....... ...+...++++.+++..+...++.+++|+|||+++..|++++|++|+|++.+ +.+++|+|+||+ T Consensus 246 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~l~G~pVv 325 (395) T protein:vir:43 246 IPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAENRYIIGSPQNGTTPTLWRLPVV 325 (395) T ss_pred cccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccCCceeccccccCCCceecceeeE Confidence 664443222 2334557899999999999999999999999999999999999999999864 456789999999 Q ss_pred eecCCCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEE Q lcl|Aclame:pro 233 NLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV 311 (324) Q Consensus 233 ~~~~~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~ 311 (324) ++.. ++++++++|||+. +.++++++++++++++.. .+|++|+++||++.|+||++.+|+||++++ T Consensus 326 ~~~~--~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~ 391 (395) T protein:vir:43 326 ETQA--ITQDEFLTGAFSLGAQIFDRMDIEVLVSTEND------------KDFENNMVTIRAEERLAFAVYRPEAFVTGS 391 (395) T ss_pred EcCC--CCCCcEEEEeccceEEEEEecceEEEEecccc------------chhhcCcEEEEEEEeeccEEecccceEEEE Confidence 8765 5678899999997 457788999999876532 469999999999999999999999999999 Q ss_pred eecC Q lcl|Aclame:pro 312 PADK 315 (324) Q Consensus 312 ~~~~ 315 (324) .+++ T Consensus 392 ~taa 395 (395) T protein:vir:43 392 LTAS 395 (395) T ss_pred eccC Confidence 9888 No 41 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=8.7e-56 Score=322.46 Aligned_cols=297 Identities=14% Similarity=0.107 Sum_probs=233.9 Q ss_pred Cchh----HHHHHHHHHHHhhhhhHHhhcc-ccccccccCccccchHHHHHHH-HHHHhhhhhhhhcceeecCCC-ceEE Q lcl|Aclame:pro 1 MEQT----QKLKLNLQHFASNNVKPQVFNP-DNVMMHEKKDGTLMNEFTTPIL-QEVMENSKIMQLGKYEPMEGT-EKKF 73 (324) Q Consensus 1 ~~~~----~~~k~~~~~~a~~~~~~~~~~~-~~~~~~~~~~~~vp~~~~~~i~-~~~~~~s~l~~l~~~~~~~~~-~~~i 73 (324) .... ...+.++|..........+... ....+++.+|+++|+++..+++ +.++..+++++++++++++++ .+.| T Consensus 79 ~~~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~ 158 (390) T protein:vir:62 79 SGAQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDF 158 (390) T ss_pred ccchhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEE Confidence 0000 0112223332211111111111 1223344455666666665555 556677778889999988664 5889 Q ss_pred EEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 74 TFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 74 p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) |+.++.+.+.|++|++.+|+++++|++++++++|++++++||+|+++||.++++++|.+.++++++.++|.++|+|+|. T Consensus 159 p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~- 237 (390) T protein:vir:62 159 TVITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQ- 237 (390) T ss_pred EEEcCCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHhhhhccCCc- Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999984 Q ss_pred cccccccccc----ccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcce Q lcl|Aclame:pro 154 PFGKSIAQSI----EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) Q Consensus 154 ~~~~~~~~~~----~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~ 225 (324) |.++.+.. .......++.++++++++++.+|...+..+++|+||++++.+|+++||.+|+|+|++ +.+++ T Consensus 238 --p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~~ 315 (390) T protein:vir:62 238 --PRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPSL 315 (390) T ss_pred --cccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccce Confidence 34443322 122223345678999999999999999999999999999999999999999999864 44568 Q ss_pred eecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccC Q lcl|Aclame:pro 226 LDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) Q Consensus 226 l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~ 305 (324) |+|+||++++. ++++.++||||++++++.++++.++.+.+. +|.+|+++||++.|+|+++.||+ T Consensus 316 l~G~Pv~~~~~--~p~~~i~~gd~s~~~i~~~~~~~v~~~~~~--------------~~~~~~~~~~~~~r~d~~~~~~~ 379 (390) T protein:vir:62 316 FNGKVVETDDG--MPADKILFADLSKYRVRFAGSLRVDRSVDA--------------KFSTDQIVYRFLQRADGLLVDAR 379 (390) T ss_pred ecccceEEecC--CCCccEEEeeccceeEEeecceEEEeeccc--------------cccCCcEEEEEEEEeCcEeechh Confidence 99999998765 456789999999999999999999987653 58999999999999999999999 Q ss_pred ceEEEEeecCC Q lcl|Aclame:pro 306 AFAKLVPADKR 316 (324) Q Consensus 306 A~~~l~~~~~~ 316 (324) ||+.|+.+++. T Consensus 380 A~~~l~~~~~a 390 (390) T protein:vir:62 380 GAKVLTVTPGA 390 (390) T ss_pred heEEEEeecCC Confidence 99999998888 No 42 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=1.8e-55 Score=320.66 Aligned_cols=297 Identities=14% Similarity=0.105 Sum_probs=246.6 Q ss_pred CchhHHHHHHHHHHHhhhhh-----HHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVK-----PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF 75 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~-----~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~ 75 (324) .......+.++......... ....+....++++++|+++|++++.+|++.+++.++|++++++++++++.+++|+ T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~ 161 (390) T protein:vir:97 82 FVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQ 161 (390) T ss_pred hhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEE Confidence 11111222222222111111 1112334445667788999999999999999999999999999999999999999 Q ss_pred EeC-CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Q lcl|Aclame:pro 76 WAD-KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) Q Consensus 76 ~~~-~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~ 154 (324) .++ .+.+.|++||+++|+++++|+++++.++|++++++||+|+++|+ ++++++|.+++++++++++|+++|+|+|++. T Consensus 162 ~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~ 240 (390) T protein:vir:97 162 ETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGAND 240 (390) T ss_pred EecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCc Confidence 876 46899999999999999999999999999999999999999997 5899999999999999999999999999988 Q ss_pred ccccccccccccc--ccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---CCcceeecc Q lcl|Aclame:pro 155 FGKSIAQSIEKTN--KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGL 229 (324) Q Consensus 155 ~~~~~~~~~~~~~--~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---~~~~~l~G~ 229 (324) .+.++.+...... ...++...++++.+++..+...++.+++|+|||++|..|++++|++|+|+|.+ +.+++|+|+ T Consensus 241 ~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~ 320 (390) T protein:vir:97 241 GLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGL 320 (390) T ss_pred cccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecce Confidence 8888766543332 23445677899999999999999999999999999999999999999999964 456689999 Q ss_pred eeEeecCCCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 230 PVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 230 pv~~~~~~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ||++++. ++++++++|||++ +.++.+.+++++.+++. .+|++|+++||++.|+||.+.||+||+ T Consensus 321 pV~~~~~--~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-------------~~f~~~~~~~r~~~r~d~~v~~~~a~v 385 (390) T protein:vir:97 321 PVVATQA--MAPGEFLVGAFDLAAQIFDQWDARVEIGYVN-------------DDFQRNMVTVLAEERLALVVYRPEALI 385 (390) T ss_pred eeEEcCC--CCCCcEEEEeccceEEEEEecceEEEEeecc-------------cccccCcEEEEEEEeeccEEeccccEE Confidence 9999765 5678999999997 56788999999987643 248999999999999999999999999 Q ss_pred EEEee Q lcl|Aclame:pro 309 KLVPA 313 (324) Q Consensus 309 ~l~~~ 313 (324) +++.+ T Consensus 386 ~~~~a 390 (390) T protein:vir:97 386 TGSFA 390 (390) T ss_pred EEEeC Confidence 99998 No 43 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=2.7e-55 Score=319.79 Aligned_cols=297 Identities=14% Similarity=0.102 Sum_probs=244.4 Q ss_pred CchhHHHHHHHHHHHhhhhhH-----HhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKP-----QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF 75 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~-----~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~ 75 (324) .......+.++.......... ...+.....++..+|+++|+++..+|++.+++.++|++++++++++++.+++|+ T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 161 (390) T protein:vir:10 82 FVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQ 161 (390) T ss_pred hhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEE Confidence 011112222222221111111 011223334455678899999999999999999999999999999999999999 Q ss_pred EeCC-cceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Q lcl|Aclame:pro 76 WADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) Q Consensus 76 ~~~~-~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~ 154 (324) .++. +.+.|++|++.+|+++++|+++++.++|++++++||+|+++|+. +++++|.++|+++++.++|+++|+|+|++. T Consensus 162 ~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~~~~~~~~~~il~G~G~~~ 240 (390) T protein:vir:10 162 ETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIRGLKVKEDAEILRGTGAND 240 (390) T ss_pred EecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCCc Confidence 8764 67999999999999999999999999999999999999999975 899999999999999999999999999988 Q ss_pred cccccccccccc--cccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---CCcceeecc Q lcl|Aclame:pro 155 FGKSIAQSIEKT--NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGL 229 (324) Q Consensus 155 ~~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---~~~~~l~G~ 229 (324) .+.++.+..... ....++...++++.+++..+...++.+++|+|||++|..|++++|++|+|+|.+ +.+++|+|+ T Consensus 241 ~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G~ 320 (390) T protein:vir:10 241 GLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGL 320 (390) T ss_pred cccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecce Confidence 888877654433 233445667899999999999999999999999999999999999999999974 345689999 Q ss_pred eeEeecCCCCCCceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 230 PVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 230 pv~~~~~~~~~~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ||+++.. ++++++++|||+++ .++.++++.++.+++. .+|++|++.||++.|+||++.+|+||+ T Consensus 321 pv~~~~~--~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~-------------~~~~~~~~~~r~~~r~d~~v~~~~a~~ 385 (390) T protein:vir:10 321 PVVATQA--MAPGEFLVGAFDLAAQIFDQWDARVEIGYVN-------------DDFQRNMVTVLAEERLALVVYRPEALI 385 (390) T ss_pred eeEEcCC--CCCCcEEEEeccceEEEEEecceEEEEeecc-------------cccccCcEEEEEEEeeccEEeccccEE Confidence 9998765 55788999999974 5788999999987643 248999999999999999999999999 Q ss_pred EEEee Q lcl|Aclame:pro 309 KLVPA 313 (324) Q Consensus 309 ~l~~~ 313 (324) +++.+ T Consensus 386 ~~~~a 390 (390) T protein:vir:10 386 SGSFA 390 (390) T ss_pred EEEeC Confidence 99998 No 44 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=2.8e-55 Score=319.64 Aligned_cols=297 Identities=14% Similarity=0.102 Sum_probs=244.9 Q ss_pred CchhHHHHHHHHHHHhhhhhH-----HhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKP-----QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF 75 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~-----~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~ 75 (324) .......+.+........... ...+....+++.++|+++|++++.+|++.+++.++|++++++++++++.+++|+ T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 161 (390) T protein:vir:81 82 FVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQ 161 (390) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEE Confidence 011112222222211111110 111223334556788899999999999999999999999999999999999999 Q ss_pred EeCC-cceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Q lcl|Aclame:pro 76 WADK-PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) Q Consensus 76 ~~~~-~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~ 154 (324) .++. +.+.|++|++.+|+++++|+++++.++|++++++||+|+++|+ ++++++|.++|++++++++|+++|+|+|++. T Consensus 162 ~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~ 240 (390) T protein:vir:81 162 ETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGAND 240 (390) T ss_pred EecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC Confidence 8764 5789999999999999999999999999999999999999997 5899999999999999999999999999988 Q ss_pred cccccccccccc--cccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---CCcceeecc Q lcl|Aclame:pro 155 FGKSIAQSIEKT--NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGL 229 (324) Q Consensus 155 ~~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---~~~~~l~G~ 229 (324) .+.++....... ....++...++++.+++.++...++.+++|+|||+++..|++++|++|+|+|.+ +.+++|+|+ T Consensus 241 ~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~ 320 (390) T protein:vir:81 241 GLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGL 320 (390) T ss_pred cccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecce Confidence 888876654433 233445677899999999999999999999999999999999999999999964 445689999 Q ss_pred eeEeecCCCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 230 PVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 230 pv~~~~~~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ||+++.. ++++++++|||++ +.++.+++++++.+++. .+|++|++.||++.|+||++.+|+||+ T Consensus 321 pv~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~-------------~~~~~~~v~~r~~~r~d~~v~~~~a~v 385 (390) T protein:vir:81 321 PVVATQA--MAPGEFLVGAFDLAAQIFDQWDARVEIGYVG-------------EDFQRNMITVLAEERLALVVYRPEALI 385 (390) T ss_pred eeEEcCC--CCCCcEEEEehhceEEEEEecceEEEEeccc-------------chhhcCcEEEEEEEeeccEEecccceE Confidence 9998765 5678999999997 56788999999987643 259999999999999999999999999 Q ss_pred EEEee Q lcl|Aclame:pro 309 KLVPA 313 (324) Q Consensus 309 ~l~~~ 313 (324) +++.+ T Consensus 386 ~~t~a 390 (390) T protein:vir:81 386 SGSFA 390 (390) T ss_pred EEEeC Confidence 99998 No 45 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=3.1e-55 Score=319.44 Aligned_cols=304 Identities=10% Similarity=0.048 Sum_probs=242.3 Q ss_pred CchhHHHHH---HHHHHHhhhhhHHh---h--ccccccccccCccccchHHHHHHH-HHHHhhhhhhhhcceeecCCCce Q lcl|Aclame:pro 1 MEQTQKLKL---NLQHFASNNVKPQV---F--NPDNVMMHEKKDGTLMNEFTTPIL-QEVMENSKIMQLGKYEPMEGTEK 71 (324) Q Consensus 1 ~~~~~~~k~---~~~~~a~~~~~~~~---~--~~~~~~~~~~~~~~vp~~~~~~i~-~~~~~~s~l~~l~~~~~~~~~~~ 71 (324) .++....+. .++..........+ + ......++++||.+||++++..+| +.+++.++++++++++++ ++.+ T Consensus 216 ~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~~ 294 (543) T protein:vir:81 216 TSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGDV 294 (543) T ss_pred hhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC-Ccce Confidence 010011010 11111111111111 1 111234566788999999998876 557788999999987765 4568 Q ss_pred EEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 72 KFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 72 ~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) .+|+.++.+.+.|++|++.+|+++++|++++++++|++++++||+|+++|+ +++.++|.+.|+++++.++|.++|+|+| T Consensus 295 ~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~~~~~~~~d~ail~G~G 373 (543) T protein:vir:81 295 WHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALLFAEGKDELEAVTLTTGTG 373 (543) T ss_pred EEEEecCCcceeecccCccccccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 899999999999999999999999999999999999999999999999997 6999999999999999999999999999 Q ss_pred ccccccccccccc----cccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---CCcc Q lcl|Aclame:pro 152 NNPFGKSIAQSIE----KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSD 224 (324) Q Consensus 152 ~~~~~~~~~~~~~----~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---~~~~ 224 (324) ++..+.|+..... ......++.++++++.+++..++..+..+++|+||++++..|++++|++|+|+|.+ +.++ T Consensus 374 t~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~ 453 (543) T protein:vir:81 374 QGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPS 453 (543) T ss_pred CCcccccchhhcccccccccccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCc Confidence 8877777654322 22333456778999999999999999999999999999999999999999999964 4567 Q ss_pred eeecceeEeecCCCC--------CCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEE Q lcl|Aclame:pro 225 SLDGLPVVNLKSSNL--------KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) Q Consensus 225 ~l~G~pv~~~~~~~~--------~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r 296 (324) +|+|+||+++.+++. +...++||||++++++.++++++.++.+.. .-+.|.+|+++||++.| T Consensus 454 ~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~----------~~~~~~~~~~~~~~~~r 523 (543) T protein:vir:81 454 QLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLF----------GTNRRPNGSRGWFAYYR 523 (543) T ss_pred cccceeeEEeccccccccccccCCcceEEEeeccceeEEeecccEEEEecccc----------ccchhhcCceEEEEEEe Confidence 899999999877553 333589999999999999999999886542 11347889999999999 Q ss_pred eccEEeccCceEEEEeecCC Q lcl|Aclame:pro 297 VALHIADDKAFAKLVPADKR 316 (324) Q Consensus 297 ~d~~v~~~~A~~~l~~~~~~ 316 (324) +||.+.+|+||++|+.+++. T Consensus 524 ~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 524 MGADVVNPNAFRLLNVETAS 543 (543) T ss_pred eccEeecccceEEEEecccC Confidence 99999999999999998888 No 46 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=1.2e-54 Score=316.30 Aligned_cols=307 Identities=12% Similarity=0.141 Sum_probs=250.1 Q ss_pred CchhHHHHHH----HHHHHh--hhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCC--CceE Q lcl|Aclame:pro 1 MEQTQKLKLN----LQHFAS--NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEG--TEKK 72 (324) Q Consensus 1 ~~~~~~~k~~----~~~~a~--~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~--~~~~ 72 (324) .+.....+.. ++.... ......+.++.+.++.++||.+||++++++|++.+++.++|+++++++++++ +.+. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~ 157 (404) T protein:vir:10 78 YNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRT 157 (404) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceE Confidence 1111111111 111100 0111223455556667788999999999999999999999999999988764 5577 Q ss_pred EEEEeCCcceeeeccCcccccc--ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 73 FTFWADKPGAYWVGEGQKIETS--KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 73 ip~~~~~~~a~~v~Eg~~~~~~--~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~ 150 (324) +|+.++.+.+.|++|++.+|.+ +++|++++++++|++++++||+|+++|+.++++++|.+.++++++.++|+++|+|+ T Consensus 158 ~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~G~ 237 (404) T protein:vir:10 158 YEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFVDKVRITRNAEILYGA 237 (404) T ss_pred EEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 8888888999999999999875 58899999999999999999999999999999999999999999999999999999 Q ss_pred CccccccccccccccccccccchhhhhHHHHHHH-HhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcce Q lcl|Aclame:pro 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~ 225 (324) |++.++.++............+..+++++.+++. .+...+..+++|+|||++|..|+++||++|+|+|.+ +.+++ T Consensus 238 g~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~ 317 (404) T protein:vir:10 238 GGDEHATGIMTANKFKKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYR 317 (404) T ss_pred CCCCcccceeeccccceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCcc Confidence 9988888887766666566666778899988876 677888888999999999999999999999999864 45678 Q ss_pred eecceeEeecCC----CCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccE Q lcl|Aclame:pro 226 LDGLPVVNLKSS----NLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH 300 (324) Q Consensus 226 l~G~pv~~~~~~----~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~ 300 (324) |+|+||+++++. ...+..+++|||++ +.++.+++++++++++. +..|++|++.||++.|+|+. T Consensus 318 l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~------------~~~~~~~~~~~~~~~r~d~~ 385 (404) T protein:vir:10 318 FLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIG------------AGAFETNTTKARIIMRIDGN 385 (404) T ss_pred ccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccc------------cchhhcCceEEEEEEeeccE Confidence 999999865442 23455689999996 56788999999987643 24699999999999999999 Q ss_pred EeccCceEEEEeecCCCCC Q lcl|Aclame:pro 301 IADDKAFAKLVPADKRTDS 319 (324) Q Consensus 301 v~~~~A~~~l~~~~~~~~~ 319 (324) +.+|+||++++.+++...+ T Consensus 386 v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 386 VKDSEALLIAEIPVESVQA 404 (404) T ss_pred EecccceEEEEeecccCCC Confidence 9999999999998887666 No 47 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=1.2e-54 Score=316.14 Aligned_cols=301 Identities=16% Similarity=0.130 Sum_probs=238.6 Q ss_pred CchhHHHHHHHHHHHh-hhhhHHh--hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFAS-NNVKPQV--FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~-~~~~~~~--~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) +++.+. +..++.... ....... ......++++++|++||+++.+.|++.+++.++++++++++++++ ..+||+.. T Consensus 110 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g-~~~ip~~~ 187 (425) T protein:vir:95 110 MNRLQV-REMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG-TTRILVDT 187 (425) T ss_pred HHHHHH-HHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecCc-eeEEEEec Confidence 111100 111111100 0000000 111122344568889999999999999999999999999999865 57899999 Q ss_pred CCcceeeeccCccccccc-cceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cc Q lcl|Aclame:pro 78 DKPGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PF 155 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~~~-~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~-~~ 155 (324) +.+.+.|++|++++|+++ ++|++++++++|++++++||+|+++|+.++++++|.++++++++.++|+++|+|+|++ .. T Consensus 188 ~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~ 267 (425) T protein:vir:95 188 DTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQ 267 (425) T ss_pred CCccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccc Confidence 999999999999999877 6899999999999999999999999999999999999999999999999999999974 44 Q ss_pred ccccccccccc--cccccchhhhhHHHHHHHHhhhhcC--CCcEEEEcHHHH----HHHHHhhccCCceeec--cCCcce Q lcl|Aclame:pro 156 GKSIAQSIEKT--NKVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNR----SLLRKIVDPETKERIY--DRNSDS 225 (324) Q Consensus 156 ~~~~~~~~~~~--~~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~----~~l~~~~d~~g~~~~~--~~~~~~ 225 (324) |.|+....... ....++..+++++.+++..+...+. .+++|+||+.++ ..|++++|.+|+|++. ....++ T Consensus 268 p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~ 347 (425) T protein:vir:95 268 PLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPD 347 (425) T ss_pred cceeecccccccccccccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCcc Confidence 55555433222 2234467789999999998877664 456799999874 3567889999999986 345678 Q ss_pred eecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccC Q lcl|Aclame:pro 226 LDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDK 305 (324) Q Consensus 226 l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~ 305 (324) |+|+||+++.++ ++++++||||++++++.++++++.++++. .|.+|++.||++.|+|+++.||+ T Consensus 348 l~G~pvv~~~~~--~~~~i~~Gd~~~~~~~~~~~~~i~~~~~~--------------~f~~~~~~~~~~~r~d~~~~~~~ 411 (425) T protein:vir:95 348 LLGLRVVFNNFL--DDDTVLFGEFEQYTLVERENITIDSSTHV--------------KFTEDQTAFRGKGRFDGKPVKPE 411 (425) T ss_pred ccceeeEEcCcC--CCccEEEEecccEEEEeecceEEEeeccc--------------ccccCceEEEEEEeeCcEeeccc Confidence 999999987754 56789999999999999999999998764 38999999999999999999999 Q ss_pred ceEEEEeecCCCCC Q lcl|Aclame:pro 306 AFAKLVPADKRTDS 319 (324) Q Consensus 306 A~~~l~~~~~~~~~ 319 (324) ||+.++.+++..++ T Consensus 412 a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 412 AFVLVTITDPVQGA 425 (425) T ss_pred ceEEEEecCcCCCC Confidence 99999999887777 No 48 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=3.9e-54 Score=313.38 Aligned_cols=298 Identities=16% Similarity=0.117 Sum_probs=238.8 Q ss_pred CchhHHHHHHHHHHHhhhhh--HHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE--EEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVK--PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK--FTFW 76 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~--~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~--ip~~ 76 (324) .++.+......+.|...... ....+.....++++||.+||++++.+|++.+++.++|++++++++++++..+ +|+. T Consensus 81 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (397) T protein:vir:49 81 KNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKW 160 (397) T ss_pred chhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEee Confidence 22222222333333321111 1122334455667788999999999999999999999999999988766554 4555 Q ss_pred eC-CcceeeeccCccccccc-cceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Q lcl|Aclame:pro 77 AD-KPGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) Q Consensus 77 ~~-~~~a~~v~Eg~~~~~~~-~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~ 154 (324) .+ .+.+.|++|++.+|+++ ++|++++++++|++++++||+|+++|+.++++++|.+++++++++++|+++|+|+|++. T Consensus 161 ~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~ 240 (397) T protein:vir:49 161 ADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLP 240 (397) T ss_pred ccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 43 46789999999999865 79999999999999999999999999999999999999999999999999999998764 Q ss_pred cccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecce Q lcl|Aclame:pro 155 FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLP 230 (324) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~p 230 (324) +. .+.++++++.+++.++...++.+++|+|||+++..|++++|++|+|+|.+ +.+++|+|+| T Consensus 241 ~~--------------~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~p 306 (397) T protein:vir:49 241 NK--------------PTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGFV 306 (397) T ss_pred cc--------------ccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceeccee Confidence 32 23457899999999999999999999999999999999999999999853 4567999999 Q ss_pred eEeecCCC-----CCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEecc Q lcl|Aclame:pro 231 VVNLKSSN-----LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) Q Consensus 231 v~~~~~~~-----~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~ 304 (324) |+++.+.. .+...++||||++ +.++++++++++++++.. ++|++|++.||++.|+|+.+.+| T Consensus 307 V~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~~~~~r~d~~~~~~ 374 (397) T protein:vir:49 307 VKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGG------------GAFETDTTKVRVIDRFDVVSTDT 374 (397) T ss_pred eEEecccccccccCCceeEEEeeccceEEEEeecccEEEEecccc------------chhhcCeeeEEEEEeeccEEecc Confidence 98765433 3455689999997 568999999999887542 46999999999999999999999 Q ss_pred CceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 305 KAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 305 ~A~~~l~~~~~~~~~~~~~~ 324 (324) +||++++.++.....+.-.. T Consensus 375 ~a~~~~~~~~~~~~~~~~~~ 394 (397) T protein:vir:49 375 EAFVPASFKAIADQKAKLST 394 (397) T ss_pred cceEEEEecccccccCcccc Confidence 99999997765543333333 No 49 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=2.2e-54 Score=314.72 Aligned_cols=301 Identities=12% Similarity=0.074 Sum_probs=236.0 Q ss_pred CchhHHHHHHHHHHH---hhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFA---SNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 ~~~~~~~k~~~~~~a---~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) .++.....+..+.|. .......+.++.+ +++++||++||++++++|++.+++.++|+++++++++++ ..++|+.. T Consensus 114 ~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~-~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~-~~~~p~~~ 191 (434) T protein:vir:62 114 GHRTNKETEIRSVFANYIVGNIDEKEARALG-LVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE-NIKYPVLV 191 (434) T ss_pred cccchHHHHHHHHHHHHhccccchhhhhhhc-ccccccceecchhhHHHHHHhhhhhhhhhhhcceeccCC-ceEEEEEe Confidence 111111111111121 1111222233322 234568889999999999999999999999999888765 58899988 Q ss_pred CCcceeee---ccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Q lcl|Aclame:pro 78 DKPGAYWV---GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) Q Consensus 78 ~~~~a~~v---~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~ 154 (324) ..+.+.|. +|++.+|.++++|+++++.++|++++++||+|+++|+.++++++|.++|+++++.++|+++|+|+|++. T Consensus 192 ~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~ 271 (434) T protein:vir:62 192 KKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANN 271 (434) T ss_pred cCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc Confidence 77777664 667889999999999999999999999999999999999999999999999999999999999999988 Q ss_pred cccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc------CCcceeec Q lcl|Aclame:pro 155 FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD------RNSDSLDG 228 (324) Q Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~------~~~~~l~G 228 (324) ++.++...... ....++..++++|++++.++...++.+++|+||++++.+|+++||++|+|+|.+ +.+++|+| T Consensus 272 ~~~g~~~~~~~-~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G 350 (434) T protein:vir:62 272 INDGALAKKAV-EFKTDEKNLYDALVKMKNTPVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLG 350 (434) T ss_pred cccceeecccc-cccccccchhhHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecc Confidence 77776554333 333455678999999999999999999999999999999999999999999853 34568999 Q ss_pred ceeEeecCCCCCC----ceeEEeecccEEEEEec-ceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|Aclame:pro 229 LPVVNLKSSNLKR----GELITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 229 ~pv~~~~~~~~~~----~~~i~gd~s~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) +||+++..++.+. ..++||||++++++.+. .++++++.+ .+|.+|+|+||++.|+|+++++ T Consensus 351 ~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~--------------~~~~~~~v~~~~~~r~Dgk~i~ 416 (434) T protein:vir:62 351 FPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLVE--------------LFSRTNRVGFRIWNLLDAQLIH 416 (434) T ss_pred eeeEEecCccCccCCCceEEEEeeccceEEEEeeceeEEEeehh--------------hhcccCceEEEEEeeecceeec Confidence 9999987765433 23789999999888865 576776543 3478999999999999999886 Q ss_pred -cCceEEE--EeecCCCC Q lcl|Aclame:pro 304 -DKAFAKL--VPADKRTD 318 (324) Q Consensus 304 -~~A~~~l--~~~~~~~~ 318 (324) |.+++.+ +++++.+. T Consensus 417 ~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 417 SPFEVPVYKYVLKAPTGA 434 (434) T ss_pred CcccceEEEEEeccCCCC Confidence 8887765 44444444 No 50 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=4e-54 Score=313.32 Aligned_cols=298 Identities=16% Similarity=0.133 Sum_probs=236.6 Q ss_pred CchhHH--HHHHHHHHHhhhhhH--HhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEE Q lcl|Aclame:pro 1 MEQTQK--LKLNLQHFASNNVKP--QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE--KKFT 74 (324) Q Consensus 1 ~~~~~~--~k~~~~~~a~~~~~~--~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~--~~ip 74 (324) .+...+ ...+.+.|....... .........++++||++||++++.+|++.+++.++|+++|+++++++.. +.+| T Consensus 79 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 158 (397) T protein:vir:49 79 LTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYE 158 (397) T ss_pred cccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEE Confidence 111111 112223332211111 1122234456677899999999999999999999999999999887544 4556 Q ss_pred EEeC-CcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|Aclame:pro 75 FWAD-KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 75 ~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~ 152 (324) +... .+.+.|++|++.+|+ +.++|++++++++|+++.++||+|+++||.++++++|.+++++++++++|+++++|+|+ T Consensus 159 ~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~ 238 (397) T protein:vir:49 159 KWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIAA 238 (397) T ss_pred eeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 6543 467899999999997 67999999999999999999999999999999999999999999999999999999987 Q ss_pred cccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeec Q lcl|Aclame:pro 153 NPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDG 228 (324) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G 228 (324) +.... +..+++++.+++.++...+..+++|+||++++..|+++||++|+|+|.+ +.+++|+| T Consensus 239 ~~~~~--------------~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G 304 (397) T protein:vir:49 239 LPTKP--------------TLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDG 304 (397) T ss_pred ccccc--------------ccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecc Confidence 65432 2346899999999999999999999999999999999999999999864 45679999 Q ss_pred ceeEeecC-----CCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|Aclame:pro 229 LPVVNLKS-----SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 229 ~pv~~~~~-----~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) +||+++.+ ...++..+++|||+. +.++.+++++++++++.. ++|++|++.||++.|+|+++. T Consensus 305 ~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~ 372 (397) T protein:vir:49 305 FAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGG------------GAFETDTTKVRVIDRFDVVAT 372 (397) T ss_pred eeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEecccc------------chhhcCceeEEEEeeeCcEEe Confidence 99987543 233455689999997 568889999999887542 469999999999999999999 Q ss_pred ccCceEEEEeecCC-CCCCCCCC Q lcl|Aclame:pro 303 DDKAFAKLVPADKR-TDSVPGEV 324 (324) Q Consensus 303 ~~~A~~~l~~~~~~-~~~~~~~~ 324 (324) +|+||++++.+++. +..+-+-+ T Consensus 373 ~~~a~~~~~~~~~~~~~~~~~~~ 395 (397) T protein:vir:49 373 DTEAFVPASFKAIADQKGNLGST 395 (397) T ss_pred cccceEEEEeecccCCCCCcccc Confidence 99999999987655 22233333 No 51 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=4.1e-54 Score=313.26 Aligned_cols=304 Identities=16% Similarity=0.143 Sum_probs=242.4 Q ss_pred CchhH-HHHHHHHHHHh-----hhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCce-EE Q lcl|Aclame:pro 1 MEQTQ-KLKLNLQHFAS-----NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KF 73 (324) Q Consensus 1 ~~~~~-~~k~~~~~~a~-----~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~-~i 73 (324) .++.. -+..+++.... ....-.+.++...++..+||++||+++.++|++.+++.++|++++++++++++.. .+ T Consensus 85 ~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 164 (409) T protein:vir:45 85 DEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTMEW 164 (409) T ss_pred hHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEEE Confidence 11110 11111221100 0111123455666677788999999999999999999999999999999977654 44 Q ss_pred EEEeCC-cceeeeccCccccccccceeeEEeeheeeE-EeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 74 TFWADK-PGAYWVGEGQKIETSKATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 74 p~~~~~-~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~-~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) |+..+. ..+.|++|++.+|+++++|.++++.++|++ ++++||+|+++|+.++++++|.++|+++++.++|+++|+|+| T Consensus 165 ~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G 244 (409) T protein:vir:45 165 ATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIAERIGRGEARYLIQGTG 244 (409) T ss_pred EeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 555443 457899999999999999999999999985 578999999999999999999999999999999999999998 Q ss_pred ccc--cccccccc-cccccccccchhhhhHHHHHHHHhhhhcCCCcEE--EEcHHHHHHHHHhhccCCceeecc----CC Q lcl|Aclame:pro 152 NNP--FGKSIAQS-IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAF--ISKTQNRSLLRKIVDPETKERIYD----RN 222 (324) Q Consensus 152 ~~~--~~~~~~~~-~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~--v~~~~~~~~l~~~~d~~g~~~~~~----~~ 222 (324) ++. .+.++... ........++.++++++.+++..|...++.++.| +||++++.+|++++|.+|+|+|++ +. T Consensus 245 ~~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~ 324 (409) T protein:vir:45 245 AGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVA 324 (409) T ss_pred CCCccccceeeeccccccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCC Confidence 753 34555443 3334444556788999999999999999888865 779999999999999999999864 45 Q ss_pred cceeecceeEeecCCCC---CCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 223 SDSLDGLPVVNLKSSNL---KRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 223 ~~~l~G~pv~~~~~~~~---~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) +.+|+|+||+++..++. ....++||||++++++.++++.++.+++. +|++|++.||++.|+|+ T Consensus 325 ~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~d~--------------~~~~~~~~~~~~~r~d~ 390 (409) T protein:vir:45 325 PASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLVER--------------YAEYDQTGFLAFHRFDC 390 (409) T ss_pred CceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceEEEEeecc--------------cccCCcEEEEEEEEecc Confidence 57899999999877653 34458899999999999999999877643 47889999999999999 Q ss_pred EEeccCceEEEEeecCCCC Q lcl|Aclame:pro 300 HIADDKAFAKLVPADKRTD 318 (324) Q Consensus 300 ~v~~~~A~~~l~~~~~~~~ 318 (324) ++.+|+||+.|+.+++++. T Consensus 391 ~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 391 ILEDTSAIKALVGKGSVGG 409 (409) T ss_pred EeechhheEEEEeccCCCC Confidence 9999999999999888877 No 52 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=1.3e-53 Score=310.49 Aligned_cols=304 Identities=13% Similarity=0.085 Sum_probs=244.5 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE--eC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW--AD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~--~~ 78 (324) .........+.+.|........... ...+++++++.+||+++.++|++.+++.++|++++++++++++..++|+. ++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:46 96 IQNTKVTSQEVRDFTEYLETRNDIQ-GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhhHHHHHHHHHHHhhhhhhh-hccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecC Confidence 1111222333444433222222222 22344556888999999999999999999999999999999888888875 45 Q ss_pred CcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc Q lcl|Aclame:pro 79 KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~ 157 (324) ...+.|++|++.+|+ +.++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|+++++|+|++.... T Consensus 175 ~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~ 254 (415) T protein:vir:46 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc Confidence 667899999999997 5689999999999999999999999999999999999999999999999999999998876655 Q ss_pred cccccccc-ccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeE Q lcl|Aclame:pro 158 SIAQSIEK-TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~~~~~-~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) +....... .....++..++++|.+++.++...++.+++|+||+++|.+|++++|++|+|+|.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:46 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred cccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeE Confidence 54443332 3333456678999999999999999999999999999999999999999999863 456789999999 Q ss_pred eecCCCCC---CceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 233 NLKSSNLK---RGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~~---~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ++++++.+ +..++||||++ +.++.+++++++.++ |.++.+.+|+++|+|+++.+|+||+ T Consensus 335 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:46 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 98776643 33589999997 457888999988764 5667788999999999999999999 Q ss_pred EEEeecCCCCCCCCCC Q lcl|Aclame:pro 309 KLVPADKRTDSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..++ ...||.. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:46 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEeecc--CCCCCCc Confidence 9998554 4557777 No 53 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=1.3e-53 Score=310.49 Aligned_cols=304 Identities=13% Similarity=0.085 Sum_probs=244.5 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE--eC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW--AD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~--~~ 78 (324) .........+.+.|........... ...+++++++.+||+++.++|++.+++.++|++++++++++++..++|+. ++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:47 96 IQNTKVTSQEVRDFTEYLETRNDIQ-GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhhHHHHHHHHHHHhhhhhhh-hccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecC Confidence 1111222333444433222222222 22344556888999999999999999999999999999999888888875 45 Q ss_pred CcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc Q lcl|Aclame:pro 79 KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~ 157 (324) ...+.|++|++.+|+ +.++|+++++.++|++++++||+|+++|+.++++++|.++|++++++++|+++++|+|++.... T Consensus 175 ~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~ 254 (415) T protein:vir:47 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc Confidence 667899999999997 5689999999999999999999999999999999999999999999999999999998876655 Q ss_pred cccccccc-ccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeE Q lcl|Aclame:pro 158 SIAQSIEK-TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~~~~~-~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) +....... .....++..++++|.+++.++...++.+++|+||+++|.+|++++|++|+|+|.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:47 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred cccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeE Confidence 54443332 3333456678999999999999999999999999999999999999999999863 456789999999 Q ss_pred eecCCCCC---CceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 233 NLKSSNLK---RGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~~---~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ++++++.+ +..++||||++ +.++.+++++++.++ |.++.+.+|+++|+|+++.+|+||+ T Consensus 335 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:47 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EeccccccCCCccEEEEEehhccEEEEeecceEEEeec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 98776643 33589999997 457888999988764 5667788999999999999999999 Q ss_pred EEEeecCCCCCCCCCC Q lcl|Aclame:pro 309 KLVPADKRTDSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..++ ...||.. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:47 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEeecc--CCCCCCc Confidence 9998554 4557777 No 54 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=1e-53 Score=311.09 Aligned_cols=287 Identities=17% Similarity=0.121 Sum_probs=236.9 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE--EEEeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKF--TFWAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i--p~~~~ 78 (324) -...+...+..+.|+.... ..+.++.+..++++||.+||++++.+|++.+++.++|++++++++++++..++ |+..+ T Consensus 66 ~~~~~~~~~~~~~~~~~l~-~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~ 144 (371) T protein:vir:81 66 KPTVQVKENEVEAFVNHIR-TRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQ 144 (371) T ss_pred ccchhhHHHHHHHHHHHHH-HHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecC Confidence 1122222334444443222 22345556667778899999999999999999999999999999988766554 55556 Q ss_pred CcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc Q lcl|Aclame:pro 79 KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~ 157 (324) .+.+.|++||+.+|+ +.++|++++++++|+++.++||+|+++|+.++++++|.+.|++++++++|.++++|+|++.+. T Consensus 145 ~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~- 223 (371) T protein:vir:81 145 QTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKT- 223 (371) T ss_pred CcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc- Confidence 778999999999986 679999999999999999999999999999999999999999999999999999999875432 Q ss_pred ccccccccccccccchhhhhHHHHHH-HHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeE Q lcl|Aclame:pro 158 SIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) +..+++++..++ ..+...+..+++|+|||+++..|++++|++|+|+|.+ +.+++|+|+||+ T Consensus 224 --------------~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~ 289 (371) T protein:vir:81 224 --------------AIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLGLPVV 289 (371) T ss_pred --------------ccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceecceeEE Confidence 234567777766 4688888899999999999999999999999999863 456799999999 Q ss_pred eecCCC----------CCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 233 NLKSSN----------LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHI 301 (324) Q Consensus 233 ~~~~~~----------~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v 301 (324) ++.+++ .+...++||||+. +.++.+++++++++++.. ++|++|++.||++.|+||.+ T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~------------~~f~~~~v~~~~~~r~d~~~ 357 (371) T protein:vir:81 290 IVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAM------------DAFETDATLWRAIERMDVKM 357 (371) T ss_pred EecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEeeccEE Confidence 987654 2455789999997 567889999999887542 46999999999999999999 Q ss_pred eccCceEEEEeecC Q lcl|Aclame:pro 302 ADDKAFAKLVPADK 315 (324) Q Consensus 302 ~~~~A~~~l~~~~~ 315 (324) .+|+||++++.+++ T Consensus 358 ~~~~a~~~~~~~~A 371 (371) T protein:vir:81 358 RDDEAFVFGEVQLA 371 (371) T ss_pred ecccceEEEEEecC Confidence 99999999999988 No 55 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=6.2e-54 Score=312.27 Aligned_cols=296 Identities=15% Similarity=0.094 Sum_probs=237.0 Q ss_pred CchhHHHHHHHHHHHhhhh------hHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCce--E Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNV------KPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK--K 72 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~------~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~--~ 72 (324) .++...+...++....... ...+.+..+..++++||.+||+++..+|++.+++.++|++++++++++++.. . T Consensus 74 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~ 153 (392) T protein:vir:10 74 MEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV 153 (392) T ss_pred HHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEE Confidence 2222222222221111000 0112233445566678889999999999999999999999999999876554 4 Q ss_pred EEEEeCCcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 73 FTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 73 ip~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) +|+.++.+.+.|++|++.+|++ .++|+++++.++|+++.++||+|+++||.++++++|.+.|++++++++|.++++|+| T Consensus 154 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g 233 (392) T protein:vir:10 154 LEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE 233 (392) T ss_pred EEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 6666777889999999999976 589999999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccccccccccccccchhhhhHHHHHH-HHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCccee Q lcl|Aclame:pro 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSL 226 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l 226 (324) +.... +..+++++.+++ ..+...++.+++|+|||+++..|+++||++|+|+|.+ +.+++| T Consensus 234 ~~~~~---------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tl 298 (392) T protein:vir:10 234 KLTKQ---------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLF 298 (392) T ss_pred ccccc---------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccc Confidence 65332 335678898877 5788889999999999999999999999999999864 456789 Q ss_pred ecceeEeecC--------CCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 227 DGLPVVNLKS--------SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) Q Consensus 227 ~G~pv~~~~~--------~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~ 297 (324) +|+|+++..+ ...+...+++|||+. +.++.+++++++++++.. .+|++|++.||++.|+ T Consensus 299 lG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~~~r~ 366 (392) T protein:vir:10 299 AGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRD 366 (392) T ss_pred cCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEee Confidence 9987765321 223445589999997 568889999999887532 3599999999999999 Q ss_pred ccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 298 ALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 298 d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) ||++.||+||++++.+++.+.++|.- T Consensus 367 d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 367 DVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred ccEEecccceEEEEecccccccCCCC Confidence 99999999999999999888886655 No 56 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=6.2e-54 Score=312.27 Aligned_cols=296 Identities=15% Similarity=0.094 Sum_probs=237.0 Q ss_pred CchhHHHHHHHHHHHhhhh------hHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCce--E Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNV------KPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK--K 72 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~------~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~--~ 72 (324) .++...+...++....... ...+.+..+..++++||.+||+++..+|++.+++.++|++++++++++++.. . T Consensus 74 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~ 153 (392) T protein:vir:10 74 MEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV 153 (392) T ss_pred HHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEE Confidence 2222222222221111000 0112233445566678889999999999999999999999999999876554 4 Q ss_pred EEEEeCCcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 73 FTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 73 ip~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) +|+.++.+.+.|++|++.+|++ .++|+++++.++|+++.++||+|+++||.++++++|.+.|++++++++|.++++|+| T Consensus 154 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g 233 (392) T protein:vir:10 154 LEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE 233 (392) T ss_pred EEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 6666777889999999999976 589999999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccccccccccccccchhhhhHHHHHH-HHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCccee Q lcl|Aclame:pro 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSL 226 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l 226 (324) +.... +..+++++.+++ ..+...++.+++|+|||+++..|+++||++|+|+|.+ +.+++| T Consensus 234 ~~~~~---------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tl 298 (392) T protein:vir:10 234 KLTKQ---------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLF 298 (392) T ss_pred ccccc---------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccc Confidence 65332 335678898877 5788889999999999999999999999999999864 456789 Q ss_pred ecceeEeecC--------CCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 227 DGLPVVNLKS--------SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) Q Consensus 227 ~G~pv~~~~~--------~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~ 297 (324) +|+|+++..+ ...+...+++|||+. +.++.+++++++++++.. .+|++|++.||++.|+ T Consensus 299 lG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~~~r~ 366 (392) T protein:vir:10 299 AGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRD 366 (392) T ss_pred cCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEee Confidence 9987765321 223445589999997 568889999999887532 3599999999999999 Q ss_pred ccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 298 ALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 298 d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) ||++.||+||++++.+++.+.++|.- T Consensus 367 d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 367 DVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred ccEEecccceEEEEecccccccCCCC Confidence 99999999999999999888886655 No 57 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=6.2e-54 Score=312.27 Aligned_cols=296 Identities=15% Similarity=0.094 Sum_probs=237.0 Q ss_pred CchhHHHHHHHHHHHhhhh------hHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCce--E Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNV------KPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK--K 72 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~------~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~--~ 72 (324) .++...+...++....... ...+.+..+..++++||.+||+++..+|++.+++.++|++++++++++++.. . T Consensus 74 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~ 153 (392) T protein:vir:10 74 MEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV 153 (392) T ss_pred HHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEE Confidence 2222222222221111000 0112233445566678889999999999999999999999999999876554 4 Q ss_pred EEEEeCCcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 73 FTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 73 ip~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) +|+.++.+.+.|++|++.+|++ .++|+++++.++|+++.++||+|+++||.++++++|.+.|++++++++|.++++|+| T Consensus 154 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g 233 (392) T protein:vir:10 154 LEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE 233 (392) T ss_pred EEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 6666777889999999999976 589999999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccccccccccccccchhhhhHHHHHH-HHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCccee Q lcl|Aclame:pro 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSL 226 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l 226 (324) +.... +..+++++.+++ ..+...++.+++|+|||+++..|+++||++|+|+|.+ +.+++| T Consensus 234 ~~~~~---------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tl 298 (392) T protein:vir:10 234 KLTKQ---------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLF 298 (392) T ss_pred ccccc---------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccc Confidence 65332 335678898877 5788889999999999999999999999999999864 456789 Q ss_pred ecceeEeecC--------CCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 227 DGLPVVNLKS--------SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) Q Consensus 227 ~G~pv~~~~~--------~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~ 297 (324) +|+|+++..+ ...+...+++|||+. +.++.+++++++++++.. .+|++|++.||++.|+ T Consensus 299 lG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~~~r~ 366 (392) T protein:vir:10 299 AGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRD 366 (392) T ss_pred cCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEee Confidence 9987765321 223445589999997 568889999999887532 3599999999999999 Q ss_pred ccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 298 ALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 298 d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) ||++.||+||++++.+++.+.++|.- T Consensus 367 d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 367 DVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred ccEEecccceEEEEecccccccCCCC Confidence 99999999999999999888886655 No 58 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=6.2e-54 Score=312.27 Aligned_cols=296 Identities=15% Similarity=0.094 Sum_probs=237.0 Q ss_pred CchhHHHHHHHHHHHhhhh------hHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCce--E Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNV------KPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK--K 72 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~------~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~--~ 72 (324) .++...+...++....... ...+.+..+..++++||.+||+++..+|++.+++.++|++++++++++++.. . T Consensus 74 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~ 153 (392) T protein:vir:10 74 MEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRV 153 (392) T ss_pred HHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEE Confidence 2222222222221111000 0112233445566678889999999999999999999999999999876554 4 Q ss_pred EEEEeCCcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 73 FTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 73 ip~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) +|+.++.+.+.|++|++.+|++ .++|+++++.++|+++.++||+|+++||.++++++|.+.|++++++++|.++++|+| T Consensus 154 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g 233 (392) T protein:vir:10 154 LEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKKSKVTRNVLILGVIE 233 (392) T ss_pred EEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 6666777889999999999976 589999999999999999999999999999999999999999999999999999988 Q ss_pred ccccccccccccccccccccchhhhhHHHHHH-HHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCccee Q lcl|Aclame:pro 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSL 226 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l 226 (324) +.... +..+++++.+++ ..+...++.+++|+|||+++..|+++||++|+|+|.+ +.+++| T Consensus 234 ~~~~~---------------~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tl 298 (392) T protein:vir:10 234 KLTKQ---------------AIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLF 298 (392) T ss_pred ccccc---------------CccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccc Confidence 65332 335678898877 5788889999999999999999999999999999864 456789 Q ss_pred ecceeEeecC--------CCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 227 DGLPVVNLKS--------SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) Q Consensus 227 ~G~pv~~~~~--------~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~ 297 (324) +|+|+++..+ ...+...+++|||+. +.++.+++++++++++.. .+|++|++.||++.|+ T Consensus 299 lG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~------------~~f~~~~~~~r~~~r~ 366 (392) T protein:vir:10 299 AGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGG------------KAFTRNTLDLRAIQRD 366 (392) T ss_pred cCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEecccc------------chhhcCceEEEEEEee Confidence 9987765321 223445589999997 568889999999887532 3599999999999999 Q ss_pred ccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 298 ALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 298 d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) ||++.||+||++++.+++.+.++|.- T Consensus 367 d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 367 DVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred ccEEecccceEEEEecccccccCCCC Confidence 99999999999999999888886655 No 59 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=1.4e-53 Score=310.36 Aligned_cols=303 Identities=15% Similarity=0.109 Sum_probs=237.5 Q ss_pred CchhHHHHHHHH--HHHhhhhhHHhh-----ccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQ--HFASNNVKPQVF-----NPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 ~~~~~~~k~~~~--~~a~~~~~~~~~-----~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) ++...+.+.... .+........+. ......+++++++++|++++++|++.+++.++|++++++++++++.+++ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 164 (413) T protein:vir:81 85 AGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKY 164 (413) T ss_pred hhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeE Confidence 000000000000 001111111111 1223345567889999999999999999999999999999999999999 Q ss_pred EEEeCC----cceeeeccCccccccc-cceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 74 TFWADK----PGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 74 p~~~~~----~~a~~v~Eg~~~~~~~-~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) |+.... ..+.|++||+.+|+++ ++|+++++.++|++++++||+|+++|+. +++++|.+.|+++++.++|+++|+ T Consensus 165 ~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-~l~~~i~~~la~~~~~~~d~~~l~ 243 (413) T protein:vir:81 165 LMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD-FLVSYINARLLEELAIEEERQLLL 243 (413) T ss_pred EEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhc Confidence 997643 4578999999999987 6899999999999999999999999985 699999999999999999999999 Q ss_pred ccCcccccccccccccccccccc-chhhhhHHHHHHHHhhhh-cCCCcEEEEcHHHHHHHHHhhccCCceeeccC----- Q lcl|Aclame:pro 149 NQGNNPFGKSIAQSIEKTNKVIK-GDFTQDNIIDLEALLEDD-ELEANAFISKTQNRSLLRKIVDPETKERIYDR----- 221 (324) Q Consensus 149 G~g~~~~~~~~~~~~~~~~~~~~-~~~~~~~i~~~~~~l~~~-~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~----- 221 (324) |+|++.++.++.+.......... +...++++.+++..+... .+..++|+||++++..|+++||++|+|+|.+. T Consensus 244 G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~ 323 (413) T protein:vir:81 244 GDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQY 323 (413) T ss_pred cCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccc Confidence 99999888787766555444333 344577777888776544 45566799999999999999999999998532 Q ss_pred ------CcceeecceeEeecCCCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEE Q lcl|Aclame:pro 222 ------NSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRAT 294 (324) Q Consensus 222 ------~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~ 294 (324) ..++|+|+||+++.+ +++++++||||++ +.++.+++++++++++.. .+|++|++.||++ T Consensus 324 ~~~~~~~~~~l~G~pv~~s~~--~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~~~r~~ 389 (413) T protein:vir:81 324 GSGGIMLDPAPWGLRTVQSQV--VPVGKPVVGAFRSAASVLRKGGVRIDSTNTNV------------DDFENNLITVRAE 389 (413) T ss_pred cccccccCceecceeeEEcCC--CCcccEEEEecccEEEEEEecceEEEEecccc------------chhhcCcEEEEEE Confidence 234799999998765 4578999999996 567888999999887543 3599999999999 Q ss_pred EEeccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 295 MHVALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 295 ~r~d~~v~~~~A~~~l~~~~~~~~ 318 (324) +|+|+.+.+|+||++|+.+++.++ T Consensus 390 ~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 390 ERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred EeeccEEecccceEEEEecCCCCC Confidence 999999999999999998776655 No 60 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=2.7e-53 Score=308.76 Aligned_cols=298 Identities=13% Similarity=0.080 Sum_probs=236.7 Q ss_pred CchhHHHHHHHHHHHhhh------hhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNN------VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~------~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) ..+.+......+.|.... ....+.++...++..+||++||++++++|++.+++.++|+++++++++++....+| T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 163 (408) T protein:vir:10 84 KSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRV 163 (408) T ss_pred cchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEE Confidence 222222222333332211 11123455566777789999999999999999999999999999999987766665 Q ss_pred EE--e-CCcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 75 FW--A-DKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 75 ~~--~-~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~ 150 (324) +. . ..+.+.|++|++++|++ .++|+++++.++|++++++||+|+++|+.+++.++|.++|+++++.++|++|++|+ T Consensus 164 ~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~ 243 (408) T protein:vir:10 164 YEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWIAKKVVVTRNQAIIEVM 243 (408) T ss_pred EeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 54 3 34678999999999975 58999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccccccccccccccccchhhhhHHHHHH-HHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcce Q lcl|Aclame:pro 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~ 225 (324) |++.+. .+..+++++.+++ ..+...++.++.|+||++++..|+++||++|+|+|++ +.+++ T Consensus 244 g~~~~~--------------~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~ 309 (408) T protein:vir:10 244 KAAPKK--------------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) T ss_pred cccccc--------------cccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCce Confidence 875432 1234678888876 5788889889999999999999999999999999964 34579 Q ss_pred eecceeEeecCCCCC-----CceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 226 LDGLPVVNLKSSNLK-----RGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 226 l~G~pv~~~~~~~~~-----~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) |+|+||+++.+..++ ...+++|||+. +.++.+++++++++++.. ..|++|++.||++.|+|+ T Consensus 310 l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~f~~~~~~~r~~~r~d~ 377 (408) T protein:vir:10 310 IKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDV 377 (408) T ss_pred ecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEccccc------------chhhcCceEEEEEEeecc Confidence 999999987644333 33489999997 568899999999887542 459999999999999999 Q ss_pred EEeccCceEEEEeecCCC------CCCCCCC Q lcl|Aclame:pro 300 HIADDKAFAKLVPADKRT------DSVPGEV 324 (324) Q Consensus 300 ~v~~~~A~~~l~~~~~~~------~~~~~~~ 324 (324) .+.+|+||++++.++..+ +.+-+.| T Consensus 378 ~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 378 KATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) T ss_pred EEeccccEEEEEeeccccCCCCCCCCCcccC Confidence 999999999999877542 1122222 No 61 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=2.5e-53 Score=308.92 Aligned_cols=297 Identities=14% Similarity=0.105 Sum_probs=237.5 Q ss_pred Cchh--HHHHHHHHHHHh---hhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE Q lcl|Aclame:pro 1 MEQT--QKLKLNLQHFAS---NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF 75 (324) Q Consensus 1 ~~~~--~~~k~~~~~~a~---~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~ 75 (324) ++.. ....+..+.|.. ..... ....-...++++||.+||++++++|++.+++.++|++++++++++++..++|+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 157 (397) T protein:vir:48 79 LTKSEEEVKAGFVKDFKNLVRGRYQN-LLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVY 157 (397) T ss_pred ccchhhHHHHHHHHHHHHHHhhhhhH-HHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEE Confidence 1111 111222222222 11111 11222334566788999999999999999999999999999999887777665 Q ss_pred Ee---CCcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 76 WA---DKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 76 ~~---~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) .. ..+.+.|++|++.++++ .++|++++++++|++++++||+|+++|+.++++++|.++++++++.++|+++++|+| T Consensus 158 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g 237 (397) T protein:vir:48 158 EKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILEAIA 237 (397) T ss_pred EeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 42 34568999999999986 589999999999999999999999999999999999999999999999999999998 Q ss_pred ccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceee Q lcl|Aclame:pro 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLD 227 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~ 227 (324) ++... .+..++++|.+++.+|...++.+++|+||++++..|+++||++|+|++.+ +.+++|+ T Consensus 238 ~~~~~--------------~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~ 303 (397) T protein:vir:48 238 TLPTK--------------PTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSID 303 (397) T ss_pred ccccc--------------cccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceec Confidence 76432 23457899999999999999999999999999999999999999999864 4567999 Q ss_pred cceeEeecC-----CCCCCceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 228 GLPVVNLKS-----SNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHI 301 (324) Q Consensus 228 G~pv~~~~~-----~~~~~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v 301 (324) |+||+++.+ ...++..+++|||+.+ .++.+++++++++++.. ++|++|++.||+++|+|+++ T Consensus 304 G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~ 371 (397) T protein:vir:48 304 GFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGG------------GAFETDTTKIRVIDRFDVVA 371 (397) T ss_pred cceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccch------------hhhhcCceeEEEEeeeccEE Confidence 999987643 2345667899999975 58889999999887532 46999999999999999999 Q ss_pred eccCceEEEEeecCC-CCCCCCCC Q lcl|Aclame:pro 302 ADDKAFAKLVPADKR-TDSVPGEV 324 (324) Q Consensus 302 ~~~~A~~~l~~~~~~-~~~~~~~~ 324 (324) .+|+||++++.+++. +..+-+-+ T Consensus 372 ~~~~a~~~~~~~~~~~~~~~~~~~ 395 (397) T protein:vir:48 372 TDTESFVPASFKAIADQKGNLGST 395 (397) T ss_pred ecccceEEEEecccccCCCCcccc Confidence 999999999987664 33333333 No 62 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=1.5e-53 Score=310.14 Aligned_cols=276 Identities=16% Similarity=0.123 Sum_probs=235.8 Q ss_pred hhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCc--eEEEEEe-CCcceeeeccCccccc-cccce Q lcl|Aclame:pro 23 VFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE--KKFTFWA-DKPGAYWVGEGQKIET-SKATW 98 (324) Q Consensus 23 ~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~--~~ip~~~-~~~~a~~v~Eg~~~~~-~~~~~ 98 (324) ..++...+++++||.+||+++.++|++.+++.++|+++++++++++.. +.+|+.. ..+.+.|++|++++|+ ++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 667777778888999999999999999999999999999999887654 5566664 4567999999999997 57999 Q ss_pred eeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhH Q lcl|Aclame:pro 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) Q Consensus 99 ~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) ++++++++|+++.++||+|+++|+.++++++|.++++++++.++|+++++|.++... ..+..++++ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~--------------~~~~~~~d~ 146 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPT--------------KPTLTKWDD 146 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccc--------------cccccCHHH Confidence 999999999999999999999999999999999999999999999999998875432 234567999 Q ss_pred HHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeEeecCCCC-----CCceeEEeec Q lcl|Aclame:pro 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNL-----KRGELITGDF 249 (324) Q Consensus 179 i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~~-----~~~~~i~gd~ 249 (324) |.++++++...++.+++|+||++++..|+++||.+|+|+|++ +.+++|+|+||+++.+... ++..++|||| T Consensus 147 i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~ 226 (293) T protein:vir:48 147 IIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDL 226 (293) T ss_pred HHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEec Confidence 999999999999999999999999999999999999999964 4567999999987654333 3446899999 Q ss_pred cc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC-CCCCCCCC Q lcl|Aclame:pro 250 DK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR-TDSVPGEV 324 (324) Q Consensus 250 s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~-~~~~~~~~ 324 (324) ++ +.++++++++++++++.. ++|++|++.||++.|+|+++.+|+||++++.+++. +..+-|-. T Consensus 227 ~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~ 291 (293) T protein:vir:48 227 KQAVTLFDRQQMSLLSTNIGG------------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGST 291 (293) T ss_pred cceEEEEEecceEEEEecccc------------hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCcccccc Confidence 97 468889999999887542 46999999999999999999999999999965544 44444443 No 63 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=7.3e-53 Score=306.40 Aligned_cols=297 Identities=14% Similarity=0.111 Sum_probs=237.3 Q ss_pred CchhHHHHHHHHHHHhhh---hhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEE- Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNN---VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW- 76 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~---~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~- 76 (324) ++...+....+..+.... ....+.++...+++++||.+||++++.+|++.+++.++|++++++++++++...+|+. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 166 (404) T protein:vir:39 87 YELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEK 166 (404) T ss_pred hhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEe Confidence 222222222222222111 1123345566677788899999999999999999999999999999988766666553 Q ss_pred -e-CCcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 77 -A-DKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 77 -~-~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) . ..+.+.|++|++.+|+ +.++|++++++++|++++++||+|+++|+.++++++|.++|+++++.++|+++|+|+|++ T Consensus 167 ~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~ 246 (404) T protein:vir:39 167 WTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTV 246 (404) T ss_pred ecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 3 3467899999999997 579999999999999999999999999999999999999999999999999999999876 Q ss_pred ccccccccccccccccccchhhhhHHHHHHH-HhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeec Q lcl|Aclame:pro 154 PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDG 228 (324) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G 228 (324) .+. .+..+++++.+++. .+...+..+++|+|||++|..|+++||++|+|++.+ +.+++|+| T Consensus 247 ~~~--------------~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G 312 (404) T protein:vir:39 247 PKK--------------PTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKG 312 (404) T ss_pred ccc--------------cccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecc Confidence 432 13345788888765 677888888999999999999999999999999864 45579999 Q ss_pred ceeEeecCCCC-----CCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|Aclame:pro 229 LPVVNLKSSNL-----KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 229 ~pv~~~~~~~~-----~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) +||+++.+... +...+++|||+. +.++++++++++++++.. ++|++|++.||++.|+|+.+. T Consensus 313 ~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~ 380 (404) T protein:vir:39 313 KKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA------------GAFETDTTKIRVIDRFDVKTT 380 (404) T ss_pred eeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccch------------hhhhhceeeEEEEeeeccEEe Confidence 99998755333 344689999996 567889999999887542 469999999999999999999 Q ss_pred ccCceEEEEeecCC---CCCCCCC Q lcl|Aclame:pro 303 DDKAFAKLVPADKR---TDSVPGE 323 (324) Q Consensus 303 ~~~A~~~l~~~~~~---~~~~~~~ 323 (324) +|+||++++.++.. ++.+.|- T Consensus 381 ~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 381 DSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred cccceEEEEeeccccCCCCCCCCC Confidence 99999999865543 4566666 No 64 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=6.1e-53 Score=306.83 Aligned_cols=295 Identities=17% Similarity=0.153 Sum_probs=233.6 Q ss_pred Cchh---HHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE-- Q lcl|Aclame:pro 1 MEQT---QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF-- 75 (324) Q Consensus 1 ~~~~---~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~-- 75 (324) +... ...+...+.|.. .. ........+++++||.+||++++++|++.+++.++|++++++++++++...+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 157 (395) T protein:vir:38 81 LPVKDGKPDAQAMKNQFVK-DF--KNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEK 157 (395) T ss_pred cchhhhhHHHHHHHHHHHH-HH--HHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEe Confidence 1111 111111222211 11 111122344556788999999999999999999999999999998776656554 Q ss_pred EeC-CcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 76 WAD-KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 76 ~~~-~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) ..+ .+.+.|++|++.+|++ .++|++++++++|++++++||+|+++|+.++++++|.++|++++++++|+++++|+|++ T Consensus 158 ~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~ 237 (395) T protein:vir:38 158 LADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKA 237 (395) T ss_pred eccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 333 4567899999999976 58999999999999999999999999999999999999999999999999999999876 Q ss_pred ccccccccccccccccccchhhhhHHHHHHH-HhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeec Q lcl|Aclame:pro 154 PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDG 228 (324) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G 228 (324) .+.. +..+++++.+++. .+...++.+++|+|||+++..|++++|++|+|+|.+ +.+++|+| T Consensus 238 ~~~~--------------~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G 303 (395) T protein:vir:38 238 PKKP--------------TISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDG 303 (395) T ss_pred cccc--------------ccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecc Confidence 4322 2235778888775 688889999999999999999999999999999864 45678999 Q ss_pred ceeEeecCCCC----CCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|Aclame:pro 229 LPVVNLKSSNL----KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 229 ~pv~~~~~~~~----~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) +||+++.++.. ++..++||||+. +.++.+++++++++++.. .+|++|++.||++.|+|+++.+ T Consensus 304 ~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~------------~~~~~~~~~~r~~~r~d~~~~~ 371 (395) T protein:vir:38 304 KPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGA------------GSFEHDTTKLRFIDRFDVQLID 371 (395) T ss_pred ceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEecccc------------chhhcCceEEEEEEeeccEEec Confidence 99998865433 345589999996 668999999999987642 4599999999999999999999 Q ss_pred cCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 304 DKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 304 ~~A~~~l~~~~~~~~~~~~~~ 324 (324) |+||++++.+++.+.++..-. T Consensus 372 ~~a~~~~~~~~~~~~~~~~~~ 392 (395) T protein:vir:38 372 DGAFAAASFKTVANQAQGTAG 392 (395) T ss_pred ccceEEEEeecccCCCCCccC Confidence 999999998877655444433 No 65 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=9.9e-53 Score=305.70 Aligned_cols=304 Identities=14% Similarity=0.077 Sum_probs=241.8 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE--EeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF--WAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~--~~~ 78 (324) ....+....+.+.|........... ....++++|+.+||+++.+.|++.+++.++|++++++++++++..++|+ .++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:98 96 IQNTKVTSQEVRDFTEYLETRNDIQ-GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhHHHHHHHHHHHHhhhhhhh-hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecC Confidence 1111122233444433222222222 2334455678899999999999999999999999999999877666655 456 Q ss_pred CcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc Q lcl|Aclame:pro 79 KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~ 157 (324) ...+.|++|++.+|+. .++|+++++.++|++++++||+|+++||.++++++|.++|+++++.++|+++++|+|++.... T Consensus 175 ~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~ 254 (415) T protein:vir:98 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 6788999999999975 689999999999999999999999999999999999999999999999999999998766554 Q ss_pred cccccc-ccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeE Q lcl|Aclame:pro 158 SIAQSI-EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~~~-~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) ...... ........+..++++|.+++.++...++.+++|+||+++|..|+++||++|+|+|.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:98 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred ccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeE Confidence 443322 233344556788999999999999999999999999999999999999999999864 345799999999 Q ss_pred eecCCCCC---CceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 233 NLKSSNLK---RGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~~---~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ++++.+.+ +..++||||+.+ .++.+++++++.++ |.++.+.+|+.+|+|+.+.||+||+ T Consensus 335 ~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:98 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 98776543 345899999974 57889999998754 4556678999999999999999999 Q ss_pred EEEeecCCCCCCCCCC Q lcl|Aclame:pro 309 KLVPADKRTDSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..+. ..+||.. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:98 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEEecc--CCCCCcc Confidence 9998554 4567777 No 66 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=9.9e-53 Score=305.70 Aligned_cols=304 Identities=14% Similarity=0.077 Sum_probs=241.8 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE--EeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF--WAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~--~~~ 78 (324) ....+....+.+.|........... ....++++|+.+||+++.+.|++.+++.++|++++++++++++..++|+ .++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:79 96 IQNTKVTSQEVRDFTEYLETRNDIQ-GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhHHHHHHHHHHHHhhhhhhh-hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecC Confidence 1111122233444433222222222 2334455678899999999999999999999999999999877666655 456 Q ss_pred CcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc Q lcl|Aclame:pro 79 KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~ 157 (324) ...+.|++|++.+|+. .++|+++++.++|++++++||+|+++||.++++++|.++|+++++.++|+++++|+|++.... T Consensus 175 ~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~ 254 (415) T protein:vir:79 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 6788999999999975 689999999999999999999999999999999999999999999999999999998766554 Q ss_pred cccccc-ccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeE Q lcl|Aclame:pro 158 SIAQSI-EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~~~-~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) ...... ........+..++++|.+++.++...++.+++|+||+++|..|+++||++|+|+|.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:79 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred ccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeE Confidence 443322 233344556788999999999999999999999999999999999999999999864 345799999999 Q ss_pred eecCCCCC---CceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 233 NLKSSNLK---RGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~~---~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ++++.+.+ +..++||||+.+ .++.+++++++.++ |.++.+.+|+.+|+|+.+.||+||+ T Consensus 335 ~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:79 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 98776543 345899999974 57889999998754 4556678999999999999999999 Q ss_pred EEEeecCCCCCCCCCC Q lcl|Aclame:pro 309 KLVPADKRTDSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..+. ..+||.. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:79 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEEecc--CCCCCcc Confidence 9998554 4567777 No 67 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=9.9e-53 Score=305.70 Aligned_cols=304 Identities=14% Similarity=0.077 Sum_probs=241.8 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE--EeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF--WAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~--~~~ 78 (324) ....+....+.+.|........... ....++++|+.+||+++.+.|++.+++.++|++++++++++++..++|+ .++ T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:81 96 IQNTKVTSQEVRDFTEYLETRNDIQ-GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhHHHHHHHHHHHHhhhhhhh-hccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecC Confidence 1111122233444433222222222 2334455678899999999999999999999999999999877666655 456 Q ss_pred CcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc Q lcl|Aclame:pro 79 KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~ 157 (324) ...+.|++|++.+|+. .++|+++++.++|++++++||+|+++||.++++++|.++|+++++.++|+++++|+|++.... T Consensus 175 ~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~ 254 (415) T protein:vir:81 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 6788999999999975 689999999999999999999999999999999999999999999999999999998766554 Q ss_pred cccccc-ccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeE Q lcl|Aclame:pro 158 SIAQSI-EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~~~-~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) ...... ........+..++++|.+++.++...++.+++|+||+++|..|+++||++|+|+|.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:81 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred ccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeE Confidence 443322 233344556788999999999999999999999999999999999999999999864 345799999999 Q ss_pred eecCCCCC---CceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 233 NLKSSNLK---RGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~~---~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ++++.+.+ +..++||||+.+ .++.+++++++.++ |.++.+.+|+.+|+|+.+.||+||+ T Consensus 335 ~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~~~~~r~d~~v~~~~a~~ 397 (415) T protein:vir:81 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 98776543 345899999974 57889999998754 4556678999999999999999999 Q ss_pred EEEeecCCCCCCCCCC Q lcl|Aclame:pro 309 KLVPADKRTDSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..+. ..+||.. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:81 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEEecc--CCCCCcc Confidence 9998554 4567777 No 68 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=5.8e-53 Score=306.97 Aligned_cols=298 Identities=13% Similarity=0.076 Sum_probs=236.0 Q ss_pred CchhHHHHHHHHHHHh----hhhhH-----Hhhccc-cccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFAS----NNVKP-----QVFNPD-NVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~----~~~~~-----~~~~~~-~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~ 70 (324) ++++.......+.+.. +...+ ....+. ..++.+.++.+||++++++|++.+++.++|+++++++|++++. T Consensus 126 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 205 (458) T protein:vir:10 126 TQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKI 205 (458) T ss_pred hhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcc Confidence 1111111111111111 10000 011111 2233456888999999999999999999999999999999999 Q ss_pred eEEEEEeCCcceeeeccCcccccc------ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 KKFTFWADKPGAYWVGEGQKIETS------KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) Q Consensus 71 ~~ip~~~~~~~a~~v~Eg~~~~~~------~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~ 144 (324) ..+|+.++.+.+.|++|++.++++ +++|+++++.++|++++++||+|+++|+.++++++|.++|+++++.++|. T Consensus 206 ~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~ 285 (458) T protein:vir:10 206 LTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEE 285 (458) T ss_pred eEEEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHH Confidence 999999999999999999988854 56899999999999999999999999999999999999999999999999 Q ss_pred HHHhccCccccccccccccc--------cccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCce Q lcl|Aclame:pro 145 AGILNQGNNPFGKSIAQSIE--------KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKE 216 (324) Q Consensus 145 ~~l~G~g~~~~~~~~~~~~~--------~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~ 216 (324) ++|+|+|++.+. |+.+... .......+.+++++|.+++..+...++.+++|+||+++|..|++++|++|+| T Consensus 286 ~~l~G~G~~~p~-Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~ 364 (458) T protein:vir:10 286 AFMTGDGSGKPK-GLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQD 364 (458) T ss_pred HhhcCCCCCccc-eeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCce Confidence 999999987544 4433221 1122233456899999999999999999999999999999999999999999 Q ss_pred eecc--------CCcceeecceeEeecCCCC--CCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhh Q lcl|Aclame:pro 217 RIYD--------RNSDSLDGLPVVNLKSSNL--KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) Q Consensus 217 ~~~~--------~~~~~l~G~pv~~~~~~~~--~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) ++.. +.+++|+|+||+++..++. ....+++|||+. +.++++.++++..++ +++ T Consensus 365 i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~----------------~~~ 428 (458) T protein:vir:10 365 VAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERER----------------QAG 428 (458) T ss_pred eeccccccccccCcCceecceeeEEccccccccCCcceEEEEecccEEEEEeeceEEEeec----------------ccC Confidence 9742 3346899999999877665 345789999964 678999999887653 357 Q ss_pred cCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 286 QDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 286 ~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) +|++.||++.|+|+.+.+|+||++.+.++. T Consensus 429 ~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 429 KQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred CCceEEEEEEEecceEecccceEEEeeccC Confidence 899999999999999999999999988887 No 69 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=1.5e-53 Score=310.13 Aligned_cols=294 Identities=13% Similarity=0.172 Sum_probs=239.8 Q ss_pred CchhHHHH-------------HHHHHHHhhhhh---------------HHhhccccccccccCccccchHH-HHHHHHHH Q lcl|Aclame:pro 1 MEQTQKLK-------------LNLQHFASNNVK---------------PQVFNPDNVMMHEKKDGTLMNEF-TTPILQEV 51 (324) Q Consensus 1 ~~~~~~~k-------------~~~~~~a~~~~~---------------~~~~~~~~~~~~~~~~~~vp~~~-~~~i~~~~ 51 (324) |......+ .+.+.+ ....+ ..+.++....++++||++||+++ ..+|++.+ T Consensus 304 ~~~~~l~rai~a~a~~~~~~a~~~~e~-a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~l 382 (632) T protein:vir:96 304 LQQYSLMRAINAAATGDWSKAGFEREV-SLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDIL 382 (632) T ss_pred HHHHHHHHHHHhhhccchhhhhhhhHH-HHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHH Confidence 00000000 000000 00000 01124445566678899999886 68899999 Q ss_pred Hhhhhhhhh-cceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHH Q lcl|Aclame:pro 52 MENSKIMQL-GKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEM 130 (324) Q Consensus 52 ~~~s~l~~l-~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i 130 (324) ++.++++++ ++.+++.++.++||+.++++.++|++|++.+|+++++|++++++++|++++++||+|+++||.++++++| T Consensus 383 r~~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i 462 (632) T protein:vir:96 383 RNKAIIGQMGARMLPGLVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLI 462 (632) T ss_pred hhcchhhhhcceEeecCCcceEEEEEeCCceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHH Confidence 999999998 5778988889999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhccCccccccccccccccccc-cccchhhhhHHHHHHHHhhhhcC--CCcEEEEcHHHHHHHH Q lcl|Aclame:pro 131 KPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK-VIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLR 207 (324) Q Consensus 131 ~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~l~ 207 (324) .+.|.++++.++|+++|+|+|++..+.|+.+....... ..++.++++++.++..++...+. .++.|+||+.++..|+ T Consensus 463 ~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~ 542 (632) T protein:vir:96 463 REDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAK 542 (632) T ss_pred HHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHH Confidence 99999999999999999999987777777665443322 23456789999999999987764 4568999999988776 Q ss_pred H--hhccCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhh Q lcl|Aclame:pro 208 K--IVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) Q Consensus 208 ~--~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) + ++|.+|+|+|.+ ++++|+||+++.. ++.++++||||+++++++++++++.++++. .|. T Consensus 543 ~~~l~d~~G~~i~~~---~~l~G~pv~~s~~--ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~--------------~~~ 603 (632) T protein:vir:96 543 KAQVFDNTGERIWQN---NEVNGYRAEASNQ--IPADTWIFGDWSQIVIAMWGVLDLKVDPYT--------------KAA 603 (632) T ss_pred HHhccCCCCceeecC---CeecccceEeccc--cccCcEEEeecceEEEEEecceEEEEcccc--------------ccc Confidence 5 779999999964 5899999998765 557789999999999999999999998764 378 Q ss_pred cCcEEEEEEEEeccEEeccCceEEEEeec Q lcl|Aclame:pro 286 QDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 286 ~~~v~~r~~~r~d~~v~~~~A~~~l~~~~ 314 (324) +|++.||++.|+|++++||+||++++.+| T Consensus 604 ~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 604 SDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred cCceEEEEEeecCceeechhhhhheeecC Confidence 99999999999999999999999999998 No 70 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=9.1e-53 Score=305.89 Aligned_cols=298 Identities=14% Similarity=0.113 Sum_probs=236.1 Q ss_pred CchhHHHHHHHHHHHhhh------hhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNN------VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~------~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) ..+.+....+.+.|.... ....+.++.+.++..+||.+||++++++|++.+++.++|++++++++++++...++ T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 163 (408) T protein:vir:74 84 KSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGSRV 163 (408) T ss_pred chhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcceEE Confidence 333333333344343211 11123445556677778999999999999999999999999999999887665554 Q ss_pred --EEeC-CcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 75 --FWAD-KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 75 --~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~ 150 (324) +..+ .+.+.|++|++.+|+ +.++|++++++++|++++++||+|+++|+.++++++|.++|+++++.++|+++|+|+ T Consensus 164 ~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~ 243 (408) T protein:vir:74 164 YEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIAKKVVVTRNQAIIAAM 243 (408) T ss_pred EEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 4443 456789999999997 569999999999999999999999999999999999999999999999999999999 Q ss_pred CccccccccccccccccccccchhhhhHHHHHH-HHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcce Q lcl|Aclame:pro 151 GNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDS 225 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~ 225 (324) |++.+.. +..+++++.+++ ..+...++.+++|+|||+++.+|+++||++|+|+|.+ +.+++ T Consensus 244 G~~~~~~--------------~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~ 309 (408) T protein:vir:74 244 GTVPKKP--------------TIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (408) T ss_pred ccccccc--------------ccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCce Confidence 9765432 334578888876 5888999999999999999999999999999999964 45679 Q ss_pred eecceeEeecC-----CCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 226 LDGLPVVNLKS-----SNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 226 l~G~pv~~~~~-----~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) |+|+||+++++ ...+...+++|||+. +.++.+++++++++++. +..|++|++.||++.|+|| T Consensus 310 l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~------------~~~f~~~~~~~r~~~r~d~ 377 (408) T protein:vir:74 310 IKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIG------------AGAFETDTTKIRVIDRFDV 377 (408) T ss_pred ecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccc------------cchhhcceeeEEEEEeeCc Confidence 99999988654 234456789999997 56888999999988753 2458999999999999999 Q ss_pred EEeccCceEEEEeecCCC--CCCCC----CC Q lcl|Aclame:pro 300 HIADDKAFAKLVPADKRT--DSVPG----EV 324 (324) Q Consensus 300 ~v~~~~A~~~l~~~~~~~--~~~~~----~~ 324 (324) ++.+|+||++++.++..+ ..+|. .| T Consensus 378 ~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 378 KATDSEALVAGSFTAIADQVGNFKTTTSTAV 408 (408) T ss_pred EEecccceEEEEeecccCCCCCCCCCccccC Confidence 999999999998744321 11111 11 No 71 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=3.7e-53 Score=308.05 Aligned_cols=288 Identities=14% Similarity=0.077 Sum_probs=233.1 Q ss_pred CchhHHHHHHHHHHHhhhh--------hHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCC--Cc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNV--------KPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEG--TE 70 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~--------~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~--~~ 70 (324) -......+.+.+....+.. ...+.++...+++++||.+||++++..|++.+++.++|+++++++++++ +. T Consensus 89 ~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 168 (397) T protein:vir:12 89 ERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRSGT 168 (397) T ss_pred HHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCcee Confidence 0001111111111111111 1112344455667788999999999999999999999999999999875 44 Q ss_pred eEEEEEeCCcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 71 KKFTFWADKPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) Q Consensus 71 ~~ip~~~~~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G 149 (324) +.+|+.++.+.+.|++|++++|++ .++|+++++.++|+++.++||+|+++|+.++++++|.+.|++++++++|.++++| T Consensus 169 ~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~~l~~~i~~~l~~~~~~~~d~~il~G 248 (397) T protein:vir:12 169 RLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQAIMTYVAKWFAKKSVVTRNNLILAA 248 (397) T ss_pred EEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 556777778889999999999974 6899999999999999999999999999999999999999999999999999999 Q ss_pred cCccccccccccccccccccccchhhhhHHHHHH-HHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcc Q lcl|Aclame:pro 150 QGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-ALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSD 224 (324) Q Consensus 150 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~-~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~ 224 (324) +|++.+. +..+++++.+++ ..+...+..+++|+|||+++.+|++++|++|+|+|.+ +.++ T Consensus 249 ~g~~~~~---------------g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~ 313 (397) T protein:vir:12 249 IASLKKV---------------DIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPTKK 313 (397) T ss_pred ccccccc---------------ccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCc Confidence 9875432 234578888877 4888899999999999999999999999999999863 4567 Q ss_pred eeecceeEeecCC----CCCCceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 225 SLDGLPVVNLKSS----NLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 225 ~l~G~pv~~~~~~----~~~~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) +|+|+||+++++. ..++..+++|||+.+ .++.+++++++++++. ...|++|++.||++.|+|+ T Consensus 314 ~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~------------~~~f~~~~~~~r~~~r~d~ 381 (397) T protein:vir:12 314 LLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTG------------AGAFETNSTKVRGIEREDV 381 (397) T ss_pred cccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEeccc------------cchhhcCceEEEEEEeecc Confidence 8999999876542 234556899999975 5788999999987653 2569999999999999999 Q ss_pred EEeccCceEEEEeecC Q lcl|Aclame:pro 300 HIADDKAFAKLVPADK 315 (324) Q Consensus 300 ~v~~~~A~~~l~~~~~ 315 (324) .+.+|+||++++.++. T Consensus 382 ~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 382 RKWDEDAVVFGQITVE 397 (397) T ss_pred EEecccceEEEEEeeC Confidence 9999999999999988 No 72 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=1.5e-52 Score=304.63 Aligned_cols=304 Identities=13% Similarity=0.079 Sum_probs=242.4 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE--EeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF--WAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~--~~~ 78 (324) .........+.+.|..........+ ...+++++|+.+||+++..+|++.+++.++|++++++++++++..++|+ .++ T Consensus 96 ~~~~~~~~~e~~~~~~~~~~~~~~~-~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 174 (415) T protein:vir:94 96 IQNTKVTSQEVRDFTEYLETRNDIQ-GGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE 174 (415) T ss_pred hhhhhhhHHHHHHHHHHhhhhhhhh-hhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecC Confidence 1111122233444433222222222 2334556788899999999999999999999999999999877666654 456 Q ss_pred CcceeeeccCcccccc-ccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc Q lcl|Aclame:pro 79 KPGAYWVGEGQKIETS-KATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK 157 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~-~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~ 157 (324) .+.+.|++|++.+|+. .++|+++++.++|++++++||+|+++||.++++++|.++|+++++.++|+++++|+|++.... T Consensus 175 ~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~ 254 (415) T protein:vir:94 175 VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred CccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 7789999999999974 689999999999999999999999999999999999999999999999999999998776655 Q ss_pred ccccccc-cccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeE Q lcl|Aclame:pro 158 SIAQSIE-KTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVV 232 (324) Q Consensus 158 ~~~~~~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~ 232 (324) +...... ......++..++++|.+++.++...++.+++|+||+++|.+|+++||++|+|++.+ +.+++|+|+||+ T Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~ 334 (415) T protein:vir:94 255 TSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE 334 (415) T ss_pred ccccccccccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeE Confidence 5443332 23334445678999999999999999999999999999999999999999999854 345789999999 Q ss_pred eecCCCCC---CceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 233 NLKSSNLK---RGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~~---~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ++++++.+ +..++||||++ +.++.+++++++.++ |.++.+.+|++.|+|+++.+|+||+ T Consensus 335 ~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~-----------------~~~~~~~~r~~~r~d~~~~~~~a~~ 397 (415) T protein:vir:94 335 ILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-----------------YMHFGECLMIAVRQDCRILDYKSAI 397 (415) T ss_pred EecccccCCCCccEEEEEehhccEEEEeecceEEEEec-----------------cccCceEEEEEEEeccEEeccccEE Confidence 98776543 33589999997 457788999988754 5567788999999999999999999 Q ss_pred EEEeecCCCCCCCCCC Q lcl|Aclame:pro 309 KLVPADKRTDSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) +++..++ ...||.. T Consensus 398 ~~~~~~~--~~~~~~~ 411 (415) T protein:vir:94 398 VIEYDDS--ERGEGDL 411 (415) T ss_pred EEEEecc--CCCCCcc Confidence 9998554 4557777 No 73 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=6.4e-53 Score=306.74 Aligned_cols=280 Identities=15% Similarity=0.120 Sum_probs=225.0 Q ss_pred cccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeehee Q lcl|Aclame:pro 28 NVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFK 107 (324) Q Consensus 28 ~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k 107 (324) =+++++++|++||++++++|++.+++.++|++++++++++++..+||+.++.+.+.|++|++++|+++++|+++++.++| T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~~~k 80 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTSTPKK 80 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEeeEE Confidence 12445678899999999999999999999999999999998889999999999999999999999999999999999999 Q ss_pred eEEeeeehHHHhh---cChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc--ccccccc----cccccccccc-hhhhh Q lcl|Aclame:pro 108 LGVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF--GKSIAQS----IEKTNKVIKG-DFTQD 177 (324) Q Consensus 108 ~~~~~~iS~e~l~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~--~~~~~~~----~~~~~~~~~~-~~~~~ 177 (324) ++++++||+|+++ |+.++++++|.++|++++++++|+++|+|+|++.. +.++... .........+ ....+ T Consensus 81 ~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 160 (311) T protein:vir:99 81 AQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANPDL 160 (311) T ss_pred EEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchhHH Confidence 9999999999995 66799999999999999999999999999875432 2222221 1111111222 23346 Q ss_pred HHHHHHHHhhhhc--CCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeecceeEeecCCC------------- Q lcl|Aclame:pro 178 NIIDLEALLEDDE--LEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSN------------- 238 (324) Q Consensus 178 ~i~~~~~~l~~~~--~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~------------- 238 (324) ++.+++..+.... ...+.|+||++++..|+++||.+|+|+|.+ +.+++|+|+||+++...+ T Consensus 161 ~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~ 240 (311) T protein:vir:99 161 AIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDEDLD 240 (311) T ss_pred HHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccccccccchhh Confidence 6777777766553 344579999999999999999999999864 345689999999875432 Q ss_pred -CCCceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 239 -LKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 239 -~~~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) .+...+++|||+.. .++.+++++++++++.. ....+++|++|+++||+++|+||++.|| +|++++.++| T Consensus 241 ~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 241 AARAVRGIVGDFANGIHWGVQRDIPVELIKYGD-------PDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred ccCcceEEEeeccccEEEEEecCceEEEeecCC-------CCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 12334577888864 47888888888876542 2346789999999999999999999996 6777888777 No 74 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=1.3e-52 Score=305.07 Aligned_cols=290 Identities=10% Similarity=0.042 Sum_probs=233.6 Q ss_pred CchhHHHHHHHH------HHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQ------HFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 ~~~~~~~k~~~~------~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) .......+...+ ++....... .......++++++++++|+++...|++.++..+++++++++++++++.++|| T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~ 154 (379) T protein:vir:10 76 DKSDSLVKSITENFNDIKEVRNGKSIQ-VKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFV 154 (379) T ss_pred ccchhHHHHHHHHHHhHHHHHhhhhhh-hhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEE Confidence 111111111111 111100000 1111233556667789999999999999999999999999999999999999 Q ss_pred EEeCC--cceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|Aclame:pro 75 FWADK--PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 75 ~~~~~--~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~ 152 (324) +.++. ..+.|++||+.+|+++++|+++++.++|++++++||+|+++|+ ++++++|.++|+++++.++|.+++.|+|+ T Consensus 155 ~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~-~~l~~~i~~~la~~~~~~~~~~~~~g~~~ 233 (379) T protein:vir:10 155 RENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNL-PFLTSFIPNALRRDYAKAENAAFNAVLAA 233 (379) T ss_pred EeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 98754 4567999999999999999999999999999999999999997 47999999999999999999999998876 Q ss_pred cccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccC------Cccee Q lcl|Aclame:pro 153 NPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR------NSDSL 226 (324) Q Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~------~~~~l 226 (324) +.... ....++..+++++.+++.++...++.++.|+|||++|..|+++||++|+|+++++ .+.+| T Consensus 234 ~~~~~---------~~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l 304 (379) T protein:vir:10 234 NATAS---------TEIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRI 304 (379) T ss_pred ccccc---------cccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCccee Confidence 43211 1112334557889999999999999999999999999999999999999998643 34589 Q ss_pred ecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCc Q lcl|Aclame:pro 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) Q Consensus 227 ~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A 306 (324) +|+||++++. ++++++++|||+++++..++++.++++++.. ++|++|++.||++.|+||.|.||+| T Consensus 305 ~G~pvv~s~~--~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~------------~~f~~~~~~~r~~~R~~~~v~~p~a 370 (379) T protein:vir:10 305 NGIPLFRATW--LAANKYYVGDWTRVTKVTTEGLSLEFSEVEG------------TNFVKNNITARIEAQVALAVEQPAA 370 (379) T ss_pred cceeeEecCC--CCCCceEEeecccEEEEEEeceEEEEeeccc------------ccccCCcEEEEEEEEeccEEecCcc Confidence 9999998765 4578899999999999999999999876532 4599999999999999999999999 Q ss_pred eEEEEeecC Q lcl|Aclame:pro 307 FAKLVPADK 315 (324) Q Consensus 307 ~~~l~~~~~ 315 (324) |++++.++- T Consensus 371 ~v~~~~~~~ 379 (379) T protein:vir:10 371 LIFGDFTAV 379 (379) T ss_pred EEEEEecCC Confidence 999998777 No 75 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=2.2e-52 Score=303.77 Aligned_cols=303 Identities=11% Similarity=0.074 Sum_probs=229.3 Q ss_pred Cch-----hHHHHHHHHH----HHh------hhhh---HHhh-ccccccccccCccccchHHHHHHHHHHHhhhhhhhhc Q lcl|Aclame:pro 1 MEQ-----TQKLKLNLQH----FAS------NNVK---PQVF-NPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLG 61 (324) Q Consensus 1 ~~~-----~~~~k~~~~~----~a~------~~~~---~~~~-~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~ 61 (324) ++. ..+.+...++ +.. .... ++.+ ......+.+++|++||++++++|++.+++.++|++++ T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~ 118 (390) T protein:vir:40 39 AEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYYNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKI 118 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHHHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhc Confidence 000 0000000000 000 0000 0001 1112234567889999999999999999999999999 Q ss_pred ceeecCCCceEEEEEeCCcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 62 KYEPMEGTEKKFTFWADKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYK 140 (324) Q Consensus 62 ~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~ 140 (324) ++++++++...+|+.++.+.+.|++|++++++ ++++|++++++++|++++++||+|+++|+.++++++|.++++++++. T Consensus 119 ~~~~~~~~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~~ 198 (390) T protein:vir:40 119 NFVNTTATTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMAL 198 (390) T ss_pred eeeecCCceeEEEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999998875 68999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhccCcccccccccccccccc-----ccccchhhhhHHHHHHHHhhhh-------cCCCcEEEEcHHHH-H--- Q lcl|Aclame:pro 141 KFDEAGILNQGNNPFGKSIAQSIEKTN-----KVIKGDFTQDNIIDLEALLEDD-------ELEANAFISKTQNR-S--- 204 (324) Q Consensus 141 ~~d~~~l~G~g~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~-------~~~~~~~v~~~~~~-~--- 204 (324) ++|+++|+|+|++.+ .|+.+...... ...+..+++++..++...+... ...+++|+||+.++ . T Consensus 199 ~~~~a~l~G~G~~~P-~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~ 277 (390) T protein:vir:40 199 GLEAGIVNGSGKDQP-IGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIY 277 (390) T ss_pred HHHhhhhcccCCCcc-ceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHH Confidence 999999999998754 44443322111 1123345555666665555433 34577899999874 3 Q ss_pred HHHHhhccCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhh Q lcl|Aclame:pro 205 LLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLF 284 (324) Q Consensus 205 ~l~~~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f 284 (324) .++.++|.+|+|++.. .++|+||+.++. +++++++||||++++++++++++++++++. +| T Consensus 278 ~~~~~~d~~G~~v~~~----~~~g~pvv~~~~--~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~--------------~f 337 (390) T protein:vir:40 278 AATSYMTPQGVWVTGI----LPVPLEIVQSVA--VPVGKAVAGRAKDYFMGIGSEQVIRTSTEY--------------RL 337 (390) T ss_pred HHhhccCCCCcccccc----CCCceeEEEcCC--CCCCcEEEEeeceEEEEeecceEEEecchh--------------hh Confidence 4457899999998643 357999998765 457789999999999999999999988754 48 Q ss_pred hcCcEEEEEEEEeccEEeccCceEEEEeecCCC--CCCCCCC Q lcl|Aclame:pro 285 EQDMVALRATMHVALHIADDKAFAKLVPADKRT--DSVPGEV 324 (324) Q Consensus 285 ~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~--~~~~~~~ 324 (324) .+|++.||+..|+|+++.|++||++|+.++..+ ..+|..| T Consensus 338 ~~~~~~~r~~~r~dg~v~~~~A~~~l~~~~~~~~~~~~~~~~ 379 (390) T protein:vir:40 338 LDDETLYYAKQYANGRPKDNSSFLVFDITGLEGSPAIDVNVV 379 (390) T ss_pred hcCcEEEEEEEEeCCEEecccceEEEEeeccCCCCCCCccee Confidence 999999999999999999999999998777743 4555544 No 76 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.1e-51 Score=299.91 Aligned_cols=294 Identities=12% Similarity=0.107 Sum_probs=233.8 Q ss_pred CchhHHHHHHHHHHHhh---hhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASN---NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~---~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) -.+..+.....+.|+.. .....+.++ ..++++||.+||++++.+|++.+++.++|++++++++++++.+++|+.. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ra--~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~ 164 (421) T protein:vir:13 87 DSKEEKRSLQLSAMSKTIRGIQLSEEERD--IMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRA 164 (421) T ss_pred chhHHHHHHHHHHHHHhhhccchhHHHhh--ccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEee Confidence 00011111112222211 111112222 2455678899999999999999999999999999999999999999987 Q ss_pred CCcc--eeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc Q lcl|Aclame:pro 78 DKPG--AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF 155 (324) Q Consensus 78 ~~~~--a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~ 155 (324) ..+. +.|++|++.+|+++++|++++++++|++++++||+|+++|+.++++++|.++|++++..++|.++++.. T Consensus 165 ~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~~~----- 239 (421) T protein:vir:13 165 GASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEFAEFAVNTENAEIVKQA----- 239 (421) T ss_pred cCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHhhhhHhhhh----- Confidence 6544 577999999999999999999999999999999999999999999999999999999999998887421 Q ss_pred ccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---CCcceeecceeE Q lcl|Aclame:pro 156 GKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---RNSDSLDGLPVV 232 (324) Q Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---~~~~~l~G~pv~ 232 (324) .++. ..++..++++|.+++.++...++.+++|+||+++|..|++++|++|+|+|.+ +.+++|+|+||+ T Consensus 240 -~g~~--------~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~tl~G~pV~ 310 (421) T protein:vir:13 240 -KAVL--------AEETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDLVFKGRPVI 310 (421) T ss_pred -hhcc--------ccccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCceecceeeE Confidence 1111 1123346899999999999999999999999999999999999999999964 456789999999 Q ss_pred eecCCCCC---CceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 233 NLKSSNLK---RGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 233 ~~~~~~~~---~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ++++++.+ ...+++|||+. +.++.+++++++++++. .|++|++.||++.|+|+++.+++||+ T Consensus 311 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~--------------~f~~~~~~~r~~~r~d~~~~~~~a~~ 376 (421) T protein:vir:13 311 ELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEA--------------GYTKNETIARIIERFDVNSPLDKSSD 376 (421) T ss_pred EeccccccCCCceEEEEEeccccEEEEEecceEEEeeccc--------------ccccCeeEEEEEeeecceeecchhhh Confidence 88776543 34679999997 56899999999988753 49999999999999999999999998 Q ss_pred EEEeecCCCCCCCCCC Q lcl|Aclame:pro 309 KLVPADKRTDSVPGEV 324 (324) Q Consensus 309 ~l~~~~~~~~~~~~~~ 324 (324) .+....++..++.-++ T Consensus 377 ~~~~~~~~a~v~~~~~ 392 (421) T protein:vir:13 377 AEKIRKFGVIVKLQEV 392 (421) T ss_pred eeeecccceeeccccc Confidence 8877765544443332 No 77 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=6.7e-51 Score=295.65 Aligned_cols=286 Identities=14% Similarity=0.062 Sum_probs=226.3 Q ss_pred CchhHHHHHHHHHHH-------------hhhhh----HHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcce Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFA-------------SNNVK----PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY 63 (324) Q Consensus 1 ~~~~~~~k~~~~~~a-------------~~~~~----~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~ 63 (324) +++...+.+.++... ..... ..........+..+||++||+++.+.|++.+++.+++++++++ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~ 164 (394) T protein:vir:97 85 KTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTV 164 (394) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhcee Confidence 111111111111110 00000 0001111224556688899999999999999999999999999 Q ss_pred eecCCCceEEEEEe-CCcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 64 EPMEGTEKKFTFWA-DKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKK 141 (324) Q Consensus 64 ~~~~~~~~~ip~~~-~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~ 141 (324) ++++++..++|+.. ++..+.|++|++.+|+ +.++|+++++.++|++++++||+|+++|+.++++++|.+.++++++.+ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~~~~~ 244 (394) T protein:vir:97 165 YQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNT 244 (394) T ss_pred eeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHHHHHH Confidence 99999889999976 4567899999999997 569999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhccCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc- Q lcl|Aclame:pro 142 FDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD- 220 (324) Q Consensus 142 ~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~- 220 (324) +|.++++|.++... .+..+++++.+++....+. ..++.|+|||++|..|++++|++|+|+|.+ T Consensus 245 ~~~~i~~g~~~~~~---------------~~~~~~~~~~~~~~~~~~~-~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~ 308 (394) T protein:vir:97 245 TNDAIAKVLKSFTT---------------KTVKNLDEIKALLNGGFDP-AYNVSLIVSQSFYQTLDTLKDGNGRYLLQDD 308 (394) T ss_pred HHHHHhhccccccc---------------cccccHHHHHHHHHhhhhh-hhCCEEEEcHHHHHHHHHhhccCCCeeeecC Confidence 99999998765421 1334578888888765443 345789999999999999999999999964 Q ss_pred ---CCcceeecceeEeecCCCCCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEE Q lcl|Aclame:pro 221 ---RNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) Q Consensus 221 ---~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r 296 (324) +.+++|+|+||+++++...+.++++||||+. +.++.+++++++.+++ .++...+|+++| T Consensus 309 ~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~~r 371 (394) T protein:vir:97 309 ITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN-----------------EIYGQYLQAVLR 371 (394) T ss_pred cCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEecc-----------------cccceeEEEEEE Confidence 3456899999999888889999999999987 5688899999987653 334467999999 Q ss_pred eccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 297 VALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 297 ~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) +|+.+.+|+||++|+..+++.+. T Consensus 372 ~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 372 FGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred EccEEecccceEEEEecccccCC Confidence 99999999999999997777666 No 78 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=3.5e-52 Score=302.68 Aligned_cols=299 Identities=12% Similarity=0.079 Sum_probs=234.7 Q ss_pred CchhHHHH---------------HHHHHHHhhh-----hhHHhh----ccccccccccCccccchHHHHHHHHHHHhhhh Q lcl|Aclame:pro 1 MEQTQKLK---------------LNLQHFASNN-----VKPQVF----NPDNVMMHEKKDGTLMNEFTTPILQEVMENSK 56 (324) Q Consensus 1 ~~~~~~~k---------------~~~~~~a~~~-----~~~~~~----~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~ 56 (324) .+..+.+. .+.+...... ....+. ......+.+++|++||+++.++|++.+.+.++ T Consensus 29 ee~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~ 108 (377) T protein:vir:98 29 EEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHP 108 (377) T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHhhh Confidence 00000000 0000000000 000000 11123445678899999999999999999999 Q ss_pred hhhhcceeecCCCceEEEEEeCCcceeeeccCcccc-ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHH Q lcl|Aclame:pro 57 IMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIA 135 (324) Q Consensus 57 l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~ 135 (324) ++++|+++++++. .++|+.++.+.+.|++|+++.+ +++++|+++++.+||+++.++||+|+++||.++++++|+++++ T Consensus 109 i~~~~~v~~~~~~-~~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la 187 (377) T protein:vir:98 109 LLKVINFKNTSLR-LKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLK 187 (377) T ss_pred hhhheeeEecCcc-eEEEEecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHH Confidence 9999999988764 7999999999999999988776 5789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhccCcccccccccccccc-cccc-----c-cchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHH Q lcl|Aclame:pro 136 EAFYKKFDEAGILNQGNNPFGKSIAQSIEK-TNKV-----I-KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK 208 (324) Q Consensus 136 ~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~-~~~~-----~-~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~ 208 (324) +++++++|.+|++|+|++. |.|+.+.... .... + ......+.+.++...+...++.+++|+||..++..+++ T Consensus 188 ~~~a~~~~~a~i~G~G~~q-P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~k 266 (377) T protein:vir:98 188 EAIAVALELAIVKGDGLLQ-PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKR 266 (377) T ss_pred HHHHHHHhhceEeccCCCc-ceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhh Confidence 9999999999999999875 4444432211 1110 1 11123466888888999999999999999999999999 Q ss_pred hhccCCceeec------------------cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccce Q lcl|Aclame:pro 209 IVDPETKERIY------------------DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQL 270 (324) Q Consensus 209 ~~d~~g~~~~~------------------~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~ 270 (324) +||.+|+++|. .+.+.+++|+|+.+..+..+++++++||||++|.++++++++++++++. T Consensus 267 lkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~-- 344 (377) T protein:vir:98 267 PLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQT-- 344 (377) T ss_pred hhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechh-- Confidence 99999999983 2334578999987666777888999999999999999999999988764 Q ss_pred eccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 271 STVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 271 ~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) +|.+|++.||+..|+|++++|++||++|+.+-- T Consensus 345 ------------~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 345 ------------FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ------------hhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 489999999999999999999999999998655 No 79 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=5.3e-50 Score=290.71 Aligned_cols=293 Identities=13% Similarity=0.116 Sum_probs=230.2 Q ss_pred Cchh--HHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC Q lcl|Aclame:pro 1 MEQT--QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD 78 (324) Q Consensus 1 ~~~~--~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~ 78 (324) +... ...+..++.|........+ ++....++++||.+||++++++|++.+++.++|++++++++++++..++|+... T Consensus 84 ~~~~~~~~~~~~~~~~l~~~~~~~~-~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 162 (394) T protein:vir:10 84 LKKKPIDAKKKAINDFIHSHGKVID-NAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR 162 (394) T ss_pred hhhhHHHHHHHHHHHHHhccchhhh-hhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEec Confidence 1111 1112223333322222222 233456667788999999999999999999999999999999999899998764 Q ss_pred -CcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Q lcl|Aclame:pro 79 -KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 79 -~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~ 156 (324) ...+.|++|++++|+ +.++|+++++.++|++++++||+|+++||.++++++|.++|+++++.++|+++++|.|++... T Consensus 163 ~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~ 242 (394) T protein:vir:10 163 ATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAK 242 (394) T ss_pred CCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 467899999999996 679999999999999999999999999999999999999999999999999999998864322 Q ss_pred cccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccC--------Ccceeec Q lcl|Aclame:pro 157 KSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR--------NSDSLDG 228 (324) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~--------~~~~l~G 228 (324) ...+..+++++.+++.......+ +++|+||+++|.+|++++|++|+|+|.++ .+++|+| T Consensus 243 ------------~~~~~~~~d~l~~~~~~~~~~~~-~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G 309 (394) T protein:vir:10 243 ------------ATTTDTLVDSLKHILNVDLDPAY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLG 309 (394) T ss_pred ------------cccccccHHHHHHHHHhhhhhhc-cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCccccccc Confidence 22344567888888765444443 57999999999999999999999998642 3368999 Q ss_pred ceeEeecCCCC----CCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|Aclame:pro 229 LPVVNLKSSNL----KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 229 ~pv~~~~~~~~----~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) +||+++++... ++..++||||++ +.++.++++++.++++.+ |. ..+|+..|+|+++.+ T Consensus 310 ~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~--------------~~---~~~~~~~r~d~~~~~ 372 (394) T protein:vir:10 310 VPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKI--------------YG---RYLGAAFRFGVKQAD 372 (394) T ss_pred ceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccc--------------cc---eeEEEEEEeccEEec Confidence 99998765433 334589999997 567778999998766432 33 458999999999999 Q ss_pred cCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 304 DKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 304 ~~A~~~l~~~~~~~~~~~~~~ 324 (324) |+||+.|+..++..+.+-|.= T Consensus 373 ~~ai~~~~~~~~~~~~~~~~~ 393 (394) T protein:vir:10 373 SNAGYFVTNTDAASGSTSGTG 393 (394) T ss_pred cccEEEEEeecccCCCCCCCC Confidence 999999998888877777666 No 80 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=4.3e-50 Score=291.25 Aligned_cols=301 Identities=13% Similarity=0.083 Sum_probs=228.8 Q ss_pred CchhHHHHHHHHHHHhhhhh--------H-Hhhccccccc-cccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVK--------P-QVFNPDNVMM-HEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE 70 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~--------~-~~~~~~~~~~-~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~ 70 (324) ......++............ . .......... .+.++.++|..+...|....+..+.++++++++++.++. T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~ 167 (419) T protein:vir:94 88 FADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNV 167 (419) T ss_pred hhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCc Confidence 11111111111111000000 0 0011111112 223334566666666777778888999999999999999 Q ss_pred eEEEEEeC--------CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 71 KKFTFWAD--------KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKF 142 (324) Q Consensus 71 ~~ip~~~~--------~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~ 142 (324) +++|+.++ ...+.|++||+.+|+++++|+++++.++|++++++||+|+++|+ .+++++|.++|+++++.++ T Consensus 168 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~l~~~i~~~la~a~~~~~ 246 (419) T protein:vir:94 168 LEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN-SQLMGYIQGRLTYGLRFLR 246 (419) T ss_pred eeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHHHHHHHHH Confidence 99988654 23578999999999999999999999999999999999999986 5899999999999999999 Q ss_pred HHHHHhccCcccccccccccccc-------ccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCc Q lcl|Aclame:pro 143 DEAGILNQGNNPFGKSIAQSIEK-------TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) Q Consensus 143 d~~~l~G~g~~~~~~~~~~~~~~-------~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~ 215 (324) |.++|+|+|++.+. |+.+.... ......+...++++.+++..+...++.+++|+||++++..|++++|.+|+ T Consensus 247 d~aii~G~G~~~p~-Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~ 325 (419) T protein:vir:94 247 DRQLLNGNGSTEMQ-GILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSG 325 (419) T ss_pred HHHHHhccCccccc-ceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCC Confidence 99999999987544 44332211 11122345568999999999999999999999999999999999998776 Q ss_pred ee-ec----cCCcceeecceeEeecCCCCCCceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcE Q lcl|Aclame:pro 216 ER-IY----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) Q Consensus 216 ~~-~~----~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v 289 (324) +. ++ .+.+++|+|+||+++.. ++++++++|||+++ .++++++++++++++.. ++|++|++ T Consensus 326 ~~~~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~------------~~~~~~~~ 391 (419) T protein:vir:94 326 VFRVIANVQGEATPRIWGLNVVSTVA--IAQGTALVGGFRQGATLWSRQGITVLMTDSHA------------DFFTANTL 391 (419) T ss_pred ceeecCCcccCCCccccceeeEEcCC--CCCccEEEeeccceEEEEEecceEEEEecccc------------chhhcCcE Confidence 54 33 34567999999999765 56788999999975 57788999998876532 46999999 Q ss_pred EEEEEEEeccEEeccCceEEEEeecCCC Q lcl|Aclame:pro 290 ALRATMHVALHIADDKAFAKLVPADKRT 317 (324) Q Consensus 290 ~~r~~~r~d~~v~~~~A~~~l~~~~~~~ 317 (324) +||++.|+|+++.+|+||++++.+++++ T Consensus 392 ~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 392 VILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred EEEEEEeeccEEeccccEEEEEeccCCC Confidence 9999999999999999999999999999 No 81 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=1.1e-49 Score=288.90 Aligned_cols=289 Identities=13% Similarity=0.130 Sum_probs=227.4 Q ss_pred CchhHHH--HHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC Q lcl|Aclame:pro 1 MEQTQKL--KLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD 78 (324) Q Consensus 1 ~~~~~~~--k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~ 78 (324) +++.+.. +..++.|.... ..+.++....++++||++||++++.+|++.+++.+++++++++++++++..++|+... T Consensus 83 ~~~~~~~~~~~~~~~~lr~~--~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (389) T protein:vir:10 83 LSKKPIDAKKKAINDFIHSH--GKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR 160 (389) T ss_pred cchhHHHHHHHHHHHHhhcc--hhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEec Confidence 2222111 11122222111 1223444556677889999999999999999999999999999999999999999864 Q ss_pred -CcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Q lcl|Aclame:pro 79 -KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 79 -~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~ 156 (324) ...+.|++|++++|+ ++++|+++++.++|+++++++|+|+++||.++++++|.+.|+++++.++|.+|++|.++... T Consensus 161 ~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~- 239 (389) T protein:vir:10 161 ATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFTA- 239 (389) T ss_pred CCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc- Confidence 455689999999985 78999999999999999999999999999999999999999999999999999998876432 Q ss_pred cccccccccccccccchhhhhHHHHHHHH-hhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccC--------Ccceee Q lcl|Aclame:pro 157 KSIAQSIEKTNKVIKGDFTQDNIIDLEAL-LEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR--------NSDSLD 227 (324) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~-l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~--------~~~~l~ 227 (324) ....+..+++++.++++. +...+ +++|+||+++|..|+++||++|+|+|.++ .+++|+ T Consensus 240 -----------~~~~~~~~~d~l~~~~~~~~~~~~--~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~ 306 (389) T protein:vir:10 240 -----------KKTTTDTLVDSLKHILNVDLDPAY--SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTIL 306 (389) T ss_pred -----------ccccccccHHHHHHHHHhhhhhhh--CcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccc Confidence 122345568888888764 44433 57999999999999999999999999643 235899 Q ss_pred cceeEeecCCCC----CCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|Aclame:pro 228 GLPVVNLKSSNL----KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 228 G~pv~~~~~~~~----~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) |+||+++++... ++..++||||++ +.+++++++++.++++.+ |. ..+|+..|+|+++. T Consensus 307 G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~~---~~~~~~~r~d~~~~ 369 (389) T protein:vir:10 307 GVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKI--------------YG---KYLGAAFRFGVQKA 369 (389) T ss_pred cceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeecccc--------------cc---ceEEEEEEeccEEe Confidence 999988765432 233489999997 578889999999876432 33 35899999999999 Q ss_pred ccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 303 DDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 303 ~~~A~~~l~~~~~~~~~~~~~ 323 (324) +|+||++++.. ..+.++||+ T Consensus 370 ~~~a~~~~~~~-~~~~~~~~~ 389 (389) T protein:vir:10 370 DSKAGYFVTNT-DVPGSALGK 389 (389) T ss_pred cccceEEEEee-ccCCCCCCC Confidence 99999999864 456678888 No 82 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=8.8e-50 Score=289.53 Aligned_cols=283 Identities=13% Similarity=0.094 Sum_probs=221.9 Q ss_pred Cchh-----------------HHHHHHHHHHHhhhhhHHhhc-c-ccccccccCccccchHHHHHHHHHHHhhhhhhhhc Q lcl|Aclame:pro 1 MEQT-----------------QKLKLNLQHFASNNVKPQVFN-P-DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLG 61 (324) Q Consensus 1 ~~~~-----------------~~~k~~~~~~a~~~~~~~~~~-~-~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~ 61 (324) .+.. ...+...+.+........+.+ . ...+++++||.+||+++.++|++.+++.+++++++ T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~ 168 (400) T protein:vir:38 89 HSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFT 168 (400) T ss_pred hhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcc Confidence 0000 000111111111111111111 1 12235667889999999999999999999999999 Q ss_pred ceeecCCCceEEEEEeC-CcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 62 KYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFY 139 (324) Q Consensus 62 ~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~ 139 (324) ++++++++..++|+.+. .+.+.|++|++..|+ +.++|+++++.++|++++++||+|+++||.++++++|.+.++++++ T Consensus 169 ~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~ 248 (400) T protein:vir:38 169 NVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQNGQQIKV 248 (400) T ss_pred eeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHHHHHHHH Confidence 99999999999999864 567899999999986 6799999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeec Q lcl|Aclame:pro 140 KKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIY 219 (324) Q Consensus 140 ~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~ 219 (324) .++|.++++|+|+... .+..+++++.+++....+.. .+++|+|||+++.+|++++|++|+|+|. T Consensus 249 ~~~~~~i~~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~-~~a~~v~~~~~~~~l~~lkd~~G~~i~~ 312 (400) T protein:vir:38 249 NTTNGAVATLLKGFTA---------------KTISSVDDLKHINNVDLDPA-YSRVIIASQSFYNFLDTVKDGNGRYLLQ 312 (400) T ss_pred HHHHHhhhhccccccc---------------cccccHHHHHHHHHhhhhhh-hCcEEEEcHHHHHHHHHhhccCCCeeee Confidence 9999999999876432 12335778888877554433 3689999999999999999999999996 Q ss_pred c----CCcceeecceeEeecCCCC---CCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEE Q lcl|Aclame:pro 220 D----RNSDSLDGLPVVNLKSSNL---KRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVAL 291 (324) Q Consensus 220 ~----~~~~~l~G~pv~~~~~~~~---~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ 291 (324) + +.+++|+|+||+++++++. ++..++||||++ +.++.++++++.++++. .+...| T Consensus 313 ~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~-----------------~~~~~~ 375 (400) T protein:vir:38 313 DSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQ-----------------IYGQFL 375 (400) T ss_pred cCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEeccc-----------------ccceeE Confidence 4 4567899999999876654 344589999997 46777999999887643 233579 Q ss_pred EEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 292 RATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 292 r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) |+.+|+|+.+.||+||++|+.+++. T Consensus 376 ~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 376 QAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EEEEEeccEEecccceEEEEeecCC Confidence 9999999999999999999998877 No 83 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=7e-50 Score=290.07 Aligned_cols=293 Identities=12% Similarity=0.039 Sum_probs=226.7 Q ss_pred CchhHHHHHHHH-----HHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQ-----HFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTF 75 (324) Q Consensus 1 ~~~~~~~k~~~~-----~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~ 75 (324) .+...+.....+ .|.. .....+.+.....+++++|++||+++.+.|. .++..+.+++++++++++++.+++|+ T Consensus 126 ~~~~~~~~~~~~~~~~~~~~~-~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~ 203 (437) T protein:vir:10 126 QDMKLKVGGEIADKKVTAFAD-YLKTGEVRDVTGIALKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPI 203 (437) T ss_pred hHHHHHHHHHHHHhhhhhhHH-HHHhhhhhhhhhcccccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEE Confidence 000000000000 0110 1111223344455677899999999987765 56788899999999999998899999 Q ss_pred Ee-CCcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 76 WA-DKPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 76 ~~-~~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) .. ..+.+.|++|++..|+ +.++|+++++.++|++++++||+|+++|+.++++++|.+.|+++++.++|.++++|+|++ T Consensus 204 ~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~ 283 (437) T protein:vir:10 204 FNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDG 283 (437) T ss_pred eeccccccccccccccccccccccceeeeeehhheeeehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhccc Confidence 85 4567899999999996 568999999999999999999999999999999999999999999999999999999865 Q ss_pred ccccccccccccccccccchhhhhHHHHHHH-HhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc----CCcceeec Q lcl|Aclame:pro 154 PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA-LLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD----RNSDSLDG 228 (324) Q Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G 228 (324) .+. ..+..+++++.+++. .+...+..+++|+||++++..|++++|++|+|+|.+ +.+++|+| T Consensus 284 ~~~-------------~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G 350 (437) T protein:vir:10 284 IKK-------------TTSTYLLGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMATDAMGRPLLQPNVTAATGYTLLG 350 (437) T ss_pred ccc-------------cccccchhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhhccCCCeeeccCccCCCCccccc Confidence 321 123345677777764 788889899999999999999999999999999964 45678999 Q ss_pred ceeEeecCCCC-----CCceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|Aclame:pro 229 LPVVNLKSSNL-----KRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 229 ~pv~~~~~~~~-----~~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) +||++++++.. ++..++||||+.+ .++++.++++..++ .|..+.+.+|+..|+|++++ T Consensus 351 ~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~----------------~~~~~~~~~~~~~r~d~~~~ 414 (437) T protein:vir:10 351 KTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQD----------------TYDIWYKQLGIFLRQNVVQA 414 (437) T ss_pred ceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEEec----------------ccccccceeeEEEEEccEEe Confidence 99998766433 3445899999974 57889999887653 14556678999999999999 Q ss_pred ccCceEEEEeec-CCCCCCCCCC Q lcl|Aclame:pro 303 DDKAFAKLVPAD-KRTDSVPGEV 324 (324) Q Consensus 303 ~~~A~~~l~~~~-~~~~~~~~~~ 324 (324) ||+||++|++.. +.++..|.-| T Consensus 415 ~~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 415 SKDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred cccceEEEEeeccccccCCCCCC Confidence 999999999764 4455566666 No 84 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=6e-50 Score=290.44 Aligned_cols=301 Identities=14% Similarity=0.062 Sum_probs=224.3 Q ss_pred Cc-----hhHHHHHHHHHHHhhhhhH--------HhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecC Q lcl|Aclame:pro 1 ME-----QTQKLKLNLQHFASNNVKP--------QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME 67 (324) Q Consensus 1 ~~-----~~~~~k~~~~~~a~~~~~~--------~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~ 67 (324) ++ ...+.+.+.+.+....... +.++.....++++||++||++++++|++.+++.|+++++|++++++ T Consensus 37 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~ 116 (381) T protein:vir:10 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) T ss_pred HHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC Confidence 11 0111111122221111000 1122334455668899999999999999999999999999999987 Q ss_pred CCceEEEEEeCCcceeeeccCcccc-ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 GTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) Q Consensus 68 ~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~ 146 (324) +. .+||+.++.+.+.|++|+++++ +++++|+++++.+||++++++||+|+++|+.++++++|.++++++++.++|++| T Consensus 117 ~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~ 195 (381) T protein:vir:10 117 LR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) T ss_pred cc-eEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhhee Confidence 64 7899999899999999998876 568999999999999999999999999999999999999999999999999999 Q ss_pred HhccCccccccccccccccc---c------ccccchh-------hhhHHHHHHHHhhh-------hcCCCcEEEEcHHHH Q lcl|Aclame:pro 147 ILNQGNNPFGKSIAQSIEKT---N------KVIKGDF-------TQDNIIDLEALLED-------DELEANAFISKTQNR 203 (324) Q Consensus 147 l~G~g~~~~~~~~~~~~~~~---~------~~~~~~~-------~~~~i~~~~~~l~~-------~~~~~~~~v~~~~~~ 203 (324) ++|+|++.+. |+....... . ....+.+ .++.+.+++..+.. .+..++.|+||+.++ T Consensus 196 i~G~G~~qP~-Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) T protein:vir:10 196 LKGTGKDQPI-GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) T ss_pred EeccCCCCce-eeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccH Confidence 9999987654 443321110 0 0111222 23445555555532 345667899999999 Q ss_pred HHHHHhh---ccCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccc Q lcl|Aclame:pro 204 SLLRKIV---DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) Q Consensus 204 ~~l~~~~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 280 (324) ..|+.++ +.+|+|++..+ .|.+|+.+ ..+++++++||||++|.++++++++++++++. T Consensus 275 ~~l~~~~~~~~~~G~~v~~l~-----~g~~vv~s--~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~------------ 335 (381) T protein:vir:10 275 FEVQAQYTHLNANGVYVTALP-----FNLNVIES--TVQEAGKVLTYVKGLYDGYLAGGINVQKFKET------------ 335 (381) T ss_pred HhhccccccCCCCCceeecCC-----CCceEEec--CCCCcCcEEEEecccEEEEEecccEEEeechh------------ Confidence 8887665 55677765422 24445554 45678889999999999999999999998764 Q ss_pred hhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC-CCCCCCCC Q lcl|Aclame:pro 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR-TDSVPGEV 324 (324) Q Consensus 281 ~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~-~~~~~~~~ 324 (324) +|.+|++.||+..|+|++++|++||+.++.+... +.++|+.= T Consensus 336 --~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~ 378 (381) T protein:vir:10 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) T ss_pred --HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCccccc Confidence 4999999999999999999999999998876655 66666666 No 85 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=6e-50 Score=290.44 Aligned_cols=301 Identities=14% Similarity=0.062 Sum_probs=224.3 Q ss_pred Cc-----hhHHHHHHHHHHHhhhhhH--------HhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecC Q lcl|Aclame:pro 1 ME-----QTQKLKLNLQHFASNNVKP--------QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME 67 (324) Q Consensus 1 ~~-----~~~~~k~~~~~~a~~~~~~--------~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~ 67 (324) ++ ...+.+.+.+.+....... +.++.....++++||++||++++++|++.+++.|+++++|++++++ T Consensus 37 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~ 116 (381) T protein:vir:95 37 INQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMDINKNVNYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAG 116 (381) T ss_pred HHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHHHhcccCCCCceecCHHHHHHHHHHHHhhccceeheeeEecC Confidence 11 0111111122221111000 1122334455668899999999999999999999999999999987 Q ss_pred CCceEEEEEeCCcceeeeccCcccc-ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 68 GTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) Q Consensus 68 ~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~ 146 (324) +. .+||+.++.+.+.|++|+++++ +++++|+++++.+||++++++||+|+++|+.++++++|.++++++++.++|++| T Consensus 117 ~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~ 195 (381) T protein:vir:95 117 LR-LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAF 195 (381) T ss_pred cc-eEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhhee Confidence 64 7899999899999999998876 568999999999999999999999999999999999999999999999999999 Q ss_pred HhccCccccccccccccccc---c------ccccchh-------hhhHHHHHHHHhhh-------hcCCCcEEEEcHHHH Q lcl|Aclame:pro 147 ILNQGNNPFGKSIAQSIEKT---N------KVIKGDF-------TQDNIIDLEALLED-------DELEANAFISKTQNR 203 (324) Q Consensus 147 l~G~g~~~~~~~~~~~~~~~---~------~~~~~~~-------~~~~i~~~~~~l~~-------~~~~~~~~v~~~~~~ 203 (324) ++|+|++.+. |+....... . ....+.+ .++.+.+++..+.. .+..++.|+||+.++ T Consensus 196 i~G~G~~qP~-Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~ 274 (381) T protein:vir:95 196 LKGTGKDQPI-GLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDA 274 (381) T ss_pred EeccCCCCce-eeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccH Confidence 9999987654 443321110 0 0111222 23445555555532 345667899999999 Q ss_pred HHHHHhh---ccCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccc Q lcl|Aclame:pro 204 SLLRKIV---DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) Q Consensus 204 ~~l~~~~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 280 (324) ..|+.++ +.+|+|++..+ .|.+|+.+ ..+++++++||||++|.++++++++++++++. T Consensus 275 ~~l~~~~~~~~~~G~~v~~l~-----~g~~vv~s--~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~------------ 335 (381) T protein:vir:95 275 FEVQAQYTHLNANGVYVTALP-----FNLNVIES--TVQEAGKVLTYVKGLYDGYLAGGINVQKFKET------------ 335 (381) T ss_pred HhhccccccCCCCCceeecCC-----CCceEEec--CCCCcCcEEEEecccEEEEEecccEEEeechh------------ Confidence 8887665 55677765422 24445554 45678889999999999999999999998764 Q ss_pred hhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC-CCCCCCCC Q lcl|Aclame:pro 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR-TDSVPGEV 324 (324) Q Consensus 281 ~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~-~~~~~~~~ 324 (324) +|.+|++.||+..|+|++++|++||+.++.+... +.++|+.= T Consensus 336 --~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~ 378 (381) T protein:vir:95 336 --LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTE 378 (381) T ss_pred --HhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCccccc Confidence 4999999999999999999999999998876655 66666666 No 86 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=3.8e-50 Score=291.54 Aligned_cols=294 Identities=13% Similarity=0.099 Sum_probs=228.0 Q ss_pred CchhHH----HHHHHHHHHhhh-------hhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCC Q lcl|Aclame:pro 1 MEQTQK----LKLNLQHFASNN-------VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGT 69 (324) Q Consensus 1 ~~~~~~----~k~~~~~~a~~~-------~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~ 69 (324) +...++ ..+++++..... ....+.++.+..++++||++||++++++|++.+++.++|+++++++++++ T Consensus 46 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~- 124 (352) T protein:vir:78 46 LNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 124 (352) T ss_pred cchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC- Confidence 111111 111122211110 11112344555667788999999999999999999999999999888765 Q ss_pred ceEEEEEe-CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHH-HHH Q lcl|Aclame:pro 70 EKKFTFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE-AGI 147 (324) Q Consensus 70 ~~~ip~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~-~~l 147 (324) ..+|+.+ ..+.+.|++|++.+++++++|+++++.++|++++++||+|+++||.++++++|.++|+++++.+++. .+. T Consensus 125 -~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~ 203 (352) T protein:vir:78 125 -LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 203 (352) T ss_pred -ceEEEEecCCCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 4578765 4568999999999999999999999999999999999999999999999999999999999988655 444 Q ss_pred hccCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceee Q lcl|Aclame:pro 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLD 227 (324) Q Consensus 148 ~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~ 227 (324) .|+|++.+.+++...... ..++..++++|+++++.|...++.+++|+||+.++..|.++++.+|++++.+ .+.+|+ T Consensus 204 ~g~g~~~~~g~l~~~~~~---~~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~-~~~~ll 279 (352) T protein:vir:78 204 VSPKSGLEHMSFYNGSVK---EVEGANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT-PAEKVF 279 (352) T ss_pred cCCCCcccccceeccccc---cccccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCccccc-CCcccc Confidence 677766655554442222 2234456899999999999999999999999999999999999999999864 456899 Q ss_pred cceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|Aclame:pro 228 GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 228 G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~ 307 (324) |+||+++.+ ...++||||+++++. +.++.++.+++ ..++++.|++..|+|+++.+|+|| T Consensus 280 G~PV~~~~~----~~~~~~Gdf~~~~~~-~~~~~~~~~~~----------------~~~g~~~f~~~~r~Dg~~~~~eA~ 338 (352) T protein:vir:78 280 GKPVVFTDA----AVKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAF 338 (352) T ss_pred ccceEEecC----CCceeEeehhhhhhh-hhhheeeeecc----------------ccCCeeEEEEEeeeCceeechhhe Confidence 999998764 346899999987664 45566555443 236889999999999999999999 Q ss_pred EEEEeecCCCCCCC Q lcl|Aclame:pro 308 AKLVPADKRTDSVP 321 (324) Q Consensus 308 ~~l~~~~~~~~~~~ 321 (324) +.|+.+++.+..|- T Consensus 339 ~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 339 RIAKAKESTGSLPS 352 (352) T ss_pred EEEEeecccCCCCC Confidence 99999888766665 No 87 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=9.5e-50 Score=289.34 Aligned_cols=294 Identities=12% Similarity=0.091 Sum_probs=227.3 Q ss_pred CchhHHHHHHHHHHHhhhhhH-------HhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKP-------QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~-------~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) ++......++++.+....... .+.++.+..++++||++||++++++|++.+++.++|+++++++++++ ..+ T Consensus 85 ~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~ 162 (387) T protein:vir:26 85 EKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEI 162 (387) T ss_pred HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--cee Confidence 111112223334332211111 22344455666778999999999999999999999999999988865 457 Q ss_pred EEEe-CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHH-hccC Q lcl|Aclame:pro 74 TFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI-LNQG 151 (324) Q Consensus 74 p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l-~G~g 151 (324) |+.. +...+.|++|++.+++++++|+++++.++|++++++||+|+++||.++++++|.++|+++++.+++..+| .|+| T Consensus 163 p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g 242 (387) T protein:vir:26 163 PRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPK 242 (387) T ss_pred eeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCC Confidence 8765 4577999999999999999999999999999999999999999999999999999999999999776544 5666 Q ss_pred ccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeeccee Q lcl|Aclame:pro 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPV 231 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv 231 (324) ++.+.+.+..... ...++..++++|+++++.|...|+.+++|+||+.++..+.++++..|++++. +.+.+|+|+|| T Consensus 243 ~g~~~g~~~~~~~---~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-~~~~~llG~PV 318 (387) T protein:vir:26 243 SGLEHMSFYNGSV---KEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVFGKPV 318 (387) T ss_pred ccccceeeecccc---ccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCccccccce Confidence 6554443333211 2234566799999999999999999999999999988877777777888875 45578999999 Q ss_pred EeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEE Q lcl|Aclame:pro 232 VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV 311 (324) Q Consensus 232 ~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~ 311 (324) +++.+ ..+++||||+++++. +.++.++.+++ ...+++.|++..|+|+++.+|+||+.|+ T Consensus 319 ~~~~~----~~~~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ 377 (387) T protein:vir:26 319 VFTDA----AVKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAK 377 (387) T ss_pred EEecC----CCceeeechhhhhhh-hhhhhheeccc----------------ccCCceEEEEEEEeCcEeechhheEEEE Confidence 99765 346899999987654 45555554433 2358899999999999999999999999 Q ss_pred eecCCCCCCC Q lcl|Aclame:pro 312 PADKRTDSVP 321 (324) Q Consensus 312 ~~~~~~~~~~ 321 (324) .+++.+..|- T Consensus 378 ~ka~~~~~~~ 387 (387) T protein:vir:26 378 AKENTGPLPS 387 (387) T ss_pred eecCCCCCCC Confidence 9888766666 No 88 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=9.5e-50 Score=289.34 Aligned_cols=294 Identities=12% Similarity=0.091 Sum_probs=227.3 Q ss_pred CchhHHHHHHHHHHHhhhhhH-------HhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKP-------QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~-------~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) ++......++++.+....... .+.++.+..++++||++||++++++|++.+++.++|+++++++++++ ..+ T Consensus 85 ~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~ 162 (387) T protein:vir:96 85 EKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEI 162 (387) T ss_pred HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--cee Confidence 111112223334332211111 22344455666778999999999999999999999999999988865 457 Q ss_pred EEEe-CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHH-hccC Q lcl|Aclame:pro 74 TFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI-LNQG 151 (324) Q Consensus 74 p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l-~G~g 151 (324) |+.. +...+.|++|++.+++++++|+++++.++|++++++||+|+++||.++++++|.++|+++++.+++..+| .|+| T Consensus 163 p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g 242 (387) T protein:vir:96 163 PRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPK 242 (387) T ss_pred eeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCC Confidence 8765 4577999999999999999999999999999999999999999999999999999999999999776544 5666 Q ss_pred ccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeeccee Q lcl|Aclame:pro 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPV 231 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv 231 (324) ++.+.+.+..... ...++..++++|+++++.|...|+.+++|+||+.++..+.++++..|++++. +.+.+|+|+|| T Consensus 243 ~g~~~g~~~~~~~---~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-~~~~~llG~PV 318 (387) T protein:vir:96 243 SGLEHMSFYNGSV---KEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVFGKPV 318 (387) T ss_pred ccccceeeecccc---ccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCccccccce Confidence 6554443333211 2234566799999999999999999999999999988877777777888875 45578999999 Q ss_pred EeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEE Q lcl|Aclame:pro 232 VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV 311 (324) Q Consensus 232 ~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~ 311 (324) +++.+ ..+++||||+++++. +.++.++.+++ ...+++.|++..|+|+++.+|+||+.|+ T Consensus 319 ~~~~~----~~~~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ 377 (387) T protein:vir:96 319 VFTDA----AVKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAK 377 (387) T ss_pred EEecC----CCceeeechhhhhhh-hhhhhheeccc----------------ccCCceEEEEEEEeCcEeechhheEEEE Confidence 99765 346899999987654 45555554433 2358899999999999999999999999 Q ss_pred eecCCCCCCC Q lcl|Aclame:pro 312 PADKRTDSVP 321 (324) Q Consensus 312 ~~~~~~~~~~ 321 (324) .+++.+..|- T Consensus 378 ~ka~~~~~~~ 387 (387) T protein:vir:96 378 AKENTGPLPS 387 (387) T ss_pred eecCCCCCCC Confidence 9888766666 No 89 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=9.5e-50 Score=289.34 Aligned_cols=294 Identities=12% Similarity=0.091 Sum_probs=227.3 Q ss_pred CchhHHHHHHHHHHHhhhhhH-------HhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKP-------QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~-------~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) ++......++++.+....... .+.++.+..++++||++||++++++|++.+++.++|+++++++++++ ..+ T Consensus 85 ~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~--~~~ 162 (387) T protein:vir:94 85 EKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--LEI 162 (387) T ss_pred HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhhhceeeecCC--cee Confidence 111112223334332211111 22344455666778999999999999999999999999999988865 457 Q ss_pred EEEe-CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHH-hccC Q lcl|Aclame:pro 74 TFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI-LNQG 151 (324) Q Consensus 74 p~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l-~G~g 151 (324) |+.. +...+.|++|++.+++++++|+++++.++|++++++||+|+++||.++++++|.++|+++++.+++..+| .|+| T Consensus 163 p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g 242 (387) T protein:vir:94 163 PRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVSPK 242 (387) T ss_pred eeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCC Confidence 8765 4577999999999999999999999999999999999999999999999999999999999999776544 5666 Q ss_pred ccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeeccee Q lcl|Aclame:pro 152 NNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGLPV 231 (324) Q Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv 231 (324) ++.+.+.+..... ...++..++++|+++++.|...|+.+++|+||+.++..+.++++..|++++. +.+.+|+|+|| T Consensus 243 ~g~~~g~~~~~~~---~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~-~~~~~llG~PV 318 (387) T protein:vir:94 243 SGLEHMSFYNGSV---KEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVFGKPV 318 (387) T ss_pred ccccceeeecccc---ccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCccccccce Confidence 6554443333211 2234566799999999999999999999999999988877777777888875 45578999999 Q ss_pred EeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEE Q lcl|Aclame:pro 232 VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLV 311 (324) Q Consensus 232 ~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~ 311 (324) +++.+ ..+++||||+++++. +.++.++.+++ ...+++.|++..|+|+++.+|+||+.|+ T Consensus 319 ~~~~~----~~~~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ 377 (387) T protein:vir:94 319 VFTDA----AVKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRIAK 377 (387) T ss_pred EEecC----CCceeeechhhhhhh-hhhhhheeccc----------------ccCCceEEEEEEEeCcEeechhheEEEE Confidence 99765 346899999987654 45555554433 2358899999999999999999999999 Q ss_pred eecCCCCCCC Q lcl|Aclame:pro 312 PADKRTDSVP 321 (324) Q Consensus 312 ~~~~~~~~~~ 321 (324) .+++.+..|- T Consensus 378 ~ka~~~~~~~ 387 (387) T protein:vir:94 378 AKENTGPLPS 387 (387) T ss_pred eecCCCCCCC Confidence 9888766666 No 90 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=6.9e-49 Score=284.61 Aligned_cols=303 Identities=14% Similarity=0.103 Sum_probs=224.0 Q ss_pred Cch--hHHHHHHHHHHHhhhhh-------------HHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee Q lcl|Aclame:pro 1 MEQ--TQKLKLNLQHFASNNVK-------------PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP 65 (324) Q Consensus 1 ~~~--~~~~k~~~~~~a~~~~~-------------~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~ 65 (324) ++. .++.+.+.+....+... ++.++.....+..++|++||++++++|++.+++.+++++++++++ T Consensus 45 ~~~~~~~~~~~e~~~~~~~~~~~~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~ 124 (395) T protein:vir:95 45 LSNDLQEEITAEINNRVVDNGILAKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQN 124 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEe Confidence 100 01111111111110000 000122233456678899999999999999999999999999999 Q ss_pred cCCCceEEEEEeCCcceeeeccCccc-cccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 MEGTEKKFTFWADKPGAYWVGEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) Q Consensus 66 ~~~~~~~ip~~~~~~~a~~v~Eg~~~-~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~ 144 (324) +++. .++|+.++.+.+.|++|.++. ++++++|+++++.+||++++++||+|+++|+.++++++|.+.|+++++.++|+ T Consensus 125 ~~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia~~~~~ 203 (395) T protein:vir:95 125 AGIK-TRVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAISVALES 203 (395) T ss_pred cCCc-eEEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHHHHHhh Confidence 8764 689999999999999987766 46789999999999999999999999999999999999999999999999999 Q ss_pred HHHhccCccc-ccccccccccccc-----ccccchhhhhHHHHHHHHhhh--------------hcCCCcEEEEcHHHHH Q lcl|Aclame:pro 145 AGILNQGNNP-FGKSIAQSIEKTN-----KVIKGDFTQDNIIDLEALLED--------------DELEANAFISKTQNRS 204 (324) Q Consensus 145 ~~l~G~g~~~-~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~--------------~~~~~~~~v~~~~~~~ 204 (324) +||+|+|++. +|.|+.+...... ...++..+++++..++..+.+ .+..+..|+||+.++. T Consensus 204 a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~ 283 (395) T protein:vir:95 204 AIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSW 283 (395) T ss_pred heeeccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhh Confidence 9999999863 5666655432221 222334455555444443322 2344568999998764 Q ss_pred HHHHhhccCCceeecc--CCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchh Q lcl|Aclame:pro 205 LLRKIVDPETKERIYD--RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVN 282 (324) Q Consensus 205 ~l~~~~d~~g~~~~~~--~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 282 (324) +..|+|+|++ +.+.+++|+|+.+..+..+++++++||||++|+++++++++++++++. T Consensus 284 ------~~~g~~~~~~~~G~~~~~lg~g~~v~~~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~-------------- 343 (395) T protein:vir:95 284 ------DVQARYTYLTANGGFVTVLPYNVTIITSEFVPEGKLVAFVTDRYNAVRGGGLTVKKFDQT-------------- 343 (395) T ss_pred ------hcCCcceeccCCCcceeccCCcceEEEcCCCCCCcEEEEecccEEEEEecceEEEeccch-------------- Confidence 5578888865 445577766654444556778899999999999999999999988764 Q ss_pred hhhcCcEEEEEEEEeccEEeccCceEEEEee----cCCCCCCCCCC Q lcl|Aclame:pro 283 LFEQDMVALRATMHVALHIADDKAFAKLVPA----DKRTDSVPGEV 324 (324) Q Consensus 283 ~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~----~~~~~~~~~~~ 324 (324) +|.+|+++||+..|+|+++.|++||+.|+.. +..+..+||+. T Consensus 344 ~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~~~~~ 389 (395) T protein:vir:95 344 LALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPRRQTSAGGTT 389 (395) T ss_pred hhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCCCCCCCCCCC Confidence 4899999999999999999999999998875 33355555655 No 91 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=4.8e-49 Score=285.48 Aligned_cols=301 Identities=15% Similarity=0.042 Sum_probs=219.9 Q ss_pred Cc----------------------hhHHHHHHHHHHH-hhhh-------hHHhhccccccccccCccccchHHHHHHHHH Q lcl|Aclame:pro 1 ME----------------------QTQKLKLNLQHFA-SNNV-------KPQVFNPDNVMMHEKKDGTLMNEFTTPILQE 50 (324) Q Consensus 1 ~~----------------------~~~~~k~~~~~~a-~~~~-------~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~ 50 (324) |+ ...+.+.+.++.. ..+. ..+.++..+..+..+||++||+++.++|++. T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~e~~~~~~~~~~t~~~Gg~lvP~~~~~~I~~~ 99 (381) T protein:vir:10 20 VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQTLSANQRNFFMDINKSVGYKEEKLLPEETIDRIFED 99 (381) T ss_pred HHhhhHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccCHHHHHHHHHHhhcCCCCCceecCHHHHHHHHHH Confidence 00 0000111111110 0000 0011223345566778999999999999999 Q ss_pred HHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCcccc-ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHH Q lcl|Aclame:pro 51 VMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEE 129 (324) Q Consensus 51 ~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~ 129 (324) +.+.|+++++|+++++++ ..++|+.++.+.+.|++|.++.+ +++++|+++++.+||++++++||+|+++|+.++++++ T Consensus 100 l~~~spir~~a~v~~~~~-~~~i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~ 178 (381) T protein:vir:10 100 LTTNHPLLADLGIKNAGL-RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERF 178 (381) T ss_pred HHhhcceeeeeeeEecCc-ceEEEeecCCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHH Confidence 999999999999998865 47899998889999999988765 5689999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHhccCccccccccccccccc---------cccccchhhhhHHHHHHHHhh------------- Q lcl|Aclame:pro 130 MKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKT---------NKVIKGDFTQDNIIDLEALLE------------- 187 (324) Q Consensus 130 i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~---------~~~~~~~~~~~~i~~~~~~l~------------- 187 (324) |.++++++++.++|++|++|+|++.+. |+....... .....+.+++.++..++..+. T Consensus 179 i~~~la~~~a~~~~~afi~GdG~~qP~-Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~ 257 (381) T protein:vir:10 179 VRVQIEEAFAVALETAFLKGTGKDQPI-GLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGK 257 (381) T ss_pred HHHHHHHHHHHHhhceeEecccCCCce-eeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccc Confidence 999999999999999999999987654 443221110 011122333334333332221 Q ss_pred -hhcCCCcEEEEcHHHHHHHHHhh---ccCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEE Q lcl|Aclame:pro 188 -DDELEANAFISKTQNRSLLRKIV---DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYK 263 (324) Q Consensus 188 -~~~~~~~~~v~~~~~~~~l~~~~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~ 263 (324) ..+..+..|+||+.++..+++++ +.+|+|++..+ .|+||+.++ .+++++++||||++|.++++.+++++ T Consensus 258 ~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~~lp-----~g~~vv~~~--~~p~~~i~fGDfs~Y~i~~r~~~~i~ 330 (381) T protein:vir:10 258 SVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTALP-----FNLNVIEST--VQEAGKVLTYVKGLYDGYLAGGINVQ 330 (381) T ss_pred cccccCceEEEEchhhHHhhccccccCCCCCceeecCC-----CCceeEEcC--CCCcCcEEEEEcccEEEEEecccEEE Confidence 13445678999999998887644 77888886532 466777655 45678899999999999999999999 Q ss_pred EeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC---CCCCC Q lcl|Aclame:pro 264 IDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS---VPGEV 324 (324) Q Consensus 264 ~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~---~~~~~ 324 (324) ++++. +|.+|+++||+..|+|++++|++||+.++.+..+... .|-|. T Consensus 331 ~~~~~--------------~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~~ 380 (381) T protein:vir:10 331 KFKET--------------LALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEET 380 (381) T ss_pred eechh--------------hhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCcccccccccc Confidence 98764 4999999999999999999999999998887555222 22222 No 92 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=1.2e-48 Score=283.31 Aligned_cols=306 Identities=12% Similarity=0.127 Sum_probs=225.1 Q ss_pred Cch-------hHHHHHHHHHH------------HhhhhhHHhhccccc-cccccCccccchHHHHHHHHHHHhhhhhhhh Q lcl|Aclame:pro 1 MEQ-------TQKLKLNLQHF------------ASNNVKPQVFNPDNV-MMHEKKDGTLMNEFTTPILQEVMENSKIMQL 60 (324) Q Consensus 1 ~~~-------~~~~k~~~~~~------------a~~~~~~~~~~~~~~-~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l 60 (324) .++ +...+...+.. ..+....+....... ...++++.+||++++..|++.+++.++++++ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~ 182 (466) T protein:vir:80 103 GARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISK 182 (466) T ss_pred hhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhh Confidence 000 00001100000 000000000111111 1223355689999999999999999999999 Q ss_pred cceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 GKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYK 140 (324) Q Consensus 61 ~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~ 140 (324) ++++++++ ..++|+....+.+.|++|++.+|+++++|+++++.+||++++++||+|+++||.++++++|.++|+++++. T Consensus 183 ~~v~~~~g-~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~ 261 (466) T protein:vir:80 183 VRLRPLKG-TARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADEILDAIGQAIGF 261 (466) T ss_pred eeeeecCc-eeEeeeecCCcceeecccccccccccccccceeecceeeeeehhhhHHHHhcchHHHHHHHHHHHHHHHHH Confidence 99998875 46889988888999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhccCcccccccccccccccccc--------ccchhhh-----------------hHHHHHHHHhhhhcCCC-c Q lcl|Aclame:pro 141 KFDEAGILNQGNNPFGKSIAQSIEKTNKV--------IKGDFTQ-----------------DNIIDLEALLEDDELEA-N 194 (324) Q Consensus 141 ~~d~~~l~G~g~~~~~~~~~~~~~~~~~~--------~~~~~~~-----------------~~i~~~~~~l~~~~~~~-~ 194 (324) ++|++||+|+|++.+. |+.+........ ....++. .++...+..+...+..+ . T Consensus 262 ~~~~ail~G~G~~~P~-Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 340 (466) T protein:vir:80 262 ALDKAILYGTGTKMPV-GIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMK 340 (466) T ss_pred HHhhheeeccCCCCcc-eeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCce Confidence 9999999999987644 554332111100 0011111 12222233333444444 4 Q ss_pred EEEEcHHHHHHHHHhh---ccCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeecccee Q lcl|Aclame:pro 195 AFISKTQNRSLLRKIV---DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLS 271 (324) Q Consensus 195 ~~v~~~~~~~~l~~~~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~ 271 (324) .|+||+.++..|..++ +.+|.+.+.......++|+||+.+++ ++++.+++|||+.++++++.++++.++++. T Consensus 341 ~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~i~G~pvv~s~~--~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~--- 415 (466) T protein:vir:80 341 FWAMSSNTHAVLMSKAITFNSAGALVASLNNTMPIVGGDIVILDF--IPDNDIIGGYGSLYLLAERADIKLAQSEHV--- 415 (466) T ss_pred eEEecchhHHHhhcccccccCCccccccCCCcccccccceeecCc--cCccceeeeccccEEEEeecceEEEechhh--- Confidence 6999999999998887 56677777766667799999998775 456789999999999999999999988653 Q ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 272 ~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .|.+|++.||+..|+|+++.+++||++++.+...+.+++-.+ T Consensus 416 -----------~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~~~~~ 457 (466) T protein:vir:80 416 -----------RFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTTSITFA 457 (466) T ss_pred -----------hhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcccceeee Confidence 489999999999999999999999999998887666655544 No 93 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=9.9e-49 Score=283.75 Aligned_cols=305 Identities=10% Similarity=0.009 Sum_probs=221.8 Q ss_pred CchhHHHHHHHH-------------------HHHhh---------hhhHHhhccccccccccCccccchHH-HHHHHHHH Q lcl|Aclame:pro 1 MEQTQKLKLNLQ-------------------HFASN---------NVKPQVFNPDNVMMHEKKDGTLMNEF-TTPILQEV 51 (324) Q Consensus 1 ~~~~~~~k~~~~-------------------~~a~~---------~~~~~~~~~~~~~~~~~~~~~vp~~~-~~~i~~~~ 51 (324) .++. ....+++ +.... .....+.+....++++.||++||+++ .++|++.+ T Consensus 103 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l 181 (477) T protein:vir:84 103 YEKG-NGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELA 181 (477) T ss_pred hhhh-HHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeeccchhHHHHHHHh Confidence 0000 0000000 00000 00000112223344556788888875 67899999 Q ss_pred HhhhhhhhhcceeecC--CCceEEEEEeCCc-ceeeeccCcc-----ccccccceeeEEeeheeeEEeeeehHHHhhcCh Q lcl|Aclame:pro 52 MENSKIMQLGKYEPME--GTEKKFTFWADKP-GAYWVGEGQK-----IETSKATWVNATMRAFKLGVILPVTKEFLNYTY 123 (324) Q Consensus 52 ~~~s~l~~l~~~~~~~--~~~~~ip~~~~~~-~a~~v~Eg~~-----~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~ 123 (324) ++.+++++++++++++ ++.++||+..+++ .+.|++||+. +|+++++|++++++++|++++++||+|+++||. T Consensus 182 ~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~ 261 (477) T protein:vir:84 182 RAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAA 261 (477) T ss_pred hhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeeeHHHHHHHhccc Confidence 9999999999988765 4568899976554 5679999864 578889999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccc-c-------chhhhhHHHHHHHHhhhhcCCC-c Q lcl|Aclame:pro 124 SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVI-K-------GDFTQDNIIDLEALLEDDELEA-N 194 (324) Q Consensus 124 ~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~-~-------~~~~~~~i~~~~~~l~~~~~~~-~ 194 (324) ++++++|.++|+++++.++|.++|+|+|++..|.|+.+......... . ....++++.+++..+...+..+ + T Consensus 262 ~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~ 341 (477) T protein:vir:84 262 VSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQIIYQKIADAIQRVHTSRFLEPE 341 (477) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHHHHHHHHHHHHhhccccccCCcc Confidence 99999999999999999999999999998777777765433221111 1 1234566777777777776654 5 Q ss_pred EEEEcHHHHHHHHHhhccCCceeeccC-----------------CcceeecceeEeecCCCCC------CceeEEeeccc Q lcl|Aclame:pro 195 AFISKTQNRSLLRKIVDPETKERIYDR-----------------NSDSLDGLPVVNLKSSNLK------RGELITGDFDK 251 (324) Q Consensus 195 ~~v~~~~~~~~l~~~~d~~g~~~~~~~-----------------~~~~l~G~pv~~~~~~~~~------~~~~i~gd~s~ 251 (324) .|+|||.++..|++++|.+|+|+|.++ ..++|+|+||++++.++.+ ...++||||++ T Consensus 342 ~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~ 421 (477) T protein:vir:84 342 VIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASD 421 (477) T ss_pred EEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccccccccCCcceEEEEEece Confidence 799999999999999999999998642 3458999999998776642 23689999999 Q ss_pred EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEe-ccCceEEEEeecCCCCCCC Q lcl|Aclame:pro 252 LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA-DDKAFAKLVPADKRTDSVP 321 (324) Q Consensus 252 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~-~~~A~~~l~~~~~~~~~~~ 321 (324) ++++. .++.++++++. ++.++.+.||+..++++... ||+||+.+|+++...+.-. T Consensus 422 ~~i~~-~~~~~~~~~~~--------------~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 422 LALFE-SSVRMRALQET--------------RAENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTAPTFA 477 (477) T ss_pred EEEEe-eceeEEecccc--------------ccccceeeeeehhhhhhhhhccccceEEeecccccccccC Confidence 88876 46777776543 34567778888888887554 6999999998776643322 No 94 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=7.9e-49 Score=284.30 Aligned_cols=294 Identities=12% Similarity=0.103 Sum_probs=224.1 Q ss_pred CchhHH----HHHHHHHHHhhh-------hhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCC Q lcl|Aclame:pro 1 MEQTQK----LKLNLQHFASNN-------VKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGT 69 (324) Q Consensus 1 ~~~~~~----~k~~~~~~a~~~-------~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~ 69 (324) ++..+. +.+++|++.... ....+.++.+..++++||++||+++.++|++.+++.++|+++++++++++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~- 159 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG- 159 (387) T ss_pred cchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCC- Confidence 211111 112233332211 11223455566677788999999999999999999999999999988865 Q ss_pred ceEEEEEe-CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 70 EKKFTFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI- 147 (324) Q Consensus 70 ~~~ip~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l- 147 (324) ..+|+.. +...+.|++|++..++++++|+++++.++|++++++||+|+++||.++++++|.+.++++++.+++..+| T Consensus 160 -~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~ 238 (387) T protein:vir:93 160 -LEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALA 238 (387) T ss_pred -ceEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhh Confidence 4578764 4577999999999999999999999999999999999999999999999999999999999999876544 Q ss_pred hccCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceee Q lcl|Aclame:pro 148 LNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLD 227 (324) Q Consensus 148 ~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~ 227 (324) .|+|++.+.+.+..... ...++..++++|+++++.+...|+.+++|+||+.++..+.++++..|++++. +.+.+|+ T Consensus 239 ~g~g~g~p~g~l~~~~~---~~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~-~~~~~ll 314 (387) T protein:vir:93 239 VSPKSGLDHMSFYNGSV---KEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVF 314 (387) T ss_pred cCCCccccceeeecccc---ccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCcccc Confidence 56666655444433221 2234455789999999999999999999999999986665444444455553 4557899 Q ss_pred cceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|Aclame:pro 228 GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 228 G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~ 307 (324) |+||+++.+ ...++||||+++++. +.++.++.+++ +.++++.|++..|+|+++.+|+|| T Consensus 315 G~PV~~~~~----~~~~~~GDf~~~~~~-~~~~~~~~~~~----------------~~~~~~~~~~~~r~d~~v~~~eA~ 373 (387) T protein:vir:93 315 GKPVVFTDA----AVKPIVGDFNYFGIN-YDGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAF 373 (387) T ss_pred ccceEEecC----CCceeeeehhhhhee-hhhheeeeccc----------------ccCCceeEEEEeeeCceeechhhe Confidence 999999764 346899999998765 45566654432 356889999999999999999999 Q ss_pred EEEEeecCCCCCCC Q lcl|Aclame:pro 308 AKLVPADKRTDSVP 321 (324) Q Consensus 308 ~~l~~~~~~~~~~~ 321 (324) +.++.+++.+..|- T Consensus 374 ~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 374 RIAKAKENTGSLPS 387 (387) T ss_pred EEEEeecCCCCCCC Confidence 99998877766555 No 95 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=2.5e-48 Score=281.57 Aligned_cols=293 Identities=12% Similarity=0.071 Sum_probs=217.9 Q ss_pred CchhH-----HHHHHHHHHHhhhh-----hHHhh---c-cccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec Q lcl|Aclame:pro 1 MEQTQ-----KLKLNLQHFASNNV-----KPQVF---N-PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM 66 (324) Q Consensus 1 ~~~~~-----~~k~~~~~~a~~~~-----~~~~~---~-~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~ 66 (324) ++.-+ +.+.+.+....... ..++. + .....+.+++|++||+++.++|++.+.+.|+++++|+++++ T Consensus 39 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~ 118 (377) T protein:vir:96 39 FTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNT 118 (377) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEec Confidence 10000 00111111110000 00011 1 11234456788899999999999999999999999999988 Q ss_pred CCCceEEEEEeCCcceeeeccCcccc-ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) Q Consensus 67 ~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~ 145 (324) ++ ..++|+.++.+.+.|++|+++++ .++++|+++++.+||++++++||+|+++||.++++++|.+++++++++++|++ T Consensus 119 ~~-~~~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a 197 (377) T protein:vir:96 119 SL-RLKALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELA 197 (377) T ss_pred CC-ceEEEEecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhc Confidence 65 57899998899999999998876 56899999999999999999999999999999999999999999999999999 Q ss_pred HHhccCcccccccccccccccc-c--------------c---ccchhhhhHHHHHHHHhhhhcC-----------CCcEE Q lcl|Aclame:pro 146 GILNQGNNPFGKSIAQSIEKTN-K--------------V---IKGDFTQDNIIDLEALLEDDEL-----------EANAF 196 (324) Q Consensus 146 ~l~G~g~~~~~~~~~~~~~~~~-~--------------~---~~~~~~~~~i~~~~~~l~~~~~-----------~~~~~ 196 (324) +++|+|++.+. |+.+...... . . .....+.+.+.+++..|...+. .++.| T Consensus 198 ~i~G~G~~~P~-Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~ 276 (377) T protein:vir:96 198 IVKGNGLLQPV-GLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) T ss_pred eEeccCCCcce-eeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEE Confidence 99999987544 4443211100 0 0 0112344566666666654443 34569 Q ss_pred EEcHHHHHHHHHhhccCCceeecc--CCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccc Q lcl|Aclame:pro 197 ISKTQNRSLLRKIVDPETKERIYD--RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVK 274 (324) Q Consensus 197 v~~~~~~~~l~~~~d~~g~~~~~~--~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~ 274 (324) +||+.++..+ .|++.+.+ +.+.+++|+|+.+..+..++++.++||||++|.++++++++++.+++. T Consensus 277 ~mn~~t~~~~------~~~~~~~~~~G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~------ 344 (377) T protein:vir:96 277 LLNPEDRWTL------EAKFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQT------ 344 (377) T ss_pred EEchhhHHhc------cccccccCCCCCceeccCCCceEEecCCCCcccEEEEEcCcEEEEEecccEEEeehhh------ Confidence 9999997665 23333332 344578899987777777888999999999999999999999998764 Q ss_pred cccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 275 ~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) +|.+|++.||+..|+|++++|++||++|+.+-- T Consensus 345 --------~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 345 --------FAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred --------hhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 499999999999999999999999999998554 No 96 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=7.9e-49 Score=284.30 Aligned_cols=294 Identities=12% Similarity=0.085 Sum_probs=222.3 Q ss_pred CchhH--HHHHHHHHHHhh-------hhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCce Q lcl|Aclame:pro 1 MEQTQ--KLKLNLQHFASN-------NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) Q Consensus 1 ~~~~~--~~k~~~~~~a~~-------~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~ 71 (324) ++... ...+++|.+... .....+.++....++++||++||++++++|++.+++.++|+++++++++++ . T Consensus 98 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~--~ 175 (402) T protein:vir:93 98 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG--L 175 (402) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCC--c Confidence 11110 011122222111 111122344455666778999999999999999999999999999988865 4 Q ss_pred EEEEEe-CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHH-Hhc Q lcl|Aclame:pro 72 KFTFWA-DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG-ILN 149 (324) Q Consensus 72 ~ip~~~-~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~-l~G 149 (324) .+|+.. +.+.+.|++|++.+++++++|+++++.++|++++++||+|+++||.++++++|.++|+++++.+++..+ ..| T Consensus 176 ~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g 255 (402) T protein:vir:93 176 EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLAAKERKDALAVS 255 (402) T ss_pred eeeeeeccCCccccccccccccccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcC Confidence 578765 456789999999999999999999999999999999999999999999999999999999999977654 456 Q ss_pred cCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcceeecc Q lcl|Aclame:pro 150 QGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSLDGL 229 (324) Q Consensus 150 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~ 229 (324) +|++.+.+.+.... ....++..++++|+++++.|...|+.+++|+||+.++..+.++++..|++++. +.+.+|+|+ T Consensus 256 ~g~g~p~g~~~~~~---~~~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~-~~~~~llG~ 331 (402) T protein:vir:93 256 PKSGLEHMSFYNGS---VKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD-TPAEKVFGK 331 (402) T ss_pred CCccccceeeeccc---cccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc-cCCcccccc Confidence 66665444333222 12233555789999999999999999999999999988877677777777775 455789999 Q ss_pred eeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEE Q lcl|Aclame:pro 230 PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK 309 (324) Q Consensus 230 pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~ 309 (324) ||+++.+ ..+++||||+++++.. .++.++.+++ ...+++.|++..|+|+++.+|+||+. T Consensus 332 PV~~t~~----~~~i~~GDf~~~~~~~-~~~~~~~~~~----------------~~~~~~~~~~~~r~Dg~v~~~~A~~~ 390 (402) T protein:vir:93 332 PVVFTDA----AVKPIVGDFNYFGINY-DGTTYDTDKD----------------VKKGEYLFVLTAWYDQQRTLDSAFRI 390 (402) T ss_pred ceEEecC----CCceeeechhhhhhhh-hhhhhhhhhc----------------ccCCceEEEEEEEeCcEEechhheEE Confidence 9998764 3468999999876543 4444443332 12588999999999999999999999 Q ss_pred EEeecCCCCCCC Q lcl|Aclame:pro 310 LVPADKRTDSVP 321 (324) Q Consensus 310 l~~~~~~~~~~~ 321 (324) |+.+++.+..|- T Consensus 391 l~ik~~~~~~~~ 402 (402) T protein:vir:93 391 AKAKENTGPLPS 402 (402) T ss_pred EEeecCCCCCCC Confidence 999877654444 No 97 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=1.2e-48 Score=283.26 Aligned_cols=299 Identities=16% Similarity=0.114 Sum_probs=217.7 Q ss_pred Cc--hhHHHHHHHHH----HHh-----hhhhHHh---hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec Q lcl|Aclame:pro 1 ME--QTQKLKLNLQH----FAS-----NNVKPQV---FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM 66 (324) Q Consensus 1 ~~--~~~~~k~~~~~----~a~-----~~~~~~~---~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~ 66 (324) ++ .....+...+. +.. ......+ ++.-...++++||++||++++++|++.+++.|+++++++++++ T Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~ 122 (383) T protein:vir:78 43 MAADIMEQAKKEARQEADAYISASRTDKNITNEEIKFFNDINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTT 122 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCChhhhhHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHhhccceeeeeeEec Confidence 10 00001111111 100 0000011 1233345677899999999999999999999999999999998 Q ss_pred CCCceEEEEEeCCcceeeeccCcccc-ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 EGTEKKFTFWADKPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA 145 (324) Q Consensus 67 ~~~~~~ip~~~~~~~a~~v~Eg~~~~-~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~ 145 (324) ++. .+||+.++.+.+.|++|+++++ .++++|+++++.+||++++++||+|+++||.++++++|.+.++++++.++|++ T Consensus 123 ~~~-~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a 201 (383) T protein:vir:78 123 GLR-TKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESA 201 (383) T ss_pred CCc-eEEEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhh Confidence 765 6899999999999999988775 57899999999999999999999999999999999999999999999999999 Q ss_pred HHhccCcccccccccccccccc---------ccccchhhhhHHHHHHHHhhhhcCCCc---------------EEEEcHH Q lcl|Aclame:pro 146 GILNQGNNPFGKSIAQSIEKTN---------KVIKGDFTQDNIIDLEALLEDDELEAN---------------AFISKTQ 201 (324) Q Consensus 146 ~l~G~g~~~~~~~~~~~~~~~~---------~~~~~~~~~~~i~~~~~~l~~~~~~~~---------------~~v~~~~ 201 (324) |++|+|++.+. |+........ ....+..+++++..+...+.. ++.+. +|+||+. T Consensus 202 ~i~G~G~~qP~-Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~~~~~n~~ 279 (383) T protein:vir:78 202 YIVGDGNDKPI-GLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTD-VYKYHSVKENGHPLNVAGKVTLLVNPT 279 (383) T ss_pred eEeccCCCCce-eeeeccCCcccccccccccccccchhhhhhhHHHHHHHHH-HHhccchhcccchhhhcCceEEEEcCc Confidence 99999987644 4433221111 112344556666666665542 33333 4555654 Q ss_pred HHHHHHH---hhccCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccc Q lcl|Aclame:pro 202 NRSLLRK---IVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDG 278 (324) Q Consensus 202 ~~~~l~~---~~d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (324) ++..+.. ..+.+| ...+++|+|+.+..+..+++++++||||++|.++++++++++++++. T Consensus 280 ~~~~~~~~~~~~~~~G-------~~~t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~---------- 342 (383) T protein:vir:78 280 DAWDVKKQYTSLNANG-------VYVTALPFNLNIIESLFVPEKKAISYVAERYDALIGGPLDIGTYDQT---------- 342 (383) T ss_pred chhhhccchhccCCCC-------ceeeecCCCceEEecCCCCcccEEEeeccceEEEecccceEEecchh---------- Confidence 4322211 122233 33467888876655667788899999999999999999999988764 Q ss_pred cchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 279 TPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 279 ~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) +|.+|++.||+..|+|++++|++||+.|+.+-+....+|.- T Consensus 343 ----~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 343 ----LAIEDLNLYAAKQFAYGKAKDDKAAAVWTLNINPAEQTPEG 383 (383) T ss_pred ----hhhcCceEEEEEEEEcCEEecCCeEEEEEEEecCCCCCCCC Confidence 49999999999999999999999999999887775555555 No 98 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=2.3e-47 Score=276.28 Aligned_cols=280 Identities=13% Similarity=0.037 Sum_probs=219.2 Q ss_pred CchhHHHHHH--------------HHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec Q lcl|Aclame:pro 1 MEQTQKLKLN--------------LQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM 66 (324) Q Consensus 1 ~~~~~~~k~~--------------~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~ 66 (324) .+.....+.. ...+.. .............+..+++.++|+++.+.|++ ++..+.++++++++++ T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~ 170 (397) T protein:vir:96 93 QKPKDGEKRKMKKFKVTEEELAEKRSAINA-FVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPV 170 (397) T ss_pred hhhHHHHHHHHHHHhhhhHHHHHHHHHHHH-HHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccc Confidence 0000000000 000110 00011122233455667889999999999987 5778889999999999 Q ss_pred CCCceEEEEEeC-CcceeeeccCccccc-cccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 67 EGTEKKFTFWAD-KPGAYWVGEGQKIET-SKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) Q Consensus 67 ~~~~~~ip~~~~-~~~a~~v~Eg~~~~~-~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~ 144 (324) +++++.+|+... ...+.|++|++..|+ +.++|++++++++++++++++|+|+++||.++++++|.+.++++++.++|. T Consensus 171 ~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~ 250 (397) T protein:vir:96 171 NSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLIADEIQDQSLNTKNA 250 (397) T ss_pred cccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHH Confidence 998899998754 567889999999996 689999999999999999999999999999999999999999999999999 Q ss_pred HHHhccCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeecc---- Q lcl|Aclame:pro 145 AGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYD---- 220 (324) Q Consensus 145 ~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~---- 220 (324) ++++|+|.+.+ .+..+++++.++++.....++ +++|+|||++|..|++++|++|+|+|.+ T Consensus 251 ~i~~g~g~~~~---------------~~~~~~d~~~~~~~~~~~~~~-~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~ 314 (397) T protein:vir:96 251 DIAAVLKTATA---------------KSVVGVDGLKDLINKEIKKVY-DVKLFISASMYSELDKLKDKNGRYLLQDSITA 314 (397) T ss_pred HHhhccccccc---------------ccccchHHHHHHHHHhhhhhc-CcEEEEcHHHHHHHHHhhccCCCeEeccCccC Confidence 99999886532 234568899999887655544 6799999999999999999999999864 Q ss_pred CCcceeecceeEeecCCC----CCCceeEEeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 221 RNSDSLDGLPVVNLKSSN----LKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 221 ~~~~~l~G~pv~~~~~~~----~~~~~~i~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) +.+++|+|+||+++++.. .++..++||||++ +.+++++++++..+++. .+.+.+|+.. T Consensus 315 ~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~~~~~~ 377 (397) T protein:vir:96 315 ASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNN-----------------IYGQLLAGII 377 (397) T ss_pred CCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEeccc-----------------ccceeEEEEE Confidence 455789999999875432 3344689999997 56888999999876542 2345789999 Q ss_pred EeccEEeccCceEEEEeecC Q lcl|Aclame:pro 296 HVALHIADDKAFAKLVPADK 315 (324) Q Consensus 296 r~d~~v~~~~A~~~l~~~~~ 315 (324) |+|+.+.||+||++|+.+++ T Consensus 378 r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 378 RYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EEccEEecccceEEEEeecC Confidence 99999999999999999888 No 99 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=5.4e-40 Score=235.86 Aligned_cols=287 Identities=14% Similarity=0.081 Sum_probs=220.8 Q ss_pred HHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee-cCCCceEEEEEeCC----cce Q lcl|Aclame:pro 8 KLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP-MEGTEKKFTFWADK----PGA 82 (324) Q Consensus 8 k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~-~~~~~~~ip~~~~~----~~a 82 (324) ++++|+-. +.....+ .+...||+|+|+++ +++++.+++.+++++++++++ +.+...+||+...+ +.. T Consensus 1 ~~~~~~~~------~~~k~it-~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~ 72 (314) T protein:vir:41 1 MDFLNKPF------QITPKID-VPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGR 72 (314) T ss_pred CchhhhHH------Hhhcccc-cccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCccccccc Confidence 23333311 1122222 23445778888887 579999999999999999985 57777889987533 234 Q ss_pred eeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhccCccc------ Q lcl|Aclame:pro 83 YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP------ 154 (324) Q Consensus 83 ~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~------ 154 (324) .|.+|....++++++|+++++.+||+...+.||+|+|+|+. ++++++|..+++++++..++.++++|+|+.. T Consensus 73 ~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~ 152 (314) T protein:vir:41 73 NTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELY 152 (314) T ss_pred ccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccch Confidence 57778888899999999999999999999999999999996 4999999999999999999999999998632 Q ss_pred -ccccccccc-ccccc--cccchhhhhHHHHHHHHhhhhcCC---CcEEEEcHHHHHHHHHhhccCCceeec----cCCc Q lcl|Aclame:pro 155 -FGKSIAQSI-EKTNK--VIKGDFTQDNIIDLEALLEDDELE---ANAFISKTQNRSLLRKIVDPETKERIY----DRNS 223 (324) Q Consensus 155 -~~~~~~~~~-~~~~~--~~~~~~~~~~i~~~~~~l~~~~~~---~~~~v~~~~~~~~l~~~~d~~g~~~~~----~~~~ 223 (324) .+.|....+ ..... ..+...+.+.+.+++..|+..|+. +.+|+||++++.+++++++.++++++. .+.+ T Consensus 153 ~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~ 232 (314) T protein:vir:41 153 RINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATG 232 (314) T ss_pred hcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCC Confidence 233433322 22111 122345667788999999999875 457999999999999999999888763 5677 Q ss_pred ceeecceeEeecCCC---CCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccE Q lcl|Aclame:pro 224 DSLDGLPVVNLKSSN---LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH 300 (324) Q Consensus 224 ~~l~G~pv~~~~~~~---~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~ 300 (324) .+++|+||+.++.++ .++..++||||++++++.+..++++.++.. .++++.|.+..|+|+. T Consensus 233 ~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a----------------~~~~~~~~~~~r~d~~ 296 (314) T protein:vir:41 233 LQYDGIPIQYVPALDALGDDKARALLTVPTNLVYGFWRNIRIEPKRDA----------------AMRRTEYIASLRADCN 296 (314) T ss_pred ceecceeeEecccccccCCCCceEEEechhheEEEeeceeEEeecccC----------------cCCeEEEEEEEEeceE Confidence 789999999877654 578899999999999999888888766532 5788999999999999 Q ss_pred EeccCceEEEEeecCCCC Q lcl|Aclame:pro 301 IADDKAFAKLVPADKRTD 318 (324) Q Consensus 301 v~~~~A~~~l~~~~~~~~ 318 (324) +.+.+|.++.....+..+ T Consensus 297 ~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 297 YEDENAAVAAVIDMSSGG 314 (314) T ss_pred EEEcCcEEEEEeeccCCC Confidence 998877776654444333 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=2.8e-38 Score=226.43 Aligned_cols=286 Identities=13% Similarity=0.088 Sum_probs=210.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee-cCCCceEEEEEeCC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP-MEGTEKKFTFWADK 79 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~-~~~~~~~ip~~~~~ 79 (324) |-.-.. ++ ..... ......+ .+..+||+++|++ ..++++.+.+.|++++++++++ +++....+++..-+ T Consensus 1 ~~~~~~----~~---~~~~~-~~~k~~t-~~d~~Gg~l~P~~-~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~ 70 (315) T protein:vir:41 1 MLTIED----IR---GGKPF-EIVPKID-VPDLGRGVLSVDR-FGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLV 70 (315) T ss_pred Ccccch----hh---cCChh-hhhhhcC-CcCCCCceechHH-HHHHHHHHHhhhhhhhhceeeeccccccccccccccC Confidence 111111 11 11111 1112222 2233455555655 5679999999999999999864 55555556554211 Q ss_pred ----cceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 80 ----PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 80 ----~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) ....|.+|.++.++++++|+++++.++++...+.||+|+|+|+. ++++++|..++++++++.++.++++|+|+. T Consensus 71 ~~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s 150 (315) T protein:vir:41 71 LDVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSS 150 (315) T ss_pred cccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcC Confidence 23458889999999999999999999999999999999999985 599999999999999999999999999864 Q ss_pred ccc-----ccccccc-ccc----cccccchhhhhHHHHHHHHhhhhcCC---CcEEEEcHHHHHHHHHhhccCCceeec- Q lcl|Aclame:pro 154 PFG-----KSIAQSI-EKT----NKVIKGDFTQDNIIDLEALLEDDELE---ANAFISKTQNRSLLRKIVDPETKERIY- 219 (324) Q Consensus 154 ~~~-----~~~~~~~-~~~----~~~~~~~~~~~~i~~~~~~l~~~~~~---~~~~v~~~~~~~~l~~~~d~~g~~~~~- 219 (324) ..+ .|....+ ... ....+...+.+.+.+|+..|+..|+. +.+|+||++++..++++++.+|+|+|. T Consensus 151 ~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~ 230 (315) T protein:vir:41 151 SDPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQ 230 (315) T ss_pred cCccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccc Confidence 322 3333211 111 11122345678899999999999874 568999999999999999999999985 Q ss_pred ---cCCcceeecceeEeecCCC---CCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEE Q lcl|Aclame:pro 220 ---DRNSDSLDGLPVVNLKSSN---LKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 220 ---~~~~~~l~G~pv~~~~~~~---~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) .+.+.+|+|+||+.+++++ .+++.++||||++++++.+.+++++++++. .++.+.|.+ T Consensus 231 ~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a----------------~~~~~~~~~ 294 (315) T protein:vir:41 231 ALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDA----------------EMRLTKYVA 294 (315) T ss_pred hhhcCCCceecccceEecccccccCCCCccEEEecccceEEEeccccEEEeeecC----------------CCCceEEEE Confidence 4577899999999887664 477889999999999999999999877643 346677888 Q ss_pred EEEeccEEeccCc--eEEEEe Q lcl|Aclame:pro 294 TMHVALHIADDKA--FAKLVP 312 (324) Q Consensus 294 ~~r~d~~v~~~~A--~~~l~~ 312 (324) ..|+|+.+.++++ ++.+++ T Consensus 295 ~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 295 SLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred EEEeceeEEeccceeEeeeeC Confidence 9999998877666 444444 No 101 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=1.5e-35 Score=211.43 Aligned_cols=301 Identities=11% Similarity=0.075 Sum_probs=219.7 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |-.+ .+..++++.+. .+. -...+.++|++||+++.+.+++.+.+.++++++++++++.+....+|....++ T Consensus 1 ~~~k-~~~~~l~~~~~-------~~~-~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~ 71 (321) T protein:vir:31 1 MASR-TINNDLSRITE-------KNA-LTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGE 71 (321) T ss_pred CchH-HHHHHHHHHHH-------hcc-ccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCC Confidence 4433 33333333221 111 12234567889999999999999999999999999999999889999987667 Q ss_pred ceeeecc-C-ccccccccceeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Q lcl|Aclame:pro 81 GAYWVGE-G-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 81 ~a~~v~E-g-~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~ 156 (324) ...|+++ + ...+.++++|+++++.++|+...+.||+|+|+|+. ++++++|.+.++++++..++.++|+|+|...++ T Consensus 72 ~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~ 151 (321) T protein:vir:31 72 RHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDS 151 (321) T ss_pred cccccccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCc Confidence 7778763 3 34556789999999999999999999999999974 699999999999999999999999999876554 Q ss_pred c-----ccccccc-c--ccccccchhhhhHHHHHHHHhhhhcCC--CcEEEEcHHHHHHHHH-hhccCCceee----ccC Q lcl|Aclame:pro 157 K-----SIAQSIE-K--TNKVIKGDFTQDNIIDLEALLEDDELE--ANAFISKTQNRSLLRK-IVDPETKERI----YDR 221 (324) Q Consensus 157 ~-----~~~~~~~-~--~~~~~~~~~~~~~i~~~~~~l~~~~~~--~~~~v~~~~~~~~l~~-~~d~~g~~~~----~~~ 221 (324) + |...... . ......+..+++.+.+++..|+..|++ ..+|+||++++..++. +++.++ +++ ..+ T Consensus 152 ~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~-~~~~~~l~~~ 230 (321) T protein:vir:31 152 FENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDT-PLGDNVIMGE 230 (321) T ss_pred ccccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCC-ccccchhhcc Confidence 2 3222111 1 111233456778899999999998875 3589999999887765 555443 444 345 Q ss_pred CcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 222 NSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHI 301 (324) Q Consensus 222 ~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v 301 (324) .+.+++|+||+..+. +|++.++++||+++.++.+.++++++..+.... ....+.+......++|+.| T Consensus 231 ~~~tl~G~pvv~~~~--mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~v 297 (321) T protein:vir:31 231 ADVNPFSFPIIGSGL--WPDDKAMFTDPQNLIYALYRDLEIDVLTESDKV-----------SERDLHARYFMRGDDDFAI 297 (321) T ss_pred ccccccceeEEEcCC--CCCCcEEEeccccEEEEEeeccEEEEeecCccc-----------cccceeeEeeeeeecceeE Confidence 667899999998775 567899999999999999999888876553210 0123344445667899999 Q ss_pred eccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 302 ADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 302 ~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .+.+|++.+++...+--...=|- T Consensus 298 e~~~a~a~~~~i~~~~~~~~~~~ 320 (321) T protein:vir:31 298 ENTEAVVLAEGLGDPLEHLEEET 320 (321) T ss_pred eccccEEEEecCCcchhcccCCC Confidence 99999999997643210000000 No 102 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=1e-35 Score=212.39 Aligned_cols=294 Identities=8% Similarity=0.002 Sum_probs=197.9 Q ss_pred CchhHHH---------------HHHHHHHHhh--hhhH------HhhccccccccccCccccchHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 1 MEQTQKL---------------KLNLQHFASN--NVKP------QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKI 57 (324) Q Consensus 1 ~~~~~~~---------------k~~~~~~a~~--~~~~------~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l 57 (324) .+..++. .+++..+... .... ....+.+......++.+.|+.+...+...+...+++ T Consensus 190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i 269 (517) T protein:vir:97 190 LMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGSL 269 (517) T ss_pred HHHHHHhhhhcccccccccchhhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhccc Confidence 0000000 0001110000 0000 000111112223356788999999999999998888 Q ss_pred hhhcceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHH----HHHHHHHH Q lcl|Aclame:pro 58 MQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQ----FFEEMKPM 133 (324) Q Consensus 58 ~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~----~~~~i~~~ 133 (324) .+.++..++. ...+|..+....+.|+.||+.+|+++++|+.+++.++++++++++|+|+|+|+..| +++||.++ T Consensus 270 ~~~~~~~~i~--~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~ 347 (517) T protein:vir:97 270 LPFIRHENLP--TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNR 347 (517) T ss_pred eeeeeecccc--ceeeecccccceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHH Confidence 8877654443 35567777777788999999999999999999999999999999999999998777 99999999 Q ss_pred HHHHHHHHHHHHHHhccCcccccccccccccc-ccccccchhhhhHHHHHHHHhhhhcC--CCcEEEEcHHHHHHHHHhh Q lcl|Aclame:pro 134 IAEAFYKKFDEAGILNQGNNPFGKSIAQSIEK-TNKVIKGDFTQDNIIDLEALLEDDEL--EANAFISKTQNRSLLRKIV 210 (324) Q Consensus 134 l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~~~~l~~~~~--~~~~~v~~~~~~~~l~~~~ 210 (324) |+++++.++|.++++|+|++....++...... ......++.+ +.+++..+...+. .++.|+||+.+|.+|+++| T Consensus 348 l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~---~~d~i~~l~~a~~~a~~a~~vmn~~t~~~I~klK 424 (517) T protein:vir:97 348 LPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTN---IQELLEKLSVATPKAADSTLVIHRNDLAAIRFLK 424 (517) T ss_pred HHHHHHHHHHHHHhcccCCCcccccccccccccccccccccch---HHHHHHHHHHHhhhccCCEEEECHHHHHHHHHhh Confidence 99999999999999999987655444432221 1111222233 3344444443333 3678999999999999999 Q ss_pred ccCCceeecc----CCcceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhc Q lcl|Aclame:pro 211 DPETKERIYD----RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ 286 (324) Q Consensus 211 d~~g~~~~~~----~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~ 286 (324) |++|+|+|.+ +...+++|..-+ .+...+ +...+++.+.+.++.+.++.+..+-+ +.+ T Consensus 425 D~~G~Yl~~~~~~~~~~~~l~G~~~~-~~~~~~--~~~~~~~~~~y~i~~~~g~~~~~~fd----------------~~~ 485 (517) T protein:vir:97 425 DKNGNYVFPVGVSNQTIATHFGFNRL-VQSVAV--DEKTAVSLSGYVTNGSRGMEFEQGTI----------------LVE 485 (517) T ss_pred cCCCCeeccCcCCcccccccCCcccc-cccccc--CceeEeeccccEEEeecceeeeeeee----------------ccc Confidence 9999999964 345667774211 122222 23334556677776666655432210 346 Q ss_pred CcEEEEEEEEeccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 287 DMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 287 ~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~ 318 (324) |+..|+.++|+++.|..+++|++.+..++..+ T Consensus 486 n~~~f~~~~~~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 486 NNKEYLFEMPISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred CceeEeeeeeeccccccccceEEEEEcCCCCC Confidence 78889999999999999999999887776665 No 103 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=100.00 E-value=3.2e-34 Score=204.20 Aligned_cols=261 Identities=16% Similarity=0.140 Sum_probs=213.9 Q ss_pred ccccccccCccccchHHHHHHHHHHHhhhhhhhhccee----ecCCCceEEEEEeCCcceeeeccCccccccccceeeEE Q lcl|Aclame:pro 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 27 ~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) .-.++++.+..++|+.++..+++.+.+.+.+.+++... ...+.+++||+++..+.+.|++||+.+|.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 21223445667899999999999999999888876542 23466799999988889999999999999999999999 Q ss_pred eeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHHH Q lcl|Aclame:pro 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) Q Consensus 103 l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) +.+++++..+.+|++++.++.+++.+.+.+++++++++++|+.++....+ .....++..+++.+.++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~-------------a~~~~~~~~t~d~i~da 147 (272) T protein:vir:30 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK-------------STQTVEATATVDGVSKA 147 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------cccccccccCHHHHHHH Confidence 99999999999999999999999999999999999999999999853211 11122345678999999 Q ss_pred HHHhhhhcCCCcEEEEcHHHHHHHHHhhccC-------CceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEE Q lcl|Aclame:pro 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) Q Consensus 183 ~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~-------g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~ 255 (324) +.++.+.+.....|+|||.++..|++.+..+ |...+..+..++++|+||++++. +++++.++.+...+.++ T Consensus 148 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~p~~t~~~~~~~a~~~~ 225 (272) T protein:vir:30 148 LDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--CPKGTAYMVRKGALRIM 225 (272) T ss_pred HHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--CCcceEEEEcCCeEEEE Confidence 9999999888899999999999998754221 22334456667999999999875 55778888888888888 Q ss_pred EecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~ 318 (324) .+++++++.+++. .++...+++..|+++++.+|+++++++.++++-. T Consensus 226 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 226 LKRNTMVETDRDI----------------TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ecCCceeeecccc----------------ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 8888888877654 2355778999999999999999999999999877 No 104 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=100.00 E-value=3.2e-34 Score=204.20 Aligned_cols=261 Identities=16% Similarity=0.140 Sum_probs=213.9 Q ss_pred ccccccccCccccchHHHHHHHHHHHhhhhhhhhccee----ecCCCceEEEEEeCCcceeeeccCccccccccceeeEE Q lcl|Aclame:pro 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 27 ~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) .-.++++.+..++|+.++..+++.+.+.+.+.+++... ...+.+++||+++..+.+.|++||+.+|.+++++++++ T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~ 80 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGFKKTT 80 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCcccccccccceEE Confidence 21223445667899999999999999999888876542 23466799999988889999999999999999999999 Q ss_pred eeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHHH Q lcl|Aclame:pro 103 MRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDL 182 (324) Q Consensus 103 l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 182 (324) +.+++++..+.+|++++.++.+++.+.+.+++++++++++|+.++....+ .....++..+++.+.++ T Consensus 81 ~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~-------------a~~~~~~~~t~d~i~da 147 (272) T protein:vir:98 81 MTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSK-------------STQTVEATATVDGVSKA 147 (272) T ss_pred EEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------cccccccccCHHHHHHH Confidence 99999999999999999999999999999999999999999999853211 11122345678999999 Q ss_pred HHHhhhhcCCCcEEEEcHHHHHHHHHhhccC-------CceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEE Q lcl|Aclame:pro 183 EALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) Q Consensus 183 ~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~-------g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~ 255 (324) +.++.+.+.....|+|||.++..|++.+..+ |...+..+..++++|+||++++. +++++.++.+...+.++ T Consensus 148 ~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~--~p~~t~~~~~~~a~~~~ 225 (272) T protein:vir:98 148 LDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRK--CPKGTAYMVRKGALRIM 225 (272) T ss_pred HHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCC--CCcceEEEEcCCeEEEE Confidence 9999999888899999999999998754221 22334456667999999999875 55778888888888888 Q ss_pred EecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~ 318 (324) .+++++++.+++. .++...+++..|+++++.+|+++++++.++++-. T Consensus 226 ~~~~~~ve~~r~~----------------~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 226 LKRNTMVETDRDI----------------TKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred ecCCceeeecccc----------------ccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 8888888877654 2355778999999999999999999999999877 No 105 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.94 E-value=2.7e-30 Score=182.65 Aligned_cols=284 Identities=11% Similarity=0.007 Sum_probs=163.6 Q ss_pred CchhHHHHHH---------------------------HHHHHh--hhhhHHh-hc-c--ccccccccCccccchHHHHHH Q lcl|Aclame:pro 1 MEQTQKLKLN---------------------------LQHFAS--NNVKPQV-FN-P--DNVMMHEKKDGTLMNEFTTPI 47 (324) Q Consensus 1 ~~~~~~~k~~---------------------------~~~~a~--~~~~~~~-~~-~--~~~~~~~~~~~~vp~~~~~~i 47 (324) .+...++.+. ++.+.. +...++. .+ . ........+++.+|+++.+.+ T Consensus 151 ~el~akl~el~k~~ee~k~~~~~~~~~~~~~~~~~~e~r~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 230 (480) T protein:vir:40 151 RELEAKVEELNKEREELKKEREASIPSEKPEDAERKFMRELGSKMAEMPEQGFLREFANGADLNVVNSLGSITSKYARKS 230 (480) T ss_pred hhHHHHHHHHHhHHHHHhhhhhhhccccchhhhhhHHHHHHHHHhccchhhhhhhhhhhhccccccccccccccchhhhe Confidence 1100000000 000000 0000000 00 0 000111112223333333332 Q ss_pred HHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccCcccccc--ccceeeEEee---heeeEEeeeehHHHhhcC Q lcl|Aclame:pro 48 LQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETS--KATWVNATMR---AFKLGVILPVTKEFLNYT 122 (324) Q Consensus 48 ~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~--~~~~~~v~l~---~~k~~~~~~iS~e~l~ds 122 (324) .......++....++. ...+.....|++|....+.. ...+....+. .++++....+|+++++|+ T Consensus 231 ~~~~~~~~~~~~~~~~-----------~~~g~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa 299 (480) T protein:vir:40 231 GIYDGAMKARFQGLTL-----------AEDGVDDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDS 299 (480) T ss_pred eechhhhhhhhhccee-----------eeccccceeeeeeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhhh Confidence 2222222222221111 11233456687776544432 2233444444 478888899999999987 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc-ccccccccccccccccchhhh-hHHHHHHHHhhhhcCCCc-EEEEc Q lcl|Aclame:pro 123 YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF-GKSIAQSIEKTNKVIKGDFTQ-DNIIDLEALLEDDELEAN-AFISK 199 (324) Q Consensus 123 ~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~-~~~~~~~~~~~~~~~~~~~~~-~~i~~~~~~l~~~~~~~~-~~v~~ 199 (324) . ++++||..+|++.++.+++.+|++|+|++.. +.++..... . .+...+. +.|.+++.++...++.++ .|+|| T Consensus 300 ~-~l~~~i~~~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~---~-~~~~~~~~d~id~L~~al~~~y~~~a~~~vmn 374 (480) T protein:vir:40 300 G-ALSEYVMSEMVNRVIQKVEYNMILGSVDGSNGFYGLKTATD---G-WTKQIEYTDLFEGITDAVAECSISDAITIVMS 374 (480) T ss_pred H-HHHHHHHHHHHHHHHHHHHHHhhccCCCCccccccceeecc---c-ccccchhHHHHHHHHHhhhHHhhCCCCEEEEC Confidence 6 8999999999999999999999999665532 222222111 1 1122333 445568899998888877 69999 Q ss_pred HHHHHHHHHhhccCCceeecc----CCcceeecceeEeecCCCCCCceeEEeecc-cEEEEEecceEEEEeeccceeccc Q lcl|Aclame:pro 200 TQNRSLLRKIVDPETKERIYD----RNSDSLDGLPVVNLKSSNLKRGELITGDFD-KLIYGIPQLIEYKIDETAQLSTVK 274 (324) Q Consensus 200 ~~~~~~l~~~~d~~g~~~~~~----~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s-~~~~~~~~~~~~~~~~~~~~~~~~ 274 (324) +.+|++|+++||++|+|+|++ +.+.+|+|+||++... .++.+...++.++ ++.+++++ ++. .+. T Consensus 375 ~~t~~~I~klKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~-~~~~~~~~~~~~~~~~~~~d~~-~~~--~~~------- 443 (480) T protein:vir:40 375 PQTFAELRKAKGTDGHSRFNELATKEQIAQSFGAVNLETRV-WMPKDEVAVYNHDEYVLIGDLN-VEN--YND------- 443 (480) T ss_pred HHHHHHHHHhhcCCCCeeccCcccccCcceecccceeeeec-cccCCcceeeeCCccEEEEecc-cce--ecc------- Confidence 999999999999999999975 4578999999876532 2333333344444 55677763 221 111 Q ss_pred cccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 275 ~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~ 318 (324) ..+..++..|+++.|+++.+.+|+|++.|+.+.-=+. T Consensus 444 -------~~~~~~~~~~~~e~~v~g~~~~~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 444 -------FDLRYNVEQWLSETLVGGSIRGKNRSAYLKKKGSLGV 480 (480) T ss_pred -------cccccchhhhhhhhhhceeeEccccEEEEEeccCcCC Confidence 0145788889999999999999999999998665555 No 106 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.91 E-value=6.3e-26 Score=158.73 Aligned_cols=263 Identities=14% Similarity=0.089 Sum_probs=208.1 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee---c-CCCceEEEEEeCCcceeeeccCcccccccccee Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP---M-EGTEKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~---~-~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) +. + ..+.-+.-++|+.|...+.+.+.+...+.+++.... . .+..++||+++..+.+.++.||+.++.++.+.+ T Consensus 1 ma--~-~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~~~it~~ 77 (274) T protein:vir:93 1 MP--Q-GITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETK 77 (274) T ss_pred CC--c-cceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccccccccc Confidence 11 1 223345668899999999999999888888875532 1 355799999987778899999999999999999 Q ss_pred eEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHH Q lcl|Aclame:pro 100 NATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 100 ~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) ..++..++.+..+.++++....+..++.+.+.+++++++++++|+.++..-.+.. .......++++.+ T Consensus 78 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~------------~~~~~~~~~~d~i 145 (274) T protein:vir:93 78 KREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------LTVNADITKLNGL 145 (274) T ss_pred eeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc------------ccccccccCHHHH Confidence 9999999999999999999999989999999999999999999999886432211 1112234578999 Q ss_pred HHHHHHhhhhcCCCcEEEEcHHHHHHHHHhh------cc-CCceeeccCCcceeecceeEeecCCCCCCceeEEeecccE Q lcl|Aclame:pro 180 IDLEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKL 252 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~ 252 (324) .++..++.+.......++|||.++..|++.. ++ .|..++..+..++++|++|++++. ++.++.++.....+ T Consensus 146 ~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l~~~gai 223 (274) T protein:vir:93 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNK--LEAGTAILAKKGAV 223 (274) T ss_pred HHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCeeEEEcCC--CCcceEEEEeCCeE Confidence 9999999988878889999999999997532 11 234556667788999999999765 55778887777777 Q ss_pred EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) .++...++.++.+|+.. +....+++..++++++.+|+++++++.+++.--- T Consensus 224 ~~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 224 KLILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEecCCcccccccchh----------------hcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 77777888888776542 2345789999999999999999999987776533 No 107 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.90 E-value=1.7e-25 Score=156.44 Aligned_cols=259 Identities=14% Similarity=0.115 Sum_probs=197.9 Q ss_pred cccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCccccccccceeeE Q lcl|Aclame:pro 26 PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 ~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) ..+ +.+..+..++|+.|...+.+.+.+...+.+++...+ ..+..++||++.....+.++.||+.++.++.+.++. T Consensus 1 ma~-~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg~~i~~~~lt~~~~ 79 (272) T protein:vir:36 1 MSK-QKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEGGEISLDKIGTTTK 79 (272) T ss_pred CCC-cceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCCCccChhhcCCcce Confidence 111 123335567799999988899988888888875533 236779999998777889999999999999999999 Q ss_pred EeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHH Q lcl|Aclame:pro 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..++++..+.++++....+..++.+.+.++++.++++++|+.++....+ .....+...+++.+.+ T Consensus 80 ~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~-------------~~~~~~~~~~~d~i~~ 146 (272) T protein:vir:36 80 SVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKT-------------TSQTVSTKANVDGVQA 146 (272) T ss_pred eEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc-------------ccccccccccHHHHHH Confidence 999999999999999999999899999999999999999999998853211 1112234567899999 Q ss_pred HHHHhhhhcCCCcEEEEcHHHHHHHHHhhc------cCCceeeccCCcceeecceeEeecCCCCCCce---eEEeecccE Q lcl|Aclame:pro 182 LEALLEDDELEANAFISKTQNRSLLRKIVD------PETKERIYDRNSDSLDGLPVVNLKSSNLKRGE---LITGDFDKL 252 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d------~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~---~i~gd~s~~ 252 (324) +..++.+.......++|||.++..|++... ..|..++..+..++++|++|+++...+...+. ++++. ..+ T Consensus 147 A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~-gA~ 225 (272) T protein:vir:36 147 ALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNS-PAL 225 (272) T ss_pred HHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEecc-cce Confidence 999999888888899999999999976542 33445555666789999999998876655442 22221 223 Q ss_pred EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) .++..+++.+|.+|+.. +....+++..+++.++.+|+++++++.+-- T Consensus 226 ~~~~~~~~~vE~~R~~~----------------~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 226 KLVLKRGVQVETDRDIV----------------TKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eeeecCCcccccccchh----------------hcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 34556677777665432 223468899999999999999999987655 No 108 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.89 E-value=8.9e-25 Score=152.44 Aligned_cols=265 Identities=18% Similarity=0.128 Sum_probs=206.4 Q ss_pred cccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCccccccccceeeE Q lcl|Aclame:pro 26 PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 ~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) ..+. .+.-+.-++|+.+..-+.+.+.+...+.+++.... ..+..++||.+.....+.++.||+.++..+.+.++. T Consensus 1 Ma~~-~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~~~lt~~~~ 79 (276) T protein:vir:10 1 MAQG-TTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPVDKIETNRR 79 (276) T ss_pred CCcc-eeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCcccccccee Confidence 1111 23345567799999999999999999888876532 357789999998778889999999999999999999 Q ss_pred EeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHH Q lcl|Aclame:pro 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) +...++++..+.++++....+..|+.+.+.++++.++++++|+.++.-..+ ......++.++++.+.+ T Consensus 80 ~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~------------~~~~~~~~~~t~d~i~~ 147 (276) T protein:vir:10 80 EAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRG------------TKLTVSADIGTLAGLEA 147 (276) T ss_pred eEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhc------------ccccccccccCHHHHHH Confidence 999999999999999999999899999999999999999999988842111 11112234567899999 Q ss_pred HHHHhhhhcCCCcEEEEcHHHHHHHHHhhcc-------CCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEE Q lcl|Aclame:pro 182 LEALLEDDELEANAFISKTQNRSLLRKIVDP-------ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~-------~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~ 254 (324) +..++.+.......++|||..+..|++.... .|..++..+..++++|++|+++.. ++.++.++.....+.+ T Consensus 148 A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~--~p~~t~~l~~~gAi~~ 225 (276) T protein:vir:10 148 AIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIVKGAFGEALGAVIVRSKK--LDEGEAILAKRGAVKL 225 (276) T ss_pred HHHHhccccCcccEEEEcHHHHHHHHHhccccccccccccccceeccccceecceeEEEcCC--CCcceEEEEeccceee Confidence 9999988777888999999999999875422 234455566778999999999775 4566766655555556 Q ss_pred EEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) +..+++.+|.+|+.. +....+++..+++.++.+|+.+++++.+++..++ |. T Consensus 226 ~~~~~~~vE~dRd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~--~~ 276 (276) T protein:vir:10 226 ITKRDFFLETDRDPS----------------TKTTALYSDKHYVAYLYDESKAVKVTKGAGTTDS--GA 276 (276) T ss_pred eecCCceeecccchh----------------hcccEEEEeeEEEEEEEcCcceEEEecCCcCCcC--CC Confidence 777888888887653 2345688899999999999999999987644333 33 No 109 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.89 E-value=1.2e-24 Score=151.66 Aligned_cols=264 Identities=16% Similarity=0.087 Sum_probs=205.6 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCcccccccccee Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) +...+ .+.-+.-++|+.+...+.+.+.+...+.+++..-+ ..+..++||++...+.+.++.||+.++..+.+.+ T Consensus 1 ~~~~~--~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~ 78 (275) T protein:vir:96 1 MALEN--MTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETK 78 (275) T ss_pred CCCcc--cchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccc Confidence 22222 22334467799999999999999888888875533 2466799999987778889999999999999999 Q ss_pred eEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHH Q lcl|Aclame:pro 100 NATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 100 ~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) +.++..++++..+.++++....+..|+.+.+.++++.++++++|+.++.--++. .....+..++++.+ T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a------------~~~~~~~~~~~d~i 146 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGA------------TLKVEADITKLAGL 146 (275) T ss_pred eeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccccccccCHHHH Confidence 999999999999999999998888899999999999999999999988533221 11122345678999 Q ss_pred HHHHHHhhhhcCCCcEEEEcHHHHHHHHHhh-------ccCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccE Q lcl|Aclame:pro 180 IDLEALLEDDELEANAFISKTQNRSLLRKIV-------DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKL 252 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~-------d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~ 252 (324) .++..++.+.......++|||..+..|++.. +..|..++..+..++++|++|+++... +.++.++.....+ T Consensus 147 ~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~--p~~t~~i~~~gA~ 224 (275) T protein:vir:96 147 QTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKI--KEGEAILAKRGAV 224 (275) T ss_pred HHHHHHhccccCCccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCC--CcceEEEEeccce Confidence 9999999887777889999999999997753 122455566777889999999998764 4555555444455 Q ss_pred EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) .++...++.+|.+|+.. +....+++..+++.++.+|+++++++..+++=++ T Consensus 225 ~~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 225 KLITKRDFFLETERHAS----------------HKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred eeeecCCcccccccchh----------------hcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 56667777888777542 2345688999999999999999999998888666 No 110 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.88 E-value=1.1e-23 Score=146.52 Aligned_cols=263 Identities=14% Similarity=0.097 Sum_probs=202.1 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCcccccccccee Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) +. +. ++..+.-++|+.+...+.+.+.+...+.+++...+ -.+..++||++...+.+..+.||+.++.++.+.+ T Consensus 1 ma--~~-~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~~g~~i~~~~it~~ 77 (274) T protein:vir:96 1 MA--QG-TTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQVIAEGEKIPVDQIGTS 77 (274) T ss_pred CC--cc-ccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCccccCCCCcCchhhcccc Confidence 11 11 12335667899999999999888887777765432 1366799999986678888999999999999999 Q ss_pred eEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHH Q lcl|Aclame:pro 100 NATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 100 ~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) ..++..++++..+.++++....+..++.+.+.++++.++++.+|+.++....+. .....+..++++.+ T Consensus 78 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a------------~~~~~~~~~~~d~i 145 (274) T protein:vir:96 78 KREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGA------------TLTVEADITKLDGL 145 (274) T ss_pred eeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcC------------CCCcCcccccHHHH Confidence 999999999999999999999888999999999999999999999888643211 11122345678999 Q ss_pred HHHHHHhhhhcCCCcEEEEcHHHHHHHHHhh------cc-CCceeeccCCcceeecceeEeecCCCCCCceeEEeecccE Q lcl|Aclame:pro 180 IDLEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKL 252 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~ 252 (324) .++..++.+.......++|||..+..|++.. +. .|..++..+..++++|++|+++.. ++.++.++.....+ T Consensus 146 ~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~--~p~~t~~l~~~gA~ 223 (274) T protein:vir:96 146 QTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGAVIVRSNK--LNKGEALLAKKGAV 223 (274) T ss_pred HHHHHHhcccCCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCeeEEEcCC--CCcceEEEEeCcce Confidence 9999999988778889999999999997753 11 234556667788999999998776 45566665555566 Q ss_pred EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) .++...++.++.+|+.. +....+++..+|+.++++|+++++++.+++-..- T Consensus 224 ~~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 224 KLITKRDFFLEKDRDAS----------------RKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred eeeecCCcccccccchh----------------hcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 66677777777665432 2345688999999999999999999987765443 No 111 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.87 E-value=2.5e-23 Score=144.47 Aligned_cols=263 Identities=14% Similarity=0.084 Sum_probs=205.2 Q ss_pred cccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCccccccccceeeE Q lcl|Aclame:pro 26 PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 ~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) ..+ ..+.-+.-++|+.|...+.+.+.+...+.+++.... ..+..++||++.....+..+.||+.++..+.+.+.. T Consensus 1 ma~-~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:97 1 MPQ-GLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCc-cceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccccccccee Confidence 111 122335568899999999999888877777775532 246779999998667888999999999999999999 Q ss_pred EeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHH Q lcl|Aclame:pro 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.+.. ....+..++++.+.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:97 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------LTVNADITKLNGLQS 147 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC------------ccccccccCHHHHHH Confidence 99999999999999999999889999999999999999999999885422211 111224567899999 Q ss_pred HHHHhhhhcCCCcEEEEcHHHHHHHHHhh------cc-CCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEE Q lcl|Aclame:pro 182 LEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~l~~~~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~ 254 (324) +..++.+.......++|||..+..|++.. .+ .|..++..+..++++|++|++++.. +.++.++.....+.+ T Consensus 148 A~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~--p~~t~~l~~~gA~~~ 225 (274) T protein:vir:97 148 AIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL--EAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCC--CcceEEEEeCcceEe Confidence 99999988878889999999999997631 11 2455666777889999999998764 566776666666677 Q ss_pred EEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) +...++.++.+|+.. +....+++..+++.++.+|+.+++++.+.|..-- T Consensus 226 ~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 226 ILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred eecCCceeccccchh----------------hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 777888888777642 2234688889999999999999999987776533 No 112 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.87 E-value=2.5e-23 Score=144.47 Aligned_cols=263 Identities=14% Similarity=0.084 Sum_probs=205.2 Q ss_pred cccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCccccccccceeeE Q lcl|Aclame:pro 26 PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 ~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) ..+ ..+.-+.-++|+.|...+.+.+.+...+.+++.... ..+..++||++.....+..+.||+.++..+.+.+.. T Consensus 1 ma~-~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~~~~~g~~i~~~~lt~~~~ 79 (274) T protein:vir:94 1 MPQ-GLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETKKR 79 (274) T ss_pred CCc-cceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccccccccccee Confidence 111 122335568899999999999888877777775532 246779999998667888999999999999999999 Q ss_pred EeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHH Q lcl|Aclame:pro 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..++.+..+.++++....+..++.+.+.++++.++++++|+.++.--.+.. ....+..++++.+.+ T Consensus 80 ~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~------------~~~~~~~~~~d~i~d 147 (274) T protein:vir:94 80 EAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------LTVNADITKLNGLQS 147 (274) T ss_pred EEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC------------ccccccccCHHHHHH Confidence 99999999999999999999889999999999999999999999885422211 111224567899999 Q ss_pred HHHHhhhhcCCCcEEEEcHHHHHHHHHhh------cc-CCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEE Q lcl|Aclame:pro 182 LEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY 254 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~l~~~~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~ 254 (324) +..++.+.......++|||..+..|++.. .+ .|..++..+..++++|++|++++.. +.++.++.....+.+ T Consensus 148 A~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~--p~~t~~l~~~gA~~~ 225 (274) T protein:vir:94 148 AIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL--EAGTAILAKKGAVKL 225 (274) T ss_pred HHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCeeEEEcCCC--CcceEEEEeCcceEe Confidence 99999988878889999999999997631 11 2455666777889999999998764 566776666666677 Q ss_pred EEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 255 GIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 255 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) +...++.++.+|+.. +....+++..+++.++.+|+.+++++.+.|..-- T Consensus 226 ~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 226 ILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred eecCCceeccccchh----------------hcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 777888888777642 2234688889999999999999999987776533 No 113 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.87 E-value=6.5e-24 Score=147.70 Aligned_cols=311 Identities=13% Similarity=0.067 Sum_probs=208.7 Q ss_pred CchhHHHHHHHHHHHh---hhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFAS---NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFW 76 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~---~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~ 76 (324) ++.++..-+.+++|+- ...-..+.+.....++.++..+||..+++-+.+...+.....+++..+... +.+..+|.. T Consensus 42 ~~~~~~e~el~E~f~Kmm~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~ 121 (393) T protein:vir:79 42 LALNEEETQILESFAKMMEGETPTNEVNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSI 121 (393) T ss_pred hhcchhHHHHHHHHHHHhcCCCchhheehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccch Confidence 3444444444555554 222223355545577888999999999999999999999999998888774 333444443 Q ss_pred eCCcceeeeccCccccccc---cceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 77 ADKPGAYWVGEGQKIETSK---ATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 77 ~~~~~a~~v~Eg~~~~~~~---~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) +.-.+.-|+||+++|+.. .+++.++++..|+|..+.+|+|+++||.+++.+++.+.+.++++++.|..++++.-++ T Consensus 122 -g~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ 200 (393) T protein:vir:79 122 -GIMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSH 200 (393) T ss_pred -heeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcc Confidence 355678899999999865 4578999999999999999999999999999999999999999999999999998665 Q ss_pred cc--ccccccccc-c-----ccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccC-------Cceee Q lcl|Aclame:pro 154 PF--GKSIAQSIE-K-----TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPE-------TKERI 218 (324) Q Consensus 154 ~~--~~~~~~~~~-~-----~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~-------g~~~~ 218 (324) .+ ..++..+.. . .....+++++++|+.+++.++....+.+++++|||-.|..+.+-.--. |++-. T Consensus 201 ghtvfDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~ 280 (393) T protein:vir:79 201 GHTVFDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPA 280 (393) T ss_pred cceeeeccccCccceeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCc Confidence 54 222222111 1 123467899999999999999999999999999999999987643222 21111 Q ss_pred ccCCcceeec-----------ceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcC Q lcl|Aclame:pro 219 YDRNSDSLDG-----------LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) Q Consensus 219 ~~~~~~~l~G-----------~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~ 287 (324) ..-...+.+| +.|+++|-.+.....--| +++..++....+..-++ .+++.+-.+ -..| T Consensus 281 ~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rF----d~~~Vd~NnvgvlLV~D-~i~tdq~dd------k~rd 349 (393) T protein:vir:79 281 KGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRF----DVYAVDRNNVGVLLVRD-DLKTDQWDE------KARG 349 (393) T ss_pred cccchhhhhchhhhccccccceeEEEeccccccccccee----eEEEeecCCceEEEEec-Ccceecccc------cccc Confidence 1111122333 567777765543331111 33444444433333222 111111111 2468 Q ss_pred cEEEEEEEEeccEEec-cCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 288 MVALRATMHVALHIAD-DKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~-~~A~~~l~~~~~~~~~~~~~~ 324 (324) .+.++...|+|+.|++ .+|++..+...-.-++ |.-| T Consensus 350 iq~iKl~ERYG~gvLn~gkaiavakNI~~~k~y-~~P~ 386 (393) T protein:vir:79 350 LQNIKMIERYGIGILNEGKAIAVAKNISMDKSY-AEPM 386 (393) T ss_pred ceeeeeeeeeceeeeeCCceEEEEecceeeccc-ccch Confidence 8889999999999888 4667766655444333 2223 No 114 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.86 E-value=2.5e-23 Score=144.45 Aligned_cols=296 Identities=12% Similarity=0.085 Sum_probs=209.6 Q ss_pred CchhHHHHHHHHHHHhhhhh---HHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEe Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVK---PQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~---~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~ 77 (324) |-.-- .-..+|.-..+. ++. +..+.+-..++.+.|..+...|++.+.+.|.+++..+...+.++.+++++.+ T Consensus 1 ~~~~~---~~~~~~~~~~~~~~~p~l--~m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~ 75 (330) T protein:vir:94 1 MVRIC---TPPLRGRWRTLTHQFPEL--KMPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNREN 75 (330) T ss_pred Cceec---CCccccceeehhcccccc--chhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcceeeeee Confidence 10000 000011111111 111 1123344457788899999999999999999999998888888889999999 Q ss_pred CCcceeeeccCccccccc-cceeeEEeeheeeEEeeeehHHHhh--cChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Q lcl|Aclame:pro 78 DKPGAYWVGEGQKIETSK-ATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~~~-~~~~~v~l~~~k~~~~~~iS~e~l~--ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~ 154 (324) ..+.+.|+..++.++++. .+|.+++...+.+++.+.|.+++.+ .+..+...+-.+...+++++++++++|+|+.++. T Consensus 76 ~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~ 155 (330) T protein:vir:94 76 VLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGN 155 (330) T ss_pred cCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCc Confidence 999999999999888765 5789999999999999999999965 4567888999999999999999999999987755 Q ss_pred cccccccccccccc----cccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeec-------cCCc Q lcl|Aclame:pro 155 FGKSIAQSIEKTNK----VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIY-------DRNS 223 (324) Q Consensus 155 ~~~~~~~~~~~~~~----~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~-------~~~~ 223 (324) ...|+.......+. ...+.++.|++-.|+..+......++.|+||.++..+++.+....|++... +... T Consensus 156 ~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v 235 (330) T protein:vir:94 156 SFQGMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQI 235 (330) T ss_pred cccchhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEE Confidence 55555443332222 234677889999999999777778899999999999999998877765532 2233 Q ss_pred ceeecceeEeecCCCCCC--------ceeEEeecc-----cEEEEEe----cceEEEEeeccceeccccccccchhhhhc Q lcl|Aclame:pro 224 DSLDGLPVVNLKSSNLKR--------GELITGDFD-----KLIYGIP----QLIEYKIDETAQLSTVKNEDGTPVNLFEQ 286 (324) Q Consensus 224 ~~l~G~pv~~~~~~~~~~--------~~~i~gd~s-----~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~f~~ 286 (324) .++.|+|++.+...+... ..|++..|- +.+.|.. .++.++... ..=.+ T Consensus 236 ~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G---------------~~~~k 300 (330) T protein:vir:94 236 PTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVG---------------AKENA 300 (330) T ss_pred eeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCC---------------Ccccc Confidence 568899988765444322 234443331 2334442 122222110 01134 Q ss_pred CcEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 287 DMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 287 ~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) +...+++++|++.++.+++|+++|++..-+ T Consensus 301 ~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 301 DETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred ceeeEEEEEeeeeEEechhheeeeccccCC Confidence 567789999999999999999999998777 No 115 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.86 E-value=7.8e-23 Score=141.80 Aligned_cols=266 Identities=15% Similarity=0.116 Sum_probs=196.2 Q ss_pred cccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCccccccccceeeE Q lcl|Aclame:pro 26 PDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 26 ~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) ..+ .++.-+.-++|+.|...+.+.+.+...+.+++.... -.+..++||++.....+.++.|++.++..+++.++. T Consensus 1 Ma~-~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~~~~~g~~i~~~~lt~~~~ 79 (278) T protein:vir:80 1 MAD-LTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQDVAEGAAIDYSALETESV 79 (278) T ss_pred CCC-cceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcceeecCCCcCccccccccee Confidence 111 223345678999999999999998888888765432 235678999998777888999999999999999999 Q ss_pred EeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHH Q lcl|Aclame:pro 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 102 ~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) ++..++.+..+.++++....+..++.+.+.++++.++++.+|+.++..-.+... .. ............++.+.+ T Consensus 80 ~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~----~~--~~~~t~~~~~~~~~~~~d 153 (278) T protein:vir:80 80 KHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTL----EV--KGAINIGLIDKIENTFTD 153 (278) T ss_pred eEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccc----cc--ccccccchhhhHHHHHHH Confidence 999999999999999999999899999999999999999999988864321110 00 000111112234566777 Q ss_pred HHHHhhhhcCCC-cEEEEcHHHHHHHHHhhc-------cCCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEE Q lcl|Aclame:pro 182 LEALLEDDELEA-NAFISKTQNRSLLRKIVD-------PETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLI 253 (324) Q Consensus 182 ~~~~l~~~~~~~-~~~v~~~~~~~~l~~~~d-------~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~ 253 (324) +..++....... ..++|||..+..|++... ..|..++..+..++++|++|+++... +.++.++.....+. T Consensus 154 a~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~--p~~t~~l~~~gAi~ 231 (278) T protein:vir:80 154 APDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKL--ADGNALAVKAGALK 231 (278) T ss_pred HHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCC--CcceEEEEecccee Confidence 777776655543 358899999999976531 12455566677889999999998765 45666555555555 Q ss_pred EEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 254 YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 254 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) ++...++.++.+|+.. +....+++..+++.++.+|+++++++..+.. T Consensus 232 ~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 232 TFLKRNLLAESGRDMD----------------HKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred eeecCCcccccccchh----------------hccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 6677777777766542 2334688899999999999999999987776 No 116 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.85 E-value=1.4e-22 Score=140.37 Aligned_cols=262 Identities=10% Similarity=0.068 Sum_probs=201.2 Q ss_pred ccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCccccccccceeeEEee Q lcl|Aclame:pro 29 VMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMR 104 (324) Q Consensus 29 ~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~ 104 (324) ...+.-+.-++|+.+..-+.+.+.+...+.+++...+ ..+..+++|.++...++.-+.||+.++..+.+.++.... T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~~~a~ 80 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTTTKVT 80 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccchheee Confidence 1112234456788888888888888888888876532 257789999998777888999999999999999999999 Q ss_pred heeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHHHHH Q lcl|Aclame:pro 105 AFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEA 184 (324) Q Consensus 105 ~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~ 184 (324) .++++..+.++++....+..|..+.+.++++..+++++|+.++.-... .....+..++++++.+++. T Consensus 81 i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~-------------a~~~~~~~~t~~~~~dA~~ 147 (270) T protein:vir:95 81 VKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNK-------------SKQTATVSADATGILDAIE 147 (270) T ss_pred eehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcc-------------cccccccccCHHHHHHHHH Confidence 999999999999998887788999999999999999999988732111 1111234567889999999 Q ss_pred HhhhhcCCCcEEEEcHHHHHHHHHhhcc----CCceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecce Q lcl|Aclame:pro 185 LLEDDELEANAFISKTQNRSLLRKIVDP----ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLI 260 (324) Q Consensus 185 ~l~~~~~~~~~~v~~~~~~~~l~~~~d~----~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~ 260 (324) ++.+......+++|||.++..|++...- .+..+...+..++++|++|++....+ ++++.++.....+.++...++ T Consensus 148 ~lgd~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~Viv~s~~~-~~~~~~l~~~gAi~~~~~~~~ 226 (270) T protein:vir:95 148 VFNSENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVSDIVKSKRV-SENTAFLQRYGAMEIVNKKKP 226 (270) T ss_pred HhccccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceecceeEEEeCCCC-CceeEEEEeccceeeeecCCc Confidence 9999888889999999999999864311 23334455678899999998865543 455555443444557777788 Q ss_pred EEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCC Q lcl|Aclame:pro 261 EYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV 320 (324) Q Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~ 320 (324) .+|.+|+.. +....+++..++++.+.++..+++++-++++++.- T Consensus 227 ~vEtdRd~~----------------~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 227 EAYTDFDIL----------------KRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred eeeeccchh----------------hcccEEEeeeEEEEEEEccceEEEEEecCCCCcCC Confidence 888887653 23346788899999999999999999988887765 No 117 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.84 E-value=4.9e-22 Score=137.42 Aligned_cols=263 Identities=14% Similarity=0.084 Sum_probs=199.8 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCcccccccccee Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) +. + ..+.-+.-++|+.|...+.+.+.+...+.+++..-. ..+..++||++...+.+..+.||+.++..+.+.+ T Consensus 1 m~--~-~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~ 77 (274) T protein:vir:95 1 MA--Q-GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETK 77 (274) T ss_pred CC--c-ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccc Confidence 11 1 122334557799999999998888877777764322 2467899999987677888999999999999999 Q ss_pred eEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHH Q lcl|Aclame:pro 100 NATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 100 ~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) ..++..++++..+.++++....+..|+.+.+.++++.++++++|+.++.--.+. ......+.++++.+ T Consensus 78 ~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a------------~~~~~~~~~~~d~i 145 (274) T protein:vir:95 78 KREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA------------KLTVEADITKLTGL 145 (274) T ss_pred eeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccccccccCHHHH Confidence 999999999999999999988888899999999999999999999888533221 11122345678999 Q ss_pred HHHHHHhhhhcCCCcEEEEcHHHHHHHHHhh------cc-CCceeeccCCcceeecceeEeecCCCCCCceeEEeecccE Q lcl|Aclame:pro 180 IDLEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKL 252 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~ 252 (324) .++..++.+.......++|||..+..|++.. ++ .|..++..+..++++|++|+++...+ .++.++.....+ T Consensus 146 ~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~--~~t~~l~~~gA~ 223 (274) T protein:vir:95 146 QTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLE--AGTAILAKKGAV 223 (274) T ss_pred HHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCC--CceEEEEeccce Confidence 9999999887777788999999999998642 12 24556667788899999999987644 445443333445 Q ss_pred EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .++...++.+|.+|+.. +....+++..++++++.+|+++++++..+|.= || T Consensus 224 ~~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~-----~~ 274 (274) T protein:vir:95 224 KLITKRDFFLETDRDPS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSL-----EM 274 (274) T ss_pred eeeecCCcccccccccc----------------cccCEEEEeEEEEEEEEcCCcEEEEEcCCccc-----cC Confidence 56667778888777542 23456889999999999999999999777663 33 No 118 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.84 E-value=4.9e-22 Score=137.42 Aligned_cols=263 Identities=14% Similarity=0.084 Sum_probs=199.8 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee----cCCCceEEEEEeCCcceeeeccCcccccccccee Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP----MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~----~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) +. + ..+.-+.-++|+.|...+.+.+.+...+.+++..-. ..+..++||++...+.+..+.||+.++..+.+.+ T Consensus 1 m~--~-~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~ 77 (274) T protein:vir:96 1 MA--Q-GMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAKVVAEGEKIPTDILETK 77 (274) T ss_pred CC--c-ceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecCCCccccccCCCccchhhcccc Confidence 11 1 122334557799999999998888877777764322 2467899999987677888999999999999999 Q ss_pred eEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHH Q lcl|Aclame:pro 100 NATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 100 ~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) ..++..++++..+.++++....+..|+.+.+.++++.++++++|+.++.--.+. ......+.++++.+ T Consensus 78 ~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a------------~~~~~~~~~~~d~i 145 (274) T protein:vir:96 78 KREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA------------KLTVEADITKLTGL 145 (274) T ss_pred eeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccccccccCHHHH Confidence 999999999999999999988888899999999999999999999888533221 11122345678999 Q ss_pred HHHHHHhhhhcCCCcEEEEcHHHHHHHHHhh------cc-CCceeeccCCcceeecceeEeecCCCCCCceeEEeecccE Q lcl|Aclame:pro 180 IDLEALLEDDELEANAFISKTQNRSLLRKIV------DP-ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKL 252 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~------d~-~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~ 252 (324) .++..++.+.......++|||..+..|++.. ++ .|..++..+..++++|++|+++...+ .++.++.....+ T Consensus 146 ~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~--~~t~~l~~~gA~ 223 (274) T protein:vir:96 146 QTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLE--AGTAILAKKGAV 223 (274) T ss_pred HHHHHHhccccccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCeEEEEeCCCC--CceEEEEeccce Confidence 9999999887777788999999999998642 12 24556667788899999999987644 445443333445 Q ss_pred EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .++...++.+|.+|+.. +....+++..++++++.+|+++++++..+|.= || T Consensus 224 ~~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~-----~~ 274 (274) T protein:vir:96 224 KLITKRDFFLETDRDPS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSL-----EM 274 (274) T ss_pred eeeecCCcccccccccc----------------cccCEEEEeEEEEEEEEcCCcEEEEEcCCccc-----cC Confidence 56667778888777542 23456889999999999999999999777663 33 No 119 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.84 E-value=5.1e-22 Score=137.33 Aligned_cols=263 Identities=14% Similarity=0.093 Sum_probs=201.4 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhhccee---e-cCCCceEEEEEeCCcceeeeccCcccccccccee Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE---P-MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWV 99 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~---~-~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~ 99 (324) +. +. .+.-+.-++|+.|...+.+.+.+...+.+++..- . ..+..++||.+...+.+..+.||+.++..+.+.+ T Consensus 1 ma--~~-~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~ 77 (274) T protein:vir:12 1 MA--QG-LTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTDILETK 77 (274) T ss_pred CC--cc-eeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchhhcccc Confidence 11 11 2233556789999999999888887777776542 2 2467899999987678889999999999999999 Q ss_pred eEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHH Q lcl|Aclame:pro 100 NATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 100 ~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) +.++..++.+..+.++++....+..|+.+.+.++++.++++++|+.++.--.+. ........++++.+ T Consensus 78 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a------------~~~~~~~a~~~d~i 145 (274) T protein:vir:12 78 KREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA------------KLTVNADITKLNGL 145 (274) T ss_pred eeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcc------------cccccccccCHHHH Confidence 999999999999999999888888899999999999999999999988543221 11122345678999 Q ss_pred HHHHHHhhhhcCCCcEEEEcHHHHHHHHHhh------ccC-CceeeccCCcceeecceeEeecCCCCCCceeEEeecccE Q lcl|Aclame:pro 180 IDLEALLEDDELEANAFISKTQNRSLLRKIV------DPE-TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKL 252 (324) Q Consensus 180 ~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~------d~~-g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~ 252 (324) .++..++.+.......++|||..+..|++.. +++ |..++..+..++++|++|+++...+ .++.++.....+ T Consensus 146 ~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~p--~~t~~l~~~gA~ 223 (274) T protein:vir:12 146 QSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGAIIVRSNKLE--AGTAILAKKGAV 223 (274) T ss_pred HHHHHHhccccccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCeeEEEeCCCC--cceEEEEeccce Confidence 9999999887777788999999999987642 222 4456667778899999999987654 445444333445 Q ss_pred EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 253 IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 253 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) .++...++.+|.+|+.. +....+++..++++++.+|+.+++++.+.|..-- T Consensus 224 ~~~~~~~~~vE~~Rd~~----------------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 224 KLILKRDFFLEVARDAS----------------TKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred eeeecCCceeccccchh----------------hcccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 56667788888877652 2234688999999999999999999987777533 No 120 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.75 E-value=7.5e-20 Score=125.43 Aligned_cols=223 Identities=13% Similarity=0.087 Sum_probs=173.0 Q ss_pred cceeecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 61 GKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYK 140 (324) Q Consensus 61 ~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~ 140 (324) -.-++ .+.++++|.+ ...+.-++||++++..+.++++.+.+.++++..+.|+++....+..|..+...++++.+|+. T Consensus 1 ~~~~~-~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~~~iA~ 77 (231) T protein:vir:73 1 ENGIN-LANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLAN 77 (231) T ss_pred Ccccc-CCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHHHHHHH Confidence 11122 2466889976 55778999999999999999999999999999999999999888889999999999999999 Q ss_pred HHHHHHHhccCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhc------cCC Q lcl|Aclame:pro 141 KFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVD------PET 214 (324) Q Consensus 141 ~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d------~~g 214 (324) ++|+.++.-... .....++.++++.|.+++..+.+....+..++|||+.+..|++..+ .-| T Consensus 78 kvD~di~~~~~~-------------a~l~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g 144 (231) T protein:vir:73 78 KVDDDLLKAAKT-------------TSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVG 144 (231) T ss_pred hhhHHHHHhhcc-------------ccccccccccHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhhhhhhhc Confidence 999998842211 1112234578999999999999888778889999999999988443 235 Q ss_pred ceeeccCCcceeecceeEeecCCCCCCcee--EEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEE Q lcl|Aclame:pro 215 KERIYDRNSDSLDGLPVVNLKSSNLKRGEL--ITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) Q Consensus 215 ~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~--i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r 292 (324) ..++..+..++++|++|+++...+...+.. +++-...+.+...+++.++.+|+.. .....++ T Consensus 145 ~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~----------------~k~~~i~ 208 (231) T protein:vir:73 145 ANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIV----------------TKTTVIT 208 (231) T ss_pred cceeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecccceeecccccc----------------ccccEEE Confidence 667778888999999999987665433321 1222234456778888888877642 3345688 Q ss_pred EEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 293 ATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 293 ~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) +..+++..+.+++.+++++.+-. T Consensus 209 ~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 209 ADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred EeEEEEEEEEcCccEEEEEeecC Confidence 99999999999999999987554 No 121 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.72 E-value=1.9e-18 Score=117.69 Aligned_cols=274 Identities=11% Similarity=0.063 Sum_probs=189.5 Q ss_pred ccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccC-----ccccccccceeeE Q lcl|Aclame:pro 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEG-----QKIETSKATWVNA 101 (324) Q Consensus 27 ~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg-----~~~~~~~~~~~~v 101 (324) ....+-..++.+.+..+...|||.+.+.|.|++..+..++.++.+.+.+....+.+.+.+.+ +..+++..+|+++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~~~~ 80 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATFTKV 80 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccccccccee Confidence 22223334556778899999999999999999999988888888999888766665554333 4456788899999 Q ss_pred EeeheeeEEeeeehHHHhhc--C-hHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccccccccccc----ccchh Q lcl|Aclame:pro 102 TMRAFKLGVILPVTKEFLNY--T-YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV----IKGDF 174 (324) Q Consensus 102 ~l~~~k~~~~~~iS~e~l~d--s-~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~----~~~~~ 174 (324) +...+-+++.+.|.+.+.+- + ..+...+-.+...+++.++.++.+|||+.++.+..|+.......+.. ..+.+ T Consensus 81 ~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~gg~~ 160 (310) T protein:vir:97 81 NSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATGSAI 160 (310) T ss_pred eeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCCCCC Confidence 99999999999999876542 3 44555666788889999999999999998665555665543322222 23667 Q ss_pred hhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhh-ccCCceee------ccCCcceeecceeEeecCCCCC------- Q lcl|Aclame:pro 175 TQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DPETKERI------YDRNSDSLDGLPVVNLKSSNLK------- 240 (324) Q Consensus 175 ~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~-d~~g~~~~------~~~~~~~l~G~pv~~~~~~~~~------- 240 (324) +.+++-.++..+....+.++.|+|||+++.+++.+. ...++.++ .+....++.|+|++.+...+.. T Consensus 161 t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~~~~ 240 (310) T protein:vir:97 161 SFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTKGGTT 240 (310) T ss_pred CHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCccccccC Confidence 889999999999877788899999999877776443 33333322 2334467899999887654432 Q ss_pred -CceeEEeeccc-----EEEEEe----cceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEE Q lcl|Aclame:pro 241 -RGELITGDFDK-----LIYGIP----QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) Q Consensus 241 -~~~~i~gd~s~-----~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l 310 (324) ...|++.-+.. -+.|.. .++.+....+ +=.++....|+++||+.++..|+|+++| T Consensus 241 gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~---------------~~~~~v~~~~V~~Y~~~av~~~~A~a~L 305 (310) T protein:vir:97 241 GCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGE---------------SEDSDEHIWRVKWYCGLALFSEKGLACA 305 (310) T ss_pred CceeEEEEeeCccccccceeccccCCccceeEEeCCc---------------ccCCcceeEEEEEeeeEEEecccceeee Confidence 22344332221 122221 1122221100 0134567789999999999999999999 Q ss_pred EeecC Q lcl|Aclame:pro 311 VPADK 315 (324) Q Consensus 311 ~~~~~ 315 (324) ++..- T Consensus 306 ~~V~~ 310 (310) T protein:vir:97 306 DGITN 310 (310) T ss_pred ccccC Confidence 98776 No 122 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.66 E-value=6.4e-17 Score=109.36 Aligned_cols=298 Identities=13% Similarity=0.084 Sum_probs=190.2 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCc Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~ 80 (324) |-.+-.+...+..+ ..++.+. +.+.+.-++++++++....+++.+.+.+++++.++++++.+...+|++..-+. T Consensus 1 ~~~~~~~~~~~n~~-~~~i~k~-----~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~ 74 (360) T protein:vir:99 1 MSSNSTIDSVRNQN-MNSLSQK-----DIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPR 74 (360) T ss_pred CcchhHHHHHhhhH-HHHHHhh-----hccccccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccce Confidence 54432222111111 1122221 22333346799999999999999999999999999999999988888765433 Q ss_pred ceee-eccCcccc-ccccceeeEEe-eheeeEEeeeehHHHhhcCh----HHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 81 GAYW-VGEGQKIE-TSKATWVNATM-RAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 81 ~a~~-v~Eg~~~~-~~~~~~~~v~l-~~~k~~~~~~iS~e~l~ds~----~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) ...- -.|+...+ ....+...+.+ ..+++-....++.+.++++. .++++.|++.++++++.-++.-.++|+..- T Consensus 75 r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds 154 (360) T protein:vir:99 75 LSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASS 154 (360) T ss_pred eeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchh Confidence 2211 22322222 23344444555 34566666778888877753 356799999999999999999999987542 Q ss_pred c---------cc-----ccccccccccccc-------------------------c--cc-----hhhhhHHHHHHHHhh Q lcl|Aclame:pro 154 P---------FG-----KSIAQSIEKTNKV-------------------------I--KG-----DFTQDNIIDLEALLE 187 (324) Q Consensus 154 ~---------~~-----~~~~~~~~~~~~~-------------------------~--~~-----~~~~~~i~~~~~~l~ 187 (324) . +. +.+..+....+.. + .+ ..+-.-+.+++..|+ T Consensus 155 ~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~Lp 234 (360) T protein:vir:99 155 GNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQTLD 234 (360) T ss_pred cccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHhcc Confidence 1 00 0111110000000 0 00 012234678999999 Q ss_pred hhcCCC----cEEEEcHHHHHHHHH-hhccC---CceeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc Q lcl|Aclame:pro 188 DDELEA----NAFISKTQNRSLLRK-IVDPE---TKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL 259 (324) Q Consensus 188 ~~~~~~----~~~v~~~~~~~~l~~-~~d~~---g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~ 259 (324) +.|+.. -.|+||+......+. +.+-+ |...+.++..-...|+|++..+. .+++.+++.+++++++|.+.+ T Consensus 235 ~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~~l~g~~~~~~~Gipi~~v~~--~pd~~~mlT~p~NLi~g~~~~ 312 (360) T protein:vir:99 235 SRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSAVIFGDSDITPFSYDLVGVNG--FPDEYMMFTDPNNLAFGLYEE 312 (360) T ss_pred hhhhcCcccceEEEccCchHHHHHHHHhccCcccchhheecccccccceeeeEEcCC--CCCCceEEeccCceeEEeeee Confidence 998753 279999998666654 33333 44556666666789999998775 457789999999999999999 Q ss_pred eEEEEeeccceeccccccccchhhhhcC-cEEEEEEEEeccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 260 IEYKIDETAQLSTVKNEDGTPVNLFEQD-MVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 260 ~~~~~~~~~~~~~~~~~~~~~~~~f~~~-~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~ 318 (324) ++++...+... +-++. .+..-...++|+.+.+++|++.+++...+.- T Consensus 313 iri~~~~e~~~------------~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 313 MELDQSTDTDK------------VHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred eEEeecccchh------------hhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 99987654311 01111 1333456789999999999999998766544 No 123 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.60 E-value=9.1e-17 Score=108.53 Aligned_cols=282 Identities=11% Similarity=0.021 Sum_probs=171.0 Q ss_pred hhccccccccccCccc-----c--chHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeC---CcceeeeccCccc Q lcl|Aclame:pro 23 VFNPDNVMMHEKKDGT-----L--MNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWAD---KPGAYWVGEGQKI 91 (324) Q Consensus 23 ~~~~~~~~~~~~~~~~-----v--p~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~---~~~a~~v~Eg~~~ 91 (324) .-++.+..+..+++.+ + |+.+-+.+.+.+.+.-+.-.+++.+.. .++...+-.... ..++.-|+|++++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 3344455555555442 2 555666677777666666667776643 344454433322 2467789999999 Q ss_pred cccccceeeEEe-eheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccc-ccc Q lcl|Aclame:pro 92 ETSKATWVNATM-RAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKT-NKV 169 (324) Q Consensus 92 ~~~~~~~~~v~l-~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~-~~~ 169 (324) |.+...++...+ ..+|.|..+.||+|+++.+..+..+-..++++..+.++.|+.++.---+...+.......... ... T Consensus 81 P~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~ 160 (318) T protein:vir:10 81 PVSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKV 160 (318) T ss_pred cccCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccc Confidence 999999977766 558999999999999999999999999999999999999998775321111010000001000 000 Q ss_pred ccchhhh-hHHHHHHH--------Hh-hhhcCCCcEEEEcHHHHHHHHHh------hccCCceeec-----cCCcceeec Q lcl|Aclame:pro 170 IKGDFTQ-DNIIDLEA--------LL-EDDELEANAFISKTQNRSLLRKI------VDPETKERIY-----DRNSDSLDG 228 (324) Q Consensus 170 ~~~~~~~-~~i~~~~~--------~l-~~~~~~~~~~v~~~~~~~~l~~~------~d~~g~~~~~-----~~~~~~l~G 228 (324) ..+...+ +.+..+.. .. ...++.++.++|||..|..|++- ...++.+++. +.-+++++| T Consensus 161 ~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lG 240 (318) T protein:vir:10 161 RTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVMG 240 (318) T ss_pred cccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceeec Confidence 0111111 11111111 11 13356788999999999999543 3334444442 233567899 Q ss_pred ceeEeecCCCCCCceeEEeecccE-EEEEecceEEEEeeccceeccccccccchhhhh-cCcEEEEEEEEeccEEeccCc Q lcl|Aclame:pro 229 LPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE-QDMVALRATMHVALHIADDKA 306 (324) Q Consensus 229 ~pv~~~~~~~~~~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~-~~~v~~r~~~r~d~~v~~~~A 306 (324) +.|+.++. .+.+++++.+...+ ++++.++++.. .+..+++.+ .+. +....+|+.......|.+|+| T Consensus 241 l~vi~s~~--~p~~~alvlq~g~vG~~~d~~pl~~t--------~~~~egg~~--~g~~~~s~~~~~~~~~~~~V~~PkA 308 (318) T protein:vir:10 241 LNVIRSRT--FPIDRVLIMERGTVGFYSDTRPLQFT--------ALYPEGNGP--NGGPTESYRADASHKRALAVDQPKA 308 (318) T ss_pred eEEeecCc--cCCCeeEEEecCCcceeeccccceee--------ecccCCCCC--CCCcchhhheehheeeeeeeeCcce Confidence 99998775 45677777775433 23333333332 222222221 122 223457788888999999999 Q ss_pred eEEEEeecCC Q lcl|Aclame:pro 307 FAKLVPADKR 316 (324) Q Consensus 307 ~~~l~~~~~~ 316 (324) +++||+.-.+ T Consensus 309 ~~~itgi~~~ 318 (318) T protein:vir:10 309 ALWLTGIVTP 318 (318) T ss_pred eEEEeeccCC Confidence 9999986555 No 124 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.55 E-value=1.4e-15 Score=102.01 Aligned_cols=258 Identities=10% Similarity=-0.011 Sum_probs=167.7 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcce----eecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeehe Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY----EPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~----~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 106 (324) ++. ..++|+.|...+++.+++.+++.+++.. +...+.+++||+......+....++..++....+.+.++++.. T Consensus 1 MA~--~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:79 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred Ccc--hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCccCccccccceEEEEEe Confidence 222 2368999999999999999998888633 2234668999998766666788889888888888888888886 Q ss_pred e-eEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHHHHHH Q lcl|Aclame:pro 107 K-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEAL 185 (324) Q Consensus 107 k-~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 185 (324) + .+..+.|++.-...+..++.+ +.+++..+++.++|+.++.--.... .........+....++.|.++..+ T Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~-------~~~~~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:79 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNG-------TALTGSAPSDADDAFDLIASALKE 150 (273) T ss_pred eecccceeeccHHHHhhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhcc-------cccccccccchhhHHHHHHHHHHH Confidence 6 466677887444455678877 6677889999999987663111000 000111112223456788899999 Q ss_pred hhhhcCC--CcEEEEcHHHHHHHHHhh----c--cCC-ceeeccCCcceeecceeEeecCCCCCCce-eEEeecccEEEE Q lcl|Aclame:pro 186 LEDDELE--ANAFISKTQNRSLLRKIV----D--PET-KERIYDRNSDSLDGLPVVNLKSSNLKRGE-LITGDFDKLIYG 255 (324) Q Consensus 186 l~~~~~~--~~~~v~~~~~~~~l~~~~----d--~~g-~~~~~~~~~~~l~G~pv~~~~~~~~~~~~-~i~gd~s~~~~~ 255 (324) +.+...+ ...++++|..+..|.+.. . ..| ...+..+..++++|++|+.+...+...+. .+.+-.+.+... T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~A~~~a 230 (273) T protein:vir:79 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hhhccCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEeccceeee Confidence 9877763 347899999999886532 1 122 23455677789999999988766554432 333322222221 Q ss_pred EecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) . +...++..+ ++ ..| ...+++.+++|+.++||++++.|+.... T Consensus 231 ~-~~~~~e~~r--------~~-----~~~---~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 231 S-QIDTVEALR--------DQ-----DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred e-ehhhhhccc--------Cc-----ccc---eeeeeeeeeeeeEEecCceEEEEeccCC Confidence 1 111222221 11 112 3457899999999999999998875444 No 125 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.54 E-value=4.2e-16 Score=104.87 Aligned_cols=280 Identities=12% Similarity=0.124 Sum_probs=178.5 Q ss_pred Cchh---HHHH---HHHHHHHh------------------------------------hhhhHH---hh-cccccccccc Q lcl|Aclame:pro 1 MEQT---QKLK---LNLQHFAS------------------------------------NNVKPQ---VF-NPDNVMMHEK 34 (324) Q Consensus 1 ~~~~---~~~k---~~~~~~a~------------------------------------~~~~~~---~~-~~~~~~~~~~ 34 (324) ||++ .++. .+.|++.. +..+.+ ++ ++-....+.+ T Consensus 59 en~~e~~~~~~~~~~E~Rs~~~~i~~~~~~~r~~p~~~~veyRSaGE~lkal~~~~~Gd~~A~~~~e~~r~a~~~~~Tgd 138 (410) T protein:vir:83 59 KNQMEQAQEVNRIAFETRSKGQAVDAAISAMRGSPVGTEVEYRSAGEYMLDMWNSAQGNASAADRLEVYARAADHQKTGD 138 (410) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhhhccCcCCCCCCCcccccHHHHHHHHhccCCchHHHHHHHHHHHHhhccCcccc Confidence 2222 2211 11222210 111111 11 1112223334 Q ss_pred CccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcce-------eeeccCccccccccceeeEEeehee Q lcl|Aclame:pro 35 KDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGA-------YWVGEGQKIETSKATWVNATMRAFK 107 (324) Q Consensus 35 ~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a-------~~v~Eg~~~~~~~~~~~~v~l~~~k 107 (324) ..++||++++.+.++.+.+..++.+++...|..+.+++||+.+..+.. ..-.||...+..+.+|+..+...++ T Consensus 139 ~~~~i~~~~v~d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikT 218 (410) T protein:vir:83 139 LQGVIPDPIVGPVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKT 218 (410) T ss_pred cccccchhHhhhHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeeeeccceeeh Confidence 456789999999999999999999999999999999999988765543 3356899999999999999999999 Q ss_pred eEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHH---HHhccCccccccccccccccccccccchhhhhHH----H Q lcl|Aclame:pro 108 LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEA---GILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI----I 180 (324) Q Consensus 108 ~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~---~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i----~ 180 (324) +|++..+||+.|+.|.+.+.+...+.|..+++.+-+.+ +|+++-+. ....+..+.+++ . T Consensus 219 yGGyt~LSRQ~IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~--------------~~a~~~~Tad~~~~~i~ 284 (410) T protein:vir:83 219 LGGYVNVSRQAIDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTG--------------AVGYGNATADNVASAIW 284 (410) T ss_pred hcCcccccceeeecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------------hhhhhhccHHHHHHHHH Confidence 99999999999999999999999999999988887754 34332111 011122233333 3 Q ss_pred HHHHHhhhh--cCCCcEEEEcHHHHHHHHHhh-c-------cCCceee--ccCCcceeecceeEeecCCCCCCceeEEee Q lcl|Aclame:pro 181 DLEALLEDD--ELEANAFISKTQNRSLLRKIV-D-------PETKERI--YDRNSDSLDGLPVVNLKSSNLKRGELITGD 248 (324) Q Consensus 181 ~~~~~l~~~--~~~~~~~v~~~~~~~~l~~~~-d-------~~g~~~~--~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd 248 (324) ++...+.++ +..-..+.++|+++..+.++. + ..|..+. -.+-.+.+++.||+..+. .++++++|-| T Consensus 285 da~~~v~da~~~~~~~~i~vS~DVl~~~~~~f~~~~~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~~--a~AgTA~f~~ 362 (410) T protein:vir:83 285 QAAGAVYTAVKGMGRLVIAIAPDVLGDFGPLFAPVNPTNAHSTGFEAGRFGQGVMGSISGIPVVMSAA--LGSGDAYLFS 362 (410) T ss_pred HHHHHHhhhhccceeeeEEechhhhhhccceeeccCCCCcccccccccccccchhhhhcccceEEecC--CCcCeeeEec Confidence 444555555 444456899999976665432 2 2221111 123456789999998764 6677888888 Q ss_pred cccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEee Q lcl|Aclame:pro 249 FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 249 ~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~ 313 (324) .+.+..-..++-.+...++.. .+| +.=.+ .|+.+.+..++++.=|.+. T Consensus 363 ~~Ai~~~eS~~gp~qL~d~~i-----------~nL-----t~~yS-gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 363 TAAIECFEQRVGTLQVVEPSV-----------FGL-----QVAYA-GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred cceeeeeecCCceeEeeCCch-----------hhh-----hhhhe-eeeeeccccccceeeeccC Confidence 665544443322233322221 111 11111 6788889999999888776 No 126 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.53 E-value=4.4e-15 Score=99.30 Aligned_cols=258 Identities=10% Similarity=-0.017 Sum_probs=166.7 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcce-e---ecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeehe Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY-E---PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~-~---~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 106 (324) ++. ..++|+.|...+++.+.+.+++.+++.. . ...+.+++||+...........++..++....+.+.++++.. T Consensus 1 MA~--~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:10 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred Ccc--hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEe Confidence 222 2467999999999999999988888643 1 234568999998766666677888877777777777777775 Q ss_pred e-eEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHHHHHH Q lcl|Aclame:pro 107 K-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEAL 185 (324) Q Consensus 107 k-~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 185 (324) + .+..+.|++.-...+..++++ +.+++.++++.++|..++.=-..... ........+....++.|.++..+ T Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~-------~~~~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-------ALTGSAPTDADDAFDLIAKALKE 150 (273) T ss_pred eeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc-------ccccccccchhHHHHHHHHHHHH Confidence 5 356667777444445667877 67778999999999987631110000 00111112233457889999999 Q ss_pred hhhhcCC--CcEEEEcHHHHHHHHHhh----cc--CC-ceeeccCCcceeecceeEeecCCCCCCc-eeEEeecccEEEE Q lcl|Aclame:pro 186 LEDDELE--ANAFISKTQNRSLLRKIV----DP--ET-KERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIYG 255 (324) Q Consensus 186 l~~~~~~--~~~~v~~~~~~~~l~~~~----d~--~g-~~~~~~~~~~~l~G~pv~~~~~~~~~~~-~~i~gd~s~~~~~ 255 (324) |.+...+ +..++++|..+..|.+.. +. .| ...+..+..+++.|++|+.+...+...+ +.+++-.+.+... T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a 230 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeee Confidence 9887764 346899999999886532 21 12 2345567778999999999876654433 3444443333222 Q ss_pred EecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) . +...++..+ ++ ..| ...+++...+|+.++||++++.|+.... T Consensus 231 ~-q~~~~e~~r--------~~-----~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 231 S-QIDTVEALR--------DQ-----DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred e-eeehhhccc--------CC-----Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 1 111222211 11 113 2357889999999999999998875443 No 127 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.53 E-value=4.4e-15 Score=99.30 Aligned_cols=258 Identities=10% Similarity=-0.017 Sum_probs=166.7 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcce-e---ecCCCceEEEEEeCCcceeeeccCccccccccceeeEEeehe Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY-E---PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~-~---~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 106 (324) ++. ..++|+.|...+++.+.+.+++.+++.. . ...+.+++||+...........++..++....+.+.++++.. T Consensus 1 MA~--~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~tid 78 (273) T protein:vir:10 1 MAF--NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDLLID 78 (273) T ss_pred Ccc--hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecccccccccccCCCccCccccccceEEEEEe Confidence 222 2467999999999999999988888643 1 234568999998766666677888877777777777777775 Q ss_pred e-eEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHHHHHH Q lcl|Aclame:pro 107 K-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEAL 185 (324) Q Consensus 107 k-~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 185 (324) + .+..+.|++.-...+..++++ +.+++.++++.++|..++.=-..... ........+....++.|.++..+ T Consensus 79 ~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~-------~~~~~~~~~~~~~~~~i~~a~~~ 150 (273) T protein:vir:10 79 QEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-------ALTGSAPTDADDAFDLIAKALKE 150 (273) T ss_pred eeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc-------ccccccccchhHHHHHHHHHHHH Confidence 5 356667777444445667877 67778999999999987631110000 00111112233457889999999 Q ss_pred hhhhcCC--CcEEEEcHHHHHHHHHhh----cc--CC-ceeeccCCcceeecceeEeecCCCCCCc-eeEEeecccEEEE Q lcl|Aclame:pro 186 LEDDELE--ANAFISKTQNRSLLRKIV----DP--ET-KERIYDRNSDSLDGLPVVNLKSSNLKRG-ELITGDFDKLIYG 255 (324) Q Consensus 186 l~~~~~~--~~~~v~~~~~~~~l~~~~----d~--~g-~~~~~~~~~~~l~G~pv~~~~~~~~~~~-~~i~gd~s~~~~~ 255 (324) |.+...+ +..++++|..+..|.+.. +. .| ...+..+..+++.|++|+.+...+...+ +.+++-.+.+... T Consensus 151 ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~A~~~a 230 (273) T protein:vir:10 151 LTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPSAAAYV 230 (273) T ss_pred hhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEeccceeee Confidence 9887764 346899999999886532 21 12 2345567778999999999876654433 3444443333222 Q ss_pred EecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 256 IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 256 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) . +...++..+ ++ ..| ...+++...+|+.++||++++.|+.... T Consensus 231 ~-q~~~~e~~r--------~~-----~~~---~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 231 S-QIDTVEALR--------DQ-----DSF---SDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred e-eeehhhccc--------CC-----Ccc---eeeeeeeeeeeeeEeccceEEEEeccCC Confidence 1 111222211 11 113 2357889999999999999998875443 No 128 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=99.51 E-value=3.1e-15 Score=100.12 Aligned_cols=293 Identities=8% Similarity=-0.023 Sum_probs=171.2 Q ss_pred hccccccc-----cccCccccchHHHHHHHHHHHhhhhhhhhcceee---cCCCceEEEEEeCCcceeeeccCccccccc Q lcl|Aclame:pro 24 FNPDNVMM-----HEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYWVGEGQKIETSK 95 (324) Q Consensus 24 ~~~~~~~~-----~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~---~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~ 95 (324) +.-.|+.+ +..-..+||+.|..++++.+.+..++.++++..+ ..+.+++||+.. .+.+.-..++..++... T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i~~~~ 79 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS-ELGVEDKATDVPVGVQP 79 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC-cceeeeecCCCcccccc Confidence 22122221 1223347899999999999999988888876543 346789999874 55666778888888888 Q ss_pred cceeeEEeeh-eeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhcc--Cccccccccccccccccccccc Q lcl|Aclame:pro 96 ATWVNATMRA-FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ--GNNPFGKSIAQSIEKTNKVIKG 172 (324) Q Consensus 96 ~~~~~v~l~~-~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~--g~~~~~~~~~~~~~~~~~~~~~ 172 (324) .+-.+++++. +..+..+.|+++-...+..++.+.+.++..+++++++|+.++.-- .+.................... T Consensus 80 ~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t~~~~ 159 (341) T protein:vir:94 80 VNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAITGNGQ 159 (341) T ss_pred ccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccccccCchh Confidence 7777888887 445677888886666678899999999999999999999877421 1111111111111111111223 Q ss_pred hhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhh-----ccCCceeeccCCcceeecceeEeecCCCCCCceeE Q lcl|Aclame:pro 173 DFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELI 245 (324) Q Consensus 173 ~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~-----d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i 245 (324) ..+++.+.++...|.....+. -.++++|..+..|.+.. +..|...+..+..++++|++|+.++..+...+... T Consensus 160 ~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~ 239 (341) T protein:vir:94 160 AFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNNSATGW 239 (341) T ss_pred hhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEeccccccccccc Confidence 456788899999998877643 46789999999986532 22344445667778999999998876654332211 Q ss_pred E-------------------------eecccE--EEEEecce-EEEEeeccceeccccccccchhhhh--cCcEEEEEEE Q lcl|Aclame:pro 246 T-------------------------GDFDKL--IYGIPQLI-EYKIDETAQLSTVKNEDGTPVNLFE--QDMVALRATM 295 (324) Q Consensus 246 ~-------------------------gd~s~~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~f~--~~~v~~r~~~ 295 (324) . +|+... +.+-+..+ .+...+-..+.............|. +-...+++.. T Consensus 240 ~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 319 (341) T protein:vir:94 240 RNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQ 319 (341) T ss_pred cccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhhhhhh Confidence 0 011100 11101110 0000000000000000000000111 1112355667 Q ss_pred EeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 296 HVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 296 r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) -+|++++||+|.+.|.. .+.++ T Consensus 320 ~~G~~~lrp~~~v~~~~--~~~~~ 341 (341) T protein:vir:94 320 AYGARLYRPLHAVNIHT--TGDTV 341 (341) T ss_pred hhcccccCcceeEEEec--CcCCC Confidence 88999999999876554 33333 No 129 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=99.43 E-value=4e-14 Score=94.02 Aligned_cols=275 Identities=12% Similarity=0.047 Sum_probs=165.1 Q ss_pred ccccc-CccccchHHHHHHHHHHHhhhhhhhh---------ccee--ecCCCceEEEEEeCC-cceeeeccCcccccccc Q lcl|Aclame:pro 30 MMHEK-KDGTLMNEFTTPILQEVMENSKIMQL---------GKYE--PMEGTEKKFTFWADK-PGAYWVGEGQKIETSKA 96 (324) Q Consensus 30 ~~~~~-~~~~vp~~~~~~i~~~~~~~s~l~~l---------~~~~--~~~~~~~~ip~~~~~-~~a~~v~Eg~~~~~~~~ 96 (324) +..+. ..-.+|+.+..-+.+...+.+.+.+. .... ..++..+++|.+..- .++.-+.|+..++..+. T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~~~~l 80 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLVPQKI 80 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccchhhc Confidence 22222 33445665555555555556555332 1222 235677899998763 67788899999999999 Q ss_pred ceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhh Q lcl|Aclame:pro 97 TWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQ 176 (324) Q Consensus 97 ~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 176 (324) +..+.....++.+..+.++++...-+..+....+.+++++.+.+..++.+|.-...-.............+......+++ T Consensus 81 ~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~s~ 160 (324) T protein:vir:59 81 NAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGIYSA 160 (324) T ss_pred ccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeeccccceecH Confidence 99888888889999999999888778889999999999999999999877642110000000000001112222345678 Q ss_pred hHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCC-ceeeccCCcceeecceeEeecCCCCCCc--------eeEEe Q lcl|Aclame:pro 177 DNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET-KERIYDRNSDSLDGLPVVNLKSSNLKRG--------ELITG 247 (324) Q Consensus 177 ~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g-~~~~~~~~~~~l~G~pv~~~~~~~~~~~--------~~i~g 247 (324) +.+.++..++.+....-.+|+||+.++..|++..-.+. ++--....-+.++|++|+++.+++.... ..+|+ T Consensus 161 ~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~G~~VivdD~~p~~~~~~~~~~y~s~l~~ 240 (324) T protein:vir:59 161 ETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLIEFVKDSQSGIRFPTYMNKRVIVDDSMPVETLEDGTKVFTSYLFG 240 (324) T ss_pred HHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhhhhccccccCceeeeecccEEEEeCCCCccccCCCCceEEEEEEe Confidence 89999999999988888899999999999987542211 1111123456789999999877654221 23343 Q ss_pred ecccEEEEE-ecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 248 DFDKLIYGI-PQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 248 d~s~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .- .+.++. ...+.+|.+|+.. .+...+..+.++. +||..+.....+.++...+-.|. T Consensus 241 ~G-Ai~~~~~~~~v~vE~dRd~~----------------~g~~~l~~r~~~~---~~p~G~s~~~~~~~~~sPt~~~L 298 (324) T protein:vir:59 241 AG-ALGYAEGQPEVPTETARNAL----------------GSQDILINRKHFV---LHPRGVKFTENAMAGTTPTDEEL 298 (324) T ss_pred cC-eEEEeecCCCcceecccCcc----------------ccceEEEEeeEEE---eEeeeEEecccccCCCCCChhhh Confidence 21 122333 2234555555431 2233344445533 44555554433222333333333 No 130 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=99.40 E-value=4.5e-14 Score=93.76 Aligned_cols=293 Identities=9% Similarity=-0.044 Sum_probs=165.1 Q ss_pred HhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccCccccc Q lcl|Aclame:pro 15 ASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 15 a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) |.+.......++....+..+ -.+.-+++..+|+......+.++++.++.++. +++..||+. +...++...-|+++.. T Consensus 1 m~~~~~~~~t~~~~~~~~~~-~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~i-G~~~~~~~~~g~~l~~ 78 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSD-VSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRV-GASTIAGRKAGEELVV 78 (334) T ss_pred CCCCcCCCccccccccccch-heehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeee-cceeeeeecCCCCCCC Confidence 33222222222222111111 12333999999999999999999999887765 677889987 6677778888888888 Q ss_pred cccceeeEEeehee-eEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh----ccCccc-------c-ccccc Q lcl|Aclame:pro 94 SKATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNNP-------F-GKSIA 160 (324) Q Consensus 94 ~~~~~~~v~l~~~k-~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~~-------~-~~~~~ 160 (324) +.++.+++++.... +.....|-+----++..|+.+.+.++++++++++.|++++. +..... . +++.. T Consensus 79 ~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~~ 158 (334) T protein:vir:80 79 QKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGILL 158 (334) T ss_pred CCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCcce Confidence 87777887777655 33344444422223567899999999999999999998762 221111 0 11111 Q ss_pred ccccc---ccccccchhhhhHHHHHHHHhhhhcCC-----CcEEEEcHHHHHHHHHhhccC--------CceeeccCCcc Q lcl|Aclame:pro 161 QSIEK---TNKVIKGDFTQDNIIDLEALLEDDELE-----ANAFISKTQNRSLLRKIVDPE--------TKERIYDRNSD 224 (324) Q Consensus 161 ~~~~~---~~~~~~~~~~~~~i~~~~~~l~~~~~~-----~~~~v~~~~~~~~l~~~~d~~--------g~~~~~~~~~~ 224 (324) ..... .....+.....+.+..+...|.....+ .-..+++|..+..|.+-..-. +.-.+..+... T Consensus 159 ~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i~ 238 (334) T protein:vir:80 159 PSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGRIA 238 (334) T ss_pred eecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccceeEE Confidence 11111 111111122234455666667666655 246799999999986542211 11223445567 Q ss_pred eeecceeEeecCCCCCC---------ceeEEeecccEEEEEecceEEEEeecc--ceeccccccccchhhhhcCcEEEEE Q lcl|Aclame:pro 225 SLDGLPVVNLKSSNLKR---------GELITGDFDKLIYGIPQLIEYKIDETA--QLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 225 ~l~G~pv~~~~~~~~~~---------~~~i~gd~s~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) +++|++|+.+++.+... ...+.|||+...........+-.-+.. +...+.+.. ....| +.+ T Consensus 239 ~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~--~~~d~------i~~ 310 (334) T protein:vir:80 239 MLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKK--DFGHY------LDT 310 (334) T ss_pred EEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechh--hHHHH------HHH Confidence 89999999887665332 124566766543221111111111111 111111111 01111 234 Q ss_pred EEEeccEEeccCceEEEEeecCCC Q lcl|Aclame:pro 294 TMHVALHIADDKAFAKLVPADKRT 317 (324) Q Consensus 294 ~~r~d~~v~~~~A~~~l~~~~~~~ 317 (324) ..-+|..++||+|++.++..-.-| T Consensus 311 ~~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 311 FQSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred HHHcCCceeccceEEEEEEeeecC Confidence 456799999999988776654444 No 131 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=99.29 E-value=1e-12 Score=86.34 Aligned_cols=292 Identities=7% Similarity=-0.022 Sum_probs=167.1 Q ss_pred HHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee---cCCCceEEEEEeCCcceee Q lcl|Aclame:pro 8 KLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYW 84 (324) Q Consensus 8 k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~---~~~~~~~ip~~~~~~~a~~ 84 (324) +..+|. .....+ ....++....++|+.|..++++.+.+.+++.+++.... ..+.+++||+.. .+.+.. T Consensus 1 ~~~~~~-------~~~~~~-~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g-~~~a~d 71 (381) T protein:vir:80 1 MATIQG-------TGGYKG-SAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS-RAAVYD 71 (381) T ss_pred Cceecc-------cccccC-cccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC-cceeee Confidence 111110 000110 11112223457899999999999999988888875532 246678899875 567788 Q ss_pred eccCccccccccceeeEEeehee-eEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccC---cccccc--- Q lcl|Aclame:pro 85 VGEGQKIETSKATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG---NNPFGK--- 157 (324) Q Consensus 85 v~Eg~~~~~~~~~~~~v~l~~~k-~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g---~~~~~~--- 157 (324) ..++.+++....+..+++++..+ ......|++.-...+..++.+.+.+.+..++++++|+.++.-.. ....+. T Consensus 72 ~~~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t 151 (381) T protein:vir:80 72 KQPQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYS 151 (381) T ss_pred ecCCCcccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 89999888888887777777744 34557888866666778999999999999999999998874211 100000 Q ss_pred ---ccccc-cccccccccchhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhh-----ccCCceeeccCCccee Q lcl|Aclame:pro 158 ---SIAQS-IEKTNKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIV-----DPETKERIYDRNSDSL 226 (324) Q Consensus 158 ---~~~~~-~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~-----d~~g~~~~~~~~~~~l 226 (324) .+... ............+++.|+++...|.....+. -.++++|..+..|.+.. +..+...+..+..+++ T Consensus 152 ~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i 231 (381) T protein:vir:80 152 YDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTI 231 (381) T ss_pred ccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEE Confidence 00000 0011112223557889999999998887643 47899999999986532 2223334566777899 Q ss_pred ecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEec-cC Q lcl|Aclame:pro 227 DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD-DK 305 (324) Q Consensus 227 ~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~-~~ 305 (324) +|++|+.++..+..........+ -+. ........ ..... .-|..+..+++....+|..+.. .. T Consensus 232 ~G~~Vv~Sn~lp~~~~t~~~~~a----gap-----~~~~~~~~-~~~~~------g~~s~~a~av~~~k~yd~~~~~~~~ 295 (381) T protein:vir:80 232 LGMEVIVTTQIGINSLTGYVNGQ----GAP-----TQPTPGVL-GSPYL------PDQAGTANVVNTGSASDLAVSLSYF 295 (381) T ss_pred cceEEEeecccccccccceeeec----ccc-----cccccccc-ccccc------cccccceeeeeeeeeeceeeeeeec Confidence 99999998765543222111000 000 00000000 00000 0133444566666677777633 22 Q ss_pred ceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 306 AFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 306 A~~~l~~~~~~~~~~~~~~ 324 (324) .+-...++.+..+..-+.. T Consensus 296 ~~~~~~g~~~~~~~~~~~~ 314 (381) T protein:vir:80 296 GLPVFSGAGATAADGGQTL 314 (381) T ss_pred cceeeecceeeecCCCcee Confidence 2333333333333322222 No 132 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=99.25 E-value=1e-12 Score=86.33 Aligned_cols=281 Identities=11% Similarity=0.056 Sum_probs=160.0 Q ss_pred HhhhhhHHhhccccccccccCc-----cccchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccC Q lcl|Aclame:pro 15 ASNNVKPQVFNPDNVMMHEKKD-----GTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEG 88 (324) Q Consensus 15 a~~~~~~~~~~~~~~~~~~~~~-----~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg 88 (324) +......+ ++.+.+....++ .+.-+++..++++.....++++++.++.++. +++..||+. +..++.....| T Consensus 1 ~~~~~~~~--~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i-G~~~~~~~~~G 77 (345) T protein:vir:22 1 MASMTGGQ--QMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPG 77 (345) T ss_pred Ccccccch--hcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee-cceEEEeeecC Confidence 11111111 111111111111 3445999999999999999999999877765 677889987 67777888888 Q ss_pred cccccc--ccceeeEEeeh--eeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh----ccC-cc---ccc Q lcl|Aclame:pro 89 QKIETS--KATWVNATMRA--FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQG-NN---PFG 156 (324) Q Consensus 89 ~~~~~~--~~~~~~v~l~~--~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g-~~---~~~ 156 (324) +++..+ .++..+.+|.. .++.. ..|-+----++..|+.+.+.++++.++++..|+.++. +.. .. ..+ T Consensus 78 ~~l~~~~~~~~~~e~~ltID~~~y~~-~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~~ 156 (345) T protein:vir:22 78 ENLDDKRKDIKHTEKVITIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNENI 156 (345) T ss_pred CCCCCCCCCcccceEEEEecchhhhh-hhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Confidence 877654 35666644443 33332 2222211123567999999999999999999998873 111 00 001 Q ss_pred cccccc----ccc-----ccccccchhhhhHHHHHHHHhhhhcCCCc--EEEEcHHHHHHHHHhhccC-----Cceeecc Q lcl|Aclame:pro 157 KSIAQS----IEK-----TNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE-----TKERIYD 220 (324) Q Consensus 157 ~~~~~~----~~~-----~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~l~~~~d~~-----g~~~~~~ 220 (324) .+.... ... ......+...++.|.++...|..+..+.. .++++|..+..|.+-+.-+ |...... T Consensus 157 ~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~ 236 (345) T protein:vir:22 157 EGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEK 236 (345) T ss_pred cccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhcccccccccccccccccc Confidence 111000 000 01111223457778888888888877653 5799999999886543221 1111223 Q ss_pred CCcceeecceeEeecCCCCCCc---------------------ee---------EEeecccEEEEEecceEEEEeeccce Q lcl|Aclame:pro 221 RNSDSLDGLPVVNLKSSNLKRG---------------------EL---------ITGDFDKLIYGIPQLIEYKIDETAQL 270 (324) Q Consensus 221 ~~~~~l~G~pv~~~~~~~~~~~---------------------~~---------i~gd~s~~~~~~~~~~~~~~~~~~~~ 270 (324) +....+.|.+|+.+++.+.... .. ++...+.+..+...+++++..++. T Consensus 237 G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~-- 314 (345) T protein:vir:22 237 GSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRSAVGTVKLRDLALERARRA-- 314 (345) T ss_pred ceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehhheeeeeeecceeeeeech-- Confidence 5567899999998765431100 01 111112112222223333433321 Q ss_pred eccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 271 STVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 271 ~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) ..| .| .+++..-+|..++||+|.+.|+-+-. T Consensus 315 -----------~~~-~d--~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 315 -----------NFQ-AD--QIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred -----------hHH-HH--HHHHHHhcCCcccccceeEEEEEeeC Confidence 112 22 35677789999999999998876655 No 133 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=99.22 E-value=7e-12 Score=81.74 Aligned_cols=278 Identities=12% Similarity=0.045 Sum_probs=164.1 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhh---------cceeecCCCceEEEEEeCC-cceeeeccCc-ccc Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQL---------GKYEPMEGTEKKFTFWADK-PGAYWVGEGQ-KIE 92 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l---------~~~~~~~~~~~~ip~~~~~-~~a~~v~Eg~-~~~ 92 (324) +. + +++.-..-.+|+.+..-+.+...+.+.+.+. ......++..+++|.+..- ..+.-+.|++ .++ T Consensus 1 Ma--~-~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~ 77 (330) T protein:vir:10 1 MA--N-ELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALE 77 (330) T ss_pred CC--C-CceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccc Confidence 11 1 1122233455665544444555555555432 1122346788999999754 5777788986 699 Q ss_pred ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh---c---cCccccccccccccccc Q lcl|Aclame:pro 93 TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---N---QGNNPFGKSIAQSIEKT 166 (324) Q Consensus 93 ~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G---~g~~~~~~~~~~~~~~~ 166 (324) ..+.+..+-....++++..+.++++...-+..|..+.+.+++++...+..++.++. | +........+....... T Consensus 78 ~~ki~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~ 157 (330) T protein:vir:10 78 TGKITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSD 157 (330) T ss_pred hhhcccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheec Confidence 99999999999999999999999999888889999999999999999988877663 2 11111111111111111 Q ss_pred cccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCC-ceeeccCCcceeecceeEeecCCCCCCce-- Q lcl|Aclame:pro 167 NKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET-KERIYDRNSDSLDGLPVVNLKSSNLKRGE-- 243 (324) Q Consensus 167 ~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g-~~~~~~~~~~~l~G~pv~~~~~~~~~~~~-- 243 (324) ....+..++++.+.++..++.+....-.+|+||+.++..|++..--+. ++.-.....++++|++|+++..++...+. T Consensus 158 ~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~~~~G~~VivdD~~p~~~~~yt 237 (330) T protein:vir:10 158 QSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLIQYIQPTTATINIPTYLGYRVIIDDGIAPTGDIYT 237 (330) T ss_pred ccccccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhhhhhcccccCcccccccceEEEEeCCCCCCCCcee Confidence 122344567788999999999888888899999999999987432111 11112334578999999998877654432 Q ss_pred -eEEeecccEEEEE---ecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeec--CCC Q lcl|Aclame:pro 244 -LITGDFDKLIYGI---PQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD--KRT 317 (324) Q Consensus 244 -~i~gd~s~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~--~~~ 317 (324) .+|+.- .+.++. .....+|.+|+.. .....+..+.+ .++||..+..-+... ++. T Consensus 238 ~yl~~~G-Ai~~~~~~~~~~v~~EtdRd~~----------------~g~~~l~~r~~---~~~hp~G~s~~~~~~~~~~~ 297 (330) T protein:vir:10 238 SYLFRTG-SIGLNTGNPSGLTTFETSREAA----------------KGNDMIYTRRA---LVMHPYGVKWTGAEVDAGNI 297 (330) T ss_pred EEEEecC-ceeeecccCCccccccccCCcc----------------ccceEEEEeeE---EEeeeeeeeecccccccCcC Confidence 233311 111221 1123444444321 12233334444 335566666544322 222 Q ss_pred CCCCCCC Q lcl|Aclame:pro 318 DSVPGEV 324 (324) Q Consensus 318 ~~~~~~~ 324 (324) ..+-.|. T Consensus 298 sPt~~~L 304 (330) T protein:vir:10 298 TPSNADL 304 (330) T ss_pred CcChHHh Confidence 2333333 No 134 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=99.22 E-value=1.2e-12 Score=86.01 Aligned_cols=284 Identities=13% Similarity=0.093 Sum_probs=161.4 Q ss_pred HhhhhhHHhh--ccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCcceeeeccCccc Q lcl|Aclame:pro 15 ASNNVKPQVF--NPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 a~~~~~~~~~--~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) +.+....+.. ++.....+.+.-.+.-++|..++++.....+.++++.++..+ ++++..||+. +...+.....|.++ T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~i-G~~~~~~~~~G~~l 79 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVL-GRTKAAYLQPGENL 79 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeec-cceeEeeeecCcCC Confidence 1101011111 111111111112244499999999999999999999887664 4677888976 56667778888877 Q ss_pred ccc--ccceeeEEeeheee-EEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh----ccCcc----ccccccc Q lcl|Aclame:pro 92 ETS--KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN----PFGKSIA 160 (324) Q Consensus 92 ~~~--~~~~~~v~l~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~----~~~~~~~ 160 (324) ..+ .+..++.++...++ .....|-+-=--++..|+.+.+.++++.++++..|+.++. +.... ..+.+.. T Consensus 80 ~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~~ 159 (347) T protein:vir:94 80 DDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGLG 159 (347) T ss_pred CCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccCC Confidence 543 56777766665443 3333343322223567899999999999999999998862 11111 1111100 Q ss_pred -----cc----cccccccccchhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhhccC-Cce----eeccCCcc Q lcl|Aclame:pro 161 -----QS----IEKTNKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDPE-TKE----RIYDRNSD 224 (324) Q Consensus 161 -----~~----~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~d~~-g~~----~~~~~~~~ 224 (324) .. .........+...++.|.++..+|.....+. -.++++|..+..|.+..+.+ +.+ .+..+..+ T Consensus 160 ~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V~ 239 (347) T protein:vir:94 160 KAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSIR 239 (347) T ss_pred cceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhcccccccccccccccceeE Confidence 00 0111111223445677888889988777753 34677899988876543322 211 22345667 Q ss_pred eeecceeEeecCCCCCCc-e----------------------eEEeecccEE----------EEEecceEEEEeecccee Q lcl|Aclame:pro 225 SLDGLPVVNLKSSNLKRG-E----------------------LITGDFDKLI----------YGIPQLIEYKIDETAQLS 271 (324) Q Consensus 225 ~l~G~pv~~~~~~~~~~~-~----------------------~i~gd~s~~~----------~~~~~~~~~~~~~~~~~~ 271 (324) ++.|++|+.+++.+.... . -+=+||++.. .+...++++++.++. T Consensus 240 ~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~--- 316 (347) T protein:vir:94 240 NVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRA--- 316 (347) T ss_pred EeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeech--- Confidence 899999998877653221 0 0112222211 111122222222111 Q ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 272 ~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) . ++.+ .+.+..-+|..++||+|.+.++-..+ T Consensus 317 ----------~-~~~~--~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 317 ----------N-FQAD--QIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred ----------h-hhhh--hhhhhhhhcCcccccceeEEEEecCC Confidence 1 1222 35677788999999999998877666 No 135 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=99.21 E-value=4.3e-12 Score=82.87 Aligned_cols=275 Identities=12% Similarity=0.029 Sum_probs=158.1 Q ss_pred ccccc-CccccchHHHHHHHHHHHhhhhhhhh---------cceeecCCCceEEEEEeC-CcceeeeccCccccccccce Q lcl|Aclame:pro 30 MMHEK-KDGTLMNEFTTPILQEVMENSKIMQL---------GKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 30 ~~~~~-~~~~vp~~~~~~i~~~~~~~s~l~~l---------~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~~~~~~ 98 (324) +..+. +.-.+|+.+..-+.+...+.+.+.+. ......++..+++|.+.. +.++.-+.|+..++..+.+. T Consensus 1 MA~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~~kitt 80 (351) T protein:vir:15 1 MAETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDVNNLTS 80 (351) T ss_pred CCceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccchheecc Confidence 22222 33345555544444555555555442 112223578899999875 35788889999999999998 Q ss_pred eeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh---cc-Cccccccccccccccccccccchh Q lcl|Aclame:pro 99 VNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQ-GNNPFGKSIAQSIEKTNKVIKGDF 174 (324) Q Consensus 99 ~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~-g~~~~~~~~~~~~~~~~~~~~~~~ 174 (324) .+-....++.+..+.++++...-+..+..+.+.+++++...+..++.+|. |. +............ +........+ T Consensus 81 ~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~-t~~~~~~~~i 159 (351) T protein:vir:15 81 GKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQ-TKVSPSEPMF 159 (351) T ss_pred cceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceecc-cccccccccc Confidence 88888889999999999988887788999999999999999999987774 21 1111110000001 1112234467 Q ss_pred hhhHHHHHHHHhhhhcCC-CcEEEEcHHHHHHHHHhhccCC-ceeeccCCcceeecceeEeecCCCCCC----c----ee Q lcl|Aclame:pro 175 TQDNIIDLEALLEDDELE-ANAFISKTQNRSLLRKIVDPET-KERIYDRNSDSLDGLPVVNLKSSNLKR----G----EL 244 (324) Q Consensus 175 ~~~~i~~~~~~l~~~~~~-~~~~v~~~~~~~~l~~~~d~~g-~~~~~~~~~~~l~G~pv~~~~~~~~~~----~----~~ 244 (324) +++.+.++..++.+.... -.+|+||+.++..|++...-+. ++-.....-++++|++|+++..++... + .. T Consensus 160 s~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li~~~~~s~~~~~i~t~~G~~VivdD~~p~~~~~~~~~~ytsy 239 (351) T protein:vir:15 160 GAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLIETIQPQNGATPFEAYNGLRIVLDDDIEIDLTDKTKPVSTSY 239 (351) T ss_pred CHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhhhhccccccCcccceecceEEEEcCCCccccCCCCCceeEEE Confidence 889999999999886544 5889999999999986542111 111112345789999999987765421 1 22 Q ss_pred EEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEe--ecCCCCCCCC Q lcl|Aclame:pro 245 ITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVP--ADKRTDSVPG 322 (324) Q Consensus 245 i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~--~~~~~~~~~~ 322 (324) +|+.- .+.++. ++..++++|+.... .++-.+..+.+ .++||..+..-.. .+++...+-. T Consensus 240 l~~~G-Ai~~~~-~~~~ve~~rd~~~~--------------~g~d~l~~r~~---~~~hp~G~s~~~~~~~~~~~sPt~~ 300 (351) T protein:vir:15 240 IFAPG-AVRYST-NMRSTETKYDPLIN--------------GGQDVIVQKRV---GTIHVAGTSIKASFSPSKASFPTID 300 (351) T ss_pred EEecc-eeeeec-CCcCcceeecccCC--------------CCceEEEEeee---eeeeeeeeeecccccccCcCCcChH Confidence 33321 111222 22234444432211 11111222222 2244554443221 1222222223 Q ss_pred CC Q lcl|Aclame:pro 323 EV 324 (324) Q Consensus 323 ~~ 324 (324) |. T Consensus 301 ~L 302 (351) T protein:vir:15 301 EL 302 (351) T ss_pred Hh Confidence 32 No 136 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=99.19 E-value=1.3e-11 Score=80.31 Aligned_cols=291 Identities=8% Similarity=0.014 Sum_probs=159.6 Q ss_pred HhhhhhHHhhccccccccccCc-----cccchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccC Q lcl|Aclame:pro 15 ASNNVKPQVFNPDNVMMHEKKD-----GTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEG 88 (324) Q Consensus 15 a~~~~~~~~~~~~~~~~~~~~~-----~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg 88 (324) +.+.-..+.-+++..+....++ .+.-+++..+++......++++++.++.++. +++..||+. +...+....-| T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~i-G~~t~~~~t~G 79 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYT-GRMTSSFHTPG 79 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEee-eeeEEeeecCC Confidence 1111111111111111111111 2445999999999999999999998877664 677889987 55556555555 Q ss_pred cccc---ccccceeeEEeeh--eeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh----ccCccc----- Q lcl|Aclame:pro 89 QKIE---TSKATWVNATMRA--FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNNP----- 154 (324) Q Consensus 89 ~~~~---~~~~~~~~v~l~~--~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~~----- 154 (324) +++. ..+....+.++.. .|+. ...|.+----++..++.+.+.++++.++++..|+.++. +..... T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~~~y~-~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~~ 158 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDDLLIS-SAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSAT 158 (375) T ss_pred cCcCCccccCCCCCceEEEecchhhh-hhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 5543 2244444444443 3333 23333322223567999999999999999999998872 211111 Q ss_pred ---cccccccc---cccccccccchhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhhccC--------Cceee Q lcl|Aclame:pro 155 ---FGKSIAQS---IEKTNKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDPE--------TKERI 218 (324) Q Consensus 155 ---~~~~~~~~---~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~d~~--------g~~~~ 218 (324) .+++.... ........+....++.|.++..+|..+..+. -.++++|..+..|.+-+|.+ +..+. T Consensus 159 ~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~ 238 (375) T protein:vir:10 159 NFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQ 238 (375) T ss_pred cccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeeccccccee Confidence 11111111 1111222334556788889999998887764 35799999998886654432 23333 Q ss_pred ccCCcceeecceeEeecCCCCCCce-----------------------------------eEEeec---c---------- Q lcl|Aclame:pro 219 YDRNSDSLDGLPVVNLKSSNLKRGE-----------------------------------LITGDF---D---------- 250 (324) Q Consensus 219 ~~~~~~~l~G~pv~~~~~~~~~~~~-----------------------------------~i~gd~---s---------- 250 (324) ..+...++.|++|+.+...+...++ -+-+|| + T Consensus 239 ~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~ 318 (375) T protein:vir:10 239 SGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKE 318 (375) T ss_pred ccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEchh Confidence 4444567899999887654432211 111232 1 Q ss_pred cEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCC Q lcl|Aclame:pro 251 KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVP 321 (324) Q Consensus 251 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~ 321 (324) .+..+.-.++++++.+... -..+|-+ .+.+..-+|..+.||+|.+.|+..+ +.++.+ T Consensus 319 A~g~v~~~~~~~~~~~~~~-----------~~~~q~~--~i~~~~a~G~~~lrp~~av~l~~~~-~~~~~~ 375 (375) T protein:vir:10 319 AAGVVEAIGPQVQVTNGDV-----------SVIYQGD--VILGRMAMGADYLNPAAAVELYIGA-TAPSAF 375 (375) T ss_pred heeeeeeeccccccccchh-----------hheeeee--eeeeeeeeccCccCceeEEEEecCc-CccccC Confidence 1111111122222211000 0012222 3567788899999999999997654 333333 No 137 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=99.18 E-value=1.6e-12 Score=85.24 Aligned_cols=282 Identities=12% Similarity=0.056 Sum_probs=157.8 Q ss_pred HhhhhhHHhhccccccccccCc-----cccchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccC Q lcl|Aclame:pro 15 ASNNVKPQVFNPDNVMMHEKKD-----GTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEG 88 (324) Q Consensus 15 a~~~~~~~~~~~~~~~~~~~~~-----~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg 88 (324) +.+....+ .+++.+-...++ .+.-++|..++++.....++++++.++.++. +++..||+. +...+.....| T Consensus 1 ma~~~~~~--~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i-G~~~~~~~~~G 77 (344) T protein:vir:10 1 MANMTGGQ--QLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL-GRTQAAYLAPG 77 (344) T ss_pred Cccccccc--cCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEee-ceeEEEeeecC Confidence 10000000 011111111111 1334999999999999999999999877765 677889987 56667777778 Q ss_pred cccccc--ccceeeEEeehee-eEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh----ccCcc----ccc- Q lcl|Aclame:pro 89 QKIETS--KATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN----PFG- 156 (324) Q Consensus 89 ~~~~~~--~~~~~~v~l~~~k-~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~----~~~- 156 (324) ++++.+ .+.-+++++...+ ......|-+----++..++.+.+.++++.++++..|+.++. +.... ..+ T Consensus 78 ~~l~~t~~~~~~~e~~l~ID~~~y~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~ 157 (344) T protein:vir:10 78 ENLDDIRKDIKHTEKVITIDGLLTADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENIT 157 (344) T ss_pred CCCCCCCCCcccceEEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Confidence 887654 4566676665544 22223333322223567999999999999999999998863 11110 000 Q ss_pred ---ccc-cccccc----ccccccchhhhhHHHHHHHHhhhhcCCCc--EEEEcHHHHHHHHHhhcc-----CCceeeccC Q lcl|Aclame:pro 157 ---KSI-AQSIEK----TNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDP-----ETKERIYDR 221 (324) Q Consensus 157 ---~~~-~~~~~~----~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~l~~~~d~-----~g~~~~~~~ 221 (324) .+. ...... ......+...++.|.++...|.....+.. .++++|..+..|.+-+.- .+...+..+ T Consensus 158 g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G 237 (344) T protein:vir:10 158 GLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDPEKG 237 (344) T ss_pred cccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccceeee Confidence 110 100000 11111223346677888888887777543 568899999988653321 112223345 Q ss_pred CcceeecceeEeecCCCCCC----ceeEE---------------eecccE----------EEEEecceEEEEeeccceec Q lcl|Aclame:pro 222 NSDSLDGLPVVNLKSSNLKR----GELIT---------------GDFDKL----------IYGIPQLIEYKIDETAQLST 272 (324) Q Consensus 222 ~~~~l~G~pv~~~~~~~~~~----~~~i~---------------gd~s~~----------~~~~~~~~~~~~~~~~~~~~ 272 (324) ....+.|++|+.+++.+... ..... .+++.. ..+...+++++..++ T Consensus 238 ~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~----- 312 (344) T protein:vir:10 238 SIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARR----- 312 (344) T ss_pred EEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccc----- Confidence 56778999999887654210 01111 122221 011111222222221 Q ss_pred cccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC Q lcl|Aclame:pro 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) Q Consensus 273 ~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~ 315 (324) ...|. + .+++..-+|.+++||+|.+.++-++. T Consensus 313 --------~~~~~-d--~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 313 --------ANFQA-D--QIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred --------hhHHH-H--HHHHHhhcccceecccceEEEEeecC Confidence 11222 2 35677889999999999987777776 No 138 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=99.17 E-value=5.4e-12 Score=82.35 Aligned_cols=296 Identities=13% Similarity=0.045 Sum_probs=159.5 Q ss_pred HHHHhhhhhHHhhccccccccccCc-cccchHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCcceeeeccCc Q lcl|Aclame:pro 12 QHFASNNVKPQVFNPDNVMMHEKKD-GTLMNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWADKPGAYWVGEGQ 89 (324) Q Consensus 12 ~~~a~~~~~~~~~~~~~~~~~~~~~-~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~ 89 (324) ..+-.+...+...|+.-...+.+.. .+.-+.|..++++.....|.++.+.+..++ ++++..||+. +...+.-...|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~i-g~~~~~~~~~g~ 79 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFT-GKLSAGYHTPGT 79 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEec-cceeEeeecCCC Confidence 0111111111112211111122211 244489999999999999999999876654 4677889987 455566666666 Q ss_pred ccccc-ccceeeEEeehee-eEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh----ccCcccccccccccc Q lcl|Aclame:pro 90 KIETS-KATWVNATMRAFK-LGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNNPFGKSIAQSI 163 (324) Q Consensus 90 ~~~~~-~~~~~~v~l~~~k-~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~~~~~~~~~~~ 163 (324) .+... .++-+++++...+ .+....|-+---.++..++.+.+.++.+.++++.+|+.++. +...+.+..+...+. T Consensus 80 ~l~~~~~~~~~~~~l~ID~~ky~~~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~ 159 (332) T protein:vir:78 80 PIVGDAGIKANEKTLVMDDLLVSSQFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) T ss_pred CCCCCCCCCCceEEEEEehhhhhHHHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccc Confidence 65433 3555555555543 22222332211123557899999999999999999988763 222221111111110 Q ss_pred ---ccccccccchhhhhHHHHHHHHhhhhcCCCc--EEEEcHHHHHHHHHhhccC-------C-ceeec-cCCcceeecc Q lcl|Aclame:pro 164 ---EKTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE-------T-KERIY-DRNSDSLDGL 229 (324) Q Consensus 164 ---~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~l~~~~d~~-------g-~~~~~-~~~~~~l~G~ 229 (324) .......+....++.|.++..+|..+..+.. .++++|..+..|.+.+|.. + ...+. +...+.++|+ T Consensus 160 ~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G~ 239 (332) T protein:vir:78 160 HVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGI 239 (332) T ss_pred ccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEeee Confidence 0111122233456778899999988887643 4677999998887644321 1 11222 2235789999 Q ss_pred eeEeecCCCCCCce------------eEEeecccE--EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 230 PVVNLKSSNLKRGE------------LITGDFDKL--IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 230 pv~~~~~~~~~~~~------------~i~gd~s~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) +|+.+++.+...++ .+-|+|++. ++.-+..+....... ..............| .+ .+++.. T Consensus 240 ~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~--~~~~~t~~~~~~~~~-~d--~i~~~~ 314 (332) T protein:vir:78 240 RILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVA--PTIQTTSGDFNVQYQ-GD--LIVGKL 314 (332) T ss_pred EEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeec--cchhhhhcccchhhh-Hh--hhhhhh Confidence 99988766533221 234444431 111111111110000 000000000111122 12 356777 Q ss_pred EeccEEeccCceEEEEee Q lcl|Aclame:pro 296 HVALHIADDKAFAKLVPA 313 (324) Q Consensus 296 r~d~~v~~~~A~~~l~~~ 313 (324) .+|+.++||++++.|+-+ T Consensus 315 ~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 315 AMGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhcCceecccceEEEeeC Confidence 899999999999999887 No 139 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.16 E-value=4.4e-12 Score=82.84 Aligned_cols=285 Identities=12% Similarity=0.079 Sum_probs=160.7 Q ss_pred CchhHHHHHHHHHHHhhhhhHHh--hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEe Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQV--FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWA 77 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~--~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~ 77 (324) |-.. -..+. .++..-..+.+.=.+.-++|..+++......|.++++.+..++ ++++..||+. T Consensus 1 ~a~~--------------~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~i- 65 (347) T protein:vir:88 1 MANA--------------TGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVM- 65 (347) T ss_pred CCCc--------------ccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeee- Confidence 1111 01111 1111111111211344499999999999999999999887664 4677888976 Q ss_pred CCcceeeeccCccccc--cccceeeEEeeheee-EEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhc----c Q lcl|Aclame:pro 78 DKPGAYWVGEGQKIET--SKATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN----Q 150 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~--~~~~~~~v~l~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G----~ 150 (324) +...+.....|..+.. ..+..+++++...+. .....|.+--.-++..|+.+.+.+++++++++..|+.++.- . T Consensus 66 G~~~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a 145 (347) T protein:vir:88 66 GRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC 145 (347) T ss_pred cceeeeeeccccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 4555666667766554 346667777766553 33334444322335678999999999999999999988731 1 Q ss_pred Cc----ccccccc----ccccccc----cccccchhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhhcc---- Q lcl|Aclame:pro 151 GN----NPFGKSI----AQSIEKT----NKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDP---- 212 (324) Q Consensus 151 g~----~~~~~~~----~~~~~~~----~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~d~---- 212 (324) .. +....++ .....+. .....+...++.|.++...|.....+. -.++++|..+..|.+.... T Consensus 146 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~ 225 (347) T protein:vir:88 146 NLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAAN 225 (347) T ss_pred ccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhh Confidence 10 0111111 1111100 111112334677888888888777653 4689999998887543221 Q ss_pred -CCceeeccCCcceeecceeEeecCCCCCCc-ee----------------------EEeecccEEE----------EEec Q lcl|Aclame:pro 213 -ETKERIYDRNSDSLDGLPVVNLKSSNLKRG-EL----------------------ITGDFDKLIY----------GIPQ 258 (324) Q Consensus 213 -~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~-~~----------------------i~gd~s~~~~----------~~~~ 258 (324) ++...+..+..+.+.|++|+.+++.+.... .. +.+|+++... +... T Consensus 226 ~~~~~~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~ 305 (347) T protein:vir:88 226 YAALIDPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLK 305 (347) T ss_pred hccccchhcceeeeeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecc Confidence 111223445667899999999887654221 11 1122322111 1111 Q ss_pred ceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 259 LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 259 ~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) ++.++..++. ..| .| .+++...+|..++||+|.+.|+-.++. T Consensus 306 d~~~e~~r~~-------------~~~-~d--~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 306 DMALERARRP-------------EFQ-AD--QIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred cceeeeeech-------------hhH-HH--HhhhhhhhcCceeccceEEEEEeCCCC Confidence 2222222211 112 22 467888999999999999888765555 No 140 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=99.13 E-value=2.9e-11 Score=78.34 Aligned_cols=296 Identities=10% Similarity=-0.010 Sum_probs=159.7 Q ss_pred HhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccCccccc Q lcl|Aclame:pro 15 ASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 15 a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) |.. ..-..|+..-.+..+- .+.-+++..+|.+.....+.++++..+.++. ++++.+|+. +...++...-|+++.. T Consensus 1 ms~--~~~~tr~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~ 76 (335) T protein:vir:63 1 MSF--LNDLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELER 76 (335) T ss_pred CCC--cccchhhhcccccchh-heehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeee-eeeeeecccCCcCcCC Confidence 110 0111122211222222 2334999999999999999999998887664 567888987 6677777777877777 Q ss_pred cccceeeEEeeheeeE-EeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHH----hccCccc--------cccccc Q lcl|Aclame:pro 94 SKATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI----LNQGNNP--------FGKSIA 160 (324) Q Consensus 94 ~~~~~~~v~l~~~k~~-~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l----~G~g~~~--------~~~~~~ 160 (324) +.+..++.++....+= ....|-+----++..|+.+.+.+++++++++..|+.++ .+..... .+++.. T Consensus 77 ~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:63 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcce Confidence 7677777777665422 22223221112356899999999999999999999775 2222111 111111 Q ss_pred ccccccccc-ccchhhhhHHHHHHHHhhhhcCCC-----cEEEEcHHHHHHHHHhhccCCc--------eeeccCCccee Q lcl|Aclame:pro 161 QSIEKTNKV-IKGDFTQDNIIDLEALLEDDELEA-----NAFISKTQNRSLLRKIVDPETK--------ERIYDRNSDSL 226 (324) Q Consensus 161 ~~~~~~~~~-~~~~~~~~~i~~~~~~l~~~~~~~-----~~~v~~~~~~~~l~~~~d~~g~--------~~~~~~~~~~l 226 (324) ....+.... .+..-..+.+.++..+|..+..+. -..+++|..+..|.+.+.--.+ ..+..+....+ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v 236 (335) T protein:vir:63 157 KLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAIL 236 (335) T ss_pred eeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEEe Confidence 111111111 111222344557777777666552 4689999999998764322111 12334556789 Q ss_pred ecceeEeecCCCCCC---------ceeEEeecccEEEEEecceEEEEeecc--ceeccccccccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 227 DGLPVVNLKSSNLKR---------GELITGDFDKLIYGIPQLIEYKIDETA--QLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 227 ~G~pv~~~~~~~~~~---------~~~i~gd~s~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) .|+||+.++..+... ...+-+|++.........-.+-+-+.. +...+.+... | .+ .+.+.. T Consensus 237 ~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~-----~-~~--~i~~~~ 308 (335) T protein:vir:63 237 NGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEK-----F-SW--VLDTFQ 308 (335) T ss_pred eceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccch-----h-hH--HhHHHH Confidence 999999877654322 223445554322111111111111111 1111111111 1 11 133445 Q ss_pred EeccEEeccCceEEEEeecCCCCCCCC Q lcl|Aclame:pro 296 HVALHIADDKAFAKLVPADKRTDSVPG 322 (324) Q Consensus 296 r~d~~v~~~~A~~~l~~~~~~~~~~~~ 322 (324) -+|..++||+|.+.++-.-.+.-.--+ T Consensus 309 a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:63 309 MYNIGARRPDTAGAIELKGIGAFDITA 335 (335) T ss_pred HcCCcccccceEEEEEEcCCCceeecC Confidence 589999999999988753332111111 No 141 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=99.10 E-value=2e-11 Score=79.25 Aligned_cols=269 Identities=13% Similarity=0.090 Sum_probs=152.3 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccCcccccccccee--- Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIETSKATWV--- 99 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~--- 99 (324) ++..|.+++++-+.-.--.+.+++-+.+.+...+++..+.+|+. +..+++|++.-...+.-|+||+.+|.++.+.+ T Consensus 1 mAe~nlt~~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe~Iplskvt~~~~~ 80 (295) T protein:vir:99 1 MAEKNLNTMADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGETIPLSKVTRTKDK 80 (295) T ss_pred CCCcccccHhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeeeeecccccccCCcccchhhheeeeee Confidence 33334444443221111234444444455555566666888886 56699999988888899999999999999875 Q ss_pred eEEeeheeeEEeeeehHHHhhc-ChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhH Q lcl|Aclame:pro 100 NATMRAFKLGVILPVTKEFLNY-TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) Q Consensus 100 ~v~l~~~k~~~~~~iS~e~l~d-s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) ..+++.+|++.. +|.|.++. ..-+....-.++|..+++.++|+.++.-..++. .+. ....-...++. T Consensus 81 t~t~kikK~rK~--tTdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat---------~t~-tg~~lq~a~a~ 148 (295) T protein:vir:99 81 DYTVKWFKKRRA--TTAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKP---------TKV-KGVGLQKALSA 148 (295) T ss_pred eeEEEeeeeccc--ccHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCc---------eee-ehhhHHHHHHH Confidence 477788888875 49999954 456778889999999999999999996332111 000 00000112333 Q ss_pred HHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccC-CceeeccCC-cceeecce-eEeecCCCCCCceeEEeecccEEEE Q lcl|Aclame:pro 179 IIDLEALLEDDELEANAFISKTQNRSLLRKIVDPE-TKERIYDRN-SDSLDGLP-VVNLKSSNLKRGELITGDFDKLIYG 255 (324) Q Consensus 179 i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~-g~~~~~~~~-~~~l~G~p-v~~~~~~~~~~~~~i~gd~s~~~~~ 255 (324) +.+.+..+.+.+..+.++++||.....|++-..-+ .+.-.++.. --.++|.- ++++. .+++|.++.--..++.+. T Consensus 149 ~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L~nfLG~q~II~S~--kv~~G~~~aT~~~Ni~~a 226 (295) T protein:vir:99 149 SWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLLKNFLGMQNVIVMP--SVPEGKIYSTAVENLVFA 226 (295) T ss_pred hhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhhhhhhccceEEEcc--cCCCceEEEeeccceEEE Confidence 33334444444445568999999998887544322 111101111 01388987 66654 466777765433333222 Q ss_pred E--ecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEE--------------ec--cEEeccCceEEEEeecCCC Q lcl|Aclame:pro 256 I--PQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH--------------VA--LHIADDKAFAKLVPADKRT 317 (324) Q Consensus 256 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r--------------~d--~~v~~~~A~~~l~~~~~~~ 317 (324) + ..+-.+. + .. .|..|.+.|-+..+ .+ +-.-+.+++++.++.++.+ T Consensus 227 y~~~~~g~l~---~-~f------------~~~~D~tglIg~~h~~~~~~~t~et~~~~~~~lfpE~~dgiv~~tI~~~~~ 290 (295) T protein:vir:99 227 SLNVKGGDLG---G-LF------------ADFTDETGLIAAARNRQLSNLTYESVFFGANVLFAEIPEGVVEATIEAAAV 290 (295) T ss_pred EecCCchhhh---h-hh------------hhccCcccceEEEeccccceeeehhhhHhHHHhcccccceEEEEEEecCcC Confidence 1 1110000 0 00 01112222211111 01 1123478899999977777 Q ss_pred CCCCC Q lcl|Aclame:pro 318 DSVPG 322 (324) Q Consensus 318 ~~~~~ 322 (324) +..-| T Consensus 291 ~~~~~ 295 (295) T protein:vir:99 291 PGIGG 295 (295) T ss_pred CCCCC Confidence 66666 No 142 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=99.08 E-value=7.3e-11 Score=76.15 Aligned_cols=295 Identities=10% Similarity=0.003 Sum_probs=156.8 Q ss_pred HhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCcceeeeccCccccc Q lcl|Aclame:pro 15 ASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWADKPGAYWVGEGQKIET 93 (324) Q Consensus 15 a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~~~ 93 (324) |..- .-..|+....+..+. .+.-+++..+|++.....+.++++..+.++ +++++.+|+. +...++...-|+++.. T Consensus 1 ms~~--~~~t~~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~i-G~~~~~~~~pG~~l~~ 76 (335) T protein:vir:78 1 MSFL--NDLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRL-GNVEAKGRRAGEELER 76 (335) T ss_pred CCcc--ccccccccccccchh-hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeee-eeeeecccccCcccCC Confidence 1100 111122221222222 244499999999999999999999887765 4577889976 5666777777777776 Q ss_pred cccceeeEEeeheeeE-EeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHH----hccCccc--------cccccc Q lcl|Aclame:pro 94 SKATWVNATMRAFKLG-VILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI----LNQGNNP--------FGKSIA 160 (324) Q Consensus 94 ~~~~~~~v~l~~~k~~-~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l----~G~g~~~--------~~~~~~ 160 (324) +.+..++.++....+= ....|-+----++..|+.+.+.+++++++++..|+.++ .+..... .+++.. T Consensus 77 ~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:78 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcce Confidence 6677777666664422 22222221112356899999999999999999999876 2222111 111111 Q ss_pred ccccc-ccccccchhhhhHHHHHHHHhhhhcCC-----CcEEEEcHHHHHHHHHhhccC--------CceeeccCCccee Q lcl|Aclame:pro 161 QSIEK-TNKVIKGDFTQDNIIDLEALLEDDELE-----ANAFISKTQNRSLLRKIVDPE--------TKERIYDRNSDSL 226 (324) Q Consensus 161 ~~~~~-~~~~~~~~~~~~~i~~~~~~l~~~~~~-----~~~~v~~~~~~~~l~~~~d~~--------g~~~~~~~~~~~l 226 (324) ....+ .....++....+.+.++...+.....+ .-+.+++|..+..|...+.-- |...+..+....+ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v 236 (335) T protein:vir:78 157 KLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAIL 236 (335) T ss_pred eeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeEEe Confidence 11111 111112223334455566666655543 346899999999987643221 1222344556789 Q ss_pred ecceeEeecCCCCCCc---------eeEEeeccc-EEEEEecc-eEEEEeeccceeccccccccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 227 DGLPVVNLKSSNLKRG---------ELITGDFDK-LIYGIPQL-IEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 227 ~G~pv~~~~~~~~~~~---------~~i~gd~s~-~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) .|+||+.++..+...+ ...-+|++. +.+..... +..-.....+...+.+... |. + .+.+.. T Consensus 237 ~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~-----~~-~--~i~~~~ 308 (335) T protein:vir:78 237 NGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQ-----FS-W--VLDTFQ 308 (335) T ss_pred eceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccch-----hh-H--hhhHHH Confidence 9999998876653221 222234432 11111111 1100000001111111111 11 1 133445 Q ss_pred EeccEEeccCceEEEEeecCCC-CCCC Q lcl|Aclame:pro 296 HVALHIADDKAFAKLVPADKRT-DSVP 321 (324) Q Consensus 296 r~d~~v~~~~A~~~l~~~~~~~-~~~~ 321 (324) -+|..++||+|.+.++-.-.+. +-+. T Consensus 309 a~G~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 309 MYNIGARRPDTAGAIELKGIEAFDITA 335 (335) T ss_pred HcCCcccCcceEEEEEecCCCcccccC Confidence 6899999999998776433221 1111 No 143 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.08 E-value=9.6e-12 Score=80.99 Aligned_cols=292 Identities=11% Similarity=0.106 Sum_probs=174.8 Q ss_pred CchhHHHHHHHHH-HHhhhh------------hHHhhccccc---cccccCccccchHHHHHHHHHHHhhhhhhhhccee Q lcl|Aclame:pro 1 MEQTQKLKLNLQH-FASNNV------------KPQVFNPDNV---MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE 64 (324) Q Consensus 1 ~~~~~~~k~~~~~-~a~~~~------------~~~~~~~~~~---~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~ 64 (324) ..-+.++.+|++. .|...+ .+.-|+++.. .+.++....+|.-+...|-..+....++++.+.+. T Consensus 77 ~Kgk~~mtefLkT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~ 156 (400) T protein:vir:93 77 PKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVT 156 (400) T ss_pred cccchhHHHhhhhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeee Confidence 1111111122211 122111 1112333322 22234445789999999999999999999987776 Q ss_pred ecCCCceEEEEEeCCcceee-eccCccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 PMEGTEKKFTFWADKPGAYW-VGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKK 141 (324) Q Consensus 65 ~~~~~~~~ip~~~~~~~a~~-v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~ 141 (324) ..+. +-+-+......-+| ..-|+.+.++..+|..-++.|+-++.+..+-+-..++ +.-.+.+|++.+|...+-.+ T Consensus 157 n~p~--l~V~~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k 234 (400) T protein:vir:93 157 NVGA--LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK 234 (400) T ss_pred cCCc--eeeecchhhhcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHH Confidence 6532 22222222233455 6788999999999999999998888888874443332 24567999999999999964 Q ss_pred -HHHHHHhccCccccccccc--c----ccccccccccchhhhhHHHH-HHHHhhhhcCCCcEEEEcHHHHHHHHHhhccC Q lcl|Aclame:pro 142 -FDEAGILNQGNNPFGKSIA--Q----SIEKTNKVIKGDFTQDNIID-LEALLEDDELEANAFISKTQNRSLLRKIVDPE 213 (324) Q Consensus 142 -~d~~~l~G~g~~~~~~~~~--~----~~~~~~~~~~~~~~~~~i~~-~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~ 213 (324) ++.+++-|+|.++.-.+.. . ...+...-.++...+.+++. ++.-+.....+.-.++++|..|+.|+.++|++ T Consensus 235 ~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~ 314 (400) T protein:vir:93 235 IVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQAT 314 (400) T ss_pred HhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCc Confidence 7999999999876432210 0 00011112234444544443 34333344445567999999999999999999 Q ss_pred CceeeccCCcce----eecc-eeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCc Q lcl|Aclame:pro 214 TKERIYDRNSDS----LDGL-PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDM 288 (324) Q Consensus 214 g~~~~~~~~~~~----l~G~-pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~ 288 (324) |.+.|..+.... -.|. .+++......++..+++ |-.. ++... +++ ... + .-|.+|+ T Consensus 315 ~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~V-Dek~-~i~~~-~~~--t~~--------s------f~~~tNs 375 (400) T protein:vir:93 315 ANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLV-DQKY-HIDMQ-DLT--KVD--------A------FEWKTNS 375 (400) T ss_pred ceeeeeeccccchhhhhcccceeeeeccCCCCCceeee-ehhh-hcccc-Cce--ecc--------c------eeeeecc Confidence 999985443221 2333 23334444555544443 5322 22221 111 100 0 1145666 Q ss_pred EEEEEEEEeccEEeccCceEEEEee Q lcl|Aclame:pro 289 VALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 289 v~~r~~~r~d~~v~~~~A~~~l~~~ 313 (324) -.+.++.+.++.+.-+++-+.++.+ T Consensus 376 ~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 376 NMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred ceEEeeeeeccceecccceeeEeeC Confidence 7788899999999999999999988 No 144 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=99.07 E-value=2.6e-10 Score=73.12 Aligned_cols=290 Identities=9% Similarity=-0.033 Sum_probs=155.8 Q ss_pred hhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCcceeeeccCccccccc Q lcl|Aclame:pro 17 NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWADKPGAYWVGEGQKIETSK 95 (324) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~ 95 (324) ........++.... +.+.-.+.-+++..++.+.....+.++++..+.++ ++++..||+. +..+++...-|+++.... T Consensus 1 ms~~n~~t~~~~~~-~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~i-G~~~~~~~~~G~~ld~~~ 78 (364) T protein:vir:10 1 MSNPNVLTQPAVSA-SGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYI-GETELQVLSPGKSPDASP 78 (364) T ss_pred CCCccccccccccc-ccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeee-eeeEEeeeccCcccCCCC Confidence 00111111111111 11111244589999999999999999999887765 4677889987 455556655555555555 Q ss_pred cceeeEEeeheee-EEeeeehHHHhhcChHH-HHHHHHHHHHHHHHHHHHHHHHh----ccCcc-----ccccccccc-- Q lcl|Aclame:pro 96 ATWVNATMRAFKL-GVILPVTKEFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGIL----NQGNN-----PFGKSIAQS-- 162 (324) Q Consensus 96 ~~~~~v~l~~~k~-~~~~~iS~e~l~ds~~~-~~~~i~~~l~~ai~~~~d~~~l~----G~g~~-----~~~~~~~~~-- 162 (324) +.-++.++....+ .....|-+=---++.++ +.+.+.+++++++++..|+.++. +.-++ ..+.+...+ T Consensus 79 ~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~~g~~ 158 (364) T protein:vir:10 79 TEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAGHGFS 158 (364) T ss_pred cccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccCCcce Confidence 6666666655432 22222222111134566 78899999999999999998852 11011 001111100 Q ss_pred ----cccccccccchhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhhcc-------CCceeeccCCcceeecc Q lcl|Aclame:pro 163 ----IEKTNKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDP-------ETKERIYDRNSDSLDGL 229 (324) Q Consensus 163 ----~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~d~-------~g~~~~~~~~~~~l~G~ 229 (324) .........+....+.+.++...|..+..+. -.++++|..+..|.+-..- .+...+..+....+.|+ T Consensus 159 i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~v~Gv 238 (364) T protein:vir:10 159 IHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLKSWNT 238 (364) T ss_pred eeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEEEece Confidence 0001111112223445566777887777654 4689999999888654221 12223345566789999 Q ss_pred eeEeecCCCCCCc-------------------ee--EEeecccE--EEEEe--------cceEEEEeeccceeccccccc Q lcl|Aclame:pro 230 PVVNLKSSNLKRG-------------------EL--ITGDFDKL--IYGIP--------QLIEYKIDETAQLSTVKNEDG 278 (324) Q Consensus 230 pv~~~~~~~~~~~-------------------~~--i~gd~s~~--~~~~~--------~~~~~~~~~~~~~~~~~~~~~ 278 (324) ||+.+++.+...+ .- +.+|++.. .+..+ .++..++.++. . T Consensus 239 ~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~--------~- 309 (364) T protein:vir:10 239 PIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEK--------K- 309 (364) T ss_pred EEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeecc--------c- Confidence 9988766542111 00 11333322 11112 22222222111 0 Q ss_pred cchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCC-C Q lcl|Aclame:pro 279 TPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE-V 324 (324) Q Consensus 279 ~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~-~ 324 (324) .| .+ .+.+..-+|..++||+|.+.|+-+++++.++--. + T Consensus 310 ----~~-~~--~ida~~a~G~g~lRPeaa~~i~~~~~~~~~~~~~~~ 349 (364) T protein:vir:10 310 ----EK-TW--YIDTFLAEGAIPDRWEAVAVVTAADTAELATDHNAI 349 (364) T ss_pred ----ee-ee--eeeeehcccCcccCccceEEEEecCCCCCccchhhh Confidence 01 11 2345566899999999999998766665554332 2 No 145 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=99.06 E-value=3.8e-11 Score=77.70 Aligned_cols=285 Identities=13% Similarity=0.084 Sum_probs=151.9 Q ss_pred HhhhhhHHh--hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCcceeeeccCccc Q lcl|Aclame:pro 15 ASNNVKPQV--FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 a~~~~~~~~--~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) +.+...... -++.....+.+.-.+.-+.|..+++......|.++.+.++.+. ++++..||+.. ...+.-...|..+ T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig-~~t~~~~~~g~~l 79 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG-RTKAAYLKPGENL 79 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeecc-ceeeeeeccCCCC Confidence 111110110 0111100111111133488999999999999999999876654 46778889874 4666667777776 Q ss_pred cc--cccceeeEEeeh--eeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhc---c-----Cccc----c Q lcl|Aclame:pro 92 ET--SKATWVNATMRA--FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN---Q-----GNNP----F 155 (324) Q Consensus 92 ~~--~~~~~~~v~l~~--~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G---~-----g~~~----~ 155 (324) +. ...+..+.++.. .++.. ..|-+---.++..++.+.+.++.+.++++..|+.++.- . .+.. + T Consensus 80 ~~~~~~~~~~e~~ltID~~~~~~-~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~ 158 (347) T protein:vir:15 80 DDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEGL 158 (347) T ss_pred CCCCCCCccceEEEEechhhhhh-HHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Confidence 54 335566655544 33332 22322222235678999999999999999999988721 0 0000 0 Q ss_pred ccccc-ccccccc-cc----ccchhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhhcc-----CCceeeccCC Q lcl|Aclame:pro 156 GKSIA-QSIEKTN-KV----IKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDP-----ETKERIYDRN 222 (324) Q Consensus 156 ~~~~~-~~~~~~~-~~----~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~d~-----~g~~~~~~~~ 222 (324) +.... ....... .. ......++.+.++..+|..+..+. -.++++|..+..|.+-.+. .|...+..+. T Consensus 159 g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~G~ 238 (347) T protein:vir:15 159 GKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHERGT 238 (347) T ss_pred CccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccccceE Confidence 00000 0000000 00 111222455666677787777643 3568899999988654322 2233345566 Q ss_pred cceeecceeEeecCCCCCCce-----eE---------------Eeeccc--------EE--EEEecceEEEEeeccceec Q lcl|Aclame:pro 223 SDSLDGLPVVNLKSSNLKRGE-----LI---------------TGDFDK--------LI--YGIPQLIEYKIDETAQLST 272 (324) Q Consensus 223 ~~~l~G~pv~~~~~~~~~~~~-----~i---------------~gd~s~--------~~--~~~~~~~~~~~~~~~~~~~ 272 (324) .+.++|++|+.+++.+...+. .. -++|+. .. .+...++.++..++. T Consensus 239 Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~---- 314 (347) T protein:vir:15 239 IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA---- 314 (347) T ss_pred EEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccc---- Confidence 678999999988765532211 00 111111 11 111222233332211 Q ss_pred cccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCC Q lcl|Aclame:pro 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) Q Consensus 273 ~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~ 317 (324) ..| - -.+++...+|.+++||++.+.|+..--.. T Consensus 315 ---------~~~-~--d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 315 ---------NYQ-A--DQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred ---------hhh-h--hhhehhhhcCCceeccccEEEEecCCCCC Confidence 112 1 23567778899999999998775432222 No 146 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=99.02 E-value=5e-11 Score=77.07 Aligned_cols=285 Identities=13% Similarity=0.047 Sum_probs=155.2 Q ss_pred HhhhhhHHh--hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCcceeeeccCccc Q lcl|Aclame:pro 15 ASNNVKPQV--FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 a~~~~~~~~--~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) +.+....+. -|+.....+.+.-.+.-++|..+++....+.|.++++.+..+. ++++..||+. +...+.-...|+.+ T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~i-G~~t~~~~~~g~~l 79 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVI-GRTKAAYLKPGENL 79 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeec-cceeeeeecCCCCC Confidence 110000010 1111111111111133399999999999999999999886554 4677888887 45555666667766 Q ss_pred ccc--ccceeeEEeehe--eeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh-----ccCc------cccc Q lcl|Aclame:pro 92 ETS--KATWVNATMRAF--KLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL-----NQGN------NPFG 156 (324) Q Consensus 92 ~~~--~~~~~~v~l~~~--k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~-----G~g~------~~~~ 156 (324) +.+ ..+..+.++... ++.. ..|.+-=-.++..++.+.+.++.+.++++..|+.++. +... ...+ T Consensus 80 ~~~~~~~~~~e~~ltiD~~~y~~-~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~ 158 (347) T protein:vir:33 80 DDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGL 158 (347) T ss_pred CCCCCCCccceEEEEechhhhhh-HHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccc Confidence 543 355566555433 3322 2233322223567899999999999999999998872 1111 1111 Q ss_pred -ccccc-----ccc-cccccccchhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhhc-----cCCceeeccCC Q lcl|Aclame:pro 157 -KSIAQ-----SIE-KTNKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD-----PETKERIYDRN 222 (324) Q Consensus 157 -~~~~~-----~~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~d-----~~g~~~~~~~~ 222 (324) ..... +.. .......+...++.|.++..+|..+..+. -.++++|..+..|.+... ..|...+..+. T Consensus 159 ~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~G~ 238 (347) T protein:vir:33 159 GKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPERGT 238 (347) T ss_pred cccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccccccccce Confidence 01000 000 00111122334677888888898777753 467999999998865332 22333345566 Q ss_pred cceeecceeEeecCCCCCCce------------e--------EEeecccE--------EEEE--ecceEEEEeeccceec Q lcl|Aclame:pro 223 SDSLDGLPVVNLKSSNLKRGE------------L--------ITGDFDKL--------IYGI--PQLIEYKIDETAQLST 272 (324) Q Consensus 223 ~~~l~G~pv~~~~~~~~~~~~------------~--------i~gd~s~~--------~~~~--~~~~~~~~~~~~~~~~ 272 (324) .++++|++|+.++..+...++ . +-++|+.. .++. ..++.++..++. T Consensus 239 V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~---- 314 (347) T protein:vir:33 239 IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRA---- 314 (347) T ss_pred eEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccch---- Confidence 778999999988765432211 0 11122111 0111 112223322211 Q ss_pred cccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCC Q lcl|Aclame:pro 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) Q Consensus 273 ~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~ 317 (324) ..|- -.+++...+|.+++||++.+.|+..--.. T Consensus 315 ---------~~~~---d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 315 ---------NYQA---DQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred ---------hhhh---HhhhhhhhcCCceecccceEEEecCCCCC Confidence 1121 23567778899999999999776432222 No 147 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=99.01 E-value=1.5e-11 Score=79.89 Aligned_cols=284 Identities=12% Similarity=0.076 Sum_probs=152.2 Q ss_pred HhhhhhHHhh--ccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCcceeeeccCccc Q lcl|Aclame:pro 15 ASNNVKPQVF--NPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWADKPGAYWVGEGQKI 91 (324) Q Consensus 15 a~~~~~~~~~--~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~ 91 (324) |.+.- .+.. ++.....+.+.-.+.-+++..+++......+.++++.+..++ ++++..||+. +...+.-...|+.+ T Consensus 1 m~~~~-~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i-G~~tv~~~t~G~~l 78 (347) T protein:vir:94 1 MANVP-GQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM-GRTSGVYLAPGERL 78 (347) T ss_pred CCCCC-ccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecc-cceeeeeecCCCCc Confidence 11110 0001 110000011111233488999999998888999998877665 4677888987 56667777777776 Q ss_pred ccc--ccceeeEEeeheee-EEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh--c--cC---c-ccccccc- Q lcl|Aclame:pro 92 ETS--KATWVNATMRAFKL-GVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL--N--QG---N-NPFGKSI- 159 (324) Q Consensus 92 ~~~--~~~~~~v~l~~~k~-~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~--G--~g---~-~~~~~~~- 159 (324) +.+ ..+-.++++...+. .....|-+-=--++..++.+.+.++.+.++++..|+.++. . .+ . .....+. T Consensus 79 ~~~~~~~~~~e~~itID~~~~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~~ 158 (347) T protein:vir:94 79 SDKRKGIKHTEKVITIDGLLTADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGLG 158 (347) T ss_pred CCCCCCCCcceEEEEecchhhhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCc Confidence 543 33444544443332 1111222211112457899999999999999999998762 1 11 0 0111111 Q ss_pred ccccc-------cccccccchhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhhccC-----CceeeccCCcce Q lcl|Aclame:pro 160 AQSIE-------KTNKVIKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDPE-----TKERIYDRNSDS 225 (324) Q Consensus 160 ~~~~~-------~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~d~~-----g~~~~~~~~~~~ 225 (324) ..... ...........++.|.++...|.....+. -.++++|..+..|..-++.+ +...+..+..++ T Consensus 159 ~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg~ 238 (347) T protein:vir:94 159 TASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIRN 238 (347) T ss_pred ccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccceEE Confidence 00000 00001111223566777788887776653 36799999988775433211 122234456678 Q ss_pred eecceeEeecCCCCCCce---------e---------------EEeecccEE----------EEEecceEEEEeecccee Q lcl|Aclame:pro 226 LDGLPVVNLKSSNLKRGE---------L---------------ITGDFDKLI----------YGIPQLIEYKIDETAQLS 271 (324) Q Consensus 226 l~G~pv~~~~~~~~~~~~---------~---------------i~gd~s~~~----------~~~~~~~~~~~~~~~~~~ 271 (324) ++|++|+.+++.+..... + +-+||++.. .+...+++++..+ T Consensus 239 i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r----- 313 (347) T protein:vir:94 239 VMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDR----- 313 (347) T ss_pred EeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchh----- Confidence 999999998766532111 1 111222110 0001111122111 Q ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 272 ~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) + ...|. | .+++..-+|.+++||+|.+.|+-.++. T Consensus 314 ---~-----~~~~~-d--~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 314 ---D-----VDAQG-D--LIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred ---c-----hhhHH-H--HhhhhhhhcCcccccceeEEEEecCCC Confidence 1 11221 2 477888999999999999999876655 No 148 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=99.00 E-value=5.2e-11 Score=76.95 Aligned_cols=279 Identities=12% Similarity=0.088 Sum_probs=156.5 Q ss_pred hhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCC-ce---EEEEEeCCcceeeeccCccccccccce Q lcl|Aclame:pro 23 VFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGT-EK---KFTFWADKPGAYWVGEGQKIETSKATW 98 (324) Q Consensus 23 ~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~-~~---~ip~~~~~~~a~~v~Eg~~~~~~~~~~ 98 (324) .....|.+++++-+....-++.+++-+.+.+..-+++..+.+|+..+ .+ ++|+++....++-|+||+.||.++.+. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~~y~gda~dVaEGe~Iplskvt~ 80 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVEDSEKPNGDVAEGDVIPLTKVTR 80 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeeceeeccccccccCCcccchhhhee Confidence 12223444444444444556777777777777777777788888643 24 445555667788999999999999886 Q ss_pred e---eEEeeheeeEEeeeehHHHhhc-ChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchh Q lcl|Aclame:pro 99 V---NATMRAFKLGVILPVTKEFLNY-TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDF 174 (324) Q Consensus 99 ~---~v~l~~~k~~~~~~iS~e~l~d-s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~ 174 (324) . ..+++.+|++..+ |.|.++. ..-+....-.++|..++..++|+.++.-..+ +..+......... T Consensus 81 ~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lkt---------aT~t~~~t~~t~~ 149 (303) T protein:vir:10 81 EQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKS---------AIENGKRTNKTKL 149 (303) T ss_pred eecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhh---------cccccccccceee Confidence 4 5788889988855 9999954 4567788899999999999999999853211 1111112222334 Q ss_pred hhhHHHHHHHHhh------hhcCCCcEEEEcHHHHHHHHHhhccCCceeeccC-CcceeecceeEeecCCCCCCceeEEe Q lcl|Aclame:pro 175 TQDNIIDLEALLE------DDELEANAFISKTQNRSLLRKIVDPETKERIYDR-NSDSLDGLPVVNLKSSNLKRGELITG 247 (324) Q Consensus 175 ~~~~i~~~~~~l~------~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~-~~~~l~G~pv~~~~~~~~~~~~~i~g 247 (324) +.+.|-+++.... .......++++||.+...+++-..-..+.--++. .--.++|.-++++. .+++|.++.- T Consensus 150 s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~L~nfLG~~II~S~--kv~~G~~~~T 227 (303) T protein:vir:10 150 SAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNLLTPYVGVKIVEFA--DVPQGEVWMT 227 (303) T ss_pred cHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhhhhhhhcceEEEec--cCCCceEEEe Confidence 4555655555332 1112234899999999888643322111100010 01138888876654 5667777654 Q ss_pred ecccEEEEE---e----cceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEec--cEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 248 DFDKLIYGI---P----QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVA--LHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 248 d~s~~~~~~---~----~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d--~~v~~~~A~~~l~~~~~~~~ 318 (324) -..++.+.+ . +.+.+.+|...-+..-++..... ..+......+ +-.-+.+++++.++.+.-.+ T Consensus 228 ~~~Ni~~ay~~~~g~l~~~f~~t~D~tglIGv~h~~~~~~--------~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e~~ 299 (303) T protein:vir:10 228 VAENLNVAYANPRGELSRAFAFATDATGFVGVLHDIQPQR--------LTSDTIYASAISMFPENIDAVIKVTIKKDEAG 299 (303) T ss_pred eccceEEEEecCchhhhhhhhhccccccceEEEeccccce--------eeehhHhHhHHHhcccccceEEEEEEeccccC Confidence 333332221 1 11222222111111111111000 0000000001 11234788999999877777 Q ss_pred CCCC Q lcl|Aclame:pro 319 SVPG 322 (324) Q Consensus 319 ~~~~ 322 (324) .+|. T Consensus 300 ~~~~ 303 (303) T protein:vir:10 300 ELPS 303 (303) T ss_pred CCCC Confidence 7777 No 149 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=99.00 E-value=7.7e-11 Score=76.04 Aligned_cols=289 Identities=8% Similarity=-0.011 Sum_probs=165.1 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceee-cCCCceEEEEEeCCcceeeeccCccccccccceeeEE Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEP-MEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~-~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) +...| .++..-...+|++|+.+|+.-+.+..+...+.++.. -.+.++.||.. +.+...-..+++.+.....+..+++ T Consensus 1 ~~~~n-~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsI-g~~tV~dY~~~~~i~~d~ltt~~~~ 78 (322) T protein:vir:31 1 MSTGN-NTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSV-GTPVVRSRPEQGDFTFDNLDTGEIS 78 (322) T ss_pred CCCCC-CcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccc-cccccccccCCCCcccccCCCceEE Confidence 22222 111222334599999999988888777666665443 34677888877 3455555556666655555555444 Q ss_pred e--eheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh--ccCccc---ccc-cccccc--ccccccccc Q lcl|Aclame:pro 103 M--RAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL--NQGNNP---FGK-SIAQSI--EKTNKVIKG 172 (324) Q Consensus 103 l--~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~--G~g~~~---~~~-~~~~~~--~~~~~~~~~ 172 (324) + ...|+.+. .|++...+ ...++.+...++.+.+++...|+.+.. -+|... .+. ...++. ......... T Consensus 79 l~IDq~KYfaf-~VdDD~~Q-a~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~~ 156 (322) T protein:vir:31 79 IILRDEVYAGN-AISKKLRQ-DSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTDQ 156 (322) T ss_pred EEEehhhhhcc-ccchhHHH-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCCc Confidence 4 44555544 48886554 578999999999999999999987632 111110 000 001110 111122233 Q ss_pred hhhhhHHHHHHHHhhhhcCCC-c-EEEEcHHHHHHHHH-------hhccCCceeeccCC------cceeecceeEeecCC Q lcl|Aclame:pro 173 DFTQDNIIDLEALLEDDELEA-N-AFISKTQNRSLLRK-------IVDPETKERIYDRN------SDSLDGLPVVNLKSS 237 (324) Q Consensus 173 ~~~~~~i~~~~~~l~~~~~~~-~-~~v~~~~~~~~l~~-------~~d~~g~~~~~~~~------~~~l~G~pv~~~~~~ 237 (324) ...|+.|++|..+|..+..+. . .+|++|..+..|.. ++|..--.+...+. .++++|..|+.+... T Consensus 157 ~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V~~SN~l 236 (322) T protein:vir:31 157 TMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDLFVSNLL 236 (322) T ss_pred hhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceeeeeeccc Confidence 557899999999998887764 3 45788998887744 33322111111111 578999999998876 Q ss_pred CCCCceeEEeecccE-EEEEecceEEEEeeccce------eccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEE Q lcl|Aclame:pro 238 NLKRGELITGDFDKL-IYGIPQLIEYKIDETAQL------STVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL 310 (324) Q Consensus 238 ~~~~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l 310 (324) +...-+++.|.-... ..|...++-...+.++.. .+...+..... .+.--.+|+.+|+|.++.+++..+.| T Consensus 237 ~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~---~~~~d~~~~~~~~g~g~~r~e~l~~~ 313 (322) T protein:vir:31 237 ADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDD---YNDDLNTATTARWGNGLVRDENLVCV 313 (322) T ss_pred cccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCc---cccccceeeeeeecceeecccceEEE Confidence 655545555442221 233333333222222200 01011111111 12233589999999999999999988 Q ss_pred EeecCCCCC Q lcl|Aclame:pro 311 VPADKRTDS 319 (324) Q Consensus 311 ~~~~~~~~~ 319 (324) .-.+...+- T Consensus 314 ~a~~~~~~~ 322 (322) T protein:vir:31 314 LANADKVTF 322 (322) T ss_pred EeccccccC Confidence 765555444 No 150 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.97 E-value=1.3e-10 Score=74.78 Aligned_cols=224 Identities=13% Similarity=0.055 Sum_probs=148.2 Q ss_pred hHHhhcccccccccc-CccccchHHHHHHHHHHHhhhhhhhhcceeecCC-CceEEEEEeCCcceeeeccCccccccccc Q lcl|Aclame:pro 20 KPQVFNPDNVMMHEK-KDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTFWADKPGAYWVGEGQKIETSKAT 97 (324) Q Consensus 20 ~~~~~~~~~~~~~~~-~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~ 97 (324) .+. .. ....+-.+ +.-+-|......|+|.+.+.++++...+.+.... ..+.+.+.++-|.+.|..=++.++.++.+ T Consensus 1 m~~-~~-~~~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~~~LP~~~fR~lN~g~~~s~~t 78 (328) T protein:vir:95 1 MAV-KG-LTALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIRSGLPSATWRLLNYGVQPSKST 78 (328) T ss_pred CCc-cc-cccccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEeeccCCceeeecCCccCcccce Confidence 000 00 00011111 2224466777889999999999999999988753 34778899999999999999999999999 Q ss_pred eeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc--ccc------------- Q lcl|Aclame:pro 98 WVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK--SIA------------- 160 (324) Q Consensus 98 ~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~--~~~------------- 160 (324) +.+++-..+-+++.+.|.+.+.+.+. .++...-.+...+++++++...+|+|+.+..+.. |+. T Consensus 79 t~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a~q 158 (328) T protein:vir:95 79 TVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNAQN 158 (328) T ss_pred eEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccccc Confidence 99999999999999999998887653 3344445667889999999999999964321000 000 Q ss_pred --c--ccc---------------------------------------------------------------------ccc Q lcl|Aclame:pro 161 --Q--SIE---------------------------------------------------------------------KTN 167 (324) Q Consensus 161 --~--~~~---------------------------------------------------------------------~~~ 167 (324) . +.+ ..+ T Consensus 159 iidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~N 238 (328) T protein:vir:95 159 IIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRIAN 238 (328) T ss_pred eeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEec Confidence 0 000 000 Q ss_pred -------ccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceee-----ccCCcceeecceeEeec Q lcl|Aclame:pro 168 -------KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI-----YDRNSDSLDGLPVVNLK 235 (324) Q Consensus 168 -------~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~-----~~~~~~~l~G~pv~~~~ 235 (324) ..+.+....+.+++++.+++.......+|+||......|++....-+...+ .+.....+.|+||..+. T Consensus 239 Id~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir~~d 318 (328) T protein:vir:95 239 IDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIRETD 318 (328) T ss_pred CcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEEEEe Confidence 011122233445666777777777788999999999999876443333222 23345678999998775 Q ss_pred CCCCCCceeE Q lcl|Aclame:pro 236 SSNLKRGELI 245 (324) Q Consensus 236 ~~~~~~~~~i 245 (324) +.--.+..++ T Consensus 319 ai~~tE~~vv 328 (328) T protein:vir:95 319 ALLETEARVV 328 (328) T ss_pred eeecCccccC Confidence 5433333222 No 151 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=98.97 E-value=1e-10 Score=75.35 Aligned_cols=275 Identities=10% Similarity=0.086 Sum_probs=151.8 Q ss_pred hhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCC-ce-EEEEEeCCcceeeeccCccccccc Q lcl|Aclame:pro 18 NVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGT-EK-KFTFWADKPGAYWVGEGQKIETSK 95 (324) Q Consensus 18 ~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~-~~-~ip~~~~~~~a~~v~Eg~~~~~~~ 95 (324) .+-.+.+-..|.+++++-+.-..-++.+++-+.+.+..-+++..+.+||..+ .+ .+|.++-...++-|+||+.+|.++ T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~GstIkt~k~~~y~gda~dVaEGe~Iplsk 80 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGMTLKTYAGYDVTLAEGNVPEGEVIPLSK 80 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCCEEeeccceeeeeccccccCCcccchhh Confidence 1122233334555555544444556777776667776777777788998754 45 346677788889999999999999 Q ss_pred cceee---EEeeheeeEEeeeehHHHhhc-ChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccccccccccccc Q lcl|Aclame:pro 96 ATWVN---ATMRAFKLGVILPVTKEFLNY-TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK 171 (324) Q Consensus 96 ~~~~~---v~l~~~k~~~~~~iS~e~l~d-s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~ 171 (324) .+.+. .+++.+|++..+ |.|+++. ..-+....-.++|..+++.++|+.++.-..+.. .+.. ++ T Consensus 81 vt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT---------~t~~--~t 147 (296) T protein:vir:98 81 VERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGT---------GTQD--AL 147 (296) T ss_pred heeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhccc---------ceee--ec Confidence 98764 777888888874 9999954 456778889999999999999999996432211 0100 11 Q ss_pred chhh----hhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCcc-eeecceeEeecCCCCCCceeEE Q lcl|Aclame:pro 172 GDFT----QDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSD-SLDGLPVVNLKSSNLKRGELIT 246 (324) Q Consensus 172 ~~~~----~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~-~l~G~pv~~~~~~~~~~~~~i~ 246 (324) +... +.-+.++...+++......++++||.....+++-..- +..-.++.... .++|.-++.+. .+++|.++. T Consensus 148 ~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg~a~i-t~qt~fG~tyl~nfLG~~II~S~--kV~~G~~~~ 224 (296) T protein:vir:98 148 GAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIAKAGI-TTQTAFGLTYLVDFTGTVIISTN--DVTKGEIWA 224 (296) T ss_pred hhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhcCCcc-chhheechhhhhhccccEEEEcC--cCCCceEEE Confidence 1111 1122333344544433456789999998776532211 11111222222 28887666554 566888776 Q ss_pred eecccEEEEEec---c-eE----EEEeeccceeccccccccchhhhhcCcEEEEEEEEec--cEEeccCceEEEEeecCC Q lcl|Aclame:pro 247 GDFDKLIYGIPQ---L-IE----YKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVA--LHIADDKAFAKLVPADKR 316 (324) Q Consensus 247 gd~s~~~~~~~~---~-~~----~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d--~~v~~~~A~~~l~~~~~~ 316 (324) .-..++.+.+.. + +. +..|...-+..-++... +...+......+ +-.-+.+++++.++.++. T Consensus 225 T~~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~h~~~~--------~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 225 TVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQEN--------TTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred eeecceEEEeecccccchhhhhccccccccceEEEecccc--------ceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 554443332211 1 11 11111100111111000 000000000001 112346788888886665 No 152 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=98.97 E-value=2.4e-10 Score=73.28 Aligned_cols=298 Identities=8% Similarity=-0.017 Sum_probs=158.2 Q ss_pred hhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeec-CCCceEEEEEeCCcceeeeccCccccccc Q lcl|Aclame:pro 17 NNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPM-EGTEKKFTFWADKPGAYWVGEGQKIETSK 95 (324) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~ 95 (324) ........++.... +.+.-.+.-+++..++.+.....+.++++..+.++ +++++.+|+. +..+++...-|+....+. T Consensus 1 Ms~~n~~t~~~~~~-s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~i-G~~~a~y~~~G~~ldg~~ 78 (402) T protein:vir:97 1 MSTPNTLTNVAVSA-SGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPNATP 78 (402) T ss_pred CCCccccccccccc-ccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEE-eeeEEeeeccccccCCCC Confidence 00111111111111 01111244589999999999999999999887765 4677889987 455556655555555455 Q ss_pred cceeeEEeeheeeE-EeeeehHHHhhcChHH-HHHHHHHHHHHHHHHHHHHHHHh-----ccCcccc----ccccccc-- Q lcl|Aclame:pro 96 ATWVNATMRAFKLG-VILPVTKEFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGIL-----NQGNNPF----GKSIAQS-- 162 (324) Q Consensus 96 ~~~~~v~l~~~k~~-~~~~iS~e~l~ds~~~-~~~~i~~~l~~ai~~~~d~~~l~-----G~g~~~~----~~~~~~~-- 162 (324) +.-++..+....+= ....|-+=---++.++ +.+.+.+++++++++..|+.+|. +-.+..+ +.....+ T Consensus 79 ~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~~g~s 158 (402) T protein:vir:97 79 TQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKGHGFS 158 (402) T ss_pred cccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccccccc Confidence 66666656554322 2222222111124566 78899999999999999998763 1111111 1111110 Q ss_pred ccccccc----ccchhhhhHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhhcc-------CCceeeccCCcceeecc Q lcl|Aclame:pro 163 IEKTNKV----IKGDFTQDNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVDP-------ETKERIYDRNSDSLDGL 229 (324) Q Consensus 163 ~~~~~~~----~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~d~-------~g~~~~~~~~~~~l~G~ 229 (324) ...+.+. .++.-..+.+.++...|.....+. -+++++|..|..|.+-..- .+...+..+....+.|+ T Consensus 159 ~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~v~Gv 238 (402) T protein:vir:97 159 INVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLSSYNC 238 (402) T ss_pred cccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEEEece Confidence 0111111 111222345567777777766654 3689999999988754221 12222445666789999 Q ss_pred eeEeecCCCCCC-------------cee--EEeecccE--EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEE Q lcl|Aclame:pro 230 PVVNLKSSNLKR-------------GEL--ITGDFDKL--IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) Q Consensus 230 pv~~~~~~~~~~-------------~~~--i~gd~s~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r 292 (324) ||+.+++.+... |.. +-+|++.. ++..+..+..--....+...+.+.... ..| +- T Consensus 239 ~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~--~~~------id 310 (402) T protein:vir:97 239 PVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEK--TYY------ID 310 (402) T ss_pred EEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHH--HHH------HH Confidence 999887654322 111 22455432 222222211111111111111111110 011 22 Q ss_pred EEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 293 ATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 293 ~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) +..-+|..+.||+|...++.+--.++.+.|++ T Consensus 311 ~~~a~G~g~~RPeaa~vv~~~~~~t~~~~~~~ 342 (402) T protein:vir:97 311 TFMAEGAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) T ss_pred HHHHhCCcccCccceEEEEEecccccccCCcc Confidence 33467889999999999987776666666666 No 153 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.95 E-value=3.1e-10 Score=72.69 Aligned_cols=293 Identities=10% Similarity=0.008 Sum_probs=159.3 Q ss_pred hhccccccccc-----cCccccchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccCcccccccc Q lcl|Aclame:pro 23 VFNPDNVMMHE-----KKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIETSKA 96 (324) Q Consensus 23 ~~~~~~~~~~~-----~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) .-.+++.+-.. +.-.+.-+++..++.......+.++++..+.++. ++++.+|+. +..+++...-|+++.-+.+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~l-G~s~a~y~~pG~~ldg~~~ 79 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAATST 79 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEEeeecCCCCcCCCCc Confidence 11111111111 1223566899999999999999999999887765 567888887 6777778877877766666 Q ss_pred ceeeEEeehee-eEEeeeehHHHhhcChHH-HHHHHHHHHHHHHHHHHHHHHHh----cc-C-cc---ccccccccccc- Q lcl|Aclame:pro 97 TWVNATMRAFK-LGVILPVTKEFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGIL----NQ-G-NN---PFGKSIAQSIE- 164 (324) Q Consensus 97 ~~~~v~l~~~k-~~~~~~iS~e~l~ds~~~-~~~~i~~~l~~ai~~~~d~~~l~----G~-g-~~---~~~~~~~~~~~- 164 (324) ..++..+.... +.....|-+----++..| +.+.+.+++++++++..|+.+|. +. . +. ..+.+...... T Consensus 80 ~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g~s~ 159 (400) T protein:vir:10 80 QADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHGFSV 159 (400) T ss_pred ccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccccce Confidence 67766665543 233333332111234577 78999999999999999998762 21 1 10 01111111111 Q ss_pred ccccc-----ccchhhhhHHHHHHHHhhhhcCCCc--EEEEcHHHHHHHHHhh---ccC----CceeeccCCcceeecce Q lcl|Aclame:pro 165 KTNKV-----IKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIV---DPE----TKERIYDRNSDSLDGLP 230 (324) Q Consensus 165 ~~~~~-----~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~l~~~~---d~~----g~~~~~~~~~~~l~G~p 230 (324) ..... .++.-..+.+.++...+...+.+.. ++++.|..+..|.... +.+ +...+..+....+.|+| T Consensus 160 ~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g~~~~g~v~~v~Gv~ 239 (400) T protein:vir:10 160 NVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKSYTISQSGATIQGFVLSSYNCP 239 (400) T ss_pred eecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchhccccCCCccccceEEEEeceE Confidence 11111 1111223345667777776665543 5677777777765321 111 11122334445789999 Q ss_pred eEeecCCCCCC-------------cee--EEeecccEE--EEEecceEEEEeeccceeccccccccchhhhhcCcEEEEE Q lcl|Aclame:pro 231 VVNLKSSNLKR-------------GEL--ITGDFDKLI--YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 231 v~~~~~~~~~~-------------~~~--i~gd~s~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) |+.++..+... |.. +-+|++... +..+..+-+--....+...+.+... |. -.+-+ T Consensus 240 Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~-----~~---~~id~ 311 (400) T protein:vir:10 240 VIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKE-----KT---YYIDT 311 (400) T ss_pred EEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhh-----HH---HHHHH Confidence 99887654322 111 225665432 2222221111111111111111111 10 01234 Q ss_pred EEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 294 TMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 294 ~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) ..-+|..+.||+|+..++-+...+.++.|-- T Consensus 312 ~~a~G~g~~RPeaa~vv~~~~~~~~~~~~~~ 342 (400) T protein:vir:10 312 FMSEGAIPDRWEAVSVVTTKRQSTGAVDSGN 342 (400) T ss_pred HHHhCCcccchhheEEEEecCCcccccccCc Confidence 4567899999999999998777666655333 No 154 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.87 E-value=3.2e-10 Score=72.60 Aligned_cols=246 Identities=12% Similarity=0.060 Sum_probs=133.1 Q ss_pred hcceeecCCCceEEEEEeCCcceeeeccCccccc--cccceee--EEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHH Q lcl|Aclame:pro 60 LGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIET--SKATWVN--ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIA 135 (324) Q Consensus 60 l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~--~~~~~~~--v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~ 135 (324) |++.+ .++++..||+. +...+....-|+++.. ..+.-.+ +++...++... .|-+----++..|+.+.+.++++ T Consensus 1 ~vr~i-~~g~s~~~~~i-G~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~-~VdDiD~~qa~~Dlr~e~s~~~G 77 (324) T protein:vir:99 1 MTRTI-TSGKSAQFPVM-GRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDV-LIYDIEDAMNHYDVRSEYSTQMG 77 (324) T ss_pred Ceeee-ecCceEEEeee-eeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhh-hhhhHHHHhcCccchhHHHHHHH Confidence 55554 34677899997 5666666666666643 3344444 33344444332 22221112356799999999999 Q ss_pred HHHHHHHHHHHHhc----c--C------cccccccc---ccccccccccccchhhhhHHHHHHHHhhhhcCCC--cEEEE Q lcl|Aclame:pro 136 EAFYKKFDEAGILN----Q--G------NNPFGKSI---AQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEA--NAFIS 198 (324) Q Consensus 136 ~ai~~~~d~~~l~G----~--g------~~~~~~~~---~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~--~~~v~ 198 (324) .++++.+|+.++.- . . .....++. ................++.|.++..+|..+..+. -.+++ T Consensus 78 ~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~vv 157 (324) T protein:vir:99 78 EALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFYT 157 (324) T ss_pred HHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEe Confidence 99999999887621 0 0 00001110 1111111112223345677888888888777754 35799 Q ss_pred cHHHHHHHHHhhcc-----CCceeeccCCcceeecceeEeecCCCCCCce-----------------------eEEeecc Q lcl|Aclame:pro 199 KTQNRSLLRKIVDP-----ETKERIYDRNSDSLDGLPVVNLKSSNLKRGE-----------------------LITGDFD 250 (324) Q Consensus 199 ~~~~~~~l~~~~d~-----~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~-----------------------~i~gd~s 250 (324) +|..+..|..-+.. ++...+..+..+.++|++|+.+++.+...+. -+-+|++ T Consensus 158 ~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~~ 237 (324) T protein:vir:99 158 DPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGAD 237 (324) T ss_pred ChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCccccccccccccccccccccccccccccccccccccC Confidence 99999877543221 1223345566788999999988766543221 0223332 Q ss_pred cE--E--------EEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCC-CCC Q lcl|Aclame:pro 251 KL--I--------YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKR-TDS 319 (324) Q Consensus 251 ~~--~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~-~~~ 319 (324) .. + .+...++..+..+ +. ..|. -.+++..-+|..++||+|.+.++-.+-. +.+ T Consensus 238 ~~~gl~~~~~a~~tv~~~~~~~e~~~--------~~-----~~~~---d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~~ 301 (324) T protein:vir:99 238 NVVGLFVHRSAVATLKLKDMALERAR--------RP-----EYQA---DQIIAKYAMGHGGLRPEAVGAIIFEDGETPAV 301 (324) T ss_pred ceeEEEEehhheEEEeeecceeccee--------ch-----hhHH---HhhhhhhhhcCcccccceEEEEEEccCccccc Confidence 21 1 1111111222211 11 1121 2356667789999999999877743333 233 Q ss_pred CCCCC Q lcl|Aclame:pro 320 VPGEV 324 (324) Q Consensus 320 ~~~~~ 324 (324) +|--+ T Consensus 302 ~~~~~ 306 (324) T protein:vir:99 302 APDVI 306 (324) T ss_pred cchhh Confidence 33222 No 155 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=98.87 E-value=1.4e-09 Score=69.14 Aligned_cols=286 Identities=17% Similarity=0.111 Sum_probs=149.4 Q ss_pred Cch-----hHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhh-hcc--eeecCCCceE Q lcl|Aclame:pro 1 MEQ-----TQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ-LGK--YEPMEGTEKK 72 (324) Q Consensus 1 ~~~-----~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~-l~~--~~~~~~~~~~ 72 (324) ||| +.++|.+|++|+++.+.+ + ...+-+.+...+-+.+...+.... ++. ....++++++ T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----n--------t~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~~g~tVk 78 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQHFANKSVEP-----G--------DTLLKNKHVGILEKVTAANSYSAPAVISNDAIFMQGRSFT 78 (329) T ss_pred hhhhhhcccceeEEehhhhcCCccCC-----c--------hhHHHHHHHHHHHHHHHhhceeeeeecccceeeccCcEEE Confidence 654 567788899999866543 1 112223343333333333222111 122 2345788999 Q ss_pred EEEEeCCcce-eeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 73 FTFWADKPGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) Q Consensus 73 ip~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G 149 (324) ||+.....-. +-.+.+-.....+.+....+++..|.-... |-+-=.+.+. ..+...+.+.....+.-.+|...+.- T Consensus 79 Ip~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~sk 157 (329) T protein:vir:10 79 VIKGDVTELKDYKRNATNEFDHPQIQETTYFLDQEKYWGRF-VDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFAT 157 (329) T ss_pred EeeecccccccccCCCCccccccccceeEEEeecccceeee-cchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHH Confidence 9998653322 222333222222334444555554433332 2221122222 23345556666677777788765532 Q ss_pred cCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCC-cEEEEcHHHHHHHHHhh----ccC-CceeeccCCc Q lcl|Aclame:pro 150 QGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEA-NAFISKTQNRSLLRKIV----DPE-TKERIYDRNS 223 (324) Q Consensus 150 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-~~~v~~~~~~~~l~~~~----d~~-g~~~~~~~~~ 223 (324) --.+. ........+.+-.++.|.+++.+|.....+. -.++++|..+..|.+.. ... .......+.. T Consensus 158 la~~a--------~~~~~~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~~~~~~g~V 229 (329) T protein:vir:10 158 LARNK--------AKHLTVGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGIKKFVIELPQGDNRQQVLGKGVQ 229 (329) T ss_pred HHhhc--------ccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeeee Confidence 11100 0011111223445788999999998775443 35789999999887522 111 1223456677 Q ss_pred ceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEec Q lcl|Aclame:pro 224 DSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIAD 303 (324) Q Consensus 224 ~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~ 303 (324) +++.|.+|+.+++.......+++|..+...... +--.++..+. ..+. | --.++.+.++|+.|++ T Consensus 230 g~idG~~Ii~vps~~~k~in~ii~~~~A~~~~~-K~~~~~~~~p-----~~~~-------~---a~~v~gr~yyd~~V~~ 293 (329) T protein:vir:10 230 GELDGFTIVKVPSKMLQGVEAMAVIGEVMASPI-QANEAKLNSN-----VPGM-------F---GTLAEQMLYTGAFVPE 293 (329) T ss_pred eeecCeEEEEecCCcccceeEEEEcCCceeeee-eeeeeeeeCC-----CCcc-------c---hheeeeeeeeeeEEEc Confidence 889999999888776666566776654433222 1112222110 0111 1 1257888999999999 Q ss_pred cCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 304 DKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 304 ~~A~~~l~~~~~~~~~~~~~~ 324 (324) +++............+..+.. T Consensus 294 ~k~~~I~~~~~~a~~~~~~~~ 314 (329) T protein:vir:10 294 HLQKYIFTIGGKEVETNRDGV 314 (329) T ss_pred cccCEEEEecccCcccCCCCC Confidence 997665554443333333332 No 156 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=98.80 E-value=1.1e-09 Score=69.61 Aligned_cols=291 Identities=10% Similarity=0.011 Sum_probs=152.9 Q ss_pred hhccccccccc-----cCccccchHHHHHHHHHHHhhhhhhhhcceeecC-CCceEEEEEeCCcceeeeccCcccccccc Q lcl|Aclame:pro 23 VFNPDNVMMHE-----KKDGTLMNEFTTPILQEVMENSKIMQLGKYEPME-GTEKKFTFWADKPGAYWVGEGQKIETSKA 96 (324) Q Consensus 23 ~~~~~~~~~~~-----~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) .-.+++.+-.. +.-.+.-+++..++.......+.++++..+.++. ++++.+|+. +..+++...-|+++..+.+ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~-G~s~~~~~~pG~~ld~~~~ 79 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYL-GETELQVLAPGQSPAATST 79 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEe-eeeEeeeecCCCCcCCCCc Confidence 11111111111 1123566899999999999999999999887764 567888987 5666777766666665666 Q ss_pred ceeeEEeehee-eEEeeeehHHHhhcChHH-HHHHHHHHHHHHHHHHHHHHHHh-----ccCc----cccccccccc--- Q lcl|Aclame:pro 97 TWVNATMRAFK-LGVILPVTKEFLNYTYSQ-FFEEMKPMIAEAFYKKFDEAGIL-----NQGN----NPFGKSIAQS--- 162 (324) Q Consensus 97 ~~~~v~l~~~k-~~~~~~iS~e~l~ds~~~-~~~~i~~~l~~ai~~~~d~~~l~-----G~g~----~~~~~~~~~~--- 162 (324) .-++..+.... +.....|-+=---++.++ +.+.+.+++++++++..|+.++. |-.. ...+.+...+ T Consensus 80 ~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G~~i 159 (401) T protein:vir:70 80 QADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHGFSI 159 (401) T ss_pred ccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCceEE Confidence 66666665543 222222222111134566 78899999999999999987742 2110 0111111111 Q ss_pred ---cccccccccchhhhhHHHHHHHHhhhhcCCCc--EEEEcHHHHHHHHHh---hccC----CceeeccCCcceeecce Q lcl|Aclame:pro 163 ---IEKTNKVIKGDFTQDNIIDLEALLEDDELEAN--AFISKTQNRSLLRKI---VDPE----TKERIYDRNSDSLDGLP 230 (324) Q Consensus 163 ---~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~l~~~---~d~~----g~~~~~~~~~~~l~G~p 230 (324) ........++.-..+.+.++...|...+.+.. ++++.|..+..|... -+.. +...+..+....+.|+| T Consensus 160 ~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~g~~~~G~v~~vaGv~ 239 (401) T protein:vir:70 160 NVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKTYTISQSGATIQGFTLSSYNCP 239 (401) T ss_pred eccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchhhccccCCccccceEEEEeceE Confidence 11111111222234557788888877776654 456667766666432 1111 11223445556799999 Q ss_pred eEeecCCCCCC-------------cee--EEeecccEE--EEEecceEEEEeeccceeccccccccchhhhhcCcEEEEE Q lcl|Aclame:pro 231 VVNLKSSNLKR-------------GEL--ITGDFDKLI--YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) Q Consensus 231 v~~~~~~~~~~-------------~~~--i~gd~s~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~ 293 (324) |+.+++.+... |.. +-+|++... +..+..+-+--....+...+.+... | .+ .+-+ T Consensus 240 Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~-----~-~~--~id~ 311 (401) T protein:vir:70 240 VIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKE-----K-TY--YIDT 311 (401) T ss_pred EEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhh-----h-HH--HHHH Confidence 99887665322 111 225555432 2222221111111111111111110 0 11 1224 Q ss_pred EEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 294 TMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 294 ~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) ..-+|..+.||+|.+.++.+-. ..++.++ T Consensus 312 ~~a~g~g~~RPeaa~vv~~k~~--~~~~~~~ 340 (401) T protein:vir:70 312 FMAEGAIPDRWEAVSVVTTKRN--TTTGAVE 340 (401) T ss_pred HHHhCCcccchhheEEEeecCc--ccccccc Confidence 4567899999999998865433 2223222 No 157 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=98.79 E-value=6e-09 Score=65.64 Aligned_cols=285 Identities=16% Similarity=0.073 Sum_probs=151.8 Q ss_pred Cchh-----HHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhh-h-cc--eeecCCCce Q lcl|Aclame:pro 1 MEQT-----QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ-L-GK--YEPMEGTEK 71 (324) Q Consensus 1 ~~~~-----~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~-l-~~--~~~~~~~~~ 71 (324) |||. .++|.+||+|+.+....- ...+ .+.-..+++.+.....+.. + +. ....+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n-------------t~~l-~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tV 66 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPG-------------QTLL-KNKHVGILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcc-------------hHHH-HHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEE Confidence 7664 557788999987554431 1122 2233334555544444332 1 22 234578889 Q ss_pred EEEEEeCCcce-eeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChH--HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 72 KFTFWADKPGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYS--QFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 72 ~ip~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~--~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) +||+.....-. +-.+.+-.....+.+....++...|.-... |-.-=.+.+.. .+...+.+.....+.-.+|...+. T Consensus 67 kIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~s 145 (319) T protein:vir:97 67 TVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFA 145 (319) T ss_pred EEeeecccccccccCCCCcccCCcccceeEEEeecccccccc-cchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHH Confidence 99998753322 222333222222333444555544433322 22211222222 334455566666667677876553 Q ss_pred ccCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCC-cEEEEcHHHHHHHHHhhc-----cCCceeeccCC Q lcl|Aclame:pro 149 NQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEA-NAFISKTQNRSLLRKIVD-----PETKERIYDRN 222 (324) Q Consensus 149 G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-~~~v~~~~~~~~l~~~~d-----~~g~~~~~~~~ 222 (324) -...+. ........+.+-.++.|.+++.+|.....+. -.++++|..+..|.+-.. ..+......+. T Consensus 146 kla~~a--------~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) T protein:vir:97 146 TLARNK--------AKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) T ss_pred HHHhhc--------ccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeee Confidence 221110 0011111223446888999999998877653 357999999998854321 12233445677 Q ss_pred cceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|Aclame:pro 223 SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 223 ~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) .+++.|.+|+.+++....+-.+++|..+..... .+--.+++.+. ..+. | --.++.+.++|..|+ T Consensus 218 Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~-~k~~~~~~~~p-----~~~~-------~---a~~v~gr~y~d~~V~ 281 (319) T protein:vir:97 218 QGELDGFVIVKVPTKLLQGLQAIAVVGEVLASP-IQADLAKTNSN-----IPGM-------F---GTLAEQLLYTGAFVP 281 (319) T ss_pred ceeecCeEEEEecccccccceEEEEcCCeeeee-eeeeeeeccCC-----Cccc-------c---ceeeeeeeeeeeEEe Confidence 789999999988776666666777765443322 22111221110 0000 1 124788899999999 Q ss_pred ccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 303 DDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 303 ~~~A~~~l~~~~~~~~~~~~~~ 324 (324) ++++......+...+...+... T Consensus 282 ~~k~~~Iy~~~~~~~~~~~~~~ 303 (319) T protein:vir:97 282 EHLQKYIFTIGGTEVATKRDGV 303 (319) T ss_pred ccccceEEEeecCCcccCCCcc Confidence 9997666665544444444333 No 158 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=98.79 E-value=6e-09 Score=65.64 Aligned_cols=285 Identities=16% Similarity=0.073 Sum_probs=151.8 Q ss_pred Cchh-----HHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhh-h-cc--eeecCCCce Q lcl|Aclame:pro 1 MEQT-----QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ-L-GK--YEPMEGTEK 71 (324) Q Consensus 1 ~~~~-----~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~-l-~~--~~~~~~~~~ 71 (324) |||. .++|.+||+|+.+....- ...+ .+.-..+++.+.....+.. + +. ....+++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~n-------------t~~l-~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tV 66 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPG-------------QTLL-KNKHVGILERVTAVNAYSTPALISNDAIFMEGRSF 66 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcc-------------hHHH-HHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEE Confidence 7664 557788999987554431 1122 2233334555544444332 1 22 234578889 Q ss_pred EEEEEeCCcce-eeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChH--HHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 72 KFTFWADKPGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYS--QFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 72 ~ip~~~~~~~a-~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~--~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) +||+.....-. +-.+.+-.....+.+....++...|.-... |-.-=.+.+.. .+...+.+.....+.-.+|...+. T Consensus 67 kIp~i~~~gl~DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~s 145 (319) T protein:vir:94 67 TVMKGDTTELKDYKRNATNEFDHPKIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFA 145 (319) T ss_pred EEeeecccccccccCCCCcccCCcccceeEEEeecccccccc-cchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHH Confidence 99998753322 222333222222333444555544433322 22211222222 334455566666667677876553 Q ss_pred ccCccccccccccccccccccccchhhhhHHHHHHHHhhhhcCCC-cEEEEcHHHHHHHHHhhc-----cCCceeeccCC Q lcl|Aclame:pro 149 NQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEA-NAFISKTQNRSLLRKIVD-----PETKERIYDRN 222 (324) Q Consensus 149 G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~-~~~v~~~~~~~~l~~~~d-----~~g~~~~~~~~ 222 (324) -...+. ........+.+-.++.|.+++.+|.....+. -.++++|..+..|.+-.. ..+......+. T Consensus 146 kla~~a--------~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~~g~ 217 (319) T protein:vir:94 146 TLARNK--------AKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLGKGV 217 (319) T ss_pred HHHhhc--------ccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhhccccccccceeeee Confidence 221110 0011111223446888999999998877653 357999999998854321 12233445677 Q ss_pred cceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|Aclame:pro 223 SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 223 ~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) .+++.|.+|+.+++....+-.+++|..+..... .+--.+++.+. ..+. | --.++.+.++|..|+ T Consensus 218 Vg~idG~~Vi~vps~~~k~in~i~~h~~A~~~~-~k~~~~~~~~p-----~~~~-------~---a~~v~gr~y~d~~V~ 281 (319) T protein:vir:94 218 QGELDGFVIVKVPTKLLQGLQAIAVVGEVLASP-IQADLAKTNSN-----IPGM-------F---GTLAEQLLYTGAFVP 281 (319) T ss_pred ceeecCeEEEEecccccccceEEEEcCCeeeee-eeeeeeeccCC-----Cccc-------c---ceeeeeeeeeeeEEe Confidence 789999999988776666666777765443322 22111221110 0000 1 124788899999999 Q ss_pred ccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 303 DDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 303 ~~~A~~~l~~~~~~~~~~~~~~ 324 (324) ++++......+...+...+... T Consensus 282 ~~k~~~Iy~~~~~~~~~~~~~~ 303 (319) T protein:vir:94 282 EHLQKYIFTIGGTEVATKRDGV 303 (319) T ss_pred ccccceEEEeecCCcccCCCcc Confidence 9997666665544444444333 No 159 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=98.78 E-value=3.3e-09 Score=67.09 Aligned_cols=271 Identities=11% Similarity=0.053 Sum_probs=160.1 Q ss_pred hccccccccccCccccc---hHHHHHHHHHHHhhhhhhhhccee---ecCCCceEEEEEeCCcceeeeccCc-ccccccc Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLM---NEFTTPILQEVMENSKIMQLGKYE---PMEGTEKKFTFWADKPGAYWVGEGQ-KIETSKA 96 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp---~~~~~~i~~~~~~~s~l~~l~~~~---~~~~~~~~ip~~~~~~~a~~v~Eg~-~~~~~~~ 96 (324) +. .--..+++.++- +.+...+++...+.-..++++... +....++.+++......+.|++.++ .+|..+. T Consensus 1 ~~---~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~ 77 (296) T protein:vir:10 1 MG---VDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDA 77 (296) T ss_pred Cc---ccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeec Confidence 11 111123344443 355566777777777777776643 2233455666766677788987754 5888889 Q ss_pred ceeeEEeeheeeEEeeeehHHHhhcC---hHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccc----- Q lcl|Aclame:pro 97 TWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK----- 168 (324) Q Consensus 97 ~~~~v~l~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~----- 168 (324) ..+......+.++..+.++.+-++.+ ..++..--....+++++..+|+.+|+|+...... |+.+....... T Consensus 78 ~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~-GLlN~p~v~~~~~~~~ 156 (296) T protein:vir:10 78 LATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIP-SVFDYPNINNVVSGGS 156 (296) T ss_pred cceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccce-eEeecCCCccccccCC Confidence 99999999999999999988666644 4678888888889999999999999997553222 22222111111 Q ss_pred cccchhhhhHHHHHHHHhhhh---cCCCcEEEEcHHHHHHHHHhhccCCceeec----cCCcceeecceeEeecCCCCCC Q lcl|Aclame:pro 169 VIKGDFTQDNIIDLEALLEDD---ELEANAFISKTQNRSLLRKIVDPETKERIY----DRNSDSLDGLPVVNLKSSNLKR 241 (324) Q Consensus 169 ~~~~~~~~~~i~~~~~~l~~~---~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~----~~~~~~l~G~pv~~~~~~~~~~ 241 (324) ..+.+-.++||.+++.++... ...+..++++|+.+..|.......|.-++. ...+.++...|..... ...++ T Consensus 157 W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~a-~~~g~ 235 (296) T protein:vir:10 157 WSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLNDY-NGTGT 235 (296) T ss_pred ccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeeccC-CCCcc Confidence 112234578889998877543 345678999999999987554444433221 1223344444433211 11122 Q ss_pred ceeEEeecc--cEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEe-ccEEeccCceEEEEeecCC Q lcl|Aclame:pro 242 GELITGDFD--KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPADKR 316 (324) Q Consensus 242 ~~~i~gd~s--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~-d~~v~~~~A~~~l~~~~~~ 316 (324) ..+++.+.+ .+-+.....++..... ...=...++...|+ |..+.+|.|++++++.+-. T Consensus 236 ~~~v~~~~~~~~~~~~v~~~~~~~~~e-----------------~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 236 SAAIAYEKDPNNMAIEIPEATNALPAQ-----------------PKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred eEEEEEEcCCceEEEEcCcceeeeccc-----------------ccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 222322211 2222222222221100 00111245667777 4899999999999988776 No 160 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.78 E-value=2.8e-09 Score=67.49 Aligned_cols=224 Identities=14% Similarity=0.142 Sum_probs=141.6 Q ss_pred hHHhhccccccccccCcccc-ch-HHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccCcccccccc Q lcl|Aclame:pro 20 KPQVFNPDNVMMHEKKDGTL-MN-EFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKA 96 (324) Q Consensus 20 ~~~~~~~~~~~~~~~~~~~v-p~-~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) .+.. ..+..+-.+....+ |. .+...|+|.+.+.++|+...+.+..+.+. ....+.++-|.+.|..=++.++.++. T Consensus 1 m~~~--~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:10 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCcc--ccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCcccc Confidence 0000 00111111111122 32 34567999999999999999988765444 44577888999999999999999999 Q ss_pred ceeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc--c-------------- Q lcl|Aclame:pro 97 TWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK--S-------------- 158 (324) Q Consensus 97 ~~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~--~-------------- 158 (324) ++.+++-..+-+++.+.|.+.+.+.+. -++...-.+...+++.+.+...+|+|+.+..+.. | T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999999888643 3344556677889999999999999973311000 0 Q ss_pred -ccc--ccc---------------------------------------------------------------------cc Q lcl|Aclame:pro 159 -IAQ--SIE---------------------------------------------------------------------KT 166 (324) Q Consensus 159 -~~~--~~~---------------------------------------------------------------------~~ 166 (324) +.. +.+ .. T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 000 000 00 Q ss_pred cc--------cccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCc-eee-----ccCCcceeecceeE Q lcl|Aclame:pro 167 NK--------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK-ERI-----YDRNSDSLDGLPVV 232 (324) Q Consensus 167 ~~--------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~-~~~-----~~~~~~~l~G~pv~ 232 (324) +. +.++....+.++.+...++.......+|+||...+..|++....-+. ..+ .+.....+.|+||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 00 00111122334455566666677778999999999999876433322 222 23345678999998 Q ss_pred eecCCCCCCceeE Q lcl|Aclame:pro 233 NLKSSNLKRGELI 245 (324) Q Consensus 233 ~~~~~~~~~~~~i 245 (324) .+.+.--.+..++ T Consensus 319 ~~dai~~tE~~Vv 331 (331) T protein:vir:10 319 RTDALLLTEARVV 331 (331) T ss_pred EeeeeecCccccC Confidence 7765443333332 No 161 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.78 E-value=2.8e-09 Score=67.49 Aligned_cols=224 Identities=14% Similarity=0.142 Sum_probs=141.6 Q ss_pred hHHhhccccccccccCcccc-ch-HHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccCcccccccc Q lcl|Aclame:pro 20 KPQVFNPDNVMMHEKKDGTL-MN-EFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKA 96 (324) Q Consensus 20 ~~~~~~~~~~~~~~~~~~~v-p~-~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) .+.. ..+..+-.+....+ |. .+...|+|.+.+.++|+...+.+..+.+. ....+.++-|.+.|..=++.++.++. T Consensus 1 m~~~--~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:10 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCcc--ccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCcccc Confidence 0000 00111111111122 32 34567999999999999999988765444 44577888999999999999999999 Q ss_pred ceeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc--c-------------- Q lcl|Aclame:pro 97 TWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK--S-------------- 158 (324) Q Consensus 97 ~~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~--~-------------- 158 (324) ++.+++-..+-+++.+.|.+.+.+.+. -++...-.+...+++.+.+...+|+|+.+..+.. | T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999999888643 3344556677889999999999999973311000 0 Q ss_pred -ccc--ccc---------------------------------------------------------------------cc Q lcl|Aclame:pro 159 -IAQ--SIE---------------------------------------------------------------------KT 166 (324) Q Consensus 159 -~~~--~~~---------------------------------------------------------------------~~ 166 (324) +.. +.+ .. T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 000 000 00 Q ss_pred cc--------cccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCc-eee-----ccCCcceeecceeE Q lcl|Aclame:pro 167 NK--------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK-ERI-----YDRNSDSLDGLPVV 232 (324) Q Consensus 167 ~~--------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~-~~~-----~~~~~~~l~G~pv~ 232 (324) +. +.++....+.++.+...++.......+|+||...+..|++....-+. ..+ .+.....+.|+||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 00 00111122334455566666677778999999999999876433322 222 23345678999998 Q ss_pred eecCCCCCCceeE Q lcl|Aclame:pro 233 NLKSSNLKRGELI 245 (324) Q Consensus 233 ~~~~~~~~~~~~i 245 (324) .+.+.--.+..++ T Consensus 319 ~~dai~~tE~~Vv 331 (331) T protein:vir:10 319 RTDALLLTEARVV 331 (331) T ss_pred EeeeeecCccccC Confidence 7765443333332 No 162 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.78 E-value=2.8e-09 Score=67.49 Aligned_cols=224 Identities=14% Similarity=0.142 Sum_probs=141.6 Q ss_pred hHHhhccccccccccCcccc-ch-HHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccCcccccccc Q lcl|Aclame:pro 20 KPQVFNPDNVMMHEKKDGTL-MN-EFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKA 96 (324) Q Consensus 20 ~~~~~~~~~~~~~~~~~~~v-p~-~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~ 96 (324) .+.. ..+..+-.+....+ |. .+...|+|.+.+.++|+...+.+..+.+. ....+.++-|.+.|..=++.++.++. T Consensus 1 m~~~--~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~vrt~LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:98 1 MPTL--STTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTTVRSGLPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCcc--ccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceeeEEeccCCchhhccCCccCcccc Confidence 0000 00111111111122 32 34567999999999999999988765444 44577888999999999999999999 Q ss_pred ceeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc--c-------------- Q lcl|Aclame:pro 97 TWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGK--S-------------- 158 (324) Q Consensus 97 ~~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~--~-------------- 158 (324) ++.+++-..+-+++.+.|.+.+.+.+. -++...-.+...+++.+.+...+|+|+.+..+.. | T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:98 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999999888643 3344556677889999999999999973311000 0 Q ss_pred -ccc--ccc---------------------------------------------------------------------cc Q lcl|Aclame:pro 159 -IAQ--SIE---------------------------------------------------------------------KT 166 (324) Q Consensus 159 -~~~--~~~---------------------------------------------------------------------~~ 166 (324) +.. +.+ .. T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:98 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 000 000 00 Q ss_pred cc--------cccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCc-eee-----ccCCcceeecceeE Q lcl|Aclame:pro 167 NK--------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK-ERI-----YDRNSDSLDGLPVV 232 (324) Q Consensus 167 ~~--------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~-~~~-----~~~~~~~l~G~pv~ 232 (324) +. +.++....+.++.+...++.......+|+||...+..|++....-+. ..+ .+.....+.|+||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:98 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 00 00111122334455566666677778999999999999876433322 222 23345678999998 Q ss_pred eecCCCCCCceeE Q lcl|Aclame:pro 233 NLKSSNLKRGELI 245 (324) Q Consensus 233 ~~~~~~~~~~~~i 245 (324) .+.+.--.+..++ T Consensus 319 ~~dai~~tE~~Vv 331 (331) T protein:vir:98 319 RTDALLLTEARVV 331 (331) T ss_pred EeeeeecCccccC Confidence 7765443333332 No 163 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=98.72 E-value=1.4e-08 Score=63.64 Aligned_cols=288 Identities=8% Similarity=0.021 Sum_probs=159.2 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccc---hHHHHHHHHHHHhhhhhhhhccee---ecCCCceEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLM---NEFTTPILQEVMENSKIMQLGKYE---PMEGTEKKFT 74 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp---~~~~~~i~~~~~~~s~l~~l~~~~---~~~~~~~~ip 74 (324) |.+.+.....+...+. ......++.+ ...+.|.+.. +.+...+++...+.-..++++... +....++.+. T Consensus 1 ~~~~~~~~~~~~~~~~-~~~~~~~~~d---a~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~ 76 (319) T protein:vir:10 1 MTTKKFDEADKSNVEM-YLIQAGVKQD---AAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYM 76 (319) T ss_pred CCCcchhHHhhHHHHH-HHhhccchhh---hhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEee Confidence 6554332222221110 0001111111 0112233433 345566777777777777777654 2233445566 Q ss_pred EEeCCcceeeeccCc-cccccccceeeEEeeheeeEEeeeehHHHhhcC---hHHHHHHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 75 FWADKPGAYWVGEGQ-KIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) Q Consensus 75 ~~~~~~~a~~v~Eg~-~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~ 150 (324) +......+.|++.++ .+|..+..++......+.++..+.++..-++.+ ..++..--....+++++..+|+.+|+|+ T Consensus 77 ~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~ 156 (319) T protein:vir:10 77 TFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGS 156 (319) T ss_pred eeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeec Confidence 666667788987755 589888999999999999999999987656544 4777888888899999999999999997 Q ss_pred Ccccccccccccccccccc---------ccchhhhhHHHHHHHHhhhh---cCCCcEEEEcHHHHHHHHHhhccCCceee Q lcl|Aclame:pro 151 GNNPFGKSIAQSIEKTNKV---------IKGDFTQDNIIDLEALLEDD---ELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) Q Consensus 151 g~~~~~~~~~~~~~~~~~~---------~~~~~~~~~i~~~~~~l~~~---~~~~~~~v~~~~~~~~l~~~~d~~g~~~~ 218 (324) ..... .|+.+........ .+..-.++|+.+++.++... ...+..++++|+.+..|.......|..++ T Consensus 157 ~~~g~-~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~t~l 235 (319) T protein:vir:10 157 APHKI-VSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMPETTMSYL 235 (319) T ss_pred ccccc-eeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccCCCCeeHH Confidence 64322 2222222111111 11123457788888887632 34567899999999999755544443332 Q ss_pred c----cCCcceeecceeEeecCCCCCCceeEEeecc-c-EEEEEecceEEEEeeccceeccccccccchhhhhcCc--EE Q lcl|Aclame:pro 219 Y----DRNSDSLDGLPVVNLKSSNLKRGELITGDFD-K-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDM--VA 290 (324) Q Consensus 219 ~----~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~--v~ 290 (324) . ...+.++.+.|..... ...+...+++...+ . +-+.....++.... +..- .. T Consensus 236 ~~lk~~~~~l~I~~~pel~~a-g~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~-------------------e~~~l~~~ 295 (319) T protein:vir:10 236 DYFKSQNSGIEIDSIAELEDI-DGAGTKGVLVYEKNPMNMSIEIPEAFNMLPA-------------------QPKDLHFK 295 (319) T ss_pred HHHHHhcCCceEEEeeeeccc-CCCcceEEEEEecCCceEEEecCcceeeeee-------------------eecCceEE Confidence 1 1223345555543211 11111222222211 1 11222222221110 1111 12 Q ss_pred EEEEEEe-ccEEeccCceEEEEee Q lcl|Aclame:pro 291 LRATMHV-ALHIADDKAFAKLVPA 313 (324) Q Consensus 291 ~r~~~r~-d~~v~~~~A~~~l~~~ 313 (324) +....|+ |..+.+|.|++++++. T Consensus 296 ~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 296 VPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred EeeeeeeEEEEEEccceeEeeecC Confidence 3344554 4778899999999998 No 164 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=98.72 E-value=1.2e-08 Score=64.01 Aligned_cols=302 Identities=10% Similarity=0.044 Sum_probs=156.1 Q ss_pred CchhHHHHHHHHHHHhhhh--hH-HhhccccccccccCccccchHHHHHHHHHHHh-hhhhhhhcceeecC-CCceEEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNV--KP-QVFNPDNVMMHEKKDGTLMNEFTTPILQEVME-NSKIMQLGKYEPME-GTEKKFTF 75 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~--~~-~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~-~s~l~~l~~~~~~~-~~~~~ip~ 75 (324) |.-.+..++.|........ .. +.....-+.++++.+.++-....+.+++.-.. ...+...++.-+.+ ....+... T Consensus 366 ~~L~elAr~~L~~rg~~~~~~~~~~~~~~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~ 445 (693) T protein:vir:95 366 MTLRELARASLVDRGIGVASLNAPQMVGLAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVG 445 (693) T ss_pred CcHHHHHHHHHHhcCCccCCCCHHHHHHHHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceee Confidence 4433333333332211000 00 11111112344455554444343444333222 23345555433322 12222333 Q ss_pred EeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh---ccCc Q lcl|Aclame:pro 76 WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQGN 152 (324) Q Consensus 76 ~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~g~ 152 (324) ..+-++..-|.|++++......=..-++...++|..+.||||++-.-+.++..-|-..++++.++.+++.++. +... T Consensus 446 lg~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~ 525 (693) T protein:vir:95 446 LGEFSSLRQVREGAEYKYVTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPA 525 (693) T ss_pred cCCCCChhhcCCCCceeeeecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcc Confidence 4455666789999999876665556778899999999999999987788999999999999999999986653 2221 Q ss_pred cccccccccccccc-cccccchhhhhHHHHHHHHhhhh------------cCCCcEEEEcHHHHHHHHHhhccCCceee- Q lcl|Aclame:pro 153 NPFGKSIAQSIEKT-NKVIKGDFTQDNIIDLEALLEDD------------ELEANAFISKTQNRSLLRKIVDPETKERI- 218 (324) Q Consensus 153 ~~~~~~~~~~~~~~-~~~~~~~~~~~~i~~~~~~l~~~------------~~~~~~~v~~~~~~~~l~~~~d~~g~~~~- 218 (324) -..+..+..+.-.. ...+.+.++.+.+.++...+... ...+..|++.+.......++..+...+.- T Consensus 526 m~DGk~LFhadH~Nl~tga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~a~ 605 (693) T protein:vir:95 526 MSDGKTLFHADHSNLLTGAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPGAD 605 (693) T ss_pred ccCCcceeeccccccccccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccccc Confidence 11222233322221 11122344555555544333211 23456788888877777666544332211 Q ss_pred -ccCCcceeecc-eeEeecCCCCCCcee--EEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEE Q lcl|Aclame:pro 219 -YDRNSDSLDGL-PVVNLKSSNLKRGEL--ITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRAT 294 (324) Q Consensus 219 -~~~~~~~l~G~-pv~~~~~~~~~~~~~--i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~ 294 (324) ..+....+.|+ .+++.+......++. ++.|... ..+++. ++.....+.-....-|..|-+.+|++ T Consensus 606 ~~~~~~NP~~~~~~vi~~prL~~~s~~~Wyl~a~~~~------dtie~~-----yL~G~~~P~ie~~~gf~~dG~~~kvr 674 (693) T protein:vir:95 606 VNSGIVNPIRAFAQVIGEPRLDDASATAWYMAAKKGS------DTIEVA-----YLDGVDTPYLEQQEGFTVDGVASKVR 674 (693) T ss_pred cccccccchhccccccccceecCCCCCceEEecCCCC------CeEEEE-----EecCCCCCeEeecCCCCcceEEEEEE Confidence 11112334443 344333332222222 2222210 111221 12111211111112389999999999 Q ss_pred EEeccEEeccCceEEEEee Q lcl|Aclame:pro 295 MHVALHIADDKAFAKLVPA 313 (324) Q Consensus 295 ~r~d~~v~~~~A~~~l~~~ 313 (324) ..+|.+++|-..+.|-.++ T Consensus 675 ~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 675 IDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred EeccCceeeccccccCCCC Confidence 9999999999988877776 No 165 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=98.71 E-value=5.7e-09 Score=65.79 Aligned_cols=276 Identities=11% Similarity=0.027 Sum_probs=145.7 Q ss_pred hcccccccc-ccCccccc----hHHHHHHHHHHHh-hhhhhhhcceeecCCCce--EEEEEeC------CcceeeeccCc Q lcl|Aclame:pro 24 FNPDNVMMH-EKKDGTLM----NEFTTPILQEVME-NSKIMQLGKYEPMEGTEK--KFTFWAD------KPGAYWVGEGQ 89 (324) Q Consensus 24 ~~~~~~~~~-~~~~~~vp----~~~~~~i~~~~~~-~s~l~~l~~~~~~~~~~~--~ip~~~~------~~~a~~v~Eg~ 89 (324) +...+..+. -.-+..|+ ++|..++.-...+ .+.|++-++...-.++.- ..+.... ...-.-...+. T Consensus 1 ~~~~~~~~~~~~Ms~~i~~~fv~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~ 80 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDIDQAFVQTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSADGT 80 (322) T ss_pred CcccceeeeeeeeechhhhHHHHHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccccCcc Confidence 211111111 00011234 4444444444443 344555444322222221 1221110 01111122222 Q ss_pred -ccccccccee--eEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhcc-Cccccccc-cccccc Q lcl|Aclame:pro 90 -KIETSKATWV--NATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQ-GNNPFGKS-IAQSIE 164 (324) Q Consensus 90 -~~~~~~~~~~--~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~-g~~~~~~~-~~~~~~ 164 (324) ..|....... .+.+..+ .....|.+.-......|..+...+..+.+++++.|+.++.+. |....... ...... T Consensus 81 ~dtp~~~~~~~~r~~~~~d~--~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ 158 (322) T protein:vir:10 81 YPTPVNNKPFAKRRTNVDTY--DTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFL 158 (322) T ss_pred cCCCccccccceEEEeeccc--ccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccC Confidence 2333333333 4444444 344677776666677899999999999999999999888642 22111111 111111 Q ss_pred cc--cccccchhhhhHHHHHHHHhhhhcCCCc---EEEEcHHHHHHHHHhhc-----cC-CceeeccCCcceeecceeEe Q lcl|Aclame:pro 165 KT--NKVIKGDFTQDNIIDLEALLEDDELEAN---AFISKTQNRSLLRKIVD-----PE-TKERIYDRNSDSLDGLPVVN 233 (324) Q Consensus 165 ~~--~~~~~~~~~~~~i~~~~~~l~~~~~~~~---~~v~~~~~~~~l~~~~d-----~~-g~~~~~~~~~~~l~G~pv~~ 233 (324) .. .......++++.|+++...|..+..+.. .++++|..+..|..... -. ...++..+..++++|+.++. T Consensus 159 ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~~i~ 238 (322) T protein:vir:10 159 ATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYTWIV 238 (322) T ss_pred CCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEEEEE Confidence 11 1122346788999999999988877743 47889999988864332 12 23344557778999999988 Q ss_pred ecCCCCCCc----------------eeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEe Q lcl|Aclame:pro 234 LKSSNLKRG----------------ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV 297 (324) Q Consensus 234 ~~~~~~~~~----------------~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~ 297 (324) +...+.... ..+++..+.+.++.+.++..+++. +++- .....+++.+.+ T Consensus 239 s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~--------~~~~-------~~a~~I~~~~~~ 303 (322) T protein:vir:10 239 STRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAE--------DPSA-------SFAWRIYSAFTA 303 (322) T ss_pred eccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeec--------cCCc-------chhhhhhhhhhh Confidence 765442111 123333344444444444444422 1111 112336677889 Q ss_pred ccEEeccCceEEEEeecCC Q lcl|Aclame:pro 298 ALHIADDKAFAKLVPADKR 316 (324) Q Consensus 298 d~~v~~~~A~~~l~~~~~~ 316 (324) |..+++|+.++.|.....- T Consensus 304 Ga~ri~~~gVv~i~~~e~~ 322 (322) T protein:vir:10 304 DCVRVEDEHIFKLRLKNSL 322 (322) T ss_pred CceEeccCcEEEEEEeccC Confidence 9999999999999986665 No 166 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=98.70 E-value=1e-08 Score=64.34 Aligned_cols=264 Identities=9% Similarity=0.014 Sum_probs=153.5 Q ss_pred cccccCccccc---hHHHHHHHHHHHhhhhhhhhccee---ecCCCceEEEEEeCCcceeeeccCc-cccccccceeeEE Q lcl|Aclame:pro 30 MMHEKKDGTLM---NEFTTPILQEVMENSKIMQLGKYE---PMEGTEKKFTFWADKPGAYWVGEGQ-KIETSKATWVNAT 102 (324) Q Consensus 30 ~~~~~~~~~vp---~~~~~~i~~~~~~~s~l~~l~~~~---~~~~~~~~ip~~~~~~~a~~v~Eg~-~~~~~~~~~~~v~ 102 (324) ....+.|.+.. +.+...+++.+.+.-..++++... +.....+.+++......+.|.+.++ .+|..+..++... T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 33333333322 455667788888888888876553 3334445666666667788987765 5788888899999 Q ss_pred eeheeeEEeeeehHHHhhcC---hHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccc--ccccc-------- Q lcl|Aclame:pro 103 MRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIE--KTNKV-------- 169 (324) Q Consensus 103 l~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~--~~~~~-------- 169 (324) ...+.++..+.++..-++.+ ..++..--....+++++..+|+.+|+|+..... .|+.+... ..... T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~-~GLlN~p~~~~~~~~~~~~~~~~ 159 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAI-KGAFEATGIQIDVSPTTGVGNVS 159 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccc-eeeecCCCcccccccCccccccc Confidence 99999999999998666654 477888888899999999999999999764322 22222111 11000 Q ss_pred ----ccchhhhhHHHHHHHHhhhh---cCCCcEEEEcHHHHHHHHHhh--ccCCceeec----cCCcceeecceeEeecC Q lcl|Aclame:pro 170 ----IKGDFTQDNIIDLEALLEDD---ELEANAFISKTQNRSLLRKIV--DPETKERIY----DRNSDSLDGLPVVNLKS 236 (324) Q Consensus 170 ----~~~~~~~~~i~~~~~~l~~~---~~~~~~~v~~~~~~~~l~~~~--d~~g~~~~~----~~~~~~l~G~pv~~~~~ 236 (324) .+..--++||.+++.++... ...+..++++|+.+..|.... +..|..++. .....++...|-..... T Consensus 160 ~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p~L~~~g 239 (301) T protein:vir:80 160 KWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVPDLAGMG 239 (301) T ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcceeccCC Confidence 01112367888888887542 235678999999999997544 333433321 11223444444332111 Q ss_pred CCCCCceeE-Eeeccc-EEEEEecceEEEEeeccceeccccccccchhhhhcCc-EEEEEEEEe-ccEEeccCceEEEEe Q lcl|Aclame:pro 237 SNLKRGELI-TGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDM-VALRATMHV-ALHIADDKAFAKLVP 312 (324) Q Consensus 237 ~~~~~~~~i-~gd~s~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~-v~~r~~~r~-d~~v~~~~A~~~l~~ 312 (324) ....+.++ +.+-.. +-+.....++...- -.+++ .......|+ |..+.+|.|++++++ T Consensus 240 -~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~------------------e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~G 300 (301) T protein:vir:80 240 -TAGSDSFAVIHDSNETAELIIPMDITRHPE------------------EYSFPRTKVPFEERTAGVVVRFPAAIVRVDG 300 (301) T ss_pred -CCcccEEEEEecCCcEEEEEecCceeeecc------------------eecCceeEeeeeeeeEEEEEEccceEEEEec Confidence 11111122 211111 11221122211100 01121 112334555 678999999999999 Q ss_pred e Q lcl|Aclame:pro 313 A 313 (324) Q Consensus 313 ~ 313 (324) . T Consensus 301 I 301 (301) T protein:vir:80 301 I 301 (301) T ss_pred C Confidence 8 No 167 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=98.66 E-value=4e-08 Score=61.13 Aligned_cols=276 Identities=9% Similarity=-0.019 Sum_probs=164.6 Q ss_pred hhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeC-CcceeeeccCccccccccceeeE Q lcl|Aclame:pro 23 VFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIETSKATWVNA 101 (324) Q Consensus 23 ~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~~~~~~~~~v 101 (324) ...+.+.-++. .....-+.+...|+..-....|+.+++......+..++|....- .+...-..||+..+......... T Consensus 1 ma~~~~~~~t~-~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~ 79 (317) T protein:vir:88 1 MATPTNAVSTV-EINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTM 79 (317) T ss_pred CCccccceEee-eeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEE Confidence 22333332222 22234567788888888889999998877666676777776432 23333456887666543322211 Q ss_pred Ee-eheeeEEeeeehHHHhhcCh---HHHHHHHHHHHHHHHHHHHHHHHHhccCc-----cccc---ccccccccc---- Q lcl|Aclame:pro 102 TM-RAFKLGVILPVTKEFLNYTY---SQFFEEMKPMIAEAFYKKFDEAGILNQGN-----NPFG---KSIAQSIEK---- 165 (324) Q Consensus 102 ~l-~~~k~~~~~~iS~e~l~ds~---~~~~~~i~~~l~~ai~~~~d~~~l~G~g~-----~~~~---~~~~~~~~~---- 165 (324) .- -.+-+...+.||.-+..-+. .+...+-...-..++.+.+|+++|+|.-. ...+ .|+...... T Consensus 80 ~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~ 159 (317) T protein:vir:88 80 LNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSL 159 (317) T ss_pred eccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCcee Confidence 11 11223444445543332222 34444555555677888999999998521 1111 122111000 Q ss_pred --------------ccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCCccee----- Q lcl|Aclame:pro 166 --------------TNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRNSDSL----- 226 (324) Q Consensus 166 --------------~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~~l----- 226 (324) ........++.++|.+++.++-+.+..+..++|++.....|.++...++.++..+....++ T Consensus 160 ~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~v~ 239 (317) T protein:vir:88 160 GANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQTVD 239 (317) T ss_pred ccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEEEE Confidence 0111222467889999999999999999999999999999988865455555433322211 Q ss_pred -----ecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEE Q lcl|Aclame:pro 227 -----DGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHI 301 (324) Q Consensus 227 -----~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v 301 (324) .| .|-+..+..++++.+++.|++++-+..-+++..+.... .-|.........++..+ T Consensus 240 ~~~tdfG-~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~laK-----------------tGd~~k~~i~~E~tLe~ 301 (317) T protein:vir:88 240 VYESDFG-KYTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHELAK-----------------TGDSEKRQLLVEYTFRV 301 (317) T ss_pred EEEeCCe-EEEEEeCCCCCCCeEEEEcccccceeecccceeeccCC-----------------CcccceeEEEEEEEEEE Confidence 12 12334455678889999999888766655554442221 11334466778889999 Q ss_pred eccCceEEEEeecCCC Q lcl|Aclame:pro 302 ADDKAFAKLVPADKRT 317 (324) Q Consensus 302 ~~~~A~~~l~~~~~~~ 317 (324) .+++|.+++++.+++= T Consensus 302 ~N~~a~a~i~~l~~~~ 317 (317) T protein:vir:88 302 NNEKSGALIRDVVAQL 317 (317) T ss_pred cCccceeEEEEecccC Confidence 9999999999988776 No 168 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=98.66 E-value=3.2e-08 Score=61.66 Aligned_cols=301 Identities=11% Similarity=0.071 Sum_probs=154.5 Q ss_pred CchhHHHHHHHHHHHhhhh--hH-HhhccccccccccCccccchHHHHHHHHHHHhh-hhhhhhcceeecC-CCceEEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNV--KP-QVFNPDNVMMHEKKDGTLMNEFTTPILQEVMEN-SKIMQLGKYEPME-GTEKKFTF 75 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~--~~-~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~-s~l~~l~~~~~~~-~~~~~ip~ 75 (324) |.--+..++.|.+...... .+ +.....-+.++++.+.++-....+.+++.-... ..+.+.+++-+.+ -...+... T Consensus 331 ~~L~elAr~~L~~~G~~~~~~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~ 410 (652) T protein:vir:79 331 MTLREYARMSLTERGIGVSSYNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVG 410 (652) T ss_pred ccHHHHHHHHHHhhccCCCCCCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceee Confidence 3222222222222111000 00 111111112344555555444444444443332 2356666554432 22233344 Q ss_pred EeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh---ccCc Q lcl|Aclame:pro 76 WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQGN 152 (324) Q Consensus 76 ~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~g~ 152 (324) ..+-++..-|.|++++......=+..++...++|..+.||||++-.-+.+...-|-+.++++.++.+++.++. +... T Consensus 411 lg~~~~L~~V~E~gEyk~~t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~ 490 (652) T protein:vir:79 411 MGGFSALRQVREGAEYKYVTTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPK 490 (652) T ss_pred cCCCCCccccCCCCccceeeecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcc Confidence 4566777889999999887776677899999999999999999886678999999999999999999976553 2221 Q ss_pred cc-cccccc-cccccccccccchhhhhHHHHHHHHhh---hh----cCCCcEEEEcHHHHHHHHHhhccCCceee--ccC Q lcl|Aclame:pro 153 NP-FGKSIA-QSIEKTNKVIKGDFTQDNIIDLEALLE---DD----ELEANAFISKTQNRSLLRKIVDPETKERI--YDR 221 (324) Q Consensus 153 ~~-~~~~~~-~~~~~~~~~~~~~~~~~~i~~~~~~l~---~~----~~~~~~~v~~~~~~~~l~~~~d~~g~~~~--~~~ 221 (324) -. .+..+. .+... +...++.++.+.+..+...+. +. ...+..|++.+.......++..+...+-- ..+ T Consensus 491 ~~~DGk~LF~hA~H~-Nl~~~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~~a~~~~~ 569 (652) T protein:vir:79 491 ISTDNVSLFDKAKHA-NVLESAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVKGADINAG 569 (652) T ss_pred cccCCceeecccccc-cccccccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCcccccccc Confidence 11 122222 11111 111122344444444433332 11 23456788888877666665433211110 111 Q ss_pred Ccceeecc-eeEeecCCCCCCcee-EEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEecc Q lcl|Aclame:pro 222 NSDSLDGL-PVVNLKSSNLKRGEL-ITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL 299 (324) Q Consensus 222 ~~~~l~G~-pv~~~~~~~~~~~~~-i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~ 299 (324) ....+.|. .+++.+.......+. ++.+... ...+++. ++.....+.-....-|..|-+.+|++..+|. T Consensus 570 ~~Np~~~~~~~i~eprL~~~s~~~wylaa~~~-----~dtiev~-----yL~G~~~P~ie~~~gf~~dG~~~kvrlD~G~ 639 (652) T protein:vir:79 570 IINPVKDFATVIAEPRLDDNSQTTFYLAASKG-----SDTIEVA-----YLNGVDTPYIDQMEGFSVDGVTTKVRIDAGV 639 (652) T ss_pred cccccccccccccccccCCCCcccEEEecCCC-----CCeEEEE-----EecCCCCCeeeecCCCCcceEEEEEEEeccC Confidence 22334443 334333222212111 1221110 0011221 1211111111111238999999999999999 Q ss_pred EEeccCceEEEEe Q lcl|Aclame:pro 300 HIADDKAFAKLVP 312 (324) Q Consensus 300 ~v~~~~A~~~l~~ 312 (324) +++|-..++|.|- T Consensus 640 ~~iD~RG~~k~t~ 652 (652) T protein:vir:79 640 APVDHRGLVKCTA 652 (652) T ss_pred ceeeccceeeecC Confidence 9999999998877 No 169 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.62 E-value=4.2e-09 Score=66.49 Aligned_cols=224 Identities=10% Similarity=0.031 Sum_probs=140.9 Q ss_pred hHHhhcccccccccc-CccccchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccCccccccccc Q lcl|Aclame:pro 20 KPQVFNPDNVMMHEK-KDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKAT 97 (324) Q Consensus 20 ~~~~~~~~~~~~~~~-~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~~ 97 (324) .+. . +.+..+-.+ +--+-|......|+|.+.+.++|+...+.+..+... ....+.++-|.+.|..=++.++.++.+ T Consensus 1 m~~-~-~~~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vrt~LP~~~fR~lN~g~~~s~~t 78 (330) T protein:vir:10 1 MAT-L-STNNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVRTGLPTPTWRKLYGGVLPNKSS 78 (330) T ss_pred CCc-C-CCCcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEEeecCCchhhhcCCccccccce Confidence 000 0 011111111 222345667778999999999999998887554333 223566788999999999999999999 Q ss_pred eeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--ccc-------------- Q lcl|Aclame:pro 98 WVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG--KSI-------------- 159 (324) Q Consensus 98 ~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~--~~~-------------- 159 (324) +.+++-..+-+++.+.|-+.+.+.+. -++...-.+...+++.+++...+|+|+.+..+. .|+ T Consensus 79 t~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~~~~q 158 (330) T protein:vir:10 79 TAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAENKDN 158 (330) T ss_pred EEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCCchhh Confidence 99999999999999999998887643 344555777799999999999999996431110 000 Q ss_pred -cc--cc----------------------------------c--cc---------------------------------- Q lcl|Aclame:pro 160 -AQ--SI----------------------------------E--KT---------------------------------- 166 (324) Q Consensus 160 -~~--~~----------------------------------~--~~---------------------------------- 166 (324) .. +. . +. T Consensus 159 vIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI 238 (330) T protein:vir:10 159 VIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARV 238 (330) T ss_pred eeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEE Confidence 00 00 0 00 Q ss_pred -cc-------cccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhh-ccCCcee-e---ccCCcceeecceeEe Q lcl|Aclame:pro 167 -NK-------VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-DPETKER-I---YDRNSDSLDGLPVVN 233 (324) Q Consensus 167 -~~-------~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~-d~~g~~~-~---~~~~~~~l~G~pv~~ 233 (324) +. .+.+....+.++.+...++...+...+|+||...+.+|++.. +.+...+ . .+.....+.|+||.. T Consensus 239 ~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gipir~ 318 (330) T protein:vir:10 239 CNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQR 318 (330) T ss_pred eecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeEEEE Confidence 00 000001112334444677777778889999999999998753 3322111 1 122345689999987 Q ss_pred ecCCCCCCceeE Q lcl|Aclame:pro 234 LKSSNLKRGELI 245 (324) Q Consensus 234 ~~~~~~~~~~~i 245 (324) +.+.--.+..++ T Consensus 319 ~Dail~tE~~vv 330 (330) T protein:vir:10 319 TDALLNTESRVV 330 (330) T ss_pred EeeeecCccccC Confidence 754433333332 No 170 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=98.56 E-value=3.2e-08 Score=61.67 Aligned_cols=287 Identities=7% Similarity=0.007 Sum_probs=155.9 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccc--hHHHHHHHHHHHhhhhhhhhcceee---cCCCceEEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLM--NEFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTF 75 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp--~~~~~~i~~~~~~~s~l~~l~~~~~---~~~~~~~ip~ 75 (324) ||=.- ++.. ......+.+.....+.+.+++. +.+...|++...+.-..++++.... ..-.++.+++ T Consensus 3 ~~~~~----~~~~-----~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~ 73 (314) T protein:vir:10 3 IKFDA----EQAK-----ITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPE 73 (314) T ss_pred cchHH----HHHH-----HHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeee Confidence 22110 0000 0011111111122222333433 3455667777776666677665432 2223556667 Q ss_pred EeCCcceeeeccCc-cccccccceeeEEeeheeeEEeeeehHHHhhcC---hHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 76 WADKPGAYWVGEGQ-KIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 76 ~~~~~~a~~v~Eg~-~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) ......+.|++..+ .+|..+..++......+.++..+.++..-++.+ ..++..--....++++...+|+.+|+|+. T Consensus 74 ~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~ 153 (314) T protein:vir:10 74 FDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSA 153 (314) T ss_pred eccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecc Confidence 66677788988765 589889999999999999999999987655544 46778888888999999999999999975 Q ss_pred ccccccccccccccc-----cccccchhhhhHHHHHHHHhhhh---cCCCcEEEEcHHHHHHHHHhhccCCceeec---- Q lcl|Aclame:pro 152 NNPFGKSIAQSIEKT-----NKVIKGDFTQDNIIDLEALLEDD---ELEANAFISKTQNRSLLRKIVDPETKERIY---- 219 (324) Q Consensus 152 ~~~~~~~~~~~~~~~-----~~~~~~~~~~~~i~~~~~~l~~~---~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~---- 219 (324) ..... |+.+..... ...++..--++||.+++.++... ...+..++++|+.+..|...-+..|.-++. T Consensus 154 ~~g~~-GLlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l~~ 232 (314) T protein:vir:10 154 PHGIV-SVFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELFTR 232 (314) T ss_pred cccce-eEeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHHHH Confidence 44322 222221111 11112223367888888888643 245678999999998886544433433321 Q ss_pred cCCcceeecceeEeecCCCCC-CceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCc--EEEEEEEE Q lcl|Aclame:pro 220 DRNSDSLDGLPVVNLKSSNLK-RGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDM--VALRATMH 296 (324) Q Consensus 220 ~~~~~~l~G~pv~~~~~~~~~-~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~--v~~r~~~r 296 (324) ...+-++.+.|-.. ..... ...+++...+ .+.+.+.+......... +... ..+....| T Consensus 233 n~~~l~I~~~~el~--~ag~~g~~~~v~y~~~------~~~~~~~vp~~~~~l~~-----------e~~~~~~~~~~~~r 293 (314) T protein:vir:10 233 NNPGLTIRFLQFLD--NYDGAGGKAALAFEKS------PLNMSIEIPEVTNVLPA-----------QPKDLHFRYPVTSK 293 (314) T ss_pred hCCCcEEEEccccc--ccCCCcceEEEEEecC------CcEEEEecCccceeecc-----------eecCceEEEcceee Confidence 12233444444322 11111 1111111111 11122222111111100 1111 12334455 Q ss_pred e-ccEEeccCceEEEEeecCC Q lcl|Aclame:pro 297 V-ALHIADDKAFAKLVPADKR 316 (324) Q Consensus 297 ~-d~~v~~~~A~~~l~~~~~~ 316 (324) + |..+.+|.|++++++.+-. T Consensus 294 ~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 294 ATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred eEEEEEECcceeEeeeeeecC Confidence 5 6888999999999988777 No 171 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=98.49 E-value=9.2e-08 Score=59.16 Aligned_cols=281 Identities=10% Similarity=0.017 Sum_probs=141.1 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcce-e--ec---CCCceEEEEEeCCcceee-----eccCcccccccccee Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY-E--PM---EGTEKKFTFWADKPGAYW-----VGEGQKIETSKATWV 99 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~-~--~~---~~~~~~ip~~~~~~~a~~-----v~Eg~~~~~~~~~~~ 99 (324) ++. ..++|+.|..++++.+++..++.+++.. + .. .+..++||+... ..+.+ .+++.++...+.+-+ T Consensus 1 Ma~--~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) T protein:vir:99 1 MAN--AFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP-SRGHTRKLRGAGAERNLTVSDFTED 77 (392) T ss_pred Ccc--ccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc-ccceeeeccccccCCcccccccccc Confidence 221 2478999999999999999998888732 2 22 255688887543 22322 234555666666666 Q ss_pred eEEeeh-eeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhH Q lcl|Aclame:pro 100 NATMRA-FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) Q Consensus 100 ~v~l~~-~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) .+++.. +..+..+.|+++-...+..++.+.+.++..++++.++|..++.--..... ..............++. T Consensus 78 ~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~------~~~~~~~~~~~~~~~~~ 151 (392) T protein:vir:99 78 SFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPY------EAAGAVHEVAPDEFFKG 151 (392) T ss_pred eEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccc------cccccccccChhhhHHH Confidence 666666 44566677888766666788999999999999999999987742111100 00111122233456888 Q ss_pred HHHHHHHhhhhcCCCc-EEEEcHHHHHHHHHhh-----ccCC---ceeeccCCcceeecceeEeecCCCCCCceeEEeec Q lcl|Aclame:pro 179 IIDLEALLEDDELEAN-AFISKTQNRSLLRKIV-----DPET---KERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDF 249 (324) Q Consensus 179 i~~~~~~l~~~~~~~~-~~v~~~~~~~~l~~~~-----d~~g---~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~ 249 (324) +.++...|.....+.. .++++|..+..|.+.. +..| ...+..+..+++.|++|+.+...+ .+..+.+.. T Consensus 152 i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~--~~t~~a~~~ 229 (392) T protein:vir:99 152 VNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIP--HGDAYLYHP 229 (392) T ss_pred HHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccc--cccceeeec Confidence 9999999987766543 5789999988876431 1112 233556777899999998876543 333333333 Q ss_pred ccEEEEEecceEEEEeeccceeccccc-cccchh----hhhcCcEEEEEEEEeccEEeccC---ceE---EEEeecCCCC Q lcl|Aclame:pro 250 DKLIYGIPQLIEYKIDETAQLSTVKNE-DGTPVN----LFEQDMVALRATMHVALHIADDK---AFA---KLVPADKRTD 318 (324) Q Consensus 250 s~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~----~f~~~~v~~r~~~r~d~~v~~~~---A~~---~l~~~~~~~~ 318 (324) +.+..+........-............ ...... -+..+...+. ...+....... ++. .++..+..-. T Consensus 230 ~a~~~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~--~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~ 307 (392) T protein:vir:99 230 TAFIMATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLID--TYFGLKVVEDPNGVGFVRARKIHLIPGSIE 307 (392) T ss_pred cccccccccccccccccceeEEecccceecceeecccceeeccccccc--eeEEEEEEeeccccceeeeeeeeeecceee Confidence 322222211111000000000000000 000000 0001111110 01111111100 111 0111100000 Q ss_pred CCC------------CCC Q lcl|Aclame:pro 319 SVP------------GEV 324 (324) Q Consensus 319 ~~~------------~~~ 324 (324) ..| |+- T Consensus 308 v~~v~~~~~~~~~~~~~~ 325 (392) T protein:vir:99 308 VAPEAGANATITAAAGED 325 (392) T ss_pred eeeeecccceeEeeeccc Confidence 001 111 No 172 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=98.45 E-value=2.5e-08 Score=62.28 Aligned_cols=225 Identities=13% Similarity=0.079 Sum_probs=136.7 Q ss_pred hHHhhcccccccccc-CccccchHHHHHHHHHHHhhhhhhhhcceeecCCCc-eEEEEEeCCcceeeeccCccccccccc Q lcl|Aclame:pro 20 KPQVFNPDNVMMHEK-KDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADKPGAYWVGEGQKIETSKAT 97 (324) Q Consensus 20 ~~~~~~~~~~~~~~~-~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~Eg~~~~~~~~~ 97 (324) .+. . +.+..+-.+ +--+-+......|+|.+.+.+.|+...+.+..+... ....+.++-|.+.|..=++.++.++.+ T Consensus 1 m~~-~-~~~a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vrt~LP~~~fR~lN~g~~~s~~t 78 (335) T protein:vir:73 1 MAL-I-GQTLPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIRAGIPEPVWRRYNQGVQPTKTQ 78 (335) T ss_pred CCc-C-CCCchhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEEEecCCchhhhcCCccccccce Confidence 000 0 000111111 111335566677999999999999998887654333 223566788999999999999999999 Q ss_pred eeeEEeeheeeEEeeeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHhccCccccc--ccc-------------- Q lcl|Aclame:pro 98 WVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG--KSI-------------- 159 (324) Q Consensus 98 ~~~v~l~~~k~~~~~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~--~~~-------------- 159 (324) +.+++-..+-+++.+.|-+.+.+.+. -++...-.+...+++.+++...+|+|+.+..+. .|+ T Consensus 79 t~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~~a~~ 158 (335) T protein:vir:73 79 TVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTSKAAS 158 (335) T ss_pred EEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCccccccCc Confidence 99999999999999999998777543 344555666789999999999999996432110 000 Q ss_pred ----ccccc----------------------------------------------------------------------- Q lcl|Aclame:pro 160 ----AQSIE----------------------------------------------------------------------- 164 (324) Q Consensus 160 ----~~~~~----------------------------------------------------------------------- 164 (324) ..+.+ T Consensus 159 a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvR 238 (335) T protein:vir:73 159 AENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISR 238 (335) T ss_pred ccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEE Confidence 00000 Q ss_pred ---cc-cccccchhhhhHHHHHH-H-----HhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeec-----cCCcceeecc Q lcl|Aclame:pro 165 ---KT-NKVIKGDFTQDNIIDLE-A-----LLEDDELEANAFISKTQNRSLLRKIVDPETKERIY-----DRNSDSLDGL 229 (324) Q Consensus 165 ---~~-~~~~~~~~~~~~i~~~~-~-----~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~-----~~~~~~l~G~ 229 (324) .. +.......+..+|.+++ . .++.......+|+||...+.+|++.........+. +.....+.|+ T Consensus 239 I~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~gi 318 (335) T protein:vir:73 239 ICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLGI 318 (335) T ss_pred EeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECCe Confidence 00 00000011123343333 1 33443444578999999999998765444433332 2233568899 Q ss_pred eeEeecCCCCCCceeEE Q lcl|Aclame:pro 230 PVVNLKSSNLKRGELIT 246 (324) Q Consensus 230 pv~~~~~~~~~~~~~i~ 246 (324) ||..+.+.--.+..+.. T Consensus 319 pir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 319 PIRRVDAILNTESAVTA 335 (335) T ss_pred EEEEEeeeecCcccccC Confidence 99877543333332222 No 173 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=98.42 E-value=3.3e-07 Score=56.12 Aligned_cols=294 Identities=9% Similarity=0.033 Sum_probs=158.6 Q ss_pred CchhHHHHHHHHHHHh-hhhhHHhhcccccccc-ccCccccc---hHHHHHHHHHHHhhhhhhhhccee---ecCCCceE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFAS-NNVKPQVFNPDNVMMH-EKKDGTLM---NEFTTPILQEVMENSKIMQLGKYE---PMEGTEKK 72 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~-~~~~~~~~~~~~~~~~-~~~~~~vp---~~~~~~i~~~~~~~s~l~~l~~~~---~~~~~~~~ 72 (324) |--+ =++++|+.-.. ......-.+....... +..+.++- +.+...|++...+.-..++++... +..-.++. T Consensus 1 ~~~~-~~~~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t 79 (329) T protein:vir:79 1 MRGN-IMSKEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFE 79 (329) T ss_pred Cccc-hhhhhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEE Confidence 2211 11112222110 0111111111111111 22233443 345677888777777777777643 33334566 Q ss_pred EEEEeCCcceeeeccC-ccccccccceeeEEeeheeeEEeeeehHHHhhcC---hHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 73 FTFWADKPGAYWVGEG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 73 ip~~~~~~~a~~v~Eg-~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) +++......+.|++.+ ..+|..+..++......+.++..+.++..-++.+ ..++..--....++++...+|+-+|+ T Consensus 80 ~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~ 159 (329) T protein:vir:79 80 YQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFK 159 (329) T ss_pred eeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEe Confidence 6777677778898775 4788888888888888889999999987655544 46788888888999999999999999 Q ss_pred ccCccccccccccc--cccccccc---------cchhhhhHHHHHHHHhhhh--c-CCCcEEEEcHHHHHHHHHhhccCC Q lcl|Aclame:pro 149 NQGNNPFGKSIAQS--IEKTNKVI---------KGDFTQDNIIDLEALLEDD--E-LEANAFISKTQNRSLLRKIVDPET 214 (324) Q Consensus 149 G~g~~~~~~~~~~~--~~~~~~~~---------~~~~~~~~i~~~~~~l~~~--~-~~~~~~v~~~~~~~~l~~~~d~~g 214 (324) |++..... |+++. ........ +..-.++||.+++.++... + ..+..++++|+.+..|.......| T Consensus 160 G~~~~g~~-GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~~~~ 238 (329) T protein:vir:79 160 GSKPHKII-SVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMPETT 238 (329) T ss_pred ecccccce-eeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccCCCC Confidence 97643322 22221 11111111 1122357888888888643 2 345789999999998865444444 Q ss_pred ceeec----cCCcceeecceeEeecCCCCCCceeEEeecc-c-EEEEEecceEEEEeeccceeccccccccchhhhhcCc Q lcl|Aclame:pro 215 KERIY----DRNSDSLDGLPVVNLKSSNLKRGELITGDFD-K-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDM 288 (324) Q Consensus 215 ~~~~~----~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~ 288 (324) .-++. ...+-++.+.|-... ....+...+++.+.+ . +-+.....++.... +... T Consensus 239 ~tvl~~lk~~~~~l~I~~~~el~~-ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-------------------q~~~ 298 (329) T protein:vir:79 239 MSYLDYFKQQNGGITIESISELED-IDGAGTKAALVYEKDPMNMSIEIPEAFNMLTA-------------------QPKD 298 (329) T ss_pred ccHHHHHHHhCCCcEEEEcccccc-cCCCCceEEEEEecCCceEEEecCcceeeeec-------------------eecC Confidence 33321 112233444443221 111122222222221 1 11111122221110 1111 Q ss_pred --EEEEEEEEe-ccEEeccCceEEEEeecCC Q lcl|Aclame:pro 289 --VALRATMHV-ALHIADDKAFAKLVPADKR 316 (324) Q Consensus 289 --v~~r~~~r~-d~~v~~~~A~~~l~~~~~~ 316 (324) ..+....|+ |..+.+|.||+++++...+ T Consensus 299 ~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 299 LHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred ceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 123334455 5788899999999988877 No 174 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=98.22 E-value=1.6e-06 Score=52.32 Aligned_cols=276 Identities=10% Similarity=-0.018 Sum_probs=137.5 Q ss_pred cccccCccccchHHHHHHHHHHHhhhhhhhhccee-----ecCCCceEEEEEeCCcceeeeccCccccccccceeeEEee Q lcl|Aclame:pro 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYE-----PMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMR 104 (324) Q Consensus 30 ~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~-----~~~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~ 104 (324) +.......+-|+.|..++++.+++..++.+++..- .-.+.+++||+.. ..-+.++..+...+.+-.++++. T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~----~~~v~dg~~~~~~~~te~~v~l~ 76 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPY----RVKSASGRTLVKQPMVDQTIPFK 76 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCC----ceeecccCCccccccccceEEEE Confidence 23333445569999999999999999998887441 1124678888733 22234455555555555565665 Q ss_pred h-eeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHHHH Q lcl|Aclame:pro 105 A-FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE 183 (324) Q Consensus 105 ~-~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 183 (324) . +..+..+.++++-...+..++.+.+.+....+++..+|..++.---. ..............+++++++. T Consensus 77 id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~---------a~~~~gt~gt~~~~~~~i~~a~ 147 (418) T protein:vir:10 77 IAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKK---------AFHSSGTPGVRPGAFIDFANAG 147 (418) T ss_pred EecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhh---------cccccccCCcCcchHHHHHHHH Confidence 5 33455667777665566789999999999999999999987631100 0001111111223589999999 Q ss_pred HHhhhhcCCC---cEEEEcHHHHHHHHHhhcc----C-CceeeccCCcceeecceeEeecCCCC-CCceeEEeecccEEE Q lcl|Aclame:pro 184 ALLEDDELEA---NAFISKTQNRSLLRKIVDP----E-TKERIYDRNSDSLDGLPVVNLKSSNL-KRGELITGDFDKLIY 254 (324) Q Consensus 184 ~~l~~~~~~~---~~~v~~~~~~~~l~~~~d~----~-g~~~~~~~~~~~l~G~pv~~~~~~~~-~~~~~i~gd~s~~~~ 254 (324) ..|.....+. -..+++|..+..|.+-... . ....+..+..+++.|+.|+.+.+.+. ..+.. .+ +..+. T Consensus 148 ~~Ld~~~VP~~G~R~lVv~P~~~~~L~~~~~~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~-~~--t~~v~ 224 (418) T protein:vir:10 148 AKQTTYAVPQDGMRHAVLDPFTCASLSDEVTKLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDH-GG--TPLVN 224 (418) T ss_pred HHHHhcCCCCCCceEEEeCHHHHHHHhhhccccccccccchhhheeeeeeeeceEEEEecCCCccccccc-cc--ceeee Confidence 9998888763 2458999988877532211 1 12234566778899999998766542 11110 00 01111 Q ss_pred EE-ecceEEEEeeccceecccccccc---chhhhhc---------CcEEEEEEEEeccEEeccCceEEEEeecC------ Q lcl|Aclame:pro 255 GI-PQLIEYKIDETAQLSTVKNEDGT---PVNLFEQ---------DMVALRATMHVALHIADDKAFAKLVPADK------ 315 (324) Q Consensus 255 ~~-~~~~~~~~~~~~~~~~~~~~~~~---~~~~f~~---------~~v~~r~~~r~d~~v~~~~A~~~l~~~~~------ 315 (324) +- ..+-.+.++-........-..|. +-..|.- +...|++..-..- ...+-..|++.++ T Consensus 225 ga~~~~~~~~~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~~~~~---~~~~~~tv~i~p~~~~~~~ 301 (418) T protein:vir:10 225 GTVVNGDTVGFDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLEDVDT---DAGGAGSIKISPSLNDGTA 301 (418) T ss_pred cccccceeEEEeecceeeccceeeccEEEECceeecccccccccccceEEEEEeeccc---cccCcceeEeccccccccc Confidence 11 11112211110000000000000 0000000 1111222211000 0000111222111 Q ss_pred ---------CCCCCCCCC Q lcl|Aclame:pro 316 ---------RTDSVPGEV 324 (324) Q Consensus 316 ---------~~~~~~~~~ 324 (324) .++.++.-| T Consensus 302 ~~~~~~~~~~~~~~~~~v 319 (418) T protein:vir:10 302 TINNENGDPVSLTAYQNV 319 (418) T ss_pred cccccccccccccCCCcc Confidence 111222233 No 175 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=98.21 E-value=5.6e-07 Score=54.87 Aligned_cols=281 Identities=10% Similarity=-0.007 Sum_probs=122.7 Q ss_pred hccccccccccCccccchHHHHHHHHHHHhhhhhhhhcc-------eeecCCCceEEEEEeCCc----ceeeeccCcccc Q lcl|Aclame:pro 24 FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGK-------YEPMEGTEKKFTFWADKP----GAYWVGEGQKIE 92 (324) Q Consensus 24 ~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~-------~~~~~~~~~~ip~~~~~~----~a~~v~Eg~~~~ 92 (324) +.- .+ -...++.....++|.+.+...+.+... ..++.+.-+++|-+..-. +..-+.+.+.++ T Consensus 1 m~l------sD-~~vfN~~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt 73 (325) T protein:vir:95 1 MAL------SD-LAVYSEYAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVA 73 (325) T ss_pred Cch------hh-hhhhhhhhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceec Confidence 110 00 011244455555555554444333321 123445556677664321 222344445555 Q ss_pred ccccc-eeeEEeeheeeEEeeeehHHHhh---cChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccccccccccccc Q lcl|Aclame:pro 93 TSKAT-WVNATMRAFKLGVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN-PFGKSIAQSIEKTN 167 (324) Q Consensus 93 ~~~~~-~~~v~l~~~k~~~~~~iS~e~l~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~-~~~~~~~~~~~~~~ 167 (324) ..+.+ ...+......-.+.+....+.+. +....+...|.+.++++..+.+-+.+|.+..+. ...........+.. T Consensus 74 ~~kitt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~~ 153 (325) T protein:vir:95 74 EKVLKHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDATANT 153 (325) T ss_pred cceeccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeeccc Confidence 55544 33444443333333333333222 122333444444444444333333333222111 00001111111111 Q ss_pred ccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceeeccCC---cceeecceeEeecCCCCCCcee Q lcl|Aclame:pro 168 KVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDRN---SDSLDGLPVVNLKSSNLKRGEL 244 (324) Q Consensus 168 ~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~~~~~---~~~l~G~pv~~~~~~~~~~~~~ 244 (324) ......+++..+.++..++.+....-..|+||..++..|.+....+...++.... .++++|++|++..+++....-. T Consensus 154 ~~~~~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~~i~t~~G~~VIVdD~~p~~~~g~ 233 (325) T protein:vir:95 154 DAADKLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVNVVRDPFGKLLVMTDSPNLFAAGT 233 (325) T ss_pred CcccccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCcccccccCCcEEEEeCCCCCCCccC Confidence 1122235778899999999998888899999999999998766555433332221 2478999999988776543210 Q ss_pred EEeecccEEEEE-ecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 245 ITGDFDKLIYGI-PQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 245 i~gd~s~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) +..+..++. .+.+.+..........+.+. + -..-...+|.+. --++||..+..-+ +..+...+-.| T Consensus 234 ---~~~ytty~lg~GAi~~~~~~~~~~~~~~~~-~-----~~~~~~~~~~~~---tf~lhp~G~sw~~-s~~g~sPt~ae 300 (325) T protein:vir:95 234 ---PNVYHILGLVPGGVLIGQNNDFDANEETKN-G-----DENIIRTYQAEW---SYNIGVKGFAWDK-ANGGKSPTDAA 300 (325) T ss_pred ---ceeEEEEEEecCeEEecCCCCccccccccC-c-----ccceeeeeeeee---eEEeecceeeeec-ccccCCcChHh Confidence 001111222 22333222111111111110 0 011122233221 1367888888732 22222222233 Q ss_pred C Q lcl|Aclame:pro 324 V 324 (324) Q Consensus 324 ~ 324 (324) . T Consensus 301 L 301 (325) T protein:vir:95 301 L 301 (325) T ss_pred h Confidence 3 No 176 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=98.06 E-value=3.7e-06 Score=50.34 Aligned_cols=279 Identities=10% Similarity=0.036 Sum_probs=139.9 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcce---------eecCCCce Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY---------EPMEGTEK 71 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~---------~~~~~~~~ 71 (324) |-+- +..+. -....+|+.+..=+.+.-.+.+.|++..-. ...++... T Consensus 1 M~~~--------------------~~~T~----l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v 56 (367) T protein:vir:80 1 MPDF--------------------NNQVR----LVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLI 56 (367) T ss_pred Ccch--------------------hhhhh----hhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEE Confidence 1100 00000 011234544433333333344444333211 23467778 Q ss_pred EEEEEeCC-cceeeeccCc---cccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 72 KFTFWADK-PGAYWVGEGQ---KIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) Q Consensus 72 ~ip~~~~~-~~a~~v~Eg~---~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l 147 (324) ++|.+..- ....-+.+.. .++..+.+-.+..-..+.++.....++-.-.-+..|.++.|.+++++.-.+...+.+| T Consensus 57 ~iPf~~~L~g~~~n~~~d~~~~~~t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG~dpm~~Ia~qva~yW~r~~q~~Ll 136 (367) T protein:vir:80 57 NIPFWRDLDSLEPNYGSDNPNVEAPIDGLGSGEMKTTKTWLNKAYGAMDLTAELAGSNPMTRIRNRFGVYWTRQWQRRII 136 (367) T ss_pred EeeeeccCCCCccccCCCCCcccccccccccchheeeeehhcccchhhhHHHHhhCchHHHHHHHHHHHHhhhhhHHHHH Confidence 89998643 3333333332 3444555555544555556666666665444456689999999999777766665544 Q ss_pred h---cc----Ccccc----------------ccccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHH Q lcl|Aclame:pro 148 L---NQ----GNNPF----------------GKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRS 204 (324) Q Consensus 148 ~---G~----g~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~ 204 (324) . |- +.+.. ..................++++.+.++..++.+....-+.++||+.++. T Consensus 137 a~L~Gvf~~~~a~~~~~~~~~~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~ 216 (367) T protein:vir:80 137 AMAVGVYKSNLAGNFATIKTRGRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYK 216 (367) T ss_pred HHHHHhhccccccchhhhhhhhccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHH Confidence 2 21 11100 0000011111111123457788899999999998888899999999999 Q ss_pred HHHHhhc------cCCceeeccCCcceeecceeEeecCCCCCC----c---eeEEeecccEEEEEecc-eEEEEeeccce Q lcl|Aclame:pro 205 LLRKIVD------PETKERIYDRNSDSLDGLPVVNLKSSNLKR----G---ELITGDFDKLIYGIPQL-IEYKIDETAQL 270 (324) Q Consensus 205 ~l~~~~d------~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~----~---~~i~gd~s~~~~~~~~~-~~~~~~~~~~~ 270 (324) .|++++- ++| ...-++++|++|++..+++... + +.+||.- .+.++.... ..++++|+... T Consensus 217 ~L~~~~li~~i~~sd~-----~~~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~G-Ai~~~~~~~~~~~E~~Rd~~~ 290 (367) T protein:vir:80 217 RMTNNDEIEFIPDSKG-----QLTIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGGA-AFGYADGAPQVPVAVGRRELR 290 (367) T ss_pred HHHhccccccccCCCC-----ccccceecceeEEEeCCCcccccCCCceEEEEEEecc-eeeecccCCccceecccchhh Confidence 9987652 222 2345789999999988877632 1 2244432 111222211 12344443310 Q ss_pred eccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC-C-------------CCCCCCCC Q lcl|Aclame:pro 271 STVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK-R-------------TDSVPGEV 324 (324) Q Consensus 271 ~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~-~-------------~~~~~~~~ 324 (324) .+. .++--+..+.| .++||..|....-.-. + ...+..|. T Consensus 291 ---~~~---------gG~d~L~~Rr~---~~~hP~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt~~eL 343 (367) T protein:vir:80 291 ---GNG---------SGLEYILERKE---WIVHPGGFNWLDADVTIPDNTGSPSGITSGPPAITLANL 343 (367) T ss_pred ---hcC---------CceEEEEeeee---EEeecceeeecccccccccccccccccccccCCCChHHh Confidence 000 11122333333 5678888776543211 1 11222222 No 177 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=97.57 E-value=4.4e-05 Score=44.49 Aligned_cols=279 Identities=10% Similarity=0.032 Sum_probs=133.2 Q ss_pred ccccc-CccccchH-HHHHHH-HHHHhhhhhhhhcce---------eecCCCceEEEEEeCC-c--ceeeeccC--cccc Q lcl|Aclame:pro 30 MMHEK-KDGTLMNE-FTTPIL-QEVMENSKIMQLGKY---------EPMEGTEKKFTFWADK-P--GAYWVGEG--QKIE 92 (324) Q Consensus 30 ~~~~~-~~~~vp~~-~~~~i~-~~~~~~s~l~~l~~~---------~~~~~~~~~ip~~~~~-~--~a~~v~Eg--~~~~ 92 (324) +..+. ....+|+. +...++ +.-.+.+.|.+..-. ...++..+++|.+..- . +..+-+.+ ...+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~t 80 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFDSGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhhccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 22222 33345542 344433 333344444443211 2245677889988642 2 22222332 2344 Q ss_pred ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh---cc-Ccccccccccc--ccccc Q lcl|Aclame:pro 93 TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQ-GNNPFGKSIAQ--SIEKT 166 (324) Q Consensus 93 ~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~-g~~~~~~~~~~--~~~~~ 166 (324) ..+.+-.+..-..+..+.....++=.-.-+..|.++.|.+++++...+...+.+|. |- +.+........ ..... T Consensus 81 ~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~~~t~ 160 (349) T protein:vir:78 81 PRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMVV 160 (349) T ss_pred cccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhccccee Confidence 45555554444444455555555422233445889999999998877776665542 21 11110000000 00011 Q ss_pred cccccchhhhhHHHHHHHHhhhhc-----CCCcEEEEcHHHHHHHHHhhccCCc-eeeccCCcceeecceeEeecCCCCC Q lcl|Aclame:pro 167 NKVIKGDFTQDNIIDLEALLEDDE-----LEANAFISKTQNRSLLRKIVDPETK-ERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 167 ~~~~~~~~~~~~i~~~~~~l~~~~-----~~~~~~v~~~~~~~~l~~~~d~~g~-~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ...+...+++..+.++..++.+.. -.-+.++||+.++..|++.+--..- +.-......+++|++|++..++++. T Consensus 161 d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i~ty~G~~VivDD~~Pv~ 240 (349) T protein:vir:78 161 DVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMFATYQGYRVIVDDSMTVV 240 (349) T ss_pred eeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhhhhccCcccCcccceecCeEEEEeCCCccc Confidence 111223456777888888887753 3346899999999999865432211 1111234578999999998888764 Q ss_pred Cc-------eeEEeecccEEEEEecc-eEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEe Q lcl|Aclame:pro 241 RG-------ELITGDFDKLIYGIPQL-IEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVP 312 (324) Q Consensus 241 ~~-------~~i~gd~s~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~ 312 (324) .. +.+||.- .+.++.... ..+++.++..... ..++-.+..+.++ ++||..+...+. T Consensus 241 ~~g~~~~yttylfg~G-Ai~~~~~~~~~~~et~rd~~~g~------------~~G~d~l~~R~~~---~~hp~G~s~~~a 304 (349) T protein:vir:78 241 GQGAQRKFISIIFGQG-AIGYGEGNPVMPLEYEREASRAN------------GGGVETLWTRKTW---LLHPFGYRFTSA 304 (349) T ss_pred cCCCCceEEEEEeecc-eEEEccCCCccceeeecccccCC------------cceeEEEEEeeEE---Eeeeeeeeeccc Confidence 32 2245421 112232221 1244444331100 0112223333332 456777766543 Q ss_pred ecCC-------CCCCCCCC Q lcl|Aclame:pro 313 ADKR-------TDSVPGEV 324 (324) Q Consensus 313 ~~~~-------~~~~~~~~ 324 (324) .... ...+..|. T Consensus 305 ~v~~~~~~~~~~sPt~aeL 323 (349) T protein:vir:78 305 VITGNGTETIARSASWQDL 323 (349) T ss_pred cccCCccccccCCCChHHh Confidence 3221 12222333 No 178 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=97.52 E-value=1.7e-05 Score=46.74 Aligned_cols=289 Identities=11% Similarity=0.117 Sum_probs=151.2 Q ss_pred HHhhccccc--cccc--cCccccch-HHHHHHHHHHHhhhhhhhhcceeecC---CCceEEEEEeCCcce------eeec Q lcl|Aclame:pro 21 PQVFNPDNV--MMHE--KKDGTLMN-EFTTPILQEVMENSKIMQLGKYEPME---GTEKKFTFWADKPGA------YWVG 86 (324) Q Consensus 21 ~~~~~~~~~--~~~~--~~~~~vp~-~~~~~i~~~~~~~s~l~~l~~~~~~~---~~~~~ip~~~~~~~a------~~v~ 86 (324) --.+++... .++. +.+--+.. -+....+..+++.-.+.+++...|++ +.++++-+...-+.+ +..+ T Consensus 1 ~~~~~a~~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~eGv~a 80 (401) T protein:vir:95 1 MLNYNAPTDGQKSSIDGANSDQMQTFFWLKKAIITARKEQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNINDQGIDA 80 (401) T ss_pred CCccCCCcccccccccccccceeeehhhHHHHHhhhhhhhhhhhcccccccccccCCeEEEEecccccccccchhcCCCc Confidence 011222211 1111 12222333 23355566666677788888888775 344444333222221 1223 Q ss_pred cCccc-----------------------------cccccceeeEEeeheeeEEeeeehHHHhh-cChHHHHHHHHH-HHH Q lcl|Aclame:pro 87 EGQKI-----------------------------ETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKP-MIA 135 (324) Q Consensus 87 Eg~~~-----------------------------~~~~~~~~~v~l~~~k~~~~~~iS~e~l~-ds~~~~~~~i~~-~l~ 135 (324) +|.++ .....+-..+..+.+++|.+..+|+++.. ++++.+.+++.+ .|. T Consensus 81 ~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s~ell~ 160 (401) T protein:vir:95 81 SGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLSRELMN 160 (401) T ss_pred ccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHHHHHhh Confidence 33322 01122334566678999999999998877 456777776533 333 Q ss_pred HHH---HHHHHHHHHhccCcccc-ccccccccccccccccchhhhhHHHHHHHHhhhhc-------------C-----CC Q lcl|Aclame:pro 136 EAF---YKKFDEAGILNQGNNPF-GKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDE-------------L-----EA 193 (324) Q Consensus 136 ~ai---~~~~d~~~l~G~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~-------------~-----~~ 193 (324) -+. ...+-+.+|++.++.-. +....-+.......+.+.++++++..+...|..+. . .. T Consensus 161 g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dTk~i~~ 240 (401) T protein:vir:95 161 GATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDTKVIGA 240 (401) T ss_pred hhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCcccccc Confidence 333 33444567754433211 11122222333444556778888888877776311 1 11 Q ss_pred c-EEEEcHHHHHHHHHhhccCCceeec------------cCCcceeecceeEeecCCC--------CCC----------- Q lcl|Aclame:pro 194 N-AFISKTQNRSLLRKIVDPETKERIY------------DRNSDSLDGLPVVNLKSSN--------LKR----------- 241 (324) Q Consensus 194 ~-~~v~~~~~~~~l~~~~d~~g~~~~~------------~~~~~~l~G~pv~~~~~~~--------~~~----------- 241 (324) + .-+||+..-..|+.++|..|.+-|. .+..+.+.++.++.++.+- ... T Consensus 241 s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~~~~~~ 320 (401) T protein:vir:95 241 TRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYRTSMVS 320 (401) T ss_pred ceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCccccccccccccccccc Confidence 2 2588999999999898887776653 2345566777777654311 000 Q ss_pred --------ceeEEeecccEEEEEecce-----EEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE Q lcl|Aclame:pro 242 --------GELITGDFDKLIYGIPQLI-----EYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA 308 (324) Q Consensus 242 --------~~~i~gd~s~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~ 308 (324) ..+++|+-.+..++..++- .+.+.+-.. ..-..+.+ |-|...+.++ +++++.+++++-++ T Consensus 321 ~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~---~~ad~~DP--lgQ~g~vgwK--~~~a~~vL~~e~m~ 393 (401) T protein:vir:95 321 GQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGK---ETADRNDP--YGETGFSSIK--WYYGILVKRPERLA 393 (401) T ss_pred CCCcceeeeeeEEccccceecccccCCccccceeEeecCCc---CCCCCCCc--ccceehhhhh--hhhhhheeccceeE Confidence 1235666555444443321 222222110 00011222 2344555554 56688999999999 Q ss_pred EEEeecCC Q lcl|Aclame:pro 309 KLVPADKR 316 (324) Q Consensus 309 ~l~~~~~~ 316 (324) +|+-++.- T Consensus 394 ~ies~a~~ 401 (401) T protein:vir:95 394 LIKTVAPL 401 (401) T ss_pred EEEeecCC Confidence 99887766 No 179 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=97.46 E-value=6.2e-05 Score=43.64 Aligned_cols=278 Identities=9% Similarity=-0.037 Sum_probs=123.0 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcce-ee----c--CCCceEEEEEeCCcc---eeeeccCcccccccccee- Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY-EP----M--EGTEKKFTFWADKPG---AYWVGEGQKIETSKATWV- 99 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~-~~----~--~~~~~~ip~~~~~~~---a~~v~Eg~~~~~~~~~~~- 99 (324) +.++-..++|+.+.+++++.+++..++.+++.. .. . .+.+++||+-..... ..+-..+. +.++..-. T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~--~~~~l~e~~ 78 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGK--SKNSLISAK 78 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcc--cccccccce Confidence 444455689999999999999999999998754 21 1 256677776432111 11111111 11222222 Q ss_pred -eEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhH Q lcl|Aclame:pro 100 -NATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) Q Consensus 100 -~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) .+++..+|... +.++++-...+..++++++ +.-.++++..+|..+.......... ...... .....+++ T Consensus 79 v~l~id~~k~~a-~~v~d~E~~l~i~~~~~~l-~~A~~aLA~~vd~~ia~~~~~~~~~-----~vgt~~---t~~~a~~~ 148 (423) T protein:vir:10 79 ATGEVGNYITVA-VEYRQIEEALKLNQLDQIL-VPINERMVTDLETELALFMMKHGAL-----SLGSPN---TPIKKWSD 148 (423) T ss_pred EEEEecceeeee-eeeChHHHhcChhHHHHHH-HHHHHHHHHHHHHHHHHHhhhcccc-----cccccc---cccccHHH Confidence 44555555444 4565544445677887654 5557899999999886322111110 001100 11124788 Q ss_pred HHHHHHHhhhhcCCC--cEEEEcHHHHHHHHH----hhcc--CCceeeccCC-cceeecceeEeecCCCC-CCcee-EEe Q lcl|Aclame:pro 179 IIDLEALLEDDELEA--NAFISKTQNRSLLRK----IVDP--ETKERIYDRN-SDSLDGLPVVNLKSSNL-KRGEL-ITG 247 (324) Q Consensus 179 i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~----~~d~--~g~~~~~~~~-~~~l~G~pv~~~~~~~~-~~~~~-i~g 247 (324) ++++-..|.....+. ...+++|..+..|.+ +... .+..-+..+. .+++.|+.++.+...+. .+++. ..+ T Consensus 149 ~a~a~~~L~~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~ 228 (423) T protein:vir:10 149 VAQTASFLKDLGINSGENYAVMDPWAAQRLADAQSGLHVSEQLVRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKL 228 (423) T ss_pred HHHHHHHHhhccCCcCCCEEEeCHHHHHHHhhhhhhhccccccchHHHHhcccceeecceEEEEecCCccccccccccee Confidence 999988888777654 467999999888742 2221 1222234443 47899999988765442 12211 000 Q ss_pred ecccEEEEEe--------cceEEEEeec---cceecccc---ccccchhhhhcC---------cEEEEEEEEeccEEecc Q lcl|Aclame:pro 248 DFDKLIYGIP--------QLIEYKIDET---AQLSTVKN---EDGTPVNLFEQD---------MVALRATMHVALHIADD 304 (324) Q Consensus 248 d~s~~~~~~~--------~~~~~~~~~~---~~~~~~~~---~~~~~~~~f~~~---------~v~~r~~~r~d~~v~~~ 304 (324) -.+....... .......... .++..+.. .+-..++...++ ...|++.. |....-+ T Consensus 229 ~~~~~~~vt~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~aGv~~v~~~tk~~l~~~~~~~~~~~~V~~--~~~~~a~ 306 (423) T protein:vir:10 229 TVKGTPEVNYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFDDTHWLNQQSKQTLYNGASALSFTATVME--DANAHSS 306 (423) T ss_pred eeeeeeEEEecccccccccccceeeccceeceeEEecceEeecceeeecccccceeecccCCcceEEEEEe--ccccccc Confidence 0000000000 0000000000 00000000 000000000000 01111111 1111111 Q ss_pred CceEEEEeecCCCCCCCCC-----C Q lcl|Aclame:pro 305 KAFAKLVPADKRTDSVPGE-----V 324 (324) Q Consensus 305 ~A~~~l~~~~~~~~~~~~~-----~ 324 (324) .++. |++..+. -.+.+. | T Consensus 307 ~~~t-v~i~p~~-~~~~~~~~~~~V 329 (423) T protein:vir:10 307 GDVT-VKISGVP-IFDAGYPQYNAV 329 (423) T ss_pred CceE-EEecccc-ccccCcccccce Confidence 2221 2221111 001111 1 No 180 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=97.43 E-value=7e-05 Score=43.38 Aligned_cols=279 Identities=10% Similarity=0.039 Sum_probs=131.3 Q ss_pred ccccc-CccccchH-HHHHHH-HHHHhhhhhhhhcce---------eecCCCceEEEEEeCC-ccee--eeccC--cccc Q lcl|Aclame:pro 30 MMHEK-KDGTLMNE-FTTPIL-QEVMENSKIMQLGKY---------EPMEGTEKKFTFWADK-PGAY--WVGEG--QKIE 92 (324) Q Consensus 30 ~~~~~-~~~~vp~~-~~~~i~-~~~~~~s~l~~l~~~---------~~~~~~~~~ip~~~~~-~~a~--~v~Eg--~~~~ 92 (324) +..+. ....+|+. +...++ +.-.+.+.|.+..-. ...++...++|.+..- .+.. +-+.. ...+ T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~qSGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~t 80 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFNSGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIAT 80 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhhccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCcccccc Confidence 11222 33345542 344433 333344545443211 2245677889987642 2222 22222 2344 Q ss_pred ccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh---cc-Cccccccc--cccccccc Q lcl|Aclame:pro 93 TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQ-GNNPFGKS--IAQSIEKT 166 (324) Q Consensus 93 ~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~---G~-g~~~~~~~--~~~~~~~~ 166 (324) ..+.+-.+.....+-.+.....++=.-.-+..|.++.|.+++++...+...+.+|. |- +.+..... ........ T Consensus 81 ~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~~~~~ 160 (349) T protein:vir:94 81 PRAIQTGEMMARVAYLNEGFGQADLTVELTSQNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQNDMVV 160 (349) T ss_pred cccccccceeeeeeeeccccchhHHHHHhhCchHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccCceeE Confidence 45554444333333344444444322223445788999999998888877765553 21 11100000 00000011 Q ss_pred cccccchhhhhHHHHHHHHhhhhc-----CCCcEEEEcHHHHHHHHHhhccCCc-eeeccCCcceeecceeEeecCCCCC Q lcl|Aclame:pro 167 NKVIKGDFTQDNIIDLEALLEDDE-----LEANAFISKTQNRSLLRKIVDPETK-ERIYDRNSDSLDGLPVVNLKSSNLK 240 (324) Q Consensus 167 ~~~~~~~~~~~~i~~~~~~l~~~~-----~~~~~~v~~~~~~~~l~~~~d~~g~-~~~~~~~~~~l~G~pv~~~~~~~~~ 240 (324) ...+...++...+.++..++.+.. -.-+.++||+.++..|++++.-..- +.-....-.+++|++|++..++++. T Consensus 161 d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~~i~~s~~~~~i~ty~G~~VivDD~~Pv~ 240 (349) T protein:vir:94 161 DVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLIDFIRDAENNTMFATYQGYRVIVDDSMTVV 240 (349) T ss_pred EecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhhhccCcccCcccceecCcEEEEeCCCccc Confidence 112223456677888888877653 2346799999999999876432211 1111224578999999998888763 Q ss_pred Cc-------eeEEeecccEEEEEec-ceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEe Q lcl|Aclame:pro 241 RG-------ELITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVP 312 (324) Q Consensus 241 ~~-------~~i~gd~s~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~ 312 (324) .. +.+||.- .+.++... ...+++.|+..... . .++-.+..+.+ .++||..+...+- T Consensus 241 ~~g~~~~yttylfg~G-Ai~~~~~~~~~~~E~~rd~~~g~---~---------~G~d~L~~R~~---~~~hp~G~s~~~a 304 (349) T protein:vir:94 241 GQDTSRKFISIIFGQG-AIGYGEGNPEMPLEYEREASRAN---G---------GGVETLWTRKT---WLLHPFGYSFTSA 304 (349) T ss_pred cCCCCceEEEEEeecc-eEEeecCCCCcceeeecccccCC---c---------ceeEEEEEeeE---EEeeeeeeeeccc Confidence 31 1245421 22233322 12344444331100 0 11122333333 2466777766543 Q ss_pred ecCC-------CCCCCCCC Q lcl|Aclame:pro 313 ADKR-------TDSVPGEV 324 (324) Q Consensus 313 ~~~~-------~~~~~~~~ 324 (324) .... ...+-.|. T Consensus 305 ~v~~~~~~~~~~sPt~aeL 323 (349) T protein:vir:94 305 VITGNGTETIARSASWQDL 323 (349) T ss_pred ccCCCccccccCCCChHHh Confidence 2221 12223333 No 181 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=97.43 E-value=3.7e-05 Score=44.87 Aligned_cols=313 Identities=11% Similarity=0.039 Sum_probs=151.8 Q ss_pred CchhHHHHHHHHHHHhhhhhHHh---hccccc---cccccCccccchHHHHHHHHHHHhhhh--hhhhcceeecCCCceE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQV---FNPDNV---MMHEKKDGTLMNEFTTPILQEVMENSK--IMQLGKYEPMEGTEKK 72 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~---~~~~~~---~~~~~~~~~vp~~~~~~i~~~~~~~s~--l~~l~~~~~~~~~~~~ 72 (324) |.+++++.+- ++.--+.+.+.. +.+..- .+-++++++--+.+..+|......... +.+-..+.+..+-..+ T Consensus 1 ~~~~~~~~~~-~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~ 79 (463) T protein:vir:95 1 MTIEKNLSDV-QQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVK 79 (463) T ss_pred CCcccccchH-HHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhh Confidence 6555554332 222223333332 222111 112235555555555554444332222 2333334455444444 Q ss_pred EEEEeC---CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHH-hhcChHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 73 FTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF-LNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 73 ip~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~-l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) |-...+ ...+.+++|++..+.+++++...+..++=++....+|.-+ +.++..+.+....+.-.-.++..+|.++|+ T Consensus 80 y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~Fy 159 (463) T protein:vir:95 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFY 159 (463) T ss_pred heeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhh Confidence 444332 3457899999999999999999999999888888887733 345677899999999999999999999999 Q ss_pred ccCccc--------ccccccccccccccccc--chhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceee Q lcl|Aclame:pro 149 NQGNNP--------FGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) Q Consensus 149 G~g~~~--------~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~ 218 (324) |+..=. ...|+.+.....+...+ ..++.+.|..+-..+...|..++-++|+..+.+.|..-.-..-|-+. T Consensus 160 Gds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~ 239 (463) T protein:vir:95 160 GDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) T ss_pred hhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEE Confidence 974311 12233222222222222 34455666666677778888888899999999998755444444444 Q ss_pred ccCCcceeecceeEe--ecCCCC--C-----CceeEEeecccEEEEEecc--e--EEEEeeccceeccccccccchhhhh Q lcl|Aclame:pro 219 YDRNSDSLDGLPVVN--LKSSNL--K-----RGELITGDFDKLIYGIPQL--I--EYKIDETAQLSTVKNEDGTPVNLFE 285 (324) Q Consensus 219 ~~~~~~~l~G~pv~~--~~~~~~--~-----~~~~i~gd~s~~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) ....+....|+|+.- +....+ . .+..+++--....-.-... . ++...+..+. ....+ T Consensus 240 ~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~--~~~~~-------- 309 (463) T protein:vir:95 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAF--ENEED-------- 309 (463) T ss_pred cCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCC--CCccc-------- Confidence 444444455665521 111000 0 0111111000000000000 0 1111111110 00001 Q ss_pred cCcEEEEEEEEeccEEeccCceEEEEeec------------CCCCCCCCCC Q lcl|Aclame:pro 286 QDMVALRATMHVALHIADDKAFAKLVPAD------------KRTDSVPGEV 324 (324) Q Consensus 286 ~~~v~~r~~~r~d~~v~~~~A~~~l~~~~------------~~~~~~~~~~ 324 (324) .....+++.+.-+..--.|+.++-.|.+. +....+|==+ T Consensus 310 ~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v 360 (463) T protein:vir:95 310 RAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFV 360 (463) T ss_pred ccceEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEE Confidence 11112333333333333333333222221 1111111111 No 182 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=97.43 E-value=3.7e-05 Score=44.87 Aligned_cols=313 Identities=11% Similarity=0.039 Sum_probs=151.8 Q ss_pred CchhHHHHHHHHHHHhhhhhHHh---hccccc---cccccCccccchHHHHHHHHHHHhhhh--hhhhcceeecCCCceE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQV---FNPDNV---MMHEKKDGTLMNEFTTPILQEVMENSK--IMQLGKYEPMEGTEKK 72 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~---~~~~~~---~~~~~~~~~vp~~~~~~i~~~~~~~s~--l~~l~~~~~~~~~~~~ 72 (324) |.+++++.+- ++.--+.+.+.. +.+..- .+-++++++--+.+..+|......... +.+-..+.+..+-..+ T Consensus 1 ~~~~~~~~~~-~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~ 79 (463) T protein:vir:99 1 MTIEKNLSDV-QQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVK 79 (463) T ss_pred CCcccccchH-HHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhh Confidence 6555554332 222223333332 222111 112235555555555554444332222 2333334455444444 Q ss_pred EEEEeC---CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHH-hhcChHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 73 FTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF-LNYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 73 ip~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~-l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) |-...+ ...+.+++|++..+.+++++...+..++=++....+|.-+ +.++..+.+....+.-.-.++..+|.++|+ T Consensus 80 y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~Fy 159 (463) T protein:vir:99 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFY 159 (463) T ss_pred heeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhh Confidence 444332 3457899999999999999999999999888888887733 345677899999999999999999999999 Q ss_pred ccCccc--------ccccccccccccccccc--chhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceee Q lcl|Aclame:pro 149 NQGNNP--------FGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) Q Consensus 149 G~g~~~--------~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~ 218 (324) |+..=. ...|+.+.....+...+ ..++.+.|..+-..+...|..++-++|+..+.+.|..-.-..-|-+. T Consensus 160 Gds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~ 239 (463) T protein:vir:99 160 GDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) T ss_pred hhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEE Confidence 974311 12233222222222222 34455666666677778888888899999999998755444444444 Q ss_pred ccCCcceeecceeEe--ecCCCC--C-----CceeEEeecccEEEEEecc--e--EEEEeeccceeccccccccchhhhh Q lcl|Aclame:pro 219 YDRNSDSLDGLPVVN--LKSSNL--K-----RGELITGDFDKLIYGIPQL--I--EYKIDETAQLSTVKNEDGTPVNLFE 285 (324) Q Consensus 219 ~~~~~~~l~G~pv~~--~~~~~~--~-----~~~~i~gd~s~~~~~~~~~--~--~~~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) ....+....|+|+.- +....+ . .+..+++--....-.-... . ++...+..+. ....+ T Consensus 240 ~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~--~~~~~-------- 309 (463) T protein:vir:99 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAF--ENEED-------- 309 (463) T ss_pred cCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCC--CCccc-------- Confidence 444444455665521 111000 0 0111111000000000000 0 1111111110 00001 Q ss_pred cCcEEEEEEEEeccEEeccCceEEEEeec------------CCCCCCCCCC Q lcl|Aclame:pro 286 QDMVALRATMHVALHIADDKAFAKLVPAD------------KRTDSVPGEV 324 (324) Q Consensus 286 ~~~v~~r~~~r~d~~v~~~~A~~~l~~~~------------~~~~~~~~~~ 324 (324) .....+++.+.-+..--.|+.++-.|.+. +....+|==+ T Consensus 310 ~a~~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v 360 (463) T protein:vir:99 310 RAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFV 360 (463) T ss_pred ccceEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEE Confidence 11112333333333333333333222221 1111111111 No 183 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=97.36 E-value=8.4e-05 Score=42.93 Aligned_cols=272 Identities=12% Similarity=0.070 Sum_probs=126.0 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcce-ee--c----CCCceEEEEEeCCcceeee-ccCccccccccceeeEE Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY-EP--M----EGTEKKFTFWADKPGAYWV-GEGQKIETSKATWVNAT 102 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~-~~--~----~~~~~~ip~~~~~~~a~~v-~Eg~~~~~~~~~~~~v~ 102 (324) +.++-...||+.+..+.++.+++..++.+++.. .. . .+.+++||+.......... +.+..+..++..-.++. T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e~~v~ 80 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFSAKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCccccccccceee Confidence 333323457999999999999999999988754 21 1 1556778875432222221 22233333444434444 Q ss_pred e--eheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHH Q lcl|Aclame:pro 103 M--RAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNII 180 (324) Q Consensus 103 l--~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 180 (324) + ..+|+ ..+.++++-...+..++++++... ..+++..+|..++..--.+.. +.... .......++++. T Consensus 81 l~id~~k~-~a~~v~d~e~~l~i~~~~~~l~~a-~~ala~~vd~~l~~~l~~~a~-----~~vgt---~~t~~~~~~~i~ 150 (423) T protein:vir:35 81 GKVGKYIT-VAVEWTQIEEALKLNQLDQILSPI-HERMVTDLETELAHFMMNNGA-----LSLGS---PNTAIKKWADVA 150 (423) T ss_pred EEecccee-ccceeCHHHHHhhHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhccc-----ccccc---ccCCcchHHHHH Confidence 4 44443 455666665555677887766655 477899999888742111100 00011 111123578999 Q ss_pred HHHHHhhhhcCCC--cEEEEcHHHHHHHHH----hhccC--CceeeccCC-cceeecceeEeecCCCCCCceeEEeeccc Q lcl|Aclame:pro 181 DLEALLEDDELEA--NAFISKTQNRSLLRK----IVDPE--TKERIYDRN-SDSLDGLPVVNLKSSNLKRGELITGDFDK 251 (324) Q Consensus 181 ~~~~~l~~~~~~~--~~~v~~~~~~~~l~~----~~d~~--g~~~~~~~~-~~~l~G~pv~~~~~~~~~~~~~i~gd~s~ 251 (324) ++...|.....+. ...+++|..+..|.+ +...+ +...+..+. .+++.|+.|+.+.+.+.. +. |.+.. T Consensus 151 ~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp~~--T~--gt~~~ 226 (423) T protein:vir:35 151 QTASFIKDIGIKTGENYAIMDPWSAQRLADAQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLASR--KQ--GDFDG 226 (423) T ss_pred HHHHHHHHhcCCcCCCEEEeCHHHHHHHhccccceeccccchhHHHhhccceeeecceEEEEcCCCccc--cc--ccccc Confidence 9999998877764 356999999877642 11111 122233443 478999999887655421 11 11111 Q ss_pred EEEEEecceEEEE--ee---ccce--eccccccccchhhhhcCcEEEEEEEEeccEEeccCceEE-----------EEee Q lcl|Aclame:pro 252 LIYGIPQLIEYKI--DE---TAQL--STVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAK-----------LVPA 313 (324) Q Consensus 252 ~~~~~~~~~~~~~--~~---~~~~--~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~-----------l~~~ 313 (324) ...... +..+.. .. +.+. ...+-.+ +. .+-..|.+ ...|...+++..-.. .... T Consensus 227 ~~~v~~-a~~v~~~a~~~~~~~~~~~~~~~~~~-~g-~l~~GD~~-----t~aGv~~v~~~t~~~~~~~~t~~~~~~~V~ 298 (423) T protein:vir:35 227 AITVKT-APNVDYLSVKDSYQFTVALTGATPSK-TG-FLKAGDQL-----KFTSTHWLNQQSKQTLYNGSTAMSFTATVL 298 (423) T ss_pred ceeecc-ccccccccccccccceeeeeeeeecc-CC-cEEecceE-----EeeeeeeccccccceeecccCCceeEEEEe Confidence 111000 000000 00 0000 0000000 00 00011211 122222222111110 1111 Q ss_pred cCCCCCCCCCC Q lcl|Aclame:pro 314 DKRTDSVPGEV 324 (324) Q Consensus 314 ~~~~~~~~~~~ 324 (324) ++.....+|+. T Consensus 299 ~~~~~~a~g~~ 309 (423) T protein:vir:35 299 EETNSTASGDV 309 (423) T ss_pred ccccccccCce Confidence 11111122222 No 184 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=97.34 E-value=3.4e-05 Score=45.11 Aligned_cols=291 Identities=10% Similarity=0.020 Sum_probs=147.8 Q ss_pred CchhHHHHHHHHHHHhh-------hh---hHHhhccccc--cccccCccccchHHHH----HHHHHHHhhhhhhhhccee Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASN-------NV---KPQVFNPDNV--MMHEKKDGTLMNEFTT----PILQEVMENSKIMQLGKYE 64 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~-------~~---~~~~~~~~~~--~~~~~~~~~vp~~~~~----~i~~~~~~~s~l~~l~~~~ 64 (324) ||.-+.++. |+++-.. .. ..-.+.++.. ..++.+..-||..+.+ .+++.+.+......++... T Consensus 1 ~~~~~~~~~-l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~ 79 (336) T protein:vir:36 1 MRDAQRIQN-LARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGES 79 (336) T ss_pred CchHHHHHH-HhhcCeeecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhcccc Confidence 888777553 2322111 00 0000111111 1111122234443333 3445555555666666654 Q ss_pred ecCC---CceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehH-HHhhcC--hHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 PMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTK-EFLNYT--YSQFFEEMKPMIAEAF 138 (324) Q Consensus 65 ~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~-e~l~ds--~~~~~~~i~~~l~~ai 138 (324) ..+. ....+++......+.+.+.+...|..+...+..+.+.+.++..+.++. |+.+.+ ..++.+--....++++ T Consensus 80 t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~al 159 (336) T protein:vir:36 80 KKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) T ss_pred ccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHH Confidence 4322 334566666667788889999999999888888888999999999984 554432 3677777788888888 Q ss_pred HHHHHHHHHhccCccccccccccc--cc---cccc----cccchhhhhHHHHHHHHhhhhcC------CCcEEEEcHHHH Q lcl|Aclame:pro 139 YKKFDEAGILNQGNNPFGKSIAQS--IE---KTNK----VIKGDFTQDNIIDLEALLEDDEL------EANAFISKTQNR 203 (324) Q Consensus 139 ~~~~d~~~l~G~g~~~~~~~~~~~--~~---~~~~----~~~~~~~~~~i~~~~~~l~~~~~------~~~~~v~~~~~~ 203 (324) .+.+++-.++|+.....- |+.+. .. +..+ .++..-.++||.+++.+|...-. .+..++|.++.+ T Consensus 160 e~~~N~i~~~Gd~~~~~y-GllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~ 238 (336) T protein:vir:36 160 AKFLNGSYLFGVAGLENY-GLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAM 238 (336) T ss_pred HHhhCcEEEEeccccceE-EEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHH Confidence 888888888887543221 22221 11 0000 01113346888889988875432 356899999998 Q ss_pred HHHHHhhccCCceeec--cCC--cceeecceeEeecCCCCCCceeEEeecccEEEEEecc---eEEEEeeccceeccccc Q lcl|Aclame:pro 204 SLLRKIVDPETKERIY--DRN--SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNE 276 (324) Q Consensus 204 ~~l~~~~d~~g~~~~~--~~~--~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~---~~~~~~~~~~~~~~~~~ 276 (324) ..|... ...|..++. ... .-++...|=. .. ++ |+...+++-...+ ..+.+......+..+ . T Consensus 239 ~~Ls~~-n~~g~Tvl~~lk~n~Pnl~i~t~pEl-----~~-a~----g~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq-~ 306 (336) T protein:vir:36 239 SDLSKT-NQYGLAAAAKLKDIFPKLEFVTIPEY-----DT-AS----GRLVQLWAPRVEGKDTATCGFTEKMRAHSIE-R 306 (336) T ss_pred HhccCC-CccCccHHHHHHHhcCccEEEEcccc-----cc-CC----CceEEEEEEecCCCcceeeecchhhhcccee-e Confidence 888542 223332221 111 1122222211 11 11 1111222211111 111111111000000 0 Q ss_pred cccchhhhhcCcEEEEEEEEe-ccEEeccCceEEEEee Q lcl|Aclame:pro 277 DGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) Q Consensus 277 ~~~~~~~f~~~~v~~r~~~r~-d~~v~~~~A~~~l~~~ 313 (324) ..-.....+..|+ |..+.+|.||+++++. T Consensus 307 --------~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 307 --------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred --------cCceeEeccccceeeeeeeccchheeeecC Confidence 0011222333444 5666779999999998 No 185 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=97.31 E-value=7.3e-05 Score=43.25 Aligned_cols=264 Identities=8% Similarity=-0.020 Sum_probs=135.8 Q ss_pred ccCc-cccc--hHHHHHHHHHHHhhhhhhhhcce---eecCCCceEEEEEeCCccee--eeccC-ccccccccceeeEEe Q lcl|Aclame:pro 33 EKKD-GTLM--NEFTTPILQEVMENSKIMQLGKY---EPMEGTEKKFTFWADKPGAY--WVGEG-QKIETSKATWVNATM 103 (324) Q Consensus 33 ~~~~-~~vp--~~~~~~i~~~~~~~s~l~~l~~~---~~~~~~~~~ip~~~~~~~a~--~v~Eg-~~~~~~~~~~~~v~l 103 (324) -++. +++. +.+.+.|.+...+.-..++++.+ .+..-.++.+...+....+. |++.+ ..+|..+..+++... T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 1222 3332 22334455544444455555543 33333456666665555666 87655 678988999999999 Q ss_pred eheeeEEeeeehHHHhhcCh---HHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccc--c--c------ Q lcl|Aclame:pro 104 RAFKLGVILPVTKEFLNYTY---SQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK--V--I------ 170 (324) Q Consensus 104 ~~~k~~~~~~iS~e~l~ds~---~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~--~--~------ 170 (324) +.+.++..+.+|.+-++.+. .++..--.....+++...+|+.+++|+.......|+++....... . . T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w~ 160 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKVQ 160 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCccc Confidence 99999999998876555432 456666666677788888899999997432222233332222111 0 0 Q ss_pred --cchhhhhHHHHHHHHhhhh--c-CCCcEEEEcHHHHHHHHHhhccCC-ceeec--cCCcceeecceeE--eec----- Q lcl|Aclame:pro 171 --KGDFTQDNIIDLEALLEDD--E-LEANAFISKTQNRSLLRKIVDPET-KERIY--DRNSDSLDGLPVV--NLK----- 235 (324) Q Consensus 171 --~~~~~~~~i~~~~~~l~~~--~-~~~~~~v~~~~~~~~l~~~~d~~g-~~~~~--~~~~~~l~G~pv~--~~~----- 235 (324) +..--.+||.+++.++... + ..+..+++.|+.+..|....-+++ .-++. ..+.....|.|+- ..+ T Consensus 161 ~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~g~~l~I~~v~~~~~~ 240 (304) T protein:vir:52 161 AMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAAGRQVAIKALPSNYGT 240 (304) T ss_pred cCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcccccCCcceEEEecccccc Confidence 1111245666677776433 2 346689999999999865433332 22221 1111112333321 111 Q ss_pred CCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEE--EEEEEEe-ccEEeccCceEEEEe Q lcl|Aclame:pro 236 SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA--LRATMHV-ALHIADDKAFAKLVP 312 (324) Q Consensus 236 ~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~--~r~~~r~-d~~v~~~~A~~~l~~ 312 (324) ....+++.+++.+.+. +.+.+.+--..... ....++... +=++.|+ |..+.+|.|++++.- T Consensus 241 ~g~~g~~r~vvY~~d~------~~~~~~vP~p~~~l----------~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 241 RVTDGKTRAMVYVNSK------EHVIFDVPMSPTVL----------DAQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred cCCCCceEEEEEecCh------hheEEecCcccccc----------chhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 1111222233333322 11222210000000 012233322 2234444 566778999999988 No 186 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=97.28 E-value=3.5e-05 Score=45.02 Aligned_cols=290 Identities=8% Similarity=-0.022 Sum_probs=143.7 Q ss_pred CchhHHHHHHHHHHHhh-----------hhhHHhhcccc--ccccccCccccc----hHHHHHHHHHHHhhhhhhhhcce Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASN-----------NVKPQVFNPDN--VMMHEKKDGTLM----NEFTTPILQEVMENSKIMQLGKY 63 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~-----------~~~~~~~~~~~--~~~~~~~~~~vp----~~~~~~i~~~~~~~s~l~~l~~~ 63 (324) |.+..- ..|+++... ......+.+++ ...++.....|| +.+...|++...+.-..++++.. T Consensus 5 ~~~~~~--~~l~~~g~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv 82 (339) T protein:vir:94 5 NDRTDI--KQLEKVGIIFDGYSPKSISSEVSAYAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPE 82 (339) T ss_pred chHHHH--HHHHhhceeeccchhhhcchhhHhhhccccccccccccccccchhhhhhhhhchhheeecccccchhhhccc Confidence 555433 234443220 11111111111 011111222233 33446667777777778888877 Q ss_pred eecCC---CceEEEEEeCCcceeeeccCccccccccc--eeeEEeeheeeEEeeeehH-HHhhc--ChHHHHHHHHHHHH Q lcl|Aclame:pro 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKAT--WVNATMRAFKLGVILPVTK-EFLNY--TYSQFFEEMKPMIA 135 (324) Q Consensus 64 ~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~--~~~v~l~~~k~~~~~~iS~-e~l~d--s~~~~~~~i~~~l~ 135 (324) .+.+. .++++++.+....+.+.+.++..|..+.. +...++.....+- .++. |+-+. ...++.+--..... T Consensus 83 ~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~--~y~~~E~~~A~~~g~~l~~~Ka~aA~ 160 (339) T protein:vir:94 83 VKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWT--EYGDLEMATYGEAGIDYVARQEISAS 160 (339) T ss_pred ccCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEEE--eecHHHHHHHHhhCCChHHHHHHHHH Confidence 65542 45778888888889999999988877654 4455544444444 4444 33332 23677888888888 Q ss_pred HHHHHHHHHHHHhccCcccccccccccccc--c----cccc--cchhhhhHHHHHHHHhhhhcC------CCcEEEEcHH Q lcl|Aclame:pro 136 EAFYKKFDEAGILNQGNNPFGKSIAQSIEK--T----NKVI--KGDFTQDNIIDLEALLEDDEL------EANAFISKTQ 201 (324) Q Consensus 136 ~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~--~----~~~~--~~~~~~~~i~~~~~~l~~~~~------~~~~~v~~~~ 201 (324) +++...+|+-.++|+..... .|+++.... . ...+ +..--++||.+++.++...-. .+..+++.++ T Consensus 161 ~al~~~~N~i~~~Gd~~~~~-~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~ 239 (339) T protein:vir:94 161 LVMAKFANSSYLLGVAGIAN-YGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPS 239 (339) T ss_pred HHHHHhhceEEeeeecccce-EEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHH Confidence 88999999999998754321 222222111 1 1111 112235788888888854432 2347999999 Q ss_pred HHHHHHHhhccCCceeec--cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEec---ceEEEEeeccceeccccc Q lcl|Aclame:pro 202 NRSLLRKIVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQ---LIEYKIDETAQLSTVKNE 276 (324) Q Consensus 202 ~~~~l~~~~d~~g~~~~~--~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~---~~~~~~~~~~~~~~~~~~ 276 (324) .+..|... ...|.-++. ..+ +-++.++-.+...... |+....+..... ...+.+........ ... T Consensus 240 ~~~~L~~~-n~~~~Tvl~~lk~n---~pnl~i~~~~el~~a~-----g~~~~~~~~~~~~~~~~~~~~p~~~~~lp-vq~ 309 (339) T protein:vir:94 240 ALNNVNRT-NNFGLSAGAKIAQT---YPNIQFVAVPEFDTAS-----GRLVQLWVPEVNGQPTGEVAFAEKLRSHS-IER 309 (339) T ss_pred HHHhcccC-CcCCccHHHHHHHh---cCCcEEEEccccccCC-----CceEEEEEEeccCCcceEEEcchhhhccc-cEE Confidence 99988643 333332221 111 1112222211111101 111111111111 11111111100000 000 Q ss_pred cccchhhhhcCcEEEEEEEE-eccEEeccCceEEEEee Q lcl|Aclame:pro 277 DGTPVNLFEQDMVALRATMH-VALHIADDKAFAKLVPA 313 (324) Q Consensus 277 ~~~~~~~f~~~~v~~r~~~r-~d~~v~~~~A~~~l~~~ 313 (324) ..-.....+..| .|..+.+|.||+++++. T Consensus 310 --------~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 310 --------YSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred --------cCceEEecceeeeeeEEEEccceeeeeecC Confidence 001122344455 57778889999999998 No 187 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=97.18 E-value=0.00014 Score=41.74 Aligned_cols=266 Identities=10% Similarity=0.055 Sum_probs=126.2 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHh-hhhhhhhcceeecCCCceEEEEEeCC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVME-NSKIMQLGKYEPMEGTEKKFTFWADK 79 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~-~s~l~~l~~~~~~~~~~~~ip~~~~~ 79 (324) |--++..=. .+-..+.+.+.+.... .....++++.++-+....++....+- T Consensus 1 m~it~~~l~----------------------------~l~~~~~~~~~~~y~~a~~~~~~~a~~~~sdf~~~~~~~lg~~ 52 (302) T protein:vir:10 1 MLINKQSLN----------------------------AAFVAIKTIFNNAFAAAPTTWQKIAMEVPSNTSSNDYKWLSTF 52 (302) T ss_pred CcccHHHHH----------------------------HHHHHHHHHHHHHHHhhhhhhhceeeecCCCcceeeceecCCC Confidence 322221111 1111122222222222 12345556555544444555555544 Q ss_pred cce-eeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh----ccCcc- Q lcl|Aclame:pro 80 PGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNN- 153 (324) Q Consensus 80 ~~a-~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~- 153 (324) +.. .|++| ++...+.-...+++.++++..+.|||+.|.+-...+..-+.+.++++.++..|+.++. |.+.. T Consensus 53 p~l~e~~Ge---~~~~~l~~~~~~i~~~~~g~~v~i~R~~i~nDdlg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~ 129 (302) T protein:vir:10 53 PKMRRWIGA---KVVKNLKAYKYVVENEDFEATVEVDRNDIEDDQIGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPC 129 (302) T ss_pred CCccccccc---eeeccccccceeEEeecccceecccHHhhcccccchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcc Confidence 443 56544 4455556566789999999999999999998888999999999999999999987764 21111 Q ss_pred ---------ccccccc---cccccc---cccccchhhhhHHHHHHHHhhhhc-----CCCcEEEEcHHHHHHHHHhhccC Q lcl|Aclame:pro 154 ---------PFGKSIA---QSIEKT---NKVIKGDFTQDNIIDLEALLEDDE-----LEANAFISKTQNRSLLRKIVDPE 213 (324) Q Consensus 154 ---------~~~~~~~---~~~~~~---~~~~~~~~~~~~i~~~~~~l~~~~-----~~~~~~v~~~~~~~~l~~~~d~~ 213 (324) .++.+.. +..... .........++..+.++.+..+.. ..+..+|+.|......+++-.. T Consensus 130 ~DG~~fF~~dH~~g~~~~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~- 208 (302) T protein:vir:10 130 FDGQYFIDTDHPVGDASVSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTN- 208 (302) T ss_pred cCCcceecccccccccccccccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhc- Confidence 0111100 000000 001112223334444444443332 3456788888877666655322 Q ss_pred CceeeccCCcceeecc-eeEeecCCCCCCceeEEeecccE---EEEEecceEEEEeeccceeccccccccchhhhhcCcE Q lcl|Aclame:pro 214 TKERIYDRNSDSLDGL-PVVNLKSSNLKRGELITGDFDKL---IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) Q Consensus 214 g~~~~~~~~~~~l~G~-pv~~~~~~~~~~~~~i~gd~s~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v 289 (324) ++. ..+....+.|. .+++.+....+..=.++.|.+.+ ++..+++..++... -|..+.+ T Consensus 209 ~~~--~~g~~Np~~g~~~~vv~p~L~s~~aWyL~a~~~~i~~~~l~g~~~P~~~~~~----------------~~~~dgv 270 (302) T protein:vir:10 209 PKL--ADNTPNPYVGTAELVVDGRIESDTAWFLLDTTKPVKPFIFQPRKQPEFVSQV----------------NLDSDDV 270 (302) T ss_pred ccc--CCCCcceeccceEEEEeeccCCCCceEEEecCCccceEEEcCccccEEEecc----------------CCCCCce Confidence 111 12222333332 34444433322222344555432 22224444444322 1556666 Q ss_pred EEEEEEEeccEEeccCceE-----EEEeecCCCCC Q lcl|Aclame:pro 290 ALRATMHVALHIADDKAFA-----KLVPADKRTDS 319 (324) Q Consensus 290 ~~r~~~r~d~~v~~~~A~~-----~l~~~~~~~~~ 319 (324) .+|.+..+|. |..+.+ .+-.+..++.+ T Consensus 271 ~~k~~~d~Gv---d~R~~~G~~~wq~a~~s~g~~~ 302 (302) T protein:vir:10 271 FNLRKLKFGA---EARAAAGYGFWQLAYGSTGTGA 302 (302) T ss_pred EEEEEEEEee---eeeeecchhhhhhhhccCccCC Confidence 6666555553 222222 11112222222 No 188 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=97.16 E-value=0.00015 Score=41.61 Aligned_cols=273 Identities=9% Similarity=-0.001 Sum_probs=122.6 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcce-ee----c--CCCceEEEEEeCCcceeee-ccCcccccccccee--e Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY-EP----M--EGTEKKFTFWADKPGAYWV-GEGQKIETSKATWV--N 100 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~-~~----~--~~~~~~ip~~~~~~~a~~v-~Eg~~~~~~~~~~~--~ 100 (324) +.++--..+|+.+..+.++.+++..++.+++.. .. . .+.+++||+.......... ..+..+...+..-. . T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~~~~~d~~~~~~~~~~~~dl~e~~v~ 80 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCceeeeccCCccccccccCccccceeE Confidence 333323347999999999999999999888754 21 1 3566777765432222222 23333333333333 4 Q ss_pred EEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhc-cCccccccccccccccccccccchhhhhHH Q lcl|Aclame:pro 101 ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN-QGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 101 v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) +++..+|...+ .++++-...+..++++++.. -.++++..+|..++.- .+...... .... .....++++ T Consensus 81 l~id~~k~va~-~v~d~E~~~~i~~~~~~l~~-A~~aLA~~vd~~ia~~~~~~~~~~~------gt~~---t~~~a~~~i 149 (423) T protein:vir:10 81 GRVGNYITVAV-EYQQLEEAIKLNQLEEILAP-VRQRIVTDLETELAHFMMNNGALSL------GSPN---TPITKWSDV 149 (423) T ss_pred EEeeceeeeee-eechHHHhcChhhHHHHHHH-HHHHHHHHHHHHHHHHHhhcccccc------ccCC---cccchHHHH Confidence 55555555544 45554444456678765544 4688999999988742 11111000 0000 111247889 Q ss_pred HHHHHHhhhhcCCC--cEEEEcHHHHHHHHHh----h--ccCCceeeccCCc-ceeecceeEeecCCCC-CCcee---EE Q lcl|Aclame:pro 180 IDLEALLEDDELEA--NAFISKTQNRSLLRKI----V--DPETKERIYDRNS-DSLDGLPVVNLKSSNL-KRGEL---IT 246 (324) Q Consensus 180 ~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~----~--d~~g~~~~~~~~~-~~l~G~pv~~~~~~~~-~~~~~---i~ 246 (324) .++...|.....+. ...+++|..+..|.+- . +..+...+..+.- +++.|+.++.+.+.+. .++.. .. T Consensus 150 ~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~ 229 (423) T protein:vir:10 150 AQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLT 229 (423) T ss_pred HHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCcccccccccccee Confidence 99999998777664 4679999998877531 1 1112233444443 7899999988765442 11110 00 Q ss_pred eecccEEEE----EecceEEEEee-----ccceeccccccccchhhhhcCcEEEEEEEEeccEEe------ccCceEE-- Q lcl|Aclame:pro 247 GDFDKLIYG----IPQLIEYKIDE-----TAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIA------DDKAFAK-- 309 (324) Q Consensus 247 gd~s~~~~~----~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~------~~~A~~~-- 309 (324) --+...+-+ ......+.+.. ..++... +.|.-+- +.+..+....++ +..-|+. T Consensus 230 ~~~~~~v~~~a~~~a~~~~~~~~~~~~~~~~~l~~G--------D~~t~aG--v~~v~~~tk~~~~~~~t~~~~~~~v~a 299 (423) T protein:vir:10 230 VKTQPTVTYNAVKDSYQFTVTLTGATASVTGFLKAG--------DQVKFTN--TYWLQQQTKQALYNGATPISFTATVTA 299 (423) T ss_pred eeecceeccccccccceeeeeeeeccccccCceeec--------ceEEecc--eeeecccccccccccccCcceEEEEEe Confidence 000000000 00011111110 0011000 0010000 000001111100 1111111 Q ss_pred -----------EEeecCC-CCCCC---CCC Q lcl|Aclame:pro 310 -----------LVPADKR-TDSVP---GEV 324 (324) Q Consensus 310 -----------l~~~~~~-~~~~~---~~~ 324 (324) |+..++. +.+.+ .-| T Consensus 300 ~~~~~~~g~~tv~i~p~~i~~~~~~~~~~v 329 (423) T protein:vir:10 300 DANSDSGGDVTVTLSGVPIYDTTNPQYNSV 329 (423) T ss_pred eeeeccCCceeeeccCccccccCCcccccc Confidence 1111110 00000 001 No 189 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=97.11 E-value=0.00017 Score=41.27 Aligned_cols=274 Identities=9% Similarity=0.003 Sum_probs=123.9 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcc-----ee-ecCCCceEEEEEeCCcceee-eccCcccc-ccccceeeEE Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGK-----YE-PMEGTEKKFTFWADKPGAYW-VGEGQKIE-TSKATWVNAT 102 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~-----~~-~~~~~~~~ip~~~~~~~a~~-v~Eg~~~~-~~~~~~~~v~ 102 (324) ++. .-..+.++..+.+.+...+....|.. .+ ..++++++||+.....-... .+-....+ ..+.+....+ T Consensus 1 MA~---~n~a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~~ 77 (299) T protein:vir:79 1 MAA---LNYAKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPKV 77 (299) T ss_pred Ccc---chhHHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEEE Confidence 211 11347888888888888777655432 22 24567899999865433222 22212222 3344555666 Q ss_pred eeheeeEEeeeehHHHhhcC--hHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHH Q lcl|Aclame:pro 103 MRAFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNII 180 (324) Q Consensus 103 l~~~k~~~~~~iS~e~l~ds--~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 180 (324) +...|.-... |-.-=++.+ ...+...+.+...+.++-.+|...+..--+..... +........+.+-.++.|. T Consensus 78 ldqdr~~~f~-vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~----g~~~~~~~~T~~n~y~~i~ 152 (299) T protein:vir:79 78 LTNQRKWSTL-VHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTAL----GNTADTTVLTTTNVLEVFD 152 (299) T ss_pred eeccccceec-cchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhc----CCcccccccCHHHHHHHHH Confidence 6665544332 221001111 12344444555555566667776554211110000 0011112223344578899 Q ss_pred HHHHHhhhhcCCC--cEEEEcHHHHHHHHHhh------ccCCceeeccCCcceeecceeEeecCCCCCCce-eEEe---- Q lcl|Aclame:pro 181 DLEALLEDDELEA--NAFISKTQNRSLLRKIV------DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGE-LITG---- 247 (324) Q Consensus 181 ~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~------d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~-~i~g---- 247 (324) +++.+|.+...+. -.++++|.++..|.+.. +.........+..+++.|.||+..++.-+...- +.-| T Consensus 153 ~~~~~lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~ 232 (299) T protein:vir:79 153 KLMEKMTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVG 232 (299) T ss_pred HHHHHHHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCcccc Confidence 9999998887754 45799999999887532 111122345667788999999876653332110 0000 Q ss_pred -ecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 248 -DFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 248 -d~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) +-+++ ++.......+.+........ ..++ ..+++---+.-+.|.|.=|.+.+ ..-|-......++ T Consensus 233 ~~ak~in~ii~~~~a~~~~~K~~~~~~-~~P~-----~~~~~~~~~~~r~y~d~~v~~nk-~~~i~~~~~~a~~ 299 (299) T protein:vir:79 233 AGAKQIFMSLVHPSAIITPVSYQFSKL-DEPT-----AVTEGKYFYFEESFEDVFILNKK-ADAIQFVVEGAGA 299 (299) T ss_pred CcccccceEEEcCCeeeeeEeeeeEEe-ecCC-----CCCccceeeeeeeeeeeeeeccc-cCeEEEEeeecCC Confidence 00000 11111122222222111111 0011 12333213333333344444332 2223222222222 No 190 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=97.09 E-value=0.00017 Score=41.18 Aligned_cols=278 Identities=9% Similarity=0.001 Sum_probs=121.1 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhhcce-ee----c--CCCceEEEEEeCCcceeee-ccCcccccccccee--e Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY-EP----M--EGTEKKFTFWADKPGAYWV-GEGQKIETSKATWV--N 100 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~-~~----~--~~~~~~ip~~~~~~~a~~v-~Eg~~~~~~~~~~~--~ 100 (324) +.++--..+|+.+.++.++.+++..++.+++.. .. . .+.+++||+-........- ..+..+..++..-. . T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e~~v~ 80 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccccceeE Confidence 333323357999999999999999999888754 21 1 2557777763322211111 22222333333333 4 Q ss_pred EEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhc-cCccccccccccccccccccccchhhhhHH Q lcl|Aclame:pro 101 ATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN-QGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) Q Consensus 101 v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G-~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i 179 (324) +++..+|...+ .++++-......++++++... .++++..+|..++.- .+..... .... ......++++ T Consensus 81 l~id~~k~va~-~v~d~E~~~~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~~~------~gt~---~t~~~a~~~i 149 (423) T protein:vir:17 81 GRVGNYITVAV-EYQQLEEAIKLNQLEEILAPV-RQRIVTDLETELAHFMMNNGALS------LGSP---NTPITKWSDV 149 (423) T ss_pred EEeeceeeeee-eecHHHHhcChhHHHHHHHHH-HHHHHHHHHHHHHHHHhhccccc------cccC---CcccccHHHH Confidence 55555555444 455544445567787655444 688999999987642 1111100 0000 0111247889 Q ss_pred HHHHHHhhhhcCCC--cEEEEcHHHHHHHHH----hhc--cCCceeeccCCc-ceeecceeEeecCCCC-CCcee---EE Q lcl|Aclame:pro 180 IDLEALLEDDELEA--NAFISKTQNRSLLRK----IVD--PETKERIYDRNS-DSLDGLPVVNLKSSNL-KRGEL---IT 246 (324) Q Consensus 180 ~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~----~~d--~~g~~~~~~~~~-~~l~G~pv~~~~~~~~-~~~~~---i~ 246 (324) +++...|.....+. ...+++|..+..|.+ +.. ..+...+..+.- +++.|+.++.+.+.+. .++.. .+ T Consensus 150 ~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~ 229 (423) T protein:vir:17 150 AQTASFLKDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLT 229 (423) T ss_pred HHHHHHHHhccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCCCccccccceeceee Confidence 99999998777664 467999999877753 111 122333444443 6899999988765442 11111 00 Q ss_pred eecccEE-EEEecceEEEEeeccceeccccccccchhhhhcCcEEE---EEEEEeccEEe------ccCceE-------- Q lcl|Aclame:pro 247 GDFDKLI-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVAL---RATMHVALHIA------DDKAFA-------- 308 (324) Q Consensus 247 gd~s~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~---r~~~r~d~~v~------~~~A~~-------- 308 (324) -.....+ .+-..+...... ++......+-.. +-.-|.+.| ++..+....++ +..-|. T Consensus 230 ~~~~~~v~~~a~~~~~~~~~---~~~~~~~~~~g~--l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~ 304 (423) T protein:vir:17 230 VKTQPTVTYNAVKDSYQFTV---TLTGATTSVTGF--LKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSD 304 (423) T ss_pred ecccccccccccccccceee---eeeeeeeeccCc--eeecceEEecceeeecccccccccccccccceEEEEEeccccc Confidence 0000000 000000000000 000000000000 000011100 00000000000 011111 Q ss_pred -----EEEeecCC-CCC---CCCCC Q lcl|Aclame:pro 309 -----KLVPADKR-TDS---VPGEV 324 (324) Q Consensus 309 -----~l~~~~~~-~~~---~~~~~ 324 (324) .|+..++. +.+ ...-| T Consensus 305 a~~~~tv~i~p~~i~~~~~~~~~~v 329 (423) T protein:vir:17 305 SSGDVTVTLSGVPIYDTTNPQYNSV 329 (423) T ss_pred ccCceEEEecCccccccCCcccccc Confidence 12111110 000 00011 No 191 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=97.08 E-value=9.2e-05 Score=42.72 Aligned_cols=292 Identities=10% Similarity=0.023 Sum_probs=147.1 Q ss_pred CchhHHHHHHHHHHHhh-------hhhH-Hhh---cccc-ccccccCccccchHHHHH-----HHHHHHhhhhhhhhcce Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASN-------NVKP-QVF---NPDN-VMMHEKKDGTLMNEFTTP-----ILQEVMENSKIMQLGKY 63 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~-------~~~~-~~~---~~~~-~~~~~~~~~~vp~~~~~~-----i~~~~~~~s~l~~l~~~ 63 (324) ||.-+.++. |+++-.. ...+ ... +.+. -..++.+...||. +.+. +++.+.+......++.. T Consensus 1 ~~~~~~~~~-l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~-~l~~~i~p~~~~~~~~p~~a~~l~pv 78 (336) T protein:vir:10 1 MRDAQRIQN-LARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPN-YLTTYVDPAVIDILVAPMKAAELVGE 78 (336) T ss_pred CchHHHHHH-HhhcCeeecchhhhhhhhHHHhhhhhhhccCccccCCCchhHH-HHHhhcccceeeehhhhhhhhhhccc Confidence 887776543 2222111 0000 001 1110 0112222233443 3333 34445555556666665 Q ss_pred eecCC---CceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehH-HHhhcC--hHHHHHHHHHHHHHH Q lcl|Aclame:pro 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTK-EFLNYT--YSQFFEEMKPMIAEA 137 (324) Q Consensus 64 ~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~-e~l~ds--~~~~~~~i~~~l~~a 137 (324) ...+. ....+++......+.+.+.+...|..+...+..+.+.+.++..+.++. |+-+.+ ..++.+--....+++ T Consensus 79 ~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~a 158 (336) T protein:vir:10 79 SKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) T ss_pred cccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHH Confidence 44322 334566666667788889999999999888888888999999999995 444432 367778888888888 Q ss_pred HHHHHHHHHHhccCcccccccccccccc---c---ccc---ccchhhhhHHHHHHHHhhhhcC------CCcEEEEcHHH Q lcl|Aclame:pro 138 FYKKFDEAGILNQGNNPFGKSIAQSIEK---T---NKV---IKGDFTQDNIIDLEALLEDDEL------EANAFISKTQN 202 (324) Q Consensus 138 i~~~~d~~~l~G~g~~~~~~~~~~~~~~---~---~~~---~~~~~~~~~i~~~~~~l~~~~~------~~~~~v~~~~~ 202 (324) +.+.+++-.++|+.....- |+.+.... . +.. ++..-.++||.+++.+|..... .+..++|.++. T Consensus 159 le~~~N~i~~~Gd~~~~~y-GllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~ 237 (336) T protein:vir:10 159 LAKFLNGSYLFGVAGLENY-GLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTA 237 (336) T ss_pred HHHhhCcEEEEeccccceE-EEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHH Confidence 8888888888887543221 22221111 0 000 1113346888889988875432 36789999999 Q ss_pred HHHHHHhhccCCceeec--cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc---eEEEEeeccceecccccc Q lcl|Aclame:pro 203 RSLLRKIVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNED 277 (324) Q Consensus 203 ~~~l~~~~d~~g~~~~~--~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~---~~~~~~~~~~~~~~~~~~ 277 (324) +..|... ...|..++. ... +-++.++..+.... ++ |+...+++-...+ ..+.+...-..+..+ . T Consensus 238 ~~~Ls~~-n~~g~Tvl~~lk~n---~Pnl~i~t~pEl~~-a~----G~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq-~- 306 (336) T protein:vir:10 238 MSDLSKT-NQYGLAAAAKLKDI---FPKLEFVTIPEYDT-AS----GRLVQLWAPRVEGKDTATCGFTEKMRAHSIE-R- 306 (336) T ss_pred HHhccCC-CccCccHHHHHHHh---cCccEEEEcccccc-CC----CceEEEEEEecCCCcceeeecchhhhcccee-e- Confidence 8888542 222332221 111 11111221111111 11 1111122211111 111111111000000 0 Q ss_pred ccchhhhhcCcEEEEEEEEe-ccEEeccCceEEEEee Q lcl|Aclame:pro 278 GTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) Q Consensus 278 ~~~~~~f~~~~v~~r~~~r~-d~~v~~~~A~~~l~~~ 313 (324) ..-.....+..|+ |..+.+|.||+++++. T Consensus 307 -------~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 307 -------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred -------cCceeEeccccceeeeeeeccchheeeecC Confidence 0011222333444 5666779999999998 No 192 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=97.04 E-value=8.6e-05 Score=42.86 Aligned_cols=298 Identities=9% Similarity=-0.056 Sum_probs=137.9 Q ss_pred CchhHHHHH--HHH---------HHHhhhhhHHhhccccccccccCccccchHHHH----HHHHHHHhhhhhhhhcceee Q lcl|Aclame:pro 1 MEQTQKLKL--NLQ---------HFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTT----PILQEVMENSKIMQLGKYEP 65 (324) Q Consensus 1 ~~~~~~~k~--~~~---------~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~----~i~~~~~~~s~l~~l~~~~~ 65 (324) |+.++..|- .+- .+.....+..-+.++.....+.++.-||-++.+ .+++.+.......+++.+.. T Consensus 33 ~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t 112 (388) T protein:vir:99 33 MAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKT 112 (388) T ss_pred hhhHhhhhcceeccCccchhhhhhhhhhhhhhcccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccc Confidence 444432220 000 011111111111222222222333335655554 34444445555556665544 Q ss_pred cCC---CceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcC---hHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 66 MEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFY 139 (324) Q Consensus 66 ~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~ai~ 139 (324) .+. ....+++......+.+.+.+...|..+...+..+-..+.+...+.++.+-+..+ ..++.+.-.....+++. T Consensus 113 ~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale 192 (388) T protein:vir:99 113 VGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLE 192 (388) T ss_pred cCCccceeEEEeeeecceeEEEeecccCCCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHH Confidence 322 345566766677788889999999888777777777777777788876544432 47788888888888888 Q ss_pred HHHHHHHHhccCcc--ccccccccccccc----ccc---------ccchhhhhHHHHHHHHhhhhcC---CC----cEEE Q lcl|Aclame:pro 140 KKFDEAGILNQGNN--PFGKSIAQSIEKT----NKV---------IKGDFTQDNIIDLEALLEDDEL---EA----NAFI 197 (324) Q Consensus 140 ~~~d~~~l~G~g~~--~~~~~~~~~~~~~----~~~---------~~~~~~~~~i~~~~~~l~~~~~---~~----~~~v 197 (324) +.+++-.|+|.... ....|+++..... ..+ .+..--++||..++.+|...-. .. -.++ T Consensus 193 ~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~ 272 (388) T protein:vir:99 193 IMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLV 272 (388) T ss_pred hhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEE Confidence 88999999985322 1222333221110 000 0112235778888888754432 11 2688 Q ss_pred EcHHHHHHHHHhhccCCceeec--cCCcceeecceeEeec---CC-CCCCcee-E-Eeeccc-EEEEEe-cceEEE--Ee Q lcl|Aclame:pro 198 SKTQNRSLLRKIVDPETKERIY--DRNSDSLDGLPVVNLK---SS-NLKRGEL-I-TGDFDK-LIYGIP-QLIEYK--ID 265 (324) Q Consensus 198 ~~~~~~~~l~~~~d~~g~~~~~--~~~~~~l~G~pv~~~~---~~-~~~~~~~-i-~gd~s~-~~~~~~-~~~~~~--~~ 265 (324) +.+..+..|... +..|.-++. ... +-++.++..+ .. ..+.+.. + +.+.-. ...+.. +..+.. +. T Consensus 273 LP~~~~~~Ls~~-n~~g~Tvl~~lk~n---~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p 348 (388) T protein:vir:99 273 LPMNKVDMLSVV-TDLGISVRDWLKQT---YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQ 348 (388) T ss_pred echHHHHhcccc-CcCCccHHHHHHHh---cCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEecc Confidence 899988888533 222222211 111 1111222111 11 0111111 1 110000 000000 000000 00 Q ss_pred eccceeccccccccchhhhhcC--cEEEEEE-EEeccEEeccCceEEEEee Q lcl|Aclame:pro 266 ETAQLSTVKNEDGTPVNLFEQD--MVALRAT-MHVALHIADDKAFAKLVPA 313 (324) Q Consensus 266 ~~~~~~~~~~~~~~~~~~f~~~--~v~~r~~-~r~d~~v~~~~A~~~l~~~ 313 (324) ..-... +. +.. ....... ...|..+.+|.||+++++. T Consensus 349 ~~~~~l----~v-------q~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 349 SKFVTL----GV-------EKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred cccccc----cc-------eecCceeEeccccceeeeEEeccchhheeccC Confidence 000000 00 000 0111122 2357778889999999998 No 193 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=96.94 E-value=0.00025 Score=40.35 Aligned_cols=298 Identities=12% Similarity=0.067 Sum_probs=153.5 Q ss_pred CchhHHHHHHHHHHHhhhhhH---Hhhccccc---cccccCccccchHHHHHHHHHHHhhhh--hhhhcceeecCCCceE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKP---QVFNPDNV---MMHEKKDGTLMNEFTTPILQEVMENSK--IMQLGKYEPMEGTEKK 72 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~---~~~~~~~~---~~~~~~~~~vp~~~~~~i~~~~~~~s~--l~~l~~~~~~~~~~~~ 72 (324) |+++-.+... .+.+++.+ +.+.+..- .+-.+++++--+.+..+|......... +.+-..+.+..+-..+ T Consensus 3 ~~~~~~~~~~---~~~~~~~e~~~KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~ 79 (462) T protein:vir:96 3 KDTNLTAEQN---KYADKFQEEVMKSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQK 79 (462) T ss_pred cccccchhhh---hhhchhhHHHHHHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhh Confidence 5544333222 22222222 22222211 112235555445555554444333222 3333344455544444 Q ss_pred EEEEeC---CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhh-cChHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 73 FTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 73 ip~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~-ds~~~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) +-...+ ...+.++.|++..+.+++++...+..++=++..-.+|...-. .+..+.++...+.-...++..+|.++|+ T Consensus 80 y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~tiE~a~Fy 159 (462) T protein:vir:96 80 YDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAKTIEWASFY 159 (462) T ss_pred heeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhh Confidence 444332 345789999999999999999999999999988888775432 3567888999999999999999999999 Q ss_pred ccCc---ccc-----cccccccccccccc-cc-chhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCceee Q lcl|Aclame:pro 149 NQGN---NPF-----GKSIAQSIEKTNKV-IK-GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) Q Consensus 149 G~g~---~~~-----~~~~~~~~~~~~~~-~~-~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~~~~ 218 (324) |+.. +.. ..|+.......+.. +- ..++.+.|..+-..+...|..++-++|+..+.+.|..-.-..-|-+. T Consensus 160 gds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~ 239 (462) T protein:vir:96 160 GDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLM 239 (462) T ss_pred hhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEE Confidence 9753 222 22222222111111 11 23444555555566777888888899999999998765544445555 Q ss_pred ccCCcceeecceeEe--ecCCCCCCc-eeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEE Q lcl|Aclame:pro 219 YDRNSDSLDGLPVVN--LKSSNLKRG-ELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) Q Consensus 219 ~~~~~~~l~G~pv~~--~~~~~~~~~-~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~ 295 (324) .+..+....|+||.- +....+.-. ..++.+ ... +.-+.+ ..+. .-....+.+.. T Consensus 240 ~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~--~~i--------~~~~~~------~~p~-------ap~~~~vsaTv 296 (462) T protein:vir:96 240 QDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMEN--ELI--------LDESLQ------PLPN-------APQPATVKATV 296 (462) T ss_pred cCCCCceeeeeeccceeeeeeeeeeCCceecCc--ccc--------cccccc------cCCC-------CCCCCceeEEE Confidence 555555677887732 111111000 011110 000 000000 0000 00111233332 Q ss_pred Eec--cEEeccC-ceE---EEEeecCCCCCCCCCC Q lcl|Aclame:pro 296 HVA--LHIADDK-AFA---KLVPADKRTDSVPGEV 324 (324) Q Consensus 296 r~d--~~v~~~~-A~~---~l~~~~~~~~~~~~~~ 324 (324) ..+ +...++. +-. +++.....+...|-|. T Consensus 297 ~t~~~g~f~~~~d~~~y~Y~V~avs~dgeS~PS~~ 331 (462) T protein:vir:96 297 ETGKKGLFTDEHDRAELTYKVVVNSDDAQSAPSEA 331 (462) T ss_pred EeCCCCCCCCccCceeEEEEEEEECCCCcccccee Confidence 222 3334442 211 3444445555555555 No 194 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=96.87 E-value=0.00015 Score=41.58 Aligned_cols=184 Identities=13% Similarity=0.010 Sum_probs=96.6 Q ss_pred EEeeeehHHHhhc-----ChHHHHHHHHHHHHHHHHHHHHHHHHh----ccCcccc-ccccc--cccccccccccchhhh Q lcl|Aclame:pro 109 GVILPVTKEFLNY-----TYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGNNPF-GKSIA--QSIEKTNKVIKGDFTQ 176 (324) Q Consensus 109 ~~~~~iS~e~l~d-----s~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~~~~-~~~~~--~~~~~~~~~~~~~~~~ 176 (324) ---.-+|+-+++| +..++.+...+++++++++..|+.++. +..+..+ ..... ..........+....+ T Consensus 1 iD~lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l~ 80 (221) T protein:vir:17 1 MDDLLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAIV 80 (221) T ss_pred CCcchhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHHH Confidence 1122345555544 568899999999999999999998863 2211111 11110 0001111122234456 Q ss_pred hHHHHHHHHhhhhcCCCc--EEEEcHHHHHHHHHhhccC-------C-ceeec-cCCcceeecceeEeecCCCCCCceeE Q lcl|Aclame:pro 177 DNIIDLEALLEDDELEAN--AFISKTQNRSLLRKIVDPE-------T-KERIY-DRNSDSLDGLPVVNLKSSNLKRGELI 245 (324) Q Consensus 177 ~~i~~~~~~l~~~~~~~~--~~v~~~~~~~~l~~~~d~~-------g-~~~~~-~~~~~~l~G~pv~~~~~~~~~~~~~i 245 (324) +.|.++..+|..+..+.. .++++|..+..|-+..+.. + ..... +...+.+.|++|+.+++.+...++-+ T Consensus 81 dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~~ 160 (221) T protein:vir:17 81 DGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTNL 160 (221) T ss_pred HHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCccccccc Confidence 778889999988887753 4677998776665422211 1 11122 22356799999999887776555433 Q ss_pred EeecccEEEE--EecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCC Q lcl|Aclame:pro 246 TGDFDKLIYG--IPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGE 323 (324) Q Consensus 246 ~gd~s~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~ 323 (324) ..+...+... ..+.++.+ |. +. -..+.|++|+..++.-.+ ..-|-- T Consensus 161 ~~~ag~~~~~~~~~~~yr~~--------------------fs-~~---------~glv~~~~Avgtvkl~~~--~~~~~~ 208 (221) T protein:vir:17 161 VTDPGDATTSGENNGSYRPA--------------------IT-DR---------AGLVFHKEAADTVEVLLP--PSRPPL 208 (221) T ss_pred ccCCcccccccccccccccc--------------------cc-ce---------EEEEEcchheeeeeeecC--CCCCce Confidence 3222211100 00011111 11 11 145678899887665432 223333 Q ss_pred C Q lcl|Aclame:pro 324 V 324 (324) Q Consensus 324 ~ 324 (324) | T Consensus 209 ~ 209 (221) T protein:vir:17 209 V 209 (221) T ss_pred e Confidence 3 No 195 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=96.47 E-value=0.00059 Score=38.28 Aligned_cols=293 Identities=10% Similarity=0.022 Sum_probs=146.9 Q ss_pred CchhHHHHHHHHHHHhhh------hhHH--hh---cccccc-ccccCccccchHHH----HHHHHHHHhhhhhhhhccee Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNN------VKPQ--VF---NPDNVM-MHEKKDGTLMNEFT----TPILQEVMENSKIMQLGKYE 64 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~------~~~~--~~---~~~~~~-~~~~~~~~vp~~~~----~~i~~~~~~~s~l~~l~~~~ 64 (324) ||.-+.++. |+++-..- +... .+ +.+..- .++.+..-||..+. ..+++.+.+.....+++.+. T Consensus 1 ~~~~~~~~~-l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~ 79 (336) T protein:vir:78 1 MRDAQRIQN-LARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGES 79 (336) T ss_pred CchHHHHHH-HhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhcccc Confidence 877766443 33321100 0000 01 111110 11111112333222 23444555555566666654 Q ss_pred ecCC---CceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcC---hHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 PMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAF 138 (324) Q Consensus 65 ~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~ai 138 (324) ..+. ....+++......+.+.+.+...|..+...+..+-+.+.++..+.++.+-+..+ ..++.+--....++++ T Consensus 80 t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~al 159 (336) T protein:vir:78 80 KKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) T ss_pred cCCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHH Confidence 4322 345667766777888899999999999999999999999999999996544433 3677777788888888 Q ss_pred HHHHHHHHHhccCccccccccccc--ccccccc-------ccchhhhhHHHHHHHHhhhhcC------CCcEEEEcHHHH Q lcl|Aclame:pro 139 YKKFDEAGILNQGNNPFGKSIAQS--IEKTNKV-------IKGDFTQDNIIDLEALLEDDEL------EANAFISKTQNR 203 (324) Q Consensus 139 ~~~~d~~~l~G~g~~~~~~~~~~~--~~~~~~~-------~~~~~~~~~i~~~~~~l~~~~~------~~~~~v~~~~~~ 203 (324) .+.+++-.++|+.....- |+.+. ....... .+..--++||..++.+|...-. .+..+++.+..+ T Consensus 160 e~~~N~~~~~Gd~~~~~~-GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~ 238 (336) T protein:vir:78 160 AKFLNGSYLFGVAGLENY-GLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAM 238 (336) T ss_pred HHhhCeEEEEeccccceE-EEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHH Confidence 888888888887543222 22221 1111110 1112346788888888864432 244799999999 Q ss_pred HHHHHhhccCCceeec--cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc---eEEEEeeccceeccccccc Q lcl|Aclame:pro 204 SLLRKIVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDG 278 (324) Q Consensus 204 ~~l~~~~d~~g~~~~~--~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 278 (324) ..|... ...|..++. ... +-++.++..+.... ++ |+...++.-...+ ..+.+...-..+..+ T Consensus 239 ~~L~~~-n~~g~tv~~~lk~n---~Pnl~i~t~pel~~-Ag----g~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq---- 305 (336) T protein:vir:78 239 SDLSKT-NQYGLSAAAKLKEI---FPKLEFVTIPEYDT-AS----GRLVQLWAPRVEGKDTATCGFTEKMRAHSIE---- 305 (336) T ss_pred HhccCC-CccCccHHHHHHHh---cCccEEEEcccccc-cC----cceEEEEEeeccCCcceeeecchhhhcccee---- Confidence 998643 222322221 111 00112222211111 11 2211222222111 111111111000000 Q ss_pred cchhhhhcCcEEEEEEEEe-ccEEeccCceEEEEee Q lcl|Aclame:pro 279 TPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) Q Consensus 279 ~~~~~f~~~~v~~r~~~r~-d~~v~~~~A~~~l~~~ 313 (324) . ..-.....+..|+ |..+.+|.||+++++. T Consensus 306 ~-----~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 306 R-----YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred e-----cCceeEeccccceeeeeeeccchheeeccC Confidence 0 0011122333444 5566779999999998 No 196 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=96.20 E-value=0.00088 Score=37.33 Aligned_cols=298 Identities=12% Similarity=0.122 Sum_probs=160.8 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccc-cccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~ 78 (324) |++... ..+.+|.. +.....++.. ..+..+.|-|.+.+.+.+.+.+.|-+++.++.+++..-... +-.-.+ T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~ 73 (355) T protein:vir:18 1 MRQETR--FKFNAYLT-----QLAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVT 73 (355) T ss_pred CChHHH--HHHHHHHH-----HHHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccC Confidence 776533 22233322 1111112211 12345678888999999999999999999999888654433 333334 Q ss_pred Ccceeeec--cC-ccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 79 KPGAYWVG--EG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 79 ~~~a~~v~--Eg-~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) ++-++-+. .+ ...|.....++...+..++.---..|+.+.|+. ..+++...+.+.+.+.++.-.-.--|+|+.-. T Consensus 74 g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A 153 (355) T protein:vir:18 74 GTIASTTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRA 153 (355) T ss_pred cceeeccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeee Confidence 44444321 11 223444455666677777766667778877764 34789999999999888877777777885411 Q ss_pred c------cccc-------c-----------ccccc-cccccc------cchhhhhHH----HHHHHH-hhhhcCCC--cE Q lcl|Aclame:pro 154 P------FGKS-------I-----------AQSIE-KTNKVI------KGDFTQDNI----IDLEAL-LEDDELEA--NA 195 (324) Q Consensus 154 ~------~~~~-------~-----------~~~~~-~~~~~~------~~~~~~~~i----~~~~~~-l~~~~~~~--~~ 195 (324) . .|.+ + ..... ...... ...-+|..| .+++.. |+..+++. -+ T Consensus 154 ~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLV 233 (355) T protein:vir:18 154 DTSDRVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLV 233 (355) T ss_pred ccCChhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 1 1111 0 00000 000000 011234333 345543 45555554 36 Q ss_pred EEEcHHHHH--HHHHhhccCCceeec--c---CCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc-eEEEEeec Q lcl|Aclame:pro 196 FISKTQNRS--LLRKIVDPETKERIY--D---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDET 267 (324) Q Consensus 196 ~v~~~~~~~--~l~~~~d~~g~~~~~--~---~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~-~~~~~~~~ 267 (324) .+|..+..+ ++..+. ....|.-. . ....++.|+|.+..|. .|.+.+++--++++-+-...+ .+-.+-+. T Consensus 234 vivG~dLla~k~~~l~n-~~~~ptE~~Aa~~i~s~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~ 310 (355) T protein:vir:18 234 AIVGRKLLADKYFPLVN-KQQENTESLAADIIISQKRIGNLPAVRVPY--FPANAVFVTTLENLSIYFMDESHRRSIDEN 310 (355) T ss_pred EEEchhhhHHHHhHHhh-ccCChHHHHHHHHHHHHHhhCCceeEEccc--cCCCceEEeeccccEEEEecCcEEEEEEec Confidence 778877543 222222 22333211 1 1136899999998775 556677777777654333222 22222111 Q ss_pred cceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 268 AQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 268 ~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) . ++|.+.-.=....|+.|-+.++++.++.........|++. T Consensus 311 p----------------~r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~~~~~~~~ 351 (355) T protein:vir:18 311 P----------------KKDRVENYESMNIDYVVEAYAAGCLLENITLGDFTAPAAP 351 (355) T ss_pred c----------------ccccccchhhhcceeeeeccccEEEEeeeeecCCCCcccc Confidence 1 1233322233456788888888888887766665556555 No 197 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=95.98 E-value=0.00093 Score=37.21 Aligned_cols=295 Identities=7% Similarity=-0.063 Sum_probs=136.7 Q ss_pred CchhHHHH---HHHHHHHh-------hhhhHHhh--ccccccc--------cccCccccch---HHHHHHHHHHHhhhhh Q lcl|Aclame:pro 1 MEQTQKLK---LNLQHFAS-------NNVKPQVF--NPDNVMM--------HEKKDGTLMN---EFTTPILQEVMENSKI 57 (324) Q Consensus 1 ~~~~~~~k---~~~~~~a~-------~~~~~~~~--~~~~~~~--------~~~~~~~vp~---~~~~~i~~~~~~~s~l 57 (324) |...+-.. ..|+++-. +......+ .+++..- ++.+-.-+|. .+.-.+++.+-..... T Consensus 21 ~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~p~~i~~~tap~~a 100 (379) T protein:vir:10 21 MDSADVTLDNLKHLESYGIHLNGRKNKLFELMQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWLPGHVRILTAVREA 100 (379) T ss_pred hccccccHHHHHHHHhcCccccchhhhhhhhhhhhhccccccccccccCccccccccchHHHHHhhcchHHHHHhhhhhh Confidence 22222100 11222111 00000011 1111110 0000011222 2223455666555666 Q ss_pred hhhcceeecCC---CceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcC---hHHHHHHHH Q lcl|Aclame:pro 58 MQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMK 131 (324) Q Consensus 58 ~~l~~~~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds---~~~~~~~i~ 131 (324) .+++.+...+. ....+++......+.+.+.++..|..+...+...-..+.+...+.++.+-+..+ ..++.+.-. T Consensus 101 ~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka 180 (379) T protein:vir:10 101 DEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKR 180 (379) T ss_pred hhhcccccCCCceeeeEEEeeeeeeeeeEEeccccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHH Confidence 66666544322 344556666677888889888889888777777777788888888876444332 477888888 Q ss_pred HHHHHHHHHHHHHHHHhccCccc-ccccccccccc--c---cc-------cc--cchhhhhHHHHHHHHhhhhcC----- Q lcl|Aclame:pro 132 PMIAEAFYKKFDEAGILNQGNNP-FGKSIAQSIEK--T---NK-------VI--KGDFTQDNIIDLEALLEDDEL----- 191 (324) Q Consensus 132 ~~l~~ai~~~~d~~~l~G~g~~~-~~~~~~~~~~~--~---~~-------~~--~~~~~~~~i~~~~~~l~~~~~----- 191 (324) ....+++...+|+-.|+|.+... ...|+++.... . .. .+ +..--++||..++.++...-. T Consensus 181 ~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~ 260 (379) T protein:vir:10 181 AMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKS 260 (379) T ss_pred HHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecc Confidence 88999999999999999954322 21233221111 0 00 00 111235777888877654322 Q ss_pred --CCcEEEEcHHHHHHHHHhhccCCceeec--cCC--cceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEE-- Q lcl|Aclame:pro 192 --EANAFISKTQNRSLLRKIVDPETKERIY--DRN--SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYK-- 263 (324) Q Consensus 192 --~~~~~v~~~~~~~~l~~~~d~~g~~~~~--~~~--~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~-- 263 (324) .+..+++.+..+..|... +..|..++. ... .-++...|=. .... ..++. ..++.-...+...+ T Consensus 261 ~~~~~tL~LP~~~~~~L~~~-n~~g~Tvl~~lk~n~Pnl~i~t~pEL--~~ag-gg~~~-----~~~~~~~~~~~~t~~~ 331 (379) T protein:vir:10 261 NKTPITIGIPNAYENYITTP-TELGYSVAQYMRESYPNVTFVSAPEL--NDAN-GGSSA-----IYYYADAVENNGTDDG 331 (379) T ss_pred cccceeEEecHHHHHhhccc-cccCccHHHHHHHhcCCcEEEEcccc--cccC-CCccE-----EEEEeeccCCCccCCc Confidence 223789999999998643 222222221 111 1122222211 1111 11111 11111111111110 Q ss_pred ------EeeccceeccccccccchhhhhcCcEEEEEEEE-eccEEeccCceEEEEee Q lcl|Aclame:pro 264 ------IDETAQLSTVKNEDGTPVNLFEQDMVALRATMH-VALHIADDKAFAKLVPA 313 (324) Q Consensus 264 ------~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r-~d~~v~~~~A~~~l~~~ 313 (324) +..+...... .. ..-.....+..| .|..+.+|.||++++++ T Consensus 332 ~~~~~~~p~k~~~l~v----e~-----~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 332 RTWLQVVPTKMFTLGV----EK-----KIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred ceEEEecchhhhhccc----ee-----cCceeEeccccceeeeeeecchhhheecCC Confidence 0000000000 00 000111222233 56777789999999998 No 198 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=95.57 E-value=0.0018 Score=35.61 Aligned_cols=268 Identities=10% Similarity=0.012 Sum_probs=108.4 Q ss_pred ccccccccCccccchHHHHHHHHHHHhhhhhhhhcce--e-----ecCCCceEEEEEe-CCcce-eeeccCccccccccc Q lcl|Aclame:pro 27 DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKY--E-----PMEGTEKKFTFWA-DKPGA-YWVGEGQKIETSKAT 97 (324) Q Consensus 27 ~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~--~-----~~~~~~~~ip~~~-~~~~a-~~v~Eg~~~~~~~~~ 97 (324) ..++.-+ --.+..+.+...++|.+.+...+.+.... + ++.+.-.+.|-.. ++... .-+...+.+...+++ T Consensus 1 ~~~t~~s-dl~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~i~~~~~~rnv~~~~~~t~~kit 79 (315) T protein:vir:96 1 MATTVNS-DLVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYKVGGAIADRDVNSTATVAGTKIA 79 (315) T ss_pred Cceeeec-ceeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccccccchhhcccCCCccccceecc Confidence 1111111 11234566667777776665555443211 1 1112111111111 11000 011112222223322 Q ss_pred -eeeEEeeheeeEEeeeehHHHhh---cChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccch Q lcl|Aclame:pro 98 -WVNATMRAFKLGVILPVTKEFLN---YTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGD 173 (324) Q Consensus 98 -~~~v~l~~~k~~~~~~iS~e~l~---ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~ 173 (324) ...+..+..--.+-+..+.+.+. +++......|.+.+..++.+.+=...+.+.-.. ..+ ..........+. T Consensus 80 ~~~dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aa--i~~---~t~~~~~~~~a~ 154 (315) T protein:vir:96 80 ADEMVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGA--IGS---NAGMNVSGELAT 154 (315) T ss_pred cccceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh--hcc---cccccccccccc Confidence 12222222111122223444444 333344444555555555544433333332100 000 000111223355 Q ss_pred hhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhh-----ccCCceeeccCCcceeecceeEeecCCCCCCceeEEee Q lcl|Aclame:pro 174 FTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV-----DPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGD 248 (324) Q Consensus 174 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~-----d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd 248 (324) ++...+.++..++.+....-+.|+||..++..|.+.. ...+..+..+..+. .+|+||++..+++.. . +|| T Consensus 155 ~~~~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~q~L~~~~~~~~~~~~~~~~~~-~lGkrViVdD~~P~~--~-~~g- 229 (315) T protein:vir:96 155 EGKKVLTKGLRTMGDKASSIAIWVMDSTSYFDIVDEAIDNKLYEEAGVVVYGGTPG-TLGKPVLVTDQCPAT--K-IFG- 229 (315) T ss_pred cCHHHHHHHHHHhcccccCeeEEEEchHHHHHHHHhhhhhhcccccceeEecCcCc-ccccEEEEECCCCcc--e-eee- Confidence 7788899999999999888899999999999987621 12222233333333 459999998766642 2 222 Q ss_pred cccEEEEEe-cceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEecc-EEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 249 FDKLIYGIP-QLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVAL-HIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 249 ~s~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~-~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .. +.+.+.-........... .+ + + -+....|..| -.+||..|..-+. .+...+=.|. T Consensus 230 -------l~~GAi~~~~~~~~~~~~~~~-~g------~-e--~l~~~~r~e~tf~l~p~G~sw~~~--~~~sPt~aeL 288 (315) T protein:vir:96 230 -------LVAGAVMITESQAPGMRSYQI-DD------Q-E--NLAIGFRAEGTANVEVLGYKWKTK--TNVNPASATL 288 (315) T ss_pred -------eecceeeecCCCccccccccC-CC------c-c--eeEEEEeeeeEeeeeeeeEEeecC--CCcCCChHHh Confidence 11 112211111100000000 01 0 1 1222233333 3567777776322 1111111222 No 199 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=95.31 E-value=0.0023 Score=35.04 Aligned_cols=292 Identities=10% Similarity=0.023 Sum_probs=141.3 Q ss_pred CchhHHHHHHHHHHHhhh------hhHH--hh---cccccc-ccccCccccchHHHHHHH-----HHHHhhhhhhhhcce Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNN------VKPQ--VF---NPDNVM-MHEKKDGTLMNEFTTPIL-----QEVMENSKIMQLGKY 63 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~------~~~~--~~---~~~~~~-~~~~~~~~vp~~~~~~i~-----~~~~~~s~l~~l~~~ 63 (324) ||.-+.++. |+++-..- +... .+ +.+..- .++.+..-||. +.+..+ +.+.+......++.+ T Consensus 1 ~~~~~~~~~-l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~-~l~~~i~p~~~~~~~~~~~~~~l~~v 78 (336) T protein:vir:10 1 MRDAQRIQN-LARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPN-YLTTYVDPSVIDILVAPMKAAELVGE 78 (336) T ss_pred CchHHHHHH-HhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHH-HHHhhcCcceeeeeechhchhhhccc Confidence 877766443 33321100 0000 01 111111 11111122333 333333 334444445555554 Q ss_pred eecCC---CceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcC---hHHHHHHHHHHHHHH Q lcl|Aclame:pro 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEA 137 (324) Q Consensus 64 ~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~a 137 (324) ...+. ....+++......+.+.+.....|..+...+...-+.+.++..+.++.+-+..+ ..++.+--....+++ T Consensus 79 ~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~a 158 (336) T protein:vir:10 79 SKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) T ss_pred ccCCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHH Confidence 33221 233445555556677888888999999888888888999999999996544433 367777777778888 Q ss_pred HHHHHHHHHHhccCcccccccccc--cccccccc-------ccchhhhhHHHHHHHHhhhhcC------CCcEEEEcHHH Q lcl|Aclame:pro 138 FYKKFDEAGILNQGNNPFGKSIAQ--SIEKTNKV-------IKGDFTQDNIIDLEALLEDDEL------EANAFISKTQN 202 (324) Q Consensus 138 i~~~~d~~~l~G~g~~~~~~~~~~--~~~~~~~~-------~~~~~~~~~i~~~~~~l~~~~~------~~~~~v~~~~~ 202 (324) +.+.+++-.++|+.....- |+.+ ........ .+..--++||..++.+|...-. .+..+++.+.. T Consensus 159 le~~~N~~~~~Gd~~~~~~-GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~ 237 (336) T protein:vir:10 159 LAKFLNGSYLFGVAGLENY-GLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTA 237 (336) T ss_pred HHHhhCeEEEEeecccceE-EEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHH Confidence 8888888888887543222 2222 11111110 1112346788888888864432 24479999999 Q ss_pred HHHHHHhhccCCceeec--cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc---eEEEEeeccceecccccc Q lcl|Aclame:pro 203 RSLLRKIVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNED 277 (324) Q Consensus 203 ~~~l~~~~d~~g~~~~~--~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~---~~~~~~~~~~~~~~~~~~ 277 (324) +..|... ...|..++. ... +-++.++..+.... ++ |+...++.-...+ ..+.+...-..+..+ . T Consensus 238 ~~~L~~~-n~~g~tv~~~lk~n---~Pnl~i~t~pel~~-Ag----g~~~~~~~~~~~~~~t~~~~~P~~f~~lpvq-~- 306 (336) T protein:vir:10 238 MSDLSKT-NQYGLSAAAKLKEI---FPKLEFVTIPEYDT-AS----GRLVQLWAPRVEGKDTATCGFTEKMRAHSIE-R- 306 (336) T ss_pred HHhccCC-CccCccHHHHHHHh---CCccEEEEcccccc-cC----CceEEEEEecccCCcceeeecChhhhcccee-e- Confidence 9998643 222322221 111 01112222221111 11 2211222111111 111111111000000 0 Q ss_pred ccchhhhhcCcEEEEEEEEe-ccEEeccCceEEEEee Q lcl|Aclame:pro 278 GTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) Q Consensus 278 ~~~~~~f~~~~v~~r~~~r~-d~~v~~~~A~~~l~~~ 313 (324) ..-.....+..|+ |..+.+|-||+++++. T Consensus 307 -------~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 307 -------YSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred -------cCceeEeccccceeeeeeeccchheeeccC Confidence 0011122333444 5566779999999998 No 200 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=95.30 E-value=0.00074 Score=37.75 Aligned_cols=286 Identities=14% Similarity=0.121 Sum_probs=135.2 Q ss_pred CchhHHHHHHHHHHHh---hhhhHHhhccccccc---cccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFAS---NNVKPQVFNPDNVMM---HEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~---~~~~~~~~~~~~~~~---~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) ++.++..-+|.+-.+. +..++.-|++.-+.. -++-...+|..+...|-..+....++++.+-+...+. +-.- T Consensus 5 iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~--~~V~ 82 (318) T protein:vir:86 5 IESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA--LLVS 82 (318) T ss_pred hhhhHHHHHHHHHHhccCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchh--hhhh Confidence 3444444444333322 333334454433222 2345567899898888888888888888654433322 2122 Q ss_pred E-EeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhc---ChHHHHHHHHHHHHHHHH-HHHHHHHHhc Q lcl|Aclame:pro 75 F-WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY---TYSQFFEEMKPMIAEAFY-KKFDEAGILN 149 (324) Q Consensus 75 ~-~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d---s~~~~~~~i~~~l~~ai~-~~~d~~~l~G 149 (324) + ..++..+...-.|..+.+....|..-++.+.-++....+ -|+.++ +...+..++..+|+.++. +.+|.+++-| T Consensus 83 ~s~~s~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~G 161 (318) T protein:vir:86 83 RSFDSSAEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 161 (318) T ss_pred hhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhheee Confidence 2 123455667788888888888888888877555444444 334443 445668999999999998 8899999999 Q ss_pred cCccccccccccc-----cc-cccccccchhhhhHHHHHHHHhhhhcCCCc---EEEEcHHHH-HHHHHhhccCCceeec Q lcl|Aclame:pro 150 QGNNPFGKSIAQS-----IE-KTNKVIKGDFTQDNIIDLEALLEDDELEAN---AFISKTQNR-SLLRKIVDPETKERIY 219 (324) Q Consensus 150 ~g~~~~~~~~~~~-----~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~---~~v~~~~~~-~~l~~~~d~~g~~~~~ 219 (324) +|+++....-.-+ .. +.....+++..+.+ ++..-.+-.++.+ -+++..... +.|..++.+....-.. T Consensus 162 DG~N~f~~~DK~advK~I~k~Ttkaksagttpfan---aieeavdfvrptagrrylivkaedrkalldelrqatanahvr 238 (318) T protein:vir:86 162 DGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFAN---AIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANAHVR 238 (318) T ss_pred cCCCCccchhhHHHHHHHHHHhhhhhccCCCchhh---HHHHHHhhhccCCCceEEEEeecchHHHHHHHHhhcccceeE Confidence 9988643221100 00 11111223333332 2222212222222 245554443 4445565443322211 Q ss_pred ---cCC-cceeecc-eeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhh--hhcCcEEEE Q lcl|Aclame:pro 220 ---DRN-SDSLDGL-PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL--FEQDMVALR 292 (324) Q Consensus 220 ---~~~-~~~l~G~-pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--f~~~~v~~r 292 (324) +.. -..-.|. .+++......-+.+++ .|- ++.+-.. + -+-++. |..|.--+. T Consensus 239 iknddteiasevgvdeiivytgskalkptvl-vdq-kyhidmq-d------------------ltkvdafewktnsnmil 297 (318) T protein:vir:86 239 IKNDDTEIASEVGVDEIIVYTGSKALKPTVL-VDQ-KYHIDMQ-D------------------LTKVDAFEWKTNSNMIL 297 (318) T ss_pred EeccchhhhhhcCcceeeeeeccccccceee-ecc-ceecchh-h------------------hhhhhcceeccCCceEE Confidence 111 0011111 1122111111122222 221 1111100 0 011111 333333333 Q ss_pred EEEEeccEEeccCceEEEEee Q lcl|Aclame:pro 293 ATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 293 ~~~r~d~~v~~~~A~~~l~~~ 313 (324) ++.-..+.+---+|-+.++.. T Consensus 298 vetltsghvetynagavitvs 318 (318) T protein:vir:86 298 VETLTSGHVETYNAGAVITVS 318 (318) T ss_pred EeecccCcceeecCceeEEeC Confidence 444444444444555555554 No 201 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=94.99 E-value=0.003 Score=34.40 Aligned_cols=298 Identities=11% Similarity=0.096 Sum_probs=157.5 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccc-cccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~ 78 (324) |++... ..+.+|.. +.....++.. .....+.|-|.+.+.+.+.+.+.|-+++.++.+++..-... +-.-.+ T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~ 73 (355) T protein:vir:98 1 MRPETR--FKFNAYLT-----RVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVT 73 (355) T ss_pred CChHHH--HHHHHHHH-----HHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccC Confidence 765533 22333332 1111122211 12235668888889999999999999999999888654333 333334 Q ss_pred Ccceeeec--c-CccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 79 KPGAYWVG--E-GQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 79 ~~~a~~v~--E-g~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) ++-++-+. . ....|.....++...+..++.---..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+.-. T Consensus 74 g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A 153 (355) T protein:vir:98 74 GTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRA 153 (355) T ss_pred ccccccccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeee Confidence 44443321 1 1223444455666667777766667778877764 34789999999999888877777777885411 Q ss_pred c------cccc-------c-----------ccccc-----cccc--cccchhhhhHH----HHHHHH-hhhhcCCC--cE Q lcl|Aclame:pro 154 P------FGKS-------I-----------AQSIE-----KTNK--VIKGDFTQDNI----IDLEAL-LEDDELEA--NA 195 (324) Q Consensus 154 ~------~~~~-------~-----------~~~~~-----~~~~--~~~~~~~~~~i----~~~~~~-l~~~~~~~--~~ 195 (324) . .|.+ + ..... .+.. .....-+|..| .+++.. |+..+++. -+ T Consensus 154 ~~Td~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLV 233 (355) T protein:vir:98 154 DTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLV 233 (355) T ss_pred ccCChhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 1 1111 0 00000 0000 00011234333 345543 45555553 36 Q ss_pred EEEcHHHHH--HHHHhhccCCceeec-----cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc-eEEEEeec Q lcl|Aclame:pro 196 FISKTQNRS--LLRKIVDPETKERIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDET 267 (324) Q Consensus 196 ~v~~~~~~~--~l~~~~d~~g~~~~~-----~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~-~~~~~~~~ 267 (324) .+|..+..+ ++..+. ....|--. -....++.|+|.+..|. .|.+.+++--++++-+-...+ .+-.+.+. T Consensus 234 vivG~dLla~k~~~l~n-~~~~ptE~~Aa~~i~s~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~ 310 (355) T protein:vir:98 234 AIVGRKLLADKYFPLVN-KQQENSESLAADIIISQKRIGNLPAVRVPY--FPANAVLVTTLENLSIYFMDESHRRSIDEN 310 (355) T ss_pred EEEchhhhHHHhhhHhh-ccCCcHHHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeeccccEEEEecCcEEEEEEec Confidence 778877543 222222 22233110 11236899999988775 556677777777654333222 22222111 Q ss_pred cceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 268 AQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 268 ~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) . ++|.+.-.=....|+.|-+.++++.++.........|.+- T Consensus 311 p----------------~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~ 351 (355) T protein:vir:98 311 P----------------KKDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPAAP 351 (355) T ss_pred c----------------ccccccchhhhcceeeeeccccEEEeeceeeeCCCCCccc Confidence 1 1222222223355778888888888776655544444444 No 202 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=94.82 E-value=0.0034 Score=34.11 Aligned_cols=298 Identities=11% Similarity=0.019 Sum_probs=152.6 Q ss_pred Cchh--HHHHHHHHHHHhhhhhHHhhcccccc-ccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEE Q lcl|Aclame:pro 1 MEQT--QKLKLNLQHFASNNVKPQVFNPDNVM-MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFW 76 (324) Q Consensus 1 ~~~~--~~~k~~~~~~a~~~~~~~~~~~~~~~-~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~ 76 (324) |+++ ++.+..+.+|. .+.....++. ...+.-+.|.+.+.+.+.+.+.+.|-+++.++.+++..-... +-.- T Consensus 1 m~~~M~~~tr~~~~~y~-----~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg 75 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYC-----DALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVG 75 (358) T ss_pred CcccccHHHHHHHHHHH-----HHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeec Confidence 6664 12222222222 2222222222 122345778888999999999999999999998887654333 3333 Q ss_pred eCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhc-----ChHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 77 ADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY-----TYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 77 ~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d-----s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) .+++-++-+.- ..|.....++...+..++.---..|+.+.|+. +..++...+.+.+.+.++.-.-.-.|+|+. T Consensus 76 ~~g~iagrt~t--r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts 153 (358) T protein:vir:78 76 VGQLYTGRKKG--GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVS 153 (358) T ss_pred CCcccceecCC--CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceeccccee Confidence 33444443332 23444455666666666666666777777763 123688888888888887766666778753 Q ss_pred cccc------ccc------------------cccccccccc---cccchhhhhHH----HHHHH-HhhhhcCCC--cEEE Q lcl|Aclame:pro 152 NNPF------GKS------------------IAQSIEKTNK---VIKGDFTQDNI----IDLEA-LLEDDELEA--NAFI 197 (324) Q Consensus 152 ~~~~------~~~------------------~~~~~~~~~~---~~~~~~~~~~i----~~~~~-~l~~~~~~~--~~~v 197 (324) -... |.+ .......... ...+.-+|..| .+++. .|+..+++. -+.+ T Consensus 154 ~A~~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvi 233 (358) T protein:vir:78 154 AADDTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVL 233 (358) T ss_pred eccCCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEE Confidence 2110 100 0000000000 00111234333 34543 445555554 3677 Q ss_pred EcHHHHH--HHHHhhccCCcee---eccCCcceeecceeEeecCCCCCCceeEEeecccEEEEE-ecceEEEEeecccee Q lcl|Aclame:pro 198 SKTQNRS--LLRKIVDPETKER---IYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGI-PQLIEYKIDETAQLS 271 (324) Q Consensus 198 ~~~~~~~--~l~~~~d~~g~~~---~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~-~~~~~~~~~~~~~~~ 271 (324) |..+..+ ++..+. ....|- -..--..++-|+|.+..|. .|.+.+++--++++-+-. .+..+-.+-+.. T Consensus 234 vG~dLla~k~~~l~n-~~~~pTE~~Aa~~i~k~iGGlpa~~~Pf--FP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p--- 307 (358) T protein:vir:78 234 VGTDLVAAAQAKLYS-EATKPSEQIAAQQLAKSIAGRKAYIPPF--FPGKRMVVTTLDNLHCYTQRGTRKRKADDNQ--- 307 (358) T ss_pred EchhhhhHHhhhHhh-cCCCcHHHHHHHHHHHHhCCCeEEEccc--cCCCceEEeeccccEEEEecCcEEEEEEecc--- Confidence 7777654 222222 222321 1111125789999988775 556677777776653322 223222221111 Q ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecC---CCCCCCCC---C Q lcl|Aclame:pro 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADK---RTDSVPGE---V 324 (324) Q Consensus 272 ~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~---~~~~~~~~---~ 324 (324) .+|.+.-.=....|+.|-+.++++.++...- ...++|+. - T Consensus 308 -------------~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~~~~~~~ 353 (358) T protein:vir:78 308 -------------DSKSFDNQYWRMEGYALGEHKAYGGFEEADIEIGADPAVLAVEAAA 353 (358) T ss_pred -------------ccccccchhhhcceeeeeccccEEEEeeeeeeeCCCCCccccCCcc Confidence 1222222223355788888888888765442 22222222 1 No 203 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=94.79 E-value=0.0035 Score=34.06 Aligned_cols=314 Identities=10% Similarity=0.028 Sum_probs=140.1 Q ss_pred CchhHHHHHHHHHHHhhhh---hHHhhcccc---ccccccCccccchHHHHHHHHHHHhhhh--hhhhcceeecCCCceE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNV---KPQVFNPDN---VMMHEKKDGTLMNEFTTPILQEVMENSK--IMQLGKYEPMEGTEKK 72 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~---~~~~~~~~~---~~~~~~~~~~vp~~~~~~i~~~~~~~s~--l~~l~~~~~~~~~~~~ 72 (324) |-|++|- +.+..---+.. .-+.+.+.. -.+-++++++--+.+..+|......... +.+-..+.+..+-..+ T Consensus 1 ~~~~~~~-~~~~~~~~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~stv~~ 79 (468) T protein:vir:63 1 MPKNNKE-EEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAK 79 (468) T ss_pred CCCCcch-hhccccChhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhhhhhh Confidence 8777763 33222111111 111121111 1112345555555565555544333322 2222233344443344 Q ss_pred EEEEeC---CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhh-cChHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 73 FTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 73 ip~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~-ds~~~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) |-...+ ...+.+++|++..+.+++++...+..++=++....+|.-+-. .+..+.+....+.-...++..+|.++|+ T Consensus 80 y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~Fy 159 (468) T protein:vir:63 80 YDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFF 159 (468) T ss_pred heeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhh Confidence 444332 345789999999999999999999999999987777764433 2467888888899999999999999999 Q ss_pred ccCccc---------cccccccccccccc-cccc-hhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHH-HhhccCCce Q lcl|Aclame:pro 149 NQGNNP---------FGKSIAQSIEKTNK-VIKG-DFTQDNIIDLEALLEDDELEANAFISKTQNRSLLR-KIVDPETKE 216 (324) Q Consensus 149 G~g~~~---------~~~~~~~~~~~~~~-~~~~-~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~-~~~d~~g~~ 216 (324) |+..-. +..|+.......+- ...| .++-++|..+-..+...|..+.-++|+..+.+.|. .....+ +- T Consensus 160 Gds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~q-~~ 238 (468) T protein:vir:63 160 GDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ-TQ 238 (468) T ss_pred cccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCce-EE Confidence 986431 11222222111111 1112 23444555555556666777778999999887773 222211 22 Q ss_pred eeccCCcceeecceeE--eecCCCC-CCceeEEeecccEE---EEEe---cceEEEEeeccceeccccccccchhhhhcC Q lcl|Aclame:pro 217 RIYDRNSDSLDGLPVV--NLKSSNL-KRGELITGDFDKLI---YGIP---QLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) Q Consensus 217 ~~~~~~~~~l~G~pv~--~~~~~~~-~~~~~i~gd~s~~~---~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~ 287 (324) +..+.......|.||- .+....+ -.+..+++|....- .+.. ....+..+....- ......+.+ . T Consensus 239 v~~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~~vsaT~~~~~-~g~~~~~~~------a 311 (468) T protein:vir:63 239 LVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPAKVTATQEAGK-KGQFRAEDL------A 311 (468) T ss_pred EEcCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccccCCccceeeeccc-CCcccCCCc------c Confidence 2223333344555542 1110000 01112233321110 0000 0000100000000 000000000 0 Q ss_pred cEEEEEEEEeccEEeccCceE-----------EEEee-cCCCCCCCCC-C Q lcl|Aclame:pro 288 MVALRATMHVALHIADDKAFA-----------KLVPA-DKRTDSVPGE-V 324 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~~~A~~-----------~l~~~-~~~~~~~~~~-~ 324 (324) ...||+...-+..--.|...+ .|+.. .+.+..+| + | T Consensus 312 ~y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p-~yv 360 (468) T protein:vir:63 312 AHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRP-QFV 360 (468) T ss_pred eEEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcc-eEE Confidence 011222222222111222222 22222 12222222 1 1 No 204 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=94.58 E-value=0.004 Score=33.71 Aligned_cols=301 Identities=8% Similarity=-0.048 Sum_probs=133.6 Q ss_pred CchhHHHH-----------HHHHHHHhhhhhHHh--hccccccccccCccccchHHHH----HHHHHHHhhhhhhhhcce Q lcl|Aclame:pro 1 MEQTQKLK-----------LNLQHFASNNVKPQV--FNPDNVMMHEKKDGTLMNEFTT----PILQEVMENSKIMQLGKY 63 (324) Q Consensus 1 ~~~~~~~k-----------~~~~~~a~~~~~~~~--~~~~~~~~~~~~~~~vp~~~~~----~i~~~~~~~s~l~~l~~~ 63 (324) |+.+++-| ..++........... +.++...-.+.++.-||-++.+ .+++.+.+.....+++.+ T Consensus 27 ~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv 106 (382) T protein:vir:96 27 EAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGI 106 (382) T ss_pred HHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccc Confidence 11111100 001111100000000 1112111112222334554443 455556666666777665 Q ss_pred eecCC---CceEEEEEeCCcceeeeccCccccccccceeeEEeeheeeEEeeeeh-HHHhhcC--hHHHHHHHHHHHHHH Q lcl|Aclame:pro 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVT-KEFLNYT--YSQFFEEMKPMIAEA 137 (324) Q Consensus 64 ~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS-~e~l~ds--~~~~~~~i~~~l~~a 137 (324) ...+. .++.+++......+.+.+.++..|..+...+..+-..+.+.....+. .|..+.+ ..++.+--.....++ T Consensus 107 ~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~a 186 (382) T protein:vir:96 107 DTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIG 186 (382) T ss_pred cccCCccceEEEEeeeecccceEEeecccCCCccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHH Confidence 44322 34567777777888899999988887776665555566666666664 5655543 466777777788888 Q ss_pred HHHHHHHHHHhccCc--cccccccccccc--cccc-----ccc--chhhhhHHHHHHHHhhhhcC---C----CcEEEEc Q lcl|Aclame:pro 138 FYKKFDEAGILNQGN--NPFGKSIAQSIE--KTNK-----VIK--GDFTQDNIIDLEALLEDDEL---E----ANAFISK 199 (324) Q Consensus 138 i~~~~d~~~l~G~g~--~~~~~~~~~~~~--~~~~-----~~~--~~~~~~~i~~~~~~l~~~~~---~----~~~~v~~ 199 (324) +...+|+-+|+|+.. +...-|+.+... +... .+. ..--++||..++.+|...-. . +..+++. T Consensus 187 le~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP 266 (382) T protein:vir:96 187 LEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALA 266 (382) T ss_pred HHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeec Confidence 888888899999632 211123333221 1111 111 12235778888888854432 1 2257889 Q ss_pred HHHHHHHHHhhccCCceeec--cCC--cceeecceeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceecccc Q lcl|Aclame:pro 200 TQNRSLLRKIVDPETKERIY--DRN--SDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) Q Consensus 200 ~~~~~~l~~~~d~~g~~~~~--~~~--~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (324) ++.+..|... ...|.-++. ... +-++...|=. ...... +. | .+.+.+-....+....+-..+.-.... T Consensus 267 ~~~~~~Ls~~-n~~g~Tvl~~lk~n~Pnl~i~t~peL--~~a~~~-g~---g-~~~~~~~~~~e~~~~~~~s~~~p~~f~ 338 (382) T protein:vir:96 267 TSKVDYLSVT-TPYGISVSDWIEQTYPKMRIVSAPEL--SGVQMQ-GK---T-PEDALVLFVEEVDASVDGSTDGGSVFS 338 (382) T ss_pred hHHHhhcccc-CccCccHHHHHHHhcCCcEEEEcccc--ccccCC-Cc---c-ceeEEEEecchhhhhcccccccCccee Confidence 9988887532 222221211 011 1122222100 000000 00 0 011111111111100000000000000 Q ss_pred ----------ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEee Q lcl|Aclame:pro 276 ----------EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 276 ----------~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~ 313 (324) +.......|..+ ......|..+.+|.||+++++. T Consensus 339 q~~p~~~~~l~ve~~~~~~~~~----~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 339 QLVQSKFITLGVEKRAKSYVED----FSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred ccccceeeeccceeecceeEec----cccceeeeEEEcchhhhhccCC Confidence 000000001000 0112467888899999999998 No 205 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=94.54 E-value=0.0041 Score=33.65 Aligned_cols=272 Identities=11% Similarity=0.051 Sum_probs=145.2 Q ss_pred cccccccCccccchHHHHHHHHHHHhhhhhhhhcc-eeec-CCCceEEEEEeCCcceeeeccCccccccccceeeEEeeh Q lcl|Aclame:pro 28 NVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGK-YEPM-EGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRA 105 (324) Q Consensus 28 ~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~-~~~~-~~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~ 105 (324) -..++.+-+..+.++++..|...+.+.-.=-...+ +... ++..+.||.. +.+...-..|.++........+++++-. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G~~L~I~ti-Gs~~~~~~~E~~~~~~~~i~TGEIt~~i 79 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSGETLHIKTI-GSVTLQEAEEDTPLIYNPIETGEITFQI 79 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCCCEEEeccc-CceeeeccccCCCeeecccccceEEEEE Confidence 12344444455566676666655544322222223 2333 3455667665 4555666777778777888888999988 Q ss_pred eeeEEe-eeehHHHhhcCh--HHHHHHHHHHHHHHHHHHHHHHHHh-ccC---ccccccccccccc--cccccccchhhh Q lcl|Aclame:pro 106 FKLGVI-LPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGIL-NQG---NNPFGKSIAQSIE--KTNKVIKGDFTQ 176 (324) Q Consensus 106 ~k~~~~-~~iS~e~l~ds~--~~~~~~i~~~l~~ai~~~~d~~~l~-G~g---~~~~~~~~~~~~~--~~~~~~~~~~~~ 176 (324) ..+.+- ..||+.+-+|+. .++.+.+..+-+++|....+..++. |.. .++.|.. .++.. .+...+.+.... T Consensus 80 ~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~-vNG~PH~~V~~~T~~~~~~ 158 (313) T protein:vir:95 80 TEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHN-VNGFPHVIVSAETNGVFAL 158 (313) T ss_pred EeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcc-cccccceEEeccCCceehh Confidence 886644 469999999874 4556666667777888887777664 211 1111111 11111 123334456667 Q ss_pred hHHHHHHHHhhhhcCCC--cEEEEcHHHHHHHHHhhc------cCCceeeccCCc------ceeecceeEeecCCC---- Q lcl|Aclame:pro 177 DNIIDLEALLEDDELEA--NAFISKTQNRSLLRKIVD------PETKERIYDRNS------DSLDGLPVVNLKSSN---- 238 (324) Q Consensus 177 ~~i~~~~~~l~~~~~~~--~~~v~~~~~~~~l~~~~d------~~g~~~~~~~~~------~~l~G~pv~~~~~~~---- 238 (324) .++..+...+.....+. -.+|+.|.....|..+.. .+|+-++..+.. ..+.|+.+.++.-.. T Consensus 159 ~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~AN~ 238 (313) T protein:vir:95 159 KHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVANY 238 (313) T ss_pred hHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhccc Confidence 78888877776665543 368999999999887652 345655543321 235666655432111 Q ss_pred -----CCCcee--EEeecc----cEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCce Q lcl|Aclame:pro 239 -----LKRGEL--ITGDFD----KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) Q Consensus 239 -----~~~~~~--i~gd~s----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~ 307 (324) ...|.. +|-..+ .-+++-|+.+.- -....+ .+-..+..+ .++|+|+.+.+.+.. T Consensus 239 ~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~-------s~~~~~------~~~~~~~~~--~~~R~G~Gi~R~~~L 303 (313) T protein:vir:95 239 NDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPK-------SEGERN------KDRARDEHV--VRCRYGFGIQRLDTL 303 (313) T ss_pred cccccccCceeeeeeeeeecccccceeeeeccccc-------cccccc------cccccccce--eeeeecccceeecce Confidence 111111 111111 112222222210 000011 111233344 446778888776665 Q ss_pred E-EEEeecCC Q lcl|Aclame:pro 308 A-KLVPADKR 316 (324) Q Consensus 308 ~-~l~~~~~~ 316 (324) . .++.+++. T Consensus 304 ~~~~~~A~~~ 313 (313) T protein:vir:95 304 GLLATSATAY 313 (313) T ss_pred eEEEeccccC Confidence 4 55666666 No 206 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=94.41 E-value=0.0016 Score=35.87 Aligned_cols=287 Identities=13% Similarity=0.134 Sum_probs=129.1 Q ss_pred Cc-----------------hhHHHHHHHHHHHh---hhhhHHhhccccc---cccccCccccchHHHHHHHHHHHhhhhh Q lcl|Aclame:pro 1 ME-----------------QTQKLKLNLQHFAS---NNVKPQVFNPDNV---MMHEKKDGTLMNEFTTPILQEVMENSKI 57 (324) Q Consensus 1 ~~-----------------~~~~~k~~~~~~a~---~~~~~~~~~~~~~---~~~~~~~~~vp~~~~~~i~~~~~~~s~l 57 (324) +| .++..-+|.+-.+. +..++..|.+.-+ .+-++-...+|..+...|-..+....++ T Consensus 63 LN~~eE~~KGK~kMt~~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v 142 (393) T protein:vir:16 63 LNAQEEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPV 142 (393) T ss_pred hhhhhhcchhhHHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcc Confidence 11 11111112111111 1112222322211 1123344578888888888888888888 Q ss_pred hhhcceeecCCCceEEEE-EeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhc---ChHHHHHHHHHH Q lcl|Aclame:pro 58 MQLGKYEPMEGTEKKFTF-WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY---TYSQFFEEMKPM 133 (324) Q Consensus 58 ~~l~~~~~~~~~~~~ip~-~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d---s~~~~~~~i~~~ 133 (324) ++.+-+...+. +-.-+ ..+...+...-.|..+.+....|..-++.+.-++....+ -|+..+ +...+..++..+ T Consensus 143 ~~vfHVT~~~~--~~V~~s~~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~E 219 (393) T protein:vir:16 143 FKVFHVTNVGA--LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAE 219 (393) T ss_pred eeeeeeccchh--hhHHhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHH Confidence 88654433222 11112 122335666778888888888888878777555444444 334443 445568999999 Q ss_pred HHHHHH-HHHHHHHHhccCccccccccccc-----c-ccccccccchhhhhH-HHHHHHHhh-hhcCCCcEEEEcHHH-H Q lcl|Aclame:pro 134 IAEAFY-KKFDEAGILNQGNNPFGKSIAQS-----I-EKTNKVIKGDFTQDN-IIDLEALLE-DDELEANAFISKTQN-R 203 (324) Q Consensus 134 l~~ai~-~~~d~~~l~G~g~~~~~~~~~~~-----~-~~~~~~~~~~~~~~~-i~~~~~~l~-~~~~~~~~~v~~~~~-~ 203 (324) |+.++. +.+|.+++-|+|.++....-.-+ . .+.....++...+.| |-.+..-+. .+++ .-+++.... . T Consensus 220 LtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagr--rylivktedrk 297 (393) T protein:vir:16 220 LTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR--RYLIVKTEDRK 297 (393) T ss_pred HHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCc--eEEEEeccchH Confidence 999998 88999999999988643221100 0 011112234444433 333332221 1111 124555444 3 Q ss_pred HHHHHhhccCCceee---ccCC-cceeecc-eeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccc Q lcl|Aclame:pro 204 SLLRKIVDPETKERI---YDRN-SDSLDGL-PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDG 278 (324) Q Consensus 204 ~~l~~~~d~~g~~~~---~~~~-~~~l~G~-pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 278 (324) +.|..++.+....-. .+.. -..-.|. .+++......-+.+++ .|- ++.+-. ++ - T Consensus 298 alldelrqatananvriknddteiasevgvdeiivytgskalkptvl-vdq-kyhidm-qd------------------l 356 (393) T protein:vir:16 298 ALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVL-VDQ-KYHIDM-QD------------------L 356 (393) T ss_pred HHHHHHHhhhccCceeeeccchhhhhhcCcceeeeeeccccccceee-ecc-ccccch-hh------------------h Confidence 445555544322211 1111 0011111 1122111111122222 221 111100 00 0 Q ss_pred cchhh--hhcCcEEEEEEEEeccEEeccCceEEEEee Q lcl|Aclame:pro 279 TPVNL--FEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 279 ~~~~~--f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~ 313 (324) +-++. |..|.--+.++.-..+.|---+|-+.++.. T Consensus 357 tkvdafewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 357 TKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred hhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 11111 333333333444444444444555555554 No 207 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=94.26 E-value=0.0049 Score=33.24 Aligned_cols=314 Identities=11% Similarity=0.030 Sum_probs=133.8 Q ss_pred CchhHHHHHHHHH-HHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEEEEe Q lcl|Aclame:pro 1 MEQTQKLKLNLQH-FASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSK--IMQLGKYEPMEGTEKKFTFWA 77 (324) Q Consensus 1 ~~~~~~~k~~~~~-~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~--l~~l~~~~~~~~~~~~ip~~~ 77 (324) .+||.+.+..... ...|..... .. -+..+-++++++--+.+..+|......... +.+-..+.+..+-..+|-... T Consensus 3 ~~~n~~~~~~~~~e~~~Ks~ttg-y~-~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y~~~~ 80 (464) T protein:vir:80 3 EKKNTERQLTSVQEEVIKGFTTG-YG-ITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKYDVYL 80 (464) T ss_pred cchhhHhhcCcccHHHHHHHHhC-Cc-cCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhhheee Confidence 3333222222111 111111110 11 111222345556555555555444333222 333334445555444444433 Q ss_pred C---CcceeeeccCccccccccceeeEEeeheeeEEeeeeh--HHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|Aclame:pro 78 D---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVT--KEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 78 ~---~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS--~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~ 152 (324) + ...+.++.|++..+.+++++...+...+=+...-.+| ..+. ++..+.+....+.-...++..+|.++|+|+.. T Consensus 81 ~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lv-n~~~d~~~~~~~dai~~va~tiE~a~FyGds~ 159 (464) T protein:vir:80 81 AHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLV-NNIEDPMRILTDDAISVVAKTIEWASFYGDSD 159 (464) T ss_pred ccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhh-cchhhHHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 2 3457899999999999999998888766444333333 3333 45678888888888888999999999999753 Q ss_pred cc---------ccccccccccccccc-c-cchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHH-HHhhccCCceeecc Q lcl|Aclame:pro 153 NP---------FGKSIAQSIEKTNKV-I-KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLL-RKIVDPETKERIYD 220 (324) Q Consensus 153 ~~---------~~~~~~~~~~~~~~~-~-~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l-~~~~d~~g~~~~~~ 220 (324) =. ...|+.......+.. + ...++.+.|..+-..+...|..++-++|+..+.+.+ ....+.+-+-+ .+ T Consensus 160 l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q~~~~-~~ 238 (464) T protein:vir:80 160 LSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQVQVI-SD 238 (464) T ss_pred cCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCceeEEE-cC Confidence 11 112222222111111 1 123455566666667778888888899999998775 44443332222 22 Q ss_pred CCcceeecceeEeecC--CCCC-CceeEEeecccE-----E-EEEecceEEEEeeccceeccccccccchhhhhcCcEEE Q lcl|Aclame:pro 221 RNSDSLDGLPVVNLKS--SNLK-RGELITGDFDKL-----I-YGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVAL 291 (324) Q Consensus 221 ~~~~~l~G~pv~~~~~--~~~~-~~~~i~gd~s~~-----~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ 291 (324) .......|+|+.-..+ .... .+..++.+.... . -+.....++...- ..+.++....-.......+ T Consensus 239 n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv------~~~~~g~f~~~~~~~~~~Y 312 (464) T protein:vir:80 239 NGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQKATVKATL------EAGTKGKFRDEDLTIDTEY 312 (464) T ss_pred CCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcCCceeEEEe------cCCcccCCccccccceeEE Confidence 2333355555421100 0000 000011111000 0 0000011111100 0011111000000011122 Q ss_pred EEEEEe-----------ccEEeccCceEEEEeecCC-CCCCCCCC Q lcl|Aclame:pro 292 RATMHV-----------ALHIADDKAFAKLVPADKR-TDSVPGEV 324 (324) Q Consensus 292 r~~~r~-----------d~~v~~~~A~~~l~~~~~~-~~~~~~~~ 324 (324) ++...- +..+...+.-+.|+....+ ..+.|==+ T Consensus 313 kv~~vn~~GeS~ps~~~~~ti~~~~~~V~l~it~~~~~~~~p~yv 357 (464) T protein:vir:80 313 KVVVVSDDAESAPSDVASVVIDDKKKQVKLEITINNMYQARPQYV 357 (464) T ss_pred EEEEECCCCccccceeeeeeecCcccEEEEEEEeCCccccccceE Confidence 222221 2222222233333332111 11111000 No 208 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=94.21 E-value=0.0028 Score=34.62 Aligned_cols=288 Identities=14% Similarity=0.147 Sum_probs=139.3 Q ss_pred CchhHHHHHHH---HHHHhhhhhHHhhccc---cccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE Q lcl|Aclame:pro 1 MEQTQKLKLNL---QHFASNNVKPQVFNPD---NVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 ~~~~~~~k~~~---~~~a~~~~~~~~~~~~---~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) +|..+...++. +.-..+......|++. +-.+-++..+.+|..++..|-..+...+++.+.+-+.+++. +-.. T Consensus 5 iesqnavteffdvlkknsgkseiknawnaklaengvtitdttfqlprklvesintallntnpvfkvfhvtnvga--llvs 82 (318) T protein:vir:94 5 IESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA--LLVS 82 (318) T ss_pred hhhhhhHHHHHHHHhcccChhhhhhhhhhhhhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhhh--eeee Confidence 45554444442 2222222222233321 11222344567888888888888888888888765544433 2233 Q ss_pred E-EeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHH--HhhcChHHHHHHHHHHHHHHHHHH-HHHHHHhcc Q lcl|Aclame:pro 75 F-WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKE--FLNYTYSQFFEEMKPMIAEAFYKK-FDEAGILNQ 150 (324) Q Consensus 75 ~-~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e--~l~ds~~~~~~~i~~~l~~ai~~~-~d~~~l~G~ 150 (324) + ..++.+++....|+.+.+...++.--++.|.-++....+... -++.|...+...|..++..+|..+ +|-+++.|+ T Consensus 83 rsfdssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnkivdlalvegd 162 (318) T protein:vir:94 83 RSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGD 162 (318) T ss_pred ccccccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhhhheeeeecC Confidence 3 234556777888999998888887777777666666555543 345677788889999999998887 567888899 Q ss_pred Cccccccccccccc------cccccccchhhhhH-HHHHHHHhh-hhcCCCcEEEEcHHH-HHHHHHhhccCCceee--- Q lcl|Aclame:pro 151 GNNPFGKSIAQSIE------KTNKVIKGDFTQDN-IIDLEALLE-DDELEANAFISKTQN-RSLLRKIVDPETKERI--- 218 (324) Q Consensus 151 g~~~~~~~~~~~~~------~~~~~~~~~~~~~~-i~~~~~~l~-~~~~~~~~~v~~~~~-~~~l~~~~d~~g~~~~--- 218 (324) |+++....-..+.. +.....++...+.| |..+..-+. .+++ .-+++.... .+.|..++.+....-. T Consensus 163 gtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrptagr--rylivktedrkalldelrqatananvrik 240 (318) T protein:vir:94 163 GTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR--RYLIVKTEDRKALLDELRQATANANVRIK 240 (318) T ss_pred CcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCc--eEEEEeccchHHHHHHHHhhhcccceEEe Confidence 98765433221110 11112234444433 333332221 1111 124555444 3444555544322211 Q ss_pred ccCC-cceeecc-eeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhh--hhcCcEEEEEE Q lcl|Aclame:pro 219 YDRN-SDSLDGL-PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL--FEQDMVALRAT 294 (324) Q Consensus 219 ~~~~-~~~l~G~-pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--f~~~~v~~r~~ 294 (324) .+.. -..-.|. .+++......-+.+++ .|- ++.+-.. + -+-++. |..|.--+.++ T Consensus 241 nddteiasevgvdeiivytgskavkptvl-vdq-kyhidmq-d------------------ltkvdafewktnsnmilve 299 (318) T protein:vir:94 241 NDDTEIASEVGVDEIIVYTGSKAVKPTVL-VDQ-KYHIDMQ-D------------------LTKVDAFEWKTNSNMILVE 299 (318) T ss_pred ccchhhhhhcCcceeEEeeccccccceeE-ecc-ceecchh-h------------------hhhhhceeeccCCceEEEE Confidence 1111 0011121 1222211112122222 221 1111100 0 011111 33333333344 Q ss_pred EEeccEEeccCceEEEEee Q lcl|Aclame:pro 295 MHVALHIADDKAFAKLVPA 313 (324) Q Consensus 295 ~r~d~~v~~~~A~~~l~~~ 313 (324) .-..+.+---+|-+.++.. T Consensus 300 tltsghvetynagavitvs 318 (318) T protein:vir:94 300 TLTSGHVETYNAGAVITVS 318 (318) T ss_pred ecccCcceeecCceeEEeC Confidence 4444444444555555554 No 209 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=298 Identities=13% Similarity=0.100 Sum_probs=155.4 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccc-cccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~ 78 (324) |++... ..+.+|.. +.....++.. ....-+.|-|.+.+.+...+.+.|-+++.++.+++..-... +-...+ T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~ 73 (357) T protein:vir:60 1 MRQETR--FKFNAYLS-----RVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVT 73 (357) T ss_pred CChHHH--HHHHHHHH-----HHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 765533 33333332 1111122221 11234668888999999999999999999998887654333 333333 Q ss_pred Ccceeeec--cC-ccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 79 KPGAYWVG--EG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 79 ~~~a~~v~--Eg-~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) ++-++-+. -+ ...|.+-..++...+..++.-.-..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+--. T Consensus 74 g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A 153 (357) T protein:vir:60 74 GSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRA 153 (357) T ss_pred cccccccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeee Confidence 44343321 11 222333345566666666666666777777763 34788888888888888776666667775311 Q ss_pred cc------ccc-------c-----------ccc-----cccccc-c-ccchhhhhHH----HHHHHH-hhhhcCCC--cE Q lcl|Aclame:pro 154 PF------GKS-------I-----------AQS-----IEKTNK-V-IKGDFTQDNI----IDLEAL-LEDDELEA--NA 195 (324) Q Consensus 154 ~~------~~~-------~-----------~~~-----~~~~~~-~-~~~~~~~~~i----~~~~~~-l~~~~~~~--~~ 195 (324) .. |.+ + ... ...... . ....-+|..| .+++.. |+..+++. -+ T Consensus 154 ~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLV 233 (357) T protein:vir:60 154 ETSDRSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLV 233 (357) T ss_pred ccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 10 100 0 000 000000 0 0011234333 345543 46666653 36 Q ss_pred EEEcHHHHH--HHHHhhccCCceeec-----cCCcceeecceeEeecCCCCCCceeEEeecccEEEEE-ecceEEEEeec Q lcl|Aclame:pro 196 FISKTQNRS--LLRKIVDPETKERIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGI-PQLIEYKIDET 267 (324) Q Consensus 196 ~v~~~~~~~--~l~~~~d~~g~~~~~-----~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~-~~~~~~~~~~~ 267 (324) .+|..+..+ ++..+. ..+.|--. -....++-|+|.+..|. .|.+.+++--++++-+-. .+..+-.+-+. T Consensus 234 vivG~dLla~k~~~l~n-~~~~pTE~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~llVT~L~NLsIY~Q~gs~RR~~~d~ 310 (357) T protein:vir:60 234 VIVGRQLLADKYFPIVN-REQDNSEMLAADVIISQKRIGNLPAVRVPY--FPADAMLITKLENLSIYYMDDSHRRVIEEN 310 (357) T ss_pred EEEchhhhhHHhhhHhh-cCCChHHHHHHHHHHHhhhhcCcceEEccc--cCCCceEEeeccccEEEEecCcEEEEEEec Confidence 777777654 222222 22233211 11235799999988775 556677777776653322 22322222111 Q ss_pred cceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 268 AQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 268 ~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) . .+|.+.-.=....|+.|-+.++++.++...-.....|++- T Consensus 311 p----------------~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~ 351 (357) T protein:vir:60 311 P----------------KLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred c----------------ccccccchhhhcceeeeeccccEEEeeeeeeccCcccccC Confidence 1 1222222223355788888888888876555544445555 No 210 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=93.88 E-value=0.0061 Score=32.73 Aligned_cols=299 Identities=14% Similarity=0.111 Sum_probs=156.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccc-cccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~ 78 (324) |++... ..+.+|... .....++.. ....-+.|-|.+.+.+...+.+.|-+++.++.+++..-... +-...+ T Consensus 1 M~~~tr--~~~~~y~~~-----~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~ 73 (357) T protein:vir:20 1 MRQETR--FKFNAYLSR-----VAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVT 73 (357) T ss_pred CChHHH--HHHHHHHHH-----HHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 765533 333333321 111122221 11234668888999999999999999999998887654333 333333 Q ss_pred Ccceeeec--cC-ccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 79 KPGAYWVG--EG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 79 ~~~a~~v~--Eg-~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) ++-++-+. -+ ...|.+-..++...+..++.-.-..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+--. T Consensus 74 g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A 153 (357) T protein:vir:20 74 GSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRA 153 (357) T ss_pred ccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeee Confidence 44343321 11 122333345566666666666666777777763 34788888888888888776666667775311 Q ss_pred cc------ccc-------c-----------ccc-----cccccc-c-ccchhhhhHH----HHHHHH-hhhhcCCC--cE Q lcl|Aclame:pro 154 PF------GKS-------I-----------AQS-----IEKTNK-V-IKGDFTQDNI----IDLEAL-LEDDELEA--NA 195 (324) Q Consensus 154 ~~------~~~-------~-----------~~~-----~~~~~~-~-~~~~~~~~~i----~~~~~~-l~~~~~~~--~~ 195 (324) .. |.+ + ... ...... . ....-+|..| .+++.. |+..+++. -+ T Consensus 154 ~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLV 233 (357) T protein:vir:20 154 ETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLV 233 (357) T ss_pred ccCChhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 10 100 0 000 000000 0 0011234333 345543 46666653 36 Q ss_pred EEEcHHHHHH-HHHhhccCCceeec--c---CCcceeecceeEeecCCCCCCceeEEeecccEEEEE-ecceEEEEeecc Q lcl|Aclame:pro 196 FISKTQNRSL-LRKIVDPETKERIY--D---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGI-PQLIEYKIDETA 268 (324) Q Consensus 196 ~v~~~~~~~~-l~~~~d~~g~~~~~--~---~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~-~~~~~~~~~~~~ 268 (324) .+|..+..+. ...+-...+.|--. . ....++-|+|.+..|. .|.+.+++--++++-+-. .+..+-.+-+.. T Consensus 234 vivG~dLla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p 311 (357) T protein:vir:20 234 VIVGRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPY--FPADAMLITKLENLSIYYMDDSHRRVIEENP 311 (357) T ss_pred EEEchhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 7777776542 22222222233211 1 1235799999988775 556677777776653322 222222221111 Q ss_pred ceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 269 QLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 269 ~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .+|.+.-.=....|+.|-+.++++.++...-.....|++. T Consensus 312 ----------------~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~ 351 (357) T protein:vir:20 312 ----------------KLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred ----------------ccccccchhhhcceeeeeccccEEEeeeeeeccccCCccC Confidence 1222222223456788888888888876666555555555 No 211 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=93.84 E-value=0.0026 Score=34.74 Aligned_cols=287 Identities=14% Similarity=0.138 Sum_probs=129.7 Q ss_pred CchhHHHHHHHHHHH---hhhhhHHhhccccc---cccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFA---SNNVKPQVFNPDNV---MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFT 74 (324) Q Consensus 1 ~~~~~~~k~~~~~~a---~~~~~~~~~~~~~~---~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip 74 (324) ++.++..-+|.+-.+ .+..++.-|.+.-+ .+-++-...+|..+...|-..+....++++.+-+...+. +-.- T Consensus 87 i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~--~~V~ 164 (400) T protein:vir:93 87 IESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA--LLVS 164 (400) T ss_pred HhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchh--hhHH Confidence 111111111211111 11112222322211 112334467888888888888888888888654433222 1111 Q ss_pred E-EeCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhc---ChHHHHHHHHHHHHHHHH-HHHHHHHHhc Q lcl|Aclame:pro 75 F-WADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY---TYSQFFEEMKPMIAEAFY-KKFDEAGILN 149 (324) Q Consensus 75 ~-~~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d---s~~~~~~~i~~~l~~ai~-~~~d~~~l~G 149 (324) + ..+...+...-.|..+.+...+|..-++.+.-++....+ -|+..+ +...+..++..+|+.++. +.+|.+++-| T Consensus 165 ~s~~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~G 243 (400) T protein:vir:93 165 RSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 243 (400) T ss_pred hhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhhee Confidence 2 122335667778888888888888888877555444444 233333 445668999999999998 8899999999 Q ss_pred cCccccccccccc-----c-ccccccccchhhhhH-HHHHHHHhh-hhcCCCcEEEEcHHH-HHHHHHhhccCCceee-- Q lcl|Aclame:pro 150 QGNNPFGKSIAQS-----I-EKTNKVIKGDFTQDN-IIDLEALLE-DDELEANAFISKTQN-RSLLRKIVDPETKERI-- 218 (324) Q Consensus 150 ~g~~~~~~~~~~~-----~-~~~~~~~~~~~~~~~-i~~~~~~l~-~~~~~~~~~v~~~~~-~~~l~~~~d~~g~~~~-- 218 (324) +|.++....-.-+ . .++....++...+.| +-.+..-+. .+++ ..+++.... .+.|..++.+....-. T Consensus 244 DG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagr--rylivktedrkalldelrqatanahvri 321 (400) T protein:vir:93 244 DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR--RYLIVKTEDRKALLDELRQATANAHVRI 321 (400) T ss_pred cCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCc--eEEEEeccchHHHHHHHHhhccccceEe Confidence 9988643221110 0 011112234444433 333332221 1111 124555444 4445556544332211 Q ss_pred -ccC-Ccceeecc-eeEeecCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhh--hhcCcEEEEE Q lcl|Aclame:pro 219 -YDR-NSDSLDGL-PVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL--FEQDMVALRA 293 (324) Q Consensus 219 -~~~-~~~~l~G~-pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--f~~~~v~~r~ 293 (324) .+. .-..-.|. .+++......-+.+++ .|- ++.+-. ++ -+-++. |..|.--+.+ T Consensus 322 knddaeiasevgvdeiivytgskalkptvl-vdq-kyhidm-qd------------------ltkvdafewktnsnmilv 380 (400) T protein:vir:93 322 KNDDAEIASEVGVDEIIVYTGSKALKPTVL-VDQ-KYHIDM-QD------------------LTKVDAFEWKTNSNMILV 380 (400) T ss_pred ecchhhhhhhcCcceeeeeeccccccceee-ecc-ccccch-hh------------------hhhhhhheeccCCceEEE Confidence 111 11111121 1122111111122222 221 111100 00 011111 3333333334 Q ss_pred EEEeccEEeccCceEEEEee Q lcl|Aclame:pro 294 TMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 294 ~~r~d~~v~~~~A~~~l~~~ 313 (324) +.-..+.|---+|-+.++.. T Consensus 381 etltsghvetynagavitvs 400 (400) T protein:vir:93 381 ETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eecccCcceeeccceeEeeC Confidence 44444444444555555554 No 212 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=93.58 E-value=0.0071 Score=32.38 Aligned_cols=303 Identities=11% Similarity=0.036 Sum_probs=143.7 Q ss_pred CchhHHHHHHHHH-HHhh----------hhhHHhhcccccccc-------------ccCccccchHHHHHHHHHHHhhhh Q lcl|Aclame:pro 1 MEQTQKLKLNLQH-FASN----------NVKPQVFNPDNVMMH-------------EKKDGTLMNEFTTPILQEVMENSK 56 (324) Q Consensus 1 ~~~~~~~k~~~~~-~a~~----------~~~~~~~~~~~~~~~-------------~~~~~~vp~~~~~~i~~~~~~~s~ 56 (324) |-...|.|..++. |-.. +..-.+..++++..+ ++++++--+.+..++......... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ 80 (514) T protein:vir:10 1 MYTQDKTKDIMKKSFFGGDRAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERD 80 (514) T ss_pred CCccchhhHHHhhhhcccceeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhccceeEeeecCcc Confidence 7666666666654 2111 111112233322222 122223222222222222111111 Q ss_pred --hhhhcceeecCCCceEEEEEe---CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHh-hcChHHHHHHH Q lcl|Aclame:pro 57 --IMQLGKYEPMEGTEKKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEM 130 (324) Q Consensus 57 --l~~l~~~~~~~~~~~~ip~~~---~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l-~ds~~~~~~~i 130 (324) +.+-..+.+..+-..+|-... ....+.+++|++-.+.+++.+...++..+=++....+|.-+- .++..+.+... T Consensus 81 ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d~~~~~ 160 (514) T protein:vir:10 81 FTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQRANTIVDSLKVQ 160 (514) T ss_pred hhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhhccchhhHHHHH Confidence 222223334433333333332 234567999999999999999999988888887766665332 24678899999 Q ss_pred HHHHHHHHHHHHHHHHHhccCc---ccc-----cccccccccccccc-cc-chhhhhHHHHHHHHhhhhcCCCcEEEEcH Q lcl|Aclame:pro 131 KPMIAEAFYKKFDEAGILNQGN---NPF-----GKSIAQSIEKTNKV-IK-GDFTQDNIIDLEALLEDDELEANAFISKT 200 (324) Q Consensus 131 ~~~l~~ai~~~~d~~~l~G~g~---~~~-----~~~~~~~~~~~~~~-~~-~~~~~~~i~~~~~~l~~~~~~~~~~v~~~ 200 (324) .+.-...++..+|.++|+|+.. +.+ ..|+.......+.. +- ..++.+.|..+-..+...|..++-++|+. T Consensus 161 ~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~ 240 (514) T protein:vir:10 161 EYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMPI 240 (514) T ss_pred HHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCCCeEecCCCCccHHHHhhhhhhhhcccCChhheeCch Confidence 9999999999999999999743 222 12222222211111 11 23444445555555666777888899999 Q ss_pred HHHHHHHHhhccCCceeeccCCcceeecceeEe--ecCCCCCC-ceeEEeecccEEEEEecceEEEEeeccceecccccc Q lcl|Aclame:pro 201 QNRSLLRKIVDPETKERIYDRNSDSLDGLPVVN--LKSSNLKR-GELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNED 277 (324) Q Consensus 201 ~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~~--~~~~~~~~-~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~ 277 (324) .+.+.+..-.....|-++.........|.|+-- +....... +..+++....+ +....+...+ T Consensus 241 ~vka~f~~~~~~~qRV~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im~~~n~L------~~~~~~~~~A--------- 305 (514) T protein:vir:10 241 GIKADFVNQHLNGQRVMLPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIMDSDNKL------DFDRPVSPTA--------- 305 (514) T ss_pred HHHHHHhhcccCcceEEeecCccceeeeeeccceeEeccceeecCCeeecccccC------ccCCccCCcC--------- Confidence 999988766655555555544455566776631 11111100 11111111000 0000000000 Q ss_pred ccchhhhhcCcEEEEEEEEec-------cE------EeccCceE----EEEeecCCCCCCCCCC Q lcl|Aclame:pro 278 GTPVNLFEQDMVALRATMHVA-------LH------IADDKAFA----KLVPADKRTDSVPGEV 324 (324) Q Consensus 278 ~~~~~~f~~~~v~~r~~~r~d-------~~------v~~~~A~~----~l~~~~~~~~~~~~~~ 324 (324) =+...+++-++..-+ .. ..+.++-+ ++......+...|-++ T Consensus 306 ------p~~~~va~svT~~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~ 363 (514) T protein:vir:10 306 ------PTAPQLSATVTPDGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLV 363 (514) T ss_pred ------CCCCcceEEEecCcccccCcccccccccccccccccceeEEEEEEEECCCCcccccce Confidence 001111121111100 00 11122221 2333344444555555 No 213 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=93.56 E-value=0.0071 Score=32.35 Aligned_cols=299 Identities=14% Similarity=0.111 Sum_probs=155.5 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccc-cccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~-~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~ 78 (324) |++... ..+.+|.. +.....++.. ....-+.|-|.+.+.+...+.+.|-+++.++.+++..-... +-.-.+ T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~ 73 (357) T protein:vir:56 1 MRQETR--FKFNAYLS-----RVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVT 73 (357) T ss_pred CChHHH--HHHHHHHH-----HHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccC Confidence 765533 33333332 1111122221 11234668888999999999999999999998887654333 333333 Q ss_pred Ccceeeec--cC-ccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 79 KPGAYWVG--EG-QKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 79 ~~~a~~v~--Eg-~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) ++-++-+. -+ ...|.+-..++...+..++.-.-..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+--. T Consensus 74 g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A 153 (357) T protein:vir:56 74 GSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRA 153 (357) T ss_pred ccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeee Confidence 44333321 11 222333345566666666666666777777763 24788888888888888776666667775311 Q ss_pred cc------ccc-------c-----------ccc-----cccccc-c-ccchhhhhHH----HHHHHH-hhhhcCCC--cE Q lcl|Aclame:pro 154 PF------GKS-------I-----------AQS-----IEKTNK-V-IKGDFTQDNI----IDLEAL-LEDDELEA--NA 195 (324) Q Consensus 154 ~~------~~~-------~-----------~~~-----~~~~~~-~-~~~~~~~~~i----~~~~~~-l~~~~~~~--~~ 195 (324) .. |.+ + ... ...... . ....-+|..| .+++.. |+..+++. -+ T Consensus 154 ~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLV 233 (357) T protein:vir:56 154 ETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLV 233 (357) T ss_pred ccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEE Confidence 10 100 0 000 000000 0 0011234333 345543 46666653 36 Q ss_pred EEEcHHHHHH-HHHhhccCCceeec--c---CCcceeecceeEeecCCCCCCceeEEeecccEEEEE-ecceEEEEeecc Q lcl|Aclame:pro 196 FISKTQNRSL-LRKIVDPETKERIY--D---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGI-PQLIEYKIDETA 268 (324) Q Consensus 196 ~v~~~~~~~~-l~~~~d~~g~~~~~--~---~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~-~~~~~~~~~~~~ 268 (324) .+|..+..+. ...+-...+.|--. . ....++-|+|.+..|. .|.+.+++--++++-+-. .+..+-.+-+.. T Consensus 234 vivG~dLla~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p 311 (357) T protein:vir:56 234 VIVGRQLLADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPY--FPADAMLITKLENLSIYYMDDSHRRVIEENP 311 (357) T ss_pred EEEchhhhhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeeccccEEEEecCcEEEEEEecc Confidence 7777776542 22222222233211 1 1235799999988775 556677777776653322 223222221111 Q ss_pred ceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 269 QLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 269 ~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .+|.+.-.=....|+.|-+.++++.++...-.....|.+- T Consensus 312 ----------------~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~~~ 351 (357) T protein:vir:56 312 ----------------KLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPAKA 351 (357) T ss_pred ----------------ccccccchhhhcceeeeeccccEEEeeeeeeccCCCCccc Confidence 1222222223355788888888888876665544445444 No 214 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=92.86 E-value=0.0021 Score=35.26 Aligned_cols=287 Identities=14% Similarity=0.101 Sum_probs=131.9 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhh--hhhhcceeecCCCceEEEEEe- Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSK--IMQLGKYEPMEGTEKKFTFWA- 77 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~--l~~l~~~~~~~~~~~~ip~~~- 77 (324) |-- +.+++ ..+....+.+. ....|+++--+.+..++......... +.+-..+.+..+-..+|-+.. T Consensus 1 ~~~-----~~~~~-----~~~a~~~al~~-a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~ 69 (470) T protein:vir:10 1 MPY-----EHLKH-----LDEATLKALNA-AGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTA 69 (470) T ss_pred CCh-----hHhhh-----hhHHHHHHHHH-hhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhcc Confidence 211 11222 11111111111 11123333222222222211111111 222223344444333443322 Q ss_pred --CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHH---hhcChHHHHHHHHHHHHHHHHHHHHHHHHhccC- Q lcl|Aclame:pro 78 --DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF---LNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG- 151 (324) Q Consensus 78 --~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~---l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g- 151 (324) +........|++-.+.+++++...+..++=++....||.-+ ++....+++....+.---.++..+|.++|+|+. T Consensus 70 rhG~~g~s~~~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~ 149 (470) T protein:vir:10 70 RHDKIGYAAFREGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNL 149 (470) T ss_pred ccccccceeecccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccc Confidence 22223345899999999999999999999999998999764 333456899999999999999999999999965 Q ss_pred --ccc-------ccccccccccc--cccc--c-cchhhhhHHHHHHHHh--hhhcCCCcEEEEcHHHHHHHHHhhccCCc Q lcl|Aclame:pro 152 --NNP-------FGKSIAQSIEK--TNKV--I-KGDFTQDNIIDLEALL--EDDELEANAFISKTQNRSLLRKIVDPETK 215 (324) Q Consensus 152 --~~~-------~~~~~~~~~~~--~~~~--~-~~~~~~~~i~~~~~~l--~~~~~~~~~~v~~~~~~~~l~~~~d~~g~ 215 (324) ++. +..|+.+...- ...+ + ...++.+.|..+-..+ ...|..++-++|+..+.+.|..-.....| T Consensus 150 l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qR 229 (470) T protein:vir:10 150 LGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISR 229 (470) T ss_pred cccccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceE Confidence 211 12233221110 0111 1 1234455566666666 46788888899999999999877777667 Q ss_pred eeeccCCcceeecceeEeecCCCCCCceeEEeecccEEEEE-ecceEEEEeeccceeccccccccchhhhhcCcEEEEEE Q lcl|Aclame:pro 216 ERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGI-PQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRAT 294 (324) Q Consensus 216 ~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~ 294 (324) -++....+....|+|+.-..+ ..|.+-+ +.+.+ +.. ..--...++++. . + +.-..+..-+ T Consensus 230 v~~~~N~~~~~~G~~v~~f~s---a~G~I~L-~~s~~-m~~~~k~~p~~l~~~v------~------~-~aAP~~~~tv- 290 (470) T protein:vir:10 230 VMTTADRRAGLLGADAQSYIG---VRGEHSL-YPSQF-LGDFHKFNPARFGAEV------G------D-FAAPSNSWTV- 290 (470) T ss_pred EEEecCCCceeeeeeccceee---eeeeeee-ccccc-ccchhhcCcccCCccc------C------C-cccCceeEEe- Confidence 666666555668888742111 0222211 11110 000 000000000000 0 0 0000000000 Q ss_pred EEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 295 MHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 295 ~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .-+.-++.+......++-.+.+| T Consensus 291 -------~~t~~~~a~~~~sk~g~~~~~~v 313 (470) T protein:vir:10 291 -------STTDNFVTLPYNSGLGDPANTTV 313 (470) T ss_pred -------ecCCCceeecccCCCCcccCcce Confidence 00011111111111111112222 No 215 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=91.79 E-value=0.014 Score=30.71 Aligned_cols=291 Identities=11% Similarity=0.043 Sum_probs=156.2 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeCC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWADK 79 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~~ 79 (324) |++... ..+.+|.. +.....++. +.+..+.|.|.+.+.+.+.+.+.|-+++.++.+++..-... +-...++ T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g 72 (338) T protein:vir:11 1 MRNETR--KQFDAYLA-----QLAKLNGVN-SAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSG 72 (338) T ss_pred CCHHHH--HHHHHHHH-----HHHHHhCCC-cccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCc Confidence 665533 22333332 111112222 23456678899999999999999999999999888654432 3333344 Q ss_pred cceeeec--c-CccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc Q lcl|Aclame:pro 80 PGAYWVG--E-GQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP 154 (324) Q Consensus 80 ~~a~~v~--E-g~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~ 154 (324) +-++-+. . ++..|.+-..++...+..++.---..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+.-.. T Consensus 73 ~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~ 152 (338) T protein:vir:11 73 TIASRTDTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAA 152 (338) T ss_pred cccccccCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeecc Confidence 4443321 1 1233333335566667777766667788887774 347899999999999888877777778864111 Q ss_pred ------cccc-------c-----------cccccc---ccccccchhhhhHH----HHHHH-HhhhhcCCC--cEEEEcH Q lcl|Aclame:pro 155 ------FGKS-------I-----------AQSIEK---TNKVIKGDFTQDNI----IDLEA-LLEDDELEA--NAFISKT 200 (324) Q Consensus 155 ------~~~~-------~-----------~~~~~~---~~~~~~~~~~~~~i----~~~~~-~l~~~~~~~--~~~v~~~ 200 (324) .|.+ + ...... ........-+|..| .+++. .|+..+++. -+.+|.. T Consensus 153 ~Td~~~nPllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~ 232 (338) T protein:vir:11 153 TTNRAANPLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGR 232 (338) T ss_pred CCChhhCcCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 1111 0 000000 00000111224333 34554 345556544 3678887 Q ss_pred HHHHH-HHHhhccCCceee--c-c--CCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc-eEEEEeeccceecc Q lcl|Aclame:pro 201 QNRSL-LRKIVDPETKERI--Y-D--RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTV 273 (324) Q Consensus 201 ~~~~~-l~~~~d~~g~~~~--~-~--~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~-~~~~~~~~~~~~~~ 273 (324) +..+. ...+-.....|.- . . ....++.|+|.+..|. .|.+.+++--++++-+-...| .+-.+-+.. T Consensus 233 dLladk~~~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p----- 305 (338) T protein:vir:11 233 ELVHDKYFPMVNKDQPATEKIATDLILSQKRMGGLPPVEVPY--VPEKGLMVTTLKNLSLYWQIGGRRRYLKEVP----- 305 (338) T ss_pred hhhHHHHhHHHhcCCChHHHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeeccccEEEEecCcEEEEEEecc----- Confidence 76542 1122222222221 1 1 1245799999988775 556677777777654433222 222221111 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCC Q lcl|Aclame:pro 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~ 317 (324) ++|.+.-.=....|+.|-+.++++.++...... T Consensus 306 -----------~r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 306 -----------EKNRIENYESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred -----------ccccccchhhhccceeeeccccEEEeecceecC Confidence 123332223345688888899999887655544 No 216 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=91.75 E-value=0.014 Score=30.68 Aligned_cols=293 Identities=10% Similarity=0.112 Sum_probs=155.1 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhcccccc---ccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVM---MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFW 76 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~---~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~ 76 (324) |++... ..+.+|.. +.....++. .+.+--+.|.|.+.+.+...+.+.|-+++.++.+++..-... +-.. T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg 73 (342) T protein:vir:10 1 MKDLTL--EKYNAYLA-----RQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLD 73 (342) T ss_pred CChHHH--HHHHHHHH-----HHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecc Confidence 776533 22333332 211122222 111223668888999999999999999999998887654333 3333 Q ss_pred eCCcceeeec---cCccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccC Q lcl|Aclame:pro 77 ADKPGAYWVG---EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) Q Consensus 77 ~~~~~a~~v~---Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g 151 (324) .+++-++-+. -+...|.+-..++...+..++.-.-..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+. T Consensus 74 ~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts 153 (342) T protein:vir:10 74 SAHTVASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTS 153 (342) T ss_pred cCcccccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceeccccee Confidence 3444443321 12233344455666667777766667778877763 347888888888888887766666777753 Q ss_pred cccc------ccc------------------cccccccccc-cccchhhhhHH----HHHHHH-hhhhcCCC--cEEEEc Q lcl|Aclame:pro 152 NNPF------GKS------------------IAQSIEKTNK-VIKGDFTQDNI----IDLEAL-LEDDELEA--NAFISK 199 (324) Q Consensus 152 ~~~~------~~~------------------~~~~~~~~~~-~~~~~~~~~~i----~~~~~~-l~~~~~~~--~~~v~~ 199 (324) -... |.+ .......... .....-+|..| .+++.. |+..+++. -+.+|. T Consensus 154 ~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG 233 (342) T protein:vir:10 154 RAATSDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITG 233 (342) T ss_pred eccCCChhhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 2110 100 0000000000 00111134333 345543 46666653 367777 Q ss_pred HHHHHH--HHHhhccCCceeec---c--CCcceeecceeEeecCCCCCCceeEEeecccEEEE-EecceEEEEeecccee Q lcl|Aclame:pro 200 TQNRSL--LRKIVDPETKERIY---D--RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG-IPQLIEYKIDETAQLS 271 (324) Q Consensus 200 ~~~~~~--l~~~~d~~g~~~~~---~--~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~-~~~~~~~~~~~~~~~~ 271 (324) .+..+. +..+.. ...|--. . ....++-|+|.+..|. .|.+.+++--++++-+- ..+..+-.+-+.. T Consensus 234 ~dLladk~~~l~n~-~~~ptE~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p--- 307 (342) T protein:vir:10 234 RKLLADKYFPIVNQ-QNAPTEELAADIVISQKRIGGLKAVRVPF--FPANAILITKLENLAIYVQEGTTRKHIENVP--- 307 (342) T ss_pred hhhhHHHHHHHHhc-CCChHHHHHHHHHHhhhhhcCceeEEccc--cCCCceEEeeccccEEEEecCcEEEEEEecc--- Confidence 776542 222222 2222110 1 1245799999988765 55667777777665332 2233222221111 Q ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 272 ~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) .+|.+.-.=....|+.|-+.++++.++........ T Consensus 308 -------------~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 308 -------------KKDRIETYESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred -------------ccccccchhhhccceeeeccccEEEeecceecCCC Confidence 12223222334568888999999988866554222 No 217 >protein:vir:348 Length: 321 # NCBI annotation: major virion structural protein # Family: family:all:3198 # MgeID: mge:9 # MgeName: Mx8 # Cross-refs: genbank:acc:NP_203462;genbank:gi:15320618;genbank:GeneID:921734 Probab=91.70 E-value=0.015 Score=30.64 Aligned_cols=276 Identities=10% Similarity=-0.014 Sum_probs=142.5 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhc----ceeecC-CCceEEEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLG----KYEPME-GTEKKFTF 75 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~----~~~~~~-~~~~~ip~ 75 (324) |--- .+.+.+. . -=.+.++.+.+.+..+++|+... +..+.+ +.++..|. T Consensus 1 mp~~-~lsel~t--------------------~-----tl~~rs~~~~D~v~~~n~LL~~L~~kG~~~~~~gg~~I~~~l 54 (321) T protein:vir:34 1 MPFP-NISDIIT--------------------T-----TIESRSGVIADNVTKNNAILARLAKRGKPRLVSGGYTILEEL 54 (321) T ss_pred CCCc-hHHHHHH--------------------H-----HHHhhcchhhhhhhcccHHHHHHHhcCcccccCCCeeEEEEE Confidence 1110 0000000 0 01233344455566666655442 222333 35566677 Q ss_pred EeC-Ccceeee-ccCccccccccceeeEEeeheeeEEeeeehH-HHhhcC----hHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 76 WAD-KPGAYWV-GEGQKIETSKATWVNATMRAFKLGVILPVTK-EFLNYT----YSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 76 ~~~-~~~a~~v-~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~-e~l~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) ... ..++.|- ++..-...-...|+..++..+.++..+.||- |.++.+ .+++...-.+...+.+...+|..+-. T Consensus 55 ~y~~~s~~~wy~Gyd~l~~~p~d~~~~Aef~wk~aa~~~~isg~e~l~n~g~~~~idll~~~~~~ae~t~~n~l~~~l~s 134 (321) T protein:vir:34 55 SFSGNSNGGWYSGYDVLPTAPQDVISSAEYALKQYAVPVVISGLEMLQNSGKEAQLDLLEARMNVAEATMANDISAALYG 134 (321) T ss_pred eeccCcceeEEEeeeeeccchhhhccccccchhheeEeeEEehhHHhhccchHHHHHHHHHHHHHHHHHHHhhhhHhhhc Confidence 665 7788885 5555555566789999999999999999987 444443 36777777777788888888887654 Q ss_pred -ccCc-cccccccccccccc-ccc-------------------ccchhhhhHHHHHHH----HhhhhcCCCcEEEEcHHH Q lcl|Aclame:pro 149 -NQGN-NPFGKSIAQSIEKT-NKV-------------------IKGDFTQDNIIDLEA----LLEDDELEANAFISKTQN 202 (324) Q Consensus 149 -G~g~-~~~~~~~~~~~~~~-~~~-------------------~~~~~~~~~i~~~~~----~l~~~~~~~~~~v~~~~~ 202 (324) |++. +.+..|+....... ... ..+..+...+..++. +.-.....++.|+++.+. T Consensus 135 dGTa~g~~~i~GL~~lv~~~p~tGtvGGIdra~~~~WRn~~~d~~~~~t~~tl~~~m~~~w~~~~Rg~~~PDlii~~~~~ 214 (321) T protein:vir:34 135 DGTAFGGRAINGLDGAVPVDPTVGTYGGINRALWPFWRSQVEDMAAVATINTIQPAMTKLWSRCVRGADMPDLIMSGNDA 214 (321) T ss_pred cccccccchhhhhhhhcccCCCCceeccccccchhhhhhhhhhhhhcccHHHHHHHHHHHHHhhccCCCCccEEEechHH Confidence 4432 22222222111110 000 011122333333333 334445578899999999 Q ss_pred HHHHHHhhccCCceeeccC-----CcceeecceeEeecC--CCCCCceeEEeecccEEEEEecceEEEEeeccceecccc Q lcl|Aclame:pro 203 RSLLRKIVDPETKERIYDR-----NSDSLDGLPVVNLKS--SNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) Q Consensus 203 ~~~l~~~~d~~g~~~~~~~-----~~~~l~G~pv~~~~~--~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~ 275 (324) +...+.......|+.-... ..-.+.|..|+..++ ..+++++.+|-|.+++.+....+-.+...+..- ....+ T Consensus 215 y~~y~~s~q~~qR~~~~~~a~~Gf~~Lky~~~div~D~~~g~~~pan~~yfiNT~yl~~r~h~~~~~~pi~p~r-~~~~N 293 (321) T protein:vir:34 215 WTTYSNSLQVLQRFTSAEEANLGFRSLKFLSTDVVLDGGIGGFAGANTMYFLNTKYLHFRPHKDRNMVPLSPSR-RAAFN 293 (321) T ss_pred HHHHHHhhheeeeecccccccccceeeeeeeEEEEEeCCCCCCccccceeeeecceEEEEEcCCCceeecCccc-ccccc Confidence 9888765444333332211 122356667776553 346788889999888766653332222221110 00111 Q ss_pred ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEee Q lcl|Aclame:pro 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 276 ~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~ 313 (324) . |.+.-....+....+.++.+=.+|... T Consensus 294 q----------dA~~q~I~~~GnL~~sn~~~~~vL~~~ 321 (321) T protein:vir:34 294 Q----------DAEAQILAWAGNLTCSGAQFQGRLIAE 321 (321) T ss_pred h----------hHHhhhhhhhheeeeecccceeEEeeC Confidence 1 111112223334444555555555554 No 218 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=89.08 E-value=0.028 Score=29.06 Aligned_cols=292 Identities=12% Similarity=0.070 Sum_probs=155.5 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeCC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWADK 79 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~~ 79 (324) |++... ..+.+|... .....++. ..+..+.|.|.+.+.+...+.+.|-+++.++.+++..-... +-...++ T Consensus 1 M~~~tr--~~~~~y~~~-----~A~~ngv~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g 72 (339) T protein:vir:79 1 MRNDTR--RLFAAYKAA-----IAKLNGVE-RVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSG 72 (339) T ss_pred CChHHH--HHHHHHHHH-----HHHHhCcc-cccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCc Confidence 765533 233333321 11111221 23345678888999999999999999999998887654333 3333334 Q ss_pred cceeee--ccCccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc Q lcl|Aclame:pro 80 PGAYWV--GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF 155 (324) Q Consensus 80 ~~a~~v--~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~ 155 (324) +-++-+ .-+...|.+-..++...+..++.-.-..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+.-... T Consensus 73 ~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~ 152 (339) T protein:vir:79 73 PVASTTDTTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAAT 152 (339) T ss_pred ceeecccCCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecC Confidence 433332 112233333345666667777766667777777763 3478888888888888877666666777531110 Q ss_pred ------ccc------------------ccc-ccccccc-cc-cchhhhhHH----HHHHH-HhhhhcCCC--cEEEEcHH Q lcl|Aclame:pro 156 ------GKS------------------IAQ-SIEKTNK-VI-KGDFTQDNI----IDLEA-LLEDDELEA--NAFISKTQ 201 (324) Q Consensus 156 ------~~~------------------~~~-~~~~~~~-~~-~~~~~~~~i----~~~~~-~l~~~~~~~--~~~v~~~~ 201 (324) |.+ +.. +...... .. ...-+|..| .+++. .|+..+++. -+.+|..+ T Consensus 153 Td~~~nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~d 232 (339) T protein:vir:79 153 SDRVANPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRN 232 (339) T ss_pred CChhhCcCccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 100 000 0000000 01 111134333 35554 345666653 36777777 Q ss_pred HHH--HHHHhhccCCceee--cc---CCcceeecceeEeecCCCCCCceeEEeecccEEEEE-ecceEEEEeeccceecc Q lcl|Aclame:pro 202 NRS--LLRKIVDPETKERI--YD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGI-PQLIEYKIDETAQLSTV 273 (324) Q Consensus 202 ~~~--~l~~~~d~~g~~~~--~~---~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~-~~~~~~~~~~~~~~~~~ 273 (324) ..+ ++..+. ....|-- .. ....++-|+|.+..|. .|.+.+++--++++-+-. .+..+-.+-+.. T Consensus 233 Lla~k~~~l~n-~~~~ptE~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p----- 304 (339) T protein:vir:79 233 LLSDKYFPLVN-RDRDPVQQIAADLIISQKRIGNLPAIRVPY--FPANGLLVTRLDNLSIYYQEGGRRRTILDNA----- 304 (339) T ss_pred hhhhHhhhHhh-cCCChHHHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeechhcEEEEecCcEEEEEEecc----- Confidence 654 222232 2223311 11 1235799999988765 556677777776653322 223222221111 Q ss_pred ccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 274 KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 274 ~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) .+|.+.-.=....|+.|-+.++++.++...-+..+ T Consensus 305 -----------~r~rie~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 305 -----------KRDRIENYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred -----------ccccccchhhccceeeeeccccEEEeeeeecccCC Confidence 12333222334558888899999988876655555 No 219 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=89.07 E-value=0.028 Score=29.06 Aligned_cols=268 Identities=10% Similarity=-0.010 Sum_probs=122.7 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhc--ceeecCCCceEEEEEeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLG--KYEPMEGTEKKFTFWAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~--~~~~~~~~~~~ip~~~~ 78 (324) |-.+ .-+.++..+.+.+...+....+. ...-.++++++||+.+. T Consensus 1 Main----------------------------------~a~~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~ 46 (290) T protein:vir:78 1 MAIN----------------------------------YVDKYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITT 46 (290) T ss_pred Cchh----------------------------------HHHHHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeecc Confidence 1111 11445555555555554433332 22335678899999864 Q ss_pred Cc-ceeeeccCccccccccceeeEEeeheeeEEeee-ehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccc Q lcl|Aclame:pro 79 KP-GAYWVGEGQKIETSKATWVNATMRAFKLGVILP-VTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFG 156 (324) Q Consensus 79 ~~-~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~-iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~ 156 (324) .. ..+-.+.|-..+.-+.+.+..+++..+.-...- --+.=-......+...+.+...+.+.-.+|...+.---+.... T Consensus 47 ~gl~DY~R~~g~~~g~v~~~~et~tl~qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~ 126 (290) T protein:vir:78 47 TGLKAHTRNKGYNEGSASNTNKSYTIDFDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKT 126 (290) T ss_pred CcccccccCCCcccCccccceeeEEeeccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhc Confidence 33 233334433333444455566666555433322 1111001123556777778888888888887765311100000 Q ss_pred cccccccccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccC-----C--ceeeccCCcceeecc Q lcl|Aclame:pro 157 KSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPE-----T--KERIYDRNSDSLDGL 229 (324) Q Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~-----g--~~~~~~~~~~~l~G~ 229 (324) .+ .......+.+-.++.|.+++.++......+-.++|+|..+..|.+.+.-. + ......+..+++.|. T Consensus 127 ~~-----~~~~~t~t~~n~~~~i~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i~~~V~~idG~ 201 (290) T protein:vir:78 127 NS-----NSVAEEITKDNVFTKLKAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSIETRITAIDGT 201 (290) T ss_pred cC-----cccccccCHHHHHHHHHHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccccceeeeecCc Confidence 00 01111122344567788888888765555567899999999886432111 1 111235567889999 Q ss_pred eeEeecCCC-------CCCceeEEeecccE-EEEEecceEEEEeecccee---ccccccccchhhhhcCcEEEEEEEEec Q lcl|Aclame:pro 230 PVVNLKSSN-------LKRGELITGDFDKL-IYGIPQLIEYKIDETAQLS---TVKNEDGTPVNLFEQDMVALRATMHVA 298 (324) Q Consensus 230 pv~~~~~~~-------~~~~~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~f~~~~v~~r~~~r~d 298 (324) +|+..++.. ..+|..-..+.+++ ++.......+.+.+...++ ...++.+ |--.+.-+.|.| T Consensus 202 ~ii~vps~~r~~t~~~f~~G~~~~~~ak~in~ii~~~~a~i~~~K~~~~~~~~P~~~~~~--------d~~~~~~r~y~d 273 (290) T protein:vir:78 202 RIVEVEAEDRFYDTFDFTDGYKPAAGAKKLNFLLVNKGSVVGGAKHASIYLHAPGSVGQG--------DGWLYQYRVYHD 273 (290) T ss_pred EEEEecccchhhhhhhhcccccccCCccceeEEEEcCCceeeeeeeeEEEeeCCCCCcCc--------ceeeeeeeeeee Confidence 998766421 00110000001111 1111122222222222211 1111111 112234455666 Q ss_pred cEEeccCceEEEEeecC Q lcl|Aclame:pro 299 LHIADDKAFAKLVPADK 315 (324) Q Consensus 299 ~~v~~~~A~~~l~~~~~ 315 (324) .=|.+.+.=.....++- T Consensus 274 ~~v~~nk~~~i~~~~~~ 290 (290) T protein:vir:78 274 IFVLDQQKDGVIASTEV 290 (290) T ss_pred eeeeccccCeeEEEeeC Confidence 66666443222222222 No 220 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=88.22 E-value=0.034 Score=28.66 Aligned_cols=292 Identities=11% Similarity=0.069 Sum_probs=157.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeCC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWADK 79 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~~ 79 (324) |++... ..+.+|.. +.....++. ..+..+.|.|.+.+.+.+.+.+.|-+++.++.+++..-... +-...++ T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g 72 (337) T protein:vir:10 1 MRKETR--QAYEKYAA-----QIAKLNDTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) T ss_pred CChHHH--HHHHHHHH-----HHHHhcChh-hhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCc Confidence 776433 22222221 111112221 22345667788889999999999999999998887654333 3333334 Q ss_pred cceeee--ccCccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc- Q lcl|Aclame:pro 80 PGAYWV--GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP- 154 (324) Q Consensus 80 ~~a~~v--~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~- 154 (324) +-++-+ +.+...|.+...++...+..++.---..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+.-.. T Consensus 73 ~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~ 152 (337) T protein:vir:10 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) T ss_pred ceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccC Confidence 433322 223334445556777777777777777788888874 347899999999999888877777778864111 Q ss_pred -----cccc------------------ccc-ccccccc-cccchhhhhHH----HHHHHH-hhhhcCCC--cEEEEcHHH Q lcl|Aclame:pro 155 -----FGKS------------------IAQ-SIEKTNK-VIKGDFTQDNI----IDLEAL-LEDDELEA--NAFISKTQN 202 (324) Q Consensus 155 -----~~~~------------------~~~-~~~~~~~-~~~~~~~~~~i----~~~~~~-l~~~~~~~--~~~v~~~~~ 202 (324) .|.+ +.. +...... .....-+|..| .+++.. |+..+++. -+.+|..+. T Consensus 153 Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dL 232 (337) T protein:vir:10 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) T ss_pred CChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 0111 000 0000000 00111134333 345543 45656553 367777776 Q ss_pred HHH-HHHhhccCCceeec-----cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc-eEEEEeeccceecccc Q lcl|Aclame:pro 203 RSL-LRKIVDPETKERIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTVKN 275 (324) Q Consensus 203 ~~~-l~~~~d~~g~~~~~-----~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~-~~~~~~~~~~~~~~~~ 275 (324) .+. -..+-.....|--. -....++.|+|.+..|. .|.+.+++--++++-+-...| .+-.+-+.. T Consensus 233 ladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------- 303 (337) T protein:vir:10 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPF--FPKRALMVTKLSNLSIYYQEGARRRTLKEVP------- 303 (337) T ss_pred hhHHhhHHhccCCCcHHHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeechhcEEEEecCcEEEEEEEcc------- Confidence 542 11222222222110 11235799999988775 556677777777654333222 222221111 Q ss_pred ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 276 ~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~ 318 (324) ++|.+.-.=....|+.|-+.++++.+++..-+.. T Consensus 304 ---------~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 304 ---------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ---------ccccccchhhccceeeeeccccEEEEeceeecCC Confidence 1233332233455888888999988875544433 No 221 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=87.72 E-value=0.037 Score=28.44 Aligned_cols=292 Identities=11% Similarity=0.054 Sum_probs=156.9 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeCC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWADK 79 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~~ 79 (324) |++... ..+.+|.. +.....++. ...-.+.|.|.+.+.+.+.+.+.|-+++.++.+++..-... +-...++ T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g 72 (337) T protein:vir:79 1 MRKETR--QAYEKYAA-----QIAKLNDTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) T ss_pred CChHHH--HHHHHHHH-----HHHHhcChh-hhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCc Confidence 776433 22222221 111112221 22334667788889999999999999999998887654333 3333334 Q ss_pred cceeee--ccCccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCccc- Q lcl|Aclame:pro 80 PGAYWV--GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP- 154 (324) Q Consensus 80 ~~a~~v--~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~- 154 (324) +-++-+ +.+...|.+...++...+..++.---..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+.-.. T Consensus 73 ~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~ 152 (337) T protein:vir:79 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) T ss_pred ceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccC Confidence 433322 223334445556677777777777777788888874 347899999999999888877777778864111 Q ss_pred -----cccc------------------ccccc-cccc-ccccchhhhhHH----HHHHHH-hhhhcCCC--cEEEEcHHH Q lcl|Aclame:pro 155 -----FGKS------------------IAQSI-EKTN-KVIKGDFTQDNI----IDLEAL-LEDDELEA--NAFISKTQN 202 (324) Q Consensus 155 -----~~~~------------------~~~~~-~~~~-~~~~~~~~~~~i----~~~~~~-l~~~~~~~--~~~v~~~~~ 202 (324) .|.+ +.... .... ......-+|..| .+++.. |+..+++. -+.+|..+. T Consensus 153 Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dL 232 (337) T protein:vir:79 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGREL 232 (337) T ss_pred CChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 0111 00000 0000 000111234333 345543 45656553 367777776 Q ss_pred HHH-HHHhhccCCceeec-----cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc-eEEEEeeccceecccc Q lcl|Aclame:pro 203 RSL-LRKIVDPETKERIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTVKN 275 (324) Q Consensus 203 ~~~-l~~~~d~~g~~~~~-----~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~-~~~~~~~~~~~~~~~~ 275 (324) .+. -..+-.....|--. -....++.|+|.+..|. .|.+.+++--++++-+-...| .+-.+-+.. T Consensus 233 ladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------- 303 (337) T protein:vir:79 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPF--FPKRALMVTKLSNLSIYYQEGARRRTLKEVP------- 303 (337) T ss_pred hhHHhhHHhccCCCcHHHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeechhcEEEEecCcEEEEEEEcc------- Confidence 542 11222222222110 01235799999988775 556677777777654333222 222221111 Q ss_pred ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 276 ~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~ 318 (324) ++|.+.-.=....|+.|-+.++++.+++..-+.. T Consensus 304 ---------~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 304 ---------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ---------ccccccchhhccceeeeeccccEEEEeceeecCC Confidence 1233332233455888888899888875544433 No 222 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=85.51 E-value=0.052 Score=27.61 Aligned_cols=277 Identities=12% Similarity=-0.015 Sum_probs=117.8 Q ss_pred cccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcce----eeeccCccccccccceeeEEeehee Q lcl|Aclame:pro 32 HEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGA----YWVGEGQKIETSKATWVNATMRAFK 107 (324) Q Consensus 32 ~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a----~~v~Eg~~~~~~~~~~~~v~l~~~k 107 (324) -+.+...+.+.+.+--+..-.+.-+--.+++.+|+.....+||+......+ .-++-++.....++..+..++..+. T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~~~~~~~~~~~~ 80 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTED 80 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceEeecccCceeeecc Confidence 112233333333333222222223334567888888777888886432111 1223333333333444444555555 Q ss_pred eEEeeeehHHHhhcC--hHHHHHHHHHHHHHHHHHHHHHHHH---hccCccccc-cccccccccccccccchhhhhHHHH Q lcl|Aclame:pro 108 LGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGI---LNQGNNPFG-KSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 108 ~~~~~~iS~e~l~ds--~~~~~~~i~~~l~~ai~~~~d~~~l---~G~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) .+-..+|.++-+.++ ..+.++.-.+.+.+.|....|..+- ....+-... ...+. ++..-...+.....+|.+ T Consensus 81 ~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Ls--gt~~wsd~~SDPi~~i~~ 158 (309) T protein:vir:99 81 HGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--GADQWSDPTSNPLPVITD 158 (309) T ss_pred cceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEec--CccccCCCCCCcHHHHHH Confidence 555566666655543 3677777777777777666654322 111111110 01111 111111223344555555 Q ss_pred HHHHhhhhcCCCcEEEEcHHHHHHHHH-------hhccCCce-eeccCCcceeecce-eEeecCC-----CCCCcee--E Q lcl|Aclame:pro 182 LEALLEDDELEANAFISKTQNRSLLRK-------IVDPETKE-RIYDRNSDSLDGLP-VVNLKSS-----NLKRGEL--I 245 (324) Q Consensus 182 ~~~~l~~~~~~~~~~v~~~~~~~~l~~-------~~d~~g~~-~~~~~~~~~l~G~p-v~~~~~~-----~~~~~~~--i 245 (324) .+.++ ++.++..+|..+.|.+|+. ++...+.. ++....-..++|+. |++-.+. ....+.+ + T Consensus 159 ~~~~~---g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g~~~~~~~i 235 (309) T protein:vir:99 159 ALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLIRA 235 (309) T ss_pred HHHhh---CCCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceeeccccccccccccc Confidence 55554 7788999999999988863 22222221 22222223345542 3321110 0001110 1 Q ss_pred EeecccEEEEEecceEEEEeeccceec--c--ccccccch-hhh-hcCcEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 246 TGDFDKLIYGIPQLIEYKIDETAQLST--V--KNEDGTPV-NLF-EQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 246 ~gd~s~~~~~~~~~~~~~~~~~~~~~~--~--~~~~~~~~-~~f-~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) -|+..-+.+....+-.++ +.+..+ . ....|.+. ..+ ..+--.+|+.....-.+.-+++-..|+++.++ T Consensus 236 wg~~~~L~y~~~~~~~~~---~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va~ 309 (309) T protein:vir:99 236 WGPHASFIYRDRLADTRN---GTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) T ss_pred cCCcEEEEEcCCCCCCcc---cccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhhhhcccC Confidence 111111111000000000 000000 0 00000000 001 11122366666666566667777777777777 No 223 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=85.46 E-value=0.053 Score=27.59 Aligned_cols=280 Identities=8% Similarity=0.019 Sum_probs=119.9 Q ss_pred cccccCccccchHHHHHHHHHHHhhhhh---h-h---hcceeecCCCceEEEEEe-C-CcceeeeccCccc-ccccccee Q lcl|Aclame:pro 30 MMHEKKDGTLMNEFTTPILQEVMENSKI---M-Q---LGKYEPMEGTEKKFTFWA-D-KPGAYWVGEGQKI-ETSKATWV 99 (324) Q Consensus 30 ~~~~~~~~~vp~~~~~~i~~~~~~~s~l---~-~---l~~~~~~~~~~~~ip~~~-~-~~~a~~v~Eg~~~-~~~~~~~~ 99 (324) ++ .-.-+.+...+.+.+...+.- + . ...+.-.++.+++||+.+ . +...+-..-|-.. ..-+.+.+ T Consensus 1 Ma-----inya~~~~~~Ld~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~R~~g~~~~g~v~~~~e 75 (346) T protein:vir:10 1 MT-----INYAEKYQAAVQQAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQRRTITTPVANYSNDWD 75 (346) T ss_pred Cc-----chhHHHHHHHHHHHHHhhhccchhhcccccccceEecCCCEEEEEEeeeecccccccccCCccccccccccee Confidence 00 111244555555544443221 1 1 112233567889999985 2 2222222222211 12234445 Q ss_pred eEEeeheeeEEeeeehHHHhhcC--hHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhh Q lcl|Aclame:pro 100 NATMRAFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQD 177 (324) Q Consensus 100 ~v~l~~~k~~~~~~iS~e~l~ds--~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (324) ..++...+.-.. .|-.-=++.+ ...+...+.+...+...-.+|...|.---+.... .........+.+..-.++ T Consensus 76 t~tl~qDR~~~F-~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~---~~~~~~~~~a~T~~ni~~ 151 (346) T protein:vir:10 76 SYELKNERYWST-LVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEA---AHDGGITTNTLDEKNILP 151 (346) T ss_pred EEEeecccccee-cccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhh---hccccccccccCHHHHHH Confidence 555554443222 1211001111 2334444444445555556676544211000000 000011111122334567 Q ss_pred HHHHHHHHhhhhcCC--CcEEEEcHHHHHHHHHhhcc-----CCceeeccCCcceeecceeEeecCCCCC------Ccee Q lcl|Aclame:pro 178 NIIDLEALLEDDELE--ANAFISKTQNRSLLRKIVDP-----ETKERIYDRNSDSLDGLPVVNLKSSNLK------RGEL 244 (324) Q Consensus 178 ~i~~~~~~l~~~~~~--~~~~v~~~~~~~~l~~~~d~-----~g~~~~~~~~~~~l~G~pv~~~~~~~~~------~~~~ 244 (324) .|.+++.+|.+...+ +..++|+|..+..|.+...- .+......+..+++.|+||+..|+.-+. +|.. T Consensus 152 ~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~~~~~i~~~V~siDGv~Ii~VPs~r~~t~~~f~~G~~ 231 (346) T protein:vir:10 152 AFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLKDPNNIQRTVYSLDDVTIRVVPSDLMQTAYDFSDGSK 231 (346) T ss_pred HHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccccccccceeeeeecCeEEEEcchhhcccchhhccCcc Confidence 888999999877664 34679999999988643311 1111223566788999999876543221 1110 Q ss_pred EEeecccE-EEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCc---eEEEEeecCCCCCC Q lcl|Aclame:pro 245 ITGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA---FAKLVPADKRTDSV 320 (324) Q Consensus 245 i~gd~s~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A---~~~l~~~~~~~~~~ 320 (324) ...+-+++ ++.......+.+........... ... ..|.-.+.-+.|.|.=|.+.+. ++-++.+.+.+..+ T Consensus 232 ~~t~ak~INfiiv~~~A~ia~~K~~~~~if~P-~~~-----~~g~~l~~~R~Y~D~fv~~nk~~~Iyv~~~~a~~~~~~~ 305 (346) T protein:vir:10 232 IIDTAKQIEMFLIYNGVQIAPEKYSFVGFDQP-SAA-----TSGNYLYYEQSYDDVLLLNTKTKGIQFVVSDKPKKDQEQ 305 (346) T ss_pred ccCCccceeEEEECCceeeeeeeeeeeEeeCC-CCC-----cccceeeeeeeeeeeeeeccccceEEEeeecccccCccC Confidence 00111111 11112222222222222211111 110 1121223445666777766443 44556666665555 Q ss_pred CCC-C Q lcl|Aclame:pro 321 PGE-V 324 (324) Q Consensus 321 ~~~-~ 324 (324) +|. + T Consensus 306 ~~~~~ 310 (346) T protein:vir:10 306 SGQDA 310 (346) T ss_pred ccccc Confidence 554 4 No 224 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=85.22 E-value=0.054 Score=27.51 Aligned_cols=291 Identities=9% Similarity=0.004 Sum_probs=146.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccc-cccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPD-NVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~-~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~ 78 (324) |.+. ++..++.+.|. .++- ....+...-+.|.|.+.+.+.+.+.+.|-+++.++.+++..-... +-...+ T Consensus 1 mtr~-~~~~y~~~~A~-------~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~ 72 (336) T protein:vir:37 1 MNKQ-AYYALAAALAK-------HFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATE 72 (336) T ss_pred CcHH-HHHHHHHHHHH-------HhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccC Confidence 7763 44444444332 1111 111222234678889999999999999999999998887653332 333333 Q ss_pred CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHH----HHHHHHHHHHHHHHhccCcc- Q lcl|Aclame:pro 79 KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPM----IAEAFYKKFDEAGILNQGNN- 153 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~----l~~ai~~~~d~~~l~G~g~~- 153 (324) ++-++-..- ...|.. ..++...+..++.---..|+.+.|+. ...+..+..+. +.+.++.-.-.-.|+|+.-. T Consensus 73 g~iagrtdt-~R~~~~-~~l~~~~Y~c~qTn~dt~i~y~~LD~-WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~ 149 (336) T protein:vir:37 73 KGVTGRKQT-GRNLAN-LDHTQNGFELAETDSGIIVPWALFDS-FAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVAD 149 (336) T ss_pred cccccccCC-Cccccc-cCcCCcccEEEEeeeeeeecHHHHHH-HhcChhHHHHHHHHHHHHHHhhchhhhcccceeecc Confidence 333332221 122222 35666667777766677788888864 32333333333 34444443334455674211 Q ss_pred --cccccc------------------cc-c---cccccccccchhhh---hH-HHHHHHHhhhhcCCC--cEEEEcHHHH Q lcl|Aclame:pro 154 --PFGKSI------------------AQ-S---IEKTNKVIKGDFTQ---DN-IIDLEALLEDDELEA--NAFISKTQNR 203 (324) Q Consensus 154 --~~~~~~------------------~~-~---~~~~~~~~~~~~~~---~~-i~~~~~~l~~~~~~~--~~~v~~~~~~ 203 (324) ..|.+- .. . ......... .-+| |. +.+++..|+..+++. -+.+|..+.. T Consensus 150 ~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~-~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLl 228 (336) T protein:vir:37 150 NTTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGD-NADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADLV 228 (336) T ss_pred CCCCCcccccchhHHHHHHhccchhhcccccccCCceEEecC-CCCcccHHHHHHHHHhcCchHHhcCCCeEEEEchhhh Confidence 111110 00 0 000000011 1123 22 345666676666653 3667777654 Q ss_pred HH-HHHhhccCC-ceee--c---cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc-eEEEEeeccceecccc Q lcl|Aclame:pro 204 SL-LRKIVDPET-KERI--Y---DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTVKN 275 (324) Q Consensus 204 ~~-l~~~~d~~g-~~~~--~---~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~-~~~~~~~~~~~~~~~~ 275 (324) +. ...+-..++ .|-- . .-...++.|+|.+..|. .|.+.+++--++++-+-...| .+-.+-+.. T Consensus 229 a~~~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------- 299 (336) T protein:vir:37 229 SKETKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPN--FPARAAAVTTLKNLSVYTEAESVRRSLRNDE------- 299 (336) T ss_pred hhhhhhhhhhcCCCHHHHHHHHHHHHHHhhCCceeEEccc--cCCCceEEeechhcEEEEecCcEEEEEEEcc------- Confidence 32 112222222 2211 0 11235799999988775 556677777777654333222 222221111 Q ss_pred ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 276 ~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .+|.+.-.=....|+.|-+.++++.++.... .-|+|| T Consensus 300 ---------~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v---~~~~e~ 336 (336) T protein:vir:37 300 ---------DKKGLVTSYYRQEGYVVEDLGLMTAIDHTKV---KLNGEV 336 (336) T ss_pred ---------ccccccchhhhcceeeeeccccEEEeeeeee---eecCcC Confidence 1222222223456888899999998887665 445666 No 225 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=84.94 E-value=0.057 Score=27.42 Aligned_cols=256 Identities=13% Similarity=0.064 Sum_probs=113.7 Q ss_pred CccccchHHHHHHHHHHHhhhhhhhhcc------eeecCCCceEEEEEeC--CcceeeeccCccccccccceeeEEeehe Q lcl|Aclame:pro 35 KDGTLMNEFTTPILQEVMENSKIMQLGK------YEPMEGTEKKFTFWAD--KPGAYWVGEGQKIETSKATWVNATMRAF 106 (324) Q Consensus 35 ~~~~vp~~~~~~i~~~~~~~s~l~~l~~------~~~~~~~~~~ip~~~~--~~~a~~v~Eg~~~~~~~~~~~~v~l~~~ 106 (324) -..-+-+.+...+.+.....+....+.. +...++++++||+... +...+-.+-|-.....+.+++..+++.. T Consensus 1 Main~~~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~dY~R~~g~~~g~v~~~~et~tl~~D 80 (285) T protein:vir:79 1 MTVVLDSKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATAYKRGQDNARKTISVGKETVKLTHE 80 (285) T ss_pred CcchhhHHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccccccccCccccccceeeeEEEeecc Confidence 1111235566666666666555554422 2345678899999842 3333334444333333444555555554 Q ss_pred eeEEe-eeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHHHHHH Q lcl|Aclame:pro 107 KLGVI-LPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEAL 185 (324) Q Consensus 107 k~~~~-~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~ 185 (324) +.-.. +.--+. -+.....+...+.+...+...-.+|...|.---+. +....+.+.+.+-.++.+.+++.+ T Consensus 81 R~~~f~iD~mDv-dEn~~~~~~ni~~ef~~~~vvPEiDayrfskla~~--------a~~~~~~~~T~~nv~~~i~~~~~~ 151 (285) T protein:vir:79 81 DWFGYDLDQFDM-DENGAYTVENVVREHNKMITIPHRDKVAVQKLFDS--------AAKKATDSITKDNALDAYDTAEAY 151 (285) T ss_pred ccceecccccch-hhhhhhhHHHHHHHHHhhhhcchhhHHHHHHHHhh--------cccccccccCHHHHHHHHHHHHHH Confidence 42222 211111 11112223333333334444445665544311110 000111122233457788889999 Q ss_pred hhhhcCC-CcEEEEcHHHHHHHHHhhccC-----Cceee---ccCCcceeec-ceeEeecCCCCCCc------eeEEeec Q lcl|Aclame:pro 186 LEDDELE-ANAFISKTQNRSLLRKIVDPE-----TKERI---YDRNSDSLDG-LPVVNLKSSNLKRG------ELITGDF 249 (324) Q Consensus 186 l~~~~~~-~~~~v~~~~~~~~l~~~~d~~-----g~~~~---~~~~~~~l~G-~pv~~~~~~~~~~~------~~i~gd~ 249 (324) +.+.+.+ +-.++|+|.++..|++.+.-. ..... .....+.|.| .|++..|+.-++.. ..++... T Consensus 152 lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~~V~~lDg~v~ii~Vps~r~kt~~~~k~Infiiv~~ 231 (285) T protein:vir:79 152 MFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDRRVAQLDGGVPIVRVSSDRLKGLGITNHVNFILTPL 231 (285) T ss_pred HHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceeeeeccccceeEEEEcchhhccCcCcchhccEEEecC Confidence 9887764 345789999999887544211 11111 2234578898 89987766444321 1222222 Q ss_pred ccEEEEEecceEEEEeeccce---eccccccccchhhhhcCcEEEEEEEEeccEEeccCce-EEEEeecCC Q lcl|Aclame:pro 250 DKLIYGIPQLIEYKIDETAQL---STVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF-AKLVPADKR 316 (324) Q Consensus 250 s~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~-~~l~~~~~~ 316 (324) + ..+.+.+.... ....+.++.. -.+.-+.|.|.=|.+.+.= +.+..+++- T Consensus 232 ~---------a~i~~~K~~~~~~f~P~~~~~~d~--------~~~~~R~Y~d~fv~~nk~~~Iy~~~~a~~ 285 (285) T protein:vir:79 232 S---------AIAPIVKYDSVSVIDPSTDRSGNR--------WTIKGLSYYDAIVLDNAKKGIYVAATAGV 285 (285) T ss_pred c---------eeccceeeeeeEeECCCCCCCcce--------eeeeeeeeeeeeehhhccceeeeeecccC Confidence 2 12222221111 1112222211 1233344555555553321 122222222 No 226 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=84.57 E-value=0.059 Score=27.31 Aligned_cols=291 Identities=11% Similarity=0.061 Sum_probs=154.6 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeCC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWADK 79 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~~ 79 (324) |++... ..+.+|.. +.....++. ..+..+.|.|.+.+.+...+.+.|-+++.++.+++..-... +-...++ T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g 72 (337) T protein:vir:78 1 MRKETR--QAYEKYAA-----QIAKLNDTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) T ss_pred CChHHH--HHHHHHHH-----HHHHhcChh-hhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCc Confidence 776433 22222221 111111221 22345668888999999999999999999998887654333 3333334 Q ss_pred cceeee--ccCccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHHHHhccCcccc Q lcl|Aclame:pro 80 PGAYWV--GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPF 155 (324) Q Consensus 80 ~~a~~v--~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~ 155 (324) +-++-. +-+...|.+...++...+..++.---..|+.+.|+. ..+++...+.+.+.+.++.-.-.-.|+|+.-... T Consensus 73 ~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~ 152 (337) T protein:vir:78 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) T ss_pred ceeeeecCCCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccC Confidence 433322 222333444456666677777766667788887763 3478888888888888877666667777532110 Q ss_pred ------ccc------------------ccccc-ccccc-cccchhhhhHH----HHHHHH-hhhhcCCC--cEEEEcHHH Q lcl|Aclame:pro 156 ------GKS------------------IAQSI-EKTNK-VIKGDFTQDNI----IDLEAL-LEDDELEA--NAFISKTQN 202 (324) Q Consensus 156 ------~~~------------------~~~~~-~~~~~-~~~~~~~~~~i----~~~~~~-l~~~~~~~--~~~v~~~~~ 202 (324) |.+ +.... ..... .....-+|..| .+++.. |+..+++. -+.+|..+. T Consensus 153 Td~~~nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dL 232 (337) T protein:vir:78 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) T ss_pred CChhhCcCccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 100 00000 00000 00111134333 355543 46666654 367777776 Q ss_pred HHH--HHHhhccCCceeec-----cCCcceeecceeEeecCCCCCCceeEEeecccEEEEE-ecceEEEEeeccceeccc Q lcl|Aclame:pro 203 RSL--LRKIVDPETKERIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGI-PQLIEYKIDETAQLSTVK 274 (324) Q Consensus 203 ~~~--l~~~~d~~g~~~~~-----~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~-~~~~~~~~~~~~~~~~~~ 274 (324) .+. +..+. ....|--. -....++-|+|.+..|. .|.+.+++--++++-+-. .+..+-.+-+.. T Consensus 233 ladk~~~l~n-~~~~ptE~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p------ 303 (337) T protein:vir:78 233 LHDKYFPIVN-ATQAPTERLAADLIVSQKRIGNLPAVRVPF--FPKRALMVTKLSNLSIYYQEGARRRTLKEVP------ 303 (337) T ss_pred hHHHHHHHHh-cCCCcHHHHHHHHHHHhhhhcCcceEEccc--cCCCceEEeechhcEEEEecCcEEEEEEecc------ Confidence 542 22222 22233211 11235799999988765 556677777776653322 233222221111 Q ss_pred cccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCC Q lcl|Aclame:pro 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) Q Consensus 275 ~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~ 318 (324) .+|.+.-.=....|+.|-+.++++.+++..-+.. T Consensus 304 ----------~r~rie~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 304 ----------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred ----------ccccccchhhccceeeeeccccEEEEeceeecCC Confidence 1233322233455888888999888875544433 No 227 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=84.31 E-value=0.061 Score=27.23 Aligned_cols=253 Identities=10% Similarity=0.019 Sum_probs=105.0 Q ss_pred ccccCccc-c--chHHHHHHHHHHHhhhhhhhhc--c--eeecCCCceEEEEEeCCc-ceeeeccCccccccccceeeEE Q lcl|Aclame:pro 31 MHEKKDGT-L--MNEFTTPILQEVMENSKIMQLG--K--YEPMEGTEKKFTFWADKP-GAYWVGEGQKIETSKATWVNAT 102 (324) Q Consensus 31 ~~~~~~~~-v--p~~~~~~i~~~~~~~s~l~~l~--~--~~~~~~~~~~ip~~~~~~-~a~~v~Eg~~~~~~~~~~~~v~ 102 (324) +.+++..+ + -+.+...+-+.+...+ +.+.. + .+-.++++++||+.+... ..+-..-|-....-+..++..+ T Consensus 1 ~~~~an~mAlnya~~~~~~Ld~~~~~~~-~t~~l~~~~~~~~~Gak~VkIp~i~~~gl~dY~R~~g~~~g~v~~~~et~t 79 (311) T protein:vir:99 1 MPTDAETRGFNYVTKDGNLLDQKITAGL-FTAALGTPEVDLVNGGRSFTLKTISTSGLKDHTRGKGFNSGTISDEKTIYT 79 (311) T ss_pred CCCcchhhHHHHHHHHHHHHHHHHHhhh-cccceecCchheeecCCEEEEEeeeeccccccccccCccccceeeeeeEEE Confidence 22221110 1 2334444333333322 22111 1 122367889999986433 2233333322222233444555 Q ss_pred eeheeeEEeeeehHHHhhcC--hHHHHHHHHHHHHHHHHHHHHHHHHhc---cCcccc----ccccccccccccccccch Q lcl|Aclame:pro 103 MRAFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILN---QGNNPF----GKSIAQSIEKTNKVIKGD 173 (324) Q Consensus 103 l~~~k~~~~~~iS~e~l~ds--~~~~~~~i~~~l~~ai~~~~d~~~l~G---~g~~~~----~~~~~~~~~~~~~~~~~~ 173 (324) ++..+.-. ..|-.-=++.+ ...+...+.+...+...=.+|...|.- ...... ...............+.+ T Consensus 80 l~~DR~~~-f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~~~lt~~ 158 (311) T protein:vir:99 80 MGQDRDVE-FYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTEETLDET 158 (311) T ss_pred eeecccee-eecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccccccCHH Confidence 54433222 22221001211 133344444444444455566544421 111000 001111111122222233 Q ss_pred hhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCC-------ceeeccCCcceeecceeEee-cCCCCCCceeE Q lcl|Aclame:pro 174 FTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPET-------KERIYDRNSDSLDGLPVVNL-KSSNLKRGELI 245 (324) Q Consensus 174 ~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g-------~~~~~~~~~~~l~G~pv~~~-~~~~~~~~~~i 245 (324) --++.|...+..+.+....+-.++|+|..+..|...+.-.. ...-.....+.|.|.|++-. ++.-+... T Consensus 159 nvl~~l~~~~~~~~~v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~~V~~lDgv~Ii~V~ps~r~~t~--- 235 (311) T protein:vir:99 159 NAYSQLKTGIGKVRKYGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALESRITSIDGVQLIEVYESNRFMTK--- 235 (311) T ss_pred HHHHHHHHHHHHHHhcCCCCeEEEEChHHHHHHhhchhhheeeecccccccccccccceecCeEEEEecCchhhcch--- Confidence 34566777777776655556678999999888764321110 11113455678999998755 44222211 Q ss_pred EeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 246 TGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 246 ~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) -||. .|... .. .-..+.|.+.++.|...+..-...---+||.- T Consensus 236 -~~ft-------~G~~~------------------------~~----~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~~ 278 (311) T protein:vir:99 236 -YDFT-------DGAKP------------------------TE----DAKAINFLVVAKPAVISIVKENAVFLFAPGQH 278 (311) T ss_pred -hhhc-------CCccc------------------------cC----cccccceEEeCCCeeeeeeeeeeeeeeCCCCC Confidence 0110 01000 00 00234566666666655554444433344443 No 228 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=83.67 E-value=0.067 Score=27.03 Aligned_cols=313 Identities=10% Similarity=0.036 Sum_probs=139.8 Q ss_pred CchhHHHHHH---HHHHHhhhhhHHhhccc---cccccccCccccchHHHHHHHHHHHhhhhh--hhhcceeecCCCceE Q lcl|Aclame:pro 1 MEQTQKLKLN---LQHFASNNVKPQVFNPD---NVMMHEKKDGTLMNEFTTPILQEVMENSKI--MQLGKYEPMEGTEKK 72 (324) Q Consensus 1 ~~~~~~~k~~---~~~~a~~~~~~~~~~~~---~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l--~~l~~~~~~~~~~~~ 72 (324) |-+++|-.-. +++..+ .+. +.+.+. +-.+-++++++--+.+..+|.........+ .+-..+.+..+-..+ T Consensus 1 ~~~~~~~~~~~~n~~~~~e-~~~-Ks~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~ 78 (467) T protein:vir:80 1 MPKNNKEEVKEVNLNSVQE-DAL-KSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAK 78 (467) T ss_pred CCCcchhhhhhcccccCHH-HHH-HHHHcccccCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhhhhhh Confidence 7666553222 222121 111 112111 111123455565566666655444433322 222233344443344 Q ss_pred EEEEeC---CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhh-cChHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 73 FTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN-YTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) Q Consensus 73 ip~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~-ds~~~~~~~i~~~l~~ai~~~~d~~~l~ 148 (324) |-...+ ...+.+++|++..+.+++++...+..++=++....+|.-+-. .+..+.+....+.-...++..+|.++|+ T Consensus 79 y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~Fy 158 (467) T protein:vir:80 79 YDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFF 158 (467) T ss_pred heeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhh Confidence 444332 345789999999999999999999999999987777764433 3467888888899999999999999999 Q ss_pred ccCccc---------cccccccccccccc-cccc-hhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHH-HhhccCCce Q lcl|Aclame:pro 149 NQGNNP---------FGKSIAQSIEKTNK-VIKG-DFTQDNIIDLEALLEDDELEANAFISKTQNRSLLR-KIVDPETKE 216 (324) Q Consensus 149 G~g~~~---------~~~~~~~~~~~~~~-~~~~-~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~-~~~d~~g~~ 216 (324) |+..-. +..|+.......+- ...| .++-++|..+-..+...|..+.-++|+..+.+.|. .....+ +- T Consensus 159 Gds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~q-~~ 237 (467) T protein:vir:80 159 GDSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQ-TQ 237 (467) T ss_pred cccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCce-EE Confidence 986431 11222222111111 1112 23444555555556666777778999999887773 222211 22 Q ss_pred eeccCCcceeecceeE--eecCCCC-CCceeEEeecccEE---EEEe---cceEEEEeeccceeccccccccchhhhhcC Q lcl|Aclame:pro 217 RIYDRNSDSLDGLPVV--NLKSSNL-KRGELITGDFDKLI---YGIP---QLIEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) Q Consensus 217 ~~~~~~~~~l~G~pv~--~~~~~~~-~~~~~i~gd~s~~~---~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~ 287 (324) +..+.......|.||- .+....+ -.+..+++|....- .+.. ....+..+....- ......+.+ . T Consensus 238 v~~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~~vsaT~~~~~-~g~~~~~~~------a 310 (467) T protein:vir:80 238 LVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPAKVTATQEAGK-KGQFRAEDL------A 310 (467) T ss_pred EEcCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccccCCccceeeeccc-CCcccCCCc------c Confidence 2223333344455542 1110000 01112233321110 0000 0000100000000 000000000 0 Q ss_pred cEEEEEEEEeccEEeccCceE-----------EEEee-cCCCCCCCCC-C Q lcl|Aclame:pro 288 MVALRATMHVALHIADDKAFA-----------KLVPA-DKRTDSVPGE-V 324 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~~~A~~-----------~l~~~-~~~~~~~~~~-~ 324 (324) ...||+...-+..--.|...+ .|+.. .+.+..+| + | T Consensus 311 ~y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p-~yv 359 (467) T protein:vir:80 311 AHEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRP-QFV 359 (467) T ss_pred eEEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcc-eEE Confidence 011222222222111222222 22222 12222222 1 1 No 229 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=81.39 E-value=0.086 Score=26.42 Aligned_cols=297 Identities=11% Similarity=0.012 Sum_probs=141.7 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhcccccc---ccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEE-EEE Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVM---MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKF-TFW 76 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~---~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~i-p~~ 76 (324) |++... ..+.+|.. +.....++. ...+.-+.|.|.+.+.+.+.+.+.|-+++.++.+++..-...+ ... T Consensus 1 M~~~tr--~~~~~y~~-----~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~ 73 (343) T protein:vir:98 1 MNKTAQ--ELFYSLIG-----DAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRS 73 (343) T ss_pred CChHHH--HHHHHHHH-----HHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEee Confidence 766432 22233322 111122222 1223346788889999999999999999999888775322222 222 Q ss_pred eCCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhc--ChHH-HHHHHHHHHHHHHHHHHHHHHHhccCcc Q lcl|Aclame:pro 77 ADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY--TYSQ-FFEEMKPMIAEAFYKKFDEAGILNQGNN 153 (324) Q Consensus 77 ~~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d--s~~~-~~~~i~~~l~~ai~~~~d~~~l~G~g~~ 153 (324) .++..+.-....+.... ....+...+..++.---..|+.+.|+. ..+| +...+.+.+.+.++.-.-.-.|+|+.-. T Consensus 74 ~sg~~t~r~~t~~~~~~-~~~~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A 152 (343) T protein:vir:98 74 NRKRHYGAHDRRTPIQQ-RWTRQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVG 152 (343) T ss_pred cCccccCccccCCCccc-cccCCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeec Confidence 22221211111111100 011111234455544555567776653 2356 8888888888877766666667775311 Q ss_pred ---ccccc------------------ccccccc---ccccccchhhhhHH----HHHHHHhhhhcCCC--cEEEEcHHHH Q lcl|Aclame:pro 154 ---PFGKS------------------IAQSIEK---TNKVIKGDFTQDNI----IDLEALLEDDELEA--NAFISKTQNR 203 (324) Q Consensus 154 ---~~~~~------------------~~~~~~~---~~~~~~~~~~~~~i----~~~~~~l~~~~~~~--~~~v~~~~~~ 203 (324) ..|.+ +...... ......+ -+|..| .++...|+..+++. -+.+|..+.. T Consensus 153 ~~T~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~g-gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dLl 231 (343) T protein:vir:98 153 TDTSDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEG-ADYVNLDELAYDLKQGLDARHRDAGDLVFLVGADLV 231 (343) T ss_pred cCCCCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCC-CCcccHHHHHHHHHhcCchHHhcCCCEEEEEchhhh Confidence 11111 0000000 0000111 123332 34555566666554 3667777764 Q ss_pred HHH-HHhhccCCc-eeec-----cCCcceeecceeEeecCCCCCCceeEEeecccEEEE-EecceEEEEeeccceecccc Q lcl|Aclame:pro 204 SLL-RKIVDPETK-ERIY-----DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG-IPQLIEYKIDETAQLSTVKN 275 (324) Q Consensus 204 ~~l-~~~~d~~g~-~~~~-----~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~-~~~~~~~~~~~~~~~~~~~~ 275 (324) +.= ..+-...++ |--. -....++-|+|.+..|. .|.+.+++--++++-+- ..+..+-.+-+.. T Consensus 232 a~~~~~l~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~Pf--FP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p------- 302 (343) T protein:vir:98 232 AKEASLVYKGNGLIATEKAALNTHDLMKSFGGMPAMIVPN--MPPRAAIVTSLSNLSIYTQEGSMRRGMKDDD------- 302 (343) T ss_pred hhhhhhhhhhcCCChHHHHHHHHHHHHHhhCCCeeEEccc--cCCCceEEeeccccEEEEecCcEEEEEEecc------- Confidence 322 122222222 2111 11235789999988775 55667777777665332 2333322222111 Q ss_pred ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 276 ~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) ++|.+.-.=....|+.|-+.++++.++.....-...-|.- T Consensus 303 ---------~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~~g~w 342 (343) T protein:vir:98 303 ---------DKKAVRDSYYRNEAYAVEDCGKFMAVDFTKVKLSSGKGTW 342 (343) T ss_pred ---------ccccccchhhhcceeeeeccccEEEeeeeeeeecCCCCCC Confidence 1222222222345777888888887766554433322311 No 230 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=79.04 E-value=0.11 Score=25.87 Aligned_cols=299 Identities=11% Similarity=0.045 Sum_probs=139.4 Q ss_pred Cchh--HHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEe Q lcl|Aclame:pro 1 MEQT--QKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWA 77 (324) Q Consensus 1 ~~~~--~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~ 77 (324) |++. ++.+..+..|.. +.....++. .....+.|-|.+.+.+.+.+.+.|-+++.++.+++..-... +-.-. T Consensus 1 m~~~m~~~tr~~~~~y~~-----~~A~~ngv~-~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~ 74 (341) T protein:vir:27 1 MSQILTQSAREYMDNFAQ-----QLAKSYGVS-NVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGV 74 (341) T ss_pred CcccccHHHHHHHHHHHH-----HHHHHcCcc-cccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeeccc Confidence 7763 222333333332 211222221 22345668788889999999999999999988887654332 22222 Q ss_pred CCcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhc-C----hHHHHHHHHHHHHHHHHHHHHHHHHhccCc Q lcl|Aclame:pro 78 DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY-T----YSQFFEEMKPMIAEAFYKKFDEAGILNQGN 152 (324) Q Consensus 78 ~~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d-s----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~ 152 (324) +++-++-+.- +..|.. +.++...+..++.---..|+.+.|+. + .+++...+.+.+.+.++.-.-.-.|+|+-- T Consensus 75 ~g~iagrtdt-~R~~r~-~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~ 152 (341) T protein:vir:27 75 SGLYTGRKAG-GRFTKQ-VGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSA 152 (341) T ss_pred ccceeeccCC-Cceecc-cccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceee Confidence 3333333221 222222 35666666666666666677777753 1 478888999999998888777777788641 Q ss_pred cc------cccc-------cc----ccc---ccccccc-cchhhhhHH----HHHHHH-hhhhcCCC--cEEEEcHHHHH Q lcl|Aclame:pro 153 NP------FGKS-------IA----QSI---EKTNKVI-KGDFTQDNI----IDLEAL-LEDDELEA--NAFISKTQNRS 204 (324) Q Consensus 153 ~~------~~~~-------~~----~~~---~~~~~~~-~~~~~~~~i----~~~~~~-l~~~~~~~--~~~v~~~~~~~ 204 (324) .. .|.+ +. ... -+..... ...-+|..| .+++.. |+..+++. -+.||..+..+ T Consensus 153 A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla 232 (341) T protein:vir:27 153 EADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIG 232 (341) T ss_pred ccCCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhh Confidence 11 0111 00 000 0000011 112234333 344443 45555554 36777776654 Q ss_pred -HHHHhhccCCcee---eccCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc-eEEEEeeccceecccccccc Q lcl|Aclame:pro 205 -LLRKIVDPETKER---IYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTVKNEDGT 279 (324) Q Consensus 205 -~l~~~~d~~g~~~---~~~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 279 (324) .-..+-.....|- -..--..++.|+|.+..|. .|.+.+++--++++-+-...| .+-.+-+.. +-. T Consensus 233 ~k~~~l~n~~~~ptE~~Aa~~i~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p--------~r~ 302 (341) T protein:vir:27 233 AAQAKLYDKADKPSEQIAAQKLDKTIAGRPAYVPPF--LPDNAMVVTIPENLQVLTQHGTAQRKAKHES--------DRK 302 (341) T ss_pred hhhhhhhccCCCCHHHHHHHHHHHhhCCCeEEEccc--cCCCceEEeeccceEEEEecCcEEEEEEecc--------ccc Confidence 2112222111121 0111135899999988765 556677777777654433333 222211111 100 Q ss_pred chhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 280 ~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) -+.-|++ +|.++-+--|+ .-.|..++..+.+ --.--|- T Consensus 303 rie~yes---~YvVEdyg~~~---~~~~~~vkl~~~~-~~~~~~~ 340 (341) T protein:vir:27 303 RSKTHTG---AWKVTQWVCWK---RSPLTTQKKSTSA-LNHRSER 340 (341) T ss_pred cccchhh---hheeehhhhhh---hccccccccCccc-ccccccc Confidence 0001222 22222221111 1112222221111 0000011 No 231 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=77.61 E-value=0.12 Score=25.57 Aligned_cols=290 Identities=9% Similarity=0.012 Sum_probs=144.6 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhcc-ccccccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceE-EEEEeC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNP-DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWAD 78 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~-~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~ 78 (324) |.+. ++...+.+.|. .++ .....+...-+.|.|.+.+.+.+.+.+.|-+++.++.+++..-... +-...+ T Consensus 1 mtr~-~~~~y~~~~A~-------~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~ 72 (336) T protein:vir:37 1 MNKQ-AYYALAAALAK-------HFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATE 72 (336) T ss_pred CcHH-HHHHHHHHHHH-------HhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccC Confidence 7773 44444444332 111 1111222335678888999999999999999999998887653332 333333 Q ss_pred CcceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHH----HHHHHHHHHHHHhccC--- Q lcl|Aclame:pro 79 KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIA----EAFYKKFDEAGILNQG--- 151 (324) Q Consensus 79 ~~~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~----~ai~~~~d~~~l~G~g--- 151 (324) ++-++-..-+... ....++...+..++.---..|+.+.|+. ...+..+..+.+. +.++.-.-.-.|+|+. T Consensus 73 g~iagrtdt~r~r--~~~~l~~~~Y~c~qTn~dt~i~y~~LD~-WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~ 149 (336) T protein:vir:37 73 KGVTGRKQTGRNL--ATLDHSQNGYELSETDSGILVNWSLFDS-FAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVAT 149 (336) T ss_pred cccccccCCCCCc--cccCCCCCccEEEEeeeeeeccHHHHHH-HhcChhHHHHHHHHHHHHHHhcchhhhcccceeecc Confidence 3333322222111 1123455556666666666778887764 3233333333333 3333333344456742 Q ss_pred -cccccccc------------------cc-c---cccccccccchhhhh---H-HHHHHHHhhhhcCCC--cEEEEcHHH Q lcl|Aclame:pro 152 -NNPFGKSI------------------AQ-S---IEKTNKVIKGDFTQD---N-IIDLEALLEDDELEA--NAFISKTQN 202 (324) Q Consensus 152 -~~~~~~~~------------------~~-~---~~~~~~~~~~~~~~~---~-i~~~~~~l~~~~~~~--~~~v~~~~~ 202 (324) ++ .|.+- .. . ......... .-+|. . +.+++..|+..+++. -+.+|..+. T Consensus 150 ~Td-nPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~-~gdy~NLDalV~D~~~~I~~~~~~d~dLVvivG~dL 227 (336) T protein:vir:37 150 NTT-KTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGD-NADYANLDDLAFDLKQGLDFRHQNRNDLVFLVGADL 227 (336) T ss_pred CCC-CccccccchhHHHHHHhccchhhcccccccCCceEEecC-CCCcccHHHHHHHHHhccchHHhcCCCeEEEEchhh Confidence 12 12110 00 0 000000011 11232 2 345666676666653 366777765 Q ss_pred HHH-HHHhhccCC-ceee---c--cCCcceeecceeEeecCCCCCCceeEEeecccEEEEEecc-eEEEEeeccceeccc Q lcl|Aclame:pro 203 RSL-LRKIVDPET-KERI---Y--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL-IEYKIDETAQLSTVK 274 (324) Q Consensus 203 ~~~-l~~~~d~~g-~~~~---~--~~~~~~l~G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~-~~~~~~~~~~~~~~~ 274 (324) .+. ...+-..++ .|-- . .-...++.|+|.+..|. .|.+.+++--++++-+-...| .+-.+-+.. T Consensus 228 la~~~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p------ 299 (336) T protein:vir:37 228 VSKETKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPN--FPARAAAVTTLKNLSVYTEAESVRRSLRNDE------ 299 (336) T ss_pred hhhhhhhhhhhcCCCHHHHHHHHHHHHHHhhCCceEEEccc--cCCCceEEeeccccEEEEecCcEEEEEEEcc------ Confidence 432 112222222 2211 0 11235789999988775 556677777777654333222 222221111 Q ss_pred cccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 275 NEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 275 ~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) .+|.+.-.=....|+.|-+.++++.++.... .-|+|| T Consensus 300 ----------~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v---~~~~e~ 336 (336) T protein:vir:37 300 ----------DKKGLVTSYYRQEGYVVEDLGLMTAIDHTKV---KLNGEV 336 (336) T ss_pred ----------ccccccchhhhcceeeeeccccEEEeeeeee---eccccC Confidence 1222322223456888999999999887665 345666 No 232 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=77.48 E-value=0.12 Score=25.55 Aligned_cols=307 Identities=11% Similarity=0.031 Sum_probs=124.5 Q ss_pred CchhH-HHHHHHHHHHhhhhhH------Hhhccccccccc--cCcc---ccchHHHHHHHHHHHhhhhh-----hhhcce Q lcl|Aclame:pro 1 MEQTQ-KLKLNLQHFASNNVKP------QVFNPDNVMMHE--KKDG---TLMNEFTTPILQEVMENSKI-----MQLGKY 63 (324) Q Consensus 1 ~~~~~-~~k~~~~~~a~~~~~~------~~~~~~~~~~~~--~~~~---~vp~~~~~~i~~~~~~~s~l-----~~l~~~ 63 (324) =|+.+ .+-...+.-+...-+. ..+..+.+.++. .++. .++.- +. +.+...+ ..+.++ T Consensus 31 PN~~~pll~li~~g~~~ta~ast~~w~~d~~~~~~~~~ta~a~a~~T~l~ve~~---~~---f~~~~l~~~~~~~Evirv 104 (418) T protein:vir:10 31 PNGSAPLLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEAAADATVLTVENS---DG---LTKGMIFYNEATGENMRL 104 (418) T ss_pred CCcchhhhhhhhcccccccceeEEEEEEEEEeeeeEEEEEEEecCceEEEEcCc---ce---eccccEEEEccCCeEEEE Confidence 11111 1111111111000000 000111111110 1111 12211 11 2223322 123444 Q ss_pred eecCCCceEEEEEeCCcceeeeccC-------ccccccccceeeEEeeheeeEEe-------eeehHHHhhc----ChHH Q lcl|Aclame:pro 64 EPMEGTEKKFTFWADKPGAYWVGEG-------QKIETSKATWVNATMRAFKLGVI-------LPVTKEFLNY----TYSQ 125 (324) Q Consensus 64 ~~~~~~~~~ip~~~~~~~a~~v~Eg-------~~~~~~~~~~~~v~l~~~k~~~~-------~~iS~e~l~d----s~~~ 125 (324) ..+++..++.-+..++..++-+++| ..+++..-..+.....+..+... +.||.-+... ..-| T Consensus 105 ~sVng~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd~~ta~~~k~~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn 184 (418) T protein:vir:10 105 ELVNGLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEGSQRPTARSIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSN 184 (418) T ss_pred EEEeCCEEEEEEecCCeeEEEEecCceEEEeccccccccccCCcceecceeccchhhhhhhhhhhhhhhhhccccccCch Confidence 5566777777776555544333332 23333332222223333333332 2333322110 0011 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhcc----Cccccc----ccccccc------ccccccccchhhhhHHHHHHHHhhhhc Q lcl|Aclame:pro 126 -FFEEMKPMIAEAFYKKFDEAGILNQ----GNNPFG----KSIAQSI------EKTNKVIKGDFTQDNIIDLEALLEDDE 190 (324) Q Consensus 126 -~~~~i~~~l~~ai~~~~d~~~l~G~----g~~~~~----~~~~~~~------~~~~~~~~~~~~~~~i~~~~~~l~~~~ 190 (324) ++....+++.++ ..+|+++|+|. +++..+ .++.... ........+.++++++.+++....... T Consensus 185 ~~ese~drk~~~a--v~iEkalI~G~~~~~~~~~g~~R~m~GIl~~vr~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g 262 (418) T protein:vir:10 185 ITESRRDCMDFHA--TEQETAIFFGQAFMGTYNGQPLHTTQGIVDAVRQYAPDNVNAMPNPTAVTYDDVVDATIDAFKWS 262 (418) T ss_pred HHHHHHHHHHHHH--HHHHHHHhcccccCCCcCCcchhhHHHHHHHHhhhcccceeccCCCCccCHHHHHHHHHHHhhcc Confidence 334444444443 37899999995 333222 2222111 111111123567888888877653321 Q ss_pred ---CCC-----cEEEEcHHHHHHHHHhhccCCceeeccCCcc-------------eeecceeEeecCCCCCCceeEEeec Q lcl|Aclame:pro 191 ---LEA-----NAFISKTQNRSLLRKIVDPETKERIYDRNSD-------------SLDGLPVVNLKSSNLKRGELITGDF 249 (324) Q Consensus 191 ---~~~-----~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~-------------~l~G~pv~~~~~~~~~~~~~i~gd~ 249 (324) ..+ =.++++++....+.++-- +=++.-.....+ .|.-.|+ .+...++++.+++.|. T Consensus 263 ~~~G~~~q~~~f~~~V~~~~k~~I~k~~~-~I~~~~~e~~~G~vv~~~~~~~G~I~L~~~p~--~~~~~lp~g~mlVvD~ 339 (418) T protein:vir:10 263 VNVGDNTQRVMFCDTVGMRTMQDIGRFFG-EVTVTQRETSYGMVFTEWKFFKGRLILKEHPL--FSAIGISPGFAVVVDV 339 (418) T ss_pred CCCcccccceeEEEEeChHHHHHhhhhhh-heeecccceeeeEEEEEEEcceEEEEeecccc--cccccCCCceEEEEcc Confidence 111 136778888888876631 111111110000 0111122 2345688999999998 Q ss_pred ccEEEEEe--cceEEEEeeccce---ecccc-ccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEee----cCCCCC Q lcl|Aclame:pro 250 DKLIYGIP--QLIEYKIDETAQL---STVKN-EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA----DKRTDS 319 (324) Q Consensus 250 s~~~~~~~--~~~~~~~~~~~~~---~~~~~-~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~----~~~~~~ 319 (324) .++-+..- +.+..+...+..- ....+ .++..++ .+++++ ...+...++++.|.+++++. +..+.. T Consensus 340 ~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D-~~kG~i----v~E~tLe~~N~~a~avitgl~~~~~~~~~t 414 (418) T protein:vir:10 340 PAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVD-AQGGSL----TSEWALELLNPQGCAVITGLQKAKERVYLT 414 (418) T ss_pred ccceEEEeccccccchhcccCCCcccccccccccccccc-cccceE----EEEeeeeeecccceEEeeccceecccccCC Confidence 87766554 4555554432210 00000 0011111 233333 35667888999999999753 222344 Q ss_pred CCCC Q lcl|Aclame:pro 320 VPGE 323 (324) Q Consensus 320 ~~~~ 323 (324) +|+- T Consensus 415 ~p~~ 418 (418) T protein:vir:10 415 APAP 418 (418) T ss_pred CCCC Confidence 4444 No 233 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=72.07 E-value=0.19 Score=24.57 Aligned_cols=290 Identities=9% Similarity=-0.001 Sum_probs=116.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccc--hHHHHHHH----HHHH-----hhh----hhhhhcc--- Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLM--NEFTTPIL----QEVM-----ENS----KIMQLGK--- 62 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp--~~~~~~i~----~~~~-----~~s----~l~~l~~--- 62 (324) |..... ..+....-..+- ..+... . ..+.... .+..+... +.+. ..+ .+..... T Consensus 171 vTa~s~---agta~~~li~A~-~~q~it--g--~tga~fa~s~~~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~ 242 (523) T protein:vir:59 171 VPVASL---PGVADVNTVRFW-QYDDAS--G--DPENTVAYPLPRYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQ 242 (523) T ss_pred cccccc---cccccccccccc-cccccc--c--cccccccchhhccccccccccccccccccccccccccccCCCccccc Confidence 111100 000000000000 000000 0 0000000 00000000 0000 000 0000000 Q ss_pred ----eeecCCCceEEEEEeCCcc-eeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcC-----hHHHHHHHHH Q lcl|Aclame:pro 63 ----YEPMEGTEKKFTFWADKPG-AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT-----YSQFFEEMKP 132 (324) Q Consensus 63 ----~~~~~~~~~~ip~~~~~~~-a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds-----~~~~~~~i~~ 132 (324) ......+. .......... ...-.++...++-..++++++++.+..+-.-..|-||.+|- ..|.++.|.+ T Consensus 243 ~~~~lyt~~~g~-~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELan 321 (523) T protein:vir:59 243 DLDLVYYIDARN-DFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVT 321 (523) T ss_pred cccccccccccc-chhhccccccccccccccccccceeeEEEeEEEeeecccccccccHHHHHHHHHHhcCCChhHHHHH Confidence 00000000 0000000000 11123456778888999999999999999999999999983 4668999999 Q ss_pred HHHHHHHHHHHHHHHhccCcc--------ccccccccccccccc-cccch---hhhhHHHHHHHH-------hhhhc--C Q lcl|Aclame:pro 133 MIAEAFYKKFDEAGILNQGNN--------PFGKSIAQSIEKTNK-VIKGD---FTQDNIIDLEAL-------LEDDE--L 191 (324) Q Consensus 133 ~l~~ai~~~~d~~~l~G~g~~--------~~~~~~~~~~~~~~~-~~~~~---~~~~~i~~~~~~-------l~~~~--~ 191 (324) .|.-.|...|++.+|.=--+. ....++......... ...+. ...+.++.|+-+ +...- . T Consensus 322 ILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~ 401 (523) T protein:vir:59 322 LMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVA 401 (523) T ss_pred HHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccc Confidence 999999999999998532111 011111110000000 00000 012233344333 33222 2 Q ss_pred CCcEEEEcHHHHHHHHHhhccCCceeec-cCC----cceee-cceeEeecCCCCCCceeEEeecccEEEEEecce----- Q lcl|Aclame:pro 192 EANAFISKTQNRSLLRKIVDPETKERIY-DRN----SDSLD-GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLI----- 260 (324) Q Consensus 192 ~~~~~v~~~~~~~~l~~~~d~~g~~~~~-~~~----~~~l~-G~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~----- 260 (324) ..+.+|+|+++...|...---+++.... ... .+.|. |++|+..+. .+..-+++| ..+.. T Consensus 402 ~~~~~~~s~~v~~~l~~~~~~~~~~~~~~~~~~~~~~g~l~~~~~vy~d~~--~~~dy~~~g--------~k~~~~~~~~ 471 (523) T protein:vir:59 402 GANFLVTSPQVAALLESMPGFTPGNDNRDGGTGIFYVGMVQGRYRLYKNIY--QNQPVIIMG--------NQDLNTPWQT 471 (523) T ss_pred cccEEEEchhHHHHHHhccccccCCccccccccceeEEEecCceEEEecCC--CCcceEEEE--------ecccCCcccc Confidence 4567899999988886322111111111 111 13343 346655443 222233333 22211 Q ss_pred EEEEeeccceecc---ccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCC Q lcl|Aclame:pro 261 EYKIDETAQLSTV---KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRT 317 (324) Q Consensus 261 ~~~~~~~~~~~~~---~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~ 317 (324) .+-+...+.+..+ .|+. -||- .+-...|++..|.+|-+...|-.+--.| T Consensus 472 ~~~y~Py~~l~~~~~~~dp~-----s~qp---~~~~~tRY~l~v~nP~~~~~~~~~~~~~ 523 (523) T protein:vir:59 472 GAVYAPYVPLLFTPTIVDPV-----NFSY---RRGLMTRYALEVVRPEFYGLLYVKLLQP 523 (523) T ss_pred cceecccchhhcccccccCC-----cccc---eeeeeeehhheecchhHhhhhhhhhcCC Confidence 1112221111111 1111 1321 2445579999998998877664443333 No 234 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=70.37 E-value=0.21 Score=24.30 Aligned_cols=285 Identities=10% Similarity=0.045 Sum_probs=120.1 Q ss_pred ccccCccccchHHHHHHHHHHH-hhhh-hh-hhcceeecCCCceEEEEEe-CCc-ceeeeccCccccc-cccceeeEEee Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVM-ENSK-IM-QLGKYEPMEGTEKKFTFWA-DKP-GAYWVGEGQKIET-SKATWVNATMR 104 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~-~~s~-l~-~l~~~~~~~~~~~~ip~~~-~~~-~a~~v~Eg~~~~~-~~~~~~~v~l~ 104 (324) +.+-.. ++.+.....+++.+. .... +. .+++..++.+-.+.+.... ... .+.++..+.+.+. ....++..++. T Consensus 1 M~~i~d-~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~ 79 (348) T protein:vir:27 1 MGLIYD-KVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEMHDEQ 79 (348) T ss_pred Ccchhh-hcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeEeeeecCCCCcceecccceeeeeee Confidence 111111 222323333343333 2222 32 3455444444444433322 222 3567877766654 34557777777 Q ss_pred heeeEEeeeehHHHhhc------C-hHHHHHHH-------HHHHHHHHHHHHHHHHH----hcc----Cccccc--c-cc Q lcl|Aclame:pro 105 AFKLGVILPVTKEFLNY------T-YSQFFEEM-------KPMIAEAFYKKFDEAGI----LNQ----GNNPFG--K-SI 159 (324) Q Consensus 105 ~~k~~~~~~iS~e~l~d------s-~~~~~~~i-------~~~l~~ai~~~~d~~~l----~G~----g~~~~~--~-~~ 159 (324) +-.++-...++.+-++. + ..+....+ ...+.+.+...+|..+. +|. |.+..- . +. T Consensus 80 ~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~vdfg~ 159 (348) T protein:vir:27 80 MPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGV 159 (348) T ss_pred cCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEEEeecC Confidence 77777666666443221 1 11111111 22334455555554333 331 111000 0 00 Q ss_pred --cccc-ccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHH---hhccC----Cce-eeccCC----cc Q lcl|Aclame:pro 160 --AQSI-EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK---IVDPE----TKE-RIYDRN----SD 224 (324) Q Consensus 160 --~~~~-~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~---~~d~~----g~~-~~~~~~----~~ 224 (324) .+.. .+..=..++....+||.++...+.+.+..+..++|++++|..|++ +++.- +.. .+.... -+ T Consensus 160 ~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~~ 239 (348) T protein:vir:27 160 KPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAELENYIA 239 (348) T ss_pred CcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHHHHHHHH Confidence 0000 000011223456688888888887778888899999999999864 33221 111 111111 12 Q ss_pred eeecceeEeecC----------CCCCCceeEEeecccE---EEEE-ecceEEEEeeccceeccccccccchhhhhc-C-- Q lcl|Aclame:pro 225 SLDGLPVVNLKS----------SNLKRGELITGDFDKL---IYGI-PQLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D-- 287 (324) Q Consensus 225 ~l~G~pv~~~~~----------~~~~~~~~i~gd~s~~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~-~-- 287 (324) ++.|.++++... ...+++.+++.-.... .+|. .++..................+.....|.+ | T Consensus 240 ~~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~ 319 (348) T protein:vir:27 240 DNFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTTTKTTDPV 319 (348) T ss_pred hhcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeEEEeeecCCCc Confidence 345555543211 1123444444322111 1110 000000000000000000001111111111 1 Q ss_pred cEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 288 MVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) ...+.+..+.=-.+.+++++..++..++. T Consensus 320 ~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 320 NVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred eEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 23345555555666778999999888887 No 235 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=70.11 E-value=0.21 Score=24.26 Aligned_cols=308 Identities=11% Similarity=0.038 Sum_probs=134.4 Q ss_pred CchhHHHHHHHHHHHhhhhhHH------hh----cc-------------------ccccc--cccCcc---ccchHHHHH Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQ------VF----NP-------------------DNVMM--HEKKDG---TLMNEFTTP 46 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~------~~----~~-------------------~~~~~--~~~~~~---~vp~~~~~~ 46 (324) .|.. +.+++.|+.....+- .+ .. +.+++ ...+++ .++.. + T Consensus 12 ~~~~---~~~~~~~~~~~~~~~PN~~~p~l~~i~~g~~~~~~~~t~~w~~d~l~~~~~~~ta~~~a~~T~i~V~~~---~ 85 (418) T protein:vir:96 12 LNPQ---ELNMKSFAGTILRRVPNGSAPLLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEALADATVLTVENS---D 85 (418) T ss_pred CChh---hhchhhhhhhhhhhcCCcccchhhhhcccCccccceeEEEEEeeEeeeeeEEEEEEEecCceEEEecCC---c Confidence 3333 334555554322210 00 00 00000 001111 12211 1 Q ss_pred HHHHHHhhhhh-----hhhcceeecCCCceEEEEEeCCcceeeeccC-------ccccccccceeeEEeeheeeEEeeee Q lcl|Aclame:pro 47 ILQEVMENSKI-----MQLGKYEPMEGTEKKFTFWADKPGAYWVGEG-------QKIETSKATWVNATMRAFKLGVILPV 114 (324) Q Consensus 47 i~~~~~~~s~l-----~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg-------~~~~~~~~~~~~v~l~~~k~~~~~~i 114 (324) . +.+...+ ..+.++..+++..++.-+..++..++-+..| ..+++..-..+.....+..+..+..| T Consensus 86 ~---f~~~~l~~~~~~~EvirVtsVng~~lTV~RG~~~t~aa~iaag~~~~~ig~~~eEGsd~~ta~~~k~~~vsN~tQI 162 (418) T protein:vir:96 86 G---LTKGMIFYNEATGENMRLELVNGLNLTVKRQTGRIAAAIIAANTKLIVIGTAFEEGSQRPTARSIQPVYVPNFTQI 162 (418) T ss_pred c---cccccEEEEecCCeEEEEEEEeCCEEEEEEccCCeeeeeeecCceEEEeecCcccccccCCcceecceeccchhhe Confidence 1 3333332 2234455567777777665555444333333 24444444444445555555566666 Q ss_pred hHHHhhcChHH-----------HHHHHHHHHHHHHHHHHHHHHHhccC----cccccc--------cccccccc--cccc Q lcl|Aclame:pro 115 TKEFLNYTYSQ-----------FFEEMKPMIAEAFYKKFDEAGILNQG----NNPFGK--------SIAQSIEK--TNKV 169 (324) Q Consensus 115 S~e~l~ds~~~-----------~~~~i~~~l~~ai~~~~d~~~l~G~g----~~~~~~--------~~~~~~~~--~~~~ 169 (324) -+|...-|... +.....+.|.+. ...+|.++++|.. .+..+. ++..-... .... T Consensus 163 f~e~vsVSgTAqA~v~qaGvsn~~~~e~d~l~~~-kv~iE~ali~g~~~~~~~ng~p~~~t~R~m~gI~~f~~~Nvi~ag 241 (418) T protein:vir:96 163 FRNAWALTDTARASYAEAGYSNITESRRDCMDFH-ATEQETAIFFGQAFMGTYNGQPLHTTQGIVDAIRQYAPDNVNAMP 241 (418) T ss_pred ehhhhhhhhhhhhhhhhcCcchhHHHHHHHHHHH-HHHHHHhhhccccccCCCCCcccccccchhHHHHhhccccccccC Confidence 66655544321 111122233333 4467888888862 222121 11111111 1111 Q ss_pred ccchhhhhHHHHHHHHhhhhc---CCC-----cEEEEcHHHHHHHHHhhccCCceeeccCCcc-------eeec-ceeEe Q lcl|Aclame:pro 170 IKGDFTQDNIIDLEALLEDDE---LEA-----NAFISKTQNRSLLRKIVDPETKERIYDRNSD-------SLDG-LPVVN 233 (324) Q Consensus 170 ~~~~~~~~~i~~~~~~l~~~~---~~~-----~~~v~~~~~~~~l~~~~d~~g~~~~~~~~~~-------~l~G-~pv~~ 233 (324) ....++++.+.++....-... ..+ =.++++.+....+.++-. ..+..-.+...+ +-+| ++++. T Consensus 242 ~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~~-~I~~~~~en~~G~vv~~~~Td~G~v~ii~ 320 (418) T protein:vir:96 242 NPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG-EVTVTQRETSYGMVFTEWKFFKGRLIIKE 320 (418) T ss_pred CCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhhc-eeEeccccceeceEEEEEEeeccEEEEEe Confidence 123456777777766543311 111 126888999999887642 222221111111 1123 24444 Q ss_pred ecC---CCCCCceeEEeecccEEEEEe--cceEEEEeeccce---ecccc-ccccchhhhhcCcEEEEEEEEeccEEecc Q lcl|Aclame:pro 234 LKS---SNLKRGELITGDFDKLIYGIP--QLIEYKIDETAQL---STVKN-EDGTPVNLFEQDMVALRATMHVALHIADD 304 (324) Q Consensus 234 ~~~---~~~~~~~~i~gd~s~~~~~~~--~~~~~~~~~~~~~---~~~~~-~~~~~~~~f~~~~v~~r~~~r~d~~v~~~ 304 (324) .+. +.++.+.+++.|.+.+-+..- +.+..+...+..- ....+ .++..++ .++++ ....+.+.++++ T Consensus 321 n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D-~~~G~----l~~Eltle~~N~ 395 (418) T protein:vir:96 321 HPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVD-AQGGS----LTSEWALELLNP 395 (418) T ss_pred cCCCCccccCcceEEEEecCceEEEEecCCCccchhcccCCCcccccccccccccccc-cccCE----EEEEEEEEeecc Confidence 433 345667788888877655543 4444444322210 00000 0011111 12333 335667888999 Q ss_pred CceEEEEeecCC-CCCCCCCC Q lcl|Aclame:pro 305 KAFAKLVPADKR-TDSVPGEV 324 (324) Q Consensus 305 ~A~~~l~~~~~~-~~~~~~~~ 324 (324) ++.++|++.-.. +.++|-|- T Consensus 396 ~a~a~itgl~~~~~~~~~~~~ 416 (418) T protein:vir:96 396 QGCAVITGLQKAKERVYLTAP 416 (418) T ss_pred cccEEeecccccccccccCCC Confidence 999999854332 33333333 No 236 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=69.45 E-value=0.22 Score=24.16 Aligned_cols=303 Identities=11% Similarity=0.098 Sum_probs=122.4 Q ss_pred CchhHHHHHH----------HHHH-------HhhhhhHHhhcccc------ccccc-cCcc-ccchHHHHHHHHHHHhhh Q lcl|Aclame:pro 1 MEQTQKLKLN----------LQHF-------ASNNVKPQVFNPDN------VMMHE-KKDG-TLMNEFTTPILQEVMENS 55 (324) Q Consensus 1 ~~~~~~~k~~----------~~~~-------a~~~~~~~~~~~~~------~~~~~-~~~~-~vp~~~~~~i~~~~~~~s 55 (324) =|+.+.++++ .+.| +..+..+.+..+++ ...++ +++. -.-|.+. .+.+++-++- T Consensus 36 enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g~~~~~ia~s~~s~~v~~~~P~Li-~lvRra~p~L 114 (534) T protein:vir:10 36 ENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIGGDHGYDATKIASGETSGSITNVGPAVM-GLVRRAIPQL 114 (534) T ss_pred hhHHHHHhhhcccccchhhhhhhhhccccchhhccccccccccccccccccccccccccccccccchhh-hHHHHHHHhh Confidence 1111111111 1111 12223333322221 11111 1111 1122222 2334445555 Q ss_pred hhhhhcceeecCCCceEEE--E--Ee-CC--------------cceee-------------------------------- Q lcl|Aclame:pro 56 KIMQLGKYEPMEGTEKKFT--F--WA-DK--------------PGAYW-------------------------------- 84 (324) Q Consensus 56 ~l~~l~~~~~~~~~~~~ip--~--~~-~~--------------~~a~~-------------------------------- 84 (324) +...++.+-||++++.-|- + .. .. +.+.| T Consensus 115 Ia~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t 194 (534) T protein:vir:10 115 IAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAAT 194 (534) T ss_pred hhhhhheeccCCchhhhheeeeeeecCCCCCccccccccccccccccccccccccccccccccccccccccccccccccc Confidence 6666776667665443321 1 00 00 00000 Q ss_pred --------------------------------------------------ecc---------CccccccccceeeEEeeh Q lcl|Aclame:pro 85 --------------------------------------------------VGE---------GQKIETSKATWVNATMRA 105 (324) Q Consensus 85 --------------------------------------------------v~E---------g~~~~~~~~~~~~v~l~~ 105 (324) .+| +.+.++-..++++++... T Consensus 195 ~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtA 274 (534) T protein:vir:10 195 GVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEA 274 (534) T ss_pred cccccccccccccccccccccCCccccccccccccccccceecccccchhhHhhhccCCCCcccchhhcceEEEEEEEee Confidence 011 012445567778888888 Q ss_pred eeeEEeeeehHHHhhcC----hHHHHHHHHHHHHHHHHHHHHHHHHhccCccc------------ccccccccccccccc Q lcl|Aclame:pro 106 FKLGVILPVTKEFLNYT----YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP------------FGKSIAQSIEKTNKV 169 (324) Q Consensus 106 ~k~~~~~~iS~e~l~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~------------~~~~~~~~~~~~~~~ 169 (324) +..+-.-..|-||.+|- ..|.++.|...|+-.|...|++.+|.=--+-. ...++...... ... T Consensus 275 KSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~-~~~ 353 (534) T protein:vir:10 275 KSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVLWINATAKVGKTGWTNMHGGKAGVFDFQDT-KDI 353 (534) T ss_pred eccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhheeecccccccccccceeeeecc-ccc Confidence 88888999999999984 37889999999999999999999884211100 00111110000 000 Q ss_pred ccchhhhhHHHHHHHHhhhh---------cCCCcEEEEcHHHHHHHHHhh--cc---CC-ceee-ccCCc----ceee-c Q lcl|Aclame:pro 170 IKGDFTQDNIIDLEALLEDD---------ELEANAFISKTQNRSLLRKIV--DP---ET-KERI-YDRNS----DSLD-G 228 (324) Q Consensus 170 ~~~~~~~~~i~~~~~~l~~~---------~~~~~~~v~~~~~~~~l~~~~--d~---~g-~~~~-~~~~~----~~l~-G 228 (324) ..+--..+.++.|+.++... ....+.+++|+.+...|...- +. .| .... .+... +.|. | T Consensus 354 ~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~~g~l~~~~~~~~~~~~~~d~~~~~~~G~l~~~ 433 (534) T protein:vir:10 354 RGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGHTDMLMTPAVMGANTTMNTDTTSSLFAGVLAGK 433 (534) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhhccchhccccccccccccccCCCceEEEEecCc Confidence 01111233445555544322 124567899999998885421 10 11 1111 12222 2333 4 Q ss_pred ceeEeecCCCCCCceeEEeecccEEEEEecceEEE----Eeeccceecc--ccccccchhhhhcCcEEEEEEEEeccEEe Q lcl|Aclame:pro 229 LPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYK----IDETAQLSTV--KNEDGTPVNLFEQDMVALRATMHVALHIA 302 (324) Q Consensus 229 ~pv~~~~~~~~~~~~~i~gd~s~~~~~~~~~~~~~----~~~~~~~~~~--~~~~~~~~~~f~~~~v~~r~~~r~d~~v~ 302 (324) ++|+..+..+ ..-++ +|..+...++ +...+-+... .|+. -||- .+-...|++..+ T Consensus 434 ~~vy~D~y~~--~dy~~--------vG~KG~~~~~~glfyaPYv~l~~~~~~dp~-----sfqP---~~g~~tRY~l~~- 494 (534) T protein:vir:10 434 YRVYIDQYAV--EDYFT--------VGYKGASEMDAGLYYCPYVALTPLRGTDPK-----NFQP---VLGFKTRYGVKL- 494 (534) T ss_pred eEEEecCCCC--cceEE--------EEEeCCcccccceeeccccccccccccCCc-----cccc---eeeeeeeeceee- Confidence 5666544332 22222 3333222221 1111111111 1111 1221 122335555543 Q ss_pred ccCc-------eEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 303 DDKA-------FAKLVPADKRTDSVPGEV 324 (324) Q Consensus 303 ~~~A-------~~~l~~~~~~~~~~~~~~ 324 (324) +|=+ +.++.+....-+-..|.= T Consensus 495 NP~~~~~~~~~~~~i~~g~~~~~~~ag~n 523 (534) T protein:vir:10 495 HPMADATQNKGFAKISNGMPQHTNMFGKN 523 (534) T ss_pred cCcccccCCccccccccCCcchhhhcccc Confidence 2210 111111110000111111 No 237 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=64.03 E-value=0.31 Score=23.40 Aligned_cols=285 Identities=11% Similarity=0.031 Sum_probs=120.6 Q ss_pred ccccCccccchHHHHHHHHHHH-hhhh-h-hhhcceeecCCCceEEEEE-eCCc-ceeeeccCccccc-cccceeeEEee Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVM-ENSK-I-MQLGKYEPMEGTEKKFTFW-ADKP-GAYWVGEGQKIET-SKATWVNATMR 104 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~-~~s~-l-~~l~~~~~~~~~~~~ip~~-~~~~-~a~~v~Eg~~~~~-~~~~~~~v~l~ 104 (324) +..--. ++.+.-...+++.+. .... + ..+++..++..-.+.+... .... .+.++..+.+.+. ....++..++. T Consensus 1 M~~i~d-~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~ 79 (348) T protein:vir:96 1 MGLIYD-KVTASNIAGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEIHDEQ 79 (348) T ss_pred Ccchhh-ccCHHHHHHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeeeecCCCCcceecccceeeeeee Confidence 111111 222222333444333 2222 3 2445655554444444332 2223 3678888876664 34567777777 Q ss_pred heeeEEeeeehHHHhh------cC-hHH----HHHHHH---HHHHHHHHHHHHHH----HHhcc----Ccccc--cc-cc Q lcl|Aclame:pro 105 AFKLGVILPVTKEFLN------YT-YSQ----FFEEMK---PMIAEAFYKKFDEA----GILNQ----GNNPF--GK-SI 159 (324) Q Consensus 105 ~~k~~~~~~iS~e~l~------ds-~~~----~~~~i~---~~l~~ai~~~~d~~----~l~G~----g~~~~--~~-~~ 159 (324) +-.++-...++.+-++ .+ ... +...+. ..+.+.+...+|.. +.+|. |.+.. -. +. T Consensus 80 ~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~vdfg~ 159 (348) T protein:vir:96 80 MPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGV 159 (348) T ss_pred cCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEEEeccC Confidence 7777766666543221 11 111 122221 22334455555533 22331 11100 00 00 Q ss_pred c--cc-cccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHH---hhcc----CCcee-ecc----CCcc Q lcl|Aclame:pro 160 A--QS-IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK---IVDP----ETKER-IYD----RNSD 224 (324) Q Consensus 160 ~--~~-~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~---~~d~----~g~~~-~~~----~~~~ 224 (324) . +. +.+..=..++....+||.++...+.+.+..+..++|+++++..|+. +++. ++... +.. .--. T Consensus 160 ~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 239 (348) T protein:vir:96 160 KADHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAELQNYVA 239 (348) T ss_pred CcccceeeccccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHHHHHHHh Confidence 0 00 0000111224456688888888887778888899999999999863 3321 11111 111 1112 Q ss_pred eeecceeEeecCC----------CCCCceeEEeecccE---EEEE-ecceEEEEeeccceeccccccccchhhhh-cC-- Q lcl|Aclame:pro 225 SLDGLPVVNLKSS----------NLKRGELITGDFDKL---IYGI-PQLIEYKIDETAQLSTVKNEDGTPVNLFE-QD-- 287 (324) Q Consensus 225 ~l~G~pv~~~~~~----------~~~~~~~i~gd~s~~---~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~f~-~~-- 287 (324) ...|+++++.... ..+++.+++.-.... .+|. .+................-..+.....|. .| T Consensus 240 ~~~g~~i~~y~~~y~d~~G~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~ 319 (348) T protein:vir:96 240 DNYGVEIVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVTTTKTTDPV 319 (348) T ss_pred hhcCceEEEEccEEEecCCcEeccccCCeEEEEcCCCceeEEeccChhhhhhhhcccccccceecCCeeEEEeeecCCCc Confidence 3456666543211 123334443221111 1110 00000000000000000000111111121 12 Q ss_pred cEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 288 MVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) ...+.+..+.=-.+.+|+++..++..++. T Consensus 320 ~~~~~~~s~plPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:96 320 NVQTKVSMVALPSFERLGDVYMLTVIPGV 348 (348) T ss_pred eEEEEEeeeeeccccCCCcEEEEEEecCC Confidence 23355556655566779999999988887 No 238 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=61.30 E-value=0.36 Score=23.04 Aligned_cols=275 Identities=11% Similarity=0.029 Sum_probs=119.4 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhh---cc-eeecCCCceEEEEEeCCcc-eeeeccCccccc--cccceeeEEe Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQL---GK-YEPMEGTEKKFTFWADKPG-AYWVGEGQKIET--SKATWVNATM 103 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l---~~-~~~~~~~~~~ip~~~~~~~-a~~v~Eg~~~~~--~~~~~~~v~l 103 (324) +.++ .-..+.+...+-+.+...+.-.-| .. +.-.++.+++||+.....- .+-..-+..... -+..++..+| T Consensus 1 Mant--l~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~~gl~DY~R~~g~~~~~g~v~~~~et~tl 78 (312) T protein:vir:10 1 MANT--LAYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLSTDGLGDYSRGSANAYVGGDVKFEYETKTM 78 (312) T ss_pred CCcc--hhHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeecccccccccccCCccccccccccceeEEe Confidence 1111 112355555544444433321111 11 2235678899999764332 233333322222 2334445555 Q ss_pred eheeeEEeeeehHHHhhcC--hHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhhHHHH Q lcl|Aclame:pro 104 RAFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIID 181 (324) Q Consensus 104 ~~~k~~~~~~iS~e~l~ds--~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~ 181 (324) +..+.-.. .|-.-=++.+ ...+...+.+...+...=.+|...|.---+.....+.. +..+...+.+.+-.++.|.+ T Consensus 79 ~qDR~~~F-~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~-~~~~~~~~~T~~ni~~~i~~ 156 (312) T protein:vir:10 79 TQDRGRKF-TLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGD-TNVEYSYSVNSSTIINKIKT 156 (312) T ss_pred eeccccee-eccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccc-cccccccccCHHHHHHHHHH Confidence 55443222 2221001211 24456666666667777778877664211100000000 00011112233445677888 Q ss_pred HHHHhhhhcCC-CcEEEEcHHHHHHHHHhh-----ccCCceeeccCCcceeecceeEeecCCCCCC------c------- Q lcl|Aclame:pro 182 LEALLEDDELE-ANAFISKTQNRSLLRKIV-----DPETKERIYDRNSDSLDGLPVVNLKSSNLKR------G------- 242 (324) Q Consensus 182 ~~~~l~~~~~~-~~~~v~~~~~~~~l~~~~-----d~~g~~~~~~~~~~~l~G~pv~~~~~~~~~~------~------- 242 (324) ++.+|.+.+.+ +..++|+|..+..|.+-. ..+..........+.|.|.||+..|+.-+.. | T Consensus 157 ~~~~lde~~vp~~rvl~vTp~~~~lLk~~~~~~~~~~~~~~~~i~~~V~~iDgv~Ii~VPs~r~~t~~~f~dG~t~~~~~ 236 (312) T protein:vir:10 157 GIKIIRENGYNGPLVCHLTYDSMFAIEEKVLEKLTAVTFAQGGIQTQVPSIDGCALIKTPQNRMYSSILLNDGTTSNQTA 236 (312) T ss_pred HHHHHHHccCCCceEEEeChHHHHHHhhhhhceecccccccceeeeeeeeecccEEEEchhhhccceeeeccCccccccc Confidence 89999887665 456799999887776421 0011111234556789999998766533211 0 Q ss_pred --eeEEeecccE-EEEEecceEEEEeeccceec---cccccccchhhhhcCcEEEEEEEEeccEEeccCc--e-EEEEee Q lcl|Aclame:pro 243 --ELITGDFDKL-IYGIPQLIEYKIDETAQLST---VKNEDGTPVNLFEQDMVALRATMHVALHIADDKA--F-AKLVPA 313 (324) Q Consensus 243 --~~i~gd~s~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A--~-~~l~~~ 313 (324) -.-..+.+++ ++.......+.+.+...... ..++++ |--.+.-+.|.|.=|.+.+. + +-++.+ T Consensus 237 gg~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~--------d~~~~~~R~Y~D~fv~~nk~~~Iyv~~k~a 308 (312) T protein:vir:10 237 GGYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTA--------NAWSMDYRRYHDLWVTDNKANSVYANFKDA 308 (312) T ss_pred CceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCc--------ceeeeeeeeeeeeeeeccccCeEEEEeecc Confidence 0000000111 11112222333332222211 111111 11123444566766666433 2 444444 Q ss_pred cCCC Q lcl|Aclame:pro 314 DKRT 317 (324) Q Consensus 314 ~~~~ 317 (324) .+++ T Consensus 309 ~~~~ 312 (312) T protein:vir:10 309 KPVG 312 (312) T ss_pred cCCC Confidence 4444 No 239 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=58.13 E-value=0.42 Score=22.65 Aligned_cols=264 Identities=9% Similarity=-0.059 Sum_probs=114.7 Q ss_pred cccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeeccC--ccccc-cccce---e--eE Q lcl|Aclame:pro 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGEG--QKIET-SKATW---V--NA 101 (324) Q Consensus 30 ~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~Eg--~~~~~-~~~~~---~--~v 101 (324) +++.....++.+.+.+--+..-.+.-+--.+++.+|+.....+|++... ++.-+.+. +.... ....+ + .. T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~ia~~l~P~vpv~~~~~k~~~f~~--eaF~~~~t~r~~~~~~~~v~~~~~~~~~~ 78 (307) T protein:vir:10 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQSLMPVVEVEKEGGKIPKFGK--ESFRLYKTERALRARSNRMNPEDLGSIDI 78 (307) T ss_pred CCCCCCCcccChhHHHHHHhhcchhhhhhhcCCcccccccccceeeECc--ccccchhhhcccCCCcceeeccccccccc Confidence 3333444555555555444444444445566788888877788888742 22111111 11111 11111 1 22 Q ss_pred EeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHH---HhccCccc-cccccccccccccccccchhhhh Q lcl|Aclame:pro 102 TMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAG---ILNQGNNP-FGKSIAQSIEKTNKVIKGDFTQD 177 (324) Q Consensus 102 ~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~---l~G~g~~~-~~~~~~~~~~~~~~~~~~~~~~~ 177 (324) .+..|-+. .++-.+.-..+..++++.-.+.+.+.|....|..+ +....+=. .....+. ++..-..++..... T Consensus 79 ~~~~~~L~--~~id~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLs--Gt~~Wsd~~sDPi~ 154 (307) T protein:vir:10 79 VLDEHDLE--YPIDYREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLS--ATEKFTAAGSDPVG 154 (307) T ss_pred cccccccc--ccCChhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEec--cccccCCCCCCcHH Confidence 23333333 33333433445567777777777777766665432 21111100 0111111 11111223455667 Q ss_pred HHHHHHHHhh-hhcCCCcEEEEcHHHHHHHHH---hh---ccCCceeeccCCcceeecceeEeecCCC--CCCce--eEE Q lcl|Aclame:pro 178 NIIDLEALLE-DDELEANAFISKTQNRSLLRK---IV---DPETKERIYDRNSDSLDGLPVVNLKSSN--LKRGE--LIT 246 (324) Q Consensus 178 ~i~~~~~~l~-~~~~~~~~~v~~~~~~~~l~~---~~---d~~g~~~~~~~~~~~l~G~pv~~~~~~~--~~~~~--~i~ 246 (324) +|.+.+.++. ..++.++..+|..+.|.+|+. +. +..+..++....-..++|+.-+...... ..++. -+. T Consensus 155 di~~~~~ai~~~~g~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~~~la~ll~v~~i~vg~a~~~~~~~~~~~iw 234 (307) T protein:vir:10 155 VIEDGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIW 234 (307) T ss_pred HHHHHHHHHHhhhCCccceEEeCHHHHHHHhcCHHHHHHhCCccccccCHHHHHHHhCceeEEEeeeeeeccCCccceeC Confidence 7888777775 557889999999999998863 21 1222222222222234443322211100 00100 011 Q ss_pred ee--------------------cccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCc Q lcl|Aclame:pro 247 GD--------------------FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) Q Consensus 247 gd--------------------~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A 306 (324) |+ +|.-+...+++..+. +. .....+.--+|+.....-.+.-++| T Consensus 235 ~~~~vl~yv~~~~~~~~~~~~epsfGyT~~~~g~~~~-d~---------------~~~~~~~~~~r~~~~~~~~i~~~~~ 298 (307) T protein:vir:10 235 GANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKGNPVV-DT---------------RIEDGKLELVRSTDIFRPYLLGADA 298 (307) T ss_pred CCceEEEecccccCCCCCcccccccceeEEEcCCeEe-ec---------------eecCCceeEEeccccccceeecccc Confidence 11 111111111221111 00 0011122224555555555556666 Q ss_pred eEEEEeecC Q lcl|Aclame:pro 307 FAKLVPADK 315 (324) Q Consensus 307 ~~~l~~~~~ 315 (324) -..|+++.- T Consensus 299 G~li~~~~~ 307 (307) T protein:vir:10 299 GYLISGING 307 (307) T ss_pred cceeccCCC Confidence 666666443 No 240 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=49.67 E-value=0.63 Score=21.68 Aligned_cols=289 Identities=12% Similarity=0.027 Sum_probs=113.5 Q ss_pred chhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHhhhhhhh-hcceeecCCCceEEEEEe-CC Q lcl|Aclame:pro 2 EQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQ-LGKYEPMEGTEKKFTFWA-DK 79 (324) Q Consensus 2 ~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~-l~~~~~~~~~~~~ip~~~-~~ 79 (324) -+|+++...+|.|+.-...- +.+.....+++.+...+-|.. +++..++..-.+.+.... .. T Consensus 1 ~~~~~~~~~~~~~~~~~~d~-----------------~~~~~l~~~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~ 63 (349) T protein:vir:10 1 MKNQKLQLDLQRFATPILDM-----------------FSQNTVLDYTRNRQYPEMLGDTLFPAVKVPTLEVDILKAGSRV 63 (349) T ss_pred CCcchhhHHHHHHHHHhhcc-----------------cCHHHHHHHHHhcCcchhhHhhcCCccccccceeEEEeeccCc Confidence 57888999999987532111 111222233333332222322 344444433333333322 12 Q ss_pred c-ceeeeccCccccccccceeeEEeeheeeEEeeeehHHHhhc----ChHH----HHHHH---HHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 80 P-GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNY----TYSQ----FFEEM---KPMIAEAFYKKFDEAG- 146 (324) Q Consensus 80 ~-~a~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~d----s~~~----~~~~i---~~~l~~ai~~~~d~~~- 146 (324) + .+.+++.+++.+..+-.....+..+-.++-...++.+-+.. ...+ +...+ ...+.+.+...+|..+ T Consensus 64 ~~~a~~v~~~~~~~~~~r~~~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~ 143 (349) T protein:vir:10 64 PTIASVSAFDAEAEIGTREASKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTM 143 (349) T ss_pred ceeeeeecCCCCcceecccceeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 24566666555543333334444544555555555433221 1111 22222 2233344555555333 Q ss_pred ---Hhcc----Ccccccc-cc--cc--ccccccc-cccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHH---hh Q lcl|Aclame:pro 147 ---ILNQ----GNNPFGK-SI--AQ--SIEKTNK-VIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK---IV 210 (324) Q Consensus 147 ---l~G~----g~~~~~~-~~--~~--~~~~~~~-~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~---~~ 210 (324) .+|. +.+..-. +. .+ ....... ..++....+||.+.+..+ +..+..++|++++|..|+. ++ T Consensus 144 q~l~~Gki~~~~~g~~vD~g~~~~~~~~lt~~~~Ws~~~adpi~Di~~~~~~~---g~~p~~~vm~~~~~~~l~~~~~i~ 220 (349) T protein:vir:10 144 EMFATGKITDKKNGIAIDYGVPKKHQETLSGTKTWDKSDASIIDNLQDWSDSL---DVTPTRALTSKKVLRILMRSTEIK 220 (349) T ss_pred HHHhCCeeEEcCCcEEEecccCccceeEecCcccCCCCCCCHHHHHHHHHHHh---CCCccEEEeCHHHHHHHhcCHHHH Confidence 3341 1110000 00 00 0000000 112334456666665544 6677899999999999852 22 Q ss_pred ---ccCCce-eeccCC----cceeecceeEeecC--------------CCCCCceeEEeeccc---EEEEE-ecceEEEE Q lcl|Aclame:pro 211 ---DPETKE-RIYDRN----SDSLDGLPVVNLKS--------------SNLKRGELITGDFDK---LIYGI-PQLIEYKI 264 (324) Q Consensus 211 ---d~~g~~-~~~~~~----~~~l~G~pv~~~~~--------------~~~~~~~~i~gd~s~---~~~~~-~~~~~~~~ 264 (324) +..... +..... -..+.|.++.+... ...+++.+++.-... ..+|. .+...+.. T Consensus 221 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~l~~~~~~G~~~yG~~~e~~~~~~ 300 (349) T protein:vir:10 221 EAIFGKDTGRVVGQADLDQWMTAQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIVLFNDEVPGQKIYGPTPEENRLIS 300 (349) T ss_pred HHhcccccccccCHHHHHHHHHhcCCceEEEEeeEEEeecCCCceeecccccCCeEEEecCCCceeEEeeccchhhhhcc Confidence 222211 111110 01233444443211 123444444432111 11111 11001000 Q ss_pred eeccceeccccccccchhhh-hcC--cEEEEEEEEeccEEeccCceEEEEee Q lcl|Aclame:pro 265 DETAQLSTVKNEDGTPVNLF-EQD--MVALRATMHVALHIADDKAFAKLVPA 313 (324) Q Consensus 265 ~~~~~~~~~~~~~~~~~~~f-~~~--~v~~r~~~r~d~~v~~~~A~~~l~~~ 313 (324) .. . .......+.....+ +.| ...+++..+.=-.+.+|+++..++.. T Consensus 301 g~-~--~~~~~~~~~~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl 349 (349) T protein:vir:10 301 SN-A--QVSNVGNIMAKIYETSEDPIGTWILASATMLPSFASADDVFQAKVL 349 (349) T ss_pred cc-c--ceeeccceEEEeeeecCCCceEEEEEeeeeeeeecCCCcEEEEEeC Confidence 00 0 00000000111111 112 23355555555566778888888887 No 241 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=48.60 E-value=0.66 Score=21.56 Aligned_cols=271 Identities=13% Similarity=0.050 Sum_probs=100.5 Q ss_pred ccccCccccchHHHHHHHHHHHhhhhhhhh---ccee-ecCCCceEEEEEeC------CcceeeeccCccccccccceee Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVMENSKIMQL---GKYE-PMEGTEKKFTFWAD------KPGAYWVGEGQKIETSKATWVN 100 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l---~~~~-~~~~~~~~ip~~~~------~~~a~~v~Eg~~~~~~~~~~~~ 100 (324) +.++- -..+.+...+-+.+...+.-.-| ...+ -.++++++||+.+- +-..+-..-|-....-+..++. T Consensus 1 Mantl--~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~R~~g~~~g~v~~~~et 78 (302) T protein:vir:78 1 MANSL--ALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYNRSTGFTQGSVTLAWSD 78 (302) T ss_pred CCchh--HHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccccccCccccceeeeeee Confidence 11110 11245555555555444332222 1223 34668899999852 2122233322222222334444 Q ss_pred EEeeheeeEE-eeeehHHHhhcC--hHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccccccccccccccchhhhh Q lcl|Aclame:pro 101 ATMRAFKLGV-ILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQD 177 (324) Q Consensus 101 v~l~~~k~~~-~~~iS~e~l~ds--~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 177 (324) .+|+..+--. .+.--+ ++.+ ...+...+.+...+...=.+|...|.---+.....+ ..........+..--++ T Consensus 79 ~tlt~DR~~~f~vD~mD--vdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~~--~~~~~~~~~~t~~nvl~ 154 (302) T protein:vir:78 79 YTLDYDLAQSFQIDAMD--VDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGVG--GVIDLSKPDASAQALMG 154 (302) T ss_pred EEeeeccceeeeccccc--hhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhccC--ccccccccchhHHHHHH Confidence 5554433222 222111 1222 233445455555555555677665531100000000 00000111112233345 Q ss_pred HHHHHHHHhhhhcCCCcEEEEcHHHHHHHHHhhccCCc-------eeeccCCcceeecceeEeecCCCCCCceeEE---- Q lcl|Aclame:pro 178 NIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETK-------ERIYDRNSDSLDGLPVVNLKSSNLKRGELIT---- 246 (324) Q Consensus 178 ~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~~~d~~g~-------~~~~~~~~~~l~G~pv~~~~~~~~~~~~~i~---- 246 (324) .+..++..+.+. .+-.++|+|.++..|...+.-+.. ..-.......+.|.|++..|+.-+...--.- T Consensus 155 ~i~~~~~~~~e~--~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i~~~V~~lDgv~Ii~VPs~r~~t~~~f~~G~~ 232 (302) T protein:vir:78 155 DIATAMELVDDS--NQLILVTSPTTLAGLLNTALIRESKNTQVLRRGEVDTKITFIQDVEVLQVPSEYLYDKVAPKVGVP 232 (302) T ss_pred HHHHHHHHhhcc--CCeEEEEChHHHHHHhcchhhccceeccccccccccceeeeecccEEEEchhhhcccceeccCCcc Confidence 666777777664 356789999999988653322211 1112445678999999876653332110000 Q ss_pred --eecccE-EEEEecceEEEEeeccceec---cccccccchhhhhcCcEEEEEE-EEeccEEeccCceEEEEeecCCCCC Q lcl|Aclame:pro 247 --GDFDKL-IYGIPQLIEYKIDETAQLST---VKNEDGTPVNLFEQDMVALRAT-MHVALHIADDKAFAKLVPADKRTDS 319 (324) Q Consensus 247 --gd~s~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~f~~~~v~~r~~-~r~d~~v~~~~A~~~l~~~~~~~~~ 319 (324) .+-+++ ++.......+.+.+...... ..+.++. ++... .++.-..+-.+...-|-.....+-+ T Consensus 233 ~~~~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd----------~~l~~~R~Y~D~fV~~nk~~gI~~~~~~~~~ 302 (302) T protein:vir:78 233 DYTGAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSAD----------AYKVDLRLYHDLIVPKNQRPGIIKASFGTIA 302 (302) T ss_pred ccCCccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcc----------eeeeeeeeEeeeeeeccccCeEEEeeccccC Confidence 000000 01111111111111111110 0000000 01111 1222222222222222222222222 No 242 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=45.87 E-value=0.75 Score=21.25 Aligned_cols=270 Identities=9% Similarity=-0.103 Sum_probs=109.4 Q ss_pred cccccCccccchHHHHHHHHHHHhhhhhhhhcceeecCCCceEEEEEeCCcceeeecc---Ccccccccc-ceeeEEeeh Q lcl|Aclame:pro 30 MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKPGAYWVGE---GQKIETSKA-TWVNATMRA 105 (324) Q Consensus 30 ~~~~~~~~~vp~~~~~~i~~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E---g~~~~~~~~-~~~~v~l~~ 105 (324) +++-....++.+.+.+-.+..-.+.-+--.+++.+|+.....+|++.....-..|-.+ ++....... .++..++.. T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~Iad~lfP~vpV~~~~~k~~~f~~e~f~~~~t~ra~~~~~~~v~~~~~~~~~~~~ 80 (307) T protein:vir:79 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQTLMPVVEVEKEGGKIPKFGKESFRLYQTERALRAKSNRMNPEDIDSVDVNL 80 (307) T ss_pred CCCCCCCcccCHHHHHHHhhccchhhhhhhcCCcccccccccceeeeccccccccccccccCCCcceeeeeccccccccc Confidence 3333344455444444333332222223455677888777777777632110011111 111111111 122223323 Q ss_pred eeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHhc--cCccccccccccccccccccccchhhhhHHHHHH Q lcl|Aclame:pro 106 FKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGILN--QGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLE 183 (324) Q Consensus 106 ~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~G--~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~ 183 (324) ...+-..+|-...-..+..++++.-.+.+.+.|....|..+-.- +....+...-....++..-..++.....+|.+.+ T Consensus 81 ~~~~l~~~id~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~~Wsd~~sDPi~di~~~~ 160 (307) T protein:vir:79 81 DEHDLEYPIDYREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATEKFTAANSDPVGVIEDGK 160 (307) T ss_pred cccchhhcccchhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCcccCCCCCCcHHHHHHHH Confidence 33232333433333334556666666666666666555432210 1111111110111111111223455677788888 Q ss_pred HHhh-hhcCCCcEEEEcHHHHHHHHH---h-h--ccCCceeeccCCcceeecceeE-eecCC-CCCCce--eEEeec--- Q lcl|Aclame:pro 184 ALLE-DDELEANAFISKTQNRSLLRK---I-V--DPETKERIYDRNSDSLDGLPVV-NLKSS-NLKRGE--LITGDF--- 249 (324) Q Consensus 184 ~~l~-~~~~~~~~~v~~~~~~~~l~~---~-~--d~~g~~~~~~~~~~~l~G~pv~-~~~~~-~~~~~~--~i~gd~--- 249 (324) .++. ..++.++.++|..+.|.+|+. + + ...+..++....-..++|+.-+ +-.+. ....+. -+.|+. T Consensus 161 ~ai~~~~g~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it~~~la~l~~v~~V~vg~a~y~~~~~~~~~iw~~~~~l 240 (307) T protein:vir:79 161 EAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVDLLKEIFEVENIAVGEAIYADDKDRFTDIWGANIVL 240 (307) T ss_pred HHHHHhhCCccceEEeCHHHHHHHhcCHHHHHHhcCccccccCHHHHHHHhCceeEEEeeeeeecccccchhcCCCceEE Confidence 7775 557889999999999998863 1 1 1122222222222335555422 21110 000110 111111 Q ss_pred -----------------ccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEe Q lcl|Aclame:pro 250 -----------------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVP 312 (324) Q Consensus 250 -----------------s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~ 312 (324) |.-+...++|..+. +. .......--+|+.....-.+.-++|-..|++ T Consensus 241 ~y~~~~~~~~~~~~~~ps~Gyt~~~~g~~~~-d~---------------~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~ 304 (307) T protein:vir:79 241 AYVPLQRGGQQRTPYEPSYGYTLRKKGNPVV-DT---------------RIEDGKLELVRATDIFRPYLLGADAGYLISG 304 (307) T ss_pred EecccccCCCCCcccccccceeEEecCceEE-ec---------------ccCCCceeEEeecccccceeeccccchhhcc Confidence 11111111111100 00 0011111224555555555555666556665 Q ss_pred ecC Q lcl|Aclame:pro 313 ADK 315 (324) Q Consensus 313 ~~~ 315 (324) +-- T Consensus 305 ~v~ 307 (307) T protein:vir:79 305 ING 307 (307) T ss_pred CCC Confidence 433 No 243 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=45.47 E-value=0.77 Score=21.21 Aligned_cols=285 Identities=10% Similarity=0.053 Sum_probs=116.8 Q ss_pred ccccCccccchHHHHHHHHHHH-hhhh-h-hhhcceeecCCCceEEEE-EeCCc-ceeeeccCccccc-cccceeeEEee Q lcl|Aclame:pro 31 MHEKKDGTLMNEFTTPILQEVM-ENSK-I-MQLGKYEPMEGTEKKFTF-WADKP-GAYWVGEGQKIET-SKATWVNATMR 104 (324) Q Consensus 31 ~~~~~~~~vp~~~~~~i~~~~~-~~s~-l-~~l~~~~~~~~~~~~ip~-~~~~~-~a~~v~Eg~~~~~-~~~~~~~v~l~ 104 (324) +.+-.. ++.+.....+++.+. .... + ..+++..++....+.... ..+.. .+.++..+.+.+. ....++..++. T Consensus 1 M~~l~d-~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~ 79 (348) T protein:vir:49 1 MGLIYD-KVTASNIAGYFNALQENVDSTLGESIFPARKQLGTKLSYITGASGQSVALKAAAFDTNVTVRDRVSAEMHDEQ 79 (348) T ss_pred Ccchhh-hcCHHHHHHHHHhccccchhhhHhhcCCCccccCceeEEEEeecCceeeeeeecCCCCcceecccceeeeeee Confidence 111111 122222223333322 2222 2 233454444333333333 22223 4667877766554 34567777777 Q ss_pred heeeEEeeeehHHHhh------cC-hHHHHHHHH-------HHHHHHHHHHHHHHHH----hcc----Cccccc--c-cc Q lcl|Aclame:pro 105 AFKLGVILPVTKEFLN------YT-YSQFFEEMK-------PMIAEAFYKKFDEAGI----LNQ----GNNPFG--K-SI 159 (324) Q Consensus 105 ~~k~~~~~~iS~e~l~------ds-~~~~~~~i~-------~~l~~ai~~~~d~~~l----~G~----g~~~~~--~-~~ 159 (324) +-.++-...++.+-++ ++ ..+....+. ..+.+.+...+|..+. +|. |.+..- . +. T Consensus 80 ~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~~~~vdyg~ 159 (348) T protein:vir:49 80 MPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGV 159 (348) T ss_pred cCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCceEEEeecC Confidence 7777766666643321 11 111122222 2233445555554333 331 111100 0 00 Q ss_pred c--cc-cccccccccchhhhhHHHHHHHHhhhhcCCCcEEEEcHHHHHHHHH---hhc---c-CCcee-eccC----Ccc Q lcl|Aclame:pro 160 A--QS-IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK---IVD---P-ETKER-IYDR----NSD 224 (324) Q Consensus 160 ~--~~-~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~---~~d---~-~g~~~-~~~~----~~~ 224 (324) . +. +.+..=..++.....||.+++..+.+.+..+..++|+++++..|+. +++ . ++... +... .-. T Consensus 160 ~~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~~ 239 (348) T protein:vir:49 160 KPDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTKAELDNYIA 239 (348) T ss_pred CcccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccHHHHHHHHH Confidence 0 00 0000111234456678888888888778888999999999998853 221 1 11111 1111 112 Q ss_pred eeecceeEeecCC----------CCCCceeEEeeccc---EEEEE-ecceEEEEeeccceeccccccccchhhhhc-C-- Q lcl|Aclame:pro 225 SLDGLPVVNLKSS----------NLKRGELITGDFDK---LIYGI-PQLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D-- 287 (324) Q Consensus 225 ~l~G~pv~~~~~~----------~~~~~~~i~gd~s~---~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~-~-- 287 (324) .+.|.++++.... ..+++.++++-... ..+|. .+...................+.....|.+ | T Consensus 240 ~~~g~~i~~y~~~y~d~dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~ 319 (348) T protein:vir:49 240 DNFGVTVVLENGTYRNEKGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAVTTTKTTDPV 319 (348) T ss_pred hhcCceEEEEeeEEEecCCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccccccccceeecCCeEEEeeeecCCCc Confidence 3445555432211 12333444332111 11111 000000000000000000011111111222 1 Q ss_pred cEEEEEEEEeccEEeccCceEEEEeecCC Q lcl|Aclame:pro 288 MVALRATMHVALHIADDKAFAKLVPADKR 316 (324) Q Consensus 288 ~v~~r~~~r~d~~v~~~~A~~~l~~~~~~ 316 (324) ...+.+....=-.+.+|+++..++..++. T Consensus 320 ~~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:49 320 NVQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred eEEEEEeeeccccccCCCcEEEEEEecCC Confidence 23344555544556779999999988887 No 244 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=37.12 E-value=1.1 Score=20.28 Aligned_cols=285 Identities=9% Similarity=-0.026 Sum_probs=114.9 Q ss_pred ccccccCccccchHHHHHHHHHHHh----hhhh-hhhcceeecCCCceEEEEEe-CCc-ceeeeccCccccccc-cceee Q lcl|Aclame:pro 29 VMMHEKKDGTLMNEFTTPILQEVME----NSKI-MQLGKYEPMEGTEKKFTFWA-DKP-GAYWVGEGQKIETSK-ATWVN 100 (324) Q Consensus 29 ~~~~~~~~~~vp~~~~~~i~~~~~~----~s~l-~~l~~~~~~~~~~~~ip~~~-~~~-~a~~v~Eg~~~~~~~-~~~~~ 100 (324) ...+-.-. ++.+.....+++.+.. .+-+ ..+++..++.+-.+++-+.. ..+ .+.+++.+.+.+..+ ..++. T Consensus 1 M~~~~~~d-~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~r~g~~~ 79 (348) T protein:vir:98 1 MSWTLDTE-FIEPTQLTGLIREALRDLQVNRFRLARWLPNVDVDDITFEFLRGGGGLAETASYRSWDTESKIGRREGLAK 79 (348) T ss_pred Ccchhhhh-ccCHHHHHHHHHHHhhccCcchhhHHhcCCCccccceEEEEEeccCCceeeeeeecCCCccceeeccccee Confidence 00000111 2333333444444321 1122 33444444433333322211 122 356778777666543 45777 Q ss_pred EEeeheeeEEeeeehHHHhhc---C-hHHHHHHHH---HHHHHHHHHHHH----HHHHhcc----Cccccccc-c--ccc Q lcl|Aclame:pro 101 ATMRAFKLGVILPVTKEFLNY---T-YSQFFEEMK---PMIAEAFYKKFD----EAGILNQ----GNNPFGKS-I--AQS 162 (324) Q Consensus 101 v~l~~~k~~~~~~iS~e~l~d---s-~~~~~~~i~---~~l~~ai~~~~d----~~~l~G~----g~~~~~~~-~--~~~ 162 (324) .+..+-.++-...++.+-+.. . ...+...+. ..+.+.+...+| +++.+|. |.+..-.. . .+. T Consensus 80 ~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg~~~~~~ 159 (348) T protein:vir:98 80 VMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFGRIGSHS 159 (348) T ss_pred eeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccccCcccc Confidence 788887777777776642221 1 112222222 233444444444 3444442 11111000 0 000 Q ss_pred cccccc--cccchhhhhHHHHHHHHhhhh-cCCCcEEEEcHHHHHHHHH---hhcc-C------CceeeccCCc---cee Q lcl|Aclame:pro 163 IEKTNK--VIKGDFTQDNIIDLEALLEDD-ELEANAFISKTQNRSLLRK---IVDP-E------TKERIYDRNS---DSL 226 (324) Q Consensus 163 ~~~~~~--~~~~~~~~~~i~~~~~~l~~~-~~~~~~~v~~~~~~~~l~~---~~d~-~------g~~~~~~~~~---~~l 226 (324) ...... ..++....+||.+++..+.+. +..+..++|+++++..|+. +++. . ..+++..... -.. T Consensus 160 ~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (348) T protein:vir:98 160 VVAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQLNTVLSS 239 (348) T ss_pred cccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHHHHHHHHh Confidence 000111 122345668888888887654 7788899999999998852 3321 1 1122211111 112 Q ss_pred ecceeEeecCC----------CCCCceeEEeecc-cE------EEEEe-cceEEEEeeccceeccccccccchhhhhc-C Q lcl|Aclame:pro 227 DGLPVVNLKSS----------NLKRGELITGDFD-KL------IYGIP-QLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D 287 (324) Q Consensus 227 ~G~pv~~~~~~----------~~~~~~~i~gd~s-~~------~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~f~~-~ 287 (324) +|.|.+..-+. ..+++.+++.-.. .. .+|.. =|...+..+...........+..+..|.+ | T Consensus 240 ~g~~~i~~~d~~~~~~g~~~~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i~~~~~~~~d 319 (348) T protein:vir:98 240 MGLPPIEVYDAKVAVDGVSTRITPANAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGIVAATWKTKD 319 (348) T ss_pred hCCeEEEEeeeEEEcCCceeceecCCeEEEEecCCcccccccccccceecccchhhhccccccceeccCceeeeeeeecC Confidence 34543321110 1122233221100 00 00000 00000000000000000011111111211 1 Q ss_pred --cEEEEEEEEeccEEeccCceEEEEeec Q lcl|Aclame:pro 288 --MVALRATMHVALHIADDKAFAKLVPAD 314 (324) Q Consensus 288 --~v~~r~~~r~d~~v~~~~A~~~l~~~~ 314 (324) ...+++..+.=-.+.+|+++..++..+ T Consensus 320 P~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 320 PVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred CcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 233455555555566788888888877 No 245 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=29.28 E-value=1.7 Score=19.36 Aligned_cols=294 Identities=12% Similarity=0.069 Sum_probs=119.8 Q ss_pred CchhHHHHHH----------------HHHHHhhhhhHHhh-----------cc----ccccccccCc-cccchHHHHHHH Q lcl|Aclame:pro 1 MEQTQKLKLN----------------LQHFASNNVKPQVF-----------NP----DNVMMHEKKD-GTLMNEFTTPIL 48 (324) Q Consensus 1 ~~~~~~~k~~----------------~~~~a~~~~~~~~~-----------~~----~~~~~~~~~~-~~vp~~~~~~i~ 48 (324) |-|.+.+++- .+......+..|+. .+ -+..++.+++ .-.-|.+.. +. T Consensus 1 ms~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~l~ea~~~~g~~~~~~~t~~~~~~~P~Li~-l~ 79 (462) T protein:vir:10 1 MSIQQLQEKWAPVLNHESVPEIKDSYKKGVVAQLLENQENAIREEGQVLNETLQTTGYTTGDTATGPVAGFDPVLIS-LI 79 (462) T ss_pred CchHHHHHHhhhhhcccccchhhhhhHHHHHHHHhhhHHHHHHhcccchhccccccCCCcCcccccccccccchhhh-HH Confidence 3333222111 01111111111110 00 0111111111 011222222 33 Q ss_pred HHHHhhhhhhhhcceeecCCCceEEE----EEeC--------Ccce-------ee------------------------- Q lcl|Aclame:pro 49 QEVMENSKIMQLGKYEPMEGTEKKFT----FWAD--------KPGA-------YW------------------------- 84 (324) Q Consensus 49 ~~~~~~s~l~~l~~~~~~~~~~~~ip----~~~~--------~~~a-------~~------------------------- 84 (324) ++.-++-+...++.+-||++++.-|- +... +.++ .| T Consensus 80 Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfnEadt~fSg~~~~~~~~~~~~~~~~~~~~~~g 159 (462) T protein:vir:10 80 RRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFNEPNAGFSGGAGTGLSNYDPTASSSAVNDAEG 159 (462) T ss_pred HHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhccCCcCcccccccccccccccccccccccccc Confidence 44445555666666666665433221 0000 0000 00 Q ss_pred --------------------------ecc-------CccccccccceeeEEeeheeeEEeeeehHHHhhcC----hHHHH Q lcl|Aclame:pro 85 --------------------------VGE-------GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT----YSQFF 127 (324) Q Consensus 85 --------------------------v~E-------g~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds----~~~~~ 127 (324) .+| +...++-..++++++.+.+..+-.-..|-||.+|- ..|.+ T Consensus 160 ~~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAE 239 (462) T protein:vir:10 160 ANPGLLNDSPAGTYEVTGDATGMATATAEALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAE 239 (462) T ss_pred ccceeecCCCccceecccccccccchhccccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChh Confidence 011 12345666777888888888888899999999984 37889 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCcc--------ccccccccccccccccccchhhhhHHHHHHHHhh---------hhc Q lcl|Aclame:pro 128 EEMKPMIAEAFYKKFDEAGILNQGNN--------PFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLE---------DDE 190 (324) Q Consensus 128 ~~i~~~l~~ai~~~~d~~~l~G~g~~--------~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~---------~~~ 190 (324) +.|.+.|.-.|...|++.+|.=--+. ....++..- .....+.-..+..+.|+.+++ ..- T Consensus 240 tELaNILSTEImlEINReii~~l~~~a~~~k~~~~~~~Gv~dl----~~~~~gr~~~e~~k~l~~qi~~ean~i~~~t~r 315 (462) T protein:vir:10 240 SELANILSTEILAEINREVVRTIYVNAVKGAIANTATDGIFDL----DVDSNGRWSVEKFKGLLFQIERDSNAIGQETRR 315 (462) T ss_pred HHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccccccceeee----ccccchHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 99999999999999999988532111 011111110 001112333455566666663 112 Q ss_pred CCCcEEEEcHHHHHHHHHhh--c-c---CCceee--ccCCc----cee-ecceeEeecCCC--CCCceeEEeecccEEEE Q lcl|Aclame:pro 191 LEANAFISKTQNRSLLRKIV--D-P---ETKERI--YDRNS----DSL-DGLPVVNLKSSN--LKRGELITGDFDKLIYG 255 (324) Q Consensus 191 ~~~~~~v~~~~~~~~l~~~~--d-~---~g~~~~--~~~~~----~~l-~G~pv~~~~~~~--~~~~~~i~gd~s~~~~~ 255 (324) ...+.+++|+.+...|...- + . +++.-+ .+..+ +.| .|++|+..+... .+.. ++.+| T Consensus 316 ~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~d--------y~~vG 387 (462) T protein:vir:10 316 GKGNILICSADVASALGMAGVLDYAPGLQGNSALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKH--------FYVAG 387 (462) T ss_pred ccceEEEEchhHHHHhhhccchhccccccccccccccccccceeEEEecCceEEEEecccCCCcccc--------eEEEE Confidence 34457899999998884221 1 0 111111 12222 233 345665543321 1222 23344 Q ss_pred EecceEEE----Eeeccceecc--ccccccchhhhhcCcEEEEEEEEeccEEeccCceEEEEeecCCCCCCCCCC Q lcl|Aclame:pro 256 IPQLIEYK----IDETAQLSTV--KNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) Q Consensus 256 ~~~~~~~~----~~~~~~~~~~--~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~~l~~~~~~~~~~~~~~ 324 (324) ..+...++ +...+-+... .|+. -||- .+-...|++..+ +|= +.+.+-.++-. T Consensus 388 ~KG~~~~~~glfy~PYv~l~~~~~~dp~-----sfqP---~~g~~tRY~l~~-NP~--------t~~~~~~~~~~ 445 (462) T protein:vir:10 388 YKGTSPYDAGLFYCPYVPLQQVRAINPN-----TFQP---KIGFKTRYGMVS-NPF--------SGGLTQGSGAL 445 (462) T ss_pred EeCCcccccceeeccccccccccccCCc-----cccc---eeeeeeeeeeee-cCC--------CCCcCCccccc Confidence 44333221 1111100000 0110 0211 112223433332 111 11111111222 No 246 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=27.40 E-value=1.8 Score=19.12 Aligned_cols=290 Identities=12% Similarity=0.044 Sum_probs=115.1 Q ss_pred CchhHHHHHHH-------HHHHhhhhhHHhhccc------c-ccccccCcc-ccchHHHHHHHHHHHhhhhhhhhcceee Q lcl|Aclame:pro 1 MEQTQKLKLNL-------QHFASNNVKPQVFNPD------N-VMMHEKKDG-TLMNEFTTPILQEVMENSKIMQLGKYEP 65 (324) Q Consensus 1 ~~~~~~~k~~~-------~~~a~~~~~~~~~~~~------~-~~~~~~~~~-~vp~~~~~~i~~~~~~~s~l~~l~~~~~ 65 (324) =|+.+.+++.. -.-..+...+.+.+++ + ..++++++. -.-|.+. .+.+++-++-+...++.+-| T Consensus 38 enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~ia~s~~t~~v~~~~P~Li-~lvRra~p~LIa~DIwGVQP 116 (529) T protein:vir:10 38 EAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAGQSSGAITNIGPAVI-GMVRRAIPSLIAFDIAGVQP 116 (529) T ss_pred hhHHHHhhcccccchhhhhhhhhhccchhhcccccccccccccccccccccccccchhh-hhHHHHHHhHHhhhhheecc Confidence 11111111110 0000112222222211 1 111111111 1112222 12333444555555655555 Q ss_pred cCCCceEEE----EE-eC-------------------------------------------------------------- Q lcl|Aclame:pro 66 MEGTEKKFT----FW-AD-------------------------------------------------------------- 78 (324) Q Consensus 66 ~~~~~~~ip----~~-~~-------------------------------------------------------------- 78 (324) |++++.-|- +. +. T Consensus 117 MTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~ 196 (529) T protein:vir:10 117 MTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAAKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSA 196 (529) T ss_pred CCchhhhhhhheeeecCCcCCCcccccccccccccccccccccccccccccccccccccccceeeccccceeeecccccc Confidence 554332220 00 00 Q ss_pred --------Cc------------------------------ceeeecc---------CccccccccceeeEEeeheeeEEe Q lcl|Aclame:pro 79 --------KP------------------------------GAYWVGE---------GQKIETSKATWVNATMRAFKLGVI 111 (324) Q Consensus 79 --------~~------------------------------~a~~v~E---------g~~~~~~~~~~~~v~l~~~k~~~~ 111 (324) .+ -..-.+| +...++-..++++++.+.+..+-. T Consensus 197 ~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsTa~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLK 276 (529) T protein:vir:10 197 YLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMATSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLK 276 (529) T ss_pred cccccccccccccccccCCccccccccccccccccccccccchhhhhccccCCCCccccccceeeEEEEEEEeeecccee Confidence 00 0000111 123455567778888889988889 Q ss_pred eeehHHHhhcC----hHHHHHHHHHHHHHHHHHHHHHHHHhccCccc------------cccccccccccccccccchhh Q lcl|Aclame:pro 112 LPVTKEFLNYT----YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNP------------FGKSIAQSIEKTNKVIKGDFT 175 (324) Q Consensus 112 ~~iS~e~l~ds----~~~~~~~i~~~l~~ai~~~~d~~~l~G~g~~~------------~~~~~~~~~~~~~~~~~~~~~ 175 (324) -..|-||.+|- ..|.++.|.+.|...|...|++.+|.=-.... ...++...... .....+--. T Consensus 277 AEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i~~~a~~~~~g~~~~~~~~~gv~d~~~~-~d~~~~~~~ 355 (529) T protein:vir:10 277 AQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTAQVGKSGWTQTVGSAAGVFDFQDP-IDVRGARWA 355 (529) T ss_pred ccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHhhhhceeeeeeeeccccccccceecccc-ccccccchh Confidence 99999999984 37899999999999999999999996100000 00111110000 000000001 Q ss_pred hhHHHHHH-------HHhhhhc--CCCcEEEEcHHHHHHHHHh--hccCC----ce-eeccCCc----ceee-cceeEee Q lcl|Aclame:pro 176 QDNIIDLE-------ALLEDDE--LEANAFISKTQNRSLLRKI--VDPET----KE-RIYDRNS----DSLD-GLPVVNL 234 (324) Q Consensus 176 ~~~i~~~~-------~~l~~~~--~~~~~~v~~~~~~~~l~~~--~d~~g----~~-~~~~~~~----~~l~-G~pv~~~ 234 (324) .+.++.|+ ..+...- ...+.+++++.+...|... .+.-+ .. ...+... +.|. |++|+.. T Consensus 356 ~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~~~~~~~~sg~~~d~~~~~~~G~l~~~~~vy~D 435 (529) T protein:vir:10 356 GESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVDAGITPAAQGMASGLNADTTKGVFAGVLGGRYKVYID 435 (529) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhccccccccccccccceeecCCceEEEEecCceEEEec Confidence 12233333 3343322 2456789999999888632 21111 11 1122222 2332 3455554 Q ss_pred cCCCCCCceeEEeecccEEEEEecceEEEEeeccceeccccccccchhhhhcCcEEEEEEEEeccEEeccCceE-----E Q lcl|Aclame:pro 235 KSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFA-----K 309 (324) Q Consensus 235 ~~~~~~~~~~i~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~r~~~r~d~~v~~~~A~~-----~ 309 (324) +.. +.. ++.+|..+...++ . . +|-.-.|.+.. ....||+.|- + T Consensus 436 ~y~--~~d--------y~~vG~KG~~~~~--~--g-------------lfy~PYv~l~~-----~~~~dp~sfqP~~g~~ 483 (529) T protein:vir:10 436 QYA--RQD--------YFTMGYRGANNLD--A--G-------------IYYCPYVALTP-----LRGSDPKNFQPVMGFK 483 (529) T ss_pred CCC--Ccc--------eEEEEEeCCcccc--c--c-------------eeecccccccc-----ccccCCCcccceeeee Confidence 432 222 3334444433322 1 0 11111111110 0012444432 0 Q ss_pred ----EEeecCCC---CCCCCCC Q lcl|Aclame:pro 310 ----LVPADKRT---DSVPGEV 324 (324) Q Consensus 310 ----l~~~~~~~---~~~~~~~ 324 (324) |..-+-.. +.+.+-+ T Consensus 484 tRY~l~~NP~~~~~~~~~~~r~ 505 (529) T protein:vir:10 484 TRYAIGVNPFAESRTQAPTSRI 505 (529) T ss_pred eeeceeecCccccccccccccc Confidence 11111111 1222233 No 247 >protein:vir:1991 Length: 305 # NCBI annotation: major head subunit # Family: family:all:776 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050638;genbank:gi:9633525;genbank:GeneID:2636267 Probab=23.33 E-value=2.3 Score=18.58 Aligned_cols=228 Identities=10% Similarity=0.034 Sum_probs=99.3 Q ss_pred CchhHHHHHHHHHHHhhhhhHHhhccccccccccCccccchHHHHHHHHHHHh-hhhhhhhcceeecCCCceEEEEEeCC Q lcl|Aclame:pro 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVME-NSKIMQLGKYEPMEGTEKKFTFWADK 79 (324) Q Consensus 1 ~~~~~~~k~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~~~-~s~l~~l~~~~~~~~~~~~ip~~~~~ 79 (324) |--++..-..+. ..+.+.+.+.+.. .+-..+++.++|-++..-++..+..- T Consensus 1 M~i~~~~l~~l~----------------------------~~~~~~f~~~~~~a~~~~~~iA~~vpSt~~~~tY~wLg~f 52 (305) T protein:vir:19 1 MIVTPASIKALM----------------------------TSWRKDFQGGLEDAPSQYNKIAMVVNSSTRSNTYGWLGKF 52 (305) T ss_pred CccCHHHHHHHH----------------------------HHHHHHHHHHHhhcCcccceEEeEecCCCCcccccccccC Confidence 222211110000 1111111112221 22245566667766666667777666 Q ss_pred cce-eeeccCccccccccceeeEEeeheeeEEeeeehHHHhhcChHHHHHHHHHHHHHHHHHHHHHHHHh----ccCc-- Q lcl|Aclame:pro 80 PGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL----NQGN-- 152 (324) Q Consensus 80 ~~a-~~v~Eg~~~~~~~~~~~~v~l~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~ai~~~~d~~~l~----G~g~-- 152 (324) |.. .|+|| .....++-..-+++-+++...+.|.|+.|+|-...+-+-+.++++++.+.--|+.++. |..+ T Consensus 53 P~lrewiGe---r~i~~l~~~~y~i~Nk~fe~tV~V~R~dIeDD~lG~y~p~~~~~G~~aa~~pd~lv~~lL~~Gf~~~c 129 (305) T protein:vir:19 53 PTLKEWVGK---RTIQQMEAHGYSIANKTFEGTVGISRDDFEDDNLGIYAPIFQEMGRSAAVQPDELIFKLLKDGFTQPC 129 (305) T ss_pred Cccchhhcc---eeeeeccccceeEeeccccceeccchhhccccccCchHHHHHHHHHHHhhchhhHHHHHHHhcCCccC Confidence 654 68854 4444455556677888999999999999999889999999999999999988886663 3211 Q ss_pred --------ccccccc-cccccc---ccc-----cccchhh--hhH---HHHHHH-------------------------H Q lcl|Aclame:pro 153 --------NPFGKSI-AQSIEK---TNK-----VIKGDFT--QDN---IIDLEA-------------------------L 185 (324) Q Consensus 153 --------~~~~~~~-~~~~~~---~~~-----~~~~~~~--~~~---i~~~~~-------------------------~ 185 (324) ..|+..- ..+.+. ++. ..++... .|. ++=+|. . T Consensus 130 yDGq~FFdtDHpv~~~~~~tg~~~~vsn~~~~~~~~g~~w~Lld~~~~ikP~I~Q~Rk~~~~~~~~~~~d~~vf~~~e~~ 209 (305) T protein:vir:19 130 YDGQNFFDKEHPVYPNVDGTGSAVNTSNIVEQDSFSGLPFYLLDCSRAVKPLIFQERRKPELVARTRIDDDHVFMDNEFL 209 (305) T ss_pred CCCCcccCCCCCcccCCcccccccchhhhhcCCCCCCceeeeeecCCcceeEEEecccccceeeccCCCchhhhhhceee Confidence 1121100 000000 000 0011100 000 000000 0 Q ss_pred hhhhcCCCcE---E--------EEc----HHHHHHHHHhhccCCceeeccCCcceeecceeE-------eecCCCCCCce Q lcl|Aclame:pro 186 LEDDELEANA---F--------ISK----TQNRSLLRKIVDPETKERIYDRNSDSLDGLPVV-------NLKSSNLKRGE 243 (324) Q Consensus 186 l~~~~~~~~~---~--------v~~----~~~~~~l~~~~d~~g~~~~~~~~~~~l~G~pv~-------~~~~~~~~~~~ 243 (324) .....+.++. | -++ ...+.+++++|+..|+++-..+. ++=.|.- +..+...+.+. T Consensus 210 ygvd~R~n~Gygfwq~a~gS~~~Ls~~nl~aar~aM~~qk~d~G~pL~I~P~---~LvVPp~LE~~A~qll~s~~i~~g~ 286 (305) T protein:vir:19 210 FGASTRRAAGYGFWQMAVAVKGDLTLDNLWKGWQLMRSFEGDGGKKLGLKPT---HIVVPVGLEKAAEQLLNRELFADGN 286 (305) T ss_pred eeeeeeeeccccchhheecCCCCCCHHHHHHHHHHHHhhcCCCCceeeeecC---eEEeCchhHHHHHHHHhhcccCCcc Confidence 0000111111 1 111 12356677888888887643221 2211110 00010011110 Q ss_pred eEEeecccEEEEEecceEEEEeecc Q lcl|Aclame:pro 244 LITGDFDKLIYGIPQLIEYKIDETA 268 (324) Q Consensus 244 ~i~gd~s~~~~~~~~~~~~~~~~~~ 268 (324) . ++. +. .++-.++.++..- T Consensus 287 ~--~~~-Np---~~g~~eliV~P~L 305 (305) T protein:vir:19 287 T--TVS-NE---MKGKLQLVVADYL 305 (305) T ss_pred c--ccc-ce---ecceEEEEecccC Confidence 0 000 00 1122333333332 Done!