Query lcl|Aclame:protein:vir:99888|NCBI_annot:capsid protein|genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Match_columns 309 No_of_seqs 140 out of 186 Neff 7.3 Searched_HMMs 1612 Date Sun Dec 1 13:46:21 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_56 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_56_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99888 Length: 309 100.0 4E-114 2E-117 642.4 25.2 309 1-309 1-309 (309) 2 protein:vir:107882 Length: 307 100.0 5.2E-98 3E-101 553.9 23.5 296 1-308 1-307 (307) 3 protein:vir:79078 Length: 307 100.0 1.5E-96 1E-99 545.9 23.3 296 1-308 1-307 (307) 4 protein:vir:106590 Length: 349 100.0 1.6E-40 1E-43 238.7 19.2 302 1-307 1-349 (349) 5 protein:vir:98480 Length: 348 100.0 6.4E-36 4E-39 213.5 20.7 298 1-308 1-348 (348) 6 protein:vir:2736 Length: 348 # 100.0 6.7E-36 4.1E-39 213.4 20.4 299 1-309 1-347 (348) 7 protein:vir:96490 Length: 348 100.0 6.5E-36 4E-39 213.5 20.1 299 1-309 1-347 (348) 8 protein:vir:4902 Length: 348 # 100.0 2.9E-35 1.8E-38 209.9 20.6 298 1-309 1-347 (348) 9 protein:vir:79503 Length: 409 100.0 1.1E-31 7E-35 190.2 16.9 303 1-309 7-393 (409) 10 protein:vir:78006 Length: 409 100.0 1.1E-31 7E-35 190.2 16.9 303 1-309 7-393 (409) 11 protein:vir:3424 Length: 341 # 99.9 1.2E-27 7.2E-31 168.3 16.5 292 1-308 1-341 (341) 12 protein:vir:393 Length: 341 # 99.9 1.5E-26 9E-30 162.2 17.4 292 1-308 1-341 (341) 13 protein:vir:6378 Length: 346 # 99.9 5.4E-26 3.4E-29 159.1 16.7 294 3-307 1-346 (346) 14 protein:vir:108211 Length: 318 99.4 5.2E-15 3.2E-18 98.9 11.7 267 1-309 1-318 (318) 15 protein:vir:10324 Length: 320 98.8 2.8E-11 1.7E-14 78.5 9.2 280 17-309 1-316 (320) 16 protein:vir:95258 Length: 368 97.9 1.5E-06 9.5E-10 52.5 13.6 295 1-309 1-365 (368) 17 protein:vir:94771 Length: 298 97.7 4.1E-06 2.5E-09 50.1 14.0 281 1-307 1-298 (298) 18 protein:vir:1638 Length: 298 # 97.6 1.1E-05 6.9E-09 47.8 14.1 279 1-307 1-298 (298) 19 protein:vir:96392 Length: 324 97.5 7E-06 4.4E-09 48.8 12.8 276 1-309 30-316 (324) 20 protein:vir:78830 Length: 324 97.5 7E-06 4.4E-09 48.8 12.8 276 1-309 30-316 (324) 21 protein:vir:8187 Length: 311 # 97.3 1.5E-05 9.5E-09 47.0 12.4 288 1-309 1-311 (311) 22 protein:vir:96223 Length: 324 97.3 2.3E-05 1.4E-08 46.0 13.3 275 1-309 30-316 (324) 23 protein:vir:7771 Length: 330 # 97.2 4E-05 2.5E-08 44.7 13.6 284 1-309 1-324 (330) 24 protein:vir:9309 Length: 324 # 97.2 5.7E-05 3.5E-08 43.9 13.7 276 1-309 30-316 (324) 25 protein:vir:99749 Length: 324 97.1 6.3E-05 3.9E-08 43.6 13.6 274 1-309 30-316 (324) 26 protein:vir:9759 Length: 303 # 97.1 4.6E-05 2.9E-08 44.3 12.6 283 1-308 1-303 (303) 27 protein:vir:103955 Length: 324 97.0 0.00011 7E-08 42.2 14.0 274 1-309 30-316 (324) 28 protein:vir:80684 Length: 315 96.9 0.00013 8.3E-08 41.8 13.8 286 1-309 1-307 (315) 29 protein:vir:97148 Length: 324 96.9 0.00015 9.5E-08 41.5 13.7 273 1-309 31-316 (324) 30 protein:vir:9820 Length: 272 # 96.9 0.00029 1.8E-07 40.0 15.2 253 1-309 1-270 (272) 31 protein:vir:3033 Length: 272 # 96.9 0.00029 1.8E-07 40.0 15.2 253 1-309 1-270 (272) 32 protein:vir:93742 Length: 274 96.7 0.00037 2.3E-07 39.4 16.1 259 1-309 1-271 (274) 33 protein:vir:9574 Length: 300 # 96.7 0.00031 1.9E-07 39.8 14.3 282 1-309 1-300 (300) 34 protein:vir:1239 Length: 274 # 96.5 0.0006 3.7E-07 38.3 15.8 259 1-309 1-271 (274) 35 protein:vir:105905 Length: 304 96.3 0.00037 2.3E-07 39.4 12.5 272 1-307 1-304 (304) 36 protein:vir:94142 Length: 304 96.3 0.00037 2.3E-07 39.4 12.5 272 1-307 1-304 (304) 37 protein:vir:41 Length: 299 # N 96.3 0.00067 4.2E-07 38.0 13.6 273 1-309 6-299 (299) 38 protein:vir:4226 Length: 326 # 96.1 0.00041 2.5E-07 39.2 11.8 279 1-309 22-324 (326) 39 protein:vir:94494 Length: 274 96.1 0.001 6.3E-07 37.0 15.7 259 1-309 1-271 (274) 40 protein:vir:97433 Length: 274 96.1 0.001 6.3E-07 37.0 15.7 259 1-309 1-271 (274) 41 protein:vir:78523 Length: 338 96.1 0.00069 4.3E-07 37.9 12.8 288 1-309 10-336 (338) 42 protein:vir:96123 Length: 274 95.9 0.0012 7.7E-07 36.5 15.4 255 1-309 1-271 (274) 43 protein:vir:80930 Length: 278 95.7 0.0017 1E-06 35.8 13.6 262 1-309 1-277 (278) 44 protein:vir:191 Length: 385 # 95.6 0.0017 1.1E-06 35.7 14.4 263 1-309 105-385 (385) 45 protein:vir:1886 Length: 385 # 95.6 0.0017 1.1E-06 35.7 14.4 263 1-309 105-385 (385) 46 protein:vir:2430 Length: 318 # 95.6 0.0017 1.1E-06 35.7 14.6 281 1-309 14-314 (318) 47 protein:vir:99920 Length: 311 95.5 0.0019 1.2E-06 35.5 13.1 286 1-308 1-311 (311) 48 protein:vir:100135 Length: 418 95.5 0.002 1.3E-06 35.3 14.8 264 1-309 136-416 (418) 49 protein:vir:4600 Length: 415 # 95.3 0.0023 1.4E-06 35.1 16.4 269 1-309 120-405 (415) 50 protein:vir:4700 Length: 415 # 95.3 0.0023 1.4E-06 35.1 16.4 269 1-309 120-405 (415) 51 protein:vir:485 Length: 407 # 95.2 0.0026 1.6E-06 34.8 12.7 266 1-309 106-401 (407) 52 protein:vir:9410 Length: 415 # 94.9 0.0033 2E-06 34.2 16.6 269 1-309 127-405 (415) 53 protein:vir:81070 Length: 390 94.7 0.0036 2.3E-06 34.0 13.3 260 1-306 114-390 (390) 54 protein:vir:80068 Length: 301 94.7 0.0014 8.5E-07 36.3 10.0 267 1-306 1-301 (301) 55 protein:vir:3613 Length: 272 # 94.7 0.0038 2.3E-06 33.9 14.6 259 1-308 1-272 (272) 56 protein:vir:78223 Length: 333 94.7 0.0037 2.3E-06 33.9 12.3 286 1-309 20-333 (333) 57 protein:vir:104342 Length: 314 94.6 0.0011 6.6E-07 36.9 9.1 258 1-292 17-314 (314) 58 protein:vir:80376 Length: 435 94.5 0.0023 1.4E-06 35.0 10.8 280 1-309 135-434 (435) 59 protein:vir:81100 Length: 415 94.4 0.0046 2.9E-06 33.4 16.4 269 1-309 124-405 (415) 60 protein:vir:79987 Length: 415 94.4 0.0046 2.9E-06 33.4 16.4 269 1-309 124-405 (415) 61 protein:vir:98339 Length: 415 94.4 0.0046 2.9E-06 33.4 16.4 269 1-309 124-405 (415) 62 protein:vir:1433 Length: 435 # 94.3 0.0048 3E-06 33.3 12.3 279 1-309 132-434 (435) 63 protein:vir:96833 Length: 275 94.2 0.0051 3.2E-06 33.1 15.5 255 1-309 3-272 (275) 64 protein:vir:3158 Length: 321 # 93.8 0.0062 3.9E-06 32.7 13.5 267 1-309 1-312 (321) 65 protein:vir:79928 Length: 393 93.6 0.0071 4.4E-06 32.4 14.3 278 1-308 74-393 (393) 66 protein:vir:104085 Length: 320 93.5 0.0073 4.5E-06 32.3 13.1 279 1-309 1-318 (320) 67 protein:vir:107687 Length: 319 93.4 0.0076 4.7E-06 32.2 11.5 261 1-306 18-319 (319) 68 protein:vir:103285 Length: 296 93.3 0.00077 4.7E-07 37.7 5.8 259 1-292 1-296 (296) 69 protein:vir:104256 Length: 458 93.1 0.0088 5.5E-06 31.8 16.2 269 1-309 165-457 (458) 70 protein:vir:81227 Length: 413 93.1 0.0089 5.5E-06 31.8 16.9 267 1-309 122-411 (413) 71 protein:vir:96262 Length: 274 93.0 0.0092 5.7E-06 31.8 15.0 254 1-309 1-270 (274) 72 protein:vir:95898 Length: 274 93.0 0.0092 5.7E-06 31.8 15.0 254 1-309 1-270 (274) 73 protein:vir:5255 Length: 304 # 92.6 0.011 6.6E-06 31.4 11.6 265 1-292 1-304 (304) 74 protein:vir:4456 Length: 401 # 92.0 0.013 8.4E-06 30.8 13.9 265 1-308 107-401 (401) 75 protein:vir:94673 Length: 419 91.7 0.014 9E-06 30.7 14.6 269 1-309 123-418 (419) 76 protein:vir:4856 Length: 293 # 91.6 0.015 9.4E-06 30.6 12.3 262 1-309 5-282 (293) 77 protein:vir:4830 Length: 397 # 91.4 0.016 1E-05 30.4 12.5 265 1-309 114-386 (397) 78 protein:vir:8102 Length: 543 # 90.7 0.02 1.2E-05 29.9 14.3 275 1-309 255-543 (543) 79 protein:vir:105334 Length: 276 90.5 0.021 1.3E-05 29.8 15.4 255 1-306 1-276 (276) 80 protein:vir:4997 Length: 397 # 90.0 0.023 1.4E-05 29.6 13.3 265 1-309 109-386 (397) 81 protein:vir:4953 Length: 397 # 89.7 0.025 1.5E-05 29.4 13.3 265 1-309 109-386 (397) 82 protein:vir:1084 Length: 437 # 89.1 0.029 1.8E-05 29.0 12.9 256 1-309 161-428 (437) 83 protein:vir:107593 Length: 392 88.9 0.029 1.8E-05 29.0 13.4 263 1-309 106-385 (392) 84 protein:vir:102873 Length: 392 88.9 0.029 1.8E-05 29.0 13.4 263 1-309 106-385 (392) 85 protein:vir:102082 Length: 392 88.9 0.029 1.8E-05 29.0 13.4 263 1-309 106-385 (392) 86 protein:vir:105004 Length: 392 88.9 0.029 1.8E-05 29.0 13.4 263 1-309 106-385 (392) 87 protein:vir:97053 Length: 390 88.4 0.032 2E-05 28.8 15.9 260 1-306 114-390 (390) 88 protein:vir:94070 Length: 339 88.3 0.022 1.3E-05 29.7 8.9 261 1-301 46-339 (339) 89 protein:vir:78090 Length: 302 88.0 0.035 2.2E-05 28.5 10.0 270 1-308 1-302 (302) 90 protein:vir:1025 Length: 408 # 87.7 0.037 2.3E-05 28.4 12.0 263 1-309 121-394 (408) 91 protein:vir:78920 Length: 290 87.6 0.037 2.3E-05 28.4 12.1 268 1-290 1-290 (290) 92 protein:vir:739 Length: 231 # 87.5 0.038 2.4E-05 28.3 13.4 225 20-301 1-231 (231) 93 protein:vir:79642 Length: 329 87.3 0.04 2.5E-05 28.3 14.4 266 1-309 26-329 (329) 94 protein:vir:80180 Length: 381 87.1 0.041 2.6E-05 28.2 12.7 289 1-309 1-326 (381) 95 protein:vir:80213 Length: 334 86.4 0.046 2.9E-05 27.9 11.2 282 1-297 1-334 (334) 96 protein:vir:1328 Length: 392 # 85.6 0.052 3.2E-05 27.6 13.0 265 1-309 114-392 (392) 97 protein:vir:102119 Length: 404 85.4 0.053 3.3E-05 27.6 13.4 270 1-309 110-401 (404) 98 protein:vir:10364 Length: 390 84.6 0.059 3.7E-05 27.3 16.3 260 1-306 114-390 (390) 99 protein:vir:7990 Length: 273 # 84.4 0.06 3.8E-05 27.3 14.4 261 1-308 1-273 (273) 100 protein:vir:3643 Length: 336 # 83.8 0.065 4.1E-05 27.1 10.5 265 1-301 34-336 (336) 101 protein:vir:3845 Length: 395 # 82.3 0.078 4.8E-05 26.7 12.0 262 1-309 105-384 (395) 102 protein:vir:100247 Length: 425 82.2 0.079 4.9E-05 26.6 12.6 266 1-309 130-425 (425) 103 protein:vir:2504 Length: 305 # 82.2 0.079 4.9E-05 26.6 14.0 275 1-309 1-299 (305) 104 protein:vir:99075 Length: 392 82.0 0.081 5E-05 26.6 11.5 279 1-309 1-314 (392) 105 protein:vir:1383 Length: 421 # 81.8 0.083 5.1E-05 26.5 13.4 257 1-309 116-395 (421) 106 protein:vir:3991 Length: 404 # 81.6 0.085 5.2E-05 26.5 12.5 262 1-309 116-394 (404) 107 protein:vir:4159 Length: 315 # 81.1 0.089 5.5E-05 26.3 9.7 270 1-309 8-310 (315) 108 protein:vir:94622 Length: 341 81.0 0.09 5.6E-05 26.3 12.0 285 1-309 1-340 (341) 109 protein:vir:101557 Length: 336 80.8 0.091 5.6E-05 26.3 10.8 265 1-301 34-336 (336) 110 protein:vir:95107 Length: 270 79.7 0.1 6.3E-05 26.0 15.0 257 1-304 1-270 (270) 111 protein:vir:100884 Length: 389 77.8 0.12 7.5E-05 25.6 11.7 258 1-309 109-385 (389) 112 protein:vir:95763 Length: 297 77.2 0.13 7.9E-05 25.5 11.1 269 1-309 1-297 (297) 113 protein:vir:2344 Length: 397 # 76.7 0.13 8.2E-05 25.4 14.9 279 1-309 10-307 (397) 114 protein:vir:102335 Length: 312 75.8 0.14 8.9E-05 25.2 10.7 272 1-300 1-312 (312) 115 protein:vir:105822 Length: 273 74.8 0.15 9.5E-05 25.0 14.3 262 1-308 1-273 (273) 116 protein:vir:102605 Length: 273 74.8 0.15 9.5E-05 25.0 14.3 262 1-308 1-273 (273) 117 protein:vir:99576 Length: 388 73.9 0.17 0.0001 24.9 10.6 266 1-301 65-388 (388) 118 protein:vir:107732 Length: 379 71.8 0.19 0.00012 24.5 9.7 265 1-301 56-379 (379) 119 protein:vir:105038 Length: 428 71.8 0.19 0.00012 24.5 12.5 279 1-308 127-428 (428) 120 protein:vir:6242 Length: 390 # 69.7 0.22 0.00014 24.2 14.9 264 1-309 116-390 (390) 121 protein:vir:78558 Length: 336 67.8 0.25 0.00015 23.9 10.4 265 1-301 34-336 (336) 122 protein:vir:4339 Length: 395 # 66.8 0.26 0.00016 23.8 16.9 263 1-309 117-395 (395) 123 protein:vir:6212 Length: 434 # 63.9 0.31 0.00019 23.4 14.5 272 1-309 143-434 (434) 124 protein:vir:79712 Length: 285 62.5 0.33 0.00021 23.2 9.6 268 1-309 1-284 (285) 125 protein:vir:4197 Length: 314 # 61.4 0.35 0.00022 23.1 13.9 276 1-309 1-314 (314) 126 protein:vir:102655 Length: 322 60.9 0.36 0.00022 23.0 12.6 278 1-290 13-322 (322) 127 protein:vir:6324 Length: 335 # 55.9 0.47 0.00029 22.4 11.6 289 1-306 1-335 (335) 128 protein:vir:81160 Length: 371 53.2 0.54 0.00033 22.1 11.8 261 1-308 91-371 (371) 129 protein:vir:7409 Length: 408 # 52.4 0.55 0.00034 22.0 12.4 261 1-309 116-394 (408) 130 protein:vir:99523 Length: 311 46.4 0.74 0.00046 21.3 11.5 275 1-289 1-311 (311) 131 protein:vir:101607 Length: 379 46.1 0.75 0.00046 21.3 16.2 254 1-309 109-378 (379) 132 protein:vir:96079 Length: 382 46.0 0.75 0.00046 21.3 13.5 269 1-301 63-382 (382) 133 protein:vir:106734 Length: 336 43.0 0.86 0.00053 20.9 10.2 262 1-301 34-336 (336) 134 protein:vir:78935 Length: 335 40.4 0.97 0.0006 20.7 11.5 287 1-306 1-335 (335) 135 protein:vir:101650 Length: 497 37.6 1.1 0.00069 20.3 11.3 269 1-309 151-494 (497) 136 protein:vir:7855 Length: 497 # 37.6 1.1 0.00069 20.3 11.3 269 1-309 151-494 (497) 137 protein:vir:5739 Length: 366 # 34.5 1.3 0.0008 20.0 12.4 280 1-308 66-366 (366) 138 protein:vir:78739 Length: 332 30.6 1.6 0.00097 19.5 8.9 285 1-306 7-332 (332) 139 protein:vir:97255 Length: 310 29.5 1.7 0.001 19.4 14.3 279 1-308 1-310 (310) 140 protein:vir:100172 Length: 394 29.2 1.7 0.001 19.4 14.9 255 1-309 111-385 (394) 141 protein:vir:94711 Length: 347 27.7 1.8 0.0011 19.2 9.7 284 1-309 1-347 (347) 142 protein:vir:97031 Length: 402 24.5 2.2 0.0013 18.7 11.0 294 1-309 1-349 (402) 143 protein:vir:93881 Length: 387 24.2 2.2 0.0014 18.7 14.1 256 1-309 118-382 (387) 144 protein:vir:105464 Length: 346 20.4 2.8 0.0017 18.2 10.4 274 1-309 1-301 (346) No 1 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=100.00 E-value=3.8e-114 Score=642.36 Aligned_cols=309 Identities=100% Similarity=1.480 Sum_probs=306.4 Q ss_pred CCCCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccceeeec Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTED 80 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~~~e 80 (309) |+|++|++||+||++|+||+|++|||++|||+|||.+++|+|++|++.++|++++|+|+|++++++++++.++++++|++ T Consensus 1 ~~~~~~~~dp~LT~~A~gy~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~~~~~~~~~~~~ 80 (309) T protein:vir:99 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTED 80 (309) T ss_pred CCCCCcCcCHhHHHHHhhccChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceEeecccCceeeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHHHHHH Q lcl|Aclame:pro 81 HGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVITDAL 160 (309) Q Consensus 81 ~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di~~~~ 160 (309) |+|+.+||++++++++.+|||+++|+++|+++|.+++|+++|++++++++|+++||++|+||++|||++||||+||++|+ T Consensus 81 ~~L~~~i~~~~~~~a~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a~y~~~~k~~Lsgt~~wsd~~SDPi~~i~~~~ 160 (309) T protein:vir:99 81 HGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVITDAL 160 (309) T ss_pred cceeecCCchhhhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChhhcCCCceEEecCccccCCCCCCcHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCcccceecCCcE Q lcl|Aclame:pro 161 DSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLIRAWGPHA 240 (309) Q Consensus 161 ~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~ 240 (309) +++|++||+|+||.++|++|++||+|++++++++.+.|+||+++|+++||+++|+||+++||++++||++++++||++++ T Consensus 161 ~~~g~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g~~~~~~~iwg~~~ 240 (309) T protein:vir:99 161 DSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLIRAWGPHA 240 (309) T ss_pred HhhCCCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHHhCcceEEeecceeeccccccccccccccCCcE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 241 SFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 241 ~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) +|+|+++.+.+.++|||||||+|+.+..|+++++++|++|+++||++++++|+|+|+||||||+||||| T Consensus 241 ~L~y~~~~~~~~~~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~li~~~va~ 309 (309) T protein:vir:99 241 SFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) T ss_pred EEEEcCCCCCCcccccccceeecccccCCceeeeeeccCCceEEEEeccccchhcchhcchhhhhcccC Confidence 999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=100.00 E-value=5.2e-98 Score=553.91 Aligned_cols=296 Identities=25% Similarity=0.414 Sum_probs=276.4 Q ss_pred CC--CCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcC-cccee Q lcl|Aclame:pro 1 MS--NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSAT-DETGS 77 (309) Q Consensus 1 m~--~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~-~~~~~ 77 (309) |. +.+|++||+||++|+||+|++|||++|||+|||.+++|||++|++ ++|++++|+|++++.++++++... ..++. T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~ia~~l~P~vpv~~~~~k~~~f~~-eaF~~~~t~r~~~~~~~~v~~~~~~~~~~~ 79 (307) T protein:vir:10 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQSLMPVVEVEKEGGKIPKFGK-ESFRLYKTERALRARSNRMNPEDLGSIDIV 79 (307) T ss_pred CCCCCCCcccChhHHHHHHhhcchhhhhhhcCCcccccccccceeeECc-ccccchhhhcccCCCcceeecccccccccc Confidence 54 469999999999999999999999999999999999999999997 799999999999999999999875 56899 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHHH Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVIT 157 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di~ 157 (309) |.+|||+.++|+++ ++...|||+++|++.++++|.+++|+++|++++++.+|+++||++|||+++|||++|||++||+ T Consensus 80 ~~~~~L~~~id~r~--~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsGt~~Wsd~~sDPi~di~ 157 (307) T protein:vir:10 80 LDEHDLEYPIDYRE--DQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPNSYAGGNKKQLSATEKFTAAGSDPVGVIE 157 (307) T ss_pred cccccccccCChhh--cCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCCceEEeccccccCCCCCCcHHHHH Confidence 99999999999875 4566899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHh----CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCcccc Q lcl|Aclame:pro 158 DALDSV----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLI 233 (309) Q Consensus 158 ~~~~~~----g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~ 233 (309) +|++++ |++||+|+||+++|++|++||+|++++++++ .|+||+++|+++||+++|+||+++|++++ ++++ T Consensus 158 ~~~~ai~~~~g~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~--~g~it~~~la~ll~v~~i~vg~a~~~~~~----~~~~ 231 (307) T protein:vir:10 158 DGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSM--KGIVTVDLLKEIFEVENIAVGEAIYADDK----DRFT 231 (307) T ss_pred HHHHHHHhhhCCccceEEeCHHHHHHHhcCHHHHHHhCCcc--ccccCHHHHHHHhCceeEEEeeeeeeccC----Cccc Confidence 999875 8999999999999999999999999999875 58999999999999999999999998763 5799 Q ss_pred eecCCcEEEEecCCCCC----CcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 234 RAWGPHASFIYRDRLAD----TRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 234 ~v~~~~~~L~~~~~~~~----~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) ++|+++++|+|+++.+. ++++|||||||+ +.++.+.|.+++..|+++|||+++++|+|++++|||||+|||- T Consensus 232 ~iw~~~~vl~yv~~~~~~~~~~~~epsfGyT~~---~~g~~~~d~~~~~~~~~~~r~~~~~~~~i~~~~~G~li~~~~~ 307 (307) T protein:vir:10 232 DIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLR---KKGNPVVDTRIEDGKLELVRSTDIFRPYLLGADAGYLISGING 307 (307) T ss_pred eeCCCceEEEecccccCCCCCcccccccceeEE---EcCCeEeeceecCCceeEEeccccccceeecccccceeccCCC Confidence 99999999999987543 567799999998 4568889999998889999999999999999999999999999 No 3 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=100.00 E-value=1.5e-96 Score=545.87 Aligned_cols=296 Identities=25% Similarity=0.415 Sum_probs=275.1 Q ss_pred CC--CCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccC-cCcccee Q lcl|Aclame:pro 1 MS--NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFS-ATDETGS 77 (309) Q Consensus 1 m~--~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~-~~~~~~~ 77 (309) |. +.+|++||+||++|+||+|++||||.|||+|||.+++|+|++|++ ++|++++|+|++++.++++++. ....++. T Consensus 1 m~~~~~~~~~dp~LT~~A~gy~n~~~Iad~lfP~vpV~~~~~k~~~f~~-e~f~~~~t~ra~~~~~~~v~~~~~~~~~~~ 79 (307) T protein:vir:79 1 MGRLSKLRIVDPVLTNLAIGYTNAEFIGQTLMPVVEVEKEGGKIPKFGK-ESFRLYQTERALRAKSNRMNPEDIDSVDVN 79 (307) T ss_pred CCCCCCCcccCHHHHHHHhhccchhhhhhhcCCcccccccccceeeecc-ccccccccccccCCCcceeeeecccccccc Confidence 55 469999999999999999999999999999999999999999997 7999999999999999999985 5678999 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHHH Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVIT 157 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di~ 157 (309) |.+|+|+.++++++ .+...|||+++|++.|+++|.+++|++||+++++..+|+++||++|||+++|||++|||++||+ T Consensus 80 ~~~~~l~~~id~r~--~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~~y~~~~k~tLsgt~~Wsd~~sDPi~di~ 157 (307) T protein:vir:79 80 LDEHDLEYPIDYRE--DQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPSSYAAGNKKQLSATEKFTAANSDPVGVIE 157 (307) T ss_pred ccccchhhcccchh--cCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhccccccCCCceEEEccCcccCCCCCCcHHHHH Confidence 99999999999875 5567899999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHh----CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCcccc Q lcl|Aclame:pro 158 DALDSV----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLI 233 (309) Q Consensus 158 ~~~~~~----g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~ 233 (309) +|++++ |++||+|+||+++|++|++||+|++++++++ .|+||+++|+++||+++|+||+++|++++ ++++ T Consensus 158 ~~~~ai~~~~g~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~--~g~it~~~la~l~~v~~V~vg~a~y~~~~----~~~~ 231 (307) T protein:vir:79 158 DGKEAIRTKIGRRPNTMVIGASAYKTLKAHPQLIEKIKYSM--KGIVTVDLLKEIFEVENIAVGEAIYADDK----DRFT 231 (307) T ss_pred HHHHHHHHhhCCccceEEeCHHHHHHHhcCHHHHHHhcCcc--ccccCHHHHHHHhCceeEEEeeeeeeccc----ccch Confidence 999876 8999999999999999999999999999876 58999999999999999999999998763 5789 Q ss_pred eecCCcEEEEecCCCCC----CcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 234 RAWGPHASFIYRDRLAD----TRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 234 ~v~~~~~~L~~~~~~~~----~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) ++|+++++|+|+++.+. +.++|||||||++.+ +...|.+++..++++|||+++++|+|++++|||||+|||- T Consensus 232 ~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~~~~g---~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~li~~~v~ 307 (307) T protein:vir:79 232 DIWGANIVLAYVPLQRGGQQRTPYEPSYGYTLRKKG---NPVVDTRIEDGKLELVRATDIFRPYLLGADAGYLISGING 307 (307) T ss_pred hcCCCceEEEecccccCCCCCcccccccceeEEecC---ceEEecccCCCceeEEeecccccceeeccccchhhccCCC Confidence 99999999999987543 467899999998654 5567888888889999999999999999999999999999 No 4 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=100.00 E-value=1.6e-40 Score=238.70 Aligned_cols=302 Identities=15% Similarity=0.144 Sum_probs=206.5 Q ss_pred CCCCC----------CCcchhhHHHHHhhcc----hhhhhhhhCCccccccccceeEEec-------hhHhhhchhHhhc Q lcl|Aclame:pro 1 MSNAP----------FPIDPELTAIAIAYRN----GRMISDEVLPRVPVGKQEFKFWKYD-------LAQGFTVPETLVG 59 (309) Q Consensus 1 m~~~~----------f~~dp~LT~~a~~y~n----~~~ig~~lfP~v~v~~~~~k~~~~~-------~~~~f~~~~t~~~ 59 (309) |.|.- +..|...+....+|.| +.|+++.+||.+++....+++.+.. +-.+|..+...+. T Consensus 1 ~~~~~~~~~~~~~~~~~~d~~~~~~l~~~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~ 80 (349) T protein:vir:10 1 MKNQKLQLDLQRFATPILDMFSQNTVLDYTRNRQYPEMLGDTLFPAVKVPTLEVDILKAGSRVPTIASVSAFDAEAEIGT 80 (349) T ss_pred CCcchhhHHHHHHHHHhhcccCHHHHHHHHHhcCcchhhHhhcCCccccccceeEEEeeccCcceeeeeecCCCCcceec Confidence 66532 1223333444445543 4599999999888776555544443 3345666666677 Q ss_pred ccccccccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------- Q lcl|Aclame:pro 60 RKSKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPN---------- 129 (309) Q Consensus 60 ~~~~~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~---------- 129 (309) |++.....+.++.+.++.+.+++|.......+..+.+...+...+.+..+.+.|..+.|+||+++++++. T Consensus 81 r~~~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~q~l~~Gki~~~~~g~~v 160 (349) T protein:vir:10 81 REASKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMVQAVKARGEKMTMEMFATGKITDKKNGIAI 160 (349) T ss_pred ccceeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeeEEcCCcEEE Confidence 7777777788888889999998876543322222222222334555677888999999999999998864 Q ss_pred --ccCcccceecccccccCCCCCChHHHHHHHHHHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHH Q lcl|Aclame:pro 130 --SYAAGNKTTLSGADQWSDPTSNPLPVITDALDSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQE 207 (309) Q Consensus 130 --~y~~~~~~~lsgt~~Wsd~~sdPi~di~~~~~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~ 207 (309) .++.+|+++|||+++||+++|||++||++|++++|.+|++++||+++|++|++|++|++++++++. ...++.+++.. T Consensus 161 D~g~~~~~~~~lt~~~~Ws~~~adpi~Di~~~~~~~g~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~-~~~~~~~~~~~ 239 (349) T protein:vir:10 161 DYGVPKKHQETLSGTKTWDKSDASIIDNLQDWSDSLDVTPTRALTSKKVLRILMRSTEIKEAIFGKDT-GRVVGQADLDQ 239 (349) T ss_pred ecccCccceeEecCcccCCCCCCCHHHHHHHHHHHhCCCccEEEeCHHHHHHHhcCHHHHHHhccccc-ccccCHHHHHH Confidence 467899999999999999999999999999999999999999999999999999999999998654 34788888888 Q ss_pred Hh---CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCcc---------ccc Q lcl|Aclame:pro 208 LL---ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIA---------DPN 275 (309) Q Consensus 208 l~---gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~---------d~~ 275 (309) +| ++++|++|+++|......+..+.+++||++.+++..+ +.++...||.|++..+...+... ..+ T Consensus 240 ~l~~~~~~~i~~yd~~y~d~~~~~~~t~~~~~p~~~v~l~~~---~~~G~~~yG~~~e~~~~~~g~~~~~~~~~~~~~~~ 316 (349) T protein:vir:10 240 WMTAQGLPIIRAYDGKYRDEDSRGNLTTNSYFPEDRIVLFND---EVPGQKIYGPTPEENRLISSNAQVSNVGNIMAKIY 316 (349) T ss_pred HHHhcCCceEEEEeeEEEeecCCCceeecccccCCeEEEecC---CCceeEEeeccchhhhhcccccceeeccceEEEee Confidence 87 6678999999998654444556788999887654322 23445678888876554433221 111 Q ss_pred cc--cCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 276 IG--LRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 276 ~g--~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) +. .+.++.+.+....-|++.-+++=|. -.+| T Consensus 317 ~~~~dP~~~~~~~~s~~lPv~~~~~~~~~-a~Vl 349 (349) T protein:vir:10 317 ETSEDPIGTWILASATMLPSFASADDVFQ-AKVL 349 (349) T ss_pred eecCCCceEEEEEeeeeeeeecCCCcEEE-EEeC Confidence 11 1123334444444455544443332 2344 No 5 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=100.00 E-value=6.4e-36 Score=213.53 Aligned_cols=298 Identities=14% Similarity=0.113 Sum_probs=200.9 Q ss_pred CCCCC---CCcchhhHHHHHhhcc----hhhhhhhhCCccccccccceeEEe-------chhHhhhchhHhhccc-cccc Q lcl|Aclame:pro 1 MSNAP---FPIDPELTAIAIAYRN----GRMISDEVLPRVPVGKQEFKFWKY-------DLAQGFTVPETLVGRK-SKPN 65 (309) Q Consensus 1 m~~~~---f~~dp~LT~~a~~y~n----~~~ig~~lfP~v~v~~~~~k~~~~-------~~~~~f~~~~t~~~~~-~~~~ 65 (309) |++.. |..-+.|+.+...+-+ +.|+.+.+||.+++....+++.+. .+-.+|..+...+.|. .... T Consensus 1 M~~~~~~d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~r~g~~~~ 80 (348) T protein:vir:98 1 MSWTLDTEFIEPTQLTGLIREALRDLQVNRFRLARWLPNVDVDDITFEFLRGGGGLAETASYRSWDTESKIGRREGLAKV 80 (348) T ss_pred CcchhhhhccCHHHHHHHHHHHhhccCcchhhHHhcCCCccccceEEEEEeccCCceeeeeeecCCCccceeecccceee Confidence 99753 5455669999988732 249999999988877666655443 3334455555555554 4667 Q ss_pred ccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------ccCccc Q lcl|Aclame:pro 66 EVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPN----------SYAAGN 135 (309) Q Consensus 66 ~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~----------~y~~~~ 135 (309) ..++.+.+.++.+.++++....--.+..... ...+.++.+.+.|..+.|+||+++++++. +|+..+ T Consensus 81 ~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~----~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg~~~ 156 (348) T protein:vir:98 81 MGELPPISEKIPLNEYDRLRLRKLSRDEALP----FIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFGRIG 156 (348) T ss_pred eeeccccccccccCHHHHHHhcCChHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccccCc Confidence 7888889999999888875422111111111 12333566788999999999999999854 244444 Q ss_pred ceecccccccCC-CCCChHHHHHHHHHHh----CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCc--ccccCHHHHHHH Q lcl|Aclame:pro 136 KTTLSGADQWSD-PTSNPLPVITDALDSV----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGD--EGMVPMAFLQEL 208 (309) Q Consensus 136 ~~~lsgt~~Wsd-~~sdPi~di~~~~~~~----g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~--~~~vt~~~l~~l 208 (309) ..+++++++||+ ++|||++||++|++++ |.+|++|+||+++|++|++|++|++++++.+.. .++++++++.++ T Consensus 157 ~~~~t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 236 (348) T protein:vir:98 157 SHSVVAAVLWSVHATATPISDLESWVATYEDTNGQSPGVILMPKAAVSHMRQCEEVIRQVFPLAPSGTAPMVSVEQLNTV 236 (348) T ss_pred ccccccccccCCCCCCCHHHHHHHHHHHHHHccCCcceEEEeCHHHHHHHhcCHHHHHHHhccCccccccccCHHHHHHH Confidence 446778899975 7899999999998764 999999999999999999999999999986543 468999998877 Q ss_pred h---CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCC------CCCcCcceecccccccccccCCcc------- Q lcl|Aclame:pro 209 L---ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRL------ADTRNGTTFGLTAQWGDRVSGSIA------- 272 (309) Q Consensus 209 ~---gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~------~~~~~~~t~G~T~~~~~~~~~~~~------- 272 (309) + |++.|++++++|... ++.+++||++.+++..... +..++.+.||.|++......+... T Consensus 237 ~~~~g~~~i~~~d~~~~~~-----g~~~~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~~~~~~~~~~i~ 311 (348) T protein:vir:98 237 LSSMGLPPIEVYDAKVAVD-----GVSTRITPANAIALLPEPGATDAAQPTELGATLLGTTAESLEDDYALAPGEQPGIV 311 (348) T ss_pred HHhhCCeEEEEeeeEEEcC-----CceeceecCCeEEEEecCCcccccccccccceecccchhhhccccccceeccCcee Confidence 5 899999999988642 3457889988776644322 123445567777765443222111 Q ss_pred cccc-c-cCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 273 DPNI-G-LRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 273 d~~~-g-~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) ...+ . .+.++++.+...--|++..+++ +++-.|+| T Consensus 312 ~~~~~~~dP~~~~~~~~s~~lPv~~~~~~-~~~a~Vl~ 348 (348) T protein:vir:98 312 AATWKTKDPVRLWTHAAAVGIPVLREPNL-TFKAQVLA 348 (348) T ss_pred eeeeeecCCcEEEEEEeeeeeccccCCCc-EEEEEEeC Confidence 0000 1 1224445555555566655554 34446777 No 6 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=100.00 E-value=6.7e-36 Score=213.43 Aligned_cols=299 Identities=14% Similarity=0.146 Sum_probs=202.9 Q ss_pred CCCC-CCCcchhhHHHHHhhcc--hhhhhhhhCCccccccccceeEEec-------hhHhhhchhHhhcccc-ccccccc Q lcl|Aclame:pro 1 MSNA-PFPIDPELTAIAIAYRN--GRMISDEVLPRVPVGKQEFKFWKYD-------LAQGFTVPETLVGRKS-KPNEVEF 69 (309) Q Consensus 1 m~~~-~f~~dp~LT~~a~~y~n--~~~ig~~lfP~v~v~~~~~k~~~~~-------~~~~f~~~~t~~~~~~-~~~~ve~ 69 (309) |++- -+.....|+.+...-.+ ..|+.+.+||..++....+++.+.. +-.+|..+...+.|.+ +....+. T Consensus 1 M~~i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~~ 80 (348) T protein:vir:27 1 MGLIYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEMHDEQM 80 (348) T ss_pred CcchhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeEeeeecCCCCcceecccceeeeeeec Confidence 9973 34444557776544322 3499999999887766656554433 3344555555554444 5566788 Q ss_pred CcCccceeeeccchhhcCCHH---HHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------ccC Q lcl|Aclame:pro 70 SATDETGSTEDHGLDAPVPQA---DIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPN--------------SYA 132 (309) Q Consensus 70 ~~~~~~~~~~e~~L~~~v~~~---~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~--------------~y~ 132 (309) .+.+.++.+.++++.....-. .....+...+...+.++.+.+.|..+.|+||+++++++. .++ T Consensus 81 p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~vdfg~~ 160 (348) T protein:vir:27 81 PFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGVK 160 (348) T ss_pred CccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEEEeecCC Confidence 888888888887753311110 000111112223344566788999999999999988643 236 Q ss_pred cccceecccccccCCCCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHh Q lcl|Aclame:pro 133 AGNKTTLSGADQWSDPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELL 209 (309) Q Consensus 133 ~~~~~~lsgt~~Wsd~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~ 209 (309) .+|+.++++ .||+++|||++||++|++.+ |.+|++++||+++|++|++|++|++++++.+...+.++++++.++| T Consensus 161 ~~~~~t~~~--~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~ 238 (348) T protein:vir:27 161 PDHKKQVSK--SWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAELENYI 238 (348) T ss_pred cccceeeee--ccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHHHHHHH Confidence 789999876 59999999999999998755 9999999999999999999999999999888878899999999986 Q ss_pred ---CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccc--------- Q lcl|Aclame:pro 210 ---ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIG--------- 277 (309) Q Consensus 210 ---gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g--------- 277 (309) +...|++|+++|.. ++++.+++||++.+++..+ +.++...||.|++..+...+......+. T Consensus 239 ~~~~g~~i~~yd~~y~d----~~G~~~~~~p~~~vvl~~~---~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 311 (348) T protein:vir:27 239 ADNFGVSIVLENGTYRN----DKGEVSKFYPDGHLTLIPN---GPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVT 311 (348) T ss_pred HhhcCceEEEEeeEEEc----CCCcCcccccCCeEEEEcC---CcceeEEeccCcchhhhhhccccccceeeeCCeeEEE Confidence 45689999999853 4567789999886655442 3345678999987655544332211111 Q ss_pred -----cCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 278 -----LRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 278 -----~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) ...++.+.+....-|++.-+++ +++-.|++| T Consensus 312 ~~~~~dP~~~~~~~~s~~lPv~~~~~~-~~~a~Vl~~ 347 (348) T protein:vir:27 312 TTKTTDPVNVQTKVSMVALPSFERLDD-VYMLTVIPA 347 (348) T ss_pred eeecCCCceEEEEEeeeeeccccCCCc-EEEEEEecC Confidence 1123344444444555555554 444477778 No 7 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=100.00 E-value=6.5e-36 Score=213.50 Aligned_cols=299 Identities=14% Similarity=0.132 Sum_probs=203.6 Q ss_pred CCCC-CCCcchhhHHHHHhhcch--hhhhhhhCCccccccccceeEEechh-------HhhhchhHhhccc-cccccccc Q lcl|Aclame:pro 1 MSNA-PFPIDPELTAIAIAYRNG--RMISDEVLPRVPVGKQEFKFWKYDLA-------QGFTVPETLVGRK-SKPNEVEF 69 (309) Q Consensus 1 m~~~-~f~~dp~LT~~a~~y~n~--~~ig~~lfP~v~v~~~~~k~~~~~~~-------~~f~~~~t~~~~~-~~~~~ve~ 69 (309) |++- -+.....|+.+.....++ .|+.+.+||..++....+++.+..+. .++..+...+.+. .+....+. T Consensus 1 M~~i~d~f~~~~l~~~i~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~~ 80 (348) T protein:vir:96 1 MGLIYDKVTASNIAGYFNTLQENVDSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEIHDEQM 80 (348) T ss_pred CcchhhccCHHHHHHHHHhcccchhhhhhhhcCCCccccceeEEEEeecCCceeEeeeecCCCCcceecccceeeeeeec Confidence 9963 244445677776655443 49999999988877666665554332 2233344444443 46667788 Q ss_pred CcCccceeeeccchhhcCCHHHH---HHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------ccC Q lcl|Aclame:pro 70 SATDETGSTEDHGLDAPVPQADI---DNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPN--------------SYA 132 (309) Q Consensus 70 ~~~~~~~~~~e~~L~~~v~~~~~---~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~--------------~y~ 132 (309) .+.+.+..+.++++......... ...+...+...+.+..+.+.|..+.|+||+++++++. .++ T Consensus 81 p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~~~~~~~~vdfg~~ 160 (348) T protein:vir:96 81 PFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGVK 160 (348) T ss_pred CccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEeecCCeeEEEeccCC Confidence 88888888887775432111110 0011112222334556778999999999999998743 346 Q ss_pred cccceecccccccCCCCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHh Q lcl|Aclame:pro 133 AGNKTTLSGADQWSDPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELL 209 (309) Q Consensus 133 ~~~~~~lsgt~~Wsd~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~ 209 (309) ++|+.++++ +||+++|||++||++|++.+ |++|++++||+++|++|++|++|++++++.+.+.+.++++++.++| T Consensus 161 ~~~~~t~~~--~W~~~~adp~~di~~~~~~~~~~G~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~~~~ 238 (348) T protein:vir:96 161 ADHKKQVSK--SWAEPGATPLADLEDAIETARELGLNPERAIMNAKTFGLIRKAASTVKAIKPLAGDGSSVTKAELQNYV 238 (348) T ss_pred cccceeecc--ccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHHhccCCccccccHHHHHHHH Confidence 789998875 69999999999999998765 9999999999999999999999999999988888899999999887 Q ss_pred ---CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCcc------------cc Q lcl|Aclame:pro 210 ---ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIA------------DP 274 (309) Q Consensus 210 ---gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~------------d~ 274 (309) +..+|++|+++|.. ++++.+++|+++.+++..+ +.++...||.|++......+.-. .. T Consensus 239 ~~~~g~~i~~y~~~y~d----~~G~~~~~~p~~~v~l~~~---~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 311 (348) T protein:vir:96 239 ADNYGVEIVLENGTYRN----EKGEVSKFFPDGHLTLIPN---GPLGNTVFGTTPEESDLFADNTVNADVEIVDSGIAVT 311 (348) T ss_pred hhhcCceEEEEccEEEe----cCCcEeccccCCeEEEEcC---CCceeEEeccChhhhhhhhcccccccceecCCeeEEE Confidence 33479999999854 3566788999887666432 33456788988865443332111 11 Q ss_pred cc--ccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 275 NI--GLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 275 ~~--g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) .. ....++.+.+...--|++..+++-|.. .++|| T Consensus 312 ~~~~~dP~~~~~~~~s~plPv~~~~~~~~~a-~Vl~~ 347 (348) T protein:vir:96 312 TTKTTDPVNVQTKVSMVALPSFERLGDVYML-TVIPG 347 (348) T ss_pred eeecCCCceEEEEEeeeeeccccCCCcEEEE-EEecC Confidence 11 112244555555556777777665444 78888 No 8 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=100.00 E-value=2.9e-35 Score=209.95 Aligned_cols=298 Identities=14% Similarity=0.148 Sum_probs=200.1 Q ss_pred CCCC--CCCcchhhHHHHHhhc--chhhhhhhhCCccccccccceeEEech-------hHhhhchhHhhcccc-cccccc Q lcl|Aclame:pro 1 MSNA--PFPIDPELTAIAIAYR--NGRMISDEVLPRVPVGKQEFKFWKYDL-------AQGFTVPETLVGRKS-KPNEVE 68 (309) Q Consensus 1 m~~~--~f~~dp~LT~~a~~y~--n~~~ig~~lfP~v~v~~~~~k~~~~~~-------~~~f~~~~t~~~~~~-~~~~ve 68 (309) |++- .|. -..|+.+..... +..|+.+.+||..++....+.+.+..+ -.+|..+...+.+.+ +....+ T Consensus 1 M~~l~d~f~-~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~ 79 (348) T protein:vir:49 1 MGLIYDKVT-ASNIAGYFNALQENVDSTLGESIFPARKQLGTKLSYITGASGQSVALKAAAFDTNVTVRDRVSAEMHDEQ 79 (348) T ss_pred CcchhhhcC-HHHHHHHHHhccccchhhhHhhcCCCccccCceeEEEEeecCceeeeeeecCCCCcceecccceeeeeee Confidence 9963 443 345666655433 345999999998777666555544433 234555545444444 666788 Q ss_pred cCcCccceeeeccchhhcCCHHHH---HHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------------cc Q lcl|Aclame:pro 69 FSATDETGSTEDHGLDAPVPQADI---DNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPN--------------SY 131 (309) Q Consensus 69 ~~~~~~~~~~~e~~L~~~v~~~~~---~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~--------------~y 131 (309) ..+.+.++.+.+.++.....-... .+.+...+...+.++.+.+.|..+.|+||+++++++. .+ T Consensus 80 ~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~g~~~~vdyg~ 159 (348) T protein:vir:49 80 MPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGV 159 (348) T ss_pred cCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCCceEEEeecC Confidence 889998888888774321110000 0001111222333556778899999999999998643 23 Q ss_pred CcccceecccccccCCCCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHH Q lcl|Aclame:pro 132 AAGNKTTLSGADQWSDPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQEL 208 (309) Q Consensus 132 ~~~~~~~lsgt~~Wsd~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l 208 (309) +.+|+.++++ .||+++|||++||++|++.+ |.+|++++||+++|++|++|++|++++++.+.+.+.++++++.++ T Consensus 160 ~~~~~~t~~~--~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~ 237 (348) T protein:vir:49 160 KPDHKKQVSK--SWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSSVTKAELDNY 237 (348) T ss_pred Ccccceeeee--ccCCCCCCHHHHHHHHHHHHHhcCCcccEEEeCHHHHHHHhcCHHHHHHhhccCcccccccHHHHHHH Confidence 6789999876 59999999999999998755 999999999999999999999999999998888889999999888 Q ss_pred h---CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccc-------- Q lcl|Aclame:pro 209 L---ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIG-------- 277 (309) Q Consensus 209 ~---gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g-------- 277 (309) + +..+|++|+++|.. ++++.+++||++.+++..+ +.++...||.|++......+.-....+. T Consensus 238 ~~~~~g~~i~~y~~~y~d----~dG~~~~~~p~~~v~l~~~---~~~G~~~yg~~~e~~~~~~~~~~~~~~~~~~~~~~~ 310 (348) T protein:vir:49 238 IADNFGVTVVLENGTYRN----EKGEVSKFFPDGHLTLIPN---GPLGNTVFGTTPEESDLFADNTVNADVEIVDNGIAV 310 (348) T ss_pred HHhhcCceEEEEeeEEEe----cCCcEeeeecCCeEEEecC---CCcceeEEecChhhhhhccccccccceeecCCeEEE Confidence 6 55689999999854 3567789999887766443 3345678999887544433221111111 Q ss_pred ------cCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 278 ------LRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 278 ------~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) ...++.+.+....-|++.-+++ +++-.++|| T Consensus 311 ~~~~~~dP~~~~~~~~s~~lPv~~~~~~-~~~a~Vl~~ 347 (348) T protein:vir:49 311 TTTKTTDPVNVQTKVSMVALPSFERLDD-VYMLTVIPA 347 (348) T ss_pred eeeecCCCceEEEEEeeeccccccCCCc-EEEEEEecC Confidence 1123344444444555555554 444578888 No 9 >protein:vir:79503 Length: 409 # NCBI annotation: major head protein # Family: family:all:11999 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468058;genbank:gi:157265500;genbank:GeneID:5600620 Probab=99.96 E-value=1.1e-31 Score=190.25 Aligned_cols=303 Identities=11% Similarity=-0.024 Sum_probs=184.9 Q ss_pred CCCC-CCCcchh----------------hHHHHHhhcchhhhhhhhCCccccccc-----------cceeEEechhHhhh Q lcl|Aclame:pro 1 MSNA-PFPIDPE----------------LTAIAIAYRNGRMISDEVLPRVPVGKQ-----------EFKFWKYDLAQGFT 52 (309) Q Consensus 1 m~~~-~f~~dp~----------------LT~~a~~y~n~~~ig~~lfP~v~v~~~-----------~~k~~~~~~~~~f~ 52 (309) +.|+ .|+-||- ++.++..+....+|.+.+||..++-.. +-+.+.+.+...|+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ia~~~~~~p~~~~L~d~~FP~~~~f~t~l~~~~~~~kg~kk~~~~~~~~~~d 86 (409) T protein:vir:79 7 INNALARVRDPLSIGGLKFPTTKEIQEAVAAIADKFNQENDLVDRFFPEDSTFASELELYLLRTQDAEQTGMTFVHQVGS 86 (409) T ss_pred cchhhhhhcCcchhcceecCchHHHHHHHHHHHHhcCCccchhhccCCCCccccceEEEEeeeccCcccccceEeeecCC Confidence 3333 3444442 334444444456899999996433211 11344444446677 Q ss_pred chhHhhccccc----ccccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 53 VPETLVGRKSK----PNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSP 128 (309) Q Consensus 53 ~~~t~~~~~~~----~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~ 128 (309) .....+++.++ ....++.+.+.+..+++++|......+...+.....+...+-+..+.++|..+.||||+++++++ T Consensus 87 ~~~pv~~r~~~~~~~~~t~epp~iK~k~~i~e~dl~~~~~~~n~~~~~~i~~~i~~D~~~L~~~I~~R~E~Ma~q~L~tG 166 (409) T protein:vir:79 87 TSLPVEARVAKVDLAKATWSPLAFKESRVWDEKEILYLGRLADEVQAGVINEQIAESLTWLMARMRNRRRWLTWQVMRTG 166 (409) T ss_pred ccccccccceeeeeeeecccccccccccccCHHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 66666666553 34556778899999999888665554433332222222334456688899999999999987654 Q ss_pred c------------------cc--CcccceecccccccCCCCCChHHHHHHHHHHh----CC--CCcEEEeCHHHHHHHh- Q lcl|Aclame:pro 129 N------------------SY--AAGNKTTLSGADQWSDPTSNPLPVITDALDSV----IL--RPNIGVLGRRTATILR- 181 (309) Q Consensus 129 ~------------------~y--~~~~~~~lsgt~~Wsd~~sdPi~di~~~~~~~----g~--~Pn~~v~~~~~~~~l~- 181 (309) . +| +++|+++|||+++|++++|||++||++|++.+ |. +|+.++|+.++|++|+ T Consensus 167 ki~i~g~~~~~~~g~~~~vDyg~pa~hkvtlTgt~~W~~~~AdPi~DIe~w~~~i~~~~g~~~t~~~~imt~~~~~~l~~ 246 (409) T protein:vir:79 167 RITIQPNDPYNPNGLKYVIDYGVTDIELPLPQKFDAKDGNGNSAVDPIQYFRDLIKAATYFPDRRPVAIIVGPGFDEVLA 246 (409) T ss_pred eEEEEecCCCccccceEEEecCCCcccceeecccccCCCCCCChHHHHHHHHHHHHHhcCCCCCccEEEEcHHHHHHHHh Confidence 2 23 67899999999999999999999999998765 33 5678999999998865 Q ss_pred cCHHHHHHhccCCCccc----ccCHH---------HHHHHhCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCC Q lcl|Aclame:pro 182 RHPKIVKAYNGSLGDEG----MVPMA---------FLQELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRL 248 (309) Q Consensus 182 ~~~~i~~~~~~~~~~~~----~vt~~---------~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~ 248 (309) +|+.|+++++..++..+ .+++. .+...+|| +|++|+.+|.. ++++.++++|++.+++...+. T Consensus 247 ~n~~ik~~l~~~~~~~~~~~~~~~~~~l~~~~~ln~~~~~~GL-~I~vYd~~Y~d----edGt~k~~~Pd~~vvLl~ap~ 321 (409) T protein:vir:79 247 DNTFVQKYVEYEKGWVVGQNTVQPPREVYRQAALDIFKRYTGL-EVMVYDKTYRD----QDGSVKYWIPVGELIVLNQST 321 (409) T ss_pred CcHHHHHhhhcccccccccccccchhhhcchhHhHhhhhhcCc-eEEEEeeEEEe----cCCcccceecCCeEEEEcCCc Confidence 66778888876544322 22332 23345588 59999999854 356778899988654333222 Q ss_pred CCCcCcceecccccccccccC------Ccc-cccccc-CCceEEEeecccceeeecchh-hhhhhccc---cC Q lcl|Aclame:pro 249 ADTRNGTTFGLTAQWGDRVSG------SIA-DPNIGL-RGGQRVRVGESVKELVTAPDL-GFFFENAV---AA 309 (309) Q Consensus 249 ~~~~~~~t~G~T~~~~~~~~~------~~~-d~~~g~-~g~~~v~v~~~~~~~v~~~~~-G~l~~~~v---a~ 309 (309) +.++.+.||.|.+-...... .+. .....+ .+..-++.....-|+++++.. -|+|.++= ++ T Consensus 322 -g~LG~T~yGa~~~~~~~~~~v~~~g~~i~~~~~~~~dP~~~~~~~~~~~~p~l~~~~~~~~~~~~~~~~~~~ 393 (409) T protein:vir:79 322 -GPVGRFVYTAHVAGQRNGKVVYATGPYLTVKDHLQDDPPYYAIIAGFHGLPQLSGYNTEDFSFHRFKWLKYA 393 (409) T ss_pred -ccccceecccccccccchhhhccccceeEecccccCCcceeeeecceEEeeeeecCCccceeehhhhhhhhh Confidence 34566778876532111100 000 011111 122334555677788886543 34444432 22 No 10 >protein:vir:78006 Length: 409 # NCBI annotation: major head protein # Family: family:all:11999 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467942;genbank:gi:157265383;genbank:GeneID:5600496 Probab=99.96 E-value=1.1e-31 Score=190.25 Aligned_cols=303 Identities=11% Similarity=-0.024 Sum_probs=184.9 Q ss_pred CCCC-CCCcchh----------------hHHHHHhhcchhhhhhhhCCccccccc-----------cceeEEechhHhhh Q lcl|Aclame:pro 1 MSNA-PFPIDPE----------------LTAIAIAYRNGRMISDEVLPRVPVGKQ-----------EFKFWKYDLAQGFT 52 (309) Q Consensus 1 m~~~-~f~~dp~----------------LT~~a~~y~n~~~ig~~lfP~v~v~~~-----------~~k~~~~~~~~~f~ 52 (309) +.|+ .|+-||- ++.++..+....+|.+.+||..++-.. +-+.+.+.+...|+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ia~~~~~~p~~~~L~d~~FP~~~~f~t~l~~~~~~~kg~kk~~~~~~~~~~d 86 (409) T protein:vir:78 7 INNALARVRDPLSIGGLKFPTTKEIQEAVAAIADKFNQENDLVDRFFPEDSTFASELELYLLRTQDAEQTGMTFVHQVGS 86 (409) T ss_pred cchhhhhhcCcchhcceecCchHHHHHHHHHHHHhcCCccchhhccCCCCccccceEEEEeeeccCcccccceEeeecCC Confidence 3333 3444442 334444444456899999996433211 11344444446677 Q ss_pred chhHhhccccc----ccccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 53 VPETLVGRKSK----PNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSP 128 (309) Q Consensus 53 ~~~t~~~~~~~----~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~ 128 (309) .....+++.++ ....++.+.+.+..+++++|......+...+.....+...+-+..+.++|..+.||||+++++++ T Consensus 87 ~~~pv~~r~~~~~~~~~t~epp~iK~k~~i~e~dl~~~~~~~n~~~~~~i~~~i~~D~~~L~~~I~~R~E~Ma~q~L~tG 166 (409) T protein:vir:78 87 TSLPVEARVAKVDLAKATWSPLAFKESRVWDEKEILYLGRLADEVQAGVINEQIAESLTWLMARMRNRRRWLTWQVMRTG 166 (409) T ss_pred ccccccccceeeeeeeecccccccccccccCHHHHHHHhCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 66666666553 34556778899999999888665554433332222222334456688899999999999987654 Q ss_pred c------------------cc--CcccceecccccccCCCCCChHHHHHHHHHHh----CC--CCcEEEeCHHHHHHHh- Q lcl|Aclame:pro 129 N------------------SY--AAGNKTTLSGADQWSDPTSNPLPVITDALDSV----IL--RPNIGVLGRRTATILR- 181 (309) Q Consensus 129 ~------------------~y--~~~~~~~lsgt~~Wsd~~sdPi~di~~~~~~~----g~--~Pn~~v~~~~~~~~l~- 181 (309) . +| +++|+++|||+++|++++|||++||++|++.+ |. +|+.++|+.++|++|+ T Consensus 167 ki~i~g~~~~~~~g~~~~vDyg~pa~hkvtlTgt~~W~~~~AdPi~DIe~w~~~i~~~~g~~~t~~~~imt~~~~~~l~~ 246 (409) T protein:vir:78 167 RITIQPNDPYNPNGLKYVIDYGVTDIELPLPQKFDAKDGNGNSAVDPIQYFRDLIKAATYFPDRRPVAIIVGPGFDEVLA 246 (409) T ss_pred eEEEEecCCCccccceEEEecCCCcccceeecccccCCCCCCChHHHHHHHHHHHHHhcCCCCCccEEEEcHHHHHHHHh Confidence 2 23 67899999999999999999999999998765 33 5678999999998865 Q ss_pred cCHHHHHHhccCCCccc----ccCHH---------HHHHHhCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCC Q lcl|Aclame:pro 182 RHPKIVKAYNGSLGDEG----MVPMA---------FLQELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRL 248 (309) Q Consensus 182 ~~~~i~~~~~~~~~~~~----~vt~~---------~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~ 248 (309) +|+.|+++++..++..+ .+++. .+...+|| +|++|+.+|.. ++++.++++|++.+++...+. T Consensus 247 ~n~~ik~~l~~~~~~~~~~~~~~~~~~l~~~~~ln~~~~~~GL-~I~vYd~~Y~d----edGt~k~~~Pd~~vvLl~ap~ 321 (409) T protein:vir:78 247 DNTFVQKYVEYEKGWVVGQNTVQPPREVYRQAALDIFKRYTGL-EVMVYDKTYRD----QDGSVKYWIPVGELIVLNQST 321 (409) T ss_pred CcHHHHHhhhcccccccccccccchhhhcchhHhHhhhhhcCc-eEEEEeeEEEe----cCCcccceecCCeEEEEcCCc Confidence 66778888876544322 22332 23345588 59999999854 356778899988654333222 Q ss_pred CCCcCcceecccccccccccC------Ccc-cccccc-CCceEEEeecccceeeecchh-hhhhhccc---cC Q lcl|Aclame:pro 249 ADTRNGTTFGLTAQWGDRVSG------SIA-DPNIGL-RGGQRVRVGESVKELVTAPDL-GFFFENAV---AA 309 (309) Q Consensus 249 ~~~~~~~t~G~T~~~~~~~~~------~~~-d~~~g~-~g~~~v~v~~~~~~~v~~~~~-G~l~~~~v---a~ 309 (309) +.++.+.||.|.+-...... .+. .....+ .+..-++.....-|+++++.. -|+|.++= ++ T Consensus 322 -g~LG~T~yGa~~~~~~~~~~v~~~g~~i~~~~~~~~dP~~~~~~~~~~~~p~l~~~~~~~~~~~~~~~~~~~ 393 (409) T protein:vir:78 322 -GPVGRFVYTAHVAGQRNGKVVYATGPYLTVKDHLQDDPPYYAIIAGFHGLPQLSGYNTEDFSFHRFKWLKYA 393 (409) T ss_pred -ccccceecccccccccchhhhccccceeEecccccCCcceeeeecceEEeeeeecCCccceeehhhhhhhhh Confidence 34566778876532111100 000 011111 122334555677788886543 34444432 22 No 11 >protein:vir:3424 Length: 341 # NCBI annotation: capsid component # Family: family:all:1021 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040587;genbank:gi:9626251;genbank:GeneID:2703482 Probab=99.92 E-value=1.2e-27 Score=168.26 Aligned_cols=292 Identities=13% Similarity=0.051 Sum_probs=178.4 Q ss_pred CCCCCCCcchhhHHHHHhhcch-hhhhhhhCCccc-ccccccee-------EEechhHhhhchhHhhcccccccccccCc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNG-RMISDEVLPRVP-VGKQEFKF-------WKYDLAQGFTVPETLVGRKSKPNEVEFSA 71 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~-~~ig~~lfP~v~-v~~~~~k~-------~~~~~~~~f~~~~t~~~~~~~~~~ve~~~ 71 (309) |- .|... .|+.+-....++ .+|++.+||... +......+ ..-.-.+.+......++++.+....+.++ T Consensus 1 ~d--~f~~~-~L~~~i~~~~~~~~~l~d~~fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~~~~~~~~~p~ 77 (341) T protein:vir:34 1 MS--MYTTA-QLLAANEQKFKFDPLFLRLFFRESYPFTTEKVYLSQIPGLVNMALYVSPIVSGEVIRSRGGSTSEFTPGY 77 (341) T ss_pred CC--CcCHH-HHHHHHHhccCccchhHHhcCCcccccccceEEEEEeeCCeeEEEeecCCCCcceeccCceeeeEEecCc Confidence 22 23332 366666666554 599999999632 22111111 11000112222233445555667778888 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHH-------HHHHHHHHHHHHHHHHHHHhhccc-------------cc Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHA-------TEQTTNLILLDREARTSKLVFSPN-------------SY 131 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~a-------v~~l~~~i~~~~E~~~a~~~~~~~-------------~y 131 (309) .+.+..+.++++......+.. ....++.++. +..+.+.|..+.|+||+++++++. +| T Consensus 78 i~~~~~i~~~d~~~r~~g~~~---~~~~~~~~~~~~~i~~~l~~l~~~i~~~~E~m~~qaL~~Gki~~~~~g~~~~~vDf 154 (341) T protein:vir:34 78 VKPKHEVNPQMTLRRLPDEDP---QNLADPAYRRRRIIMQNMRDEELAIAQVEEMQAVSAVLKGKYTMTGEAFDPVEVDM 154 (341) T ss_pred cCccceeCHHHHHHHhhcccc---ccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcEEEecCCccEEEEEe Confidence 888888888776432211111 1122333333 344566899999999999987542 12 Q ss_pred --CcccceecccccccCCCC---CChHHHHHHHHHHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH--HH Q lcl|Aclame:pro 132 --AAGNKTTLSGADQWSDPT---SNPLPVITDALDSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM--AF 204 (309) Q Consensus 132 --~~~~~~~lsgt~~Wsd~~---sdPi~di~~~~~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~--~~ 204 (309) +.+|+++++|+++|++++ +||++||++|.+..|..|++++||+++|++|++|++|+++++....+.+.+.. .. T Consensus 155 g~~~~~~~~~t~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~ 234 (341) T protein:vir:34 155 GRSEENNITQSGGTEWSKRDKSTYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETAVKD 234 (341) T ss_pred CCCCccceEecCCccCCcCCCchHHHHHHHHHHHHhcCCceEEEEeCHHHHHHHhcCHHHHHHHhhcccccccccccccc Confidence 578999999999999864 69999999999999999999999999999999999999999876655554433 22 Q ss_pred H---HHH---hCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCc----ccc Q lcl|Aclame:pro 205 L---QEL---LELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSI----ADP 274 (309) Q Consensus 205 l---~~l---~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~----~d~ 274 (309) + ..+ ++...|.+|+++|.. +++.+++||++.+++..+ +..+.+.||.+.+......+.. +.. T Consensus 235 ~~~~~~~~~~~~g~~i~~y~~~y~d-----dG~~~~~ip~~~v~l~p~---g~~g~~~yg~~~d~~~~~~~~~~~~~~~~ 306 (341) T protein:vir:34 235 LGKAVSYKGMYGDVAIVVYSGQYVE-----NGVKKNFLPDNTMVLGNT---QARGLRTYGCIQDADAQREGINASARYPK 306 (341) T ss_pred cccceeeeeecCCceEEEEcCEEEE-----CCcEEeeecCCeEEEeeC---CCcceEEEeecccccccccceeeeeEeee Confidence 2 222 345569999999953 245678899887766543 2334567888876433222111 111 Q ss_pred c-c-c-cCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 275 N-I-G-LRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 275 ~-~-g-~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) . . . ...+..+.+...--|++..+++=+..+ || T Consensus 307 ~~~~~~dp~~~~~~~~s~pLPv~~~pd~~~~a~--V~ 341 (341) T protein:vir:34 307 NWVTTGDPAREFTMIQSAPLMLLADPDEFVSVQ--LA 341 (341) T ss_pred eeeecCCCcEEEEEEcccceeeeeCCCcEEEEE--eC Confidence 0 0 1 123445555555556655555433332 33 No 12 >protein:vir:393 Length: 341 # NCBI annotation: gp8 # Family: family:all:1021 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046903;genbank:gi:9630472;genbank:GeneID:1261647 Probab=99.91 E-value=1.5e-26 Score=162.23 Aligned_cols=292 Identities=14% Similarity=0.079 Sum_probs=173.1 Q ss_pred CCCCCCCcchhhHHHHHhhcch-hhhhhhhCCcccc-ccccceeEEechh---Hhhhch----hHhhcccccccccccCc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNG-RMISDEVLPRVPV-GKQEFKFWKYDLA---QGFTVP----ETLVGRKSKPNEVEFSA 71 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~-~~ig~~lfP~v~v-~~~~~k~~~~~~~---~~f~~~----~t~~~~~~~~~~ve~~~ 71 (309) |- .|-+ +.|+.+-....++ .++.+.+||..+. ......+-..... ..|..+ ...++++.+....+.++ T Consensus 1 ~d--~f~~-~~L~~~i~~~~~~~~~l~~~~Fp~~~~~~t~~v~~~~~~~~~~lap~v~~~~~~~~~~~~~~~~~~~~~p~ 77 (341) T protein:vir:39 1 MS--VYTT-AQLLAVNEKKFKFDPLFLRIFFRETYPFSTEKVYLSQIPGLVNMALYVSPIVSGKVIRSRGGSTSEFTPGY 77 (341) T ss_pred CC--ccCH-HHHHHHHHhhcCccchhHhhcCCcccccCcceEEEEEecCCceeeEEecCCCCcceecccceeeeeEeccc Confidence 21 2332 2355555554443 5999999996432 2221111111110 112222 22334444556667777 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHH-------HHHHHHHHHHHHHHHHHHHHHHhhccc--------------- Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPL-------GHATEQTTNLILLDREARTSKLVFSPN--------------- 129 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~-------~~av~~l~~~i~~~~E~~~a~~~~~~~--------------- 129 (309) .+.+..+.+.++......+... ...++. .+.+..+.+.|..+.|+||+++++++. T Consensus 78 i~~~~~i~~~d~~~r~~g~~~~---~~~~~~~~~~~~i~~~~~~l~~~i~~r~E~m~~qaL~~Gki~i~~~g~~~~~vDf 154 (341) T protein:vir:39 78 VKPKHEVNPLMTLRRLPDEDPQ---NLADPVYRRRRIILQNMKDEELAIAQVEEKQAVAAVLSGKYTMTGEAFEPVEVDM 154 (341) T ss_pred cCcccccCHHHHHHHhhccccc---ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCceEEEcCCCcEEEEec Confidence 7777777666653211111100 112222 233455778899999999999887542 Q ss_pred ccCcccceecccccccCCCC---CChHHHHHHHHHHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH--HH Q lcl|Aclame:pro 130 SYAAGNKTTLSGADQWSDPT---SNPLPVITDALDSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM--AF 204 (309) Q Consensus 130 ~y~~~~~~~lsgt~~Wsd~~---sdPi~di~~~~~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~--~~ 204 (309) ..+.+|+++|+|+++|++++ +||+.||++|.+..|..|++++||+++|++|++|++|+++++....+.+.+.. .+ T Consensus 155 g~~~~~~~~lt~~~~W~~~~~~~~d~l~di~~~~~~~g~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~~ 234 (341) T protein:vir:39 155 GRSAGNNIVQAGAAAWSSRDKETYDPTDDIEAYALNASGVVNIIVFDPKGWALFRSFKAVKEKLDTRRGSNSELETALKD 234 (341) T ss_pred cCCccceeEecCCccCCCCCCchHHHHHHHHHHHHhcCCceEEEEeChHHHHHHhcCHHHHHHHhhcccccccccchhhh Confidence 12568999999999999975 58999999999999999999999999999999999999999876555544432 22 Q ss_pred H---HHH---hCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccc--- Q lcl|Aclame:pro 205 L---QEL---LELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPN--- 275 (309) Q Consensus 205 l---~~l---~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~--- 275 (309) + ..+ ++...|++|+++|.. +++.+++|+++.+++... +.++.+.||.|.+......+....+. T Consensus 235 ~~~~~~~~~~~~g~~i~~y~~~y~d-----~g~~~~~ip~~~~~l~p~---~~~g~~~yg~~~d~~~~~~~~~~~~~~~~ 306 (341) T protein:vir:39 235 LGKAVSYKGMYGDVAIVVYSGQYIE-----NDVKKNYLPDLTMVLGNT---QARGLRTYGCILDADAQREGINASTRYPK 306 (341) T ss_pred hhhHhhhhhhhcCceEEEEccEEEe-----cCcEEeeecCCeEEEeeC---CCcceEEEecccchhhcccceeeeeeeee Confidence 2 222 455679999999853 234568888876655433 22335678888754332221111111 Q ss_pred ----cccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 276 ----IGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 276 ----~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) .+...+..+.+...--|++.-+++=+.. -|| T Consensus 307 ~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~a--~V~ 341 (341) T protein:vir:39 307 NWVQTGDPAREFTMIQSAPLMLLADPDEFVSV--KLA 341 (341) T ss_pred eeeecCCCcEEEEEEeccccceeeCCCcEEEE--EeC Confidence 1122345555555556666656554443 233 No 13 >protein:vir:6378 Length: 346 # NCBI annotation: capsid protein E # Family: family:all:1021 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918991;genbank:gi:34610166;genbank:GeneID:2559600 Probab=99.90 E-value=5.4e-26 Score=159.09 Aligned_cols=294 Identities=10% Similarity=-0.000 Sum_probs=165.8 Q ss_pred CCCCCcchhhHHHHHhhcchhhhhhhhCCccc-ccccccee--EEech-----hHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 3 NAPFPIDPELTAIAIAYRNGRMISDEVLPRVP-VGKQEFKF--WKYDL-----AQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 3 ~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~-v~~~~~k~--~~~~~-----~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) =..|.+ ..||.+....-+..++.+.+||..+ +......+ ....+ .+.+......+.++.+....+.++.+. T Consensus 1 ~d~f~~-~~l~~~i~~~p~~~~l~~~~fp~~~~~~t~~i~i~~~~g~~~la~~v~~~~~~~~~~~~g~~~~~~~~p~i~~ 79 (346) T protein:vir:63 1 MEIFDT-LTLAGVIQSGPALSMYWQGFYPNEITFDTDEILFDLVFKDKKLAPFVAPNVQGRVIAARGYTTKTFRPAYVKP 79 (346) T ss_pred CCccCH-HHHHHHHHhcCCccchhhhcCccccccccceEEEEEecCceeeeeeecCCCCcceecccceeeeEeecCccCc Confidence 112322 2355554444455689999999543 22222111 11110 011222233455556666777788888 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHH-------HHHHHHHHHHHHHHHHHHHHHhhccc---------------ccC Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLG-------HATEQTTNLILLDREARTSKLVFSPN---------------SYA 132 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~-------~av~~l~~~i~~~~E~~~a~~~~~~~---------------~y~ 132 (309) +..+...++......+... -...++.+ +.+..+.+.|..+.|+||+++++++. ..+ T Consensus 80 ~~~i~~~d~~~~~~~~~~~--~~~~~~~~~~~~~i~~~~~~l~~~i~~~~E~m~~~al~~gki~~~g~~~~~~~vdfg~~ 157 (346) T protein:vir:63 80 KDVINPNRTLKRRAGEQPI--IGGMSLQERFQAVVADSQLEQRQRIENRIEWMCAMATIYGYVDVVGEAFPMQRVDFGRD 157 (346) T ss_pred cceeCHHHHHHHhhhhhhc--cCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCEEEeeCCceeEEEEeeCCC Confidence 8888777765422211110 11223332 23455678899999999999876531 226 Q ss_pred cccceecccccccCCCCCChHHHHHHHHHHh----CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCC-cccccCHHH--- Q lcl|Aclame:pro 133 AGNKTTLSGADQWSDPTSNPLPVITDALDSV----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLG-DEGMVPMAF--- 204 (309) Q Consensus 133 ~~~~~~lsgt~~Wsd~~sdPi~di~~~~~~~----g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~-~~~~vt~~~--- 204 (309) .+|+.+|+++++|++++|||++||++|++.+ |.+|++++||+++|++|++|++|+++++..+. ..+.+++.+ T Consensus 158 ~~~~~~lt~~~~W~~~~adp~~di~~~~~~~~~~~g~~~~~~i~~~~~~~~l~~~~~v~~~~~~~~~~~~~~~~~~~l~~ 237 (346) T protein:vir:63 158 PALTVQLTGGAAWDQATSDPLGNIQTMRTTAWKKSNSTITRLTMGLDAWSLFSQKPAVVELLNLFYKGSTSDFNRSRLDD 237 (346) T ss_pred ccceeeecccccCCCCCCCHHHHHHHHHHHHHHccCCceEEEEECHHHHHHHhcCHHHHHHHhhhccccccccchhhccc Confidence 6899999999999999999999999998764 89999999999999999999999999975432 223333332 Q ss_pred ---------HHHHh--CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceeccccccccc-ccCCcc Q lcl|Aclame:pro 205 ---------LQELL--ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDR-VSGSIA 272 (309) Q Consensus 205 ---------l~~l~--gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~-~~~~~~ 272 (309) +..++ +--+|++|+++|.. .+++.+++|+++.++++.+ +..+...||.+.+...- .+.... T Consensus 238 ~~~~~~~~~~~~~~~~~gi~i~~y~~~y~d----~~G~~~~~ip~~~v~~~p~---~~~g~~~yg~~~d~~~~~~~~~~~ 310 (346) T protein:vir:63 238 GSPVQYQGTIGGYNGMGTLELYTYHDTYTG----DDNTEQEILGSYDVVGTGP---GLQGTQCFGAIMDFKNGLVPTRMF 310 (346) T ss_pred chhhhhhhhHhhhhccCCeEEEEeccEEEc----CCCceeccccCCeEEEEec---CCcceEEEeeccccccCcccceee Confidence 22222 22248889998843 3456778899876655432 22334567777643221 111111 Q ss_pred cc-cc-ccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 273 DP-NI-GLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 273 d~-~~-g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) .. .. ....+..+.+...--|++.-+++=+.++ += T Consensus 311 ~~~~~~~dp~~~~~~~~s~plPv~~~p~~~~~~~-V~ 346 (346) T protein:vir:63 311 PKMWEEEDPSVAMLMTQSAPLMVPAQPNASFRMT-VK 346 (346) T ss_pred eEEEEecCCCEEEEEEeeeccceecCCCcEEEEE-eC Confidence 00 00 0112233333322233333333222111 00 No 14 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.38 E-value=5.2e-15 Score=98.88 Aligned_cols=267 Identities=14% Similarity=0.131 Sum_probs=160.8 Q ss_pred CC---------------------CCCCCcchhhHHHHHhhcchhhhhhhhCCcccc-ccccceeEEechhHhhhch-hHh Q lcl|Aclame:pro 1 MS---------------------NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPV-GKQEFKFWKYDLAQGFTVP-ETL 57 (309) Q Consensus 1 m~---------------------~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v-~~~~~k~~~~~~~~~f~~~-~t~ 57 (309) |+ +-.|++. .+-.++ .+.||+|.||=.+.- .....+|.+..+ +|... ... T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~-~i~e~~----~~~~iad~lf~~~~a~~~~~v~f~~~~p--~~~~~d~e~ 73 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPT-ALKKMM----VNQFISESLFRNGGANPNGVVAYNEGNP--SFLEDDVAD 73 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHH-HHHHHH----hccchhhhhhhcccccccceeEEEeccc--ccccCcHhh Confidence 32 2222222 222222 456899999976532 333445544433 45332 345 Q ss_pred hcccccccccccCcCc-cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccc Q lcl|Aclame:pro 58 VGRKSKPNEVEFSATD-ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNK 136 (309) Q Consensus 58 ~~~~~~~~~ve~~~~~-~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~ 136 (309) |+++|+...+...... +....+.++|...|++|.... ...++.+++.+.+.+-+.+..+.++-+.+.++.. ++ T Consensus 74 VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~--n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t----~~ 147 (318) T protein:vir:10 74 VAEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDE--NRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIV----PT 147 (318) T ss_pred ccCcccccccCCCCCchhhhhhehhccceeccHHHHhh--cChhHHHHHHHHHHHHHHHHHHHHHHHHHhcccc----cc Confidence 7999998887766644 444567889999999987653 3578889999999999988888888887766543 22 Q ss_pred eecccccccCCCCCChHHHHHHHHH-------------------HhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcc Q lcl|Aclame:pro 137 TTLSGADQWSDPTSNPLPVITDALD-------------------SVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDE 197 (309) Q Consensus 137 ~~lsgt~~Wsd~~sdPi~di~~~~~-------------------~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~ 197 (309) ... +..|++ .++|..|+-++.+ ..|+.||+|+|++..|..|++|+.+++++..++. . T Consensus 148 ~~~--s~~w~~-~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~-~ 223 (318) T protein:vir:10 148 LAV--PTAWDN-GGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNAN-Y 223 (318) T ss_pred ccC--CcCCCC-cccccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccch-h Confidence 333 334765 3445544444432 2389999999999999999999999999876542 1 Q ss_pred cccCHH---HH-HHHhCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccc Q lcl|Aclame:pro 198 GMVPMA---FL-QELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIAD 273 (309) Q Consensus 198 ~~vt~~---~l-~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d 273 (309) ...... .+ -++|||+-|. .+.||.+-+|+...... |+..........++.+ T Consensus 224 ~~~~~~~tg~~~g~~lGl~vi~-----------------s~~~p~~~alvlq~g~v--------G~~~d~~pl~~t~~~~ 278 (318) T protein:vir:10 224 VSTAPDWTGNFPGSVMGLNVIR-----------------SRTFPIDRVLIMERGTV--------GFYSDTRPLQFTALYP 278 (318) T ss_pred hhhcccccccccceeeceEEee-----------------cCccCCCeeEEEecCCc--------ceeeccccceeeeccc Confidence 111111 11 1335655222 23345554555443222 2221111111111111 Q ss_pred ----cccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 274 ----PNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 274 ----~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) ++-|...++++++....-.-|+-|-+.++|++.+.- T Consensus 279 egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~~ 318 (318) T protein:vir:10 279 EGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVTP 318 (318) T ss_pred CCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccCC Confidence 122233456788888888999999999999999999 No 15 >protein:vir:10324 Length: 320 # NCBI annotation: ORF26 # Family: family:all:570 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758919;genbank:gi:27311193;genbank:GeneID:956155 Probab=98.85 E-value=2.8e-11 Score=78.48 Aligned_cols=280 Identities=9% Similarity=-0.056 Sum_probs=115.5 Q ss_pred Hhhcchhhhh---hhhCCccccccccceeEEechhHhhh-chhHhhcccccccccccCc-CccceeeeccchhhcCCHHH Q lcl|Aclame:pro 17 IAYRNGRMIS---DEVLPRVPVGKQEFKFWKYDLAQGFT-VPETLVGRKSKPNEVEFSA-TDETGSTEDHGLDAPVPQAD 91 (309) Q Consensus 17 ~~y~n~~~ig---~~lfP~v~v~~~~~k~~~~~~~~~f~-~~~t~~~~~~~~~~ve~~~-~~~~~~~~e~~L~~~v~~~~ 91 (309) +.+.- ..+| ..+||..+|......+-..+ .... +|. ++|++....+..+. .-+.+.+--...++.|..+| T Consensus 1 i~~~P-~~~g~~~glff~~~~v~T~~V~ie~~~--~~l~lip~--v~rg~~g~~~~~~~~~~~~f~~p~~~~~d~i~a~e 75 (320) T protein:vir:10 1 MNLLP-VNYGDSRALFAREKKVRTRTILVEEKN--GVLTLIQS--REPGSTENVAKRGKRKVRSFVIPHLPLEDVILPDE 75 (320) T ss_pred CCcCC-chhhhhhhhccCCCCcccceEEEEEec--Cceeeeec--cCCCCCceeecCCcceEEEEecceeccCCccCHHH Confidence 33332 2233 23456655543332211111 1111 222 23333322222211 11222222222334444444 Q ss_pred HHHHhh--------cCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc-----c----c----cCcccc---eecc-cccccC Q lcl|Aclame:pro 92 IDNAPT--------NYNPLGHATEQTTNLILLDREARTSKLVFSP-----N----S----YAAGNK---TTLS-GADQWS 146 (309) Q Consensus 92 ~~~a~~--------~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~-----~----~----y~~~~~---~~ls-gt~~Ws 146 (309) +++... .-+...+-+..+.+.+++.+|+++++.++-. + + |+-+.+ .+|+ ++..|. T Consensus 76 iq~~Ra~G~~~~~~~~~~v~~~l~~lr~~~~~T~E~m~~~AL~G~ildadGtv~~d~y~~fGi~~~~i~~~l~~a~~dv~ 155 (320) T protein:vir:10 76 YEGLRGFGTTALAAKSELVKERXETMKSSHDITHEHLRMGAKKGQILDADGTVLYDLYAEFGITKKTIYFGLDNKDANVA 155 (320) T ss_pred HcCcccCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCeEEcCCCcEEEechhhhCCccceeEEecCCCCccHH Confidence 432211 1111233345567889999999999987521 0 1 111111 1221 111222 Q ss_pred CCCCChHHHHHHHHHHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHH--hCCCeEEeecceeecc Q lcl|Aclame:pro 147 DPTSNPLPVITDALDSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQEL--LELDAIYIGEARLNIA 224 (309) Q Consensus 147 d~~sdPi~di~~~~~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l--~gl~~I~v~~a~~~~~ 224 (309) ....+++..|++|....++..-.+++|+++|++|..||+|++++..... .+..-...+..- ||-=.+..|+++|.. T Consensus 156 ~~~~~~~~~i~~~l~g~~~t~v~al~g~~f~~al~~h~~Vke~y~~~~~-~~~~l~~~~~~~f~~gGi~~~~Y~g~~~d- 233 (320) T protein:vir:10 156 ESCRQVLRHVEDNLRGDVMKDVSVDVSEEFFDKFIKHASVKEVFLNHEA-AVNRLGGDTRKGFKFGGLIFNENRARHVD- 233 (320) T ss_pred HHHHHHHHHHHHHhccCCCCceEEEEChHHHHHHhcCHHHHHHHHhhhh-hhhhccccccceEEecCEEEEEcccEEEc- Confidence 2233444444444443455556899999999999999999999875432 111111111111 221235667777642 Q ss_pred ccCCCcccceecCCcE-EEEecCCCCCCcCcceeccccccccccc---CCccccccccCCceEEEeecccceeeecchhh Q lcl|Aclame:pro 225 RPGQNPNLIRAWGPHA-SFIYRDRLADTRNGTTFGLTAQWGDRVS---GSIADPNIGLRGGQRVRVGESVKELVTAPDLG 300 (309) Q Consensus 225 ~~g~~~~~~~v~~~~~-~L~~~~~~~~~~~~~t~G~T~~~~~~~~---~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G 300 (309) .+++.+++.+++. .+++.+..+ ..- +|+-.+.+-...+ -.......-++...-+.+-..-.|.-+..-=+ T Consensus 234 ---~~g~~~~~I~~~~~~~~p~g~~~--~f~-~~~apad~~e~vnt~g~p~y~k~~~~~~~~g~~l~~qS~PLpi~~rP~ 307 (320) T protein:vir:10 234 ---EEGKETRFIKAGKGHAFPTGTTN--TFF-TALAPADFNETAGTLGKRYYAKMEPRRMGRGFDLHSQSNVLPMCCRPG 307 (320) T ss_pred ---CCCCeeEeecCCeeEEEEecCch--hhe-eeecccCcHhhcCCcccccccccccccCCCeEEEEeeecccccccCcc Confidence 3344556666654 444333222 111 2222222111111 11111111112222233333333443444444 Q ss_pred hhhhccccC Q lcl|Aclame:pro 301 FFFENAVAA 309 (309) Q Consensus 301 ~l~~~~va~ 309 (309) .|++-.++| T Consensus 308 ~lv~~~~~a 316 (320) T protein:vir:10 308 VLVELDAAA 316 (320) T ss_pred eEEEEEecC Confidence 455444444 No 16 >protein:vir:95258 Length: 368 # NCBI annotation: Phage conserved protein # Family: family:all:570 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944891;genbank:gi:38707831;genbank:GeneID:2744044 Probab=97.88 E-value=1.5e-06 Score=52.47 Aligned_cols=295 Identities=12% Similarity=0.029 Sum_probs=132.1 Q ss_pred CCC----CCCCcchhhHHHHHhhc-chhhhhhh-hCCccccccccceeEEechhHhhh-chhHhhccccccccc-ccCc- Q lcl|Aclame:pro 1 MSN----APFPIDPELTAIAIAYR-NGRMISDE-VLPRVPVGKQEFKFWKYDLAQGFT-VPETLVGRKSKPNEV-EFSA- 71 (309) Q Consensus 1 m~~----~~f~~dp~LT~~a~~y~-n~~~ig~~-lfP~v~v~~~~~k~~~~~~~~~f~-~~~t~~~~~~~~~~v-e~~~- 71 (309) |-+ ..|.+-. ||.-..-.- ++.+|++. ||+..+|......+-.-+ ..+. ++. +.|++++.++ ..+. T Consensus 1 ~~d~f~~d~Fs~~~-LT~ain~~p~~p~~l~~lglF~~~~v~t~~v~iE~~~--~~l~Lvp~--~~rg~~~~~~~~~~~r 75 (368) T protein:vir:95 1 MLTNSEKSRFFLAD-LTGEVQSIPNTYGYISNLGLFRSAPITQTTFLMDLTD--WDVSLLDA--VDRDSRKAETSAPERV 75 (368) T ss_pred CcccccCCcccHHH-HHHHHHhcCCCcceecccccccCCCccceEEEEEEEc--CeEEEccc--cCCCCCCcccccCCce Confidence 443 3454333 666554443 35688876 888777664433321111 1122 222 3344433322 1221 Q ss_pred CccceeeeccchhhcCCHHHHHHHhh---------cCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc------c---cc-- Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPT---------NYNPLGHATEQTTNLILLDREARTSKLVFSP------N---SY-- 131 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~---------~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~------~---~y-- 131 (309) .-+...+--...++.|..+|+++-.. +.+...+-.+.+.+.+.+.+|+++++.++.. . +| T Consensus 76 ~~~~f~~ph~~~~d~I~a~eiQg~RafG~~~~l~~v~~~v~~kl~~~r~~~d~T~E~~r~gAL~G~ilDadGtvl~dly~ 155 (368) T protein:vir:95 76 RQISFPMMYFKEVESITPDEIQGVRQPGTANELTTEAVVRAKKLMKIRTKFDITREFLFMQALKGKVVDARGTLYADLYK 155 (368) T ss_pred eEEEEecceeccccccchHHHccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCeeECCCCcEEecchh Confidence 22344444445555666666554211 1112223335567788899999999876521 0 11 Q ss_pred --CcccceecccccccCCCCCChHHHHHHHHHHh-------CCC---CcEEEeCHHHHHHHhcCHHHHHHhccCCCcc-c Q lcl|Aclame:pro 132 --AAGNKTTLSGADQWSDPTSNPLPVITDALDSV-------ILR---PNIGVLGRRTATILRRHPKIVKAYNGSLGDE-G 198 (309) Q Consensus 132 --~~~~~~~lsgt~~Wsd~~sdPi~di~~~~~~~-------g~~---Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~-~ 198 (309) +-+.+ +. .-..++++.|+-+.+.+|+..+ ++. .-.++.|+..|++|..||+|++++++..... . T Consensus 156 eFGit~~-~v--~f~l~~~~tdv~~~~~~~~~~i~d~l~g~~~~~~~~v~alcg~~Ffd~L~~h~~Vkeay~~~~~a~~~ 232 (368) T protein:vir:95 156 QFDVEKK-TI--YFDLDNPNADIDASIEELRMHMEDEAKTGTVINGEEIHVVVDRVFFSKLTKHPKIRDAYLAQQTPLAW 232 (368) T ss_pred hhCCccc-eE--EEEeCCCCcCHHHHHHHHHHHHHHhhcccccccccceEEEEChHHHHHhhcChhHHHHHHHHHhhhhh Confidence 11111 11 1123468889999999996543 223 3578889999999999999999987532111 0 Q ss_pred ccCHHHHH-----------HHh---CCCeEEeecceeeccccCCCcccceecCCcEE--------EEecCCC---CCCcC Q lcl|Aclame:pro 199 MVPMAFLQ-----------ELL---ELDAIYIGEARLNIARPGQNPNLIRAWGPHAS--------FIYRDRL---ADTRN 253 (309) Q Consensus 199 ~vt~~~l~-----------~l~---gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~--------L~~~~~~---~~~~~ 253 (309) .-.+..+. .-| |+ ...-|.+++ .+..++..++++++.+ +++.... ..+.. T Consensus 233 ~~lr~~~r~g~~~~~~~~~~~F~fgGi-~f~eYrg~~----~~~~g~~~~~v~~d~v~I~~gea~~~P~G~~~~~~~~~F 307 (368) T protein:vir:95 233 QQITGSLRTGGADGVQAHMNTFYYGGV-KFVQYNGKF----KDKRGKVHTLVSIDSVADTVGVGHAFPNVAMLGEANNIF 307 (368) T ss_pred hhhccccccccccccccccceeEecCE-EEEEcceee----cCCCcceeeeecCCceeeccCceEEEeecccccccCcce Confidence 00011110 002 22 122244433 4445555566665432 2332211 11222 Q ss_pred cceecccccccccccCCccccc---cccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 254 GTTFGLTAQWGDRVSGSIADPN---IGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 254 ~~t~G~T~~~~~~~~~~~~d~~---~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) -+.|++- .+-...+..-.+-| +-+..++.+.+.-...|.-+..-=+.|++...+| T Consensus 308 ~~~~aPa-d~~e~vNt~g~p~Ya~~~~~~~~~g~~le~qSnpLpic~RP~~lv~~~~~a 365 (368) T protein:vir:95 308 EVAYGPC-PKMGYANTLGQELYVFEYEKDRDEGIDFEAHSYMLPYCTRPQLLVDVRADA 365 (368) T ss_pred EEEecCC-CcHhhcCCCcccccceeeeccCCCeeEEEEeecccchhcccceeEEEEecC Confidence 2333332 11111111000000 0011222232222223333333333455444444 No 17 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=97.74 E-value=4.1e-06 Score=50.12 Aligned_cols=281 Identities=11% Similarity=-0.014 Sum_probs=135.5 Q ss_pred CCC-CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccceeee Q lcl|Aclame:pro 1 MSN-APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTE 79 (309) Q Consensus 1 m~~-~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~~~ 79 (309) |+- +-..+.+.+.+-.+..-.+..+=..+++.+++.....++|++..... ..-++.++........+.+.+...+ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~----a~~v~Eg~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSE----IDVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcc----eEEeeCCccccccccceeEEEEeee Confidence 983 44444444433223322233344556788888877788888743211 1223444544444555555555554 Q ss_pred ccchhhcCCHHHHHHH-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccce---eccccc---ccCCCCCCh Q lcl|Aclame:pro 80 DHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKT---TLSGAD---QWSDPTSNP 152 (309) Q Consensus 80 e~~L~~~v~~~~~~~a-~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~---~lsgt~---~Wsd~~sdP 152 (309) ..+-..++++|-.++. ....+..+...+.+.+.|.+..|..+-........-+..... ..+.+. ....+..++ T Consensus 77 k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (298) T protein:vir:94 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADP 156 (298) T ss_pred EEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccH Confidence 4454556666654322 122334444555566666655554443322111110000000 011111 123356678 Q ss_pred HHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCccc---ccCHHHHHHHhCCCeEEeecceeecccc Q lcl|Aclame:pro 153 LPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEG---MVPMAFLQELLELDAIYIGEARLNIARP 226 (309) Q Consensus 153 i~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~---~vt~~~l~~l~gl~~I~v~~a~~~~~~~ 226 (309) +.||.+....+ +..++..+|+++.|.+|+. ++..++..- ......-..++|+| |++-+..-.. . T Consensus 157 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-------lkd~~G~~l~~~~~~~~~~~tl~G~P-V~~~~~v~~~--~ 226 (298) T protein:vir:94 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAK-------QKDLQGNALFPELKWGATPDTINGLP-VDVNKTVSDM--S 226 (298) T ss_pred HHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH-------hhccCCCeeecCcccCCCCceeccee-eEEecccccc--c Confidence 88999988766 6789999999999998754 222222210 11111123577887 4444432211 1 Q ss_pred CCCcccceecCCcEE-EEecCCCCCCcCcceeccccccccc--ccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 227 GQNPNLIRAWGPHAS-FIYRDRLADTRNGTTFGLTAQWGDR--VSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 227 g~~~~~~~v~~~~~~-L~~~~~~~~~~~~~t~G~T~~~~~~--~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) + .....-++++-.- +.|.... +.+.+.... ..++ . .++...+...+|+....+-.+.-+.+-..+ T Consensus 227 ~-~~~~~~~~Gdfs~~~~~~~~~---------~~~~~~~~~~~~d~~-~-~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l 294 (298) T protein:vir:94 227 L-TQRDRAIIGDFANGFKWGYAK---------EVPLEVIQYGDPDNS-G-LDLKGYNQVYIRAELFLGWGILDATKFARV 294 (298) T ss_pred C-CCccEEEEeeccceEEEEEec---------CceEEEeecCCCcCc-c-hhhhhcCcEEEEEEEEeccEeecccceEEE Confidence 1 1111122233211 1111100 111111100 0011 0 012235556688888888888888888888 Q ss_pred hccc Q lcl|Aclame:pro 304 ENAV 307 (309) Q Consensus 304 ~~~v 307 (309) +++. T Consensus 295 ~~~t 298 (298) T protein:vir:94 295 TEAN 298 (298) T ss_pred EecC Confidence 8888 No 18 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=97.57 E-value=1.1e-05 Score=47.76 Aligned_cols=279 Identities=9% Similarity=-0.007 Sum_probs=136.9 Q ss_pred CCC--CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccceee Q lcl|Aclame:pro 1 MSN--APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGST 78 (309) Q Consensus 1 m~~--~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~~ 78 (309) |+. ...++....+.|- ..-.+..+=..+++.+|+.....++|+...... ..-++.+++.......+...++.. T Consensus 1 ma~~gG~lvp~~~~~~ii-~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~----a~~v~E~~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLI-SKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSE----IDVVAESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CcccCcceechhHHHHHH-HHHHhhhhhhhhcceeeccCCceEEEEEecCcc----eEEecCCccccccccceeEEEEee Confidence 984 3444444444443 332333444566788888877778888654211 122344444444444454444444 Q ss_pred eccchhhcCCHHHHHHH-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC-----cccceec-ccccccCCCCCC Q lcl|Aclame:pro 79 EDHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYA-----AGNKTTL-SGADQWSDPTSN 151 (309) Q Consensus 79 ~e~~L~~~v~~~~~~~a-~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~-----~~~~~~l-sgt~~Wsd~~sd 151 (309) +..+-..++++|-.... .+.++..+...+.+.+.+.+..|..+-...-....-+ ..+.... +..........+ T Consensus 76 ~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 155 (298) T protein:vir:16 76 IKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIAD 155 (298) T ss_pred eeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccccccccccccc Confidence 44444455666654332 2234445555666777776666655443321111100 0011110 111123345677 Q ss_pred hHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecceeecc Q lcl|Aclame:pro 152 PLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEARLNIA 224 (309) Q Consensus 152 Pi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a~~~~~ 224 (309) +..+|.+....+ +..+...+|+++.|.+|+. + +..++.+ +..+ ..-..++|+| |++.+..-... T Consensus 156 ~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~---l----kd~~G~~-i~~~~~~~~~~~~l~G~P-V~~~~~v~~~~ 226 (298) T protein:vir:16 156 PNGAIENAVELLTGVDADVTGIAINPSFRSALAK---Q----KDLQDNA-LFPELKWGATPDTINGLP-VDVNKTVSDMS 226 (298) T ss_pred HHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHH---h----hccCCCe-eecCcccCCCCceeccee-eEEeccccccc Confidence 888999987765 6778899999999998755 2 2222222 1111 1124577887 55544322111 Q ss_pred ccCCCcccceecCCcE-EEEecCCCCCCcCcceeccccccccc--ccCCccccccccCCceEEEeecccceeeecchhhh Q lcl|Aclame:pro 225 RPGQNPNLIRAWGPHA-SFIYRDRLADTRNGTTFGLTAQWGDR--VSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGF 301 (309) Q Consensus 225 ~~g~~~~~~~v~~~~~-~L~~~~~~~~~~~~~t~G~T~~~~~~--~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~ 301 (309) ......-++++.. .+.+.. .-+.+.+.... ..+. . .++-..+..-+|+.+..+-.+.-+++=. T Consensus 227 ---~~~~~~~~~GDfs~~~~~~~---------~~~~~~~~~~~~~~~~~-~-~~~f~~~~v~~ra~~r~d~~v~~~~a~~ 292 (298) T protein:vir:16 227 ---LTQRDRAIIGDFANGFKWGY---------AKEVPLEVIQYGDPDNS-G-LDLKGYNQVYIRAELFLGWGILDATKFA 292 (298) T ss_pred ---CCCccEEEEeeccceEEEEE---------ecCceEEEeeccCCcCc-c-hhhhhcCcEEEEEEEEEccEeecccceE Confidence 1111122333321 111110 11112221110 0000 0 0112345556888999999999998888 Q ss_pred hhhccc Q lcl|Aclame:pro 302 FFENAV 307 (309) Q Consensus 302 l~~~~v 307 (309) .+++|. T Consensus 293 ~l~~at 298 (298) T protein:vir:16 293 RVTEAN 298 (298) T ss_pred EEeecC Confidence 888888 No 19 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=97.55 E-value=7e-06 Score=48.84 Aligned_cols=276 Identities=12% Similarity=-0.021 Sum_probs=139.1 Q ss_pred CC-C--CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MS-N--APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~-~--~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) |. . ...++..+.+.|-..-++..-|. .+++.+|+.....++|+..... ...-++.++.......++.+.++. T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~-~l~~~~~~~~~~~~~p~~~~~~----~a~~v~Eg~~~~~~~~~~~~v~~~ 104 (324) T protein:vir:96 30 MMHEKKDGTLMNEFTTPILQEVMENSKIM-QLGKYEPMEGTEKKFTFWADKP----GAYWVGEGQKIETSKATWVNATMR 104 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhh-hhcceeeccCCceEEEEEecCc----ceeEecCCccccccccceeEEEEe Confidence 32 1 13555665555544333333333 3578889888788999885421 122355566666666777777777 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceec--ccccccCCCCCChHHH Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTL--SGADQWSDPTSNPLPV 155 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~l--sgt~~Wsd~~sdPi~d 155 (309) .+..+...+++++-.++. .++....-.+.+.+.+.+..|..+ +++..-.......+ .+.......+.....+ T Consensus 105 ~~k~~~~~~is~ell~ds--~~~l~~~i~~~la~ai~~~~d~a~----l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~ 178 (324) T protein:vir:96 105 AFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAG----ILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred eEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHH----hccCCCCCcCccccccccccceeccccccHHH Confidence 777776677777755543 244455555566666655555433 22211001111111 1122233345556777 Q ss_pred HHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCccc Q lcl|Aclame:pro 156 ITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNL 232 (309) Q Consensus 156 i~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~ 232 (309) |.+....+ +..++..+|++++|.+|+. + +..++.. ++....-..++|+|- ++..+. ...++. T Consensus 179 i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~---l----~d~~G~~-~~~~~~~~~l~G~PV-~~~~~~-----~~~~~~- 243 (324) T protein:vir:96 179 IIDLEALLEDDELEANAFISKTQNRSLLRK---I----VDPETKE-RIYDRNSDSLDGLPV-VNLKSS-----NLKRGE- 243 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHH---h----hccCCCe-eecCCCCCcccceee-EeeCCC-----CCCcce- Confidence 77776555 6789999999999998754 2 2222221 221112234778873 332111 111111 Q ss_pred ceecCCcEEEEecCCCCCCc---CcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 233 IRAWGPHASFIYRDRLADTR---NGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 233 ~~v~~~~~~L~~~~~~~~~~---~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) -++++..-+++....+-.. .+.+++.... ..+... ..-..+...+|+.+.++-.+.-+.+-..|+++.++ T Consensus 244 -~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~----~~~~~~--~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:96 244 -LITGDFDKLIYGIPQLIEYKIDETAQLSTVKN----EDGTPV--NLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred -EEEEecceEEEEEecCcEEEEeeccccccccc----ccccch--hhhhcCcEEEEEEEEEccEEecccceEEEeccccc Confidence 1222211111111001000 0000000000 000000 11123456789999999999999998899988888 No 20 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=97.55 E-value=7e-06 Score=48.84 Aligned_cols=276 Identities=12% Similarity=-0.021 Sum_probs=139.1 Q ss_pred CC-C--CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MS-N--APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~-~--~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) |. . ...++..+.+.|-..-++..-|. .+++.+|+.....++|+..... ...-++.++.......++.+.++. T Consensus 30 ~~~~~~~~~iP~~~~~~ii~~~~~~s~l~-~l~~~~~~~~~~~~~p~~~~~~----~a~~v~Eg~~~~~~~~~~~~v~~~ 104 (324) T protein:vir:78 30 MMHEKKDGTLMNEFTTPILQEVMENSKIM-QLGKYEPMEGTEKKFTFWADKP----GAYWVGEGQKIETSKATWVNATMR 104 (324) T ss_pred cccCcCccccchhHHHHHHHHHHhhchhh-hhcceeeccCCceEEEEEecCc----ceeEecCCccccccccceeEEEEe Confidence 32 1 13555665555544333333333 3578889888788999885421 122355566666666777777777 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceec--ccccccCCCCCChHHH Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTL--SGADQWSDPTSNPLPV 155 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~l--sgt~~Wsd~~sdPi~d 155 (309) .+..+...+++++-.++. .++....-.+.+.+.+.+..|..+ +++..-.......+ .+.......+.....+ T Consensus 105 ~~k~~~~~~is~ell~ds--~~~l~~~i~~~la~ai~~~~d~a~----l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~ 178 (324) T protein:vir:78 105 AFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAG----ILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred eEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHH----hccCCCCCcCccccccccccceeccccccHHH Confidence 777776677777755543 244455555566666655555433 22211001111111 1122233345556777 Q ss_pred HHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCccc Q lcl|Aclame:pro 156 ITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNL 232 (309) Q Consensus 156 i~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~ 232 (309) |.+....+ +..++..+|++++|.+|+. + +..++.. ++....-..++|+|- ++..+. ...++. T Consensus 179 i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~---l----~d~~G~~-~~~~~~~~~l~G~PV-~~~~~~-----~~~~~~- 243 (324) T protein:vir:78 179 IIDLEALLEDDELEANAFISKTQNRSLLRK---I----VDPETKE-RIYDRNSDSLDGLPV-VNLKSS-----NLKRGE- 243 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHH---h----hccCCCe-eecCCCCCcccceee-EeeCCC-----CCCcce- Confidence 77776555 6789999999999998754 2 2222221 221112234778873 332111 111111 Q ss_pred ceecCCcEEEEecCCCCCCc---CcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 233 IRAWGPHASFIYRDRLADTR---NGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 233 ~~v~~~~~~L~~~~~~~~~~---~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) -++++..-+++....+-.. .+.+++.... ..+... ..-..+...+|+.+.++-.+.-+.+-..|+++.++ T Consensus 244 -~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~----~~~~~~--~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~ 316 (324) T protein:vir:78 244 -LITGDFDKLIYGIPQLIEYKIDETAQLSTVKN----EDGTPV--NLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred -EEEEecceEEEEEecCcEEEEeeccccccccc----ccccch--hhhhcCcEEEEEEEEEccEEecccceEEEeccccc Confidence 1222211111111001000 0000000000 000000 11123456789999999999999998899988888 No 21 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.34 E-value=1.5e-05 Score=46.98 Aligned_cols=288 Identities=11% Similarity=0.015 Sum_probs=133.9 Q ss_pred CCC---CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MSN---APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~~---~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) |+. .-+.+-+.+.+--+..-.+.-+-..+++.+|+.....++|++..... ..-++.++.....+.++.+.++. T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~----a~wv~Eg~~~~~~~~~f~~v~l~ 76 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPR----GEVVGEGAQKSESTATFAPVTAI 76 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCce----eEEeecCcccccccceeeEEEEe Confidence 663 33443343433333333333344567788888877788988854211 12244555555556666666666 Q ss_pred eeccchhhcCCHHHHHHH-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc--c---c-----CcccceecccccccC Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSPN--S---Y-----AAGNKTTLSGADQWS 146 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a-~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~--~---y-----~~~~~~~lsgt~~Ws 146 (309) .+.-+-..++++|-.++. ...++.++...+.+.+.+....|..+-...-+.. . . ...+..+.++ T Consensus 77 ~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~----- 151 (311) T protein:vir:81 77 PRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTT----- 151 (311) T ss_pred eEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecc----- Confidence 655555556666654322 2233444555555666665555544433221111 0 0 0111111111 Q ss_pred CCCCChHHHHHHHHHH---hCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCccc---ccCHHHHHHHhCCCeEEeecce Q lcl|Aclame:pro 147 DPTSNPLPVITDALDS---VILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEG---MVPMAFLQELLELDAIYIGEAR 220 (309) Q Consensus 147 d~~sdPi~di~~~~~~---~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~---~vt~~~l~~l~gl~~I~v~~a~ 220 (309) .....+..+|...... .+..|+..+|++.+|.+|+. ++..+++.- ..+...-..++|+| |++-+.. T Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-------lkd~~G~~l~~~~~~~~~~~tl~G~P-v~~~~~i 223 (311) T protein:vir:81 152 GTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLAT-------QRDSQGRKLYPELGFGTDVASFAGLN-AAVSDTV 223 (311) T ss_pred cccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHh-------hhccCCCeeecCccccCCCceeccee-EEecccc Confidence 1223455567666544 37889999999999988754 222222221 11111234567877 4432221 Q ss_pred eeccccCCCcccceecCCcE-EEEecCCCCCCcCcceeccccccccc--ccCCccccccccCCceEEEeecccceeeecc Q lcl|Aclame:pro 221 LNIARPGQNPNLIRAWGPHA-SFIYRDRLADTRNGTTFGLTAQWGDR--VSGSIADPNIGLRGGQRVRVGESVKELVTAP 297 (309) Q Consensus 221 ~~~~~~g~~~~~~~v~~~~~-~L~~~~~~~~~~~~~t~G~T~~~~~~--~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~ 297 (309) -..-...........+.... .+++.+-.. -..+..-+.+.+-... ..++ ..+...+...+|+.+.++-.+.-+ T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~-~~i~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~r~~~r~d~~v~~~ 299 (311) T protein:vir:81 224 RGGPEAVTASTGVYRTTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGL---GDLKRQNQIAIRAEVVYGIGIMST 299 (311) T ss_pred cccccccccccchhcccCCccEEEEEeccc-EEEEEeccceEEEeccCCCCcc---hhhhhcCcEEEEEEEEeccEeecc Confidence 10000000000000111110 001110000 0000000111111000 0000 012334566789999999999999 Q ss_pred hhhhhhhccccC Q lcl|Aclame:pro 298 DLGFFFENAVAA 309 (309) Q Consensus 298 ~~G~l~~~~va~ 309 (309) ++-..++.++-| T Consensus 300 ~a~~~l~~a~~~ 311 (311) T protein:vir:81 300 DAFAVVRDADES 311 (311) T ss_pred cceEEEEeeccC Confidence 999999999999 No 22 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=97.34 E-value=2.3e-05 Score=45.99 Aligned_cols=275 Identities=13% Similarity=0.008 Sum_probs=138.3 Q ss_pred CC-C--CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MS-N--APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~-~--~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) |+ + ...++..+.+.|-..-++..-| -.+++.+|+.....+||++..... ...++.++.......++.+.++. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l-~~l~~~~~~~~~~~~~p~~~~~~~----a~~v~Eg~~~~~~~~~f~~v~~~ 104 (324) T protein:vir:96 30 MMHEKKDGTLLNDFTTPILQEVMENSKI-MQLGKYEPMEGTEKKFTFWADKPG----AYWVGEGQKIETSKATWVNATMR 104 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhhchh-hhhcceeeccCCceEEEEEecCcc----eeeecCCccccccccceeEEEEE Confidence 22 1 2245555444444332322223 235788888887788998754211 22356666666666777777888 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---cCcccceecccccccCCCCCChHH Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS---YAAGNKTTLSGADQWSDPTSNPLP 154 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~---y~~~~~~~lsgt~~Wsd~~sdPi~ 154 (309) .++.+-..+|+++-.++. .++....-.+.+.+.|....|..+ +++.. .+.........+ .....+..... T Consensus 105 ~~k~~~~~~is~ell~ds--~~~l~~~i~~~l~~aia~~~d~~~----l~G~g~~~~~~~~~~~~~~~-~~~~~~~~~~~ 177 (324) T protein:vir:96 105 AFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAG----ILNQGNNPFGKSIAQSIKKT-NKVIKGDFTQD 177 (324) T ss_pred eEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHh----hhcCCCCCcCcccccccccc-ceecccccchH Confidence 877777777887765543 244455555666666655555432 22211 111111111111 12223455677 Q ss_pred HHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCcc Q lcl|Aclame:pro 155 VITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPN 231 (309) Q Consensus 155 di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~ 231 (309) +|.+....+ +..|+.++|+++.|..|+. ++. .++.. ++....-..++|+|-+ +..+. ..... T Consensus 178 ~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~---lkd----~~G~~-~~~~~~~~~l~G~PV~-~~~~~-----~~~~~- 242 (324) T protein:vir:96 178 NIIDLEALLEDDELEANAFISKTQNRSLLRK---IVD----PETKE-RIYDRNSDSLDGLPVV-NLKSS-----NLKRG- 242 (324) T ss_pred HHHHHHHhhhhccCCCCEEEEcHHHHHHHHH---hhC----CCCCe-eecCCCCCcccceeeE-eecCC-----CCCcc- Confidence 777776655 6789999999999988764 222 22221 1111122346788733 22111 11111 Q ss_pred cceecCCcEEEEecCCCCCCc---CcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 232 LIRAWGPHASFIYRDRLADTR---NGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 232 ~~~v~~~~~~L~~~~~~~~~~---~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) .-++++..-+++....+-.. .+.+++ . +.. ..+.. ...-..+...+|+.+.++-.+..+++-..++.|.+ T Consensus 243 -~~~~gd~s~~~~~~~~~~~i~~~~~~~~~--~-~~~-~~~~~--~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~ 315 (324) T protein:vir:96 243 -ELITGDFDKLIYGIPQLIEYKIDETAQLS--T-VKN-EDGTP--VNLFEQDMVALRATMHVALHIADDKAFAKLVPADK 315 (324) T ss_pred -eEEEEecceEEEEEecCcEEEEeeccccc--c-ccc-ccccc--hhhhhcCcEEEEEEEEeccEEecccceEEEecccc Confidence 12223221111111111000 000000 0 000 00000 01123455678999999999999999999998888 Q ss_pred C Q lcl|Aclame:pro 309 A 309 (309) Q Consensus 309 ~ 309 (309) + T Consensus 316 ~ 316 (324) T protein:vir:96 316 R 316 (324) T ss_pred c Confidence 8 No 23 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=97.24 E-value=4e-05 Score=44.72 Aligned_cols=284 Identities=14% Similarity=0.087 Sum_probs=133.8 Q ss_pred CCCCCCCcchh--------------hHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc Q lcl|Aclame:pro 1 MSNAPFPIDPE--------------LTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE 66 (309) Q Consensus 1 m~~~~f~~dp~--------------LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ 66 (309) |+..-+...-. ...+...-++..-| -.+++.+++.....+||++...-.. .-++.++.... T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l-~~~~~~~~~~~~~~~~p~~~~~~~a----~~v~Eg~~~~~ 75 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIV-QRIARKVPMGPTGISIPHWTGAVSA----SWTGEAERKPI 75 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccch-hhhcceeeccCCceEEEEEcCCcce----eEecCCCcccc Confidence 65443322211 11111111111112 2345677887777888887542111 12344555555 Q ss_pred cccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcc-----------c Q lcl|Aclame:pro 67 VEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAG-----------N 135 (309) Q Consensus 67 ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~-----------~ 135 (309) ...++.+.++..+..+-..+++++-..+ +.++..+...+.+.+.+....|. .++++..-+.. + T Consensus 76 ~~~~f~~i~~~~~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~ai~~~~~~----~~l~G~g~~~~~~g~~~~~~~~~ 149 (330) T protein:vir:77 76 TKGSFGKQELEPVKITTIFAESAEVVRL--NPLNYLNTMRTKIAEAIALKFDA----AAIHGIDKPSAFKGYLAETTKVV 149 (330) T ss_pred ccceeeEEEEeEEEEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHH----HhhcccCCCCccccccccccccc Confidence 5556666666666555555666665443 33455555556666666555553 33332211111 1 Q ss_pred ceecccccccCCCCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH-------H-- Q lcl|Aclame:pro 136 KTTLSGADQWSDPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM-------A-- 203 (309) Q Consensus 136 ~~~lsgt~~Wsd~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~-------~-- 203 (309) ...-+.....+..+.+.+.+|.+.+..+ +..++..+|++++|..|+. ++ ..++.. +..+ . T Consensus 150 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~---lk----d~~G~~-l~~~~~~~~~~~~~ 221 (330) T protein:vir:77 150 SLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNT---AV----DGNGRP-LFVESTYTEQVGAI 221 (330) T ss_pred eeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH---Hh----ccCCce-eecCcccccccccc Confidence 1111111223445678888998887765 6778899999999988765 22 222211 1110 0 Q ss_pred HHHHHhCCCeEEeecceeeccccCCCccccee-cCCcEEEEecCCCCCCcCcceecccccccccccCC--ccccccccCC Q lcl|Aclame:pro 204 FLQELLELDAIYIGEARLNIARPGQNPNLIRA-WGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGS--IADPNIGLRG 280 (309) Q Consensus 204 ~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v-~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~--~~d~~~g~~g 280 (309) .=..++|+| |++-+..- .+...+...+ +++-.-+.+....+-.. ..+=..+++.+...... ......-.++ T Consensus 222 ~~~~l~G~P-V~~~~~~p----~~~~~~~~~~~~gd~s~~~i~~~~~~~i-~~~~e~~~~~~~~~~~~~~~~~~~~f~~~ 295 (330) T protein:vir:77 222 REGRILGRP-TYVADNVV----NGTVGNRVVGVMGDFSQVIWGQIGGLSF-DVTDQATLDFGEEQGGVWVPKLISLWQHN 295 (330) T ss_pred CCceeccee-eEEecccc----CCCCCCccEEEEEecceEEEEEecCcEE-EEeecceeeecccccccccccccchhhcC Confidence 112467877 44433321 1111111111 12211111111111000 00001111111111110 0111112345 Q ss_pred ceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 281 GQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 281 ~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) ...+|+...++-.+.-+.+-..++.+.|+ T Consensus 296 ~~~~r~~~r~d~~v~~~~a~~~i~~~~~~ 324 (330) T protein:vir:77 296 MVAVRCEAEFAFMVNDKDAFVKLTDQVAG 324 (330) T ss_pred cEEEEEEEEeccEEecccceEEEEeccCC Confidence 56789999999999999999999999999 No 24 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.16 E-value=5.7e-05 Score=43.86 Aligned_cols=276 Identities=12% Similarity=-0.015 Sum_probs=137.4 Q ss_pred CC-C--CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MS-N--APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~-~--~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) ++ + ...+++...+.+-..-++. -+-..+++.+|+.....+||++...-. ..-++.++.......++.+.++. T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~-s~l~~l~~~~~~~~~~~~ip~~~~~~~----a~~v~Eg~~~~~~~~~f~~i~~~ 104 (324) T protein:vir:93 30 MMHEKKDGTLLNDFTTPILQEVMEN-SKIMQLGKYEPMEGTEKKFTFWADKPG----AYWVGEGQKIETSKATWVNATMR 104 (324) T ss_pred cccCCCcceechhHHHHHHHHHHhh-chhhhhcceeeccCCceEEEEEecCcc----eeeecCCccccccccceeEEEEE Confidence 21 1 1255555555444322221 222345677888887788888753211 12345556555556667777777 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecc--cccccCCCCCChHHH Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--GADQWSDPTSNPLPV 155 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ls--gt~~Wsd~~sdPi~d 155 (309) .+..+-..+++++-.++. .++......+.+.+.|....|..+ +++..=.......+. +.......+.....+ T Consensus 105 ~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~aia~~~d~a~----l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (324) T protein:vir:93 105 AFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAG----ILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDN 178 (324) T ss_pred eEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHH----hcCCCCCCcCccccccccccceeccccccHHH Confidence 766666667777665543 234455555666666655555433 222110111111111 112233345667888 Q ss_pred HHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCccc Q lcl|Aclame:pro 156 ITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNL 232 (309) Q Consensus 156 i~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~ 232 (309) |.+....+ +..++.++|+++.|..|+. + +..++.. ++....-..++|+|-+ +..+ ....... T Consensus 179 i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~---l----~d~~G~~-~~~~~~~~~l~G~PVv-~~~~-----~~~~~~~- 243 (324) T protein:vir:93 179 IIDLEALLEDDELEANAFISKTQNRSLLRK---I----VDPETKE-RIYDRNSDSLDGLPVV-NLKS-----SNLKRGE- 243 (324) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHH---h----hCCCCCe-eecCCCCCcccceeeE-eecC-----CCCCcce- Confidence 88877665 6788999999999998764 2 2222222 1211112346788733 2211 1111111 Q ss_pred ceecCCcEEEEecCCCCCCc---CcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 233 IRAWGPHASFIYRDRLADTR---NGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 233 ~~v~~~~~~L~~~~~~~~~~---~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) -++++..-+++....+-.. .+.++.... ...+... ..-..+...+|+.+.++-.+.-+++-..|++|.++ T Consensus 244 -i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~----~~~~~~~--~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~ 316 (324) T protein:vir:93 244 -LITGDFDKLIYGIPQLIEYKIDETAQLSTVK----NEDGTPV--NLFEQDMVALRATMHVALHIADDKAFAKLVPADKR 316 (324) T ss_pred -EEEEecceEEEEEecCcEEEEeecccccccc----cccccch--hhhhcCcEEEEEEEEeccEEecccceEEEeccccc Confidence 1223221111111111000 000000000 0001100 11234556789999999999999999999999888 No 25 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.12 E-value=6.3e-05 Score=43.63 Aligned_cols=274 Identities=12% Similarity=0.003 Sum_probs=139.1 Q ss_pred CCC---CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MSN---APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~~---~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) |+. ...++..+.+.|-..-+...-| -.+++.+|+.....++|++... .....++.++........+.+.+.. T Consensus 30 ~~~~~~~~lip~~~~~~ii~~~~~~s~l-~~~~~~~~~~~~~~~~p~~~~~----~~a~~v~Eg~~~~~~~~~~~~v~~~ 104 (324) T protein:vir:99 30 MMHEKKDGTLLNDFTTPILQEVMENSKI-MRLGKYEPMEGTEKKFTFWADK----PGAYWVGEGQKIETSKATWVNATMR 104 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchh-hhhcceeeccCCceEEEEEecC----cceeEeccCccccccccceeEEEEe Confidence 221 1244555545544333322223 2357778888777888887432 1223455666666666677777777 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-cCcccceecccccccCCCCCChHHHH Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS-YAAGNKTTLSGADQWSDPTSNPLPVI 156 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~-y~~~~~~~lsgt~~Wsd~~sdPi~di 156 (309) .+..+-..+++++..++. .++......+.+.+.|....|..+-.. ++.+ .+........ ......++.....+| T Consensus 105 ~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~ai~~~~d~~~l~G--~g~~~~~~~~~~~~~-~~~~~~~~~~~~~~i 179 (324) T protein:vir:99 105 AFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILN--QGNNPFGKSIAQSIE-KTNKVIKGDFTQDNI 179 (324) T ss_pred eEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHhhhc--CCCCccCcccccccc-ccceeccccCCHHHH Confidence 777776677777765544 244556666667776666555433211 1111 1111111111 112233455667777 Q ss_pred HHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCcccc Q lcl|Aclame:pro 157 TDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLI 233 (309) Q Consensus 157 ~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~ 233 (309) .+....+ ...++..+|+++.|..|+. +++ .++.. ++....-..++|+| |++..+ .....+. T Consensus 180 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~---l~d----~~g~~-~~~~~~~~~l~G~P-Vv~~~~-----~~~~~~~-- 243 (324) T protein:vir:99 180 IDLEALLEDDELEANAFISKTQNRSLLRK---IVD----PETKE-RIYDRNSDTLDGLP-VVNLKS-----SNLKRGE-- 243 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHH---hhc----CCCce-eecCCCCcccccee-EEeecC-----CCCCcce-- Confidence 7776655 6789999999999998763 222 22221 12111123467877 333221 1111111 Q ss_pred eecCCcEEEEecCCCCCCcCcceeccccccccccc-CCccc-----cccccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 234 RAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVS-GSIAD-----PNIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 234 ~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~-~~~~d-----~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) -++++..-+++.... |.+.+...... ....+ ...-..+...+|+.+.++-.+.-+.+-..|+++. T Consensus 244 ~i~gd~~~~~~~~~~---------~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~ 314 (324) T protein:vir:99 244 LITGDFDKLIYGIPQ---------LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EEEEecccEEEEEec---------CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEecc Confidence 122221111111100 11221111100 00000 0112345567899999999999999999999988 Q ss_pred cC Q lcl|Aclame:pro 308 AA 309 (309) Q Consensus 308 a~ 309 (309) ++ T Consensus 315 ~~ 316 (324) T protein:vir:99 315 KK 316 (324) T ss_pred CC Confidence 88 No 26 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=97.08 E-value=4.6e-05 Score=44.34 Aligned_cols=283 Identities=9% Similarity=-0.040 Sum_probs=133.2 Q ss_pred CCC---CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MSN---APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~~---~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) |+. .-+.+.+.+.+--+..-.+..+=..+++.+|+.....++|++...-. ..-++.++........+...+.. T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~----a~wv~E~~~~~~s~~~f~~v~l~ 76 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSD----IDVVAENGKKTHGGLSLEPVTIV 76 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcc----eEEeecCccccccccceeeEEee Confidence 874 23555555544445554444445566788888877788998854311 12344455544445555544444 Q ss_pred eeccchhhcCCHHHHHH-HhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----ccCcccceecccccccCCCCCC Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDN-APTNYNPLGHATEQTTNLILLDREARTSKLVFSPN-----SYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~-a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~-----~y~~~~~~~lsgt~~Wsd~~sd 151 (309) .+.-+-..+++.|-.++ ....++..+...+.+.+.+....|..+-...-... ..+..+...+++.......+.+ T Consensus 77 ~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (303) T protein:vir:97 77 PIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESED 156 (303) T ss_pred eEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccc Confidence 44444444555554432 12334455556666677666666554433221100 0111122222222222234556 Q ss_pred hHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH-----HHHHHhCCCeEEeecceeec Q lcl|Aclame:pro 152 PLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA-----FLQELLELDAIYIGEARLNI 223 (309) Q Consensus 152 Pi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~-----~l~~l~gl~~I~v~~a~~~~ 223 (309) +..||.+....+ +..|+..+|+++.+.+|+. +++. ++.. +..++ .-..++|+| |++-+..-.. T Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~---lkd~----~g~~-~~~~~~~~~~~~~~l~G~P-v~~s~~v~~~ 227 (303) T protein:vir:97 157 ADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAK---VTNG----EMGP-KMYPELAWGANPDSINGLK-SSVNTTVGAG 227 (303) T ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHH---hhcc----CCCe-EEecCccCCCCCceeccee-eEEecccCCc Confidence 788998887765 7889999999999988863 2222 1111 11111 112477877 4433221100 Q ss_pred cccCCCcccceecCCcEE-EEecCCCCCCcCcceeccccccccc--ccCCccccccccCCceEEEeecccceeeecchhh Q lcl|Aclame:pro 224 ARPGQNPNLIRAWGPHAS-FIYRDRLADTRNGTTFGLTAQWGDR--VSGSIADPNIGLRGGQRVRVGESVKELVTAPDLG 300 (309) Q Consensus 224 ~~~g~~~~~~~v~~~~~~-L~~~~~~~~~~~~~t~G~T~~~~~~--~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G 300 (309) ... ......-+++|..- +.+... . +.+.+.... .-+. .. .+-..+...+|+.+.++-.+.-+.+= T Consensus 228 ~~~-~~~~~~~~~Gdf~~~~~~~~~-----~----~~~~~~~~~~~~d~~-~~-~~~~~n~~~~r~~~r~~~~v~~p~af 295 (303) T protein:vir:97 228 ADE-AESKDLVIIGDFESMFKWGYA-----K----QIPMEIIKYGDPDNS-GK-DLKGYNQIYLRAEAYIGWGILDAKSF 295 (303) T ss_pred ccc-CCCccEEEEeeccccEEEEEe-----c----CcEEEEeeccCCCCc-ch-hhhhcCcEEEEEEEEeccEeecccce Confidence 000 01111123333211 111100 0 111111110 0000 00 11223444577777777777777666 Q ss_pred hhhhcccc Q lcl|Aclame:pro 301 FFFENAVA 308 (309) Q Consensus 301 ~l~~~~va 308 (309) ..++++=- T Consensus 296 ~~l~~~~~ 303 (303) T protein:vir:97 296 ARVTKGEV 303 (303) T ss_pred EEeeCCCC Confidence 66665544 No 27 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=97.00 E-value=0.00011 Score=42.24 Aligned_cols=274 Identities=12% Similarity=0.009 Sum_probs=137.8 Q ss_pred CCC---CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MSN---APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~~---~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) |+. ...++..+.+.|...-++..-| -.+++.+|+.....+||++... .....++.++........+.+.+.. T Consensus 30 ~~~~~~~~liP~~~~~~ii~~~~~~s~l-~~~~~~~~~~~~~~~~p~~~~~----~~a~~v~Eg~~~~~~~~~~~~v~~~ 104 (324) T protein:vir:10 30 MMHEKKDGTLLNDFTTPILQEVMENSKI-MQLGKYEPMEGTEKKFTFWADK----PGAYWVGEGQKIETSKATWVNATMR 104 (324) T ss_pred eccCCCcceechhHHHHHHHHHHhhchh-hhhcceeeccCCceEEEEEeCC----cceeEeccCccccccccceeEEEEe Confidence 221 1245555555544333332223 3357888888777888887532 1123355566666666677777777 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-cCcccceecccccccCCCCCChHHHH Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS-YAAGNKTTLSGADQWSDPTSNPLPVI 156 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~-y~~~~~~~lsgt~~Wsd~~sdPi~di 156 (309) .+..+...+++++-.++. .++......+.+.+.+....|..+-.. ++.+ .+........ .......+.....+| T Consensus 105 ~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~ai~~~~d~a~l~G--~g~~~~~~~i~~~~~-~~~~~~~~~~t~~~i 179 (324) T protein:vir:10 105 AFKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAGILN--QGNNPFGKSIAQSIE-KTNKVIKGDFTQDNI 179 (324) T ss_pred eEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHhhhc--CCCCccCcccccccc-ccceeccccCCHHHH Confidence 777776677777765543 244555555666666655554433211 1111 1111111111 112233455677888 Q ss_pred HHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCcccc Q lcl|Aclame:pro 157 TDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLI 233 (309) Q Consensus 157 ~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~ 233 (309) .+....+ +..++..+|+++.|..|+. +++ .++.. +.....-..++|+|-+ +..+. ...++. T Consensus 180 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~---l~d----~~g~~-~~~~~~~~~l~G~PV~-~~~~~-----~~~~~~-- 243 (324) T protein:vir:10 180 IDLEALLEDDELEANAFISKTQNRSLLRK---IVD----PETKE-RIYDRNSDTLDGLPVV-NLKSS-----NLKRGE-- 243 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHH---hhc----cCCce-eecCCCCccccceeEE-eecCC-----CCCcce-- Confidence 8877665 6789999999999998763 222 22221 1111122347788733 32111 111111 Q ss_pred eecCCcEEEEecCCCCCCcCcceeccccccccccc-CCccc-----cccccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 234 RAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVS-GSIAD-----PNIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 234 ~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~-~~~~d-----~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) -++++..-+++... . |.+.+..+... ....+ ...-..+...+|+.+.++-.+.-+.+=..|+++. T Consensus 244 ~~~gd~~~~~~~~~-----~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~ 314 (324) T protein:vir:10 244 LITGDFDKLIYGIP-----Q----LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EEEEecccEEEEEe-----c----CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEecc Confidence 12222111111110 0 11111111000 00000 0112345567889999999999899888899988 Q ss_pred cC Q lcl|Aclame:pro 308 AA 309 (309) Q Consensus 308 a~ 309 (309) ++ T Consensus 315 ~~ 316 (324) T protein:vir:10 315 KK 316 (324) T ss_pred CC Confidence 88 No 28 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=96.92 E-value=0.00013 Score=41.81 Aligned_cols=286 Identities=11% Similarity=0.008 Sum_probs=129.0 Q ss_pred CCCCC-----CCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSNAP-----FPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~~~-----f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) |+... +.+-+.+.+--+-.-...-+=.++++.+|+.....++|++...-.. .-++.++.....+..+.+.+ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a----~wv~Eg~~~~~s~~~f~~v~ 76 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRA----KIVGEGEVKPSASVDVSAFT 76 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcce----EEeeCCccccccccceeeeE Confidence 88532 4444444332232222223335567888888877888887542111 22344555555555666666 Q ss_pred eeeeccchhhcCCHHHHHHHhhcCC----HHHHHHHHHHHHHHHHHHHHHHHHhhcccccCc--ccceeccccccc---C Q lcl|Aclame:pro 76 GSTEDHGLDAPVPQADIDNAPTNYN----PLGHATEQTTNLILLDREARTSKLVFSPNSYAA--GNKTTLSGADQW---S 146 (309) Q Consensus 76 ~~~~e~~L~~~v~~~~~~~a~~~~d----~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~--~~~~~lsgt~~W---s 146 (309) ...+.-+-..++++|-.++. ..+ .+..-.+.+.+.|....| ..++++.+-.. ......+..... . T Consensus 77 l~~~kl~~~~~iS~ell~~s--~~~~~~~l~~~i~~~la~ai~~~~d----~a~~~G~~~~~~~~~~~~~~~~~~~~~~~ 150 (315) T protein:vir:80 77 AQPIKVVTQQRVSDEFMWAD--ADYRLGVLQDLISPALGASIGRAVD----LIAFHGIDPATGKAASAVHTSLNKTKNIV 150 (315) T ss_pred eeeeeEEeeehhhHHHhhcC--chhHHHHHHHHHHHHHHHHHHHHHh----hheeeccCCCCCcccccccccccccccee Confidence 55555554556666644321 112 122222333333333332 23333321111 111111111111 1 Q ss_pred CCCCChHHHHHHHHHHh----CCCCcEEEeCHHHHHHHhcCHHHHHHh-ccCCCccc--ccCHHHHHHHhCCCeEEeecc Q lcl|Aclame:pro 147 DPTSNPLPVITDALDSV----ILRPNIGVLGRRTATILRRHPKIVKAY-NGSLGDEG--MVPMAFLQELLELDAIYIGEA 219 (309) Q Consensus 147 d~~sdPi~di~~~~~~~----g~~Pn~~v~~~~~~~~l~~~~~i~~~~-~~~~~~~~--~vt~~~l~~l~gl~~I~v~~a 219 (309) +.+.+...||.+....+ ...++..+|+++++.+|+. ++..- +..+...- .+....-..++|+| |++.+. T Consensus 151 ~~~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~---l~~~~g~~~~g~~~~~~~~~g~~~tl~G~P-V~~~~~ 226 (315) T protein:vir:80 151 DATDSATADLVKAVGLIAGAGLQVPNGVALDPAFSFALST---EVYPKGSPLAGQPMYPAAGFAGLDNWRGLN-VGASST 226 (315) T ss_pred eccccchHHHHHHHHHHhhccCccceEEEEcHHHHHHHHH---HhhccCCcccccccccccccCCCceeccee-eEecCc Confidence 23455667777776554 3456789999999988864 22110 01111110 01111123578887 554443 Q ss_pred eeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchh Q lcl|Aclame:pro 220 RLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDL 299 (309) Q Consensus 220 ~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~ 299 (309) .-.....+......-+++|-.-+.+.... +.++-..- +.+.. +. . .++...+...+|+...++-.|.-+++ T Consensus 227 ~~~~~~~~~~~~~~~~~GDfs~~~~g~~~-----~~~i~i~~-~~~~~-~~-~-~~~~~~~~v~~r~~~r~~~~v~~~~a 297 (315) T protein:vir:80 227 VSGAPEMSPASGVKAIVGDFSRVHWGFQR-----NFPIELIE-YGDPD-QT-G-RDLKGHNEVMVRAEAVLYVAIESLDS 297 (315) T ss_pred CCcccccccccccEEEEeecccEEEEEec-----CeeEEEec-ccccc-Cc-c-cchhhcCcEEEEEEEEecceeecccc Confidence 21111111111111122322111111100 11111000 00000 00 0 12334566778999999999999999 Q ss_pred hhhhhccccC Q lcl|Aclame:pro 300 GFFFENAVAA 309 (309) Q Consensus 300 G~l~~~~va~ 309 (309) -..++++.|. T Consensus 298 ~~~l~~~~a~ 307 (315) T protein:vir:80 298 FAVVKEKAAP 307 (315) T ss_pred eEEEeeccCC Confidence 9999988888 No 29 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=96.86 E-value=0.00015 Score=41.50 Aligned_cols=273 Identities=13% Similarity=0.004 Sum_probs=138.2 Q ss_pred CCC--CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccceee Q lcl|Aclame:pro 1 MSN--APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGST 78 (309) Q Consensus 1 m~~--~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~~ 78 (309) ++. ...++..+...|-..-++..-| ..+++.+|+.....++|++..... ..-++.++.....+..+.+.++.+ T Consensus 31 ~~~~~~~~iP~~~~~~ii~~~~~~s~l-~~~~~~~~~~~~~~~ip~~~~~~~----a~~v~Eg~~~~~~~~~f~~v~~~~ 105 (324) T protein:vir:97 31 MHEKKDGTLMNEFTTPILQEVMENSKI-MQLGKYEPMEGTEKKFTFWADKPG----AYWVGEGQKIETSKATWVNATMRA 105 (324) T ss_pred ccCCCcceechhHHHHHHHHHHhhcch-hhhcceeeccCCceEEEEEecCcc----eeEeccCccccccccceeEEEEee Confidence 222 2355555544443332222223 345788888877788888753211 223455666666667777777777 Q ss_pred eccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecc--cccccCCCCCChHHHH Q lcl|Aclame:pro 79 EDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--GADQWSDPTSNPLPVI 156 (309) Q Consensus 79 ~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ls--gt~~Wsd~~sdPi~di 156 (309) +..+...+++++..++. .++......+.+.+.|....|..+ +++..-.......++ +......++.....+| T Consensus 106 ~k~~~~~~is~ell~ds--~~~l~~~i~~~l~~aia~~~d~a~----l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i 179 (324) T protein:vir:97 106 FKLGVILPVTKEFLNYT--YSQFFEEMKPMIAEAFYKKFDEAG----ILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNI 179 (324) T ss_pred EEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHh----hccCCCCccCccccccccccceeccccCCHHHH Confidence 77777777777755543 345556666667777666555433 222211111111111 1112223344556667 Q ss_pred HHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCcccc Q lcl|Aclame:pro 157 TDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLI 233 (309) Q Consensus 157 ~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~ 233 (309) .+...++ ++.+...+|++..|..|+. + +..++.. +.....-..++|+| |++..+. ....+. T Consensus 180 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~---l----kd~~g~~-~~~~~~~~tl~G~P-V~~~~~~-----~~~~~~-- 243 (324) T protein:vir:97 180 IDLEALLEDDELEANAFISKTQNRSLLRK---I----VDPETKE-RIYDRNSDTLDGLP-VVNLKSS-----NLKRGE-- 243 (324) T ss_pred HHHHHhhhhccCCCCEEEEcHHHHHHHHH---h----hcCCCce-eecCCCCcccccee-eEeecCC-----CCCcce-- Confidence 6665554 7789999999999988763 2 2122222 12111123467887 4332221 111111 Q ss_pred eecCCcEEEEecCCCCCCcCcceeccccccccccc-CCc--ccc---ccccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 234 RAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVS-GSI--ADP---NIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 234 ~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~-~~~--~d~---~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) -++++..-+++.... |.+.+...... ... .+. ..-..+...+|+.+.++-.+.-+++-..|+++. T Consensus 244 ~~~gd~~~~~i~~~~---------~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~ 314 (324) T protein:vir:97 244 LITGDFDKLIYGIPQ---------LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPAD 314 (324) T ss_pred EEEEecccEEEEEec---------CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEecc Confidence 122322111111100 11221111000 000 000 112244567899999999999999999999988 Q ss_pred cC Q lcl|Aclame:pro 308 AA 309 (309) Q Consensus 308 a~ 309 (309) ++ T Consensus 315 ~~ 316 (324) T protein:vir:97 315 KK 316 (324) T ss_pred CC Confidence 87 No 30 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=96.85 E-value=0.00029 Score=39.98 Aligned_cols=253 Identities=13% Similarity=0.065 Sum_probs=129.8 Q ss_pred CCCCC------CCcchhhHHHHHhhcchhhhhhhhC-Ccccc-------ccccceeEEechhHhhhchhHhhcccccccc Q lcl|Aclame:pro 1 MSNAP------FPIDPELTAIAIAYRNGRMISDEVL-PRVPV-------GKQEFKFWKYDLAQGFTVPETLVGRKSKPNE 66 (309) Q Consensus 1 m~~~~------f~~dp~LT~~a~~y~n~~~ig~~lf-P~v~v-------~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ 66 (309) ||+.. +.+. ++.++.+ ..+....+| +.+.+ .....++|++.... .....+.+..... T Consensus 1 MA~~~T~~~~~~iPe-v~s~~v~----~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~----~a~~v~eg~~i~~ 71 (272) T protein:vir:98 1 MAVGTTKMAQMLDPE-VLADMID----AEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIG----DAEDVAEGEAIPM 71 (272) T ss_pred CCCccccchheechH-HHHHHHH----HHHHHHhhhhccccccccccCCCCCEEEEEEecCCC----CcccccCCCcccc Confidence 99642 3222 2233332 222222222 22211 22234567764311 1222344444444 Q ss_pred cccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccC Q lcl|Aclame:pro 67 VEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWS 146 (309) Q Consensus 67 ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Ws 146 (309) .+++....+..++..+-.-.++++... ++..|+.....+.+...+....|..+...+.... ...+++ T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~--~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~-------~~~~~~---- 138 (272) T protein:vir:98 72 TQLGFKKTTMTIKKAGKGVEITDEAIL--SGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST-------QTVEAT---- 138 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHh--hccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cccccc---- Confidence 455666666667666655566666543 3456788888888888887777766655432211 111111 Q ss_pred CCCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeec Q lcl|Aclame:pro 147 DPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNI 223 (309) Q Consensus 147 d~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~ 223 (309) .+ ..+|.++...+ +..+..++|+++++..|+.+..+ ...+.+....+.+..-++..++|++ |++-+.. T Consensus 139 -~t---~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~-~~~~~~~~~~~~~~~g~ig~i~G~~-Vi~s~~~--- 209 (272) T protein:vir:98 139 -AT---VDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAK-EWLGATEVGANRVVSGVYGEVLGVQ-IVRSRKC--- 209 (272) T ss_pred -cC---HHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccc-cccccccccccccccccchhhcCee-EEEcCCC--- Confidence 12 34455554444 66789999999999999876433 2222222222334344566788986 5554331 Q ss_pred cccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 224 ARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 224 ~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) + ....++|+...+.++. +. +. .++...-...+...+++...+--+++-++....+ T Consensus 210 --p---~~t~~~~~~~a~~~~~----------------~~-~~---~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~ 264 (272) T protein:vir:98 210 --P---KGTAYMVRKGALRIML----------------KR-NT---MVETDRDITKAINQIVANKHYGVYLYKAEKAVKI 264 (272) T ss_pred --C---cceEEEEcCCeEEEEe----------------cC-Cc---eeeeccccccceeEEEEEEEEEEEEEcCCceEEE Confidence 1 0112333333322211 10 00 1111111123556677777777788888888888 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-+-|| T Consensus 265 t~~~a~ 270 (272) T protein:vir:98 265 TLKDAA 270 (272) T ss_pred Eecccc Confidence 888777 No 31 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=96.85 E-value=0.00029 Score=39.98 Aligned_cols=253 Identities=13% Similarity=0.065 Sum_probs=129.8 Q ss_pred CCCCC------CCcchhhHHHHHhhcchhhhhhhhC-Ccccc-------ccccceeEEechhHhhhchhHhhcccccccc Q lcl|Aclame:pro 1 MSNAP------FPIDPELTAIAIAYRNGRMISDEVL-PRVPV-------GKQEFKFWKYDLAQGFTVPETLVGRKSKPNE 66 (309) Q Consensus 1 m~~~~------f~~dp~LT~~a~~y~n~~~ig~~lf-P~v~v-------~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ 66 (309) ||+.. +.+. ++.++.+ ..+....+| +.+.+ .....++|++.... .....+.+..... T Consensus 1 MA~~~T~~~~~~iPe-v~s~~v~----~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~----~a~~v~eg~~i~~ 71 (272) T protein:vir:30 1 MAVGTTKMAQMLDPE-VLADMID----AEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIG----DAEDVAEGEAIPM 71 (272) T ss_pred CCCccccchheechH-HHHHHHH----HHHHHHhhhhccccccccccCCCCCEEEEEEecCCC----CcccccCCCcccc Confidence 99642 3222 2233332 222222222 22211 22234567764311 1222344444444 Q ss_pred cccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccC Q lcl|Aclame:pro 67 VEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWS 146 (309) Q Consensus 67 ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Ws 146 (309) .+++....+..++..+-.-.++++... ++..|+.....+.+...+....|..+...+.... ...+++ T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~--~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~-------~~~~~~---- 138 (272) T protein:vir:30 72 TQLGFKKTTMTIKKAGKGVEITDEAIL--SGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST-------QTVEAT---- 138 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHh--hccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc-------cccccc---- Confidence 455666666667666655566666543 3456788888888888887777766655432211 111111 Q ss_pred CCCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeec Q lcl|Aclame:pro 147 DPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNI 223 (309) Q Consensus 147 d~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~ 223 (309) .+ ..+|.++...+ +..+..++|+++++..|+.+..+ ...+.+....+.+..-++..++|++ |++-+.. T Consensus 139 -~t---~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~-~~~~~~~~~~~~~~~g~ig~i~G~~-Vi~s~~~--- 209 (272) T protein:vir:30 139 -AT---VDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAK-EWLGATEVGANRVVSGVYGEVLGVQ-IVRSRKC--- 209 (272) T ss_pred -cC---HHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccc-cccccccccccccccccchhhcCee-EEEcCCC--- Confidence 12 34455554444 66789999999999999876433 2222222222334344566788986 5554331 Q ss_pred cccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 224 ARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 224 ~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) + ....++|+...+.++. +. +. .++...-...+...+++...+--+++-++....+ T Consensus 210 --p---~~t~~~~~~~a~~~~~----------------~~-~~---~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~ 264 (272) T protein:vir:30 210 --P---KGTAYMVRKGALRIML----------------KR-NT---MVETDRDITKAINQIVANKHYGVYLYKAEKAVKI 264 (272) T ss_pred --C---cceEEEEcCCeEEEEe----------------cC-Cc---eeeeccccccceeEEEEEEEEEEEEEcCCceEEE Confidence 1 0112333333322211 10 00 1111111123556677777777788888888888 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-+-|| T Consensus 265 t~~~a~ 270 (272) T protein:vir:30 265 TLKDAA 270 (272) T ss_pred Eecccc Confidence 888777 No 32 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=96.74 E-value=0.00037 Score=39.37 Aligned_cols=259 Identities=10% Similarity=0.012 Sum_probs=128.9 Q ss_pred CCCCC-CCcc---h-hhHHHHHhhcchhhhhhhhCCcc-cccccc---ceeEEechhHhhhchhHhhcccccccccccCc Q lcl|Aclame:pro 1 MSNAP-FPID---P-ELTAIAIAYRNGRMISDEVLPRV-PVGKQE---FKFWKYDLAQGFTVPETLVGRKSKPNEVEFSA 71 (309) Q Consensus 1 m~~~~-f~~d---p-~LT~~a~~y~n~~~ig~~lfP~v-~v~~~~---~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~ 71 (309) |+++. ...| | +++++.+.--...++--.+++.. ....+. .++|+|... -.......+.....-+++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~----g~~~~~~eg~~i~~~~it~ 76 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS----GDAQVVAEGEKIPTDILET 76 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccC----CCcccccCCCccccccccc Confidence 99853 2222 3 33343322111111111122111 111222 345555321 1112233344444445556 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCC Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sd 151 (309) .+.+..++..+-...+++++ .++...||...+.+.+...+....+..+...+.... .+. +.+.. T Consensus 77 ~~~~~~i~~~~~~~~i~D~~--~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~-------~~~-------~~~~~ 140 (274) T protein:vir:93 77 KKREAKIRKIAKGTSITDEA--LLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------LTV-------NADIT 140 (274) T ss_pred ceeEEEeeeecccccccHHH--HHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-------ccc-------ccccc Confidence 66666666666545555554 444567888888888888887777766655442211 111 11222 Q ss_pred hHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCC Q lcl|Aclame:pro 152 PLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQ 228 (309) Q Consensus 152 Pi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~ 228 (309) ....|.+++..+ +..+..+++++.++..|++++.+. .+..+....+.+..-++..++|++ |++-+.. + T Consensus 141 ~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~-f~~~s~~g~~~~~~G~ig~~~G~~-Vi~s~~~-----p-- 211 (274) T protein:vir:93 141 KLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTN-FTRATELGDDIIVKGAFGEALGAI-IVRTNKL-----E-- 211 (274) T ss_pred CHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhc-ccccccccccceeecccceecCee-EEEcCCC-----C-- Confidence 355666776666 457899999999999998876433 222232223445555677888876 4442211 0 Q ss_pred CcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 229 NPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 229 ~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) ....++|+..+ +|+-.+. . -.++...-...+...+++...+.-.++-+..-..++.+-| T Consensus 212 -~~t~~l~~~ga----------------i~~~~~~-~---~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~ 270 (274) T protein:vir:93 212 -AGTAILAKKGA----------------VKLILKR-D---FFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred -cceEEEEeCCe----------------EEEEecC-C---cccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCcc Confidence 01112333222 2221110 0 0111111112345567777777777777777777776666 Q ss_pred C Q lcl|Aclame:pro 309 A 309 (309) Q Consensus 309 ~ 309 (309) + T Consensus 271 s 271 (274) T protein:vir:93 271 S 271 (274) T ss_pred c Confidence 6 No 33 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=96.70 E-value=0.00031 Score=39.82 Aligned_cols=282 Identities=11% Similarity=0.041 Sum_probs=131.1 Q ss_pred CCCCC-----CCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSNAP-----FPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~~~-----f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) ||... .++...+..+-..-++..-| .++++.+|+...+.++|++..... ..-++.++.......++.+.. T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i-~~l~~~~~~~~~~~~~p~~~~~~~----a~wv~Eg~~~~~s~~~f~~v~ 75 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSI-AKLSPQKPIPFNGQREFVFDFDSD----IDIVAENGKKTHGGVSLDPVT 75 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhh-hhhcceeeccCCceEEEEEecCcc----eEEeeCCcccccccccceeeE Confidence 88542 44444444443322333333 357888888888888998764311 123444555555556666666 Q ss_pred eeeeccchhhcCCHHHHHHH-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCc--ccceeccc-cc-ccCCCCC Q lcl|Aclame:pro 76 GSTEDHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAA--GNKTTLSG-AD-QWSDPTS 150 (309) Q Consensus 76 ~~~~e~~L~~~v~~~~~~~a-~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~--~~~~~lsg-t~-~Wsd~~s 150 (309) ...+.-+-..+++++-.+.. .+.++..+.-.+.+.+.+....+..+-...-....-+. .+.....+ +. .-..... T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~ 155 (300) T protein:vir:95 76 IVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDT 155 (300) T ss_pred eeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeeccccc Confidence 65555555566666654321 22344455555566666666555444322100000000 00000000 00 0111245 Q ss_pred ChHHHHHHHHHH---hCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcc---cccCHHHHHHHhCCCeEEeecceeecc Q lcl|Aclame:pro 151 NPLPVITDALDS---VILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDE---GMVPMAFLQELLELDAIYIGEARLNIA 224 (309) Q Consensus 151 dPi~di~~~~~~---~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~---~~vt~~~l~~l~gl~~I~v~~a~~~~~ 224 (309) ++..+|...... .+..|+..+|+++.+.+|+. ++..++.. ...+-..-..++|+| |++.+..- T Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~-------lkd~~G~~i~~~~~~~~~~~~l~G~P-v~~s~~v~--- 224 (300) T protein:vir:95 156 NPDESMEDAVGMIDGSERDITGAILDPIFTTALSK-------MKNAEGGKLYPELAWGGVPDAINGLA-VDKNRTVS--- 224 (300) T ss_pred chHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHH-------hhccCCCeeccCccccCCCceeccee-eEEecCCC--- Confidence 666777777654 47889999999999988754 22222211 000111224578887 55444321 Q ss_pred ccCCCcc-cceecCCcE-EEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhh Q lcl|Aclame:pro 225 RPGQNPN-LIRAWGPHA-SFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFF 302 (309) Q Consensus 225 ~~g~~~~-~~~v~~~~~-~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l 302 (309) .+.+.. ..-++++-. .+.+... .+.++-.. ++.+.-+.. .++-..+...+|+.+.++=.|..+.+-.. T Consensus 225 -~~~~~~~~~~~~GDf~~~~~~~~~-----~~~~~~v~-~~~~~d~~~---~~~f~~~~v~~r~~~r~d~~v~~~~a~~~ 294 (300) T protein:vir:95 225 -YSQTDPKNTAIVGDFETMFKWGYA-----KEVPMEII-KYGDPDNSG---RDLKGYNQIYIRCEAYIGWGIMDAASFAR 294 (300) T ss_pred -CCCCCCccEEEEeeccceEEEEEe-----cccEEEEe-eccCCCCcc---hhhhhcCcEEEEEEEeecceeecccceEE Confidence 111111 111223311 1111100 01111100 011100000 01123444567888888877777777777 Q ss_pred hhccccC Q lcl|Aclame:pro 303 FENAVAA 309 (309) Q Consensus 303 ~~~~va~ 309 (309) |+++ || T Consensus 295 l~~~-~g 300 (300) T protein:vir:95 295 IVKT-GG 300 (300) T ss_pred EecC-CC Confidence 6554 33 No 34 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=96.47 E-value=0.0006 Score=38.26 Aligned_cols=259 Identities=8% Similarity=-0.007 Sum_probs=126.3 Q ss_pred CCCCC-CCcchhhHHHHHhhcchhhhhhhhC-Ccccc----ccccc---eeEEechhHhhhchhHhhcccccccccccCc Q lcl|Aclame:pro 1 MSNAP-FPIDPELTAIAIAYRNGRMISDEVL-PRVPV----GKQEF---KFWKYDLAQGFTVPETLVGRKSKPNEVEFSA 71 (309) Q Consensus 1 m~~~~-f~~dp~LT~~a~~y~n~~~ig~~lf-P~v~v----~~~~~---k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~ 71 (309) |+|.. ...|-+-=.+--.|...++....+| |.+.+ ..+.+ ++|.|... -.......+.....-+++. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~i----g~a~~~~~g~~i~~~~lt~ 76 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS----GDAQVVAEGEKIPTDILET 76 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCC----CccccccCCCccchhhccc Confidence 99853 2222211112222333333333222 22222 12223 34544321 1122233344443445555 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCC Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sd 151 (309) ...+..++..+-...+++++ ..+..-||.+.+.+.+...+....+..+...+... +.+-+.+.. T Consensus 77 ~~~~~~i~~~~~~~~i~D~~--~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a--------------~~~~~~~a~ 140 (274) T protein:vir:12 77 KKREAKIRKIAKGTSITDEA--LLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGA--------------KLTVNADIT 140 (274) T ss_pred ceeeEEeeeecceeeecHHH--HHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------ccccccccc Confidence 66666666665555555544 34456688888888777777666666555444221 112223334 Q ss_pred hHHHHHHHHHHhC---CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCC Q lcl|Aclame:pro 152 PLPVITDALDSVI---LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQ 228 (309) Q Consensus 152 Pi~di~~~~~~~g---~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~ 228 (309) ....|.++...+| ..+..+++++.++..|++++.+. .+..+....+++..-.+..++|++ |++-+.. + T Consensus 141 ~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~-fv~~s~~g~~~~~~G~ig~~~G~~-Vi~s~~~-----p-- 211 (274) T protein:vir:12 141 KLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTN-FTRATELGDDIIVKGAFGEALGAI-IVRSNKL-----E-- 211 (274) T ss_pred CHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhh-ccccccccccceecccceeecCee-EEEeCCC-----C-- Confidence 4667777777775 47899999999999999876432 333333333455556777888875 4443211 0 Q ss_pred CcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 229 NPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 229 ~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) ....++++..++-.+... +. .++...-...+...+.....+--+++-+..-..++-+-| T Consensus 212 -~~t~~l~~~gA~~~~~~~----------~~----------~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:12 212 -AGTAILAKKGAVKLILKR----------DF----------FLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred -cceEEEEeccceeeeecC----------Cc----------eeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCc Confidence 011233333322221100 00 111111112344455566666556666655556655555 Q ss_pred C Q lcl|Aclame:pro 309 A 309 (309) Q Consensus 309 ~ 309 (309) + T Consensus 271 ~ 271 (274) T protein:vir:12 271 S 271 (274) T ss_pred c Confidence 5 No 35 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=96.32 E-value=0.00037 Score=39.38 Aligned_cols=272 Identities=11% Similarity=0.052 Sum_probs=130.8 Q ss_pred CCCC--------------CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc Q lcl|Aclame:pro 1 MSNA--------------PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE 66 (309) Q Consensus 1 m~~~--------------~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ 66 (309) |+.. ..++......+-..-++..-| ..+++.+|+.....++|++.....+ ..++.++.... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l-~~~~~~~~~~~~~~~ip~~~~~~~a----~~v~E~~~~~~ 75 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAI-MKLAKNEPMTAQKKKFTYLAKGVGA----YWVSETERIQT 75 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccch-hhhcceeeccCCceEEEEEeCCcce----EEeecCccccc Confidence 4422 234444333443222222222 2356778887777788888543222 22344444444 Q ss_pred cccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------cCcccceec Q lcl|Aclame:pro 67 VEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS-------YAAGNKTTL 139 (309) Q Consensus 67 ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~-------y~~~~~~~l 139 (309) ....+.+.+...+..+-..+++++-..+ +.++....-.+.+.+.+....|..+ +++.. .+....... T Consensus 76 ~~~~~~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~ia~~~d~~~----l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:10 76 SKPEYAQAEMEAKKIGVIIPLSKEFLKW--TAKDFFNEVKPLIAEAFYKAFDQAV----IFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred ccceeeEEEEEEEEEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHhhh----eeccCCCcccccccccccccc Confidence 4555555566666665556666665443 3455556555666666665555433 32211 111111111 Q ss_pred ccccccCCCCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEe Q lcl|Aclame:pro 140 SGADQWSDPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYI 216 (309) Q Consensus 140 sgt~~Wsd~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v 216 (309) .........+.+.+.+|.+....+ +..+...+|+++.|.+|+. +++ .++.+ +..+ ....++|+| |++ T Consensus 150 ~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~---lkd----~~G~~-l~~~-~~~~l~G~P-V~~ 219 (304) T protein:vir:10 150 EEKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN---ALD----ANDRP-LFDA-NGNEIMGLP-LSY 219 (304) T ss_pred cccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH---hhc----cCCcE-eecC-CCcccccee-eEE Confidence 112223345566788888887666 6788999999999999874 222 22221 1211 113578888 444 Q ss_pred ecceeeccccCCCcccceecCCc--EEEEecCCCCCCcCcceecccccc------cccccCCccccccccCCceEEEeec Q lcl|Aclame:pro 217 GEARLNIARPGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQW------GDRVSGSIADPNIGLRGGQRVRVGE 288 (309) Q Consensus 217 ~~a~~~~~~~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~------~~~~~~~~~d~~~g~~g~~~v~v~~ 288 (309) .+..-... .+..-++++- +++... .+.++-..-+- ..+..|... ..-..+...+|+.+ T Consensus 220 ~~~~~~~~-----~~~~~~~gd~~~~~~~~~-------~~~~i~~~~e~~~~~~~~~~~~g~~~--~~f~~~~~~~r~~~ 285 (304) T protein:vir:10 220 TGADVYDK-----KKSLALMGDWDYARYGIL-------QGIEYAISEDATLTTLQASDASGQPV--SLFERDMFALRATM 285 (304) T ss_pred ecccccCC-----CCcEEEEEehhhEEEEEe-------cceEEEEeecceeeeecccccCccch--hhhhcCcEEEEEEE Confidence 33321111 1111222221 111110 11111110000 000000000 11234455688888 Q ss_pred ccceeeecchhhhhhhccc Q lcl|Aclame:pro 289 SVKELVTAPDLGFFFENAV 307 (309) Q Consensus 289 ~~~~~v~~~~~G~l~~~~v 307 (309) .++-.+.-+++-..++.+= T Consensus 286 r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 286 HIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EeccEeecccceEEEEecC Confidence 8888888888877777766 No 36 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=96.32 E-value=0.00037 Score=39.38 Aligned_cols=272 Identities=11% Similarity=0.052 Sum_probs=130.8 Q ss_pred CCCC--------------CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc Q lcl|Aclame:pro 1 MSNA--------------PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE 66 (309) Q Consensus 1 m~~~--------------~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ 66 (309) |+.. ..++......+-..-++..-| ..+++.+|+.....++|++.....+ ..++.++.... T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l-~~~~~~~~~~~~~~~ip~~~~~~~a----~~v~E~~~~~~ 75 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAI-MKLAKNEPMTAQKKKFTYLAKGVGA----YWVSETERIQT 75 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccch-hhhcceeeccCCceEEEEEeCCcce----EEeecCccccc Confidence 4422 234444333443222222222 2356778887777788888543222 22344444444 Q ss_pred cccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------cCcccceec Q lcl|Aclame:pro 67 VEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS-------YAAGNKTTL 139 (309) Q Consensus 67 ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~-------y~~~~~~~l 139 (309) ....+.+.+...+..+-..+++++-..+ +.++....-.+.+.+.+....|..+ +++.. .+....... T Consensus 76 ~~~~~~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~ia~~~d~~~----l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:94 76 SKPEYAQAEMEAKKIGVIIPLSKEFLKW--TAKDFFNEVKPLIAEAFYKAFDQAV----IFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred ccceeeEEEEEEEEEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHhhh----eeccCCCcccccccccccccc Confidence 4555555566666665556666665443 3455556555666666665555433 32211 111111111 Q ss_pred ccccccCCCCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEe Q lcl|Aclame:pro 140 SGADQWSDPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYI 216 (309) Q Consensus 140 sgt~~Wsd~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v 216 (309) .........+.+.+.+|.+....+ +..+...+|+++.|.+|+. +++ .++.+ +..+ ....++|+| |++ T Consensus 150 ~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~---lkd----~~G~~-l~~~-~~~~l~G~P-V~~ 219 (304) T protein:vir:94 150 EEKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN---ALD----ANDRP-LFDA-NGNEIMGLP-LSY 219 (304) T ss_pred cccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH---hhc----cCCcE-eecC-CCcccccee-eEE Confidence 112223345566788888887666 6788999999999999874 222 22221 1211 113578888 444 Q ss_pred ecceeeccccCCCcccceecCCc--EEEEecCCCCCCcCcceecccccc------cccccCCccccccccCCceEEEeec Q lcl|Aclame:pro 217 GEARLNIARPGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQW------GDRVSGSIADPNIGLRGGQRVRVGE 288 (309) Q Consensus 217 ~~a~~~~~~~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~------~~~~~~~~~d~~~g~~g~~~v~v~~ 288 (309) .+..-... .+..-++++- +++... .+.++-..-+- ..+..|... ..-..+...+|+.+ T Consensus 220 ~~~~~~~~-----~~~~~~~gd~~~~~~~~~-------~~~~i~~~~e~~~~~~~~~~~~g~~~--~~f~~~~~~~r~~~ 285 (304) T protein:vir:94 220 TGADVYDK-----KKSLALMGDWDYARYGIL-------QGIEYAISEDATLTTLQASDASGQPV--SLFERDMFALRATM 285 (304) T ss_pred ecccccCC-----CCcEEEEEehhhEEEEEe-------cceEEEEeecceeeeecccccCccch--hhhhcCcEEEEEEE Confidence 33321111 1111222221 111110 11111110000 000000000 11234455688888 Q ss_pred ccceeeecchhhhhhhccc Q lcl|Aclame:pro 289 SVKELVTAPDLGFFFENAV 307 (309) Q Consensus 289 ~~~~~v~~~~~G~l~~~~v 307 (309) .++-.+.-+++-..++.+= T Consensus 286 r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 286 HIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EeccEeecccceEEEEecC Confidence 8888888888877777766 No 37 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=96.27 E-value=0.00067 Score=37.98 Aligned_cols=273 Identities=13% Similarity=0.058 Sum_probs=133.3 Q ss_pred CCC-----C-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSN-----A-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~-----~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |+. . ..++......|-..-++. -+-..+++.+|+.....++++.....+ .-++.+++......++.+. T Consensus 6 ~~~~~~~~~~~~iP~~~~~~ii~~~~~~-s~l~~~~~~~~~~~~~~~~~~~~~~~a-----~~v~E~~~~~~~~~~f~~v 79 (299) T protein:vir:41 6 DTTTMQSAKTGSIPINISEQIITGVKNG-SAAMKLAKAVPMTKPEEEFTFMSGVGA-----FWVDEAERIQTSKPTFTKA 79 (299) T ss_pred CcccccCCCceecchhHHHHHHHHHHhc-chhhhhceeeecCCCcEEEEEEcCCce-----eeeecCccccccccceeEE Confidence 221 1 234444444443322222 233344677888777777777654321 2234455555555566666 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceeccc---ccccCCCCCC Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSG---ADQWSDPTSN 151 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsg---t~~Wsd~~sd 151 (309) .+..+..+-..+++++-..+. .++......+.+.+.+.+..|..+ +++..-+ .+...++. .......+.+ T Consensus 80 ~l~~~k~~~~~~is~ell~ds--~~~~~~~i~~~l~~a~~~~~d~a~----l~G~g~~-~~~gil~~~~~~~~~~~~~~~ 152 (299) T protein:vir:41 80 KMRSKKMGVIIPTTKENLNYS--VTNFFSLMQAEIVEAFYKKFDQAV----FTGVESP-YNWNILKSATDASNLVEETAN 152 (299) T ss_pred EEeeEEEEEeehhhHHHHhcC--HHHHHHHHHHHHHHHHHHHHHHHH----hhcccCc-ccccccccccccceeeccccc Confidence 666666666667777655532 344555556666666666555433 2222111 11112211 1122234566 Q ss_pred hHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH---HHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 152 PLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM---AFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 152 Pi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~---~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) .+.||.++...+ ++.++..+|+++.|.+|+. ++ ..++.+ +..+ .....+||+| |++-+.. T Consensus 153 ~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~---lk----d~~G~~-l~~~~~~~~~~~l~G~P-V~~~~~~----- 218 (299) T protein:vir:41 153 KYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRS---TK----DGNGMP-IFNTATSNGVDDVLGLP-IAYTPKY----- 218 (299) T ss_pred cHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHH---hh----ccCCce-eecCCcCCCCceeccee-eEEeccc----- Confidence 788888887665 6789999999999999874 22 222221 1100 0112567877 4433322 Q ss_pred cCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccc---cCCcccc---ccccCCceEEEeecccceeeecchh Q lcl|Aclame:pro 226 PGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRV---SGSIADP---NIGLRGGQRVRVGESVKELVTAPDL 299 (309) Q Consensus 226 ~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~---~~~~~d~---~~g~~g~~~v~v~~~~~~~v~~~~~ 299 (309) +..+....-++++..-++... .. +.+.+..+.. .+...+. .....+..-+|+.+.++-.+.-+.| T Consensus 219 ~~~~~~~~~~~gdfs~~~i~~-----~~----~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A 289 (299) T protein:vir:41 219 TFGDKDISELVGDWNQAYYGI-----LR----GVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEA 289 (299) T ss_pred CCCCCceEEEEEecccEEEEE-----ec----CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccc Confidence 111111112222221111110 00 1111110000 0000010 1112344568899999999999988 Q ss_pred hhhhhccccC Q lcl|Aclame:pro 300 GFFFENAVAA 309 (309) Q Consensus 300 G~l~~~~va~ 309 (309) -..++...|- T Consensus 290 ~~~l~~~aa~ 299 (299) T protein:vir:41 290 FSAVQPKAGN 299 (299) T ss_pred eEEEEeccCC Confidence 8888777666 No 38 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=96.13 E-value=0.00041 Score=39.17 Aligned_cols=279 Identities=11% Similarity=0.037 Sum_probs=126.1 Q ss_pred CCC---CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MSN---APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~~---~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) .+. .-.++.+....+...-++.. +--.+++.+|+.....++|++.....+ .-++.++.....+.++.+.++. T Consensus 22 ~~~~~~g~~ip~~~~~~ii~~~~~~s-~i~~~~~~~~~~~~~~~~p~~~~~~~a----~~v~Eg~~~~~~~~~f~~i~~~ 96 (326) T protein:vir:42 22 TGDSMFEGYLEPEQAQDYFAEAEKIS-IVQQFAQKIPMGTTGQKIPHWTGDVSA----SWIGEGDMKPITKGNMTSQTIA 96 (326) T ss_pred ccccCCcceechhhHHHHHHHHHhcc-hhhhhcceeeccCCceEEEEEeCCcce----EEecCCccccccccceeEEEEe Confidence 111 11344444333322222222 223467888988878888887653222 1244555555666666666666 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCc--------ccceecccccccCCCC Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAA--------GNKTTLSGADQWSDPT 149 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~--------~~~~~lsgt~~Wsd~~ 149 (309) .+..+-..+++++-..+ +.++......+.+.+.+....|..+ +++..-+. ...............+ T Consensus 97 ~~k~~~~v~iS~ell~~--s~~~~~~~i~~~l~~a~~~~~d~a~----l~G~gs~~p~gi~~~~~~~~~~~~~~~~~~~~ 170 (326) T protein:vir:42 97 PHKIATIFVASAETVRA--NPANYLGTMRTKVATAFAMAFDNAA----INGTDSPFPTFLAQTTKEVSLVDPDGTGSNAD 170 (326) T ss_pred eEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHHh----hcccCCCccccccccccccceeeccccccccc Confidence 66666666666665443 3455566666677777666655433 22211000 0000111111111111 Q ss_pred CChHH-HHHHHHH---HhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH---------HHHHHhCCCeEEe Q lcl|Aclame:pro 150 SNPLP-VITDALD---SVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA---------FLQELLELDAIYI 216 (309) Q Consensus 150 sdPi~-di~~~~~---~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~---------~l~~l~gl~~I~v 216 (309) ...-. ++..... ......+..+|++++|.+|+. +++ .++.. +..+. ....++|+| |.+ T Consensus 171 ~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~---lkd----~~G~~-l~~~~~~~~~~~~~~~~~l~G~p-v~~ 241 (326) T protein:vir:42 171 LTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNG---AKD----KSGRP-LFIESTYTEENSPFRLGRIVARP-TIL 241 (326) T ss_pred chhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHH---hhc----cCCce-eeccccccCccccccCceeeeee-EEE Confidence 11111 1222222 235677889999999998874 222 12111 11111 112356666 333 Q ss_pred ecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeec Q lcl|Aclame:pro 217 GEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTA 296 (309) Q Consensus 217 ~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~ 296 (309) .+.. +. .+..-+++|-.-+++....+-.. ..+-..+.+.+....+... ..-..+...+|+...++-.+.- T Consensus 242 ~~~~-----~~--~~~~~~~Gd~s~~~~~~~~~~~v-~~~~e~~~~~~~~~~~~~~--~~~~~d~~~~r~~~~~d~~v~~ 311 (326) T protein:vir:42 242 SDHV-----AS--GTVVGYQGDFRQLVWGQVGGLSF-DVTDQATLNLGTPQAPNFV--SLWQHNLVAVRVEAEYAFHCND 311 (326) T ss_pred cCCC-----CC--CceEEEEeecceEEEEEecceEE-EEeecceeeecccccccch--hhhhcCcEEEEEEEEeccEEec Confidence 2211 11 11111223321111111100000 0000001111110000000 0112345668999999999999 Q ss_pred chhhhhhhccccC Q lcl|Aclame:pro 297 PDLGFFFENAVAA 309 (309) Q Consensus 297 ~~~G~l~~~~va~ 309 (309) +++-..|+++.|+ T Consensus 312 ~~a~~~l~~~~~~ 324 (326) T protein:vir:42 312 KDAFVKLTNVDAT 324 (326) T ss_pred ccceEEEeecccc Confidence 9998899999999 No 39 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=96.09 E-value=0.001 Score=36.98 Aligned_cols=259 Identities=10% Similarity=0.005 Sum_probs=128.1 Q ss_pred CCCCC-C---Ccch-hhHHHHHhhcchhhhhhhhCCcc-ccc---cccceeEEechhHhhhchhHhhcccccccccccCc Q lcl|Aclame:pro 1 MSNAP-F---PIDP-ELTAIAIAYRNGRMISDEVLPRV-PVG---KQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSA 71 (309) Q Consensus 1 m~~~~-f---~~dp-~LT~~a~~y~n~~~ig~~lfP~v-~v~---~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~ 71 (309) |+|.. . .+.| +++++.+.--...++--.++..- ... ....++|+|... -..+....+.....-+++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~----g~a~~~~~g~~i~~~~lt~ 76 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS----GDAQVVAEGEKIPTDILET 76 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCC----CccccccCCCccccccccc Confidence 99853 2 2233 34444432111222111111110 111 222345555421 1122334444444445555 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCC Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sd 151 (309) ...+..++..+-...++.++ .++..-||.+.+.+.+...+....+..+...+.... .+. +.+.- T Consensus 77 ~~~~~~i~~~~~~~~i~D~~--~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~-------~~~-------~~~~~ 140 (274) T protein:vir:94 77 KKREAKIRKIAKGTSITDEA--LLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------LTV-------NADIT 140 (274) T ss_pred ceeEEEeeeecceecccHHH--HHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------ccc-------ccccc Confidence 66666666666545555554 444566888888888887777777766665543211 111 11222 Q ss_pred hHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCC Q lcl|Aclame:pro 152 PLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQ 228 (309) Q Consensus 152 Pi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~ 228 (309) ....|.+++..+ +..+..+++++.++..|++++.+. .++.+....+++..-++..++|++ |++-+.. + T Consensus 141 ~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~-f~~~s~~g~~~~~~G~ig~~~G~~-Vi~s~~~-----p-- 211 (274) T protein:vir:94 141 KLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTN-FTRATELGDDIIVKGAFGEALGAI-IVRTNKL-----E-- 211 (274) T ss_pred CHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhh-ccccCcccccceeccccceecCee-EEEcCCC-----C-- Confidence 356677777666 456889999999999998876432 233333323455556677788875 4443221 0 Q ss_pred CcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 229 NPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 229 ~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) ....++++..+ +|+ +...+. .++.+.-...+...+.....+--.++-+..-..++.+.| T Consensus 212 -~~t~~l~~~gA----------------~~~-~~~~~~---~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:94 212 -AGTAILAKKGA----------------VKL-ILKRDF---FLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred -cceEEEEeCcc----------------eEe-eecCCc---eeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 01112333222 222 111000 111111112234556666666667777766667777777 Q ss_pred C Q lcl|Aclame:pro 309 A 309 (309) Q Consensus 309 ~ 309 (309) + T Consensus 271 ~ 271 (274) T protein:vir:94 271 S 271 (274) T ss_pred c Confidence 7 No 40 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=96.09 E-value=0.001 Score=36.98 Aligned_cols=259 Identities=10% Similarity=0.005 Sum_probs=128.1 Q ss_pred CCCCC-C---Ccch-hhHHHHHhhcchhhhhhhhCCcc-ccc---cccceeEEechhHhhhchhHhhcccccccccccCc Q lcl|Aclame:pro 1 MSNAP-F---PIDP-ELTAIAIAYRNGRMISDEVLPRV-PVG---KQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSA 71 (309) Q Consensus 1 m~~~~-f---~~dp-~LT~~a~~y~n~~~ig~~lfP~v-~v~---~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~ 71 (309) |+|.. . .+.| +++++.+.--...++--.++..- ... ....++|+|... -..+....+.....-+++. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~~----g~a~~~~~g~~i~~~~lt~ 76 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYS----GDAQVVAEGEKIPTDILET 76 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecCC----CccccccCCCccccccccc Confidence 99853 2 2233 34444432111222111111110 111 222345555421 1122334444444445555 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCC Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sd 151 (309) ...+..++..+-...++.++ .++..-||.+.+.+.+...+....+..+...+.... .+. +.+.- T Consensus 77 ~~~~~~i~~~~~~~~i~D~~--~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~-------~~~-------~~~~~ 140 (274) T protein:vir:97 77 KKREAKIRKIAKGTSITDEA--LLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK-------LTV-------NADIT 140 (274) T ss_pred ceeEEEeeeecceecccHHH--HHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccC-------ccc-------ccccc Confidence 66666666666545555554 444566888888888887777777766665543211 111 11222 Q ss_pred hHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCC Q lcl|Aclame:pro 152 PLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQ 228 (309) Q Consensus 152 Pi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~ 228 (309) ....|.+++..+ +..+..+++++.++..|++++.+. .++.+....+++..-++..++|++ |++-+.. + T Consensus 141 ~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~-f~~~s~~g~~~~~~G~ig~~~G~~-Vi~s~~~-----p-- 211 (274) T protein:vir:97 141 KLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTN-FTRATELGDDIIVKGAFGEALGAI-IVRTNKL-----E-- 211 (274) T ss_pred CHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhh-ccccCcccccceeccccceecCee-EEEcCCC-----C-- Confidence 356677777666 456889999999999998876432 233333323455556677788875 4443221 0 Q ss_pred CcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 229 NPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 229 ~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) ....++++..+ +|+ +...+. .++.+.-...+...+.....+--.++-+..-..++.+.| T Consensus 212 -~~t~~l~~~gA----------------~~~-~~~~~~---~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 270 (274) T protein:vir:97 212 -AGTAILAKKGA----------------VKL-ILKRDF---FLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred -cceEEEEeCcc----------------eEe-eecCCc---eeccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcc Confidence 01112333222 222 111000 111111112234556666666667777766667777777 Q ss_pred C Q lcl|Aclame:pro 309 A 309 (309) Q Consensus 309 ~ 309 (309) + T Consensus 271 ~ 271 (274) T protein:vir:97 271 S 271 (274) T ss_pred c Confidence 7 No 41 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=96.06 E-value=0.00069 Score=37.91 Aligned_cols=288 Identities=10% Similarity=-0.044 Sum_probs=127.8 Q ss_pred CCC------------CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhh--hch--hHhhcccccc Q lcl|Aclame:pro 1 MSN------------APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGF--TVP--ETLVGRKSKP 64 (309) Q Consensus 1 m~~------------~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f--~~~--~t~~~~~~~~ 64 (309) |+. ...++....+.|-. .-....+=-.+++.+|+...+.++|++...... ... ...++.++.. T Consensus 10 ~~~~~~~~~~~~~~~~~liP~~~~~~ii~-~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~ 88 (338) T protein:vir:78 10 NTAGSNHQGRLAHVPSDLLPKEIVGPIFD-KAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGTK 88 (338) T ss_pred hhcccccccceecccccccchHHHHHHHH-HHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccccccccccc Confidence 111 11344444443332 222222234567888888888899987543111 100 1122334444 Q ss_pred cccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------Cccc Q lcl|Aclame:pro 65 NEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSY---------AAGN 135 (309) Q Consensus 65 ~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y---------~~~~ 135 (309) ......+....+..+..+-..+++++-.++ +.++....-.+.+.+.+....|.. ++++..- .... T Consensus 89 ~~~~~~f~~v~l~~~k~~~~~~is~ell~d--s~~~~~~~i~~~la~a~~~~~d~~----~l~G~g~~~~~~~~gi~~~~ 162 (338) T protein:vir:78 89 PLSGTAWDTRSVAPIKLATIVTVSEEFARM--NPSGLYTKLQADLAYAIGRGIDLA----VFHGKSPLTGSALQGIDTNN 162 (338) T ss_pred cccccceeEEEEEEEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHH----hhcccCCCcccccccccccc Confidence 444445555555555555555666654443 234445555556666666655543 3332211 1111 Q ss_pred ceec-ccccccCCCCCChHHHHHHHHHHh----CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcc---cccCHHHHHH Q lcl|Aclame:pro 136 KTTL-SGADQWSDPTSNPLPVITDALDSV----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDE---GMVPMAFLQE 207 (309) Q Consensus 136 ~~~l-sgt~~Wsd~~sdPi~di~~~~~~~----g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~---~~vt~~~l~~ 207 (309) ...- +..+.........+.+|.+....+ ...++..+|+++.+..|+.-.++++ .++.. ....-..-.. T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d----~~g~~l~~~~~~~~~~~~ 238 (338) T protein:vir:78 163 VIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRD----ANGNVDPTRINLAASAGD 238 (338) T ss_pred ccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhcc----CCCceeecccccCCCCce Confidence 1100 011111112234566666665544 4568899999999998865433222 11111 0011111134 Q ss_pred HhCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceeccccccc---ccccCCccc---cccccCCc Q lcl|Aclame:pro 208 LLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWG---DRVSGSIAD---PNIGLRGG 281 (309) Q Consensus 208 l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~---~~~~~~~~d---~~~g~~g~ 281 (309) ++|+| |++.+..-............-++++..-+++.... |.+.+.. +...+..++ ...-..+. T Consensus 239 l~G~P-V~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~---------~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (338) T protein:vir:78 239 LLGLP-VQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFAD---------EIRVKMSDTATLTDNTSPTPQTVSMWQTNQ 308 (338) T ss_pred eeeee-EEEccccCccccccCCcccEEEEEecceEEEEeec---------ccEEEEeecccccccccccccchhhhhcCc Confidence 67877 55443322111011111111122322111111100 1111110 000000000 01112344 Q ss_pred eEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 282 QRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 282 ~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) ..+|+.+.++-.+.-+++-..+.++-|+ T Consensus 309 ~~~r~~~r~d~~v~~~~a~~~l~~~~~~ 336 (338) T protein:vir:78 309 IAILIEVTFGWLLGDKQAFVKFVDDEDP 336 (338) T ss_pred EEEEEEEEeccEeecccceEEEecccCC Confidence 5688888889888888888888888887 No 42 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=95.93 E-value=0.0012 Score=36.52 Aligned_cols=255 Identities=11% Similarity=0.039 Sum_probs=124.7 Q ss_pred CCCC-CC---Ccch-hhHHHHHhhcchhhhhhhhCCc-ccc----c---cccceeEEechhHhhhchhHhhccccccccc Q lcl|Aclame:pro 1 MSNA-PF---PIDP-ELTAIAIAYRNGRMISDEVLPR-VPV----G---KQEFKFWKYDLAQGFTVPETLVGRKSKPNEV 67 (309) Q Consensus 1 m~~~-~f---~~dp-~LT~~a~~y~n~~~ig~~lfP~-v~v----~---~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v 67 (309) ||+. +. .+.| +++++.+. ++-...+|.. +.+ . ....++|+|.. .-.......+.....- T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~----~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~----~g~~~~~~~g~~i~~~ 72 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQA----ELDKKLRFAQFADIDSTLVGQPGDTLTFPAFTY----SGDAQVIAEGEKIPVD 72 (274) T ss_pred CCccccchhhhhhhHHHHHHHHH----HHHhhhhhcccccccccccCCCCCEEEEEeecc----CCCccccCCCCcCchh Confidence 9974 22 2233 34444432 2222222221 111 1 22234555532 1112223444444444 Q ss_pred ccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCC Q lcl|Aclame:pro 68 EFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSD 147 (309) Q Consensus 68 e~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd 147 (309) +.+....+..++.++-...++.++ ..+...||...+.+.+...+....+..++..+.... ++ .+. T Consensus 73 ~it~~~~~~~i~~~~~~~~i~D~~--~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~---------~~----~~~ 137 (274) T protein:vir:96 73 QIGTSKREAKVRKIGKGTELTDEA--VLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT---------LT----VEA 137 (274) T ss_pred hcccceeEEEEEeeeceeeecHHH--HHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC---------CC----cCc Confidence 555566666666655544555554 344567888888888888877777777666553221 11 111 Q ss_pred CCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeecc Q lcl|Aclame:pro 148 PTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIA 224 (309) Q Consensus 148 ~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~ 224 (309) +..-...|.++...+ ...+..+++++.++..|++++.+ +.+..+....+.+..-++..++|++ |++-+.. T Consensus 138 -~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~-~f~~~~~~g~~~~~~g~ig~~~G~~-Vi~s~~~---- 210 (274) T protein:vir:96 138 -DITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASD-NFTRPTQLGDNIIVKGAFGEALGAV-IVRSNKL---- 210 (274) T ss_pred -ccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcccc-cccccccccccceeecccceecCee-EEEcCCC---- Confidence 111145566666665 45789999999999999887642 2222222222444455677788876 5443221 Q ss_pred ccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhh Q lcl|Aclame:pro 225 RPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFE 304 (309) Q Consensus 225 ~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~ 304 (309) + ....++|+.++ +|+-.+. + -.++...-...++..++....+--.++-++.-..++ T Consensus 211 -p---~~t~~l~~~gA----------------~~~~~~~-~---~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t 266 (274) T protein:vir:96 211 -N---KGEALLAKKGA----------------VKLITKR-D---FFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKIT 266 (274) T ss_pred -C---cceEEEEeCcc----------------eeeeecC-C---cccccccchhhcccEEEEeeEEEEEEEcCccEEEEE Confidence 0 01123333332 2221110 0 011111112234455666666666666666655555 Q ss_pred ccccC Q lcl|Aclame:pro 305 NAVAA 309 (309) Q Consensus 305 ~~va~ 309 (309) .+.|- T Consensus 267 ~~~~~ 271 (274) T protein:vir:96 267 KGAGD 271 (274) T ss_pred cCccc Confidence 55444 No 43 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=95.66 E-value=0.0017 Score=35.83 Aligned_cols=262 Identities=11% Similarity=0.012 Sum_probs=116.7 Q ss_pred CCCC-C-----CCcchhhHHHHHhhcchhhhhhhhCCcc-ccccccc---eeEEechhHhhhchhHhhcccccccccccC Q lcl|Aclame:pro 1 MSNA-P-----FPIDPELTAIAIAYRNGRMISDEVLPRV-PVGKQEF---KFWKYDLAQGFTVPETLVGRKSKPNEVEFS 70 (309) Q Consensus 1 m~~~-~-----f~~dp~LT~~a~~y~n~~~ig~~lfP~v-~v~~~~~---k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~ 70 (309) ||+. + |+|+ +++++.+.--+..++--.++... ....+.+ ++|+|.. .. ..+....+.....-+++ T Consensus 1 Ma~~~T~~~~~iiPe-v~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~---~g-~a~~~~~g~~i~~~~lt 75 (278) T protein:vir:80 1 MADLTTKLANLIDPE-VMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKY---IG-DAQDVAEGAAIDYSALE 75 (278) T ss_pred CCCcceehhheecHH-HHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeecc---CC-cceeecCCCcCcccccc Confidence 9873 2 4332 24444432111112212222211 1112223 3455432 11 11223333333333455 Q ss_pred cCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCC Q lcl|Aclame:pro 71 ATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTS 150 (309) Q Consensus 71 ~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~s 150 (309) ..+.+..++..+-.-.++++ ...+...|+.+.+.+.+...+....+..+...+.... ++ .++ ..+.... T Consensus 76 ~~~~~~~i~~~~~a~~v~D~--~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~-----~~--~~~--~~t~~~~ 144 (278) T protein:vir:80 76 TESVKHGIKKAGKGVKLTDE--SVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTT-----LE--VKG--AINIGLI 144 (278) T ss_pred cceeeEeeehhhccccccHH--HHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccc-----cc--ccc--ccccchh Confidence 55555666555544444444 3455677899999998888888888877665543221 11 111 1111122 Q ss_pred C-hHHHHHHHHHHhC----CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 151 N-PLPVITDALDSVI----LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 151 d-Pi~di~~~~~~~g----~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) | -+..+-++.++++ -.+.++++++.++..|++++.+.- +..+....+++..-++..++|++ |++-+.. T Consensus 145 ~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~-~~~~~~g~~~~~~G~ig~~~G~~-Vi~s~~~----- 217 (278) T protein:vir:80 145 DKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSW-TKASQLGDDLLVKGAFGELLGWE-IVRTKKL----- 217 (278) T ss_pred hhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhc-cccccccccceeeccceeeccee-EEEcCCC----- Confidence 2 2334455555553 235679999999999988764332 22222222344445666778875 4443221 Q ss_pred cCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhc Q lcl|Aclame:pro 226 PGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFEN 305 (309) Q Consensus 226 ~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~ 305 (309) + ....+++..++ +|+-.. .. -.++...-...++..++....+--+++-+++-..+ . T Consensus 218 p---~~t~~l~~~gA----------------i~~~~~-~~---~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~i-t 273 (278) T protein:vir:80 218 A---DGNALAVKAGA----------------LKTFLK-RN---LLAESGRDMDHKLTKFNADQHYAVALVDETKAVKV-V 273 (278) T ss_pred C---cceEEEEeccc----------------eeeeec-CC---cccccccchhhccceeeeeeEEEEEEEcCcceEEE-e Confidence 0 01122333222 221111 00 01111111223344455555555555555444444 2 Q ss_pred cccC Q lcl|Aclame:pro 306 AVAA 309 (309) Q Consensus 306 ~va~ 309 (309) ..|| T Consensus 274 ~~a~ 277 (278) T protein:vir:80 274 PVAG 277 (278) T ss_pred eccC Confidence 3333 No 44 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=95.63 E-value=0.0017 Score=35.75 Aligned_cols=263 Identities=11% Similarity=0.061 Sum_probs=122.8 Q ss_pred CCC-----CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechh-HhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSN-----APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLA-QGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~-----~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~-~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |.. ..+++......+-.-.+...-|. .++|.+|+.....++++.... ... .-++.++.......++.+. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~a----~~v~E~~~~~~~~~~~~~~ 179 (385) T protein:vir:19 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIR-DLLAQGRTSSNALEYVREEVFTNNA----DVVAEKALKPESDITFSKQ 179 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchh-hhcceecccCcceEEEEEecCCcce----eeeccCccccccccceeEE Confidence 221 11333333333332222222232 357888888777888876421 111 1234455555555666666 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecc--c--ccccCCCCC Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--G--ADQWSDPTS 150 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ls--g--t~~Wsd~~s 150 (309) .+..+..+-..+++++-.++.. +......+.+.+.+....|.. ++++..-+......++ + +..++..+. T Consensus 180 ~~~~~k~~~~~~is~ell~d~~---~l~~~i~~~la~a~~~~~d~~----~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~ 252 (385) T protein:vir:19 180 TANVKTIAHWVQASRQVMDDAP---MLQSYINNRLMYGLALKEEGQ----LLNGDGTGDNLEGLNKVATAYDTSLNATGD 252 (385) T ss_pred EEeeeeEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHH----HHhccCCCCccccccccccccccccccccc Confidence 7777777666678877544332 234444455555555554433 3332211111111111 1 111223344 Q ss_pred ChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH---HHHHHHhCCCeEEeecceeecc Q lcl|Aclame:pro 151 NPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM---AFLQELLELDAIYIGEARLNIA 224 (309) Q Consensus 151 dPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~---~~l~~l~gl~~I~v~~a~~~~~ 224 (309) +.+.+|.+....+ +..++.++|+++.|.+|+. ++ ..++.. +... ..-..++|+| |++.+.. T Consensus 253 ~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~---lk----d~~G~~-l~~~~~~~~~~~l~G~p-V~~~~~~---- 319 (385) T protein:vir:19 253 TRADIIAHAIYQVTESEFSASGIVLNPRDWHNIAL---LK----DNEGRY-IFGGPQAFTSNIMWGLP-VVPTKAQ---- 319 (385) T ss_pred chHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH---hh----cCCCce-eccCcccCCCceeccee-eEEcCcC---- Confidence 5667777776544 7788999999999998764 22 222211 1100 1113356776 4332211 Q ss_pred ccCCCcccceecCC--cEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhh Q lcl|Aclame:pro 225 RPGQNPNLIRAWGP--HASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFF 302 (309) Q Consensus 225 ~~g~~~~~~~v~~~--~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l 302 (309) +.. .-++++ ..++++.. -|.+.++.. ........+...+++...++-.+.-+.+-.. T Consensus 320 -p~~----~~~~gd~~~~~~~~~~----------~~~~v~~~~------~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~ 378 (385) T protein:vir:19 320 -AAG----TFTVGGFDMASQVWDR----------MDATVEVSR------EDRDNFVKNMLTILCEERLALAHYRPTAIIK 378 (385) T ss_pred -CCC----cEEEeecccEEEEEEe----------cceEEEEec------cccchhhcCcEEEEEEEeeccEEecccceEE Confidence 100 011121 11111110 122222211 0001112444567788888877777777777 Q ss_pred hhccccC Q lcl|Aclame:pro 303 FENAVAA 309 (309) Q Consensus 303 ~~~~va~ 309 (309) ++-+.|+ T Consensus 379 ~~~~aa~ 385 (385) T protein:vir:19 379 GTFSSGS 385 (385) T ss_pred EEeccCC Confidence 7766666 No 45 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=95.63 E-value=0.0017 Score=35.75 Aligned_cols=263 Identities=11% Similarity=0.061 Sum_probs=122.8 Q ss_pred CCC-----CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechh-HhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSN-----APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLA-QGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~-----~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~-~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |.. ..+++......+-.-.+...-|. .++|.+|+.....++++.... ... .-++.++.......++.+. T Consensus 105 ~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~a----~~v~E~~~~~~~~~~~~~~ 179 (385) T protein:vir:18 105 LGSDADSAGSLIQPMQIPGIIMPGLRRLTIR-DLLAQGRTSSNALEYVREEVFTNNA----DVVAEKALKPESDITFSKQ 179 (385) T ss_pred hccccccCCceecchhhhHHHHHhhhccchh-hhcceecccCcceEEEEEecCCcce----eeeccCccccccccceeEE Confidence 221 11333333333332222222232 357888888777888876421 111 1234455555555666666 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecc--c--ccccCCCCC Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--G--ADQWSDPTS 150 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ls--g--t~~Wsd~~s 150 (309) .+..+..+-..+++++-.++.. +......+.+.+.+....|.. ++++..-+......++ + +..++..+. T Consensus 180 ~~~~~k~~~~~~is~ell~d~~---~l~~~i~~~la~a~~~~~d~~----~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~ 252 (385) T protein:vir:18 180 TANVKTIAHWVQASRQVMDDAP---MLQSYINNRLMYGLALKEEGQ----LLNGDGTGDNLEGLNKVATAYDTSLNATGD 252 (385) T ss_pred EEeeeeEEEeehhhHHHHhhHH---HHHHHHHHHHHHHHHHHHHHH----HHhccCCCCccccccccccccccccccccc Confidence 7777777666678877544332 234444455555555554433 3332211111111111 1 111223344 Q ss_pred ChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH---HHHHHHhCCCeEEeecceeecc Q lcl|Aclame:pro 151 NPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM---AFLQELLELDAIYIGEARLNIA 224 (309) Q Consensus 151 dPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~---~~l~~l~gl~~I~v~~a~~~~~ 224 (309) +.+.+|.+....+ +..++.++|+++.|.+|+. ++ ..++.. +... ..-..++|+| |++.+.. T Consensus 253 ~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~---lk----d~~G~~-l~~~~~~~~~~~l~G~p-V~~~~~~---- 319 (385) T protein:vir:18 253 TRADIIAHAIYQVTESEFSASGIVLNPRDWHNIAL---LK----DNEGRY-IFGGPQAFTSNIMWGLP-VVPTKAQ---- 319 (385) T ss_pred chHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH---hh----cCCCce-eccCcccCCCceeccee-eEEcCcC---- Confidence 5667777776544 7788999999999998764 22 222211 1100 1113356776 4332211 Q ss_pred ccCCCcccceecCC--cEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhh Q lcl|Aclame:pro 225 RPGQNPNLIRAWGP--HASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFF 302 (309) Q Consensus 225 ~~g~~~~~~~v~~~--~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l 302 (309) +.. .-++++ ..++++.. -|.+.++.. ........+...+++...++-.+.-+.+-.. T Consensus 320 -p~~----~~~~gd~~~~~~~~~~----------~~~~v~~~~------~~~~~~~~~~~~~~~~~r~~~~v~~~~a~~~ 378 (385) T protein:vir:18 320 -AAG----TFTVGGFDMASQVWDR----------MDATVEVSR------EDRDNFVKNMLTILCEERLALAHYRPTAIIK 378 (385) T ss_pred -CCC----cEEEeecccEEEEEEe----------cceEEEEec------cccchhhcCcEEEEEEEeeccEEecccceEE Confidence 100 011121 11111110 122222211 0001112444567788888877777777777 Q ss_pred hhccccC Q lcl|Aclame:pro 303 FENAVAA 309 (309) Q Consensus 303 ~~~~va~ 309 (309) ++-+.|+ T Consensus 379 ~~~~aa~ 385 (385) T protein:vir:18 379 GTFSSGS 385 (385) T ss_pred EEeccCC Confidence 7766666 No 46 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=95.63 E-value=0.0017 Score=35.74 Aligned_cols=281 Identities=11% Similarity=0.027 Sum_probs=127.1 Q ss_pred CCC------CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSN------APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~------~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |++ ...++....+.+-..-+ +.-+-..+++.+|+.....+||+...... ..-++.++.....+..+.+. T Consensus 14 ~~~~~~~~~~~~ip~~~~~~ii~~~~-~~~~l~~~~~~~~~~~~~~~ip~~~~~~~----a~~v~Eg~~~~~~~~~f~~i 88 (318) T protein:vir:24 14 IAQTGDTMFKGYLEPEQAKDYFAEAE-KTSIVQQFAQKVPMGTTGQKIPHWVGDVS----AQWIGEGDMKPITKGNMTSQ 88 (318) T ss_pred hhcccCcccceeechhHHHHHHHHHH-hhchhhhhcceeeccCCceEEEEEeCCcc----eEEecCCccccccccceeEE Confidence 221 12444444444432222 22233455788888877788888754211 12234445545555666666 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceeccc--ccccCCCCCCh Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSG--ADQWSDPTSNP 152 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsg--t~~Wsd~~sdP 152 (309) ++.++..+...+++++-.++ +.++..+...+.+.+.+....|..+-...-+ ..+........+ .......++.. T Consensus 89 ~~~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~--~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (318) T protein:vir:24 89 TIAPHKIATIFVASAETVRA--NPANYLGTMRTKVATAFAMAFDGAAMHGTDS--PFPTYIGQTTKAISIADTTGATTVY 164 (318) T ss_pred EEeeEEEEEeehhhHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHhhhcccCC--CCCcccccccccccccccccccchH Confidence 66666666666777665443 3345556556666776666665544322111 111111111100 01112223333 Q ss_pred HHHHHHHHHH---hCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCccccc-------CHHHH--HHHhCCCeEEeecce Q lcl|Aclame:pro 153 LPVITDALDS---VILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMV-------PMAFL--QELLELDAIYIGEAR 220 (309) Q Consensus 153 i~di~~~~~~---~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~v-------t~~~l--~~l~gl~~I~v~~a~ 220 (309) ..++...+.. .+..+...+|+++.|.+|+. +++. ++.. +. +...+ ..++|+|- .+.++ T Consensus 165 ~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~---lkd~----~G~~-l~~~~~~~~~~~~~~~~~i~g~pv-~~~~~- 234 (318) T protein:vir:24 165 DQVAVNGLSLLVNDGKKWTHTLLDDITEPILNG---AKDQ----NGRP-LFIESTYGEAASPFRSGRIVARPT-ILSDH- 234 (318) T ss_pred HHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH---hhcc----CCce-eecCccccCccccccCceEEEEee-EEeCC- Confidence 3444444333 36778899999999998864 2221 2111 00 00111 12334432 22111 Q ss_pred eeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhh Q lcl|Aclame:pro 221 LNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLG 300 (309) Q Consensus 221 ~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G 300 (309) .+.++ ..-++++-.-+++....+-.. ..+=-.+.+.+....+... ..-..+...+|+.+.++-.+.-+.+- T Consensus 235 ----~~~~~--~~~~~gdfs~~~~~~~~~l~i-~~~~~~~~~~~~~~~~~~~--~~f~~~~~~~r~~~r~d~~v~~~~a~ 305 (318) T protein:vir:24 235 ----VVEGT--TVGFMGDFSQLIWGQIGGLSF-DVTDQATLNLGTVESPNFV--SLWQHNLVAVRVEAEYAFHCNDAEAF 305 (318) T ss_pred ----CCCCc--cEEEEeecceEEEEEecCeEE-EEeeccceeccccccccch--hhhhcCcEEEEEEEEEccEEecccce Confidence 11111 111222221111111000000 0000001111110011000 01224556789999999999999998 Q ss_pred hhhhccccC Q lcl|Aclame:pro 301 FFFENAVAA 309 (309) Q Consensus 301 ~l~~~~va~ 309 (309) ..|+++.|| T Consensus 306 ~~i~~~~a~ 314 (318) T protein:vir:24 306 VALTNVVSG 314 (318) T ss_pred EEEEeeccC Confidence 899999999 No 47 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=95.53 E-value=0.0019 Score=35.54 Aligned_cols=286 Identities=12% Similarity=0.002 Sum_probs=127.4 Q ss_pred CCCC-----CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSNA-----PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~~-----~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) |+.. ..++..+.+.|..--+ +..+=..+++.+|+.....+||+...... ..-++.+++......++.+.+ T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~-~~s~l~~~~~~i~~~~~~~~~p~~~~~~~----a~wv~Eg~~~~~~~~~f~~v~ 75 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVV-QGSTVAVLSARKPQRFGNEDIITFNGRPK----AEFVGEGQQKSSTTGEFDFVT 75 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHH-hhchhhhhcceeeccCCceEEEEEeCCce----eEEeecCcccccccceeeEEE Confidence 8752 2344444343332222 22334566788888877788888743211 123455555555555666656 Q ss_pred eeeeccchhhcCCHHHHHHH-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhh--ccccc-CcccceecccccccCC---C Q lcl|Aclame:pro 76 GSTEDHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVF--SPNSY-AAGNKTTLSGADQWSD---P 148 (309) Q Consensus 76 ~~~~e~~L~~~v~~~~~~~a-~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~--~~~~y-~~~~~~~lsgt~~Wsd---~ 148 (309) +..+..+-..++++|-.++. ++.++..+...+.+.+.|....|..+-...- .+... ...+....+ +....- . T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~-~~~~~~~~~~ 154 (311) T protein:vir:99 76 STPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAA-SKRVELTADT 154 (311) T ss_pred EeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccc-cceeeccccc Confidence 65555555567777654432 2234445555556666666655544332211 01111 011111111 111211 2 Q ss_pred CCChHHHHHHHHHHh-----CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecc Q lcl|Aclame:pro 149 TSNPLPVITDALDSV-----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEA 219 (309) Q Consensus 149 ~sdPi~di~~~~~~~-----g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a 219 (309) ..++..||......+ ...+|..+|++++|..|+. + +..++.+ +..+ ..-..++|+| |++-+. T Consensus 155 ~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~---l----kd~~G~~-l~~~~~~~~~~~~l~G~P-v~~s~~ 225 (311) T protein:vir:99 155 IANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLST---A----RYTDGRK-KFPELGLGIGVSSFEGID-ASVSDT 225 (311) T ss_pred cchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHh---h----hccCCCe-eecCcccCCCCceeccee-eEeecc Confidence 234556777665533 4567889999999988854 2 2222221 1111 1123577887 333222 Q ss_pred eeeccccCCCc-ccceecCCcEEEEecCCCCCCcCcceeccccc---ccccccCCccccccccCCceEEEeecccceeee Q lcl|Aclame:pro 220 RLNIARPGQNP-NLIRAWGPHASFIYRDRLADTRNGTTFGLTAQ---WGDRVSGSIADPNIGLRGGQRVRVGESVKELVT 295 (309) Q Consensus 220 ~~~~~~~g~~~-~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~---~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~ 295 (309) .- ...+-.. ....+..+...+++.+-......+..-+.+.+ +++. .++ -.....+..-+|+.+.++=.|. T Consensus 226 i~--~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~d~~~~r~~~r~d~~v~ 299 (311) T protein:vir:99 226 VN--GGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDP-DGQ---GDLKRHNQIALRLEIVYGWYVF 299 (311) T ss_pred cc--cccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCC-Ccc---hhhhhcCcEEEEEEEeecceec Confidence 11 0000000 11111222222222211000000011111111 1000 000 0112334455777777776655 Q ss_pred cchhhhhhhcccc Q lcl|Aclame:pro 296 APDLGFFFENAVA 308 (309) Q Consensus 296 ~~~~G~l~~~~va 308 (309) - ++..-+++++| T Consensus 300 ~-~~~v~~~~~~A 311 (311) T protein:vir:99 300 T-DRFVVIENAVA 311 (311) T ss_pred C-hhHeeeecccC Confidence 4 57778889988 No 48 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=95.46 E-value=0.002 Score=35.34 Aligned_cols=264 Identities=13% Similarity=0.026 Sum_probs=120.8 Q ss_pred CC-----CCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MS-----NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~-----~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) +. ....++..+.+.|...-+...-|.. +++.+|+.....+++..... .....-++.++........+...+ T Consensus 136 ~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~---~~~a~~v~E~~~~~~~~~~f~~v~ 211 (418) T protein:vir:10 136 VGSGVSGSNSLVVADRQAGIIAPPQRKMTIRD-LLMPGQTSSSSIEYTVETGF---TNNAAAVAEGAQKPTSDLKFNLKN 211 (418) T ss_pred ccCCCCCCccccchhHHHHHHHHHhhhhhHHh-hcceeeccCCceeEEEEecC---CCceeeeccCccccccccceeeEE Confidence 11 1124444544444433333333333 46788887777778775331 111122344555445555666666 Q ss_pred eeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecc--c--ccccCCCCCC Q lcl|Aclame:pro 76 GSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--G--ADQWSDPTSN 151 (309) Q Consensus 76 ~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ls--g--t~~Wsd~~sd 151 (309) ..++..+-..+++++-.++.. +....-.+.+.+.+....|. .++++..-+..-...++ + +...+..+.+ T Consensus 212 ~~~~k~~~~~~is~ell~ds~---~l~~~i~~~l~~a~~~~~d~----a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~ 284 (418) T protein:vir:10 212 QPVRTIAHLFKASRQILDDAP---ALQSYIDGRARYGLQLTEEG----QILKGDGTGANILGILPQASAFMPSITLANAT 284 (418) T ss_pred EeeeeEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHH----HHhccCCCCccccccccccccccccccccccc Confidence 666666655677777655432 33444444555555544443 33332211000011111 1 1123345566 Q ss_pred hHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccC---HHHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 152 PLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVP---MAFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 152 Pi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt---~~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) .+.+|.+++..+ +..++.++|++.+|..|.. ++ ..++.. +.. ...-..++|+| |++.+.. T Consensus 285 ~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~---lk----d~~G~~-i~~~~~~~~~~~l~G~p-V~~~~~~----- 350 (418) T protein:vir:10 285 PIDKIRLALLQAVLAEFPATGIVLNPIDWASIEL---TK----DSQGRY-IVGNPVNGTTPRLWNLP-VVETQAM----- 350 (418) T ss_pred cHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH---hh----cCCCce-eccccccCCCceeccee-eEEcCCC----- Confidence 788888876554 6778899999999988753 22 222211 110 01113466776 4432211 Q ss_pred cCCCcccceecCC--cEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 226 PGQNPNLIRAWGP--HASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 226 ~g~~~~~~~v~~~--~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) +... -++++ ..++++.. . |.+.++.... ...-..+...+|+...++-.+.-+++-.++ T Consensus 351 p~~~----~~~gd~s~~~~~~~~------~----~~~i~~~~~~------~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~ 410 (418) T protein:vir:10 351 TANE----FLVGAFSMAAQIFDR------M----EIEVLLSTEN------VDDFEKNMVSIRAEERLALAVYRPESFVTG 410 (418) T ss_pred CCCc----EEEeeccceEEEEEe------c----ceEEEEeccc------chhhhcCceEEEEEEeeccEEecccceEEE Confidence 1111 12222 11222110 0 1111111100 001123445567767777666666665555 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-..++ T Consensus 411 ~~~~~~ 416 (418) T protein:vir:10 411 ALVEQA 416 (418) T ss_pred EeccCC Confidence 444444 No 49 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=95.33 E-value=0.0023 Score=35.07 Aligned_cols=269 Identities=13% Similarity=0.073 Sum_probs=124.5 Q ss_pred CC------CC-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcC Q lcl|Aclame:pro 1 MS------NA-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSAT 72 (309) Q Consensus 1 m~------~~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~ 72 (309) ++ +. .-++......|...-+...-|. .++..+|+....+++++..... .....-++.++..... ...+. T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~Eg~~~~~~~~~~~~ 196 (415) T protein:vir:46 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLD-KYVTVKRVTNGSGKYPVVRQSE--VAALEKVEELEENPELAVKPFF 196 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhh-hhcceeeccCCceeEEEEEecC--Ccceeeccccccccccccccee Confidence 10 11 1233333333322212222222 2345667777777776653211 1111223444444433 23455 Q ss_pred ccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCCh Q lcl|Aclame:pro 73 DETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNP 152 (309) Q Consensus 73 ~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdP 152 (309) ..++..+..+-..+|+++-..+ +.++......+.+.+.+....+..+-...-++....... ........+...+.+. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~-~~~~~~~~~~~~~~~~ 273 (415) T protein:vir:46 197 QLAYDINTHRGYFRISREAIED--AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS-GFEKEGKKLEVKKAKS 273 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhh--chHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccc-ccccccceeccccccc Confidence 5555555555555666665543 335556666677777776666655544332222211111 1111122344556677 Q ss_pred HHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 153 LPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 153 i~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) +.+|.+.+..+ +..++..+|+++.|.+|+. + +..++.. +..+ ..-..++|+| |++.+... T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~---l----kd~~G~~-i~~~~~~~~~~~~l~G~p-V~~~~~~~---- 340 (415) T protein:vir:46 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK---M----KDKLGNY-LIQPDVKEKTQQRLLGAK-IEILPDEV---- 340 (415) T ss_pred hHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH---h----hccCCCe-eeccCcCCCCCcccccee-eEEecccc---- Confidence 77777776554 6788999999999998853 2 2222222 1111 1123567877 44433221 Q ss_pred cCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 226 PGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 226 ~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) .+..++..-++++- .++++.. =|.+.++.. + ......+|+...++-.+.-+++-.++ T Consensus 341 ~~~~~~~~~~~gd~~~~~~~~~~----------~~~~v~~~~------~-----~~~~~~~~~~~r~d~~v~~~~a~~~~ 399 (415) T protein:vir:46 341 LGQKGNNTLIIGNLKDAIVLFDR----------SQYQASWTD------Y-----MHFGECLMIAVRQDCRILDYKSAIVI 399 (415) T ss_pred ccCCCccEEEEEehhccEEEEee----------cceEEEeec------c-----ccCceEEEEEEEeccEEeccccEEEE Confidence 11111112233321 1111110 012222211 1 11223456677777777777777776 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-.-++ T Consensus 400 ~~~~~~ 405 (415) T protein:vir:46 400 EYDDSE 405 (415) T ss_pred EeeccC Confidence 654444 No 50 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=95.33 E-value=0.0023 Score=35.07 Aligned_cols=269 Identities=13% Similarity=0.073 Sum_probs=124.5 Q ss_pred CC------CC-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcC Q lcl|Aclame:pro 1 MS------NA-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSAT 72 (309) Q Consensus 1 m~------~~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~ 72 (309) ++ +. .-++......|...-+...-|. .++..+|+....+++++..... .....-++.++..... ...+. T Consensus 120 ~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~Eg~~~~~~~~~~~~ 196 (415) T protein:vir:47 120 QGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLD-KYVTVKRVTNGSGKYPVVRQSE--VAALEKVEELEENPELAVKPFF 196 (415) T ss_pred hhccccccCCcccccHHHHHHHHHHHHhhhhhh-hhcceeeccCCceeEEEEEecC--Ccceeeccccccccccccccee Confidence 10 11 1233333333322212222222 2345667777777776653211 1111223444444433 23455 Q ss_pred ccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCCh Q lcl|Aclame:pro 73 DETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNP 152 (309) Q Consensus 73 ~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdP 152 (309) ..++..+..+-..+|+++-..+ +.++......+.+.+.+....+..+-...-++....... ........+...+.+. T Consensus 197 ~v~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~-~~~~~~~~~~~~~~~~ 273 (415) T protein:vir:47 197 QLAYDINTHRGYFRISREAIED--AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSS-GFEKEGKKLEVKKAKS 273 (415) T ss_pred eEEeeeeeeEeeehhhHHHHhh--chHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccc-ccccccceeccccccc Confidence 5555555555555666665543 335556666677777776666655544332222211111 1111122344556677 Q ss_pred HHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 153 LPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 153 i~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) +.+|.+.+..+ +..++..+|+++.|.+|+. + +..++.. +..+ ..-..++|+| |++.+... T Consensus 274 ~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~---l----kd~~G~~-i~~~~~~~~~~~~l~G~p-V~~~~~~~---- 340 (415) T protein:vir:47 274 LDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK---M----KDKLGNY-LIQPDVKEKTQQRLLGAK-IEILPDEV---- 340 (415) T ss_pred hHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH---h----hccCCCe-eeccCcCCCCCcccccee-eEEecccc---- Confidence 77777776554 6788999999999998853 2 2222222 1111 1123567877 44433221 Q ss_pred cCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 226 PGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 226 ~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) .+..++..-++++- .++++.. =|.+.++.. + ......+|+...++-.+.-+++-.++ T Consensus 341 ~~~~~~~~~~~gd~~~~~~~~~~----------~~~~v~~~~------~-----~~~~~~~~~~~r~d~~v~~~~a~~~~ 399 (415) T protein:vir:47 341 LGQKGNNTLIIGNLKDAIVLFDR----------SQYQASWTD------Y-----MHFGECLMIAVRQDCRILDYKSAIVI 399 (415) T ss_pred ccCCCccEEEEEehhccEEEEee----------cceEEEeec------c-----ccCceEEEEEEEeccEEeccccEEEE Confidence 11111112233321 1111110 012222211 1 11223456677777777777777776 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-.-++ T Consensus 400 ~~~~~~ 405 (415) T protein:vir:47 400 EYDDSE 405 (415) T ss_pred EeeccC Confidence 654444 No 51 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=95.18 E-value=0.0026 Score=34.79 Aligned_cols=266 Identities=10% Similarity=0.009 Sum_probs=119.4 Q ss_pred CCC------CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccc-cCcCc Q lcl|Aclame:pro 1 MSN------APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVE-FSATD 73 (309) Q Consensus 1 m~~------~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve-~~~~~ 73 (309) |+. ...+|..+.+.|-..-++..-| ..+++.+|+.....++|+...... ..-++.++...... ..+.. T Consensus 106 ~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~~----a~~v~E~~~~~~~~~~~f~~ 180 (407) T protein:vir:48 106 LQVGNDEDGGYAIPEELDRTILTLLKDEVVM-RQEATVITLGGSDYKKLVNLGGTT----SGWVGETDARPETATSKLGL 180 (407) T ss_pred hhcccCCCCcccccHhHHHHHHHHHHhhhhh-hhhceeeecCCCceEEEEecCCcc----eeeeccccccccccccccee Confidence 221 1234444333333222222222 234566777777777776432111 11123333332222 23444 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceeccccc--------cc Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGAD--------QW 145 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~--------~W 145 (309) .++..+..+-..+++++-..+ +.+|....-.+.+.+.+....|..+ +++..- ......|+... .| T Consensus 181 i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~~~a~----l~G~G~-~~p~Gil~~~~~~~~~~~~~~ 253 (407) T protein:vir:48 181 IEPFMGEIYGNPQATQKMLDD--AFFNVEDWINSELALEFAEQEEIAF----TSGDGS-KKPKGFLAYESTDEDDKTRAF 253 (407) T ss_pred EEeeeeeeEeehhhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHhhh----hccCCC-Cccceeeeccccccccccccc Confidence 455555555455666665543 3455556666667776655555432 222100 01111111111 11 Q ss_pred CC--------CCCChHHHHHHHHHHh--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhC Q lcl|Aclame:pro 146 SD--------PTSNPLPVITDALDSV--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLE 210 (309) Q Consensus 146 sd--------~~sdPi~di~~~~~~~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~g 210 (309) .. ++.--..+|.+....+ ..++| ..+|++.+|..|.. ++..++.+ ++.+ ..-..+|| T Consensus 254 ~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~-------lkD~~Gr~-l~~~~~~~g~~~~l~G 325 (407) T protein:vir:48 254 GKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRL-------LKDNDGNY-LWRPGIELGQPSSLAG 325 (407) T ss_pred ccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHH-------hhccCCce-eeccCcCCCCCceecc Confidence 11 1111234455554444 33444 57999999987654 22222222 1111 12234677 Q ss_pred CCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeeccc Q lcl|Aclame:pro 211 LDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESV 290 (309) Q Consensus 211 l~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~ 290 (309) .| |++-+.. +. +-.++..++|.+- +.+|. ..++.+..+..+.+...+...+++.+.+ T Consensus 326 ~P-V~~~~~~-----p~-------~~~~~~~i~~Gd~--------~~~~~--i~~~~~~~i~~d~~~~~~~~~~~~~~r~ 382 (407) T protein:vir:48 326 YG-IVENEQM-----PD-------IAADAKAIAFGNF--------KRGYT--IVDRIGTRILRDPYTNKPFVGFYTTKRT 382 (407) T ss_pred ee-eEEecCc-----CC-------ccCCccEEEEEec--------cccEE--EEEeeceEEEeeccccCCcEEEEEEEEe Confidence 76 4433221 10 0011111222210 11111 1112222222233444666778999999 Q ss_pred ceeeecchhhhhhhccccC Q lcl|Aclame:pro 291 KELVTAPDLGFFFENAVAA 309 (309) Q Consensus 291 ~~~v~~~~~G~l~~~~va~ 309 (309) +-.++-+++-.+++.+.|+ T Consensus 383 d~~v~~~~a~~~l~~~aa~ 401 (407) T protein:vir:48 383 GGMLVDSQAIKLMKIGAAT 401 (407) T ss_pred ccEEecccceEEEEeeccC Confidence 9999999999999888888 No 52 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=94.88 E-value=0.0033 Score=34.21 Aligned_cols=269 Identities=14% Similarity=0.075 Sum_probs=119.7 Q ss_pred CCCCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcCccceeee Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATDETGSTE 79 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~~~~~~~~ 79 (309) -.....++..+...+-..-++..-| ..+++.++|....+++++..... .....-++.++..... ...+...++..+ T Consensus 127 ~~g~~~iP~~~~~~ii~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~ 203 (415) T protein:vir:94 127 DSGFVVIPEEIVTDILKLKEVEFNL-DKYVTVKRVTNGSGKYPVVRQSE--VAALEKVEELEENPELAVKPFFQLAYDIN 203 (415) T ss_pred ccccccCcHHHHHHHHHHHHhhhhh-hhhcceeeccCCceeEEEEeecC--CccceeccccccccccccccceeeEeehe Confidence 0111234433444433332222223 23356667776667665542211 1111123334443322 233444455554 Q ss_pred ccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHHHHH Q lcl|Aclame:pro 80 DHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVITDA 159 (309) Q Consensus 80 e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di~~~ 159 (309) ..+-..+++++-..+ +.++......+.+.+.+....+..+-...-++..-+ ........+..++..+...+.+|.+. T Consensus 204 k~~~~~~is~ell~d--s~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~-~~~~~~~~~~~~~~~~~~~~~~i~~~ 280 (415) T protein:vir:94 204 THRGYFRISREAIED--AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS-TSSGFEKEGKKLEVKKAKSLDDIKDA 280 (415) T ss_pred eeeeechhhHHHHhh--chHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc-ccccccccccccccccccchHHHHHH Confidence 444444566654443 335555666666777766666554443322221111 01111112223444455667777777 Q ss_pred HHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH----HHHHHhCCCeEEeecceeeccccCCCccc Q lcl|Aclame:pro 160 LDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA----FLQELLELDAIYIGEARLNIARPGQNPNL 232 (309) Q Consensus 160 ~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~----~l~~l~gl~~I~v~~a~~~~~~~g~~~~~ 232 (309) +..+ ++.++..+|+++.|.+|+. + +..++++ ++.+. .-..++|+| |++.+... .+..++. T Consensus 281 ~~~~~~~~~~~~~~vmn~~~~~~l~~---l----kd~~G~~-l~~~~~~~~~~~~l~G~p-V~~~~~~~----~~~~~~~ 347 (415) T protein:vir:94 281 INLNVKPNYEHNVAIVSQTMFAKLDK---M----KDKLGNY-LIQPDVKEKTQQRLLGAK-IEILPDEV----LGQKGNN 347 (415) T ss_pred HHhhhhhccCCCEEEEcHHHHHHHHH---h----hccCCCe-eeccCcCCCCCceeccee-eEEecccc----cCCCCcc Confidence 6554 6789999999999998854 2 2222222 11111 123466777 44433321 1111111 Q ss_pred ceecCC--cEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 233 IRAWGP--HASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 233 ~~v~~~--~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) .-++++ +.++++.. =|.+.++... ......+|+...++-.+.-+++-..++-.-++ T Consensus 348 ~i~~gd~~~~~~~~~~----------~~~~v~~~~~-----------~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 405 (415) T protein:vir:94 348 TLIIGNLKDAIVLFDR----------SQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDDSE 405 (415) T ss_pred EEEEEehhccEEEEee----------cceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEeccC Confidence 122232 11111110 1222222211 11123356666667777777777776554444 No 53 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=94.73 E-value=0.0036 Score=33.96 Aligned_cols=260 Identities=10% Similarity=0.021 Sum_probs=120.5 Q ss_pred CCC-----CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSN-----APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~-----~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) +.. ..+++......+...-+...-|.+ +++.+|+.....++++..... ....-++.++........+...+ T Consensus 114 ~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~~---~~a~~v~Eg~~~~~~~~~~~~i~ 189 (390) T protein:vir:81 114 STDAAGSAGALTTPNRLPGFITPPDARLTVRD-LIGSGRTDSALIEYVQETGFV---NNAAIVAEGALKPESSLKFAKKT 189 (390) T ss_pred ccccccCCcceechhhhHHHHHHHhhhhhhhh-hcceeeccCCceEEEEEecCC---cceeeecCCcccccccceeeEEE Confidence 111 123333344444433333333433 467788887777888874321 11123455565555666677777 Q ss_pred eeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceec--ccccccC--CCCCC Q lcl|Aclame:pro 76 GSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTL--SGADQWS--DPTSN 151 (309) Q Consensus 76 ~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~l--sgt~~Ws--d~~sd 151 (309) +..+..+-..+++++-..++. +....-.+.+.+.+....+. .++++..-+...+..+ ++...+. ..+.+ T Consensus 190 ~~~~k~~~~~~is~ell~d~~---~~~~~i~~~l~~~~~~~~d~----a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~ 262 (390) T protein:vir:81 190 DTTHVIAHTMKATRQILSDAP---QLASYMNNRLIRGLKVKEDA----EILRGTGANDGLLGLIPQATTYAAPTTIAGAT 262 (390) T ss_pred EeeeEEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHH----HHHhcCCCCCcccceeecccccccccccccch Confidence 777777766777777555432 23444444555555554443 2333221111111111 1122222 22345 Q ss_pred hHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH---HHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 152 PLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM---AFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 152 Pi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~---~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) ++.+|.+.+..+ +..++..+|++++|.+|+. ++ ..++.. +..+ ..-..+||+| |++.+.. T Consensus 263 ~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~---lk----d~~G~~-l~~~~~~~~~~~l~G~p-v~~~~~~----- 328 (390) T protein:vir:81 263 RVDQLRLAMLQASLAEYNPSGIVINPIDWAAIEL---AK----DANNQY-LIGNARGTLTPTLWGLP-VVATQAM----- 328 (390) T ss_pred hHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH---hh----cCCCce-eecCcccccCceeccee-eEEcCCC----- Confidence 667777765544 7788999999999988764 22 111111 1100 0112467777 4432221 Q ss_pred cCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 226 PGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 226 ~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) +.. + -++++. .++++.. -|.+.++.. ....-..+...+|+...++-.+.-+++-..+ T Consensus 329 p~~--~--~~~gd~~~~~~~~~~----------~~~~v~~~~-------~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~ 387 (390) T protein:vir:81 329 APG--E--FLVGAFDLAAQIFDQ----------WDARVEIGY-------VGEDFQRNMITVLAEERLALVVYRPEALISG 387 (390) T ss_pred CCC--c--EEEEehhceEEEEEe----------cceEEEEec-------ccchhhcCcEEEEEEEeeccEEecccceEEE Confidence 100 0 111211 1111110 011111110 0001123444577777777777777666555 Q ss_pred hcc Q lcl|Aclame:pro 304 ENA 306 (309) Q Consensus 304 ~~~ 306 (309) +=+ T Consensus 388 t~a 390 (390) T protein:vir:81 388 SFA 390 (390) T ss_pred EeC Confidence 444 No 54 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=94.71 E-value=0.0014 Score=36.29 Aligned_cols=267 Identities=14% Similarity=0.123 Sum_probs=109.5 Q ss_pred CCC---CCCCcchhhHHHHHh---hcchhhhhhhhCCcccccccc---ceeEEechhHhhhchhHhhcccc-cccccccC Q lcl|Aclame:pro 1 MSN---APFPIDPELTAIAIA---YRNGRMISDEVLPRVPVGKQE---FKFWKYDLAQGFTVPETLVGRKS-KPNEVEFS 70 (309) Q Consensus 1 m~~---~~f~~dp~LT~~a~~---y~n~~~ig~~lfP~v~v~~~~---~k~~~~~~~~~f~~~~t~~~~~~-~~~~ve~~ 70 (309) |-+ ++|.... |+.+=-. -..+.+.+..|||.......+ ..|.+++..-... ..+-++ ....+... T Consensus 1 ~~~~~~g~f~~~~-l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~----~~~~~~~dip~~~~~ 75 (301) T protein:vir:80 1 MQGKITATIEARD-LQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAK----IIANGADDLPLVDVD 75 (301) T ss_pred CCccccchhhHHH-HHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEE----EecCccccccccccc Confidence 543 4554422 3333222 123568899999965333333 3444443321111 111111 12334444 Q ss_pred cCccceeeeccchhhcCCHHHHHHHh-hcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc----ccCc---c------cc Q lcl|Aclame:pro 71 ATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN----SYAA---G------NK 136 (309) Q Consensus 71 ~~~~~~~~~e~~L~~~v~~~~~~~a~-~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~----~y~~---~------~~ 136 (309) ...+...+...+....+..+|...++ .+++...+......+.+...+. ++++.+. .|+- . +. T Consensus 76 ~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n----~~~f~G~~~~g~~GLlN~p~~~~~~~~ 151 (301) T protein:vir:80 76 MVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKEN----SIAFRGEKKYAIKGAFEATGIQIDVSP 151 (301) T ss_pred ceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhc----eEEeeecccccceeeecCCCccccccc Confidence 55555566666666666677776654 4566555544444444433322 3333331 1110 0 11 Q ss_pred ee-cccccccCCCCCC-hHHHHHHHHHHh-----C-CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHH Q lcl|Aclame:pro 137 TT-LSGADQWSDPTSN-PLPVITDALDSV-----I-LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQEL 208 (309) Q Consensus 137 ~~-lsgt~~Wsd~~sd-Pi~di~~~~~~~-----g-~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l 208 (309) .+ ..+..+|...+.+ .+.||..+..++ | ..|++++|+++.+..|.+- ++. .+ .+.--.+.|++- T Consensus 152 ~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~-----~~~-~~--~~~tvl~~l~~~ 223 (301) T protein:vir:80 152 TTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKK-----RYS-NE--DSRSVLKVLQDN 223 (301) T ss_pred CcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhc-----ccc-CC--CCeeHHHHHHHH Confidence 11 2234578887765 688999887765 4 3799999999999988521 110 01 122223444432 Q ss_pred hCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccc--cccccCCccccccccCCceEEEe Q lcl|Aclame:pro 209 LELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQW--GDRVSGSIADPNIGLRGGQRVRV 286 (309) Q Consensus 209 ~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~--~~~~~~~~~d~~~g~~g~~~v~v 286 (309) +..-+|+- -..+..+ |.. +.+.+++|..... +. ..-++--++. -.+.+.....+++..-+|-.|+ T Consensus 224 ~~~~~I~~-~p~L~~~--g~~-------g~~~~v~~~~~~d-~~-~~~v~~~~~~~~~e~~~~~~~~~~~~r~~Gv~i~- 290 (301) T protein:vir:80 224 AWFSAIVR-VPDLAGM--GTA-------GSDSFAVIHDSNE-TA-ELIIPMDITRHPEEYSFPRTKVPFEERTAGVVVR- 290 (301) T ss_pred cCcceEEE-cceeccC--CCC-------cccEEEEEecCCc-EE-EEEecCceeeecceecCceeEeeeeeeeEEEEEE- Confidence 21111211 0111111 110 2233444432111 00 0001100000 0011112222232323332222 Q ss_pred ecccceeeecchhhhhhhcc Q lcl|Aclame:pro 287 GESVKELVTAPDLGFFFENA 306 (309) Q Consensus 287 ~~~~~~~v~~~~~G~l~~~~ 306 (309) -|.+-+.+.+. T Consensus 291 ---------~P~ai~~~~GI 301 (301) T protein:vir:80 291 ---------FPAAIVRVDGI 301 (301) T ss_pred ---------ccceEEEEecC Confidence 12222222222 No 55 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=94.68 E-value=0.0038 Score=33.88 Aligned_cols=259 Identities=11% Similarity=0.026 Sum_probs=121.5 Q ss_pred CCCC-CCCcc---hh-hHHHHHhhcchhhhhhhh-CCccc----cccccceeEEechhHhhhchhHhhcccccccccccC Q lcl|Aclame:pro 1 MSNA-PFPID---PE-LTAIAIAYRNGRMISDEV-LPRVP----VGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFS 70 (309) Q Consensus 1 m~~~-~f~~d---p~-LT~~a~~y~n~~~ig~~l-fP~v~----v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~ 70 (309) |++. +...| |. ++++ .+.++..-.+ .+.+. ...+.+....|++-... -+.+....+.....-+++ T Consensus 1 ma~~~T~~~d~iiPev~~~~----v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~-gda~~~~eg~~i~~~~lt 75 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPI----VSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYI-GDAADVAEGGEISLDKIG 75 (272) T ss_pred CCCcceehhhhhchHHHHHH----HHHHHHhhhhhccccccccccccCCCCEEEEeeeccC-ccccccCCCCccChhhcC Confidence 9973 44333 32 3333 3333322211 12111 12223333333321122 223345555555555667 Q ss_pred cCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCC Q lcl|Aclame:pro 71 ATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTS 150 (309) Q Consensus 71 ~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~s 150 (309) ..+.+..++.++-...+++++ ..+...||...+.+.+...+....+..+...+... ..+.++. .+ T Consensus 76 ~~~~~~~i~~~~k~~~vtD~~--~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~-------~~~~~~~-----~~- 140 (272) T protein:vir:36 76 TTTKSVTIKKAAKGTEITDEA--ALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTT-------SQTVSTK-----AN- 140 (272) T ss_pred CcceeEeeehhhccccccHHH--HhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-------ccccccc-----cc- Confidence 777777777766555555544 44567789888888888888777776655443221 1111111 12 Q ss_pred ChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccC Q lcl|Aclame:pro 151 NPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPG 227 (309) Q Consensus 151 dPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g 227 (309) ...|.++...+ +..++.++++++++..|++++.+...-... ++ +++-.-++..++|++ |++-+..= .+ T Consensus 141 --~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~-~~-~~~~~G~ig~~~G~~-Vv~s~~~p----~~ 211 (272) T protein:vir:36 141 --VDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEV-GA-NALINGTYADVLGAQ-IVRSKKLA----EG 211 (272) T ss_pred --HHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccccccccccc-cc-cceeeeccceecCee-EEEeCCCC----CC Confidence 33566666665 456889999999999999887655543221 11 122223456778876 55543311 11 Q ss_pred CCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 228 QNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 228 ~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) .......++..+++ |+ +.- ..-.++...-...+...++....+--.|+-+++-..++++- T Consensus 212 ~~~~~~~~~~~gA~----------------~~-~~~---~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g 271 (272) T protein:vir:36 212 SALMFKIVSNSPAL----------------KL-VLK---RGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTG 271 (272) T ss_pred ceeEEEEEecccce----------------ee-eec---CCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeecC Confidence 11111122222222 21 100 00011111111233445555555544444444433332221 Q ss_pred c Q lcl|Aclame:pro 308 A 308 (309) Q Consensus 308 a 308 (309) - T Consensus 272 ~ 272 (272) T protein:vir:36 272 V 272 (272) T ss_pred C Confidence 1 No 56 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=94.67 E-value=0.0037 Score=33.89 Aligned_cols=286 Identities=10% Similarity=-0.048 Sum_probs=118.2 Q ss_pred CCCC--CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHh--hhch--hHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSNA--PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQG--FTVP--ETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~~--~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~--f~~~--~t~~~~~~~~~~ve~~~~~~ 74 (309) |... .-++..+.+.|-. .-...-+-.++++.+|+.....++|+...... +..+ ....+.++........+.+. T Consensus 20 ~~~~~~~liP~~~~~~ii~-~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f~~i 98 (333) T protein:vir:78 20 LAHVPSDLLPKEIVGPIFD-KAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTAWDTR 98 (333) T ss_pred eecCCccccchhHHHHHHH-HHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCcccccccccccccccccceeEE Confidence 1111 1223333333322 21222233456778888877778887744211 1100 00111112222223333333 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC--c-----ccce---ecccccc Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYA--A-----GNKT---TLSGADQ 144 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~--~-----~~~~---~lsgt~~ 144 (309) +...+.-+-..+++++-.++ +.++..+...+.+.+.+.+..|..+ +++..-. . .+.. +.++... T Consensus 99 ~l~~~kl~~~~~is~ell~~--s~~~~~~~i~~~la~ai~~~~d~~~----l~G~g~~~~~~~~g~~~~~~~~~~~~~~~ 172 (333) T protein:vir:78 99 SVSPIKLATIVTVSEEFARM--NPSGLYTKLQGDLAYAIGRGIDLAV----FHGKSPLTGSALQGIDTDNVIANTTNVDY 172 (333) T ss_pred EEeeEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHHH----hcccCCCCCcccccccccccccccccccc Confidence 43333333344555554432 2344455555566666666655443 2221110 0 0000 1111111 Q ss_pred cCCCCCChHHHHHHHHHHh----CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcc---cccCHHHHHHHhCCCeEEee Q lcl|Aclame:pro 145 WSDPTSNPLPVITDALDSV----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDE---GMVPMAFLQELLELDAIYIG 217 (309) Q Consensus 145 Wsd~~sdPi~di~~~~~~~----g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~---~~vt~~~l~~l~gl~~I~v~ 217 (309) .......-+.+|.+....+ ...++..+|+++.|..|+.....++. ++.. ..+....-..++|+| |++. T Consensus 173 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~----~G~~i~~~~~~~~~~~~l~G~P-v~~~ 247 (333) T protein:vir:78 173 LQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDA----NGNVDPSRINLAAQTGDVLGLP-AQFG 247 (333) T ss_pred cccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCC----CCceeecCccccCCCceeecee-eEEc Confidence 1222333466777776654 46688999999999998764433321 1111 001111124567876 4444 Q ss_pred cceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceeccccccccc-----ccCCccccccccCCceEEEeecccce Q lcl|Aclame:pro 218 EARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDR-----VSGSIADPNIGLRGGQRVRVGESVKE 292 (309) Q Consensus 218 ~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~-----~~~~~~d~~~g~~g~~~v~v~~~~~~ 292 (309) +..-.........+..-++++-.-+++.... |.+.+.... ..+... .....+...+|+.+.++- T Consensus 248 ~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~---------~~~i~~~~~~~~~~~~~~~~--~~~~~~~v~~r~~~r~d~ 316 (333) T protein:vir:78 248 RAVGGDLGAAVDSKTRIIGGDFSQLKFGFAD---------EIRIKMSDTATLTDSGSATV--SMWQTNQIAILIEVTFGW 316 (333) T ss_pred cccCCCccccCCCccEEEEEecccEEEEEee---------ccEEEEecccccccccccee--ehhhcCcEEEEEEEEEcc Confidence 3321100000011111122211111111000 111111000 000000 111234455778888888 Q ss_pred eeecchhhhhhhccccC Q lcl|Aclame:pro 293 LVTAPDLGFFFENAVAA 309 (309) Q Consensus 293 ~v~~~~~G~l~~~~va~ 309 (309) .+.-+++-..|+++-|= T Consensus 317 ~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 317 LLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEecccceEEEeccCCC Confidence 88888887777777666 No 57 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=94.60 E-value=0.0011 Score=36.88 Aligned_cols=258 Identities=13% Similarity=0.063 Sum_probs=97.6 Q ss_pred CCCC--------CCCcchhhHHHH---HhhcchhhhhhhhCCcc---ccccccceeEEechhHhhhchhHhhcccc-ccc Q lcl|Aclame:pro 1 MSNA--------PFPIDPELTAIA---IAYRNGRMISDEVLPRV---PVGKQEFKFWKYDLAQGFTVPETLVGRKS-KPN 65 (309) Q Consensus 1 m~~~--------~f~~dp~LT~~a---~~y~n~~~ig~~lfP~v---~v~~~~~k~~~~~~~~~f~~~~t~~~~~~-~~~ 65 (309) |.-. .|.+. .|+.|= +--.-+++.+.++||.. +-.-+++.|..|+..-... ..+-.+ ... T Consensus 17 ~~~~~~~~d~~~~fl~~-ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a~----~~~d~~~dip 91 (314) T protein:vir:10 17 EQMGVEKADAAGIWAVS-QLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIAQ----IIADYSDDLP 91 (314) T ss_pred HhhcccchhhhHHHHHH-HHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeecccccee----eeCCcccccc Confidence 2111 23332 133221 11123457777888843 2223334444443211000 111111 123 Q ss_pred ccccCcCccceeeeccchhhcCCHHHHHHHh-hcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc----ccCc---ccce Q lcl|Aclame:pro 66 EVEFSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN----SYAA---GNKT 137 (309) Q Consensus 66 ~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~-~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~----~y~~---~~~~ 137 (309) .++.....+...+...+.......+|...++ .+.+...+........++..+ =++++.+. .|+- .+.. T Consensus 92 ~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~----n~i~f~G~~~~g~~GLlN~p~v~ 167 (314) T protein:vir:10 92 LVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLL----DKLVWSGSAPHGIVSVFDQPNIN 167 (314) T ss_pred eeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhh----ceEEEeecccccceeEeecCCCc Confidence 4455556666666666666666666666553 355544443333333332222 22333321 1111 1122 Q ss_pred ecccccccCCCCCChHHHHHHHHHHh-----C-CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHh-C Q lcl|Aclame:pro 138 TLSGADQWSDPTSNPLPVITDALDSV-----I-LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELL-E 210 (309) Q Consensus 138 ~lsgt~~Wsd~~sdPi~di~~~~~~~-----g-~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~-g 210 (309) ..+++..|+.+ ...+.||.+...++ | ..|++++|++..+..|.+ ... + .+.-=.+.|++-. . T Consensus 168 ~~~~~~~WaT~-~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~------~~~--~--~~~tvl~~l~~n~~~ 236 (314) T protein:vir:10 168 NVVATPNWSVP-QNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQG------LVP--Q--TNLSYGELFTRNNPG 236 (314) T ss_pred cccCCCCcccH-HHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcc------ccc--C--CCccHHHHHHHhCCC Confidence 33455679633 24689999887665 3 579999999999988742 000 1 1111234444421 1 Q ss_pred CCeEEeecceeeccccCCC----------cccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCC Q lcl|Aclame:pro 211 LDAIYIGEARLNIARPGQN----------PNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRG 280 (309) Q Consensus 211 l~~I~v~~a~~~~~~~g~~----------~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g 280 (309) |+ |.- -..+.++..++. ..+.-..|-....++..+.+ . .+..-+..+.+|...- .. T Consensus 237 l~-I~~-~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~e~~~-----~--~~~~~~~~r~~Gv~i~-----~P 302 (314) T protein:vir:10 237 LT-IRF-LQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLPAQPKD-----L--HFRYPVTSKATGLIVY-----RP 302 (314) T ss_pred cE-EEE-cccccccCCCcceEEEEEecCCcEEEEecCccceeecceecC-----c--eEEEcceeeeEEEEEE-----Cc Confidence 11 110 001111111110 11111112221111111111 1 1111111122222110 11 Q ss_pred ceEEEeecccce Q lcl|Aclame:pro 281 GQRVRVGESVKE 292 (309) Q Consensus 281 ~~~v~v~~~~~~ 292 (309) .-+++..-++-. T Consensus 303 ~ai~~~dGI~~~ 314 (314) T protein:vir:10 303 LTMAVIKGITFA 314 (314) T ss_pred ceeEeeeeeecC Confidence 112222111111 No 58 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=94.51 E-value=0.0023 Score=35.03 Aligned_cols=280 Identities=10% Similarity=0.000 Sum_probs=122.1 Q ss_pred CC---CCCCCcchhhHHHHHhhcchhhhhhhh-CCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccce Q lcl|Aclame:pro 1 MS---NAPFPIDPELTAIAIAYRNGRMISDEV-LPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETG 76 (309) Q Consensus 1 m~---~~~f~~dp~LT~~a~~y~n~~~ig~~l-fP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~ 76 (309) ++ ....+++...+.|-.--++..-| ..+ +-.+|+.....+||++...... .-++-++.......++...++ T Consensus 135 ~~~~~gg~lvP~~~~~~ii~~l~~~~~i-~~~~~~~v~~~~~~~~~p~~~~~~~a----~~v~E~~~~~~~~~~f~~i~~ 209 (435) T protein:vir:80 135 LSPGAGGVLVPENLSSEVIELLRPKSVV-RKLGARTLPLSNGNITIPRLKGGAIV----GYIGADTDIPTTQQQFDDLKL 209 (435) T ss_pred cCCCCCccccchhHHHHHHHHHhhhchh-hhccceeeecCCCceEEEEEeCCcce----eeeccCccccccccceeeEEE Confidence 11 11234444444443222222222 233 2235666666788888543211 123334444445556666666 Q ss_pred eeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc-cccCcc--cceeccccccc-CCCC-CC Q lcl|Aclame:pro 77 STEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSP-NSYAAG--NKTTLSGADQW-SDPT-SN 151 (309) Q Consensus 77 ~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~-~~y~~~--~~~~lsgt~~W-sd~~-sd 151 (309) ..+..+-..+++.+-..++...++.++.-.+.+.+.+....|..+-.. ++ ++-+.+ |....+....= +..+ .+ T Consensus 210 ~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G--~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 287 (435) T protein:vir:80 210 TAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRD--DGTANTPKGLRFWALPGNVITASDGSTLQK 287 (435) T ss_pred eeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhcc--CCCCCcccceeecccccceeecccccchhh Confidence 666666566677766554432334455556667777766666544321 00 010111 00000000000 0111 23 Q ss_pred hHHHHHHHHHHh-----CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeecccc Q lcl|Aclame:pro 152 PLPVITDALDSV-----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARP 226 (309) Q Consensus 152 Pi~di~~~~~~~-----g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~ 226 (309) +..|+..+...+ ...+...+|++.+|.+|.. ++..++.. +.....=..++|+| |++.+..-..... T Consensus 288 ~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~-------lkd~~G~~-l~~~~~~~~l~G~p-v~~~~~~p~~~~~ 358 (435) T protein:vir:80 288 IETDLGKAILALENADANLTQPGWIMAPRTFRFLEG-------LRDGNGNK-VYPELANGMLKGYP-VGKTTQVPINLGE 358 (435) T ss_pred HHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHh-------hhccCCce-eccCCCCCeEeeee-eEEeccccccccC Confidence 445666665443 4567789999999988744 22222222 11100012467877 4333322110000 Q ss_pred CCCcccceecCCcE-EEEecCCCCCCcCcceeccccccccc-----ccCCccccccccCCceEEEeecccceeeecchhh Q lcl|Aclame:pro 227 GQNPNLIRAWGPHA-SFIYRDRLADTRNGTTFGLTAQWGDR-----VSGSIADPNIGLRGGQRVRVGESVKELVTAPDLG 300 (309) Q Consensus 227 g~~~~~~~v~~~~~-~L~~~~~~~~~~~~~t~G~T~~~~~~-----~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G 300 (309) ..+ ...-++++-. +++. . .. |++.+.... ..+... .+ -..+...+|+.+..+-.+.-+++- T Consensus 359 ~~~-~~~i~~gd~s~~~i~-~-----~~----~~~i~~~~~~~~~~~~~~~~-~~-f~~n~~~~r~~~r~d~~~~~~~a~ 425 (435) T protein:vir:80 359 AGK-ESEIYFTDFGDVFIG-E-----EE----TLEIDYSKEATYKDADGHMV-SA-FQRDQTLIRVIAKNDFGPRHVESI 425 (435) T ss_pred CCC-cceEEEEEcccEEEE-e-----ec----ceEEEEeccccccccccchh-hh-hhcCcceeeeeeeeCcEeecccce Confidence 000 0111222211 1110 0 01 111111000 001110 11 123455789999999899888888 Q ss_pred hhhhccccC Q lcl|Aclame:pro 301 FFFENAVAA 309 (309) Q Consensus 301 ~l~~~~va~ 309 (309) ..++++-=| T Consensus 426 ~~l~~~~~~ 434 (435) T protein:vir:80 426 AVLSGVAWG 434 (435) T ss_pred EEEeccCCC Confidence 888877666 No 59 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=94.36 E-value=0.0046 Score=33.38 Aligned_cols=269 Identities=13% Similarity=0.070 Sum_probs=121.3 Q ss_pred CC---CCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcCccce Q lcl|Aclame:pro 1 MS---NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATDETG 76 (309) Q Consensus 1 m~---~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~~~~~ 76 (309) .. ....++..+...|-..-++..-|. .++..++|....+++++...... ....-++.++..... ...+...++ T Consensus 124 ~~~~~gg~~iP~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~E~~~~~~~~~~~~~~v~~ 200 (415) T protein:vir:81 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLD-KYVTVKRVTNGSGKYPVVRQSEV--AALEKVEELEENPELAVKPFFQLAY 200 (415) T ss_pred ccccccccccchHHHHHHHHHHHhhhhhh-hheeeeeccCCceeEEEEeecCC--ccceeeccccccCcccccceeeEEe Confidence 00 112333333333322212211122 22445566666666655422111 111123344444332 234555566 Q ss_pred eeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHH Q lcl|Aclame:pro 77 STEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVI 156 (309) Q Consensus 77 ~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di 156 (309) ..+..+-..+++++-..+ +.++......+.+.+.+....|..+....-.+..-.. -......+...+..+...+.+| T Consensus 201 ~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~-~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:81 201 DINTHRGYFRISREAIED--AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-SSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred eeeeeEeeehhhHHHHhh--chHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc-cccccccccccccccccchhHH Confidence 666666556677665443 3455566666677777766666554443322211110 0001111223344556667777 Q ss_pred HHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH----HHHHHhCCCeEEeecceeeccccCCC Q lcl|Aclame:pro 157 TDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA----FLQELLELDAIYIGEARLNIARPGQN 229 (309) Q Consensus 157 ~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~----~l~~l~gl~~I~v~~a~~~~~~~g~~ 229 (309) .+.+..+ .+.++..+|+++.|.+|+. + +..++++ +..+. .-..++|.| |++.+... .+.. T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~l~~---l----kd~~G~~-l~~~~~~~~~~~~l~G~p-V~~~~~~~----~~~~ 344 (415) T protein:vir:81 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDK---M----KDKLGNY-LIQPDVKEKTQQRLLGAK-IEILPDEV----LGQK 344 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHH---h----hccCCce-eeccCcCCCCCceeccee-eEEecccc----cCCC Confidence 7776554 6778999999999998764 2 3223222 11111 112456766 44333211 1111 Q ss_pred cccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 230 PNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 230 ~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) ++..-++++- .++++.. =|.+.++... ......+|+...++-.+.-+++-.+++-.- T Consensus 345 ~~~~~~~Gd~~~~~~~~~~----------~~~~v~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:81 345 GNNTLIIGNLKDAIVLFDR----------SQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred CccEEEEEehhccEEEEee----------cceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 1111222221 1111110 0122222110 111234566677777777788877776655 Q ss_pred cC Q lcl|Aclame:pro 308 AA 309 (309) Q Consensus 308 a~ 309 (309) ++ T Consensus 404 ~~ 405 (415) T protein:vir:81 404 SE 405 (415) T ss_pred cC Confidence 55 No 60 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=94.36 E-value=0.0046 Score=33.38 Aligned_cols=269 Identities=13% Similarity=0.070 Sum_probs=121.3 Q ss_pred CC---CCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcCccce Q lcl|Aclame:pro 1 MS---NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATDETG 76 (309) Q Consensus 1 m~---~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~~~~~ 76 (309) .. ....++..+...|-..-++..-|. .++..++|....+++++...... ....-++.++..... ...+...++ T Consensus 124 ~~~~~gg~~iP~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~E~~~~~~~~~~~~~~v~~ 200 (415) T protein:vir:79 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLD-KYVTVKRVTNGSGKYPVVRQSEV--AALEKVEELEENPELAVKPFFQLAY 200 (415) T ss_pred ccccccccccchHHHHHHHHHHHhhhhhh-hheeeeeccCCceeEEEEeecCC--ccceeeccccccCcccccceeeEEe Confidence 00 112333333333322212211122 22445566666666655422111 111123344444332 234555566 Q ss_pred eeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHH Q lcl|Aclame:pro 77 STEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVI 156 (309) Q Consensus 77 ~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di 156 (309) ..+..+-..+++++-..+ +.++......+.+.+.+....|..+....-.+..-.. -......+...+..+...+.+| T Consensus 201 ~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~-~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:79 201 DINTHRGYFRISREAIED--AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-SSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred eeeeeEeeehhhHHHHhh--chHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc-cccccccccccccccccchhHH Confidence 666666556677665443 3455566666677777766666554443322211110 0001111223344556667777 Q ss_pred HHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH----HHHHHhCCCeEEeecceeeccccCCC Q lcl|Aclame:pro 157 TDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA----FLQELLELDAIYIGEARLNIARPGQN 229 (309) Q Consensus 157 ~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~----~l~~l~gl~~I~v~~a~~~~~~~g~~ 229 (309) .+.+..+ .+.++..+|+++.|.+|+. + +..++++ +..+. .-..++|.| |++.+... .+.. T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~l~~---l----kd~~G~~-l~~~~~~~~~~~~l~G~p-V~~~~~~~----~~~~ 344 (415) T protein:vir:79 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDK---M----KDKLGNY-LIQPDVKEKTQQRLLGAK-IEILPDEV----LGQK 344 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHH---h----hccCCce-eeccCcCCCCCceeccee-eEEecccc----cCCC Confidence 7776554 6778999999999998764 2 3223222 11111 112456766 44333211 1111 Q ss_pred cccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 230 PNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 230 ~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) ++..-++++- .++++.. =|.+.++... ......+|+...++-.+.-+++-.+++-.- T Consensus 345 ~~~~~~~Gd~~~~~~~~~~----------~~~~v~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:79 345 GNNTLIIGNLKDAIVLFDR----------SQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred CccEEEEEehhccEEEEee----------cceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 1111222221 1111110 0122222110 111234566677777777788877776655 Q ss_pred cC Q lcl|Aclame:pro 308 AA 309 (309) Q Consensus 308 a~ 309 (309) ++ T Consensus 404 ~~ 405 (415) T protein:vir:79 404 SE 405 (415) T ss_pred cC Confidence 55 No 61 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=94.36 E-value=0.0046 Score=33.38 Aligned_cols=269 Identities=13% Similarity=0.070 Sum_probs=121.3 Q ss_pred CC---CCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcCccce Q lcl|Aclame:pro 1 MS---NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATDETG 76 (309) Q Consensus 1 m~---~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~~~~~ 76 (309) .. ....++..+...|-..-++..-|. .++..++|....+++++...... ....-++.++..... ...+...++ T Consensus 124 ~~~~~gg~~iP~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~--~~~~~v~E~~~~~~~~~~~~~~v~~ 200 (415) T protein:vir:98 124 LKTDSGFVVIPEEIVTDILKLKEVEFNLD-KYVTVKRVTNGSGKYPVVRQSEV--AALEKVEELEENPELAVKPFFQLAY 200 (415) T ss_pred ccccccccccchHHHHHHHHHHHhhhhhh-hheeeeeccCCceeEEEEeecCC--ccceeeccccccCcccccceeeEEe Confidence 00 112333333333322212211122 22445566666666655422111 111123344444332 234555566 Q ss_pred eeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHH Q lcl|Aclame:pro 77 STEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVI 156 (309) Q Consensus 77 ~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di 156 (309) ..+..+-..+++++-..+ +.++......+.+.+.+....|..+....-.+..-.. -......+...+..+...+.+| T Consensus 201 ~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~-~~~~~~~~~~~~~~~~~~~~~i 277 (415) T protein:vir:98 201 DINTHRGYFRISREAIED--AKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-SSGFEKEGKKLEVKKAKSLDDI 277 (415) T ss_pred eeeeeEeeehhhHHHHhh--chHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc-cccccccccccccccccchhHH Confidence 666666556677665443 3455566666677777766666554443322211110 0001111223344556667777 Q ss_pred HHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH----HHHHHhCCCeEEeecceeeccccCCC Q lcl|Aclame:pro 157 TDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA----FLQELLELDAIYIGEARLNIARPGQN 229 (309) Q Consensus 157 ~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~----~l~~l~gl~~I~v~~a~~~~~~~g~~ 229 (309) .+.+..+ .+.++..+|+++.|.+|+. + +..++++ +..+. .-..++|.| |++.+... .+.. T Consensus 278 ~~~~~~~~~~~~~~~~~v~n~~~~~~l~~---l----kd~~G~~-l~~~~~~~~~~~~l~G~p-V~~~~~~~----~~~~ 344 (415) T protein:vir:98 278 KDAINLNVKPNYEHNVAIVSQTMFAKLDK---M----KDKLGNY-LIQPDVKEKTQQRLLGAK-IEILPDEV----LGQK 344 (415) T ss_pred HHHHHhhhhhccCCCEEEEcHHHHHHHHH---h----hccCCce-eeccCcCCCCCceeccee-eEEecccc----cCCC Confidence 7776554 6778999999999998764 2 3223222 11111 112456766 44333211 1111 Q ss_pred cccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 230 PNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 230 ~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) ++..-++++- .++++.. =|.+.++... ......+|+...++-.+.-+++-.+++-.- T Consensus 345 ~~~~~~~Gd~~~~~~~~~~----------~~~~v~~~~~-----------~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:98 345 GNNTLIIGNLKDAIVLFDR----------SQYQASWTDY-----------MHFGECLMIAVRQDCRILDYKSAIVIEYDD 403 (415) T ss_pred CccEEEEEehhccEEEEee----------cceEEEEecc-----------ccCceEEEEEEEeccEEeccccEEEEEEec Confidence 1111222221 1111110 0122222110 111234566677777777788877776655 Q ss_pred cC Q lcl|Aclame:pro 308 AA 309 (309) Q Consensus 308 a~ 309 (309) ++ T Consensus 404 ~~ 405 (415) T protein:vir:98 404 SE 405 (415) T ss_pred cC Confidence 55 No 62 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=94.29 E-value=0.0048 Score=33.28 Aligned_cols=279 Identities=10% Similarity=-0.008 Sum_probs=121.6 Q ss_pred CCC------CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSN------APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~------~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |+. ...+++.+.+.|-..-++..-|.......+|+.....+||++.....+ .-++-++........+... T Consensus 132 ~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a----~~v~E~~~~~~~~~~f~~i 207 (435) T protein:vir:14 132 LNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIV----GYIGADTDIPTTQQQFDDL 207 (435) T ss_pred cccCCcCCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcce----eeeccCccccccccceeEE Confidence 211 124455444444432233233333223345666656788887542211 1234444444555556666 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCC--HHHHHHHHHHHHHHHHHHHHHHHHhhccc-c--cCcc--cceecc---cccc Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYN--PLGHATEQTTNLILLDREARTSKLVFSPN-S--YAAG--NKTTLS---GADQ 144 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d--~~~~av~~l~~~i~~~~E~~~a~~~~~~~-~--y~~~--~~~~ls---gt~~ 144 (309) ++.....+-..+++++-..++ .++ .+..-.+.+.+.|....|..+- ++. + -+.+ |....+ .... T Consensus 208 ~~~~~k~~~~~~iS~ell~ds--~~~~~l~~~i~~~l~~ai~~~~d~a~l----~G~G~~~~p~Gi~~~~~~~~~~~~~~ 281 (435) T protein:vir:14 208 KLTAKKMAALVPIANDLIKYA--GVNPNVDQIVVGDLTAAIGAREDKAFI----RDDGTANTPKGLRFWALPSNVITASD 281 (435) T ss_pred EeeeEEEEEeehhhHHHHHhh--ccCHHHHHHHHHHHHHHHHHHHHHHhh----ccCCCCccccceeecccccceecccc Confidence 666666665566776655543 344 3344455666666665554432 221 1 1111 100000 0001 Q ss_pred cCCCCCChHHHHHHHHHHh-----CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecc Q lcl|Aclame:pro 145 WSDPTSNPLPVITDALDSV-----ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEA 219 (309) Q Consensus 145 Wsd~~sdPi~di~~~~~~~-----g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a 219 (309) + ....+...+|......+ ++.+...+|++..|.+|+. ++ ..++.. +.....=..++|+| |++.+. T Consensus 282 ~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~---lk----d~~G~~-l~~~~~~g~l~G~P-v~~~~~ 351 (435) T protein:vir:14 282 A-STLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEG---LR----DGNGNK-VYPELANGMLKGYP-VGKTTQ 351 (435) T ss_pred c-cchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHH---hh----ccCCce-eccCCCCCeeecce-eEeecc Confidence 1 11223344555554433 4567789999999988754 22 222221 11100011367877 433222 Q ss_pred eeeccccCCCccc-ceecCCcE-EEEecCCCCCCcCcceeccccccc-ccccCCccccccccCCceEEEeecccceeeec Q lcl|Aclame:pro 220 RLNIARPGQNPNL-IRAWGPHA-SFIYRDRLADTRNGTTFGLTAQWG-DRVSGSIADPNIGLRGGQRVRVGESVKELVTA 296 (309) Q Consensus 220 ~~~~~~~g~~~~~-~~v~~~~~-~L~~~~~~~~~~~~~t~G~T~~~~-~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~ 296 (309) .-. ..+...+. .-++++-. +++.. ..+.++-.+-+-. ....+... ..-..+...+|+.+..+-.+.- T Consensus 352 ~p~--~~~~~~~~~~i~~gd~s~~~i~~------~~~~~~~~~~~~~~~~~~~~~~--~~f~~~~~~~r~~~r~d~~~~~ 421 (435) T protein:vir:14 352 VPI--NLGETGKESEIYFTDFGDVFIGE------EETLEIDYSKEATYKDADGHMV--SAFQRDQTLIRVIAKNDFGPRH 421 (435) T ss_pred ccc--cccCCCccceEEEeecccEEEEE------ecccEEEEeccccccccccchh--hhhhcChhheeeeeeeCceeec Confidence 100 00011110 11222211 11110 0011111100000 00011111 1112344568899999989999 Q ss_pred chhhhhhhccccC Q lcl|Aclame:pro 297 PDLGFFFENAVAA 309 (309) Q Consensus 297 ~~~G~l~~~~va~ 309 (309) +.+-..++++-.| T Consensus 422 ~~a~~~l~~~~~~ 434 (435) T protein:vir:14 422 VESIAVLAGVAWG 434 (435) T ss_pred ccceEEEecCCCC Confidence 9988888888777 No 63 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=94.19 E-value=0.0051 Score=33.14 Aligned_cols=255 Identities=11% Similarity=0.014 Sum_probs=113.3 Q ss_pred CCCCCCCcc---h-hhHHHHHhhcchhhhhhhhCCc-ccc----ccccc---eeEEechhHhhhchhHhhcccccccccc Q lcl|Aclame:pro 1 MSNAPFPID---P-ELTAIAIAYRNGRMISDEVLPR-VPV----GKQEF---KFWKYDLAQGFTVPETLVGRKSKPNEVE 68 (309) Q Consensus 1 m~~~~f~~d---p-~LT~~a~~y~n~~~ig~~lfP~-v~v----~~~~~---k~~~~~~~~~f~~~~t~~~~~~~~~~ve 68 (309) |++.+...| | +++++. ..++..-.+|.. +.+ ..+.+ ++|+|.. . -..+....+.....-+ T Consensus 3 ~~~~T~l~d~i~PEv~~~~v----~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~---i-g~a~~~~~g~~i~~~~ 74 (275) T protein:vir:96 3 LENMTKLANMVNPEVLAPMM----QAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVY---S-GDAKVVPEGEEIPIDL 74 (275) T ss_pred CcccchhhhhhchHHHHHHH----HHHHHHhhhhcccceecccccCCCCCEEEeeeecc---C-CccccccCCCCcchhh Confidence 545443333 3 333433 333333333322 111 12223 3455432 1 1222344444444445 Q ss_pred cCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCC Q lcl|Aclame:pro 69 FSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDP 148 (309) Q Consensus 69 ~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~ 148 (309) ++....+..++.++-...+++++ ..+...||...+.+.+...+....+..+...+.... ++ .+.+ T Consensus 75 lt~~~~~~~i~~~~~~~~i~D~~--~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~---------~~----~~~~ 139 (275) T protein:vir:96 75 IETKKRQATIRKIGKGTVLTDEA--LLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGAT---------LK----VEAD 139 (275) T ss_pred cccceeeEEeehhcccccccHHH--HHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc---------cc----cccc Confidence 66666677776666665666654 344566888888888777776666655544332211 11 1111 Q ss_pred CCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 149 TSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 149 ~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) . --...|.++...+ ...++.++++++++..|++.+.+. .+.......+.+..-++..++|++ |++-+.. T Consensus 140 ~-~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~-f~~~~~~g~~~~~~G~ig~~~G~~-Vi~s~~~----- 211 (275) T protein:vir:96 140 I-TKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDN-FTRATLLGDNVIVKGAFGEALGAI-IVRSNKI----- 211 (275) T ss_pred c-cCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhccccc-ccccccccccceeccccceecCee-EEEeCCC----- Confidence 1 1144566666666 357899999999999998875322 222222222345555677788876 4443211 Q ss_pred cCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhc Q lcl|Aclame:pro 226 PGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFEN 305 (309) Q Consensus 226 ~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~ 305 (309) + ....++++.+++-++.. . +.+.| ..|- ...+...+++...+--+++-++.-.-++. T Consensus 212 p---~~t~~i~~~gA~~~~~~-~---------~~~vE-~~Rd---------~~~~~d~i~~~~~y~~~~~~~~~vv~~t~ 268 (275) T protein:vir:96 212 K---EGEAILAKRGAVKLITK-R---------DFFLE-TERH---------ASHKSTALFSDKHYVAYLYDESKVVKITK 268 (275) T ss_pred C---cceEEEEeccceeeeec-C---------Ccccc-cccc---------hhhcCcEEEEeEEEEEEEEcCccEEEEEe Confidence 0 01123333332221110 0 01111 1111 11222333333333222222222222211 Q ss_pred cccC Q lcl|Aclame:pro 306 AVAA 309 (309) Q Consensus 306 ~va~ 309 (309) .=|+ T Consensus 269 ~~~~ 272 (275) T protein:vir:96 269 SASG 272 (275) T ss_pred cccc Confidence 1111 No 64 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=93.84 E-value=0.0062 Score=32.68 Aligned_cols=267 Identities=10% Similarity=0.061 Sum_probs=111.8 Q ss_pred CCCCCCCcchhhHHHHH-----------hhc-chh----hhhh-----hh---CCccccccccceeEEechhH-hhhchh Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAI-----------AYR-NGR----MISD-----EV---LPRVPVGKQEFKFWKYDLAQ-GFTVPE 55 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~-----------~y~-n~~----~ig~-----~l---fP~v~v~~~~~k~~~~~~~~-~f~~~~ 55 (309) |+...| |..|..++. ||. .++ +|-. .+ ...++|....++++..+-.. ....-. T Consensus 1 ~~~k~~--~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~ 78 (321) T protein:vir:31 1 MASRTI--NNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQD 78 (321) T ss_pred CchHHH--HHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCccccccc Confidence 655555 223333332 110 111 1110 11 22356666666666654211 111100 Q ss_pred HhhcccccccccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC--- Q lcl|Aclame:pro 56 TLVGRKSKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYA--- 132 (309) Q Consensus 56 t~~~~~~~~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~--- 132 (309) + -.+..++.++++.+.+|.|++......|+++..++.....|-++...+.+.+++.+..+... +++..-. T Consensus 79 e---~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~----~nGd~~~~~~ 151 (321) T protein:vir:31 79 E---GEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLA----ANGDEDAEDS 151 (321) T ss_pred c---cccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhhe----eeccccCCCc Confidence 0 11223344566777899999999999999998776444556677777777777777666443 2221110 Q ss_pred --cccceecc------cccccCCCCCChHHHHHHHHHHh----CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCccc- Q lcl|Aclame:pro 133 --AGNKTTLS------GADQWSDPTSNPLPVITDALDSV----ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEG- 198 (309) Q Consensus 133 --~~~~~~ls------gt~~Wsd~~sdPi~di~~~~~~~----g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~- 198 (309) .-|+.-|. .+..+..+ .-....+.+....+ .-.|+ +.+|+++++.+++ +.++..++..+ T Consensus 152 ~~~~n~G~l~~a~~~~~~~~~~~~-~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~------~~l~~~~~~~~~ 224 (321) T protein:vir:31 152 FENQNDGFITVAEGDVETIDAADD-ILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYH------YTLTDRDTPLGD 224 (321) T ss_pred ccccchhhhhhhcccccccccccc-ccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHH------HHHhcCCCcccc Confidence 00111110 00111111 12233444444443 22466 5689999876533 33333333221 Q ss_pred -ccCHHHHHHHhCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccc Q lcl|Aclame:pro 199 -MVPMAFLQELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIG 277 (309) Q Consensus 199 -~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g 277 (309) .++-..-..++|+|-+.+ ..+|++.+++- ....-+||+...- +........... T Consensus 225 ~~l~~~~~~tl~G~pvv~~-----------------~~mP~~~il~t------~~~nl~~~~~~~~--~~~~~~~~~~~~ 279 (321) T protein:vir:31 225 NVIMGEADVNPFSFPIIGS-----------------GLWPDDKAMFT------DPQNLIYALYRDL--EIDVLTESDKVS 279 (321) T ss_pred chhhccccccccceeEEEc-----------------CCCCCCcEEEe------ccccEEEEEeecc--EEEEeecCcccc Confidence 122222234667773331 23566655551 2222345543321 111000000000 Q ss_pred cCCceEEEee--cccceeeecchhhhhhhccccC Q lcl|Aclame:pro 278 LRGGQRVRVG--ESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 278 ~~g~~~v~v~--~~~~~~v~~~~~G~l~~~~va~ 309 (309) ....+++.. ...+-+|--.++..+++|.-=. T Consensus 280 -~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~ 312 (321) T protein:vir:31 280 -ERDLHARYFMRGDDDFAIENTEAVVLAEGLGDP 312 (321) T ss_pred -ccceeeEeeeeeecceeEeccccEEEEecCCcc Confidence 011112211 1122233333333333332211 No 65 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=93.56 E-value=0.0071 Score=32.35 Aligned_cols=278 Identities=12% Similarity=0.152 Sum_probs=134.5 Q ss_pred CCCC--CCCcchhhHHHHHhhcchhhhhhhhCCcccc-ccccceeEEechhHhhhchhHhhcccccccccccC-cC--cc Q lcl|Aclame:pro 1 MSNA--PFPIDPELTAIAIAYRNGRMISDEVLPRVPV-GKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFS-AT--DE 74 (309) Q Consensus 1 m~~~--~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v-~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~-~~--~~ 74 (309) |++. .-.+-.+++++.+---.|=+|+..||--+.. ..++-.++-|+-..+| .++-+++.....++ .+ +- T Consensus 74 mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~Ra~-----~IgEGgE~~~~sld~~T~dsv 148 (393) T protein:vir:79 74 MATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIMRAY-----DVAEGQEIPEDSIDWQTHESP 148 (393) T ss_pred hcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchheeeec-----cccccccccccchhhhcCCce Confidence 6643 2333445666665555666899999876644 3444445555533334 35667777666555 33 34 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----ccCcccceecccccccCCCC Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPN-----SYAAGNKTTLSGADQWSDPT 149 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~-----~y~~~~~~~lsgt~~Wsd~~ 149 (309) +.....+++.-.++++-+. .++.|....++....+.+.+.+|..|-+..-..+ .|.++-+...+|-++=..-+ T Consensus 149 ~~~~gK~G~~Ia~SqEmIs--DSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qN 226 (393) T protein:vir:79 149 EIRVGKSGIRLRFTDEMIS--DSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQN 226 (393) T ss_pred eEEechhhhhhhhHHHHhh--cchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCcccccc Confidence 5555556666555555433 3456777777777777777777776655432221 25566666666644333323 Q ss_pred CC-hHHHHHHHHHH---hCCCCcEEEeCHHHHHHHhcCHHHHHHhcc--CCCcc-c-----ccCHHHHHHH--hCCCeEE Q lcl|Aclame:pro 150 SN-PLPVITDALDS---VILRPNIGVLGRRTATILRRHPKIVKAYNG--SLGDE-G-----MVPMAFLQEL--LELDAIY 215 (309) Q Consensus 150 sd-Pi~di~~~~~~---~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~--~~~~~-~-----~vt~~~l~~l--~gl~~I~ 215 (309) .. -++||.+..=+ -++.|++++|++-+|+.+-+|.++-...-. .|.++ + ..-|+-++.- |.++ |. T Consensus 227 GTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nln-v~ 305 (393) T protein:vir:79 227 DTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFN-VN 305 (393) T ss_pred ccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhhcccccccee-EE Confidence 22 35666666433 499999999999999999988543221100 11111 1 1112222222 2233 22 Q ss_pred eecc-eeeccccCCCccccee--cCCcE-EEEecCCCCCCcCcceecccccccccccC---Ccccccccc---CCceE-- Q lcl|Aclame:pro 216 IGEA-RLNIARPGQNPNLIRA--WGPHA-SFIYRDRLADTRNGTTFGLTAQWGDRVSG---SIADPNIGL---RGGQR-- 283 (309) Q Consensus 216 v~~a-~~~~~~~g~~~~~~~v--~~~~~-~L~~~~~~~~~~~~~t~G~T~~~~~~~~~---~~~d~~~g~---~g~~~-- 283 (309) +.-= -|.+ +...++++ =-|++ +|+-.+.. + |-||.++.-+ ....+.||- .+++- T Consensus 306 ~sPfvp~d~----k~~rFd~~~Vd~NnvgvlLV~D~i-------~---tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaia 371 (393) T protein:vir:79 306 LSPFIPLDK----KSRRFDVYAVDRNNVGVLLVRDDL-------K---TDQWDEKARGLQNIKMIERYGIGILNEGKAIA 371 (393) T ss_pred Eeccccccc----ccceeeEEEeecCCceEEEEecCc-------c---eeccccccccceeeeeeeeeceeeeeCCceEE Confidence 2110 0111 12223222 12333 33322211 1 2244443322 223333333 23322 Q ss_pred ----EEeeccc-ceeeecchhhhhhhcccc Q lcl|Aclame:pro 284 ----VRVGESV-KELVTAPDLGFFFENAVA 308 (309) Q Consensus 284 ----v~v~~~~-~~~v~~~~~G~l~~~~va 308 (309) |.+...| +|. ||.|+-- T Consensus 372 vakNI~~~k~y~~P~--------~~~~~~~ 393 (393) T protein:vir:79 372 VAKNISMDKSYAEPM--------LIKNVGN 393 (393) T ss_pred EEecceeecccccch--------hhhccCC Confidence 2333333 332 3333333 No 66 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=93.52 E-value=0.0073 Score=32.31 Aligned_cols=279 Identities=10% Similarity=0.056 Sum_probs=125.4 Q ss_pred CC-CCCC------------------CcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccc Q lcl|Aclame:pro 1 MS-NAPF------------------PIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRK 61 (309) Q Consensus 1 m~-~~~f------------------~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~ 61 (309) |+ ...| ++......+-..-++..-| ..+++.+|+.....++|++...... .-++.+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l-~~~~~~~~~~~~~~~~p~~~~~~~a----~~v~E~ 75 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIV-QQFAQKVPMGTTGQKIPHWIGDVSA----QWIGEG 75 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccch-hhhcceeeccCCceEEEEEeCCcce----EEecCC Confidence 33 2334 2222222222222221112 3456788888777888887542211 234445 Q ss_pred ccccccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--cCc-----c Q lcl|Aclame:pro 62 SKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS--YAA-----G 134 (309) Q Consensus 62 ~~~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~--y~~-----~ 134 (309) +.....+.++.+.++.++..+...+++++-.++ +.++......+.+.+.+....|..+ +++.. .+. . T Consensus 76 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d--s~~~l~~~i~~~l~~a~a~~~d~a~----l~G~g~~~~~~~~~~~ 149 (320) T protein:vir:10 76 DMKPITKGNMTSQNIAPHKIATIFVASAETVRA--NPANYLGTMRTKVATAFAMAFDSAA----LNGTDSPFPTYLAQTT 149 (320) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhc--ChHHHHHHHHHHHHHHHHHHHHHHh----hcccCCCCCccccccc Confidence 555555666666677776666666777765543 2345556656666666666555443 32211 110 0 Q ss_pred cceecccccccCC-CCCChHHHHHHHHHH---hCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCccccc-------CHH Q lcl|Aclame:pro 135 NKTTLSGADQWSD-PTSNPLPVITDALDS---VILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMV-------PMA 203 (309) Q Consensus 135 ~~~~lsgt~~Wsd-~~sdPi~di~~~~~~---~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~v-------t~~ 203 (309) +....+.+..... .......++.+.... ....+...+|+++.|.+|+. +++. ++.. +. .+. T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~---lkd~----~G~~-l~~~~~~~~~~~ 221 (320) T protein:vir:10 150 KSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNG---AKDK----NGRP-LFIESTYTDENS 221 (320) T ss_pred ccccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHH---hhcc----CCce-eeccccccCccc Confidence 1111111111111 112222334333332 36678999999999999864 3332 1111 11 011 Q ss_pred HH--HHHhCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCc Q lcl|Aclame:pro 204 FL--QELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGG 281 (309) Q Consensus 204 ~l--~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~ 281 (309) .+ ..++|+|. ++.+. -+..+ ..-++++-.-+++....+-.+ ..+=..+.+++....+... +.-..+. T Consensus 222 ~~~~~~i~g~pv-~~~~~-----~~~~~--~~~~~gd~~~~~~~~~~~~~i-~~~~~~~~~~~~~~~~~~~--~~f~~~~ 290 (320) T protein:vir:10 222 PFRAGRIVSRPT-ILSDH-----VADGT--TVGYMGDFRNVIWGQVGGLSF-DVTDQATLNLGTPTEPNFV--SLWQHNL 290 (320) T ss_pred cccCceeeeeee-EecCC-----CCCCc--eEEEEeecceEEEEEecCeEE-EEeecceeeeccccccccc--hhhhcCc Confidence 11 13455552 22211 11111 101122211011110000000 0000111222211111110 1112355 Q ss_pred eEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 282 QRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 282 ~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) ..+|+.+.++-.+.-+++-..+++++|- T Consensus 291 ~~~r~~~~~d~~v~~~~a~~~l~~~~ap 318 (320) T protein:vir:10 291 VAVRVEAEYAFHNNDKDAFVKLTNVVTP 318 (320) T ss_pred EEEEEEEeeccEEecccceEEEEeccCC Confidence 6688999999999999998899888877 No 67 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=93.42 E-value=0.0076 Score=32.20 Aligned_cols=261 Identities=14% Similarity=0.137 Sum_probs=104.4 Q ss_pred CCCCCCCcch-----hh-----HHHHHh-h--cchhhhhhhhCCcc---ccccccceeEEechhHhhhchhHhhcccc-c Q lcl|Aclame:pro 1 MSNAPFPIDP-----EL-----TAIAIA-Y--RNGRMISDEVLPRV---PVGKQEFKFWKYDLAQGFTVPETLVGRKS-K 63 (309) Q Consensus 1 m~~~~f~~dp-----~L-----T~~a~~-y--~n~~~ig~~lfP~v---~v~~~~~k~~~~~~~~~f~~~~t~~~~~~-~ 63 (309) |.+..+..|. .+ +.|=-. | .-+++.+.+++|.. +-..+++.|.+++..-... ..+-++ . T Consensus 18 ~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~----~~~d~~~d 93 (319) T protein:vir:10 18 LIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKVGTAQ----IIADYTDD 93 (319) T ss_pred HhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeecccccee----eecCcccc Confidence 1111111111 12 222211 1 22457888888854 3334445555554321111 111111 1 Q ss_pred ccccccCcCccceeeeccchhhcCCHHHHHHHh-hcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc----ccCc---cc Q lcl|Aclame:pro 64 PNEVEFSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN----SYAA---GN 135 (309) Q Consensus 64 ~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~-~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~----~y~~---~~ 135 (309) ...++.....+...+...+....+..+|...++ .+++...+......+.+...+ -++++.+. .|+- .+ T Consensus 94 ip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~----n~i~f~G~~~~g~~GLlN~p~ 169 (319) T protein:vir:10 94 LPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLV----NRLVFKGSAPHKIVSVFNHPN 169 (319) T ss_pred ccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhh----ceEEEeecccccceeEEeCCC Confidence 234455555555556655555566667766654 456555544443443333222 22333321 1111 11 Q ss_pred ceecccccccCC-CCC---ChHHHHHHHHHHh-----C-CCCcEEEeCHHHHHHHhc-CHHHHHHhccCCCcccccCHHH Q lcl|Aclame:pro 136 KTTLSGADQWSD-PTS---NPLPVITDALDSV-----I-LRPNIGVLGRRTATILRR-HPKIVKAYNGSLGDEGMVPMAF 204 (309) Q Consensus 136 ~~~lsgt~~Wsd-~~s---dPi~di~~~~~~~-----g-~~Pn~~v~~~~~~~~l~~-~~~i~~~~~~~~~~~~~vt~~~ 204 (309) ....+ ...|++ ++. .++.||.....++ | ..|++++|+++.|..|.+ ++ + .+..-.+. T Consensus 170 ~~~~~-~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~---------~--~~~t~l~~ 237 (319) T protein:vir:10 170 ITKIT-SGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMP---------E--TTMSYLDY 237 (319) T ss_pred ceeee-cCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccC---------C--CCeeHHHH Confidence 11122 233443 233 4678888887654 3 479999999999998843 21 1 23344567 Q ss_pred HHHHh-CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecc--cccc--cccccCCccccccccC Q lcl|Aclame:pro 205 LQELL-ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGL--TAQW--GDRVSGSIADPNIGLR 279 (309) Q Consensus 205 l~~l~-gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~--T~~~--~~~~~~~~~d~~~g~~ 279 (309) |++.+ +++ |+- -..+..+ ++. +.+.+++|...... ..+.. -++. -......+..++...- T Consensus 238 lk~~~~~l~-I~~-~pel~~a--g~~-------g~~~~v~y~~~~~~----~~~~v~~~~~~~~~e~~~l~~~~~~~~r~ 302 (319) T protein:vir:10 238 FKSQNSGIE-IDS-IAELEDI--DGA-------GTKGVLVYEKNPMN----MSIEIPEAFNMLPAQPKDLHFKVPCTSKC 302 (319) T ss_pred HHHhcCCce-EEE-eeeeccc--CCC-------cceEEEEEecCCce----EEEecCcceeeeeeeecCceEEEeeeeee Confidence 77654 222 221 1112111 111 12233344322110 01110 0000 0011112222222222 Q ss_pred CceEEEeecccceeeecchhhhhhhcc Q lcl|Aclame:pro 280 GGQRVRVGESVKELVTAPDLGFFFENA 306 (309) Q Consensus 280 g~~~v~v~~~~~~~v~~~~~G~l~~~~ 306 (309) +|..|+ -|.+-+.+.+. T Consensus 303 ~Gv~i~----------~P~ai~~~dGI 319 (319) T protein:vir:10 303 TGLTIY----------RPMTIVLITGV 319 (319) T ss_pred EEEEEE----------ccceeEeeecC Confidence 322221 22222222222 No 68 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=93.25 E-value=0.00077 Score=37.67 Aligned_cols=259 Identities=16% Similarity=0.083 Sum_probs=106.6 Q ss_pred CC-C-C----CCCcchhhHHHHHh-h--cchhhhhhhhCCccc---cccccceeEEechhHhhhchhHhhcccc-ccccc Q lcl|Aclame:pro 1 MS-N-A----PFPIDPELTAIAIA-Y--RNGRMISDEVLPRVP---VGKQEFKFWKYDLAQGFTVPETLVGRKS-KPNEV 67 (309) Q Consensus 1 m~-~-~----~f~~dp~LT~~a~~-y--~n~~~ig~~lfP~v~---v~~~~~k~~~~~~~~~f~~~~t~~~~~~-~~~~v 67 (309) |. + + +|.+. .|+.|=.- | .-+++.+.++||... -.-+++.|.+++..-... ..+-++ +...+ T Consensus 1 ~~~~~a~~~~~f~~~-ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~----~~~~~~~dip~v 75 (296) T protein:vir:10 1 MGVDKADAAGIWTVK-QLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQ----IVADYTDDLPLV 75 (296) T ss_pred CcccchhhhHHHHHH-HHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCcee----EeCCCcccccee Confidence 44 1 1 23332 23333222 1 234578888888443 223334444443211111 111111 12244 Q ss_pred ccCcCccceeeeccchhhcCCHHHHHHHh-hcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc----ccCc---ccceec Q lcl|Aclame:pro 68 EFSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN----SYAA---GNKTTL 139 (309) Q Consensus 68 e~~~~~~~~~~~e~~L~~~v~~~~~~~a~-~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~----~y~~---~~~~~l 139 (309) +.....+...+...+....+..+|.+.++ .+.+...+......+.+...+ -++++.+. .|+- .+.... T Consensus 76 ~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~----n~~~f~G~~~~g~~GLlN~p~v~~~ 151 (296) T protein:vir:10 76 DALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLL----DKLVWSGSTAHGIPSVFDYPNINNV 151 (296) T ss_pred eccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhh----ceEEEeecccccceeEeecCCCccc Confidence 55555566666666666666667776654 356655554444443333322 22333321 1111 111122 Q ss_pred ccccccCCCCCChHHHHHHHHHHh-----C-CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCe Q lcl|Aclame:pro 140 SGADQWSDPTSNPLPVITDALDSV-----I-LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDA 213 (309) Q Consensus 140 sgt~~Wsd~~sdPi~di~~~~~~~-----g-~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~ 213 (309) +.+..|++++ ..+.||..+...+ | ..|++++|+++.+..|.+- . +++ +..-.+.|++.+.--+ T Consensus 152 ~~~~~W~~~t-~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~------~--~~~--~~t~l~~ik~~~~~l~ 220 (296) T protein:vir:10 152 VSGGSWSQPT-TAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNL------V--PGT--SVSYGEFFRQNNSGVT 220 (296) T ss_pred cccCCccCHH-HHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhc------c--CCC--CccHHHHHHHhcCCce Confidence 3344699887 8899999987654 3 4699999999999887531 1 122 3333566776542111 Q ss_pred EEeecceeeccccCCC----------cccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceE Q lcl|Aclame:pro 214 IYIGEARLNIARPGQN----------PNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQR 283 (309) Q Consensus 214 I~v~~a~~~~~~~g~~----------~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~ 283 (309) |+- -..+..+..++. ..+.-..|-....++..+.. ..|-. .+..+.+|...- ...-+ T Consensus 221 i~~-~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~e~~~-----l~~~~--~~~~~~~Gv~i~-----~P~ai 287 (296) T protein:vir:10 221 VEF-VQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNALPAQPKD-----LHFKI--PVTSKATGLIVY-----RPLTM 287 (296) T ss_pred EEE-eeeeccCCCCcceEEEEEEcCCceEEEEcCcceeeecccccC-----ceEEE--eeEeeEEEEEEE-----CCcee Confidence 221 111211111111 11111122222222222111 11111 111122222110 11111 Q ss_pred EEeecccce Q lcl|Aclame:pro 284 VRVGESVKE 292 (309) Q Consensus 284 v~v~~~~~~ 292 (309) ++..-++-. T Consensus 288 ~~~dGI~~~ 296 (296) T protein:vir:10 288 AVMKGITFA 296 (296) T ss_pred EEEeeeecC Confidence 111111111 No 69 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=93.09 E-value=0.0088 Score=31.85 Aligned_cols=269 Identities=10% Similarity=0.009 Sum_probs=98.3 Q ss_pred CC--CCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhh--hchhHhhcccccccccccCcCccce Q lcl|Aclame:pro 1 MS--NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGF--TVPETLVGRKSKPNEVEFSATDETG 76 (309) Q Consensus 1 m~--~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f--~~~~t~~~~~~~~~~ve~~~~~~~~ 76 (309) ++ +....+-+.+.+--+---....+-..+++.+|+.....++++......+ ..+................+...++ T Consensus 165 ~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~ 244 (458) T protein:vir:10 165 SSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHF 244 (458) T ss_pred ccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecccccccccccccccccccceeeEe Confidence 11 1111111111111111111112223456677777666666654322111 1111111111111122333444455 Q ss_pred eeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceeccc----------ccccC Q lcl|Aclame:pro 77 STEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSG----------ADQWS 146 (309) Q Consensus 77 ~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsg----------t~~Ws 146 (309) .....+...+|+++-..++ .++....-...+.+.|....+.. ++++..- ......++. ..... T Consensus 245 ~~~k~~~~v~is~ell~ds--~~~~~~~i~~~l~~~i~~~~d~~----~l~G~G~-~~p~Gi~~~~~~~~~~~~~~~~~~ 317 (458) T protein:vir:10 245 STYKLAAKSFITDETEEDA--IFSLLPLLRKRLIEAHAVSIEEA----FMTGDGS-GKPKGLLTLASEDSAKVVTEAKAD 317 (458) T ss_pred eeeeEEeeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHH----hhcCCCC-Cccceeeecccccccceeeccccc Confidence 5555555556666654433 23334444445555555544433 2322110 001111110 11112 Q ss_pred CC---CCChHHHHHHHHHHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCccc-------ccCHHHHHHHhCCCeEEe Q lcl|Aclame:pro 147 DP---TSNPLPVITDALDSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEG-------MVPMAFLQELLELDAIYI 216 (309) Q Consensus 147 d~---~sdPi~di~~~~~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~-------~vt~~~l~~l~gl~~I~v 216 (309) .. +.|.|-++..-...-+..+...+|++..|.+|+. ++..++.+. ......-..++|+| |++ T Consensus 318 ~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~-------lkd~~G~~i~~~~~~~~~~~~~~~~l~G~p-v~~ 389 (458) T protein:vir:10 318 GSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDL-------LEDEEWQDVAQVGNDSVKLQGQVGRIYGLP-VVV 389 (458) T ss_pred ccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHh-------hcccCCceeeccccccccccCcCceeccee-eEE Confidence 22 2334444443344446677889999999998764 221121110 00000112466766 333 Q ss_pred ecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeec Q lcl|Aclame:pro 217 GEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTA 296 (309) Q Consensus 217 ~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~ 296 (309) -+.. +........++++ |+--|.+.++.+..+..+.+...+...++....+.-.+.- T Consensus 390 ~~~~-----p~~~~~~~~~~~~------------------f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~ 446 (458) T protein:vir:10 390 SEYF-----PAKANSAEFAVIV------------------YKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYF 446 (458) T ss_pred cccc-----ccccCCcceEEEE------------------ecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEec Confidence 2221 1111111111111 0000111112222222233333444444444444444444 Q ss_pred chhhhhhhccccC Q lcl|Aclame:pro 297 PDLGFFFENAVAA 309 (309) Q Consensus 297 ~~~G~l~~~~va~ 309 (309) |+ || +...+|| T Consensus 447 ~~-a~-v~~~~aa 457 (458) T protein:vir:10 447 AN-GV-VSGTYAA 457 (458) T ss_pred cc-ce-EEEeecc Confidence 43 33 3455555 No 70 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=93.07 E-value=0.0089 Score=31.83 Aligned_cols=267 Identities=12% Similarity=0.033 Sum_probs=121.1 Q ss_pred CC-CCCCCcchhhHHHHHh-hcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccccc-CcCcccee Q lcl|Aclame:pro 1 MS-NAPFPIDPELTAIAIA-YRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEF-SATDETGS 77 (309) Q Consensus 1 m~-~~~f~~dp~LT~~a~~-y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~-~~~~~~~~ 77 (309) .+ +....+-+.+.+-.+. -+...-|. .+++.+|+...+.+|++......+.....-++.++.....+. .+...++. T Consensus 122 ~~~~~~~~vp~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~ 200 (413) T protein:vir:81 122 LTDEFQGGYGTTWNRNIIYRRREKLVVA-DLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTES 200 (413) T ss_pred cccccccccchhhHHHHHHHHhhhhhHH-hhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEee Confidence 00 1111121222211111 12222222 346778888888888877554333333334455555555554 35666666 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccC---CCCCChHH Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWS---DPTSNPLP 154 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Ws---d~~sdPi~ 154 (309) .+..+-..+|+++-.++.. + ....-.+.+.+.+....|. .++++..-....+..+..+.... ..+.+.+. T Consensus 201 ~~k~~~~~~iS~ell~ds~--~-l~~~i~~~la~~~~~~~d~----~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~ 273 (413) T protein:vir:81 201 LSKIAGLTKITDEMIEDYD--F-LVSYINARLLEELAIEEER----QLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELAD 273 (413) T ss_pred eeeEEEeehhhHHHHHHHH--H-HHHHHHHHHHHHHHHHHHH----HHhccCCCCCcccccccccccccccccccchhHH Confidence 6666655678877655442 1 3343344455555554443 33333211111111221111111 12234555 Q ss_pred HHHHHHHH----hCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH-----------HHHHHHhCCCeEEeecc Q lcl|Aclame:pro 155 VITDALDS----VILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM-----------AFLQELLELDAIYIGEA 219 (309) Q Consensus 155 di~~~~~~----~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~-----------~~l~~l~gl~~I~v~~a 219 (309) +|...+.. .+.+|+.++|++..|.+|+. +++ .++.. +..+ .--..+||+| |++.+. T Consensus 274 ~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~---lkd----~~G~~-l~~~~~~~~~~~~~~~~~~~l~G~p-v~~s~~ 344 (413) T protein:vir:81 274 SIYKAMTNISLATPFQADALVINPLDYQELRL---AKD----ANGQY-YGGGVFQGQYGSGGIMLDPAPWGLR-TVQSQV 344 (413) T ss_pred HHHHHHHHhhhhccCCCcEEEEcHHHHHHHHH---hhc----cCCce-eccccccccccccccccCceeccee-eEEcCC Confidence 66555433 37889999999999998753 221 11111 0000 0012366776 443322 Q ss_pred eeeccccCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecc Q lcl|Aclame:pro 220 RLNIARPGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAP 297 (309) Q Consensus 220 ~~~~~~~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~ 297 (309) . +.. .-++++. .++++.. -|.+.++... ...+-..+...+|+...++-.+.-+ T Consensus 345 ~-----~~~----~~~~gd~~~~~~~~~~----------~~~~v~~~~~------~~~~~~~~~~~~r~~~r~d~~~~~~ 399 (413) T protein:vir:81 345 V-----PVG----KPVVGAFRSAASVLRK----------GGVRIDSTNT------NVDDFENNLITVRAEERVGLMVTFP 399 (413) T ss_pred C-----Ccc----cEEEEecccEEEEEEe----------cceEEEEecc------ccchhhcCcEEEEEEEeeccEEecc Confidence 1 100 1122221 1121110 0122222110 0011234556788888888888888 Q ss_pred hhhhhhhccccC Q lcl|Aclame:pro 298 DLGFFFENAVAA 309 (309) Q Consensus 298 ~~G~l~~~~va~ 309 (309) ++-..++-+-|. T Consensus 400 ~a~~~l~~~~~~ 411 (413) T protein:vir:81 400 EAIVQLDVAEVV 411 (413) T ss_pred cceEEEEecCCC Confidence 887777655555 No 71 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=92.99 E-value=0.0092 Score=31.75 Aligned_cols=254 Identities=11% Similarity=0.008 Sum_probs=116.5 Q ss_pred CCCC-CC---Ccch-hhHHHHHhhcchhhhhhhhCC-ccccc----cc---cceeEEechhHhhhchhHhhccccccccc Q lcl|Aclame:pro 1 MSNA-PF---PIDP-ELTAIAIAYRNGRMISDEVLP-RVPVG----KQ---EFKFWKYDLAQGFTVPETLVGRKSKPNEV 67 (309) Q Consensus 1 m~~~-~f---~~dp-~LT~~a~~y~n~~~ig~~lfP-~v~v~----~~---~~k~~~~~~~~~f~~~~t~~~~~~~~~~v 67 (309) |++. +. .+.| +++++.+ .++....+|- .+.+. .+ ..++|+|... -..+....+.....- T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~----~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~i----g~a~~~~~g~~i~~~ 72 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQ----AELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYS----GDAKVVAEGEKIPTD 72 (274) T ss_pred CCcceeehhheechHHHHHHHH----HHHHhhhhccccceecccccCCCCCEEEeeeecCC----CccccccCCCccchh Confidence 9984 22 2233 3444433 2333322222 11111 12 2345555321 112223334443334 Q ss_pred ccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCC Q lcl|Aclame:pro 68 EFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSD 147 (309) Q Consensus 68 e~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd 147 (309) +++....+..++..+-.-.++++ ...+..-||.+.+.+.+...+....+..+...+.... ..++. T Consensus 73 ~lt~~~~~~~i~~~~~a~~i~D~--~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-------------~~~~~ 137 (274) T protein:vir:96 73 ILETKKREAKIRKIAKGTSISDE--ALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-------------LTVEA 137 (274) T ss_pred hcccceeEEEeeeeecceeehHH--HHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccc Confidence 55555666666665544455544 3445566888888888777776666665554442211 11211 Q ss_pred CCCChHHHHHHHHHHhC---CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeecc Q lcl|Aclame:pro 148 PTSNPLPVITDALDSVI---LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIA 224 (309) Q Consensus 148 ~~sdPi~di~~~~~~~g---~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~ 224 (309) .. --...|.++...+| ..+..++++++++..|++++.+. .+..+....+++..-++..++|++ |++-+. . T Consensus 138 ~~-~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~-f~~~s~~g~~~~~~G~ig~~~G~~-Vi~s~~-~--- 210 (274) T protein:vir:96 138 DI-TKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTN-FTRATELGDDVIVKGAFGEALGAV-IVRSNK-L--- 210 (274) T ss_pred cc-cCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhcccc-ccccccccccceeccccceecCeE-EEEeCC-C--- Confidence 11 11455666766664 56889999999999999876432 222222223345555677788877 444321 1 Q ss_pred ccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhh Q lcl|Aclame:pro 225 RPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFE 304 (309) Q Consensus 225 ~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~ 304 (309) + ....++++..+ +|+-. ... -.++...-...+...+...+.+--+++-++.-..++ T Consensus 211 -~---~~t~~l~~~gA----------------~~~~~-~~~---~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t 266 (274) T protein:vir:96 211 -E---AGTAILAKKGA----------------VKLIT-KRD---FFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT 266 (274) T ss_pred -C---CceEEEEeccc----------------eeeee-cCC---cccccccccccccCEEEEeEEEEEEEEcCCcEEEEE Confidence 0 11123333332 22211 000 011111111123444555444444444444444443 Q ss_pred ccccC Q lcl|Aclame:pro 305 NAVAA 309 (309) Q Consensus 305 ~~va~ 309 (309) -.++ T Consensus 267 -k~~~ 270 (274) T protein:vir:96 267 -KGSG 270 (274) T ss_pred -cCCc Confidence 1122 No 72 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=92.99 E-value=0.0092 Score=31.75 Aligned_cols=254 Identities=11% Similarity=0.008 Sum_probs=116.5 Q ss_pred CCCC-CC---Ccch-hhHHHHHhhcchhhhhhhhCC-ccccc----cc---cceeEEechhHhhhchhHhhccccccccc Q lcl|Aclame:pro 1 MSNA-PF---PIDP-ELTAIAIAYRNGRMISDEVLP-RVPVG----KQ---EFKFWKYDLAQGFTVPETLVGRKSKPNEV 67 (309) Q Consensus 1 m~~~-~f---~~dp-~LT~~a~~y~n~~~ig~~lfP-~v~v~----~~---~~k~~~~~~~~~f~~~~t~~~~~~~~~~v 67 (309) |++. +. .+.| +++++.+ .++....+|- .+.+. .+ ..++|+|... -..+....+.....- T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~----~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~i----g~a~~~~~g~~i~~~ 72 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQ----AELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIYS----GDAKVVAEGEKIPTD 72 (274) T ss_pred CCcceeehhheechHHHHHHHH----HHHHhhhhccccceecccccCCCCCEEEeeeecCC----CccccccCCCccchh Confidence 9984 22 2233 3444433 2333322222 11111 12 2345555321 112223334443334 Q ss_pred ccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCC Q lcl|Aclame:pro 68 EFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSD 147 (309) Q Consensus 68 e~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd 147 (309) +++....+..++..+-.-.++++ ...+..-||.+.+.+.+...+....+..+...+.... ..++. T Consensus 73 ~lt~~~~~~~i~~~~~a~~i~D~--~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~-------------~~~~~ 137 (274) T protein:vir:95 73 ILETKKREAKIRKIAKGTSISDE--ALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK-------------LTVEA 137 (274) T ss_pred hcccceeEEEeeeeecceeehHH--HHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------ccccc Confidence 55555666666665544455544 3445566888888888777776666665554442211 11211 Q ss_pred CCCChHHHHHHHHHHhC---CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeecc Q lcl|Aclame:pro 148 PTSNPLPVITDALDSVI---LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIA 224 (309) Q Consensus 148 ~~sdPi~di~~~~~~~g---~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~ 224 (309) .. --...|.++...+| ..+..++++++++..|++++.+. .+..+....+++..-++..++|++ |++-+. . T Consensus 138 ~~-~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~-f~~~s~~g~~~~~~G~ig~~~G~~-Vi~s~~-~--- 210 (274) T protein:vir:95 138 DI-TKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTN-FTRATELGDDVIVKGAFGEALGAV-IVRSNK-L--- 210 (274) T ss_pred cc-cCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhcccc-ccccccccccceeccccceecCeE-EEEeCC-C--- Confidence 11 11455666766664 56889999999999999876432 222222223345555677788877 444321 1 Q ss_pred ccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhh Q lcl|Aclame:pro 225 RPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFE 304 (309) Q Consensus 225 ~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~ 304 (309) + ....++++..+ +|+-. ... -.++...-...+...+...+.+--+++-++.-..++ T Consensus 211 -~---~~t~~l~~~gA----------------~~~~~-~~~---~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t 266 (274) T protein:vir:95 211 -E---AGTAILAKKGA----------------VKLIT-KRD---FFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKIT 266 (274) T ss_pred -C---CceEEEEeccc----------------eeeee-cCC---cccccccccccccCEEEEeEEEEEEEEcCCcEEEEE Confidence 0 11123333332 22211 000 011111111123444555444444444444444443 Q ss_pred ccccC Q lcl|Aclame:pro 305 NAVAA 309 (309) Q Consensus 305 ~~va~ 309 (309) -.++ T Consensus 267 -k~~~ 270 (274) T protein:vir:95 267 -KGSG 270 (274) T ss_pred -cCCc Confidence 1122 No 73 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=92.62 E-value=0.011 Score=31.40 Aligned_cols=265 Identities=12% Similarity=0.039 Sum_probs=117.4 Q ss_pred CCCCCCCcchhhHHHHHh-h--cchhhhhhhhCCcc---ccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIA-Y--RNGRMISDEVLPRV---PVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~-y--~n~~~ig~~lfP~v---~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |+--.|.+. .|+.+=.= | +.+++.+.+|||.. +-..+++.|..++..=... +.-.-....+...++....++ T Consensus 1 ~~~lafl~~-qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~-~~~i~~~a~dip~vd~~~~~~ 78 (304) T protein:vir:52 1 MSLLAYVKN-GLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLD-DGLITVGTSTLDQVEVGFTPT 78 (304) T ss_pred CchHHHHHH-HHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCccc-ccccCCcCCccceeeccccee Confidence 988888877 47665422 4 35679999999954 3334445555553321111 000111223445667777777 Q ss_pred ceeeeccchhhcCCHHHHHHHhh-cCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----c-----Ccccceecc--- Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPT-NYNPLGHATEQTTNLILLDREARTSKLVFSPNS-----Y-----AAGNKTTLS--- 140 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~-~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~-----y-----~~~~~~~ls--- 140 (309) ...+...+........|...++. +.+...+-.+... +..|...-++++-+.. + +.-...+.+ T Consensus 79 ~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~----~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~ 154 (304) T protein:vir:52 79 RSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALN----KNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAA 154 (304) T ss_pred EEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHH----HHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCc Confidence 77777776666666677766653 4554433332222 2223333333333311 1 111111111 Q ss_pred cccccCCCCCC-hHHHHHHHHHHh-----C-CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHh---- Q lcl|Aclame:pro 141 GADQWSDPTSN-PLPVITDALDSV-----I-LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELL---- 209 (309) Q Consensus 141 gt~~Wsd~~sd-Pi~di~~~~~~~-----g-~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~---- 209 (309) .+.+|-..+.| ++.||.+...++ + ..|++++|.+..+..|.. ... ++++..+ .+.|++-. T Consensus 155 a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~-----~~~--~~~~~Tv--l~~l~~n~~~~~ 225 (304) T protein:vir:52 155 QNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLAL-----VQR--ANTDTTA--LEFLTKHLSAAA 225 (304) T ss_pred cCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhh-----ccC--CCCCchH--HHHHHHhccccc Confidence 12458777665 888888887765 2 469999999999998842 011 1111111 23333321 Q ss_pred CCC-eEEeecceeeccccCCCc---ccceecCCcEEEEecCCCCCCcCcceecccccccccccC-CccccccccCCceEE Q lcl|Aclame:pro 210 ELD-AIYIGEARLNIARPGQNP---NLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSG-SIADPNIGLRGGQRV 284 (309) Q Consensus 210 gl~-~I~v~~a~~~~~~~g~~~---~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~-~~~d~~~g~~g~~~v 284 (309) |.+ +|+.-.. .....|..+ -+-+--+...+-+..+ ..-+ .++ .| ..+. .+..+++..-||..+ T Consensus 226 g~~l~I~~v~~--~~~~~g~~g~~r~vvY~~d~~~~~~~vP-~p~~----~l~--~q---~~~~~~~~vp~~~r~gGv~v 293 (304) T protein:vir:52 226 GRQVAIKALPS--NYGTRVTDGKTRAMVYVNSKEHVIFDVP-MSPT----VLD--AQ---PKGLLAFESGLRMAFGGVTF 293 (304) T ss_pred CCcceEEEecc--cccccCCCCceEEEEEecChhheEEecC-cccc----ccc--hh---hcCCceEEecceeeeeeEEE Confidence 111 1221100 111111111 0111111112212111 1000 001 11 1111 233445555566444 Q ss_pred Eeecc---cce Q lcl|Aclame:pro 285 RVGES---VKE 292 (309) Q Consensus 285 ~v~~~---~~~ 292 (309) +.-+. +|. T Consensus 294 ~~P~a~~y~D~ 304 (304) T protein:vir:52 294 MEPDSALYVDY 304 (304) T ss_pred EccceeeeecC Confidence 33211 122 No 74 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=91.95 E-value=0.013 Score=30.84 Aligned_cols=265 Identities=10% Similarity=0.009 Sum_probs=120.6 Q ss_pred CCC------CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcCc Q lcl|Aclame:pro 1 MSN------APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATD 73 (309) Q Consensus 1 m~~------~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~~ 73 (309) |+. ...+|..+...|...-+...-| ..+++.+|+.....++++....... .-++.++..... ...+.+ T Consensus 107 ~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~~a----~wv~E~~~~~~~~~~~~~~ 181 (401) T protein:vir:44 107 LQVGTDEDGGYAVPEELDRSILSLLKDEVVM-RQEATVITVGGSDYKKLVNLGGTAS----GWVGETDTRSQTATSRLGL 181 (401) T ss_pred hhcCCCCCCceeccHhHHHHHHHHHHhhhhh-hhhceeeecCCCceEEEEecCCccc----eeeccccccCcccccccee Confidence 332 2245555444444332322222 3456778888777788876443211 112333322221 123444 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecc--------ccccc Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--------GADQW 145 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ls--------gt~~W 145 (309) .++..+..+-..+++.+-..+ +.+|....-.+.+.+.+....|.. ++++..-+ ..+..|+ ++..| T Consensus 182 v~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~ai~~~~~~~----~l~G~G~~-~p~Gil~~~~~~~~~~~~~~ 254 (401) T protein:vir:44 182 IEPFMGEIYGNPQATQKMLDD--AFFNVEAWINSELATEFAEQEEIA----FTTGDGTK-KPKGFLAYESTEESDKARAF 254 (401) T ss_pred eeeehhheeeehhhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHhh----hhccCCCC-ccceeecccccccccccccc Confidence 344443333344555554443 345556666666676665544432 22221100 1111111 01111 Q ss_pred C--------CCCCChHHHHHHHHHHh--C-CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhC Q lcl|Aclame:pro 146 S--------DPTSNPLPVITDALDSV--I-LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLE 210 (309) Q Consensus 146 s--------d~~sdPi~di~~~~~~~--g-~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~g 210 (309) . ..+.--..+|.+.+..+ . ....+.+|+++.|.+|+. ++ ..++.+ ++.+ ..-..++| T Consensus 255 ~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~---lk----d~~G~~-l~~~~~~~g~~~~l~G 326 (401) T protein:vir:44 255 GKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRL---LK----DTEGNY-LWRPGLELGQPSSLAG 326 (401) T ss_pred ccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHH---hh----ccCCce-eecCCcCCCCCceecc Confidence 1 11111244555555444 2 234478999999988763 22 222222 2211 11234678 Q ss_pred CCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeeccc Q lcl|Aclame:pro 211 LDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESV 290 (309) Q Consensus 211 l~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~ 290 (309) +| |++-+..-. .+ . ++..++|.+- +.+| ....+.+..+..+.+...+...+++...+ T Consensus 327 ~P-Vv~~~~~p~---~~-~--------~~~~i~~Gd~--------~~~~--~i~~~~~~~~~~~~~~~~~~v~~~a~~r~ 383 (401) T protein:vir:44 327 YG-IAENEQMPD---IA-A--------DAKAIAFGNF--------KRGY--TIVDRIGTRILRDPYTNKPFVGFYTTKRT 383 (401) T ss_pred ee-eEEecCcCC---cc-C--------CccEEEEeeh--------hccE--EEEEecceEEeeeccccCCcEEEEEEEEe Confidence 77 443332110 00 0 1111122110 1111 11222222333333444666778999999 Q ss_pred ceeeecchhhhhhhcccc Q lcl|Aclame:pro 291 KELVTAPDLGFFFENAVA 308 (309) Q Consensus 291 ~~~v~~~~~G~l~~~~va 308 (309) +-.++-+++..+++-+.| T Consensus 384 d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 384 GGMLVDSQAIKLLKIAAA 401 (401) T ss_pred ccEEecccceEEEEeecC Confidence 999999999999988888 No 75 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=91.74 E-value=0.014 Score=30.68 Aligned_cols=269 Identities=12% Similarity=0.027 Sum_probs=118.1 Q ss_pred CC-----CCC-CCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHh----hhchhHhhcccccccccccC Q lcl|Aclame:pro 1 MS-----NAP-FPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQG----FTVPETLVGRKSKPNEVEFS 70 (309) Q Consensus 1 m~-----~~~-f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~----f~~~~t~~~~~~~~~~ve~~ 70 (309) +. ... .++-..+..+...-.....+-..+++.+|+.....+|++...... ......-++.++........ T Consensus 123 ~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~ 202 (419) T protein:vir:94 123 APAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLS 202 (419) T ss_pred cccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccc Confidence 11 111 111111222222211112222345667777766677776543210 01112233445555555666 Q ss_pred cCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--cCcc-----cceeccccc Q lcl|Aclame:pro 71 ATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS--YAAG-----NKTTLSGAD 143 (309) Q Consensus 71 ~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~--y~~~-----~~~~lsgt~ 143 (309) +.+.++..+..+-..+|+++-..+.. +....-.+.+.+.+....+..+ +++.. -+.+ ...+..... T Consensus 203 ~~~i~~~~~k~~~~~~is~ell~d~~---~l~~~i~~~la~a~~~~~d~ai----i~G~G~~~p~Gi~~~~~~~~~~~~~ 275 (419) T protein:vir:94 203 FDTITTTLKTVAHWLPITRQAADDNS---QLMGYIQGRLTYGLRFLRDRQL----LNGNGSTEMQGILTTPGIGTYQQPK 275 (419) T ss_pred eeeEEeeeeeEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHHHH----HhccCcccccceecccccccccccc Confidence 77777777777666678877665432 2333334445555555554333 22211 0100 000011111 Q ss_pred ccCCC-CCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccc----cCHHHHHHHhCCCeEE Q lcl|Aclame:pro 144 QWSDP-TSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGM----VPMAFLQELLELDAIY 215 (309) Q Consensus 144 ~Wsd~-~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~----vt~~~l~~l~gl~~I~ 215 (309) .+... ....+.+|..++..+ +..|+..+|+++.|..|+. ++.. ++..-. +....-..++|+| |+ T Consensus 276 ~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~~l~~---~k~~----~~~~~~~~~~~~~~~~~~l~G~p-V~ 347 (419) T protein:vir:94 276 PTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWESIEL---DQAP----GSGVFRVIANVQGEATPRIWGLN-VV 347 (419) T ss_pred cccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH---Hhhc----CCCceeecCCcccCCCcccccee-eE Confidence 22233 345677888877654 6788999999999988753 2211 111101 1111123567876 43 Q ss_pred eecceeeccccCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeeccccee Q lcl|Aclame:pro 216 IGEARLNIARPGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKEL 293 (309) Q Consensus 216 v~~a~~~~~~~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~ 293 (309) +-+.. +..+ -++++- .++++. .-|.+.++..- ....-..+...+|+...++-. T Consensus 348 ~~~~~-----~~~~----~~~gd~~~~~~~~~----------~~~~~v~~~~~------~~~~~~~~~~~~r~~~r~d~~ 402 (419) T protein:vir:94 348 STVAI-----AQGT----ALVGGFRQGATLWS----------RQGITVLMTDS------HADFFTANTLVILAEFRANLA 402 (419) T ss_pred EcCCC-----CCcc----EEEeeccceEEEEE----------ecceEEEEecc------ccchhhcCcEEEEEEEeeccE Confidence 32221 1110 111211 111100 00112211110 001112455668888888888 Q ss_pred eecchhhhhhhccccC Q lcl|Aclame:pro 294 VTAPDLGFFFENAVAA 309 (309) Q Consensus 294 v~~~~~G~l~~~~va~ 309 (309) +.-+++-..++-+-|= T Consensus 403 v~~~~a~~~~~~~aa~ 418 (419) T protein:vir:94 403 VYQPKAFVRVTFAAAT 418 (419) T ss_pred EeccccEEEEEeccCC Confidence 8887766665433332 No 76 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=91.60 E-value=0.015 Score=30.57 Aligned_cols=262 Identities=12% Similarity=0.000 Sum_probs=117.9 Q ss_pred CCCC------CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcCc Q lcl|Aclame:pro 1 MSNA------PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATD 73 (309) Q Consensus 1 m~~~------~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~~ 73 (309) |+.. ..++..+...|-..-++..-| ..++..+|+....++++.... ........-++.++..... ..++.+ T Consensus 5 ~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l-~~~~~~~~~~~~~g~~~~~~~-~~~~~~a~~v~Eg~~~~~~~~~~~~~ 82 (293) T protein:vir:48 5 KTDHSGSDAGLTIPQDIRTAINTLVRQYDSL-QEYVNVENVTTLTGSRVYEKW-TDITGLANIDDEAGKIADIDDPKLSL 82 (293) T ss_pred ecccccCcCceEechhHHHHHHHHHHhhhhh-hhhceeeeccCCcceEEEEee-cCCCcceeeecCCcccccccccceeE Confidence 4421 245555555554332222222 233556677666665443211 1111112234445554432 345666 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChH Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPL 153 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi 153 (309) .+..++..+-..++++|-.++. .+|.+..-.+.+.+.+....+.. +++...- ++. ..+.... T Consensus 83 i~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~la~~~~~~~~~~----i~~g~~~---------~~~---~~~~~~~ 144 (293) T protein:vir:48 83 IKYTIKRYAGISTVTNSLLADS--AENILAWLSGWIAKKVVVTRNKA----ILGVVDK---------LPT---KPTLTKW 144 (293) T ss_pred EEEeeeEEEEeehhhHHHHhhh--hHHHHHHHHHHHHHHHHHHHHhH----Hhhcccc---------ccc---cccccCH Confidence 6777766666667777765543 35555555555666655544432 2222110 000 1222334 Q ss_pred HHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccC----HHHHHHHhCCCeEEeecceeecccc Q lcl|Aclame:pro 154 PVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVP----MAFLQELLELDAIYIGEARLNIARP 226 (309) Q Consensus 154 ~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt----~~~l~~l~gl~~I~v~~a~~~~~~~ 226 (309) .||.+...++ .......+|++.+|..|+. +++ .++.. +.. ...-..++|.|- ++.+..+... T Consensus 145 d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~---lkd----~~g~~-l~~~~~~~~~~~~l~G~Pv-~~~~~~~~~~-- 213 (293) T protein:vir:48 145 DDIIDLEAKVDPAIKQTSFFLTNTSGFTALKK---VKN----ALGDY-LMERDVKSPTGYSIAGFAV-KEISDRWLPN-- 213 (293) T ss_pred HHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH---hhc----cCCce-EeecCcCCCCCceecceee-EEecccccCC-- Confidence 4555554443 4455678999999988765 221 22211 111 111234678773 3332222111 Q ss_pred CCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhh Q lcl|Aclame:pro 227 GQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFE 304 (309) Q Consensus 227 g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~ 304 (309) +...+..-++++. .+++... . |.+.+... .....-..+...+|+.+.++-++.-+++-.+++ T Consensus 214 ~~~~~~~~~~gd~~~~~~~~~~------~----~~~i~~~~------~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~ 277 (293) T protein:vir:48 214 ASSGVMPLYFGDLKQAVTLFDR------Q----QMSLLSTN------IGGGAFETDTTKVRVIDRFDVVATDTEAFVPAS 277 (293) T ss_pred ccCCceEEEEEeccceEEEEEe------c----ceEEEEec------ccchhhhcCeEEEEEEEeeCcEEecccceEEEE Confidence 1111111233321 1111110 0 11111100 000111244556788888888888888888777 Q ss_pred ccccC Q lcl|Aclame:pro 305 NAVAA 309 (309) Q Consensus 305 ~~va~ 309 (309) -..++ T Consensus 278 ~~~~~ 282 (293) T protein:vir:48 278 FKAIA 282 (293) T ss_pred eeccc Confidence 44444 No 77 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=91.36 E-value=0.016 Score=30.40 Aligned_cols=265 Identities=11% Similarity=0.011 Sum_probs=116.0 Q ss_pred CCCC-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcCccceee Q lcl|Aclame:pro 1 MSNA-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATDETGST 78 (309) Q Consensus 1 m~~~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~~~~~~~ 78 (309) -++. ..++..+.+.|-..-++..-|. .+++.+|+....++++.+... ........++.++..... ...+.+.+... T Consensus 114 ~~~gg~~iP~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~ 191 (397) T protein:vir:48 114 GSDAGLTIPQDIQTAIHTLVRQYDSLQ-EYVNVENVTTLTGSRVYEKWA-DITGLAKLDDEAGSIGTNDDPKLYPIRYAI 191 (397) T ss_pred CccccccccHHHHHHHHHHHHHHHHHH-hhhceeeccCCcceEEEEeec-CCCcceeeeccccccccccccceeeEEeeh Confidence 1111 3445555555543333322232 335667777777776654321 111111223333433332 23445555555 Q ss_pred eccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHHHH Q lcl|Aclame:pro 79 EDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVITD 158 (309) Q Consensus 79 ~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di~~ 158 (309) +..+-..+++++-..+ +.++......+.+.+.+....+..+.. +. +.....++...| |-|.++.. T Consensus 192 ~k~~~~~~iS~ell~d--s~~~l~~~v~~~l~~~~~~~~d~~il~----G~----g~~~~~~~~~~~-----d~i~~~~~ 256 (397) T protein:vir:48 192 KRYAGISTVTNSLLAD--SAENILAWLSGWIAKKVVVTRNKAILE----AI----ATLPTKPTLTKW-----DDIIDLQA 256 (397) T ss_pred eeeeeehhhHHHHHhh--chHHHHHHHHHHHHHHHHHHHHHHHhh----cc----cccccccccccH-----HHHHHHHH Confidence 5555445666665443 345566666667777776665544332 21 111112222222 33444444 Q ss_pred HHHHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecceeeccccCCCcccce Q lcl|Aclame:pro 159 ALDSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEARLNIARPGQNPNLIR 234 (309) Q Consensus 159 ~~~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~~ 234 (309) -++.....+...+|+++.|.+|+. ++..++.+ +..+ ..-..++|.|-+.+.+..... +..+...- T Consensus 257 ~l~~~~~~~a~~v~n~~~~~~L~~-------lkd~~G~~-i~~~~~~~~~~~~l~G~PV~~~~~~~~~~---~~~~~~~~ 325 (397) T protein:vir:48 257 KVDPAIKQTSFFLTNTSGFTALKK-------VKNAFGDY-LMERDVKSPTGYSIDGFAVKEVADRWLAN---ASSGAMPL 325 (397) T ss_pred HhhhhhcCCCEEEECHHHHHHHHH-------hhcCCCce-eeccCcCCCCCceeccceeEEecccccCC---cCCCceEE Confidence 444456678899999999998874 33333322 2211 122346777744433222211 11122222 Q ss_pred ecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 235 AWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 235 v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) ++++- .++++.. . |.+.+... ....+-..+...+|+...++-.+.-+++-..++=.-++ T Consensus 326 ~~gd~~~~~~~~~~------~----~~~i~~~~------~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:48 326 YFGDLKQAVTLFDR------Q----QMSLLSTN------IGGGAFETDTTKIRVIDRFDVVATDTESFVPASFKAIA 386 (397) T ss_pred EEEeccceEEEEee------c----ceEEEEec------cchhhhhcCceeEEEEeeeccEEecccceEEEEecccc Confidence 33321 1111110 0 11111000 00011123444566666666666666655555422222 No 78 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=90.66 E-value=0.02 Score=29.94 Aligned_cols=275 Identities=8% Similarity=-0.081 Sum_probs=121.5 Q ss_pred CCC-CCCCcchhhHHHH-HhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccceee Q lcl|Aclame:pro 1 MSN-APFPIDPELTAIA-IAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGST 78 (309) Q Consensus 1 m~~-~~f~~dp~LT~~a-~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~~ 78 (309) -++ ...++..++..+. ...+...-| ..+...+++ .....+|+...... ..-++-++.......++...++.. T Consensus 255 ~~~gg~lip~~~~~~ii~~~~~~~~~l-~~~~~~~~~-~g~~~~~~~~~~~~----a~~v~Eg~~~~~~~~~~~~i~~~~ 328 (543) T protein:vir:81 255 KADGGYLVPFQLDPTVIITSNGSLNDI-RRFARQVVA-TGDVWHGVSSAAVQ----WSWDAEFEEVSDDSPEFGQPEIPV 328 (543) T ss_pred cccCcccCchhhhhHHHHHHHhhhchh-hhhcccccC-CcceEEEEecCCcc----eeecccCccccccccccceeeeee Confidence 111 1234444444432 222221112 222322222 22344555432211 122445555555566666666666 Q ss_pred eccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecc------cccccCCCCCCh Q lcl|Aclame:pro 79 EDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS------GADQWSDPTSNP 152 (309) Q Consensus 79 ~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ls------gt~~Wsd~~sdP 152 (309) +..+-..+++.+-..+. .+....-.+.+.+.+....+ ..++++..-+..-...++ .+..+...++.+ T Consensus 329 ~k~~~~~~is~ell~d~---~~~~~~i~~~l~~~~~~~~d----~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~ 401 (543) T protein:vir:81 329 KKAQGFVPISIEALQDE---ANVTETVALLFAEGKDELEA----VTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFA 401 (543) T ss_pred eeeEeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHH----HHHhccCCCCcccccchhhccccccccccccccccc Confidence 66666667777655432 34444444555555544443 334443211111111111 111223355667 Q ss_pred HHHHHHHHHHh--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH---HHHHHHhCCCeEEeecceeecccc Q lcl|Aclame:pro 153 LPVITDALDSV--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM---AFLQELLELDAIYIGEARLNIARP 226 (309) Q Consensus 153 i~di~~~~~~~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~---~~l~~l~gl~~I~v~~a~~~~~~~ 226 (309) ..++.+....+ .+.++ ..+|++++|..|+. +++ .++.. +..+ ..-..++|+| |++.+..-..... T Consensus 402 ~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~---lkd----~~G~~-l~~~~~~g~~~~l~G~p-v~~~~~~~~~~~~ 472 (543) T protein:vir:81 402 LADVYAVYEQLAARHRRQGAWLANNLIYNKIRQ---FDT----QGGAG-LWTTIGNGEPSQLLGRP-VGEAEAMDANWNT 472 (543) T ss_pred HHHHHHHHHhhhccccCCcEEEEcHHHHHHHHH---hhc----CCCce-eccCcCCCCCcccccee-eEEeccccccccc Confidence 88888777665 44444 68999999998764 221 12111 1100 0112467876 4443332100000 Q ss_pred CCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcc Q lcl|Aclame:pro 227 GQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENA 306 (309) Q Consensus 227 g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~ 306 (309) ....++..++|.+-..- .-+..-|.+.++..... ..+.-..+...+++...++-.+.-+++-.+++-+ T Consensus 473 -------~~~~~~~~i~~gd~~~~-~i~~~~~~~i~~~~~~~----~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~ 540 (543) T protein:vir:81 473 -------SASADNFVLLYGNFQNY-VIADRIGMTVEFIPHLF----GTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVE 540 (543) T ss_pred -------cccCCcceEEEeeccce-eEEeecccEEEEecccc----ccchhhcCceEEEEEEeeccEeecccceEEEEec Confidence 01112222333221110 00000122222111000 0011123555688888889999999999899888 Q ss_pred ccC Q lcl|Aclame:pro 307 VAA 309 (309) Q Consensus 307 va~ 309 (309) .|| T Consensus 541 ~~a 543 (543) T protein:vir:81 541 TAS 543 (543) T ss_pred ccC Confidence 888 No 79 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=90.48 E-value=0.021 Score=29.83 Aligned_cols=255 Identities=13% Similarity=0.026 Sum_probs=116.8 Q ss_pred CCCC-CCC---cchh-hHHHHHhhcchhhhhhhhCC-ccc----cccccceeEEechhHhhhchhHhhcccccccccccC Q lcl|Aclame:pro 1 MSNA-PFP---IDPE-LTAIAIAYRNGRMISDEVLP-RVP----VGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFS 70 (309) Q Consensus 1 m~~~-~f~---~dp~-LT~~a~~y~n~~~ig~~lfP-~v~----v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~ 70 (309) ||+. +.. +.|. +++ |...++-.-..|. .+. ...+.++...|++-... -+.+....+.....-+++ T Consensus 1 Ma~~~T~l~d~i~Pev~~~----~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~i-gda~~~~eg~~i~~~~lt 75 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAP----MMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYS-GDATVVPEGQKIPVDKIE 75 (276) T ss_pred CCcceeehhhhhchHHHHH----HHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCC-CccccccCCCccCccccc Confidence 9974 221 2222 233 3333333333332 121 22234444444332222 122335555555555566 Q ss_pred cCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCC Q lcl|Aclame:pro 71 ATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTS 150 (309) Q Consensus 71 ~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~s 150 (309) ..+....++.++-...++++. .....-||...+.+.+...+.......+...+.... ..++.... T Consensus 76 ~~~~~a~i~~~~k~~~~tD~a--~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~-------------~~~~~~~~ 140 (276) T protein:vir:10 76 TNRREAKIHKIGKGTDITDEA--LLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTK-------------LTVSADIG 140 (276) T ss_pred cceeeEEeehccccccccHHH--HHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc-------------cccccccc Confidence 666777777666555666554 344567888888888888777766665555443211 11222211 Q ss_pred ChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccC Q lcl|Aclame:pro 151 NPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPG 227 (309) Q Consensus 151 dPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g 227 (309) + ...|.++...+ ...+++++++++++..|++... .+.+..+....+.+..-++..++|++ |++-+.. T Consensus 141 t-~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~-~~f~~~s~~g~~~~~~G~ig~~~G~~-Vi~s~~~------- 210 (276) T protein:vir:10 141 T-LAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSAS-DNFTRATELGDNIIVKGAFGEALGAV-IVRSKKL------- 210 (276) T ss_pred C-HHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhcc-ccccccccccccceeccccceeccee-EEEcCCC------- Confidence 2 34455555555 4678999999999999986421 12222232223455566777888876 4443211 Q ss_pred CCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccc--------eeeecchh Q lcl|Aclame:pro 228 QNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVK--------ELVTAPDL 299 (309) Q Consensus 228 ~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~--------~~v~~~~~ 299 (309) +....++++..++-++.. . +.+.| .+|-.. .+...+.+...+- -+++-... T Consensus 211 -p~~t~~l~~~gAi~~~~~-~---------~~~vE-~dRd~~---------~~~d~i~~~~~y~~~~~~~~~vv~~t~~~ 269 (276) T protein:vir:10 211 -DEGEAILAKRGAVKLITK-R---------DFFLE-TDRDPS---------TKTTALYSDKHYVAYLYDESKAVKVTKGA 269 (276) T ss_pred -CcceEEEEeccceeeeec-C---------Cceee-cccchh---------hcccEEEEeeEEEEEEEcCcceEEEecCC Confidence 011123444333322111 0 11111 111111 1222333333332 22332333 Q ss_pred hhhhhcc Q lcl|Aclame:pro 300 GFFFENA 306 (309) Q Consensus 300 G~l~~~~ 306 (309) |-.=.+| T Consensus 270 ~~~~~~~ 276 (276) T protein:vir:10 270 GTTDSGA 276 (276) T ss_pred cCCcCCC Confidence 3333333 No 80 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=90.01 E-value=0.023 Score=29.55 Aligned_cols=265 Identities=12% Similarity=-0.002 Sum_probs=110.8 Q ss_pred CCC-----C-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccc-cCcCc Q lcl|Aclame:pro 1 MSN-----A-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVE-FSATD 73 (309) Q Consensus 1 m~~-----~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve-~~~~~ 73 (309) |+. . ..++..+...|-..-++..-|. .+.+.+|+....++++.... ........-++.++...... ..+.. T Consensus 109 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~-~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQ-EYVNVENVTTLTGSRVYEKW-ADITGLAKLDDEGGQIGQNDDPKLSL 186 (397) T ss_pred hhccCCccCcceecHHHHHHHHHHHHhhhhHh-hhcceeeccCCcceEEEEee-ccCCcceeeeccccccccccccceee Confidence 221 1 2333333333321111111122 23455666666665443211 11111112334444443333 24455 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChH Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPL 153 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi 153 (309) .++.++..+-..+++++-..++ .+|......+.+.+.+....+..+ +++.. .....++. .+.|.+ T Consensus 187 v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~d~ai----l~G~g----~~~~~~~~-----~~~d~i 251 (397) T protein:vir:49 187 IRYAIKRYAGISTVTNSLLADS--AENILAWLSGWIAKKVVVTRNKAI----LEAIG----TLPNKPTL-----AKWDDI 251 (397) T ss_pred eEeeeeeeEeehhhHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHHHH----Hhccc----cccccccc-----cCHHHH Confidence 5555555555556666655433 355566666667777766555433 22221 11111111 122334 Q ss_pred HHHHHHHHHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH----HHHHHhCCCeEEeecceeeccccCCC Q lcl|Aclame:pro 154 PVITDALDSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA----FLQELLELDAIYIGEARLNIARPGQN 229 (309) Q Consensus 154 ~di~~~~~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~----~l~~l~gl~~I~v~~a~~~~~~~g~~ 229 (309) .++..-++.....+...+|++..|.+|+. ++..++.. ++.+. .=..++|+|-+++.+..... +.. T Consensus 252 ~~~~~~l~~~~~~~a~~v~n~~~~~~l~~-------lkd~~g~~-l~~~~~~~g~~~~l~G~pV~~~~~~~~~~---~~~ 320 (397) T protein:vir:49 252 IDLQAKVDPAIKQTSLFLTNTSGFTALKK-------VKNAMGDY-LMERDVKSPTGYSIDGFVVKEISDRFLPN---GTG 320 (397) T ss_pred HHHHHhhhhhhcCCCEEEEcHHHHHHHHH-------hhccCCce-eecccccCCCCceecceeeEEeccccccc---ccC Confidence 44444445557778899999999988765 22222222 11111 11236677633332222211 111 Q ss_pred cccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 230 PNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 230 ~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) ....-++++. .++++.. -|.+.+.... .+ ..-..+...+|+...++-.+.-+++-.+++-.- T Consensus 321 ~~~~~~~gd~~~~~~~~~~----------~~~~i~~~~~-~~-----~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 321 GAMPLYFGDLKQAVTLFDR----------QHLSLLSTNI-GG-----GAFETDTTKVRVIDRFDVVSTDTEAFVPASFKA 384 (397) T ss_pred CceeEEEeeccceEEEEee----------cccEEEEecc-cc-----chhhcCeeeEEEEEeeccEEecccceEEEEecc Confidence 2222233321 1111110 0111111000 00 011234445666677777777776666664222 Q ss_pred cC Q lcl|Aclame:pro 308 AA 309 (309) Q Consensus 308 a~ 309 (309) .+ T Consensus 385 ~~ 386 (397) T protein:vir:49 385 IA 386 (397) T ss_pred cc Confidence 22 No 81 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=89.69 E-value=0.025 Score=29.37 Aligned_cols=265 Identities=11% Similarity=0.020 Sum_probs=110.9 Q ss_pred CCC------CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc-cccCcCc Q lcl|Aclame:pro 1 MSN------APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE-VEFSATD 73 (309) Q Consensus 1 m~~------~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~-ve~~~~~ 73 (309) |+. ...++..+...|...-+...-|. .++..+|+...+++++...- ........-++-++.... ....+.. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~-~~~~~~~~~~~~~~~~~~~~-~~~~~~a~~v~E~~~~~~~~~~~~~~ 186 (397) T protein:vir:49 109 KTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQ-EYVNVENVTTLTGSRVYEKW-TDITGLANIDDEAGKIADVDDPKLSL 186 (397) T ss_pred hhccccccCcccccHhHHHHHHHHHHhhhhHH-hhhceeecccCccceEEEee-ccCCcceeeecCccccccccccceee Confidence 221 12233333333322212211122 23556667666665443211 112111223444444433 3345555 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChH Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPL 153 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi 153 (309) .+...+..+-..+++++-..+. .++....-.+.+.+.+....+..+ +++. +.....++...| |-+ T Consensus 187 i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~d~ai----~~G~----g~~~~~~~~~~~-----d~i 251 (397) T protein:vir:49 187 IKYTIKRYAGISTVTNSLLADS--AENILAWLSGWIAKKVVVTRNKAI----LEAI----AALPTKPTLTKW-----DDI 251 (397) T ss_pred EEeeeeeEEeeehhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHH----Hhhc----cccccccccccH-----HHH Confidence 5555555555556666654433 345555556666666665555433 2221 111112222222 223 Q ss_pred HHHHHHHHHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecceeeccccCCC Q lcl|Aclame:pro 154 PVITDALDSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEARLNIARPGQN 229 (309) Q Consensus 154 ~di~~~~~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a~~~~~~~g~~ 229 (309) .++...++.........+|+++.|..|+. ++..++.+ +..+ ..-..++|.|-+++.+.... .+.. T Consensus 252 ~~~~~~l~~~~~~~a~~vmn~~~~~~l~~-------lkd~~G~~-l~~~~~~~~~~~~l~G~PV~~~~~~~~~---~~~~ 320 (397) T protein:vir:49 252 IDLEAKVDPAIKQTSFFLTNTSGFTALKK-------VKNALGDY-LMERDVKSPTGYSIDGFAVKEVADRWLA---NGTG 320 (397) T ss_pred HHHHHhhhhhhcCCCEEEEcHHHHHHHHH-------hhcCCCce-eeccCcCCCCCceecceeeEEecccccc---cccC Confidence 33333333345666789999999988764 22233222 1111 11234677774333222211 1111 Q ss_pred cccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccc Q lcl|Aclame:pro 230 PNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAV 307 (309) Q Consensus 230 ~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~v 307 (309) +...-++++- .++++.. . |++.+.... ....-..+...+|+...++-.+.-+.+-.+++=.- T Consensus 321 ~~~~i~~gd~~~~~~~~~~------~----~~~i~~~~~------~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 321 GAMPLYFGDLKQAVTLFDR------Q----HMSLLSTNI------GGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKA 384 (397) T ss_pred CceeEEEeeccceEEEEee------c----ceEEEEecc------ccchhhcCceeEEEEeeeCcEEecccceEEEEeec Confidence 2222232321 1111100 0 111111100 00111234455677777777777776666665333 Q ss_pred cC Q lcl|Aclame:pro 308 AA 309 (309) Q Consensus 308 a~ 309 (309) ++ T Consensus 385 ~~ 386 (397) T protein:vir:49 385 IA 386 (397) T ss_pred cc Confidence 33 No 82 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=89.06 E-value=0.029 Score=29.05 Aligned_cols=256 Identities=11% Similarity=0.030 Sum_probs=100.2 Q ss_pred CCCC-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc-cccCcCccceee Q lcl|Aclame:pro 1 MSNA-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE-VEFSATDETGST 78 (309) Q Consensus 1 m~~~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~-ve~~~~~~~~~~ 78 (309) .++. ..++..+.+.+-.. +...-|. .++..+++....+++++......-. ..++-++.... ....+...++.. T Consensus 161 ~~~~g~lvp~~~~~~i~~~-~~~~~l~-~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~e~~~~~e~~~~~~~~v~~~~ 235 (437) T protein:vir:10 161 LKDGKVIIPETILTPEKEV-HQFPRLG-SLVRTESVTTTTGKLPIFNNSTDLL---TAHTEYGQTTKNATPVITPILWDL 235 (437) T ss_pred cccccccchHHHHHHHHHh-hhhhhhh-hcceeEeeccCceeeEEeecccccc---ccccccccccccccccceeeeeeh Confidence 1111 13333333322221 1111111 1223445666667777653321111 11222222222 223344444444 Q ss_pred eccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHHHH Q lcl|Aclame:pro 79 EDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVITD 158 (309) Q Consensus 79 ~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di~~ 158 (309) +..+-..+++++-..+ +.+|....-...+.+.+....+.. ++++.. +.. .. ...++...+|.+ T Consensus 236 ~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~~~~~~~~~~----i~~g~g--~~~-~~--------~~~~~~~~~~~~ 298 (437) T protein:vir:10 236 KTYTGGYVFSQELISD--SSYDWQAELQSRLIELRDNTDDSL----IITALT--DGI-KK--------TTSTYLLGDLKK 298 (437) T ss_pred hheeeehhhhHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHH----Hhhhhc--ccc-cc--------cccccchhhHHH Confidence 3334344666654443 234444444455555555544433 222211 000 00 112233445555 Q ss_pred HHH-Hh--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecceeeccccCCCc Q lcl|Aclame:pro 159 ALD-SV--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEARLNIARPGQNP 230 (309) Q Consensus 159 ~~~-~~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a~~~~~~~g~~~ 230 (309) ++. .+ .+++| +.+|++++|.+|+. ++..++.+ +..+ ..-..|||.| |++.+..... .+..+ T Consensus 299 ~~~~~l~~~~~~~~~~~~~~~~~~~l~~-------lkd~~g~~-~~~~~~~~~~~~~l~G~p-v~~~~~~~~~--~~~~~ 367 (437) T protein:vir:10 299 VLNVTLKPQDSAAASIVMSQSAYNLFDM-------ATDAMGRP-LLQPNVTAATGYTLLGKT-VVIVDDKLFP--SASAG 367 (437) T ss_pred HHHhhhhhhhhcCCEEEEcHHHHHHHHH-------hhccCCCe-eeccCccCCCCcccccce-eEEecccccC--CcCCC Confidence 543 22 45565 57999999998754 22223222 1111 1123577877 4443322110 11111 Q ss_pred ccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 231 NLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 231 ~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) +..-++++- .++++.. -|.+.++... .......+++....+-.++-+++..+++.-+. T Consensus 368 ~~~~~~gd~~~~~~~~~r----------~~~~~~~~~~----------~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~ 427 (437) T protein:vir:10 368 DVNIVVAPLKKAVINFKL----------TEITGQFQDT----------YDIWYKQLGIFLRQNVVQASKDLIVNLTGKLK 427 (437) T ss_pred ceEEEEeeccccEEEEee----------eceEEEEecc----------cccccceeeEEEEEccEEecccceEEEEeecc Confidence 111233321 1222111 1222222111 11112234555556666777777766664433 Q ss_pred C Q lcl|Aclame:pro 309 A 309 (309) Q Consensus 309 ~ 309 (309) + T Consensus 428 ~ 428 (437) T protein:vir:10 428 A 428 (437) T ss_pred c Confidence 3 No 83 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=88.94 E-value=0.029 Score=28.99 Aligned_cols=263 Identities=13% Similarity=0.045 Sum_probs=112.9 Q ss_pred CC-----CCC-CCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEe--chhHhhhchhHhhcccccccccc-cCc Q lcl|Aclame:pro 1 MS-----NAP-FPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKY--DLAQGFTVPETLVGRKSKPNEVE-FSA 71 (309) Q Consensus 1 m~-----~~~-f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~--~~~~~f~~~~t~~~~~~~~~~ve-~~~ 71 (309) |+ +.. .++....+.|...-++..-|. .+++.++|...+++++.+ ...-.+ .-++.++...... ..+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~-~~~~~~~~~~~~~~~~~~~~~~~~~a----~~v~E~~~~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALE-QYVTVEPVRTRSGSRVLEKNSDMIPF----AEITEMGEIPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhh-hhceeeeccCCceeEEEEeecCCccc----eeecccccccccccccc Confidence 22 122 343333334432222222222 245666777666665443 221111 1233344443332 344 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCC Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sd 151 (309) .+.+...+..+-..+++++-..+ +.+|....-.+.+.+.|....+..+....-+ .. ..+.. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~------~~-----------~~~~~ 241 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQD--SDQNILKYVTKWLGKKSKVTRNVLILGVIEK------LT-----------KQAIK 241 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------cc-----------ccCcc Confidence 55455554444445666665443 3355566666667777666555444322111 01 11222 Q ss_pred hHHHHHHHHHH-h--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCccccc----CHHHHHHHhCCCeEEeecceeec Q lcl|Aclame:pro 152 PLPVITDALDS-V--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMV----PMAFLQELLELDAIYIGEARLNI 223 (309) Q Consensus 152 Pi~di~~~~~~-~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~v----t~~~l~~l~gl~~I~v~~a~~~~ 223 (309) ...+|.+++.. + ..++| ..+|+++.|.+|+. + +..++.. +. +...-..+||++.|++-+.-... T Consensus 242 ~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~---l----kd~~G~~-l~~~~~~~~~~~tllG~~~v~~~~~~~~~ 313 (392) T protein:vir:10 242 SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK---L----KDKDGKY-ILQSDPTQKNKKLFAGTNPVVVVSNRFLK 313 (392) T ss_pred CHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH---h----hccCCCe-EeecCccCCccccccCcccEEEecccccC Confidence 23455555432 2 45554 58999999998854 2 2222222 11 11112346788766653332211 Q ss_pred cccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 224 ARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 224 ~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) .... ..+...+++.+-...-.-...-|.+.++... .+ ..-..+...+|+...++-.+.-+++-..+ T Consensus 314 ~~~~--------~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~-~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 379 (392) T protein:vir:10 314 SKGT--------TAKKAPLIIGDLKEAIVLFKREDMELASTDV-GG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) T ss_pred CCcc--------cCCceEEEEEehhceEEEEeecceEEEEecc-cc-----chhhcCceEEEEEEeeccEEecccceEEE Confidence 1111 1111212221100000000001112221110 00 00123444577777777777777777776 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-..++ T Consensus 380 ~~~~~a 385 (392) T protein:vir:10 380 EIDLSA 385 (392) T ss_pred Eecccc Confidence 665555 No 84 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=88.94 E-value=0.029 Score=28.99 Aligned_cols=263 Identities=13% Similarity=0.045 Sum_probs=112.9 Q ss_pred CC-----CCC-CCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEe--chhHhhhchhHhhcccccccccc-cCc Q lcl|Aclame:pro 1 MS-----NAP-FPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKY--DLAQGFTVPETLVGRKSKPNEVE-FSA 71 (309) Q Consensus 1 m~-----~~~-f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~--~~~~~f~~~~t~~~~~~~~~~ve-~~~ 71 (309) |+ +.. .++....+.|...-++..-|. .+++.++|...+++++.+ ...-.+ .-++.++...... ..+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~-~~~~~~~~~~~~~~~~~~~~~~~~~a----~~v~E~~~~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALE-QYVTVEPVRTRSGSRVLEKNSDMIPF----AEITEMGEIPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhh-hhceeeeccCCceeEEEEeecCCccc----eeecccccccccccccc Confidence 22 122 343333334432222222222 245666777666665443 221111 1233344443332 344 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCC Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sd 151 (309) .+.+...+..+-..+++++-..+ +.+|....-.+.+.+.|....+..+....-+ .. ..+.. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~------~~-----------~~~~~ 241 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQD--SDQNILKYVTKWLGKKSKVTRNVLILGVIEK------LT-----------KQAIK 241 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------cc-----------ccCcc Confidence 55455554444445666665443 3355566666667777666555444322111 01 11222 Q ss_pred hHHHHHHHHHH-h--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCccccc----CHHHHHHHhCCCeEEeecceeec Q lcl|Aclame:pro 152 PLPVITDALDS-V--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMV----PMAFLQELLELDAIYIGEARLNI 223 (309) Q Consensus 152 Pi~di~~~~~~-~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~v----t~~~l~~l~gl~~I~v~~a~~~~ 223 (309) ...+|.+++.. + ..++| ..+|+++.|.+|+. + +..++.. +. +...-..+||++.|++-+.-... T Consensus 242 ~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~---l----kd~~G~~-l~~~~~~~~~~~tllG~~~v~~~~~~~~~ 313 (392) T protein:vir:10 242 SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK---L----KDKDGKY-ILQSDPTQKNKKLFAGTNPVVVVSNRFLK 313 (392) T ss_pred CHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH---h----hccCCCe-EeecCccCCccccccCcccEEEecccccC Confidence 23455555432 2 45554 58999999998854 2 2222222 11 11112346788766653332211 Q ss_pred cccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 224 ARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 224 ~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) .... ..+...+++.+-...-.-...-|.+.++... .+ ..-..+...+|+...++-.+.-+++-..+ T Consensus 314 ~~~~--------~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~-~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 379 (392) T protein:vir:10 314 SKGT--------TAKKAPLIIGDLKEAIVLFKREDMELASTDV-GG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) T ss_pred CCcc--------cCCceEEEEEehhceEEEEeecceEEEEecc-cc-----chhhcCceEEEEEEeeccEEecccceEEE Confidence 1111 1111212221100000000001112221110 00 00123444577777777777777777776 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-..++ T Consensus 380 ~~~~~a 385 (392) T protein:vir:10 380 EIDLSA 385 (392) T ss_pred Eecccc Confidence 665555 No 85 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=88.94 E-value=0.029 Score=28.99 Aligned_cols=263 Identities=13% Similarity=0.045 Sum_probs=112.9 Q ss_pred CC-----CCC-CCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEe--chhHhhhchhHhhcccccccccc-cCc Q lcl|Aclame:pro 1 MS-----NAP-FPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKY--DLAQGFTVPETLVGRKSKPNEVE-FSA 71 (309) Q Consensus 1 m~-----~~~-f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~--~~~~~f~~~~t~~~~~~~~~~ve-~~~ 71 (309) |+ +.. .++....+.|...-++..-|. .+++.++|...+++++.+ ...-.+ .-++.++...... ..+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~-~~~~~~~~~~~~~~~~~~~~~~~~~a----~~v~E~~~~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALE-QYVTVEPVRTRSGSRVLEKNSDMIPF----AEITEMGEIPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhh-hhceeeeccCCceeEEEEeecCCccc----eeecccccccccccccc Confidence 22 122 343333334432222222222 245666777666665443 221111 1233344443332 344 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCC Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sd 151 (309) .+.+...+..+-..+++++-..+ +.+|....-.+.+.+.|....+..+....-+ .. ..+.. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~------~~-----------~~~~~ 241 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQD--SDQNILKYVTKWLGKKSKVTRNVLILGVIEK------LT-----------KQAIK 241 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------cc-----------ccCcc Confidence 55455554444445666665443 3355566666667777666555444322111 01 11222 Q ss_pred hHHHHHHHHHH-h--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCccccc----CHHHHHHHhCCCeEEeecceeec Q lcl|Aclame:pro 152 PLPVITDALDS-V--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMV----PMAFLQELLELDAIYIGEARLNI 223 (309) Q Consensus 152 Pi~di~~~~~~-~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~v----t~~~l~~l~gl~~I~v~~a~~~~ 223 (309) ...+|.+++.. + ..++| ..+|+++.|.+|+. + +..++.. +. +...-..+||++.|++-+.-... T Consensus 242 ~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~---l----kd~~G~~-l~~~~~~~~~~~tllG~~~v~~~~~~~~~ 313 (392) T protein:vir:10 242 SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK---L----KDKDGKY-ILQSDPTQKNKKLFAGTNPVVVVSNRFLK 313 (392) T ss_pred CHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH---h----hccCCCe-EeecCccCCccccccCcccEEEecccccC Confidence 23455555432 2 45554 58999999998854 2 2222222 11 11112346788766653332211 Q ss_pred cccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 224 ARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 224 ~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) .... ..+...+++.+-...-.-...-|.+.++... .+ ..-..+...+|+...++-.+.-+++-..+ T Consensus 314 ~~~~--------~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~-~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 379 (392) T protein:vir:10 314 SKGT--------TAKKAPLIIGDLKEAIVLFKREDMELASTDV-GG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) T ss_pred CCcc--------cCCceEEEEEehhceEEEEeecceEEEEecc-cc-----chhhcCceEEEEEEeeccEEecccceEEE Confidence 1111 1111212221100000000001112221110 00 00123444577777777777777777776 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-..++ T Consensus 380 ~~~~~a 385 (392) T protein:vir:10 380 EIDLSA 385 (392) T ss_pred Eecccc Confidence 665555 No 86 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=88.94 E-value=0.029 Score=28.99 Aligned_cols=263 Identities=13% Similarity=0.045 Sum_probs=112.9 Q ss_pred CC-----CCC-CCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEe--chhHhhhchhHhhcccccccccc-cCc Q lcl|Aclame:pro 1 MS-----NAP-FPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKY--DLAQGFTVPETLVGRKSKPNEVE-FSA 71 (309) Q Consensus 1 m~-----~~~-f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~--~~~~~f~~~~t~~~~~~~~~~ve-~~~ 71 (309) |+ +.. .++....+.|...-++..-|. .+++.++|...+++++.+ ...-.+ .-++.++...... ..+ T Consensus 106 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~-~~~~~~~~~~~~~~~~~~~~~~~~~a----~~v~E~~~~~~~~~~~~ 180 (392) T protein:vir:10 106 MSGLTGEDGGLVIPQDIQTQINELARSFDALE-QYVTVEPVRTRSGSRVLEKNSDMIPF----AEITEMGEIPETDNPKF 180 (392) T ss_pred ccccccCCCceecchhHHHHHHHHHHhhhhhh-hhceeeeccCCceeEEEEeecCCccc----eeecccccccccccccc Confidence 22 122 343333334432222222222 245666777666665443 221111 1233344443332 344 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCC Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sd 151 (309) .+.+...+..+-..+++++-..+ +.+|....-.+.+.+.|....+..+....-+ .. ..+.. T Consensus 181 ~~v~l~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~------~~-----------~~~~~ 241 (392) T protein:vir:10 181 SNVQYAVKDRAGILPLSRSLLQD--SDQNILKYVTKWLGKKSKVTRNVLILGVIEK------LT-----------KQAIK 241 (392) T ss_pred eeEEeeeeeEEEeehhhHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------cc-----------ccCcc Confidence 55455554444445666665443 3355566666667777666555444322111 01 11222 Q ss_pred hHHHHHHHHHH-h--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCccccc----CHHHHHHHhCCCeEEeecceeec Q lcl|Aclame:pro 152 PLPVITDALDS-V--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMV----PMAFLQELLELDAIYIGEARLNI 223 (309) Q Consensus 152 Pi~di~~~~~~-~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~v----t~~~l~~l~gl~~I~v~~a~~~~ 223 (309) ...+|.+++.. + ..++| ..+|+++.|.+|+. + +..++.. +. +...-..+||++.|++-+.-... T Consensus 242 ~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~---l----kd~~G~~-l~~~~~~~~~~~tllG~~~v~~~~~~~~~ 313 (392) T protein:vir:10 242 SLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK---L----KDKDGKY-ILQSDPTQKNKKLFAGTNPVVVVSNRFLK 313 (392) T ss_pred CHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH---h----hccCCCe-EeecCccCCccccccCcccEEEecccccC Confidence 23455555432 2 45554 58999999998854 2 2222222 11 11112346788766653332211 Q ss_pred cccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 224 ARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 224 ~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) .... ..+...+++.+-...-.-...-|.+.++... .+ ..-..+...+|+...++-.+.-+++-..+ T Consensus 314 ~~~~--------~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~-~~-----~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 379 (392) T protein:vir:10 314 SKGT--------TAKKAPLIIGDLKEAIVLFKREDMELASTDV-GG-----KAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) T ss_pred CCcc--------cCCceEEEEEehhceEEEEeecceEEEEecc-cc-----chhhcCceEEEEEEeeccEEecccceEEE Confidence 1111 1111212221100000000001112221110 00 00123444577777777777777777776 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-..++ T Consensus 380 ~~~~~a 385 (392) T protein:vir:10 380 EIDLSA 385 (392) T ss_pred Eecccc Confidence 665555 No 87 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=88.43 E-value=0.032 Score=28.75 Aligned_cols=260 Identities=10% Similarity=0.013 Sum_probs=114.3 Q ss_pred CCC-----CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSN-----APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~-----~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) ++. ...++...+..+....+...-|.+ +++.+|+.....+|+++.... ....-++.++.......++...+ T Consensus 114 ~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~-~~~~~~~~~~~~~~~~~~~~~---~~a~~v~Eg~~~~~~~~~~~~i~ 189 (390) T protein:vir:97 114 STDAAGSAGALTTPNRLPGFITPPDARLTVRD-LIGSGRTDSALIEYVQETGFV---NNAAIVAEGALKPESSLKFAKKT 189 (390) T ss_pred hcccccccccccchhhhHHHHHHHhhhhhhHh-hcceeeccCCceEEEEEecCC---cceeeecCCccccccccceeEEE Confidence 111 123444445555444444444444 467788887777888874321 11122344555555555666666 Q ss_pred eeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecc--cccccC--CCCCC Q lcl|Aclame:pro 76 GSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS--GADQWS--DPTSN 151 (309) Q Consensus 76 ~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ls--gt~~Ws--d~~sd 151 (309) +..+..+-..+++++-..+.. +....-.+.+.+.+....+. .++++..-...-...++ +...+. ..+.+ T Consensus 190 ~~~~k~~~~~~is~ell~ds~---~l~~~i~~~la~a~~~~~d~----a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~ 262 (390) T protein:vir:97 190 DTTHVIAHTMKATRQILSDAP---QLASYMNNRLIRGLKVKEDA----EILRGTGANDGLLGLIPQATTYAAPTTIAGAT 262 (390) T ss_pred EeeeeEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHH----HHhhcCCCCccccceeeccccccccccccccc Confidence 666666656677776544321 23333334455554444432 33333211111111111 111121 12345 Q ss_pred hHHHHHHHHHH---hCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH---HHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 152 PLPVITDALDS---VILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM---AFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 152 Pi~di~~~~~~---~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~---~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) .+.+|.+.+.. .+..++..+|+++.|.+|+. +++ .++.. +..+ ..-..++|+| |++.+.. T Consensus 263 ~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~---lkd----~~G~~-l~~~~~~~~~~~l~G~p-V~~~~~~----- 328 (390) T protein:vir:97 263 RVDQLRLAMLQASLAEYPASGIVINPIDWAAIEL---AKD----ANNQY-LIGNARGTLTPTLWGLP-VVATQAM----- 328 (390) T ss_pred hHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH---hhc----CCCce-eecCccCCCCceeccee-eEEcCCC----- Confidence 56666666544 36788999999999999874 332 22211 1111 0112467877 4433221 Q ss_pred cCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 226 PGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 226 ~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) +.. + -++++. .++++.. -|++.++.. +...-..+...+|+...++-.+.-+++-..+ T Consensus 329 ~~~--~--~~~gd~~~~~~~~~~----------~~~~i~~~~-------~~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~ 387 (390) T protein:vir:97 329 APG--E--FLVGAFDLAAQIFDQ----------WDARVEIGY-------VNDDFQRNMVTVLAEERLALVVYRPEALITG 387 (390) T ss_pred CCC--c--EEEEeccceEEEEEe----------cceEEEEee-------cccccccCcEEEEEEEeeccEEeccccEEEE Confidence 100 0 111211 1111110 011111100 0011123444456666665555555444333 Q ss_pred hcc Q lcl|Aclame:pro 304 ENA 306 (309) Q Consensus 304 ~~~ 306 (309) +=+ T Consensus 388 ~~a 390 (390) T protein:vir:97 388 SFA 390 (390) T ss_pred EeC Confidence 322 No 88 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=88.32 E-value=0.022 Score=29.70 Aligned_cols=261 Identities=8% Similarity=0.050 Sum_probs=105.2 Q ss_pred CCCC--CCCcchhhHHHHH--hh--cchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc--ccCcC Q lcl|Aclame:pro 1 MSNA--PFPIDPELTAIAI--AY--RNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV--EFSAT 72 (309) Q Consensus 1 m~~~--~f~~dp~LT~~a~--~y--~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v--e~~~~ 72 (309) |... .+.+ ..|+++.. =| ..+++.++.|||...+..-..+..+|...+..=. ...-+-+++...+ ...+. T Consensus 46 ~~~~~~~~i~-a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~-a~~ygd~ad~Pl~~~~v~~~ 123 (339) T protein:vir:94 46 LQTTANAGIP-AWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQ-VATYSDWSANGMSKANVNFE 123 (339) T ss_pred cccccccchh-hhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccc-eEEcccccCCCcccccceee Confidence 1111 1211 11222221 12 3456899999998877654334444433211100 0111223332222 23444 Q ss_pred ccceeeeccchhhcCCHHHHHHH-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc----ccCccc---c-eeccccc Q lcl|Aclame:pro 73 DETGSTEDHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSPN----SYAAGN---K-TTLSGAD 143 (309) Q Consensus 73 ~~~~~~~e~~L~~~v~~~~~~~a-~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~----~y~~~~---~-~~lsgt~ 143 (309) .++....+-+.. +...|...| ..+++...+-.+..++.++ +..-+.++.+. .|+.-| - ...+++. T Consensus 124 ~~~v~~~~~g~~--y~~~E~~~A~~~g~~l~~~Ka~aA~~al~----~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~ 197 (339) T protein:vir:94 124 SRQNYRYQTWTE--YGDLEMATYGEAGIDYVARQEISASLVMA----KFANSSYLLGVAGIANYGLMNDPSLPAPVAATV 197 (339) T ss_pred EEeEEEEEEEEe--ecHHHHHHHHhhCCChHHHHHHHHHHHHH----HhhceEEeeeecccceEEEEeCCCccccccCCC Confidence 555555555553 344555444 3456654433333333332 22222222221 121111 1 2235667 Q ss_pred ccCCCCCC-hHHHHHHHHHHh-----CC----CCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHh-CCC Q lcl|Aclame:pro 144 QWSDPTSN-PLPVITDALDSV-----IL----RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELL-ELD 212 (309) Q Consensus 144 ~Wsd~~sd-Pi~di~~~~~~~-----g~----~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~-gl~ 212 (309) +|...+.+ .+.||.+...++ |. .|++++|.+..+..|.+- +. .+.-=.+.|++-+ +|. T Consensus 198 ~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~---------n~--~~~Tvl~~lk~n~pnl~ 266 (339) T protein:vir:94 198 NWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT---------NN--FGLSAGAKIAQTYPNIQ 266 (339) T ss_pred CcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC---------Cc--CCccHHHHHHHhcCCcE Confidence 89887765 488999887665 32 467999999999988531 11 1211134566654 232 Q ss_pred eEEeecceeeccccCCCcccceecCCcEEEEecCCCCC-CcC--cc--eecccccccccccCCccccccccCCceEEEee Q lcl|Aclame:pro 213 AIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLAD-TRN--GT--TFGLTAQWGDRVSGSIADPNIGLRGGQRVRVG 287 (309) Q Consensus 213 ~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~-~~~--~~--t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~ 287 (309) |+- -..+.++. + +...++...-.+. +.. -| ---...|. .+..+..++.+.-||..|+ T Consensus 267 -i~~-~~el~~a~--g---------~~~~~~~~~~~~~~~~~~~~p~~~~~lpvq~---~~~~~~v~~~~rt~Gv~i~-- 328 (339) T protein:vir:94 267 -FVA-VPEFDTAS--G---------RLVQLWVPEVNGQPTGEVAFAEKLRSHSIER---YSTTTRQKHSGATFGAVIY-- 328 (339) T ss_pred -EEE-ccccccCC--C---------ceEEEEEEeccCCcceEEEcchhhhccccEE---cCceEEecceeeeeeEEEE-- Confidence 221 11111111 1 1111221110000 000 00 00001121 1223333444444443333 Q ss_pred cccceeeecchhhh Q lcl|Aclame:pro 288 ESVKELVTAPDLGF 301 (309) Q Consensus 288 ~~~~~~v~~~~~G~ 301 (309) -|.-++.-.|. T Consensus 329 ---~P~ai~~~~GI 339 (339) T protein:vir:94 329 ---QPWAVTQELGV 339 (339) T ss_pred ---ccceeeeeecC Confidence 23333333333 No 89 >protein:vir:78090 Length: 302 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468790;genbank:gi:157325371;genbank:GeneID:5601852 Probab=87.95 E-value=0.035 Score=28.54 Aligned_cols=270 Identities=14% Similarity=0.054 Sum_probs=107.0 Q ss_pred CCCCCCCcchhhHHHHHhhcch---hhhhhhhCCcc-----ccccc---cceeEEechh----HhhhchhHhhccccccc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNG---RMISDEVLPRV-----PVGKQ---EFKFWKYDLA----QGFTVPETLVGRKSKPN 65 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~---~~ig~~lfP~v-----~v~~~---~~k~~~~~~~----~~f~~~~t~~~~~~~~~ 65 (309) |+|+ | ++|.-|.+. .|....++-.. .|.-. +.|+|++.-. ..+..++ | .+..+ T Consensus 1 Mant-------l-~ya~~~~~~Ld~~~~~~~~t~~l~~~~~~v~~~Gak~vkIp~is~~~~~TsGl~dy~--R--~~g~~ 68 (302) T protein:vir:78 1 MANS-------L-ALAQIYQDNIDKAIAVNSKSAFLEANPNNVQYNGGNTIKIADISFGSGTTGDLKAYN--R--STGFT 68 (302) T ss_pred CCch-------h-HHHHHHHHHHHHHHHhhhceeecccCCceEEEecCcEEEEEEEEeeccccccccccc--c--ccCcc Confidence 8865 2 444444431 23222222111 12222 3344444210 0111111 2 22223 Q ss_pred ccccCcCccceee-eccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHH-HHHHHHHhhcccccCcccceeccccc Q lcl|Aclame:pro 66 EVEFSATDETGST-EDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDR-EARTSKLVFSPNSYAAGNKTTLSGAD 143 (309) Q Consensus 66 ~ve~~~~~~~~~~-~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~-E~~~a~~~~~~~~y~~~~~~~lsgt~ 143 (309) ....+...+++.+ .|++....|+.-+.++.+....-....-+...+++-=.. -.++++++.....- ++....+.+ T Consensus 69 ~g~v~~~~et~tlt~DR~~~f~vD~mDvdETn~~~~~ani~~ef~r~~vvPEiDayrfskla~~a~~~--~~~~~~~~~- 145 (302) T protein:vir:78 69 QGSVTLAWSDYTLDYDLAQSFQIDAMDVDETKNLATVGNVLSEYQRTKIVPAIDKYRFTKLANDGTGV--GGVIDLSKP- 145 (302) T ss_pred ccceeeeeeeEEeeeccceeeeccccchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHHhhhcc--Ccccccccc- Confidence 3334444444443 344556666655555433221111111111111111111 12456665544321 121222211 Q ss_pred ccCCCCCChHHHHHHHHHHhCC-CCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceee Q lcl|Aclame:pro 144 QWSDPTSNPLPVITDALDSVIL-RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLN 222 (309) Q Consensus 144 ~Wsd~~sdPi~di~~~~~~~g~-~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~ 222 (309) +....+.+.+|+++++.++- -+-+|.+++.++.+|++.+.+.+.+.......+.| ...+.++=|++-|.|-+++.. T Consensus 146 --~~t~~nvl~~i~~~~~~~~e~~~~vl~vtp~~~~~Lk~a~~~~~~~~~~~~~~~~i-~~~V~~lDgv~Ii~VPs~r~~ 222 (302) T protein:vir:78 146 --DASAQALMGDIATAMELVDDSNQLILVTSPTTLAGLLNTALIRESKNTQVLRRGEV-DTKITFIQDVEVLQVPSEYLY 222 (302) T ss_pred --chhHHHHHHHHHHHHHHhhccCCeEEEEChHHHHHHhcchhhccceeccccccccc-cceeeeecccEEEEchhhhcc Confidence 11335788899999887733 36779999999999999999988876654444444 233555557777777776665 Q ss_pred ccccCCCcccceecCCcEEEEecCCCCC---CcCcce-----------ecccccccccccCCccccccccCCceEEEeec Q lcl|Aclame:pro 223 IARPGQNPNLIRAWGPHASFIYRDRLAD---TRNGTT-----------FGLTAQWGDRVSGSIADPNIGLRGGQRVRVGE 288 (309) Q Consensus 223 ~~~~g~~~~~~~v~~~~~~L~~~~~~~~---~~~~~t-----------~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~ 288 (309) +...-.++-...-=+.++=++-+++.+. ...... -+|.++ .|...-.+...-. ..+.++-. T Consensus 223 t~~~f~~G~~~~~~ak~INfiiv~~~a~ia~~K~~~~~if~P~~~~~gd~~l~~--~R~Y~D~fV~~nk-~~gI~~~~-- 297 (302) T protein:vir:78 223 DKVAPKVGVPDYTGAKKIPYMIFKRDAPTGIVKTDKVRVFEPDTNQSADAYKVD--LRLYHDLIVPKNQ-RPGIIKAS-- 297 (302) T ss_pred cceeccCCccccCCccceeEEEECCCeeeeeeeeeeeEeeCCCCCCCcceeeee--eeeEeeeeeeccc-cCeEEEee-- Confidence 4321111100000011111111111111 111100 111111 1221111110000 01111111 Q ss_pred ccceeeecchhhhhhhcccc Q lcl|Aclame:pro 289 SVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 289 ~~~~~v~~~~~G~l~~~~va 308 (309) ..+|| T Consensus 298 ---------------~~~~~ 302 (302) T protein:vir:78 298 ---------------FGTIA 302 (302) T ss_pred ---------------ccccC Confidence 12222 No 90 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=87.71 E-value=0.037 Score=28.44 Aligned_cols=263 Identities=10% Similarity=-0.023 Sum_probs=108.0 Q ss_pred CCCCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCcCccceeee Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSATDETGSTE 79 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~~~~~~~~~ 79 (309) .++.-+.+-+.+.+-.+........=-.++..+|+....++++.... ........-++.++..... ...+...++..+ T Consensus 121 ~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~ 199 (408) T protein:vir:10 121 DSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKW-TDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIK 199 (408) T ss_pred ccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeec-cccccceeeecCccccccccCcceeeEEeeee Confidence 22222222222222222211111111223445566666666543311 1111112233444444332 234556666666 Q ss_pred ccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHHHHH Q lcl|Aclame:pro 80 DHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVITDA 159 (309) Q Consensus 80 e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di~~~ 159 (309) ..+-..+++++-..+ +.+|....-.+.+.+.+....+..+. ++.. . ..... +..-+.+|.+. T Consensus 200 k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~~~~~~~~~il----~g~g--~--~~~~~--------~~~~~~~l~~~ 261 (408) T protein:vir:10 200 RYAGIITATNTSLKD--TAENILAWLSSWIAKKVVVTRNQAII----EVMK--A--APKKP--------TIAKFDDVITM 261 (408) T ss_pred eEEeeehhHHHHHhh--chHHHHHHHHHHHHHHHHHHHHHHHh----hccc--c--ccccc--------ccccHHHHHHH Confidence 666556677665443 34555565566666666665554332 2211 0 00111 12223455554 Q ss_pred HHH-h--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecceeeccccCCCcc Q lcl|Aclame:pro 160 LDS-V--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEARLNIARPGQNPN 231 (309) Q Consensus 160 ~~~-~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a~~~~~~~g~~~~ 231 (309) +.. + ++++| ..+|+++.|..|+. ++ ..++.. +..+ ..-..+||.|-+++-+..... ..... T Consensus 262 ~~~~~~~~~~~~a~~v~n~~~~~~l~~---lk----d~~G~~-i~~~~~~~~~~~~l~G~PV~~~~~~~~~~---~~~~~ 330 (408) T protein:vir:10 262 INTAVDPAIIATSSLLTNQSGLNKLAL---VK----TAEGKY-LLEPDPTKPNSYLIKGKQVIVVADRWLPN---TGSTV 330 (408) T ss_pred HHHhhhhhhccCCEEEEcHHHHHHHHH---hh----ccCCce-EeccCcCCCCCceecceeeEEecccccCc---cCCCc Confidence 432 2 55555 58899999999875 22 222221 1111 112356787743322221111 01111 Q ss_pred cceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 232 LIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 232 ~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) ..-++++- .++++. --|.+.++....... -..+...+|....++-++.-+++-.+++-+-++ T Consensus 331 ~~i~~gd~~~~~~~~~----------~~~~~v~~~~~~~~~------f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~ 394 (408) T protein:vir:10 331 YPLYYGDMSQAITLFD----------RENMSLLPTNIGAGA------FETDTTKIRVIDRFDVKATDSEALVAGSFSAIA 394 (408) T ss_pred eEEEEEehhccEEEEE----------ecceEEEEcccccch------hhcCceEEEEEEeeccEEeccccEEEEEeeccc Confidence 11223321 111111 012222221111100 123444566666677677766666665533332 No 91 >protein:vir:78920 Length: 290 # NCBI annotation: Cps # Family: family:all:701 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468846;genbank:gi:157325479;genbank:GeneID:5601917 Probab=87.63 E-value=0.037 Score=28.40 Aligned_cols=268 Identities=13% Similarity=0.071 Sum_probs=104.0 Q ss_pred CCCCCCCcchhhHHHHHhhcchhhhhhhhCCccc-cccccceeEEechhHhhhchhHhhcccccccccccCcCccceee- Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVP-VGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGST- 78 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~-v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~~- 78 (309) |+-..+. -....+-..|...-+-|...-+.+. ....+.++|+.+- ..+..++ | .+..+.-+.+.+..++.+ T Consensus 1 Main~a~--~~~~~Ld~~~~~~~~t~~l~~~~~~~~ggktVkI~~i~~-~gl~DY~--R--~~g~~~g~v~~~~et~tl~ 73 (290) T protein:vir:78 1 MAINYVD--KYGKELDQKLVFGTYTNELETPNLLWLDAKTFKIQTITT-TGLKAHT--R--NKGYNEGSASNTNKSYTID 73 (290) T ss_pred CchhHHH--HHHHHHHHHHHhhheeeeccccceeeccCCEEEEeeecc-Ccccccc--c--CCCcccCccccceeeEEee Confidence 7754431 1222222333333333333333321 1122234444432 2222222 2 222222233333344443 Q ss_pred eccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHH-HHHHHHhhcccccCcccceecccccccCCCC-CChHHHH Q lcl|Aclame:pro 79 EDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDRE-ARTSKLVFSPNSYAAGNKTTLSGADQWSDPT-SNPLPVI 156 (309) Q Consensus 79 ~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E-~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~-sdPi~di 156 (309) .+++....|+.-+.++++...+......+...+.+.-... .+++.++..... .++.. +. ..+ .+.+.-| T Consensus 74 qdR~~~F~vD~~DvDEt~~~~~~~nv~~ef~~~~v~PEiDayr~skla~~a~~---~~~~~-~~-----t~t~~n~~~~i 144 (290) T protein:vir:78 74 FDRDVEFFVDVMDVDETGQALSAANVTKEFNSRHAGPEMDAYRFSKLATAAKT---NSNSV-AE-----EITKDNVFTKL 144 (290) T ss_pred ccccceeeccccchhHHhhhhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhhhc---cCccc-cc-----ccCHHHHHHHH Confidence 4445666676555555543333333333333333222222 234444444321 11111 10 122 3577777 Q ss_pred HHHHHH---hCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeec-ceeeccccCCCccc Q lcl|Aclame:pro 157 TDALDS---VILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGE-ARLNIARPGQNPNL 232 (309) Q Consensus 157 ~~~~~~---~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~-a~~~~~~~g~~~~~ 232 (309) .+.+.+ ++..+-.|++++.++.+|++++.+.+.+.......+.+ .-.+.++=|++-|.|-. .+..+...-.++-. T Consensus 145 ~~~~~~ldevp~~~rvl~vtp~~~~lL~~~~~f~r~~~~~~~~~~~i-~~~V~~idG~~ii~vps~~r~~t~~~f~~G~~ 223 (290) T protein:vir:78 145 KAAIRKVKKYGTQNLVMYVSPDVMAALELSDDFVRAINVQNIGPSSI-ETRITAIDGTRIVEVEAEDRFYDTFDFTDGYK 223 (290) T ss_pred HHHHHHHHhcCCCCeEEEECHHHHHHHhhChhhhccccccccccccc-cceeeeecCcEEEEecccchhhhhhhhccccc Confidence 777544 45556789999999999999999998876544334444 33556666777665543 23221110000000 Q ss_pred ceecCCcEEEEecCCCCC---CcCcc-----------eecccccccccccCCccccccccCCceEEEeeccc Q lcl|Aclame:pro 233 IRAWGPHASFIYRDRLAD---TRNGT-----------TFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESV 290 (309) Q Consensus 233 ~~v~~~~~~L~~~~~~~~---~~~~~-----------t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~ 290 (309) .---+.++=++-.++.+. ..... .-||.+++ |...-.+... ....-|.+.-.+ T Consensus 224 ~~~~ak~in~ii~~~~a~i~~~K~~~~~~~~P~~~~~~d~~~~~~--r~y~d~~v~~---nk~~~i~~~~~~ 290 (290) T protein:vir:78 224 PAAGAKKLNFLLVNKGSVVGGAKHASIYLHAPGSVGQGDGWLYQY--RVYHDIFVLD---QQKDGVIASTEV 290 (290) T ss_pred ccCCccceeEEEEcCCceeeeeeeeEEEeeCCCCCcCcceeeeee--eeeeeeeeec---cccCeeEEEeeC Confidence 000011111111111111 11110 12333322 2222111111 111111111111 No 92 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=87.48 E-value=0.038 Score=28.34 Aligned_cols=225 Identities=11% Similarity=0.037 Sum_probs=108.8 Q ss_pred cchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccceeeeccchhhcCCHHHHHHHhhcC Q lcl|Aclame:pro 20 RNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNY 99 (309) Q Consensus 20 ~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~ 99 (309) .|+-..|+.+ .|++ +.-+.+..+-+.....-+++.+..+..+++.+--..|.+++... ..- T Consensus 1 ~~~~~~Gdti--------------t~P~---~iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~--~~g 61 (231) T protein:vir:73 1 ENGINLANLC--------------EYPN---DIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS--GYG 61 (231) T ss_pred CccccCCceE--------------Eecc---cccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhh--ccC Confidence 2222333333 3332 12234566667766666788888888888877666666665433 355 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHHHHHHHHhC---CCCcEEEeCHHH Q lcl|Aclame:pro 100 NPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVITDALDSVI---LRPNIGVLGRRT 176 (309) Q Consensus 100 d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di~~~~~~~g---~~Pn~~v~~~~~ 176 (309) ||...+.+.+...|.......+...+. ++.|+-+++.-...|.++.+.+| -.|..++++++. T Consensus 62 Dp~~ea~~Q~~~~iA~kvD~di~~~~~---------------~a~l~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~ 126 (231) T protein:vir:73 62 DPIGESNKQLGLSLANKVDDDLLKAAK---------------TTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKD 126 (231) T ss_pred chHHHHHHHHHHHHHHhhhHHHHHhhc---------------cccccccccccHHHHHHHHHHhccccccceEEEEcchH Confidence 888888887777665554443332221 12344556667888888888774 578999999999 Q ss_pred HHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcce Q lcl|Aclame:pro 177 ATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTT 256 (309) Q Consensus 177 ~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t 256 (309) +..|+..+...+.- +....+++---.+..++|++ |++-+.. ..+.......++...++.++.- .++ T Consensus 127 ~~~Lrk~~~~~~~~--~~~g~~i~~~G~iG~i~G~~-Vi~S~~~----~~~~~~~~~~i~~~gAl~~~~k------~~~- 192 (231) T protein:vir:73 127 AAKIRKDANAKNIG--SEVGANALINGTYADVLGAQ-IVRSKKL----AEGSALMFKIVSNSPALKLVLK------RGV- 192 (231) T ss_pred HHhhhhccchhhhh--hhhccceeeecccceEcceE-EEEcCCC----CCCceeeeeEEeeccceeeeec------ccc- Confidence 99999877654432 11222344444666778875 4442111 1111111112222222211110 011 Q ss_pred ecccccccccccCCccccccccCCceEEEe---ecccceeeecchhhh Q lcl|Aclame:pro 257 FGLTAQWGDRVSGSIADPNIGLRGGQRVRV---GESVKELVTAPDLGF 301 (309) Q Consensus 257 ~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v---~~~~~~~v~~~~~G~ 301 (309) +.| .+|-.....+.-++ -+++-+ .++---+++-+ |. T Consensus 193 ---~vE-tdRd~~~k~~~i~~---~~~y~v~l~~~~~vv~~t~~--g~ 231 (231) T protein:vir:73 193 ---QVE-TDRDIVTKTTVITA---DEHYAAYLYDLTKVVNITFT--GV 231 (231) T ss_pred ---eee-ccccccccccEEEE---eEEEEEEEEcCccEEEEEee--cC Confidence 111 11111111111111 111111 11111112222 22 No 93 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=87.30 E-value=0.04 Score=28.27 Aligned_cols=266 Identities=15% Similarity=0.111 Sum_probs=107.4 Q ss_pred CCCCCCCcc--hh-----hHHHHHh-h--cchhhhhhhhCCcc---ccccccceeEEechhHhhhchhHhhccc-ccccc Q lcl|Aclame:pro 1 MSNAPFPID--PE-----LTAIAIA-Y--RNGRMISDEVLPRV---PVGKQEFKFWKYDLAQGFTVPETLVGRK-SKPNE 66 (309) Q Consensus 1 m~~~~f~~d--p~-----LT~~a~~-y--~n~~~ig~~lfP~v---~v~~~~~k~~~~~~~~~f~~~~t~~~~~-~~~~~ 66 (309) |+...+..+ +. |+.+-.. | .-+++.+.++||.. +-..+++.|.+++..-... ..+-+ ..... T Consensus 26 ~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~----~~~d~~~dip~ 101 (329) T protein:vir:79 26 LRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAK----IIADYTDDLST 101 (329) T ss_pred cccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeee----eecCcccccce Confidence 211111111 11 2222111 2 23458889999954 3334445555554321000 01111 12223 Q ss_pred cccCcCccceeeeccchhhcCCHHHHHHHh-hcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc----ccCc---cccee Q lcl|Aclame:pro 67 VEFSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN----SYAA---GNKTT 138 (309) Q Consensus 67 ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~-~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~----~y~~---~~~~~ 138 (309) ++....++...+...+....+..+|...++ .+++...+......+.+...+ -++++.+. .|+- .+..+ T Consensus 102 vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~----n~i~f~G~~~~g~~GLlN~p~v~~ 177 (329) T protein:vir:79 102 VDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLV----NHLVFKGSKPHKIISVFEHPNLTT 177 (329) T ss_pred eecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhh----ccEEEeecccccceeeecCCCccc Confidence 444455545555555555556666666553 456655544443333332222 23344331 1111 11111 Q ss_pred c-cc---ccccCCCCCC-hHHHHHHHHHHh-----C-CCCcEEEeCHHHHHHHhc-CHHHHHHhccCCCcccccCHHHHH Q lcl|Aclame:pro 139 L-SG---ADQWSDPTSN-PLPVITDALDSV-----I-LRPNIGVLGRRTATILRR-HPKIVKAYNGSLGDEGMVPMAFLQ 206 (309) Q Consensus 139 l-sg---t~~Wsd~~sd-Pi~di~~~~~~~-----g-~~Pn~~v~~~~~~~~l~~-~~~i~~~~~~~~~~~~~vt~~~l~ 206 (309) . +| +..|...+.+ .+.||.+...++ | ..|++++|+++.+..|.+ ++ ++ +..-.+.|+ T Consensus 178 ~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~~~~---------~~--~~tvl~~lk 246 (329) T protein:vir:79 178 INSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMVRMP---------ET--TMSYLDYFK 246 (329) T ss_pred cccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhcccC---------CC--CccHHHHHH Confidence 1 11 2357666554 678898886655 3 469999999999988843 21 11 333355666 Q ss_pred HHhCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCC-cC---cceecccccccccccCCccccccccCCce Q lcl|Aclame:pro 207 ELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADT-RN---GTTFGLTAQWGDRVSGSIADPNIGLRGGQ 282 (309) Q Consensus 207 ~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~-~~---~~t~G~T~~~~~~~~~~~~d~~~g~~g~~ 282 (309) +.+---+|+ .-..+..+ +.. +.+.+++|......- +. ..++-+ .| +....+..++++.-||. T Consensus 247 ~~~~~l~I~-~~~el~~a--g~~-------g~~~~v~y~~~~~~~~~~vp~~~~~l~-~q---~~~~~~~v~~~~r~~Gv 312 (329) T protein:vir:79 247 QQNGGITIE-SISELEDI--DGA-------GTKAALVYEKDPMNMSIEIPEAFNMLT-AQ---PKDLHFKVPCTSKCTGL 312 (329) T ss_pred HhCCCcEEE-Eccccccc--CCC-------CceEEEEEecCCceEEEecCcceeeee-ce---ecCceEEEceeeeEEEE Confidence 654211222 11112221 111 122233333211110 00 001100 11 22223333444444444 Q ss_pred EEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 283 RVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 283 ~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) .|+. |.+-+.+.+.|-| T Consensus 313 ~i~~----------P~ai~~~dGI~~~ 329 (329) T protein:vir:79 313 TIYR----------PLTLVLIKGLVVG 329 (329) T ss_pred EEEC----------cceeeeeeeeeeC Confidence 3332 2222222233333 No 94 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=87.07 E-value=0.041 Score=28.18 Aligned_cols=289 Identities=12% Similarity=0.004 Sum_probs=106.9 Q ss_pred CC--------------C---CCCCcchhhHHHHH-hhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccc Q lcl|Aclame:pro 1 MS--------------N---APFPIDPELTAIAI-AYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKS 62 (309) Q Consensus 1 m~--------------~---~~f~~dp~LT~~a~-~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~ 62 (309) |+ . ..|++. +.....+ .+++ .++-..+.-+-+.....++...|.+.-.... ..+.+++ T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPe-v~s~~v~~~l~~-~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a--~d~~~g~ 76 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPE-VWSSEVRMFRDQ-KFAALEATKKIPFEGKKGDLIHIPNISRAAV--YDKQPQT 76 (381) T ss_pred CceecccccccCcccchhhHHhhhhH-HHHHHHHHHHHH-hhhhhhccccccceeecCceEEeeccCccee--eeecCCC Confidence 32 1 135443 2222222 2221 2221122111111112222223322111111 1123333 Q ss_pred cccccccCcCccceeeeccc-hhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccC----cccce Q lcl|Aclame:pro 63 KPNEVEFSATDETGSTEDHG-LDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYA----AGNKT 137 (309) Q Consensus 63 ~~~~ve~~~~~~~~~~~e~~-L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~----~~~~~ 137 (309) ....-+....+..+.+.++- -...|+..| +.+..+|+....++.+...|....+..+...+....... ..... T Consensus 77 ~i~~~~~~~~~~~itID~~~~~~~~Idd~D--~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~ 154 (381) T protein:vir:80 77 PVNLQARTDSEFTFTVTKYKESSFMIEDIV--NTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDT 154 (381) T ss_pred cccccccCCceEEEEEeeeeecceeechHH--HHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccccc Confidence 33333444455556554432 234555444 556678999888888777776555544443332211111 11122 Q ss_pred ecccccccCCCC----CChHHHHHHHHHHhCC--CC---cEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHH Q lcl|Aclame:pro 138 TLSGADQWSDPT----SNPLPVITDALDSVIL--RP---NIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQEL 208 (309) Q Consensus 138 ~lsgt~~Wsd~~----sdPi~di~~~~~~~g~--~P---n~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l 208 (309) .+++.......+ ..-+..|.++++.+.. -| -.++++++.+..|+.++++.++-.++. ..+-.-++..+ T Consensus 155 ~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~---~~l~~G~Ig~i 231 (381) T protein:vir:80 155 TLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQV---KPVTSGVVGTI 231 (381) T ss_pred cccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccc---hhhhceeeeEE Confidence 222222222222 2245566666655521 24 379999999999999998887654321 12323345567 Q ss_pred hCCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCcc--cccc-ccCCceEEE Q lcl|Aclame:pro 209 LELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIA--DPNI-GLRGGQRVR 285 (309) Q Consensus 209 ~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~--d~~~-g~~g~~~v~ 285 (309) +|++-+. .... ..+........++-...+. +...+..+.+.|-+++ ..-+++. |-.. ..+.+..+- T Consensus 232 ~G~~Vv~-Sn~l----p~~~~t~~~~~agap~~~~--~~~~~~~~~g~~s~~a----~av~~~k~yd~~~~~~~~~~~~~ 300 (381) T protein:vir:80 232 LGMEVIV-TTQI----GINSLTGYVNGQGAPTQPT--PGVLGSPYLPDQAGTA----NVVNTGSASDLAVSLSYFGLPVF 300 (381) T ss_pred cceEEEe-eccc----ccccccceeeecccccccc--ccccccccccccccce----eeeeeeeeeceeeeeeeccceee Confidence 7776332 2111 0000000000000000000 0000111111111111 0101110 0000 001111111 Q ss_pred eecccceeeecchhhhh--hhccccC Q lcl|Aclame:pro 286 VGESVKELVTAPDLGFF--FENAVAA 309 (309) Q Consensus 286 v~~~~~~~v~~~~~G~l--~~~~va~ 309 (309) .+...+.-.-.+.+|-+ ++.-++| T Consensus 301 ~g~~~~~~~~~~~~~~~~~~~~~~~~ 326 (381) T protein:vir:80 301 SGAGATAADGGQTLGSFGGANRWATA 326 (381) T ss_pred ecceeeecCCCceeeeehhhhhhhhh Confidence 22222222333334432 2222233 No 95 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=86.37 E-value=0.046 Score=27.91 Aligned_cols=282 Identities=9% Similarity=0.000 Sum_probs=105.4 Q ss_pred CCCCCCCcchhhHHHHHh---------------hcchhhhhh-hhCCccccc----cccceeEEechhHhhhchhHhhcc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIA---------------YRNGRMISD-EVLPRVPVG----KQEFKFWKYDLAQGFTVPETLVGR 60 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~---------------y~n~~~ig~-~lfP~v~v~----~~~~k~~~~~~~~~f~~~~t~~~~ 60 (309) |++-. ...||.-+.+ ....+|-.. .+.+.+.+. ..+.+++..++..+ .-+.+ T Consensus 1 m~~~~---~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~~~~-----~~~~~ 72 (334) T protein:vir:80 1 MTYPA---ANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGASTI-----AGRKA 72 (334) T ss_pred CCCCc---CCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecceee-----eeecC Confidence 88641 0112221111 111222222 223333322 11122333333211 11223 Q ss_pred cccccccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------- Q lcl|Aclame:pro 61 KSKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS---------- 130 (309) Q Consensus 61 ~~~~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~---------- 130 (309) +.....-.....+..+.+.+ .|-....-.++++++..+|.+....+..-..+...-...+...+..+.. T Consensus 73 g~~l~~~~~~~~~~~l~ID~-~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~ 151 (334) T protein:vir:80 73 GEELVVQKNVSDKLNLTVDT-VLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPA 151 (334) T ss_pred CCCCCCCCcccCceEEEEee-eeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 33322222223333433322 2333334455667777888887777666655555444444332222111 Q ss_pred c--CcccceecccccccCCCCCChHHHHHHHH---HHhC--------CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcc Q lcl|Aclame:pro 131 Y--AAGNKTTLSGADQWSDPTSNPLPVITDAL---DSVI--------LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDE 197 (309) Q Consensus 131 y--~~~~~~~lsgt~~Wsd~~sdPi~di~~~~---~~~g--------~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~ 197 (309) + +......++|+.. +...+|-.-+.++. +.+. ..+-.++++++.|.+|+.|+++.++-++...+. T Consensus 152 ~~~G~~~~~~~~g~~~--~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~ 229 (334) T protein:vir:80 152 FHDGILLPSTISGLAA--DAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGG 229 (334) T ss_pred ccCCcceeeccccccc--chhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceecccccc Confidence 1 1122344455442 44556655555543 3331 223689999999999999999998865543322 Q ss_pred cccCHHHHHHHhCCCeEEeecceeecc---c-cCCCcccce-ecCCcEEEEecCCCCCCcCcceecccc--cccccccCC Q lcl|Aclame:pro 198 GMVPMAFLQELLELDAIYIGEARLNIA---R-PGQNPNLIR-AWGPHASFIYRDRLADTRNGTTFGLTA--QWGDRVSGS 270 (309) Q Consensus 198 ~~vt~~~l~~l~gl~~I~v~~a~~~~~---~-~g~~~~~~~-v~~~~~~L~~~~~~~~~~~~~t~G~T~--~~~~~~~~~ 270 (309) ..+....+..+.|++-+.. ...=... . .|....... =+...+.|++...--.+.. .-..|- .+..+..++ T Consensus 230 ~~~~~g~i~~v~G~~V~~S-n~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~--~~~~~~e~~~~~~~~~d 306 (334) T protein:vir:80 230 NSFVGGRIAMLNGVRVVET-PRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQ--VHPVSAQFWEEKKDFGH 306 (334) T ss_pred ccccceeEEEEeceEEEee-cCCCCccccccccccccccccccccceEEEEEeCceEEEEE--EeecceeeeechhhHHH Confidence 2233445666677652221 1100000 0 000000000 0011222332221000000 001110 011111222 Q ss_pred ccccccccCCceEEEee--cccceeeecc Q lcl|Aclame:pro 271 IADPNIGLRGGQRVRVG--ESVKELVTAP 297 (309) Q Consensus 271 ~~d~~~g~~g~~~v~v~--~~~~~~v~~~ 297 (309) +.+.. -.-|...+|=. -.++-.++.| T Consensus 307 ~i~~~-~a~G~g~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 307 YLDTF-QSYNIGQRRPDAVAVHDITVTNP 334 (334) T ss_pred HHHHH-HHcCCceeccceEEEEEEeeecC Confidence 22211 11122111111 1111112222 No 96 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=85.61 E-value=0.052 Score=27.65 Aligned_cols=265 Identities=10% Similarity=-0.000 Sum_probs=117.1 Q ss_pred CCC--CCCCcchhhHHHHHhhcchhhhhhhhCCccccccc-cceeEEechhHhhhchhHhhcccccccccccCcCcccee Q lcl|Aclame:pro 1 MSN--APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQ-EFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGS 77 (309) Q Consensus 1 m~~--~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~-~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~ 77 (309) ++. ...++.++...+-.-.....-+-..+...+++... ..++|+...... ..-++.++........+....+. T Consensus 114 t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~----a~~v~E~~~~~~~~~~f~~v~~~ 189 (392) T protein:vir:13 114 TKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITGRAT----AGIVGETAEIPESYPATTQRSMG 189 (392) T ss_pred cccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcc----eeeecccccccccccceeeEEee Confidence 111 12333333333222211111121222223333322 234555433211 11234455555555566666666 Q ss_pred eeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc--ccCcc--ccee-cccccccCCCCCCh Q lcl|Aclame:pro 78 TEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPN--SYAAG--NKTT-LSGADQWSDPTSNP 152 (309) Q Consensus 78 ~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~--~y~~~--~~~~-lsgt~~Wsd~~sdP 152 (309) .+..+-..+++++-..+ +.+|....-.+.+.+.|....+.. ++++. +-|.+ +..+ .+....|..++.-. T Consensus 190 ~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~d~~----~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~ 263 (392) T protein:vir:13 190 GFKYGFASVVSYEFATD--QVLDLVGFLVSDAGPAIGDAMGRH----FLTGTGTGQPRGILTDATGANAAFGEADADSKV 263 (392) T ss_pred eeeEEeeehhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHH----HhcccCCcccccccccccccccccccccccccc Confidence 66666566677665553 334445555556666665544433 33221 11111 0000 01111233344445 Q ss_pred HHHHHHHHHHh--CC-CCcEEEeCHHHHHHHhcCHHHHHHhccCCCccc---ccCHHHHHHHhCCCeEEeecceeecccc Q lcl|Aclame:pro 153 LPVITDALDSV--IL-RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEG---MVPMAFLQELLELDAIYIGEARLNIARP 226 (309) Q Consensus 153 i~di~~~~~~~--g~-~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~---~vt~~~l~~l~gl~~I~v~~a~~~~~~~ 226 (309) ..+|.+....+ .+ .+...+|++..+.+|+. + +..++..- .++...-..|+|.| |++-+.. T Consensus 264 ~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~---l----kd~~G~~l~~~~~~~g~~~~l~G~P-v~~~~~~------ 329 (392) T protein:vir:13 264 SDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRK---L----KDANGQYLWQSALTVGAPDTFNGKV-VETDDGM------ 329 (392) T ss_pred HHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHH---h----hccCCceeecCCcCCCCCceeccee-eEEcCCC------ Confidence 66666655544 23 34568999999988763 2 22222210 01111123467776 3332111 Q ss_pred CCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcc Q lcl|Aclame:pro 227 GQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENA 306 (309) Q Consensus 227 g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~ 306 (309) |.+.+++ .+ ... | ....+++..-.... +.+-..+...+|.....+-+++-+++..+++-. T Consensus 330 ----------~~~~i~~-Gd-----f~~--~-~i~~~~~~~i~~~~-~~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~ 389 (392) T protein:vir:13 330 ----------PADKVLF-AD-----LSK--Y-RVRFAGSLRVDRSV-DAKFSTDQIVYRFLQRADGLLVDARGAKVLTVT 389 (392) T ss_pred ----------CCCcEEE-ee-----ccc--e-eEEeecceEEEeec-cccccCCcEEEEEEEEeccEEecccceEEEEee Confidence 1111221 11 000 0 01111111111111 222345666789999999999999998888888 Q ss_pred ccC Q lcl|Aclame:pro 307 VAA 309 (309) Q Consensus 307 va~ 309 (309) .|| T Consensus 390 ~aa 392 (392) T protein:vir:13 390 PAA 392 (392) T ss_pred ccC Confidence 888 No 97 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=85.38 E-value=0.053 Score=27.57 Aligned_cols=270 Identities=9% Similarity=-0.008 Sum_probs=115.4 Q ss_pred CC-----CCCC-CcchhhHHHHHhhcchhhhhhhhCCcccccccccee--EEechhHhhhchhHhhcccccccc--cccC Q lcl|Aclame:pro 1 MS-----NAPF-PIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKF--WKYDLAQGFTVPETLVGRKSKPNE--VEFS 70 (309) Q Consensus 1 m~-----~~~f-~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~--~~~~~~~~f~~~~t~~~~~~~~~~--ve~~ 70 (309) |+ +.-+ ++..+.+.|...-++..-|. .+++.+||...++++ ++......+. -++.++.... .... T Consensus 110 ~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~-~l~~~~~~~~~~g~~~~~~~~~~~~~~----~v~e~~~~~~~~~~~~ 184 (404) T protein:vir:10 110 ISENIDEDGGYAVPEDIQTKINTRLKDTTDLY-NMVDYEPVFTRSGSRTYEKRSKQKPMK----PLSENQQIPTNGDNGK 184 (404) T ss_pred hccccCCCCceeechhHHHHHHHHHhhhhhHh-hhhceeeccCCccceEEEEecCCccee----eccccccccccccccc Confidence 22 1112 22333333322212211122 345667777666553 4332211111 1222222211 1233 Q ss_pred cCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceec--ccccccCCC Q lcl|Aclame:pro 71 ATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTL--SGADQWSDP 148 (309) Q Consensus 71 ~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~l--sgt~~Wsd~ 148 (309) +...+...+..+-..+++++-..++ .++......+.+.+.+....|..+ +++..-.......+ .+....... T Consensus 185 f~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~la~~~~~~~~~~i----l~G~g~~~~~~gi~~~~~~~~~~~~ 258 (404) T protein:vir:10 185 LERFNFKLKDLADFMSIPNDLLKFA--DKSLEDWIINWFVDKVRITRNAEI----LYGAGGDEHATGIMTANKFKKITLP 258 (404) T ss_pred eeeeEeeheeeEeeehhhHHHHhhc--HHHHHHHHHHHHHHHHHHHHHHHH----hhcCCCCCcccceeeccccceeecc Confidence 4444444444444456666554432 234455555556666655554433 33221111111111 111223334 Q ss_pred CCChHHHHHHHHHH-h--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH----HHHHHhCCCeEEeecce Q lcl|Aclame:pro 149 TSNPLPVITDALDS-V--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA----FLQELLELDAIYIGEAR 220 (309) Q Consensus 149 ~sdPi~di~~~~~~-~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~----~l~~l~gl~~I~v~~a~ 220 (309) +...+.++...+.. + ++.+| ..+|+++.|.+|+. ++..+++. ++.+. .-..++|.| |++.... T Consensus 259 ~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~-------lkd~~G~~-l~~~~~~~~~~~~l~G~P-V~~~~~~ 329 (404) T protein:vir:10 259 KSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDS-------LEDKTGRP-YLQPDPKDPTQYRFLGLP-VIELPND 329 (404) T ss_pred ccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHH-------hhccCCce-eeccCcCCCCCcccccee-eEEeccc Confidence 55667788777653 2 66665 46999999998765 22223222 22111 112456766 3322111 Q ss_pred eeccccCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecch Q lcl|Aclame:pro 221 LNIARPGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPD 298 (309) Q Consensus 221 ~~~~~~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~ 298 (309) .- .+......-++++. .++++. .-|++.+..... ...-..+...+|+...++-.+.-++ T Consensus 330 ~~---~~~~~~~~~~~gd~s~~~~~~~----------~~~~~i~~~~~~------~~~~~~~~~~~~~~~r~d~~v~~~~ 390 (404) T protein:vir:10 330 LL---LSTESAIPVLLGDTKEAYKYVS----------DGAYELATTNIG------AGAFETNTTKARIIMRIDGNVKDSE 390 (404) T ss_pred cc---CCCCCccEEEEEeccccEEEEE----------ecceEEEEeccc------cchhhcCceEEEEEEeeccEEeccc Confidence 10 01111111122211 011100 001222111100 0101245567888888898999998 Q ss_pred hhhhhhccccC Q lcl|Aclame:pro 299 LGFFFENAVAA 309 (309) Q Consensus 299 ~G~l~~~~va~ 309 (309) +-..++=+.|+ T Consensus 391 a~~~~~~~~aa 401 (404) T protein:vir:10 391 ALLIAEIPVES 401 (404) T ss_pred ceEEEEeeccc Confidence 88888888888 No 98 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=84.58 E-value=0.059 Score=27.31 Aligned_cols=260 Identities=10% Similarity=0.009 Sum_probs=119.0 Q ss_pred CCC-----CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSN-----APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~-----~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) ++. ...++-..+..+-.--+...-|.. +++.+|+.....+|+++.... ....-++.++.......++.+.+ T Consensus 114 ~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~~~---~~a~~v~Eg~~~~~~~~~~~~i~ 189 (390) T protein:vir:10 114 STDAAGSAGALTTPNRLPGFITQPDARLTVRD-LIGSGRTDSALIEYVQETGFV---NNAAIVAEGALKPESSLKFAKKT 189 (390) T ss_pred hcccccccccccchhHHHHHHHHHHhhchhhh-hcceeeccCCceEEEEEecCC---cceeeecCCccccccccceeEEE Confidence 111 112222233333322222222333 467788887778888875421 11122445555555666677777 Q ss_pred eeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceec--ccccccC--CCCCC Q lcl|Aclame:pro 76 GSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTL--SGADQWS--DPTSN 151 (309) Q Consensus 76 ~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~l--sgt~~Ws--d~~sd 151 (309) +..+..+...+++++-.++.. +......+.+.+.+....+. .++++..-...-...+ ++...+. ..+.+ T Consensus 190 ~~~~k~~~~~~is~ell~d~~---~l~~~i~~~l~~~~~~~~~~----~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~ 262 (390) T protein:vir:10 190 DTTHVIAHTMKATRQILSDAP---QLASYMNNRLIRGLKVKEDA----EILRGTGANDGLLGLIPQATTYAAPTTIAGAT 262 (390) T ss_pred EeeEEEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHH----HHhhcCCCCccccccccccccccccccccccc Confidence 777777777778877554432 33444444555555444443 3333221001011111 1111121 23456 Q ss_pred hHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH---HHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 152 PLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA---FLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 152 Pi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~---~l~~l~gl~~I~v~~a~~~~~~ 225 (309) ++.+|.+++..+ +..++.++|+++.|.+|+. ++ ..++.. +..+. .-..++|+| |++.+.. T Consensus 263 ~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~---lk----d~~g~~-l~~~~~~~~~~~l~G~p-v~~~~~~----- 328 (390) T protein:vir:10 263 RVDQLRLAMLQASLAEYPASGIVINPIDWAAIEL---AK----DANNQY-LIGNARGTLTPTLWGLP-VVATQAM----- 328 (390) T ss_pred hHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH---hh----cCCCce-eecCCcCcCCceeccee-eEEcCCC----- Confidence 777777776554 6778899999999988773 22 222211 11100 012357776 3332211 Q ss_pred cCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 226 PGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 226 ~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) +.. .-++++. .++++. .-|.+.++.. +..+-..+...+|+...++-.+.-+.+-..+ T Consensus 329 p~~----~~~~gdf~~~~~~~~----------~~~~~i~~~~-------~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~ 387 (390) T protein:vir:10 329 APG----EFLVGAFDLAAQIFD----------QWDARVEIGY-------VNDDFQRNMVTVLAEERLALVVYRPEALISG 387 (390) T ss_pred CCC----cEEEEeccceEEEEE----------ecceEEEEee-------cccccccCcEEEEEEEeeccEEeccccEEEE Confidence 100 0111211 111110 0011111110 0011123445677777777777777665555 Q ss_pred hcc Q lcl|Aclame:pro 304 ENA 306 (309) Q Consensus 304 ~~~ 306 (309) +=| T Consensus 388 ~~a 390 (390) T protein:vir:10 388 SFA 390 (390) T ss_pred EeC Confidence 433 No 99 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=84.44 E-value=0.06 Score=27.26 Aligned_cols=261 Identities=13% Similarity=0.100 Sum_probs=109.8 Q ss_pred CCCCCCCcchhhHHHHH-hhcchhhhhhhhCCcc--ccccc--cceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAI-AYRNGRMISDEVLPRV--PVGKQ--EFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~-~y~n~~~ig~~lfP~v--~v~~~--~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) ||+..|.+.- .++.++ .+++.-.++. |..+- ...+. +.++|+.+. ....+ -.+.++....-++...+.+ T Consensus 1 MA~~~~~pei-~~~~v~~~~~~~lv~~~-l~~~~~~~~~~~GdTv~ip~~~~---~~~~d-~~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:79 1 MAFNNFIPEL-WSDMLLEEWTAQTVFAN-LVNREYEGIASKGNVVHIAGVVA---PTVKD-YKAAGRQTSADAISDTGVD 74 (273) T ss_pred CcchhhhHHH-HHHHHHHHHHhhccchh-hhhccccccccCCcEEEEeecCc---ccccc-cccCCCccCccccccceEE Confidence 9998887664 444443 4444322222 22221 11222 233344332 22111 1222333333345555566 Q ss_pred eeeecc-chhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHH Q lcl|Aclame:pro 76 GSTEDH-GLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLP 154 (309) Q Consensus 76 ~~~~e~-~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~ 154 (309) +.+..+ .....|+..|.. +..+|.+. .++.....|....+..++..+..... +. +++. .. ..++.+. T Consensus 75 ~tid~~~~~~~~i~d~d~~--~~~~~~~~-~~~~~~~ala~~vD~~i~~~~~~a~~-----~~--~~~~-~~-~~~~~~~ 142 (273) T protein:vir:79 75 LLIDQEKSIDFLVDDIDRV--QVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-----AL--TGSA-PS-DADDAFD 142 (273) T ss_pred EEEeeecccceeeccHHHH--hhcccHHH-HHHHHHHHHHHHHHHHHHHHHhhccc-----cc--cccc-cc-chhhHHH Confidence 666543 445566655543 44555443 44444444544444444444433221 11 1110 00 1123445 Q ss_pred HHHHHHHHh---CCCCc---EEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCC Q lcl|Aclame:pro 155 VITDALDSV---ILRPN---IGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQ 228 (309) Q Consensus 155 di~~~~~~~---g~~Pn---~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~ 228 (309) .|.++..++ ++ |. .++++++.+..|+..+.........+. .+.+-.-++..++|++ |+..... ..+. T Consensus 143 ~i~~a~~~ld~~~v-P~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~-~~~l~~G~ig~~~G~~-i~~s~~l----p~~~ 215 (273) T protein:vir:79 143 LIASALKELTKANV-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD-AAGLRAGTIGNLLGAR-IVESNNL----RDTD 215 (273) T ss_pred HHHHHHHHhhhccC-CccCcEEEECHHHHHHHhhchhhhhhhhhccc-ccceeeeEeeEEeceE-EEecccc----cccC Confidence 555555444 33 54 799999999999998863333332222 1222233455667765 3321111 0010 Q ss_pred CcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 229 NPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 229 ~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) +..++.+... .+++--+.. ..+.......-+..|+..+.+-..++-|++=..|...-+ T Consensus 216 ---------~~~~~a~~~~--------A~~~a~~~~-----~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 216 ---------DEQFVAFHPS--------AAAYVSQID-----TVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ---------ceEEEEEecc--------ceeeeeehh-----hhhcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 0011222110 111100100 000000011124456666666666666664444443333 No 100 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=83.82 E-value=0.065 Score=27.08 Aligned_cols=265 Identities=10% Similarity=0.031 Sum_probs=103.7 Q ss_pred CCC--CCC---CcchhhHHHHHhhcc--------hhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc Q lcl|Aclame:pro 1 MSN--APF---PIDPELTAIAIAYRN--------GRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV 67 (309) Q Consensus 1 m~~--~~f---~~dp~LT~~a~~y~n--------~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v 67 (309) .|+ .|- +.++-.-++...|-. +.+.++.|||...++.=..++.+|.-.+. .-....-+-.++...+ T Consensus 34 da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~~ 112 (336) T protein:vir:36 34 DAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDS 112 (336) T ss_pred hhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccccccCCccceeEEEeeeec-eeeEEEeeccCCCcee Confidence 011 110 112222233334443 45789999997665433233444432110 0000111223333344 Q ss_pred ccCcCccceeeeccchhhcCCHHHHHHH-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc----ccc-----Ccccce Q lcl|Aclame:pro 68 EFSATDETGSTEDHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSP----NSY-----AAGNKT 137 (309) Q Consensus 68 e~~~~~~~~~~~e~~L~~~v~~~~~~~a-~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~----~~y-----~~~~~~ 137 (309) ..........+.-.+.-......|...| ...+|...+-.+..++.++..+ -+.++-+ ..| |+-... T Consensus 113 d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~----N~i~~~Gd~~~~~yGllNdP~l~a~ 188 (336) T protein:vir:36 113 GANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFL----NGSYLFGVAGLENYGLINDPSLSAP 188 (336) T ss_pred ecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhh----CcEEEEeccccceEEEEecCCCccc Confidence 4333333334444444444554554444 3456654443333333332211 1222211 112 111111 Q ss_pred ecccccccCCCCCC-hHHHHHHHHHHh-----C----CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHH Q lcl|Aclame:pro 138 TLSGADQWSDPTSN-PLPVITDALDSV-----I----LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQE 207 (309) Q Consensus 138 ~lsgt~~Wsd~~sd-Pi~di~~~~~~~-----g----~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~ 207 (309) +-+.++.|+..+.+ .++||.+...++ | -.|++++|....+..|.+- + ..|.-=.+.|++ T Consensus 189 ~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~---------n--~~g~Tvl~~lk~ 257 (336) T protein:vir:36 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT---------N--QYGLAAAAKLKD 257 (336) T ss_pred cccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccCC---------C--ccCccHHHHHHH Confidence 22233345555544 899999887665 4 2599999999999888431 1 112212456666 Q ss_pred HhCCCeEEeecc-eeeccccCCCcccceecCCcEEEEecCCCCCCcCcceeccccccc----ccccCCccccccccCCce Q lcl|Aclame:pro 208 LLELDAIYIGEA-RLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWG----DRVSGSIADPNIGLRGGQ 282 (309) Q Consensus 208 l~gl~~I~v~~a-~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~----~~~~~~~~d~~~g~~g~~ 282 (309) -| |.|.+-.+ .+.++ . ++.+.+++....+.. .-..+++-.+. ......+..+....-+|. T Consensus 258 n~--Pnl~i~t~pEl~~a----~-------g~~~~l~~~~~~~~~--t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv 322 (336) T protein:vir:36 258 IF--PKLEFVTIPEYDTA----S-------GRLVQLWAPRVEGKD--TATCGFTEKMRAHSIERYSSYFRQKKSAGTWGA 322 (336) T ss_pred hc--CccEEEEccccccC----C-------CceEEEEEEecCCCc--ceeeecchhhhccceeecCceeEeccccceeee Confidence 54 22322111 12111 1 122223322211110 00112221110 011222333333333443 Q ss_pred EEEeecccceeeecchhhh Q lcl|Aclame:pro 283 RVRVGESVKELVTAPDLGF 301 (309) Q Consensus 283 ~v~v~~~~~~~v~~~~~G~ 301 (309) .|+ .|.-+..-.|. T Consensus 323 ~i~-----~P~ai~~~~GI 336 (336) T protein:vir:36 323 VIF-----RPFAVAQMIGV 336 (336) T ss_pred eee-----ccchheeeecC Confidence 332 33333333333 No 101 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=82.34 E-value=0.078 Score=26.67 Aligned_cols=262 Identities=8% Similarity=-0.087 Sum_probs=106.8 Q ss_pred CC-------CC-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc-ccCc Q lcl|Aclame:pro 1 MS-------NA-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV-EFSA 71 (309) Q Consensus 1 m~-------~~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v-e~~~ 71 (309) |+ +. ..+|......|- ......-.-..+++.+||....++++.... ..+.....-++.++..... ..++ T Consensus 105 ~~~~~~~~~~gg~~vP~~~~~~ii-~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~a~~v~E~~~~~~~~~~~f 182 (395) T protein:vir:38 105 VTSGTTGTGNAGLTIPEDIQLQIR-TLTRSFTSLESLANVENVTTSHGSRVYEKL-ADITPLKDLDDESALIGDNDDPEL 182 (395) T ss_pred HhhccCccCCCceecchhHhhHHH-HHHHhhcchhhhcceeeccCCcceEEEEee-ccCCccccccccccccccccccce Confidence 11 11 122322222221 111111122233455666666565543221 1111111123333333322 2334 Q ss_pred CccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCC Q lcl|Aclame:pro 72 TDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSN 151 (309) Q Consensus 72 ~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sd 151 (309) ...++..+..+-..+++.+-..+ +.++....-++.+.+.+....|..+.. +.. ......+.. + T Consensus 183 ~~v~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~il~----g~g----~~~~~~~~~-----~-- 245 (395) T protein:vir:38 183 TVVKYLIHRYAGITTVTNTLLKD--TVDNIIQWLVNWAAKKDVVTRNAKILE----VMG----KAPKKPTIS-----Q-- 245 (395) T ss_pred eeEEeeeeeeEeehhhHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHHHhh----ccc----ccccccccc-----c-- Confidence 44444444444444566554443 334555666666777776665544332 211 111111211 1 Q ss_pred hHHHHHHHHH-Hh--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCccc---ccCHHHHHHHhCCCeEEeecceeecc Q lcl|Aclame:pro 152 PLPVITDALD-SV--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEG---MVPMAFLQELLELDAIYIGEARLNIA 224 (309) Q Consensus 152 Pi~di~~~~~-~~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~---~vt~~~l~~l~gl~~I~v~~a~~~~~ 224 (309) ..+|.++.. .+ ..++| ..+|+++.|.+|+. ++..++..- .++...-..++|.| |++.+..... T Consensus 246 -~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~-------lkd~~G~~l~~~~~~~~~~~~l~G~p-V~~~~~~~~~- 315 (395) T protein:vir:38 246 -FDNIKDLENNTLDPAIESTSSFITNQSGYNILSK-------VKDADGRYLMQPDVTSPDKYLIDGKP-VIRIADKWLP- 315 (395) T ss_pred -HHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH-------hhccCCceeeccCcCCCCcceeccce-eEEecccccC- Confidence 223444332 22 34444 58999999999864 222222210 01111223456766 4443322111 Q ss_pred ccCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhh Q lcl|Aclame:pro 225 RPGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFF 302 (309) Q Consensus 225 ~~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l 302 (309) +......-++++- .++++. .-|.+.++.... ...-..+...+|+...++-.+.-+++... T Consensus 316 --~~~~~~~i~~gd~~~~~~i~~----------~~~~~i~~~~~~------~~~~~~~~~~~r~~~r~d~~~~~~~a~~~ 377 (395) T protein:vir:38 316 --DVSGSHPLYFGDLKQGITLFD----------RQQMQIDTTNVG------AGSFEHDTTKLRFIDRFDVQLIDDGAFAA 377 (395) T ss_pred --cCCCcceEEEEeccccEEEEE----------ecceEEEEeccc------cchhhcCceEEEEEEeeccEEecccceEE Confidence 1111111233321 111111 012222221110 01112444567788888888888888888 Q ss_pred hhccccC Q lcl|Aclame:pro 303 FENAVAA 309 (309) Q Consensus 303 ~~~~va~ 309 (309) ++-..++ T Consensus 378 ~~~~~~~ 384 (395) T protein:vir:38 378 ASFKTVA 384 (395) T ss_pred EEeeccc Confidence 7766555 No 102 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=82.22 E-value=0.079 Score=26.63 Aligned_cols=266 Identities=9% Similarity=-0.021 Sum_probs=118.5 Q ss_pred CCCC------CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccc-cCcCc Q lcl|Aclame:pro 1 MSNA------PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVE-FSATD 73 (309) Q Consensus 1 m~~~------~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve-~~~~~ 73 (309) |+.. ..++..+.+.|-.--++..-|. .+++.+|+.....++|+......+ .-++.++...... ..+.+ T Consensus 130 l~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~-~l~~~~~~~~~~~~~~~~~~~~~a----~wv~E~~~~~~~~~~~f~~ 204 (425) T protein:vir:10 130 LNKGEDSEGGYLTPIEWDRTITNKLVLISPMR-QLCRVQPVSKAGFSKLFNMGGTTS----GWVGEASQRPQTNAATFQP 204 (425) T ss_pred hhcCcCCCCceeccHhHHHHHHHHHHhhhhhh-hhceeeeccCCceEEEEEcCCcce----eeeccccccccccccccce Confidence 3322 1334333333332222222233 356777888777888775332111 1233344333333 24555 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceeccc--------cccc Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSG--------ADQW 145 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsg--------t~~W 145 (309) .++..+..+-..+++++-..++ .++....-.+.+.+.|....+..+ +++..- ......|+. +..| T Consensus 205 v~~~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~ai~~~~d~~~----l~G~G~-~~p~Gil~~~~~~~~~~~~~~ 277 (425) T protein:vir:10 205 LSFASGEIYANPAATQQILDDA--EIDLESWLATEVQTEFAKQEGKAF----LAGDGT-NKPNGLLTYIAGGANAAKHPF 277 (425) T ss_pred eeeeheeeEeehHhHHHHHhcc--hhHHHHHHHHHHHHHHHHHHHhhh----hcccCC-CCcceeeeccccccccccccc Confidence 5666655555566666655433 355556656666676665555433 222110 000011110 0001 Q ss_pred C---C-----CCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhC Q lcl|Aclame:pro 146 S---D-----PTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLE 210 (309) Q Consensus 146 s---d-----~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~g 210 (309) . . .+..-..+|.+....+ .....+.+|+++.|.+|+. + +..++.+ +..+ ..-..+|| T Consensus 278 ~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~---l----kD~~G~~-l~~~~~~~g~~~~l~G 349 (425) T protein:vir:10 278 GAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQVRK---L----KDGQGNY-LWQPSYVAGQPATLAG 349 (425) T ss_pred cccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHHHHH---h----hcCCCce-eeccCccCCCCceecc Confidence 1 0 1122233444444333 2344578999999988764 2 2222222 1111 11124667 Q ss_pred CCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeeccc Q lcl|Aclame:pro 211 LDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESV 290 (309) Q Consensus 211 l~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~ 290 (309) .| |++-+..-. +..+...++|.+- +.+|. ..++.+..+..+.+-..+...++....+ T Consensus 350 ~P-V~~~~~~p~------------~~~~~~~i~~Gd~--------~~~~~--i~~~~~~~v~~d~~~~~~~~~~~~~~r~ 406 (425) T protein:vir:10 350 YP-VTEVPDMPD------------VAANSTPILFGDF--------QQTYL--IIDRIGVRVLRDPYTAKPYVLFYTTKRV 406 (425) T ss_pred ee-eEEecCcCC------------ccCCccEEEEEeh--------hccEE--EEEecceEEEecccccCCcEEEEEEEEe Confidence 76 444322110 0111111111110 00111 1111222222333344566678888889 Q ss_pred ceeeecchhhhhhhccccC Q lcl|Aclame:pro 291 KELVTAPDLGFFFENAVAA 309 (309) Q Consensus 291 ~~~v~~~~~G~l~~~~va~ 309 (309) +-.++-+++..+++.+.|= T Consensus 407 d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 407 GGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ccEeecccceEEEEeeccC Confidence 9999999988777665555 No 103 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=82.18 E-value=0.079 Score=26.62 Aligned_cols=275 Identities=12% Similarity=0.074 Sum_probs=120.8 Q ss_pred CCCC------CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccc-----ccccc Q lcl|Aclame:pro 1 MSNA------PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKP-----NEVEF 69 (309) Q Consensus 1 m~~~------~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~-----~~ve~ 69 (309) ||.. ..++..+.+.|-.--++..-| -.+++.+++.....++|++...... .-++.++.. ...+. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l-~~l~~~~~~~~~~~~~p~~~~~~~a----~wv~E~~~~~~~~~~~s~~ 75 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTV-LSAFQNVNMGTKTTHLPVLATLPEA----DWVGESATDPKGVKPTSKV 75 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchh-hhhcceeeccCCcEEEEEEeCCcce----EEeeccccccccccccccc Confidence 7753 255555545544322222223 3456888888777888887643211 112222222 12233 Q ss_pred CcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---cCcccceecc--cccc Q lcl|Aclame:pro 70 SATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS---YAAGNKTTLS--GADQ 144 (309) Q Consensus 70 ~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~---y~~~~~~~ls--gt~~ 144 (309) ++.+..+..+..+-..++++|-..+ +.++....-.+.+.+.+....|..+-...-.+.. ....+....+ .... T Consensus 76 ~f~~i~~~~~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~ 153 (305) T protein:vir:25 76 TWANRTLVAEEIAVIIPVHENVIDD--ATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEV 153 (305) T ss_pred ceeeEEeeeEEEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccc Confidence 3444444444444444566655443 3344455555666666666555444321100000 0000011111 1111 Q ss_pred cC--CCCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecc Q lcl|Aclame:pro 145 WS--DPTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEA 219 (309) Q Consensus 145 Ws--d~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a 219 (309) +. +...|++.++......+ +..++..+|++..|..|+. + +..++.. +..+. .++|+|-++ -+. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~---l----kd~~G~~-i~~~~---~l~G~Pv~~-~~~ 221 (305) T protein:vir:25 154 VGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN---I----RDANGNP-VFRDD---SFAGFRTFF-NRN 221 (305) T ss_pred cccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHH---h----hccCCce-eecCC---cccccceEE-cCc Confidence 21 12345555555554433 6778899999999988753 2 2223322 22222 477888443 222 Q ss_pred eeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccc---cCCccccccccCCceEEEeecccceeeec Q lcl|Aclame:pro 220 RLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRV---SGSIADPNIGLRGGQRVRVGESVKELVTA 296 (309) Q Consensus 220 ~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~---~~~~~d~~~g~~g~~~v~v~~~~~~~v~~ 296 (309) .-. ... +..-++++-.-+.+... . |.+.+..... .+... ......+...+|+...++-.+.- T Consensus 222 ~~~---~~~--~~~~~~gd~s~~~i~~~-----~----~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~R~~~r~~~~v~~ 286 (305) T protein:vir:25 222 GAW---DAD--AAIEVIADSSRVKIGVR-----Q----DITVKFLDQATLGTGENQ-INLAERDMVALRLKARFAYVLGV 286 (305) T ss_pred cCC---CCC--ccEEEEEecceEEEEEe-----c----CeEEEEeeeeeeecCCce-eeeeecCcEEEEEEEeecceeeC Confidence 110 011 11112222111111100 0 1111111000 00000 01112344567888888777777 Q ss_pred chhhhhhhccccC Q lcl|Aclame:pro 297 PDLGFFFENAVAA 309 (309) Q Consensus 297 ~~~G~l~~~~va~ 309 (309) +.+-..++++-+| T Consensus 287 p~a~v~~~~~~~~ 299 (305) T protein:vir:25 287 SATAQGANKTPVA 299 (305) T ss_pred cccEEEEcccccc Confidence 8887777777554 No 104 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=81.99 E-value=0.081 Score=26.57 Aligned_cols=279 Identities=11% Similarity=0.003 Sum_probs=93.5 Q ss_pred CCCCCCCcchhhHHHHHh-hcchhhhhhhhCCcc-------ccc-cccc--eeEEechhHhhhchhHhh---cccccccc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIA-YRNGRMISDEVLPRV-------PVG-KQEF--KFWKYDLAQGFTVPETLV---GRKSKPNE 66 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~-y~n~~~ig~~lfP~v-------~v~-~~~~--k~~~~~~~~~f~~~~t~~---~~~~~~~~ 66 (309) ||+..|.++.. ...++. +++ ..+|+.+ +.. +.+. ++++.+ .+..-+-.. ++++.... T Consensus 1 Ma~~~~~p~~~-a~~~l~~l~~-----~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~---~~~~~~~~~~~~~~~~~~~~ 71 (392) T protein:vir:99 1 MANAFSKPTAV-VDTAIQMLQN-----ELILTNLVWLNGIGDFAHKFNDTITVRVPA---PSRGHTRKLRGAGAERNLTV 71 (392) T ss_pred CccccccHHHH-HHHHHHHHHh-----hccchhhhccccccccccCCCCeEEEeecc---cccceeeeccccccCCcccc Confidence 99999977754 444443 332 2223322 111 1121 222222 232222111 11222222 Q ss_pred cccCcCccceeeecc-chhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceeccccccc Q lcl|Aclame:pro 67 VEFSATDETGSTEDH-GLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQW 145 (309) Q Consensus 67 ve~~~~~~~~~~~e~-~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~W 145 (309) -++.-.+..+.+.++ .....++.+| .++...|...+..+.....|....|..++..+.... +. ... ++.-. T Consensus 72 ~~~~~~~~~~~id~~k~~~~~i~d~e--~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~-~~----~~~-~~~~~ 143 (392) T protein:vir:99 72 SDFTEDSFPVTLTDVAYHLGVLTDEE--LTFDLESFATQILPRQVRGVADILEEGVRDMIVGAP-YE----AAG-AVHEV 143 (392) T ss_pred cccccceEEEEEeeeeecceeechHH--HhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccc-cc----ccc-ccccc Confidence 233334444444222 3333455554 344445555555555555555544555555443221 11 111 11111 Q ss_pred CCCCCChHHHHHHHHHHhCC--CC--cEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeeccee Q lcl|Aclame:pro 146 SDPTSNPLPVITDALDSVIL--RP--NIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARL 221 (309) Q Consensus 146 sd~~sdPi~di~~~~~~~g~--~P--n~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~ 221 (309) ..++.+.+|.++..++.. -| .++++++..+..|+..+.+.+.-.........+..-++..++|.+ |+.-...- T Consensus 144 --~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~-v~~s~~~~ 220 (392) T protein:vir:99 144 --APDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYE-IVESTLIP 220 (392) T ss_pred --ChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeE-EEeecccc Confidence 123345556665554421 24 379999999999999988766543321111112223445555654 32211110 Q ss_pred eccccCCCcccceecCCc-EEEEecCCCCCCcCcceecccccccccccCCcccccccc-----------CCceEEEeecc Q lcl|Aclame:pro 222 NIARPGQNPNLIRAWGPH-ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGL-----------RGGQRVRVGES 289 (309) Q Consensus 222 ~~~~~g~~~~~~~v~~~~-~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~-----------~g~~~v~v~~~ 289 (309) .+. . . .|-.. ..+....+. ...+.+++....-.+...+.+...+-+- .|...+..... T Consensus 221 ----~~t-~-~--a~~~~a~~~at~a~v--~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~ 290 (392) T protein:vir:99 221 ----HGD-A-Y--LYHPTAFIMATRAPA--PPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNG 290 (392) T ss_pred ----ccc-c-e--eeecccccccccccc--ccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccc Confidence 000 0 0 01111 111111100 0001111111100000011110000000 00000000000 Q ss_pred c----ceeeecchhhhhhhccccC Q lcl|Aclame:pro 290 V----KELVTAPDLGFFFENAVAA 309 (309) Q Consensus 290 ~----~~~v~~~~~G~l~~~~va~ 309 (309) . .-.+......+-+..+.-+ T Consensus 291 ~~~~~~~~~~~~~~~v~v~~v~~~ 314 (392) T protein:vir:99 291 VGFVRARKIHLIPGSIEVAPEAGA 314 (392) T ss_pred cceeeeeeeeeecceeeeeeeecc Confidence 0 0000000000000000000 No 105 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=81.75 E-value=0.083 Score=26.51 Aligned_cols=257 Identities=11% Similarity=0.028 Sum_probs=106.6 Q ss_pred CCC---C-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccce Q lcl|Aclame:pro 1 MSN---A-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETG 76 (309) Q Consensus 1 m~~---~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~ 76 (309) +.. . .-++..+.+.|-.--++..-| -.++..+||...+++|++..... ...-..++.++......+.+...++ T Consensus 116 ~~t~~~gg~liP~~~~~~Ii~~~~~~~~l-~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~s~~~f~~i~~ 192 (421) T protein:vir:13 116 IMSSTNNGAVIPQEFVNEFEKLKEGYPSL-KEHCHVIPVNRNAGKMPVRAGAS--VDKLANLAKDTELVKAMLKTQPMAY 192 (421) T ss_pred ccccCCcceecchhhHHHHHHHHHhhhhh-hhhceeeeccCCceEEEEeecCC--ccceeeccccccccccccceeEEEe Confidence 111 1 233444444432211221222 23456778887788888765421 1111234445555555566666666 Q ss_pred eeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHHH Q lcl|Aclame:pro 77 STEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPVI 156 (309) Q Consensus 77 ~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~di 156 (309) ..+..+-..+++.+-..++ .++......+.+.+.+....+-...+. + +..++ .++..-..+| T Consensus 193 ~~~k~~~~v~iS~ell~ds--~~~l~~~i~~~la~~~~~~~~~~i~~~-------~---~g~~~------~~~~~~~d~i 254 (421) T protein:vir:13 193 DIDDYGLLAPIDNSLLEDS--EINFLEFVNEEFAEFAVNTENAEIVKQ-------A---KAVLA------EETINDYAGL 254 (421) T ss_pred eeeeeEeehhhhHHHHhhh--HHHHHHHHHHHHHHHHHHHhhhhHhhh-------h---hhccc------cccccchHHH Confidence 6666665566776655443 234444444445554443333221110 0 01111 1122223445 Q ss_pred HHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH---HHHHHHhCCCeEEeecceeeccccCCCc Q lcl|Aclame:pro 157 TDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM---AFLQELLELDAIYIGEARLNIARPGQNP 230 (309) Q Consensus 157 ~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~---~~l~~l~gl~~I~v~~a~~~~~~~g~~~ 230 (309) .+.+..+ ...+...+|+++.|..|+. + +..++.. +..+ ..-..++|+| |++-+.... +... T Consensus 255 ~~~~~~l~~~~~~~a~~v~n~~~~~~l~~---l----kd~~G~~-i~~~~~~~~~~tl~G~p-V~~~~~~~~----~~~~ 321 (421) T protein:vir:13 255 VKTINSLVPNARKRAIIVTNSDGRAYLDG---L----MDKQGRP-LLKELSDGGDLVFKGRP-VIELEESIF----DVGD 321 (421) T ss_pred HHHHHHhhhhhcCCCEEEEcHHHHHHHHH---h----hcCCCce-eecCcCCCCCceeccee-eEEeccccc----cCCC Confidence 4444443 5667889999999988753 2 2222222 1111 1113467877 333332211 1111 Q ss_pred ccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchh--------- Q lcl|Aclame:pro 231 NLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDL--------- 299 (309) Q Consensus 231 ~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~--------- 299 (309) ...-++++- .++++.. -|.+.++.. ...-..+...+|+...++-++.-+.+ T Consensus 322 ~~~~~~gd~~~~~~~~~~----------~~~~v~~~~--------~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~ 383 (421) T protein:vir:13 322 ETKFIVSDFKTLIKFMDR----------KQYLIDQSK--------EAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKF 383 (421) T ss_pred ceEEEEEeccccEEEEEe----------cceEEEeec--------ccccccCeeEEEEEeeecceeecchhhheeeeccc Confidence 122222321 1111110 022222111 01112333445555555444433322 Q ss_pred hhhhh--ccccC Q lcl|Aclame:pro 300 GFFFE--NAVAA 309 (309) Q Consensus 300 G~l~~--~~va~ 309 (309) |.|.. ++.++ T Consensus 384 ~a~v~~~~~~~~ 395 (421) T protein:vir:13 384 GVIVKLQEVLKS 395 (421) T ss_pred ceeeccccccCC Confidence 33332 33333 No 106 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=81.56 E-value=0.085 Score=26.46 Aligned_cols=262 Identities=8% Similarity=-0.044 Sum_probs=110.5 Q ss_pred CCC------CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc-cccCcCc Q lcl|Aclame:pro 1 MSN------APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE-VEFSATD 73 (309) Q Consensus 1 m~~------~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~-ve~~~~~ 73 (309) |+. ...++..+...|-..-+...-|.+ ++..+|+....++++.... ........-++.++.... ....+.. T Consensus 116 ~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~-~~~~~~a~~v~Eg~~~~~~~~~~f~~ 193 (404) T protein:vir:39 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ-YVRVESVSTSNGSRVYEKW-TDVTPLTVMDAEDGKIPDLDNPRLTI 193 (404) T ss_pred hhcccccCCceeccHHHHHHHHHHHHhhhhHHh-hcceeeccCCcceEEEEee-cCCccceeeecCccccccccccceee Confidence 211 112233333333211111111222 2345566665566554321 111111122444444433 2345555 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChH Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPL 153 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi 153 (309) .++..+..+-..+++++-..++ .+|....-.+.+.+.+....|..+ +++. +.....++.. .. T Consensus 194 i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~d~~i----l~g~----g~~~~~~~~~--------~~ 255 (404) T protein:vir:39 194 IKYLIKRYAGIITATNTLLKDT--AENILAWLSSWIAKKVVVTRNQAI----IAAM----GTVPKKPTIA--------KF 255 (404) T ss_pred EEeeeeeEEeeehhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHH----Hhcc----cccccccccc--------cH Confidence 5656655555556666655433 355566666667777666555433 2221 1111112221 12 Q ss_pred HHHHHHHHH-h--CCCC-cEEEeCHHHHHHHhcCHHHHHHhccCCCcccccC----HHHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 154 PVITDALDS-V--ILRP-NIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVP----MAFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 154 ~di~~~~~~-~--g~~P-n~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt----~~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) .+|.+.+.. + .+++ ...+|+++.|.+|+. + +..++.. +.. ...-..++|.| |++.+...... T Consensus 256 ~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~---l----kd~~G~~-l~~~~~~~~~~~~l~G~p-V~~~~~~~~~~- 325 (404) T protein:vir:39 256 DDVITMINTSVDPAIIATSSLLTNQSGLNKLAL---V----KTAEGKY-LLEPDPTKPNSYLIKGKK-VIVVADRWLPN- 325 (404) T ss_pred HHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH---h----hccCCce-eeccCcCCCCcceeccee-EEEecccccCc- Confidence 334333321 1 3333 458999999988874 2 2222221 111 11123566776 33332211111 Q ss_pred cCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 226 PGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 226 ~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) .......-++++- .++++. .. |.+.++.... ...-..+...+|+...++-.+.-+.+...+ T Consensus 326 -~~~~~~~~~~gd~~~~~~~~~------~~----~~~i~~~~~~------~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~ 388 (404) T protein:vir:39 326 -SGSTVYPLYYGDMSQAITLFD------RE----NMSLLPTNIG------AGAFETDTTKIRVIDRFDVKTTDSEALVAG 388 (404) T ss_pred -cCCCccEEEEEeccccEEEEe------ec----ceEEEEeccc------hhhhhhceeeEEEEeeeccEEecccceEEE Confidence 1111111222221 111111 00 1122221110 011224555688888888888888888888 Q ss_pred hccccC Q lcl|Aclame:pro 304 ENAVAA 309 (309) Q Consensus 304 ~~~va~ 309 (309) +-..++ T Consensus 389 ~~~~~a 394 (404) T protein:vir:39 389 SFTAIA 394 (404) T ss_pred Eeeccc Confidence 744444 No 107 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=81.06 E-value=0.089 Score=26.34 Aligned_cols=270 Identities=12% Similarity=0.021 Sum_probs=108.9 Q ss_pred CCCCCCCcchhhH--HHHHhhcchhhhh---hh------hCCccccc----cccceeEEechhHhhhchhHhhccccccc Q lcl|Aclame:pro 1 MSNAPFPIDPELT--AIAIAYRNGRMIS---DE------VLPRVPVG----KQEFKFWKYDLAQGFTVPETLVGRKSKPN 65 (309) Q Consensus 1 m~~~~f~~dp~LT--~~a~~y~n~~~ig---~~------lfP~v~v~----~~~~k~~~~~~~~~f~~~~t~~~~~~~~~ 65 (309) +.+.++.+-.-+| +..-||-.|+... +. +...+.|. ....++...+-...-....+..+...... T Consensus 8 ~~~~~~~~~k~~t~~d~~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~ 87 (315) T protein:vir:41 8 RGGKPFEIVPKIDVPDLGRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRDETGQKLAPP 87 (315) T ss_pred hcCChhhhhhhcCCcCCCCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccccccCcCCCC Confidence 3334444333322 2234455554222 12 22222221 11111111110000000011222223334 Q ss_pred ccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-cCcccceecc---- Q lcl|Aclame:pro 66 EVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNS-YAAGNKTTLS---- 140 (309) Q Consensus 66 ~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~-y~~~~~~~ls---- 140 (309) +.+.++...++.+++....-.++++..++.....|-++.-+..+.+++...+|..+-+.-....+ +...++.-|+ T Consensus 88 ~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~~ 167 (315) T protein:vir:41 88 ESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLASE 167 (315) T ss_pred CCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceecccc Confidence 44566777788888888777888888776544446777777778887777666544332111000 0011111111 Q ss_pred ---cc-cccCCCCCChHHHHHHHHHHh--CCC---Cc-EEEeCHHHHHHHhcCHHHHHHhccCCCccc--ccCHHHHHHH Q lcl|Aclame:pro 141 ---GA-DQWSDPTSNPLPVITDALDSV--ILR---PN-IGVLGRRTATILRRHPKIVKAYNGSLGDEG--MVPMAFLQEL 208 (309) Q Consensus 141 ---gt-~~Wsd~~sdPi~di~~~~~~~--g~~---Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~--~vt~~~l~~l 208 (309) ++ ..|+ +.+.|...+.+...++ .++ +| +.+|+++++.++ ++.....+...+ .+...+-..+ T Consensus 168 ~~~~~~~~~~-a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~------rklk~~~g~~lw~~~~~~g~~~tl 240 (315) T protein:vir:41 168 KLTESDVDPE-AEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAY------RDALKGRETGLGDQALTGANSILY 240 (315) T ss_pred cccccccccc-cccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHH------HHHhccCCCccccchhhcCCCcee Confidence 10 1111 2234555555555554 222 33 689999998774 344443322111 1112222345 Q ss_pred hCCCeEEeecceeeccccCCCccccee-cCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEee Q lcl|Aclame:pro 209 LELDAIYIGEARLNIARPGQNPNLIRA-WGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVG 287 (309) Q Consensus 209 ~gl~~I~v~~a~~~~~~~g~~~~~~~v-~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~ 287 (309) +|.| |+.- ..+... +|+..+++ .+. ..-.+|.. ....+...+....+...+-.+ T Consensus 241 ~G~P-V~~~------------~~m~~~~~~~~~ilf-~d~-----~nl~~~~~------~~i~i~~~~~a~~~~~~~~~~ 295 (315) T protein:vir:41 241 DGRP-VQYV------------PALEALNDGKSRALF-VVP-----TQLVYGFW------RNIKVVPDYDAEMRLTKYVAS 295 (315) T ss_pred cccc-eEec------------ccccccCCCCccEEE-ecc-----cceEEEec------cccEEEeeecCCCCceEEEEE Confidence 5655 2211 111111 23333332 221 11123321 111121111111111111111 Q ss_pred cccceeeecchhhhhhhccccC Q lcl|Aclame:pro 288 ESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 288 ~~~~~~v~~~~~G~l~~~~va~ 309 (309) .-.|++|.++|+.|. T Consensus 296 -------~r~d~~~~~~~~~a~ 310 (315) T protein:vir:41 296 -------LRTDNHYEDEEGAVS 310 (315) T ss_pred -------EEeceeEEeccceeE Confidence 123667777777555 No 108 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=81.02 E-value=0.09 Score=26.33 Aligned_cols=285 Identities=8% Similarity=-0.010 Sum_probs=103.7 Q ss_pred CC--CCC------------CCcchhhHHHHHhhcchhhhhhhhCCcc----ccc---cccceeEEechhHhhhchhHhhc Q lcl|Aclame:pro 1 MS--NAP------------FPIDPELTAIAIAYRNGRMISDEVLPRV----PVG---KQEFKFWKYDLAQGFTVPETLVG 59 (309) Q Consensus 1 m~--~~~------------f~~dp~LT~~a~~y~n~~~ig~~lfP~v----~v~---~~~~k~~~~~~~~~f~~~~t~~~ 59 (309) |+ |+. |.+. +.....+.- |-...+|... +.. ....++|+.+... . .... T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipe-i~s~~i~~~----l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g~~~---~--~d~~ 70 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPE-QWLSEVQMF----RKAKMLDTSVVKTWGAQVKKGDTFHVPRISELG---V--EDKA 70 (341) T ss_pred CcchhhhccccccchhHHHHHHH-HHHHHHHHH----HHhhcchhhccccccccccCCceEEEeccCcce---e--eeec Confidence 44 432 2211 111111111 1111222221 111 1112233333211 1 1122 Q ss_pred ccccccccccCcCccceeeecc-chhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCccccee Q lcl|Aclame:pro 60 RKSKPNEVEFSATDETGSTEDH-GLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTT 138 (309) Q Consensus 60 ~~~~~~~ve~~~~~~~~~~~e~-~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ 138 (309) +++....-+....+.++.+.++ .....|+..+ +.+..+|+....++.....|....+..++..+-.....+..+... T Consensus 71 ~~~~i~~~~~~~~~~~itiD~~~~~~~~i~d~d--~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~ 148 (341) T protein:vir:94 71 TDVPVGVQPVNDTDFVITVDTDRTTAVALDDLL--EIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFS 148 (341) T ss_pred CCCccccccccCceEEEEEeeeeecceeechHH--HHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCcccc Confidence 3333333344445556666333 3344555544 445678888888877777776666666555443222211111111 Q ss_pred cccccccCCCCCChHHHHHHHHHHh---CC--CCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCe Q lcl|Aclame:pro 139 LSGADQWSDPTSNPLPVITDALDSV---IL--RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDA 213 (309) Q Consensus 139 lsgt~~Wsd~~sdPi~di~~~~~~~---g~--~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~ 213 (309) -+....=.++...-...|.+++..+ .+ ..-.++++++.+..|+.++++.++-+... +.+..-++..++|++ T Consensus 149 ~~~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~---~~l~~G~ig~i~G~~- 224 (341) T protein:vir:94 149 SSNGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINN---APIAQGQIGSLMGVR- 224 (341) T ss_pred CccccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhcccc---chhheeeeeeEeceE- Confidence 1100000011111223344444443 33 22469999999999999999888754432 234344566677776 Q ss_pred EEeecceeeccccCCCcccceecCCcE-EEEecCCCCCCcC-----------cceeccccc--cccccc------C-Ccc Q lcl|Aclame:pro 214 IYIGEARLNIARPGQNPNLIRAWGPHA-SFIYRDRLADTRN-----------GTTFGLTAQ--WGDRVS------G-SIA 272 (309) Q Consensus 214 I~v~~a~~~~~~~g~~~~~~~v~~~~~-~L~~~~~~~~~~~-----------~~t~G~T~~--~~~~~~------~-~~~ 272 (309) |+..... +..... -|.... ..++........+ ..+-|..+. +-+... . ... T Consensus 225 V~~Sn~l-----p~~~~~---~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~ 296 (341) T protein:vir:94 225 VIRTSLI-----GNNSAT---GWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVV 296 (341) T ss_pred EEEeccc-----cccccc---cccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccc Confidence 3221111 000000 011110 0010100000000 111111100 000000 0 000 Q ss_pred ccccccCC-------ceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 273 DPNIGLRG-------GQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 273 d~~~g~~g-------~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) .+.....+ ++.++....+=..+.-|+|...|.-.-+. T Consensus 297 ~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~~~~ 340 (341) T protein:vir:94 297 SKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTTGDT 340 (341) T ss_pred cccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecCcCC Confidence 00000011 11222223333444455554433222122 No 109 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=80.85 E-value=0.091 Score=26.29 Aligned_cols=265 Identities=11% Similarity=0.076 Sum_probs=104.1 Q ss_pred CCC--CCC---CcchhhHHHHHhhcc--------hhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc Q lcl|Aclame:pro 1 MSN--APF---PIDPELTAIAIAYRN--------GRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV 67 (309) Q Consensus 1 m~~--~~f---~~dp~LT~~a~~y~n--------~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v 67 (309) .|. .|= ..++-.-++-..|-. +.+.++.|||...++.=..++.+|.-.+. .-....-+-+++...+ T Consensus 34 da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~~ 112 (336) T protein:vir:10 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDS 112 (336) T ss_pred hhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccccccCCccceeEEEeeeec-eeeEEEeeccCCCcee Confidence 011 110 112222334444543 44788899997665433223444432110 0000111223333344 Q ss_pred ccCcCccceeeeccchhhcCCHHHHHHH-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc----cccCcccc--e--e Q lcl|Aclame:pro 68 EFSATDETGSTEDHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSP----NSYAAGNK--T--T 138 (309) Q Consensus 68 e~~~~~~~~~~~e~~L~~~v~~~~~~~a-~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~----~~y~~~~~--~--~ 138 (309) ..........+.-.+.-......|...| ...+|...+-.+..++.++..+ -+.++-+ ..|+.-|- . . T Consensus 113 d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~----N~i~~~Gd~~~~~yGllN~P~l~a~ 188 (336) T protein:vir:10 113 GANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFL----NGSYLFGVAGLENYGLINDPSLSAP 188 (336) T ss_pred ecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhh----CcEEEEeccccceEEEEeCCCCccc Confidence 4333333334444444444555555544 3456654443333333332211 1222211 11211111 1 1 Q ss_pred ccccccc-CCCCCC-hHHHHHHHHHHh-----C----CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHH Q lcl|Aclame:pro 139 LSGADQW-SDPTSN-PLPVITDALDSV-----I----LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQE 207 (309) Q Consensus 139 lsgt~~W-sd~~sd-Pi~di~~~~~~~-----g----~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~ 207 (309) .+++++| +..+.+ .++||.+....+ | -.|++++|....+..|.+- + ..|.-=.+.|++ T Consensus 189 ~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~---------n--~~g~Tvl~~lk~ 257 (336) T protein:vir:10 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT---------N--QYGLAAAAKLKD 257 (336) T ss_pred cccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccCC---------C--ccCccHHHHHHH Confidence 2334445 455544 899999886654 4 3699999999999888431 1 112222456666 Q ss_pred Hh-CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCcCcceeccccccc----ccccCCccccccccCCce Q lcl|Aclame:pro 208 LL-ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWG----DRVSGSIADPNIGLRGGQ 282 (309) Q Consensus 208 l~-gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~----~~~~~~~~d~~~g~~g~~ 282 (309) -| +|. |+ ....+.++ . ++.+.+++....+.. .-..+++-.+. ......+..+....-+|. T Consensus 258 n~Pnl~-i~-t~pEl~~a----~-------G~~~~l~~~~~~~~~--t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~Gv 322 (336) T protein:vir:10 258 IFPKLE-FV-TIPEYDTA----S-------GRLVQLWAPRVEGKD--TATCGFTEKMRAHSIERYSSYFRQKKSAGTWGA 322 (336) T ss_pred hcCccE-EE-EccccccC----C-------CceEEEEEEecCCCc--ceeeecchhhhccceeecCceeEeccccceeee Confidence 54 232 21 11122111 1 112223332211110 00112221110 011222333333333443 Q ss_pred EEEeecccceeeecchhhh Q lcl|Aclame:pro 283 RVRVGESVKELVTAPDLGF 301 (309) Q Consensus 283 ~v~v~~~~~~~v~~~~~G~ 301 (309) .|+ .|.-+..-.|. T Consensus 323 ~i~-----~P~ai~~~~GI 336 (336) T protein:vir:10 323 VIF-----RPFAVAQMIGV 336 (336) T ss_pred eee-----ccchheeeecC Confidence 332 23333333333 No 110 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=79.67 E-value=0.1 Score=26.01 Aligned_cols=257 Identities=14% Similarity=-0.014 Sum_probs=114.6 Q ss_pred CCCCCC--Ccch-hhHHHHHhhcchhhhhhhhCCcccc----ccccceeEEechhHhhhchhHhhcccccccccccCcCc Q lcl|Aclame:pro 1 MSNAPF--PIDP-ELTAIAIAYRNGRMISDEVLPRVPV----GKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATD 73 (309) Q Consensus 1 m~~~~f--~~dp-~LT~~a~~y~n~~~ig~~lfP~v~v----~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~ 73 (309) |+-+.+ .++| +++++...--+.. ..+.|-+.+ ..+.+....|++-. ..-+.....-+...-.-+++..+ T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~---~~~~~~~~~d~~L~g~~G~ti~~P~~~-~igdae~~~eg~~i~~~~lt~~~ 76 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNA---IRFTPYAVTDDTLVGQPGDTITRPKYA-YIGAAEDLQEGVAMDTTQMSMTT 76 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhH---HhhccccccccccCCCCCCEEEeeeec-CCCccccccCCCccchhhcccch Confidence 997654 4455 5666664322111 122332222 22344444443311 11123334445544445666667 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChH Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPL 153 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi 153 (309) ....++.++-.-.++++... ...-||...+.+.+...+....+..+...+. |+ .|+...+.-. T Consensus 77 ~~a~i~~~gk~~~itD~a~~--~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~--------------~a-~~~~~~~~t~ 139 (270) T protein:vir:95 77 TKVTVKETGKAVEVTQTAII--TNVNGTLQEASRQLAMSLADKVEIDYIAELN--------------KS-KQTATVSADA 139 (270) T ss_pred heeeeehhhCcceecHHHHh--hhccchHHHHHHHHHHHHHHHHHHHHHHHhc--------------cc-ccccccccCH Confidence 77777777655555555433 3345888888887777776655544432221 11 1222233345 Q ss_pred HHHHHHHHHhC---CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCc Q lcl|Aclame:pro 154 PVITDALDSVI---LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNP 230 (309) Q Consensus 154 ~di~~~~~~~g---~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~ 230 (309) .+|.++...+| -.++.++|+++++..|+.+..+ .......+.+.--.+..++|++-| |-+... .++ T Consensus 140 ~~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk~~~~----~~~~~~~~~~~~G~ig~~~G~~Vi-v~s~~~------~~~ 208 (270) T protein:vir:95 140 TGILDAIEVFNSENDEDYVLYVNPKDYNKLVKSLFK----VGGNVQDRAISKGDLVEIVGVSDI-VKSKRV------SEN 208 (270) T ss_pred HHHHHHHHHhccccCCCcEEEEcHHHHHHHHhhhcc----cccccccchhcccccceecceeEE-EeCCCC------Cce Confidence 67777777774 4689999999999999887532 222222233333456677887643 322211 111 Q ss_pred ccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccc---eeeecchhhhhhh Q lcl|Aclame:pro 231 NLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVK---ELVTAPDLGFFFE 304 (309) Q Consensus 231 ~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~---~~v~~~~~G~l~~ 304 (309) ..+++...++-++.. .++. .| .+|-.....+.-++. +++-+.-..+ .+++...+|-.=- T Consensus 209 -~~~l~~~gAi~~~~~------~~~~----vE-tdRd~~~~~d~i~~~---~~y~v~~~~~skvv~~t~~~a~~~~~ 270 (270) T protein:vir:95 209 -TAFLQRYGAMEIVNK------KKPE----AY-TDFDILKRTHLLSTN---YHYSVNLKDETGVVKVTFKPSGSLEM 270 (270) T ss_pred -eEEEEeccceeeeec------CCce----ee-eccchhhcccEEEee---eEEEEEEEccceEEEEEecCCCCcCC Confidence 123444443322111 0111 11 111110111111111 1111111111 1222222222111 No 111 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=77.84 E-value=0.12 Score=25.62 Aligned_cols=258 Identities=11% Similarity=0.050 Sum_probs=112.9 Q ss_pred CCC-----C-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc-cccCcCc Q lcl|Aclame:pro 1 MSN-----A-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE-VEFSATD 73 (309) Q Consensus 1 m~~-----~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~-ve~~~~~ 73 (309) |+. . ..++..+.+.|-..-++..-| -.+++.+||....++|+...... ......+.++.... ....+.. T Consensus 109 ~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~E~~~~~~~~~~~~~~ 184 (389) T protein:vir:10 109 TSKVTSTEAGVLIPEEIIYDPTAEVNSVVDL-STLVTKTPVTTPKGTYPILKRAT---DRFSSVAELAENPKLAEPEFNK 184 (389) T ss_pred hcccccCCcceeehHHHHHHHHHHHHhhhhH-HhhcceeeccCCeeEEEEEecCC---Ccccccccccccccccccccee Confidence 221 1 234444444443222222222 24467778887778887764311 11122333443332 3445555 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChH Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPL 153 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi 153 (309) .++..+..+-..+++++-..+ +.+|....-.+.+.+.+....+..+....-... ..++. ...+.|-+ T Consensus 185 i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~---------~~~~~--~~~~~d~l 251 (389) T protein:vir:10 185 VDWSVATYRGAIPLSEEAIAD--SAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFT---------AKKTT--TDTLVDSL 251 (389) T ss_pred eeeeheeeEeeehhhHHHHhh--hhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc---------ccccc--ccccHHHH Confidence 566665555555666665443 334555555566666666665554433321110 01110 11233333 Q ss_pred HHHHHHHHHhCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH--------HHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 154 PVITDALDSVILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA--------FLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 154 ~di~~~~~~~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~--------~l~~l~gl~~I~v~~a~~~~~~ 225 (309) .++....-.... ....+|++..|..|+. ++..++.+ +..+. .-..++|+| |++-++.+... T Consensus 252 ~~~~~~~~~~~~-~a~~~~n~~~~~~L~~-------lkd~~G~~-i~~~~~~~~~~~~~~~~l~G~p-V~~~~~~~~~~- 320 (389) T protein:vir:10 252 KHILNVDLDPAY-SRALVVTQSLFNTLDT-------LKDKNGRY-LLHDASDSITDGTAKGTILGVP-VYVVGDTLLGS- 320 (389) T ss_pred HHHHHhhhhhhh-CcEEEecHHHHHHHHH-------hhccCCCe-eeecCcccccccccccccccce-eEEecccccCC- Confidence 333222111122 3678999999988874 22222222 11111 112478888 44433322111 Q ss_pred cCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhh Q lcl|Aclame:pro 226 PGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFF 303 (309) Q Consensus 226 ~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~ 303 (309) ..++..-++++- .++++.. . |.+.++.. . .+ -...+|+.+..+-.+.-+++..++ T Consensus 321 --~~~~~~~~~gd~~~~~~~~~~------~----~~~i~~~~------~-~~----~~~~~~~~~r~d~~~~~~~a~~~~ 377 (389) T protein:vir:10 321 --LAGDQKAFVGDLKRGVLFTDR------Q----QVTLAWED------S-KI----YGKYLGAAFRFGVQKADSKAGYFV 377 (389) T ss_pred --CCCceEEEEeeccccEEEEee------c----ceEEEeec------c-cc----ccceEEEEEEeccEEecccceEEE Confidence 112222344431 1222111 1 12222111 0 00 012356666777778888887777 Q ss_pred h--ccccC Q lcl|Aclame:pro 304 E--NAVAA 309 (309) Q Consensus 304 ~--~~va~ 309 (309) + .+.++ T Consensus 378 ~~~~~~~~ 385 (389) T protein:vir:10 378 TNTDVPGS 385 (389) T ss_pred EeeccCCC Confidence 6 33444 No 112 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=77.24 E-value=0.13 Score=25.50 Aligned_cols=269 Identities=10% Similarity=-0.001 Sum_probs=115.3 Q ss_pred CCCCCCCcchhh--------------HHHHHhhcchhhhhhhhCCccccccccce-eEEechhHhhhchhHhhccccccc Q lcl|Aclame:pro 1 MSNAPFPIDPEL--------------TAIAIAYRNGRMISDEVLPRVPVGKQEFK-FWKYDLAQGFTVPETLVGRKSKPN 65 (309) Q Consensus 1 m~~~~f~~dp~L--------------T~~a~~y~n~~~ig~~lfP~v~v~~~~~k-~~~~~~~~~f~~~~t~~~~~~~~~ 65 (309) |+-.-|..+.++ ..+...-++ ..+=..+++.+|+...+.. +++...... ..-++.++... T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~-~s~l~~~~~~~~~~~~~~~~~~~~~~~~~----a~~v~Eg~~~~ 75 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQ-NSLVMQLGQYQEMEGEQEKTVYVQTDGIS----AYWVNETEKIK 75 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHh-hchhhhhcceeecCCCccEEEEEEcCCce----eEEeecCcccc Confidence 554333222222 222111111 1122344666777655443 344332111 12234455555 Q ss_pred ccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceeccccccc Q lcl|Aclame:pro 66 EVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQW 145 (309) Q Consensus 66 ~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~W 145 (309) ..+..+.+.++..++.+-..+++++..++. .++....-.+.+.+.+....|..+- ++..- ......++....- T Consensus 76 ~~~~~f~~v~l~~~k~~~~~~is~ell~ds--~~~l~~~i~~~la~ai~~~~d~a~l----~G~g~-~~~~gi~~~~~~~ 148 (297) T protein:vir:95 76 TDKPEVVPVTLKAHKLGIILVTSREALNYT--WKKFFEDMKPQIVEAFYKKIDEAGL----LGHDT-PFANSVAKAAKDA 148 (297) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhcC--HHHHHHHHHHHHHHHHHHHHHHHHh----cccCC-ccccccccccccc Confidence 555666666666666666667777655543 2445555556666666665554442 22111 0111111111111 Q ss_pred CC--CCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecce Q lcl|Aclame:pro 146 SD--PTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEAR 220 (309) Q Consensus 146 sd--~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~ 220 (309) .. ++..-..+|.+...++ +..++..+|+++.|.+|+. ++. .++.. +..+ .-..++|+|-+ +.... T Consensus 149 ~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~---l~d----~~G~~-i~~~-~~~~l~G~Pv~-~~~~~ 218 (297) T protein:vir:95 149 NKVIGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALRE---ARD----GNKVS-IYDK-AANTIDGITTV-DLKSA 218 (297) T ss_pred ceecccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH---hhc----cCCce-eecC-CCCcccceeeE-eecCC Confidence 11 1222355666666554 7788999999999998864 222 12111 1111 11346677733 22110 Q ss_pred eeccccCCCcccceecCCcEEEEecCCCCCCcCcceeccccccccc--------ccCCccccccccCCceEEEeecccce Q lcl|Aclame:pro 221 LNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDR--------VSGSIADPNIGLRGGQRVRVGESVKE 292 (309) Q Consensus 221 ~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~--------~~~~~~d~~~g~~g~~~v~v~~~~~~ 292 (309) ....+ .-++++..-+++.... +.+.+.... ..+... ..-..+...+|+.+.++- T Consensus 219 -----~~~~~--~~~~gd~s~~~~~~~~---------~~~i~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~r~~~~~d~ 280 (297) T protein:vir:95 219 -----RFEKG--DLLAGDFDNLIYGVPY---------NITYKISEEGQISTITNADGTPI--NLFEQEMIAIRATMDIAV 280 (297) T ss_pred -----CCCCc--eEEEEecccEEEEEec---------CeEEEEeeccccccccccCccch--hhhhcCcEEEEEEEEecc Confidence 00111 1122322111111111 111111000 001100 111234556777777777 Q ss_pred eeecchhhhhhhccccC Q lcl|Aclame:pro 293 LVTAPDLGFFFENAVAA 309 (309) Q Consensus 293 ~v~~~~~G~l~~~~va~ 309 (309) .+.-+++=..++.|.=- T Consensus 281 ~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 281 MITKTDAFAKLTPAERV 297 (297) T ss_pred EeecccceEEEeecCCC Confidence 77777665555322111 No 113 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=76.72 E-value=0.13 Score=25.40 Aligned_cols=279 Identities=10% Similarity=0.025 Sum_probs=121.5 Q ss_pred CCCC------CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSNA------PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~~------~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |+.. -..+..+.+.+-..-++..-| ..+++.+++.....++|++.....+ .-++.++.....+..+.+. T Consensus 10 ~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i-~~l~~~~~~~~~~~~ip~~~~~~~a----~wv~Eg~~~~~s~~~f~~v 84 (397) T protein:vir:23 10 IAQTKDTMFTGYLDPVQAKDYFAEAEKTSIV-QRVAQKIPMGATGIVIPHWTGDVSA----QWIGEGDMKPITKGNMTKR 84 (397) T ss_pred HhhccCCCCccccchhHHHHHHHHHHhccch-hhhcceeeccCCceEEEEEcCCcce----EEecCCccccccccceeEE Confidence 2211 133333334433222222223 3467888888877888888543221 2345556666666777777 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceeccccc-ccCCCCCChH Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGAD-QWSDPTSNPL 153 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~-~Wsd~~sdPi 153 (309) ++..+..+-..+++++-.++. .++.+....+.+.+.+....|..+ +++..-+......+..+. ...-.+.... T Consensus 85 ~l~~~k~~~~v~iS~ell~ds--~~~l~~~i~~~l~~aia~~~d~a~----l~G~gt~~~~~~~~~~~~~~~~~~~~~~~ 158 (397) T protein:vir:23 85 DVHPAKIATIFVASAETVRAN--PANYLGTMRTKVATAIAMAFDNAA----LHGTNAPSAFQGYLDQSNKTQSISPNAYQ 158 (397) T ss_pred EEeeEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHH----hhcccCCcccccccccccceeeecccchh Confidence 777777776677777755533 355666666677777766665543 222211111111111110 1111222233 Q ss_pred HHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH---------HHHHHHhCCCeEEeeccee Q lcl|Aclame:pro 154 PVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM---------AFLQELLELDAIYIGEARL 221 (309) Q Consensus 154 ~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~---------~~l~~l~gl~~I~v~~a~~ 221 (309) .++.+....+ +..++..+|+++.+.+|+. +++ .++.. +..+ -.-..++|+| |++.+.. T Consensus 159 ~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~---lkd----~~G~~-i~~~~~~~~~~~~~~~~tl~G~P-v~~s~~~- 228 (397) T protein:vir:23 159 GLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNG---SVD----ANGRP-LFVESTYESLTTPFREGRILGRP-TILSDHV- 228 (397) T ss_pred HHHHHHHHhhhhcccCCCEEEEcHHHHHHHHH---hhc----cCCce-eecccccccccccccCceeeeee-EEEeCCC- Confidence 4444444333 6678899999999988775 222 12211 1100 0112356766 3332211 Q ss_pred eccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhh Q lcl|Aclame:pro 222 NIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGF 301 (309) Q Consensus 222 ~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~ 301 (309) +. .+..-++++-.-+++....+-.. ..+=-.+.+.+....+.. .+.-..+...+|+.+.++-.+.-+++-. T Consensus 229 ----~~--g~~~~~~gDfs~~~i~~~~~i~i-~~~~e~~~~~~~~~~~~~--~~lf~~d~v~~ra~~r~d~~v~~~~a~~ 299 (397) T protein:vir:23 229 ----AE--GDVVGYAGDFSQIIWGQVGGLSF-DVTDQATLNLGSQESPNF--VSLWQHNLVAVRVEAEYGLLINDVNAFV 299 (397) T ss_pred ----CC--CceEEEEeecceEEEEEEeceEE-EEeeeeeeeeccccccce--eeeeeccceeEEEEeeeccceecccceE Confidence 10 11111222211111111000000 000000111000000000 0011223345666666666666666655 Q ss_pred hhhccccC Q lcl|Aclame:pro 302 FFENAVAA 309 (309) Q Consensus 302 l~~~~va~ 309 (309) .++....+ T Consensus 300 ~~~~~~~~ 307 (397) T protein:vir:23 300 KLTFDPVL 307 (397) T ss_pred EEeecccc Confidence 55544433 No 114 >protein:vir:102335 Length: 312 # NCBI annotation: putative capsid protein # Family: family:all:701 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529560;genbank:gi:90592716;genbank:GeneID:3974467 Probab=75.79 E-value=0.14 Score=25.22 Aligned_cols=272 Identities=14% Similarity=0.081 Sum_probs=105.3 Q ss_pred CCCCCCCcchhhHHHHHhhcch---hhhhhhhCCcc-----cc---ccccceeEEechhHhhhchhHhhccccccccccc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNG---RMISDEVLPRV-----PV---GKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEF 69 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~---~~ig~~lfP~v-----~v---~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~ 69 (309) |||+. ++|.-|.+. .+....++... .| ...+.|+|+..- ..+.. =.|.-+...++.+. T Consensus 1 Mantl--------~ya~~~~~~LD~~~~~~~~s~~l~~~~~~v~~~ggktVkIp~i~~-~gl~D--Y~R~~g~~~~~g~v 69 (312) T protein:vir:10 1 MANTL--------AYGQVLQQGLDKQATQELLTGWMDSNAKQIKYEGGKEVKIGKLST-DGLGD--YSRGSANAYVGGDV 69 (312) T ss_pred CCcch--------hHHHHHHHHHHHHHHhhhccccccCCCceEEEecCcEEEEEeeec-ccccc--cccccCCccccccc Confidence 88652 444444332 23333333321 12 223344444432 22322 12222233444455 Q ss_pred CcCccceeee-ccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHH-HHHHHHHhhcccccCcccceecccccccCC Q lcl|Aclame:pro 70 SATDETGSTE-DHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDR-EARTSKLVFSPNSYAAGNKTTLSGADQWSD 147 (309) Q Consensus 70 ~~~~~~~~~~-e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~-E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd 147 (309) +...+++.+. +++....|+.-+.++.+...+-....-+...+.+-=.. -.++++++.....-......+.+.+ - T Consensus 70 ~~~~et~tl~qDR~~~F~vD~mDvDETn~~~s~anv~~ef~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~----~ 145 (312) T protein:vir:10 70 KFEYETKTMTQDRGRKFTLDAMDVDETNFLVTATTVMGEFQRLKVIPEIDAYRLSRLATIAIGIKGDTNVEYSYS----V 145 (312) T ss_pred cccceeEEeeecccceeeccccchhhHhhHHHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhccccccccccccc----c Confidence 5555555543 44566667655555543222111111111111111111 1245565554433222222221110 1 Q ss_pred CCCChHHHHHHHHHHh---CCC-CcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeec Q lcl|Aclame:pro 148 PTSNPLPVITDALDSV---ILR-PNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNI 223 (309) Q Consensus 148 ~~sdPi~di~~~~~~~---g~~-Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~ 223 (309) ...+.+..|++++..+ |+. +-+|.+++.++.+|++. ....+.......+.| .-.+.++=|++-|.|-+++..+ T Consensus 146 T~~ni~~~i~~~~~~lde~~vp~~rvl~vTp~~~~lLk~~--~~~~~~~~~~~~~~i-~~~V~~iDgv~Ii~VPs~r~~t 222 (312) T protein:vir:10 146 NSSTIINKIKTGIKIIRENGYNGPLVCHLTYDSMFAIEEK--VLEKLTAVTFAQGGI-QTQVPSIDGCALIKTPQNRMYS 222 (312) T ss_pred CHHHHHHHHHHHHHHHHHccCCCceEEEeChHHHHHHhhh--hhceeccccccccee-eeeeeeecccEEEEchhhhccc Confidence 2356888888887665 543 56799999999888753 334443333334444 3455666677777776666543 Q ss_pred cccCCCcc--------cce-ecCCc--EEEEecC-CCCCC-----------cCcceecccccccccccCCccccccccCC Q lcl|Aclame:pro 224 ARPGQNPN--------LIR-AWGPH--ASFIYRD-RLADT-----------RNGTTFGLTAQWGDRVSGSIADPNIGLRG 280 (309) Q Consensus 224 ~~~g~~~~--------~~~-v~~~~--~~L~~~~-~~~~~-----------~~~~t~G~T~~~~~~~~~~~~d~~~g~~g 280 (309) +..=.+++ ... -=+.+ .++.+.. +.+.. ......||.+++ |...-.+...-. .. T Consensus 223 ~~~f~dG~t~~~~~gg~~~~~~ak~INfiiv~~~a~i~~~K~~~~~if~P~~~~~~d~~~~~~--R~Y~D~fv~~nk-~~ 299 (312) T protein:vir:10 223 SILLNDGTTSNQTAGGYLKGTKALDTNFIIAPVDVPLAITKQDKMRIFDPETNQTANAWSMDY--RRYHDLWVTDNK-AN 299 (312) T ss_pred eeeeccCcccccccCceeecCcccccceEEeCCceeeceeeeeeeeeeCCCCCCCcceeeeee--eeeeeeeeeccc-cC Confidence 32100000 000 00111 1111111 01110 011123344332 222211111000 11 Q ss_pred ceEEEeecccceeeecchhh Q lcl|Aclame:pro 281 GQRVRVGESVKELVTAPDLG 300 (309) Q Consensus 281 ~~~v~v~~~~~~~v~~~~~G 300 (309) +.++-+ -.++..| T Consensus 300 ~Iyv~~-------k~a~~~~ 312 (312) T protein:vir:10 300 SVYANF-------KDAKPVG 312 (312) T ss_pred eEEEEe-------ecccCCC Confidence 111111 1122222 No 115 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=74.82 E-value=0.15 Score=25.04 Aligned_cols=262 Identities=13% Similarity=0.087 Sum_probs=108.5 Q ss_pred CCCCCCCcchhhHHHHH-hhcchhhhhhhhCCc-c-ccccc--cceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAI-AYRNGRMISDEVLPR-V-PVGKQ--EFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~-~y~n~~~ig~~lfP~-v-~v~~~--~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) ||+..|.+.-. ++.++ .+++.-.++. ++.+ . ...+. +.++|+.+. ....+- .+.++....-++...+.+ T Consensus 1 MA~~~~~pe~~-~~~v~~~~~~~lv~~~-l~~~~~~~~~~~Gdtv~ip~~~~---~~~~d~-~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELW-SDMLLEEWTAQTVFAN-LVNREYEGTASKGNVVHIAGVVA---PTVKDY-KAAGRQTSADAISDTGVD 74 (273) T ss_pred CcchhhhHHHH-HHHHHHHHHhhhccch-hhccccccccccCceEEEeeccc---cccccc-ccCCCccCccccccceEE Confidence 99998876644 44443 4544322332 2222 1 11122 223344332 222111 112222222234444455 Q ss_pred eeeecc-chhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHH Q lcl|Aclame:pro 76 GSTEDH-GLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLP 154 (309) Q Consensus 76 ~~~~e~-~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~ 154 (309) +.++.+ .....|+..|.. +..++.+. .++.....|....+..++..+..... + .+++. .. ..++.+. T Consensus 75 ~tid~~~~~~~~i~d~d~~--~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~-----~--~~~~~-~~-~~~~~~~ 142 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRV--QVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-----A--LTGSA-PT-DADDAFD 142 (273) T ss_pred EEEeeeeecceEeecHHHh--hhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc-----c--ccccc-cc-chhHHHH Confidence 555443 444556654433 33455432 44444444443334444443332211 1 11110 00 1134556 Q ss_pred HHHHHHHHhCC--CC---cEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCC Q lcl|Aclame:pro 155 VITDALDSVIL--RP---NIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQN 229 (309) Q Consensus 155 di~~~~~~~g~--~P---n~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~ 229 (309) .|.++..++.- -| -.++++++.+..|+..+.........+. .+.+-.-++..++|++ |+..... ..+. T Consensus 143 ~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~-~~~l~~G~ig~i~G~~-v~~s~~l----p~~~- 215 (273) T protein:vir:10 143 LIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD-AAGLRAGTIGNLLGAR-IVESNNL----RDTD- 215 (273) T ss_pred HHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccc-ccceeeeeeeEEeceE-EEEeccc----ccCC- Confidence 66666555521 14 3699999999999998864443332222 2223233455667765 3321111 0110 Q ss_pred cccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 230 PNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 230 ~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) +...+++.. ..+|+--++.. .+.......-+..|+....+...|+-+++=..|+..-+ T Consensus 216 --------~~~~~~~~~--------~A~~~a~q~~~-----~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 216 --------DEQFVAFHP--------SAAAYVSQIDT-----VEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred --------ccEEEEEec--------cceeeeeeeeh-----hhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 111222221 01222112111 00000001113456666666666666665555544333 No 116 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=74.82 E-value=0.15 Score=25.04 Aligned_cols=262 Identities=13% Similarity=0.087 Sum_probs=108.5 Q ss_pred CCCCCCCcchhhHHHHH-hhcchhhhhhhhCCc-c-ccccc--cceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAI-AYRNGRMISDEVLPR-V-PVGKQ--EFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~-~y~n~~~ig~~lfP~-v-~v~~~--~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) ||+..|.+.-. ++.++ .+++.-.++. ++.+ . ...+. +.++|+.+. ....+- .+.++....-++...+.+ T Consensus 1 MA~~~~~pe~~-~~~v~~~~~~~lv~~~-l~~~~~~~~~~~Gdtv~ip~~~~---~~~~d~-~~~~~~~~~~~~~~~~~~ 74 (273) T protein:vir:10 1 MAFNNFIPELW-SDMLLEEWTAQTVFAN-LVNREYEGTASKGNVVHIAGVVA---PTVKDY-KAAGRQTSADAISDTGVD 74 (273) T ss_pred CcchhhhHHHH-HHHHHHHHHhhhccch-hhccccccccccCceEEEeeccc---cccccc-ccCCCccCccccccceEE Confidence 99998876644 44443 4544322332 2222 1 11122 223344332 222111 112222222234444455 Q ss_pred eeeecc-chhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHH Q lcl|Aclame:pro 76 GSTEDH-GLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLP 154 (309) Q Consensus 76 ~~~~e~-~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~ 154 (309) +.++.+ .....|+..|.. +..++.+. .++.....|....+..++..+..... + .+++. .. ..++.+. T Consensus 75 ~tid~~~~~~~~i~d~d~~--~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~-----~--~~~~~-~~-~~~~~~~ 142 (273) T protein:vir:10 75 LLIDQEKSIDFLVDDIDRV--QVAGSLEA-YTRAGATALATDTDKFIADMLVDNGT-----A--LTGSA-PT-DADDAFD 142 (273) T ss_pred EEEeeeeecceEeecHHHh--hhhccHHH-HHHHHHHHHHHHHHHHHHHHHhcccc-----c--ccccc-cc-chhHHHH Confidence 555443 444556654433 33455432 44444444443334444443332211 1 11110 00 1134556 Q ss_pred HHHHHHHHhCC--CC---cEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCC Q lcl|Aclame:pro 155 VITDALDSVIL--RP---NIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQN 229 (309) Q Consensus 155 di~~~~~~~g~--~P---n~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~ 229 (309) .|.++..++.- -| -.++++++.+..|+..+.........+. .+.+-.-++..++|++ |+..... ..+. T Consensus 143 ~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~-~~~l~~G~ig~i~G~~-v~~s~~l----p~~~- 215 (273) T protein:vir:10 143 LIAKALKELTKANVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGD-AAGLRAGTIGNLLGAR-IVESNNL----RDTD- 215 (273) T ss_pred HHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhcchhhhhhhhcccc-ccceeeeeeeEEeceE-EEEeccc----ccCC- Confidence 66666555521 14 3699999999999998864443332222 2223233455667765 3321111 0110 Q ss_pred cccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhcccc Q lcl|Aclame:pro 230 PNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVA 308 (309) Q Consensus 230 ~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va 308 (309) +...+++.. ..+|+--++.. .+.......-+..|+....+...|+-+++=..|+..-+ T Consensus 216 --------~~~~~~~~~--------~A~~~a~q~~~-----~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 216 --------DEQFVAFHP--------SAAAYVSQIDT-----VEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred --------ccEEEEEec--------cceeeeeeeeh-----hhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 111222221 01222112111 00000001113456666666666666665555544333 No 117 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=73.85 E-value=0.17 Score=24.87 Aligned_cols=266 Identities=12% Similarity=0.072 Sum_probs=100.0 Q ss_pred CCCCC------CCcc-hhhHHHHHhhcch--------hhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccc Q lcl|Aclame:pro 1 MSNAP------FPID-PELTAIAIAYRNG--------RMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPN 65 (309) Q Consensus 1 m~~~~------f~~d-p~LT~~a~~y~n~--------~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~ 65 (309) ||.-+ +..+ +++ ...+.|..| .+.++.|||...++.=..++.+|.-.+.- -....-+-+++.. T Consensus 65 ~a~da~~~~~~t~~~~gip-~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~-G~A~~ygd~~D~P 142 (388) T protein:vir:99 65 QAFDSAYVAPTTQASIPTP-IQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPA-GTAMEYGDLTNIP 142 (388) T ss_pred cccCcccccccccCcccHH-HHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecc-eeEEEeecccCCC Confidence 12111 1111 111 112334433 36788888876653322334444321100 0000111122222 Q ss_pred ccc--cCcCccceeeeccchhhcCCHHHHHHHh-hcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc-------cccCccc Q lcl|Aclame:pro 66 EVE--FSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSP-------NSYAAGN 135 (309) Q Consensus 66 ~ve--~~~~~~~~~~~e~~L~~~v~~~~~~~a~-~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~-------~~y~~~~ 135 (309) .+. .....++...-+-+. .+..+|...|+ .++|...+-.+..++.++.. .=++++.+ ..|+.-| T Consensus 143 l~d~~~~~~~r~v~~~~~g~--~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~----~N~i~f~G~~g~~~~~~yGllN 216 (388) T protein:vir:99 143 LSSWNVNFERRTIVRGEMGI--QVGLLEEGRASAMRINSAEVKRQGAAVQLEIM----RNAIGFYGWEGKNGNRTFGFLN 216 (388) T ss_pred ceeccceeeeeeEEEEEeee--eecHHHHHHHHhhCCCcHHHHHHHHHHHHHhh----hceEEEEeecCCCccceEEEee Confidence 222 223333433333333 34455555543 45665554443333333222 22333322 1221111 Q ss_pred ------ceec---ccccccCCCCCC-hHHHHHHHHHHh-----CC-----CCcEEEeCHHHHHHHhcCHHHHHHhccCCC Q lcl|Aclame:pro 136 ------KTTL---SGADQWSDPTSN-PLPVITDALDSV-----IL-----RPNIGVLGRRTATILRRHPKIVKAYNGSLG 195 (309) Q Consensus 136 ------~~~l---sgt~~Wsd~~sd-Pi~di~~~~~~~-----g~-----~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~ 195 (309) .+.. .+..+|.+.+.+ .+.||.++...+ |. .|.+++|.+..+..|.+- +. T Consensus 217 dP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~---------n~- 286 (388) T protein:vir:99 217 DPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV---------TD- 286 (388) T ss_pred CCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc---------Cc- Confidence 1222 233468887765 789999998765 22 355899999999998521 11 Q ss_pred cccccC-HHHHHHHh-CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCC-----CcCcceecc--ccccc-- Q lcl|Aclame:pro 196 DEGMVP-MAFLQELL-ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLAD-----TRNGTTFGL--TAQWG-- 264 (309) Q Consensus 196 ~~~~vt-~~~l~~l~-gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~-----~~~~~t~G~--T~~~~-- 264 (309) .+ +| .+.|++-| +|.-+-+ .-+..+...+ +.+++.+|...... .....++-. +..+. T Consensus 287 -~g-~Tvl~~lk~n~Pnl~i~t~--pEl~~a~~tg--------g~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l 354 (388) T protein:vir:99 287 -LG-ISVRDWLKQTYPRVRVMSA--PELQGGNPDD--------GKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTL 354 (388) T ss_pred -CC-ccHHHHHHHhcCCcEEEEe--cccccccccC--------CceeEEEEecccccccccCccCcceeEEecccccccc Confidence 12 22 35566654 2321111 1111111111 12233334332211 111122111 11110 Q ss_pred --ccccCCccccccccCCceEEEeecccceeeecchhhh Q lcl|Aclame:pro 265 --DRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGF 301 (309) Q Consensus 265 --~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~ 301 (309) .+....+..++...-+|..|+ .|.-+..-.|. T Consensus 355 ~vq~~~~~~~~~~~~rt~Gv~ir-----~P~Ai~~~~GI 388 (388) T protein:vir:99 355 GVEKRVKNYVEAYSNATAGVMLK-----RPWAVVRLIGL 388 (388) T ss_pred cceecCceeEeccccceeeeEEe-----ccchhheeccC Confidence 011112222222222332222 23333333333 No 118 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=71.84 E-value=0.19 Score=24.53 Aligned_cols=265 Identities=13% Similarity=0.120 Sum_probs=100.9 Q ss_pred CCCC--C---C-------CcchhhHHHHHhhcchh--------hhhhhhCCccccccccceeEEechhHhhhchhHhhcc Q lcl|Aclame:pro 1 MSNA--P---F-------PIDPELTAIAIAYRNGR--------MISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGR 60 (309) Q Consensus 1 m~~~--~---f-------~~dp~LT~~a~~y~n~~--------~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~ 60 (309) |-.. . + ..+|-+-++...|. |. +.++.|||...++.-..+..+|.-.+.- =....-+- T Consensus 56 md~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~-p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~-G~A~~ygd 133 (379) T protein:vir:10 56 MDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWL-PGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGL-GTAQPYTD 133 (379) T ss_pred hccccccccccccCccccccccchHHHHHhhc-chHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeee-eeeEEecc Confidence 3311 1 1 11222224445554 33 5678888866654443344444321100 00001111 Q ss_pred cccccccc--cCcCccceeeeccchhhcCCHHHHHHH-hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc---c---c Q lcl|Aclame:pro 61 KSKPNEVE--FSATDETGSTEDHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSPN---S---Y 131 (309) Q Consensus 61 ~~~~~~ve--~~~~~~~~~~~e~~L~~~v~~~~~~~a-~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~---~---y 131 (309) +++...+. .....++...-+-+. .+...|...+ ..+++...+-.+..++ ..|+..-++++.+. . | T Consensus 134 ~~d~pl~d~~~~~~~r~v~~~~~g~--~yg~~El~~Aa~~g~~l~~~Ka~aA~~----ale~~~N~i~f~G~~d~~~~~y 207 (379) T protein:vir:10 134 GGNMALMSWTPTFETRTVVRFEAGL--QVAPLEEARSSRVQVSSADEKRAMVGE----ALEVQRNRVAFYGYNDGSGRTF 207 (379) T ss_pred ccCCCeeeeeeeeeeeeeEEEEEEE--eecHHHHHHHHHhCCChHHHHHHHHHH----HHHHhhceEEEEeecCCCcceE Confidence 22221222 222223332222222 2333444433 3455554443333332 22333333444331 1 1 Q ss_pred Cccc------ceec----ccccccCCCCCC-hHHHHHHHHHHh-----CC-----CCcEEEeCHHHHHHHhcCHHHHHHh Q lcl|Aclame:pro 132 AAGN------KTTL----SGADQWSDPTSN-PLPVITDALDSV-----IL-----RPNIGVLGRRTATILRRHPKIVKAY 190 (309) Q Consensus 132 ~~~~------~~~l----sgt~~Wsd~~sd-Pi~di~~~~~~~-----g~-----~Pn~~v~~~~~~~~l~~~~~i~~~~ 190 (309) +.-| -.+. ++..+|.+.+.+ .+.||.++...+ |. .|.+++|.+..+..|.+- T Consensus 208 GllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~------- 280 (379) T protein:vir:10 208 GFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP------- 280 (379) T ss_pred EEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc------- Confidence 1111 0111 133569888765 789999986654 32 355999999999998531 Q ss_pred ccCCCcccccCHHHHHHHh-CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCC-CcCc-ce--ecccccccc Q lcl|Aclame:pro 191 NGSLGDEGMVPMAFLQELL-ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLAD-TRNG-TT--FGLTAQWGD 265 (309) Q Consensus 191 ~~~~~~~~~vt~~~l~~l~-gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~-~~~~-~t--~G~T~~~~~ 265 (309) +. .+.-=.+.|++-| +|. |+. ..-+..+ ++.+ +..++|.+...+ ..+. .+ +.++-++.. T Consensus 281 --n~--~g~Tvl~~lk~n~Pnl~-i~t-~pEL~~a--ggg~--------~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~ 344 (379) T protein:vir:10 281 --TE--LGYSVAQYMRESYPNVT-FVS-APELNDA--NGGS--------SAIYYYADAVENNGTDDGRTWLQVVPTKMFT 344 (379) T ss_pred --cc--cCccHHHHHHHhcCCcE-EEE-ccccccc--CCCc--------cEEEEEeeccCCCccCCcceEEEecchhhhh Confidence 11 1211134555543 332 211 1112221 1111 223333332211 1111 11 222221111 Q ss_pred ----cccCCccccccccCCceEEEeecccceeeecchhhh Q lcl|Aclame:pro 266 ----RVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGF 301 (309) Q Consensus 266 ----~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~ 301 (309) +....+..++...-+|..|+ .|.-++.-.|- T Consensus 345 l~ve~~~~~~~~~~~~rt~Gv~ir-----~P~Ai~~~~G~ 379 (379) T protein:vir:10 345 LGVEKKIKGYAEGYTNATAGAMLK-----RPFATYRQTGA 379 (379) T ss_pred ccceecCceeEeccccceeeeeee-----cchhhheecCC Confidence 11112222333333443333 34433443444 No 119 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=71.79 E-value=0.19 Score=24.52 Aligned_cols=279 Identities=13% Similarity=0.079 Sum_probs=105.1 Q ss_pred CC-----CCCCCcchhhHHHHHhhcchhhhhhhhCC-ccccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MS-----NAPFPIDPELTAIAIAYRNGRMISDEVLP-RVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~-----~~~f~~dp~LT~~a~~y~n~~~ig~~lfP-~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) +. ....++....+.|-. .-.+..+-..+.. .+|+.....++|++.....+ .-++-++........+... T Consensus 127 ~~~~~~~gg~liP~~~~~~ii~-~l~~~~~l~~~~~~~~~~~~g~~~~p~~~~~~~a----~~v~Eg~~~~~~~~~f~~i 201 (428) T protein:vir:10 127 ISTAAGSGGVLIPQNIHSEVIE-LLRDRTIVRKLGARSIPLPNGNMSLPRLAGGATA----SYTGENQDAKVSEARFDDV 201 (428) T ss_pred hcccccCCccccchhHHHHHHH-HHhhhchhhhhcceeeecCCcceEEEEEeCCcce----eeeccCccccccccceeeE Confidence 11 112334333333321 1111111112212 23444444567776432111 1234455555555666666 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceeccccc------ccCCC Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGAD------QWSDP 148 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~------~Wsd~ 148 (309) ++..+..+-..+++++-..+. .++....-.+.+.+.|....|..+ +++..-+......++.+. .+... T Consensus 202 ~~~~~k~~~~v~is~ell~ds--~~~l~~~i~~~l~~ai~~~~d~~~----l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~ 275 (428) T protein:vir:10 202 KLTAKTMIAMVPISNALIGRA--GFNVEQLVLQDILTAISVREDKAF----MRDDGTGDTPIGMKARATQWNRLLPWAAD 275 (428) T ss_pred EeeeEEEEEeehhhHHHHhhh--hHHHHHHHHHHHHHHHHHHHHHHH----hccCCCCcccccccccccccccccccccc Confidence 666666665667777755543 234455555666666665555433 332211111111111111 12223 Q ss_pred CCChHHHHHHHHHHhC---------CCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecc Q lcl|Aclame:pro 149 TSNPLPVITDALDSVI---------LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEA 219 (309) Q Consensus 149 ~sdPi~di~~~~~~~g---------~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a 219 (309) .+..+..++.+.+.+. ......+|++..|..|.. ++ ..++.+ +..+..=..++|+| |++-+. T Consensus 276 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~---lk----d~~G~~-i~~~~~~g~l~G~p-v~~~~~ 346 (428) T protein:vir:10 276 AAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYMKLFG---LR----DGNGNK-VYPEMAQGMLKGYP-IQRTSA 346 (428) T ss_pred ccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHHHHHH---hh----ccCCce-eccCCCCCeeecee-eEEecc Confidence 3444566666666541 234567999999988754 22 122222 11110001367887 444332 Q ss_pred ee-eccccCCCcccceecCCcEEEEecCCCCCCcCcceeccccccc-ccccCCccccccccCCceEEEeecccceeeecc Q lcl|Aclame:pro 220 RL-NIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWG-DRVSGSIADPNIGLRGGQRVRVGESVKELVTAP 297 (309) Q Consensus 220 ~~-~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~-~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~ 297 (309) .- +.. .+. ....-++++-.-++... . .+...-.+-+-. ....+... ..-..+...+|+.+.++-.+.-| T Consensus 347 ~p~~~~-~~~-~~~~i~~gd~s~~~i~~-~----~~i~i~~~~~~~~~~~~~~~~--~~f~~~~~~~R~~~r~d~~v~~p 417 (428) T protein:vir:10 347 IPANLG-EGG-KESEIYFADFNDVVIGE-D----GNMKVDFSKEASYIDTDGKLV--SAFSRNQSLIRVVTEHDIGFRHP 417 (428) T ss_pred cccccc-CCC-ccceEEEEecceEEEEE-e----cceEEEeeccccccccccccc--chhhcchhheeeeeeeCceeecc Confidence 21 111 000 01111222211111000 0 011110000000 00000000 01112233456655555444444 Q ss_pred hhhhhhhcccc Q lcl|Aclame:pro 298 DLGFFFENAVA 308 (309) Q Consensus 298 ~~G~l~~~~va 308 (309) ++-.+++++-= T Consensus 418 ~a~~~~t~~~~ 428 (428) T protein:vir:10 418 EGLVLGTGVLF 428 (428) T ss_pred ceEEEEeccCC Confidence 33333322222 No 120 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=69.74 E-value=0.22 Score=24.20 Aligned_cols=264 Identities=10% Similarity=-0.010 Sum_probs=116.6 Q ss_pred CCCCCCCcchhhHHHHHhhcchhhhhhhhCCcccccccc-ceeEEechhHhhhchhHhhcccccccccccCcCccceeee Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQE-FKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGSTE 79 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~-~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~~~ 79 (309) -++...++-+....+-........+=..+...+++...+ .++|+...... ..-++.++........+.+.++..+ T Consensus 116 ~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~~~~~----a~wv~E~~~~~~~~~~f~~i~~~~~ 191 (390) T protein:vir:62 116 AGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVITGRSS----ASIVGETAEIPESYPATAQRSMGGF 191 (390) T ss_pred cCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcCCcc----eeeecccccccccccceeeeEeeee Confidence 111223333333333322222222223344444554432 46676643211 1223445555555666666666666 Q ss_pred ccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCccc--cee-cccccccCCCCCChHHHH Q lcl|Aclame:pro 80 DHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGN--KTT-LSGADQWSDPTSNPLPVI 156 (309) Q Consensus 80 e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~--~~~-lsgt~~Wsd~~sdPi~di 156 (309) ..+-..+++++-.++ +.+|....-.+.+.+.|....+.. ++++..=|.+- ... .+++.....+++....+| T Consensus 192 k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~d~~----~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l 265 (390) T protein:vir:62 192 KYGFASVVSYEFATD--QVLDLVGFLVSDAGPAIGDAMGRH----FITGTGQPRGILTDASPATATFLATDTDSKVSDAL 265 (390) T ss_pred eEEeehHHHHHHHhh--hhHHHHHHHHHHHHHHHHHHHHhh----hhccCCccccccccccccccceecccccccchHHH Confidence 666666677666554 334555555555666665555443 23322111110 000 011111122333445555 Q ss_pred HHHHHHh--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecceeeccccCCC Q lcl|Aclame:pro 157 TDALDSV--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEARLNIARPGQN 229 (309) Q Consensus 157 ~~~~~~~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a~~~~~~~g~~ 229 (309) .+...++ ..+.| ..+|++..|..|.. ++..++.. +..+ ..-..++|.| |++-+. T Consensus 266 ~~~~~~l~~~~~~~a~~vmn~~~~~~L~~-------lkd~~g~~-l~~~~~~~g~~~~l~G~P-v~~~~~---------- 326 (390) T protein:vir:62 266 IDLFHEVPSAYRANAKYVVNDLRAAQMRK-------LKDANGQY-LWQSGLTVGAPSLFNGKV-VETDDG---------- 326 (390) T ss_pred HHHHHhhhhhhhcCCEEEEchHHHHHHHH-------hhccCCCe-eecCCcCCCccceecccc-eEEecC---------- Confidence 5554444 34455 57999999988753 22223222 1111 1112456766 333211 Q ss_pred cccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 230 PNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 230 ~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) .|.+.+++ .+- +.-....+++..-.... +.+-..+...++....++-.++-+++-.+++-.-|| T Consensus 327 ------~p~~~i~~-gd~--------s~~~i~~~~~~~v~~~~-~~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 327 ------MPADKILF-ADL--------SKYRVRFAGSLRVDRSV-DAKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred ------CCCccEEE-eec--------cceeEEeecceEEEeec-cccccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 01111211 110 10001111111100111 222345666788999999999999998888755555 No 121 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=67.81 E-value=0.25 Score=23.92 Aligned_cols=265 Identities=11% Similarity=0.047 Sum_probs=105.0 Q ss_pred CCC--CC--C-CcchhhHHHHHhhcc--------hhhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc Q lcl|Aclame:pro 1 MSN--AP--F-PIDPELTAIAIAYRN--------GRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV 67 (309) Q Consensus 1 m~~--~~--f-~~dp~LT~~a~~y~n--------~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v 67 (309) .|+ .| . ..++-.-++...|-. +.+.++.|||...++.=..++.+|.-.+. .-....-+-+++...+ T Consensus 34 da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~-~G~a~~ygd~~D~P~v 112 (336) T protein:vir:78 34 DAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDS 112 (336) T ss_pred hhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeec-ceeeEEeecccCCCee Confidence 111 11 1 223322233334543 44788889987665433233444432110 0000111223333344 Q ss_pred ccCcCccceeeeccchhhcCCHHHHHHHh-hcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc----ccc-----Ccccce Q lcl|Aclame:pro 68 EFSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSP----NSY-----AAGNKT 137 (309) Q Consensus 68 e~~~~~~~~~~~e~~L~~~v~~~~~~~a~-~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~----~~y-----~~~~~~ 137 (309) ..........+...+.-......|...|+ .+++...+-.+..++.++ +..-+.++-+ ..| |.-... T Consensus 113 d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale----~~~N~~~~~Gd~~~~~~GllN~P~l~a~ 188 (336) T protein:vir:78 113 GTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA----KFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) T ss_pred ecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHH----HhhCeEEEEeccccceEEEEeCCCCCcc Confidence 44444444444445544455556665553 355554433333333332 2111222221 112 211111 Q ss_pred ecccccccCCCCCC-hHHHHHHHHHHh-----CC----CCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHH Q lcl|Aclame:pro 138 TLSGADQWSDPTSN-PLPVITDALDSV-----IL----RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQE 207 (309) Q Consensus 138 ~lsgt~~Wsd~~sd-Pi~di~~~~~~~-----g~----~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~ 207 (309) +-+.+..|...+.+ .+.||......+ |. .|.+++|....+..|.+- +. .+.-=.+.|++ T Consensus 189 ~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~---------n~--~g~tv~~~lk~ 257 (336) T protein:vir:78 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT---------NQ--YGLSAAAKLKE 257 (336) T ss_pred cccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC---------Cc--cCccHHHHHHH Confidence 22233446666655 899999887765 32 477999999999998531 11 12211345665 Q ss_pred HhCCCeEEeecc-eeeccccCCCcccceecCCcEEEEecCCCCCCcCcceeccccccc----ccccCCccccccccCCce Q lcl|Aclame:pro 208 LLELDAIYIGEA-RLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWG----DRVSGSIADPNIGLRGGQ 282 (309) Q Consensus 208 l~gl~~I~v~~a-~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~----~~~~~~~~d~~~g~~g~~ 282 (309) -| |.+.+-.. -+.++ ++ +.+.+|.....+.. .-...++-.+. ......+..++...-+|. T Consensus 258 n~--Pnl~i~t~pel~~A--gg----------~~~~~~~~~~~~~~-t~~~~~p~~f~~lpvq~~~~~~~v~~~~rt~Gv 322 (336) T protein:vir:78 258 IF--PKLEFVTIPEYDTA--SG----------RLVQLWAPRVEGKD-TATCGFTEKMRAHSIERYSSYFRQKKSAGTWGA 322 (336) T ss_pred hc--CccEEEEccccccc--Cc----------ceEEEEEeeccCCc-ceeeecchhhhccceeecCceeEeccccceeee Confidence 43 22222111 11111 11 12223322211100 00111111110 011222333333334443 Q ss_pred EEEeecccceeeecchhhh Q lcl|Aclame:pro 283 RVRVGESVKELVTAPDLGF 301 (309) Q Consensus 283 ~v~v~~~~~~~v~~~~~G~ 301 (309) .|+ .|.-+..-.|. T Consensus 323 ~i~-----~P~ai~~~~GI 336 (336) T protein:vir:78 323 VIF-----RPFAVAQMIGV 336 (336) T ss_pred eee-----ccchheeeccC Confidence 332 23333333333 No 122 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=66.79 E-value=0.26 Score=23.77 Aligned_cols=263 Identities=10% Similarity=-0.003 Sum_probs=119.7 Q ss_pred CC-CC-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccceee Q lcl|Aclame:pro 1 MS-NA-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDETGST 78 (309) Q Consensus 1 m~-~~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~~~~ 78 (309) .+ +. ..++....+.|-.--++..-|. .+++.+|+.....+|++..... ....-++.++.......++.+.++.+ T Consensus 117 ~~~~~g~~vp~~~~~~ii~~~~~~~~l~-~l~~~~~~~~~~~~~~~~~~~~---~~a~~v~E~~~~~~~~~~~~~i~~~~ 192 (395) T protein:vir:43 117 IDGSGGALVAPDRRPGVVAAPQRRLTIR-DLVAPGTTESNSVEYVRETGFV---NNAAPVSEGTQKPYSDLTFELENAPV 192 (395) T ss_pred cCCCCccccchhhHHHHHHHHHhhhhHH-hhccceecCCCceEEEEEecCC---CceeeecCCccccccccceeEEEEee Confidence 11 11 1222223333332222222233 3467788877777888763321 11122344555555566677777777 Q ss_pred eccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccc------cCCCCCCh Q lcl|Aclame:pro 79 EDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQ------WSDPTSNP 152 (309) Q Consensus 79 ~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~------Wsd~~sdP 152 (309) +..+-..+++++-..+.. +....-.+.+.+.+....+. .++++..-...-+..++.... ....+.+. T Consensus 193 ~k~~~~~~is~ell~d~~---~l~~~v~~~la~a~~~~~d~----~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~ 265 (395) T protein:vir:43 193 RTIAHLFKASRQILDDAS---ALQSYIDARARYGLMLVEEC----QLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQR 265 (395) T ss_pred eeEEEeehhhHHHHHhHH---HHHHHHHHHHHHHHHHHHHH----HHHhccCCCCccccccccccccccccccccccchh Confidence 777767778877655432 23333344455555444443 233322100000111221111 12234456 Q ss_pred HHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccC---HHHHHHHhCCCeEEeecceeecccc Q lcl|Aclame:pro 153 LPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVP---MAFLQELLELDAIYIGEARLNIARP 226 (309) Q Consensus 153 i~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt---~~~l~~l~gl~~I~v~~a~~~~~~~ 226 (309) +.+|.+.+..+ +..+..++|+++.|.+|+. ++ ..++.. +.. ...-..+||+| |++.+.. + T Consensus 266 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~---lk----d~~G~~-i~~~~~~~~~~~l~G~p-Vv~~~~~-----~ 331 (395) T protein:vir:43 266 IDRIRLAILQAQLAEFPASGIVLNPIDWALIEL---NK----DAENRY-IIGSPQNGTTPTLWRLP-VVETQAI-----T 331 (395) T ss_pred HHHHHHHHHhhccccCCCcEEEEcHHHHHHHHH---hh----ccCCce-eccccccCCCceeccee-eEEcCCC-----C Confidence 77887776555 5678899999999998753 22 122211 110 01123467776 4432211 1 Q ss_pred CCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhh Q lcl|Aclame:pro 227 GQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFE 304 (309) Q Consensus 227 g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~ 304 (309) ... -++++- .++++. .-|.+.++.... ...-..+...+|+...++-.+.-+++-..++ T Consensus 332 ~~~----~~~gd~~~~~~~~~----------~~~~~i~~~~~~------~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~ 391 (395) T protein:vir:43 332 QDE----FLTGAFSLGAQIFD----------RMDIEVLVSTEN------DKDFENNMVTIRAEERLAFAVYRPEAFVTGS 391 (395) T ss_pred CCc----EEEEeccceEEEEE----------ecceEEEEeccc------cchhhcCcEEEEEEEeeccEEecccceEEEE Confidence 110 122211 111111 012222221110 0111245556788888888888777755553 Q ss_pred ccccC Q lcl|Aclame:pro 305 NAVAA 309 (309) Q Consensus 305 ~~va~ 309 (309) +-|| T Consensus 392 -~taa 395 (395) T protein:vir:43 392 -LTAS 395 (395) T ss_pred -eccC Confidence 3333 No 123 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=63.85 E-value=0.31 Score=23.37 Aligned_cols=272 Identities=10% Similarity=-0.008 Sum_probs=106.7 Q ss_pred CC----C-CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MS----N-APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~----~-~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) ++ + ...++..+.+.|..--+...-| ..++..+++. ...++|++... .-..+....+.++.....+..+...+ T Consensus 143 ~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i-~~~~~~~~~~-~~~~~p~~~~~-~~a~~~~~~~e~~~~~~~~~~f~~v~ 219 (434) T protein:vir:62 143 LGLVTGNGSVTIPDFLSKEIITYAQEENFL-RRLGTGVKTK-ENIKYPVLVKK-AEAQGHKNERTNNEMPETDIEFDEIE 219 (434) T ss_pred hcccccccceecchhhHHHHHHhhhhhhhh-hhhcceeccC-CceEEEEEecC-CcccceecccccccccccccceeeEE Confidence 11 1 1123333222222211111111 2233334443 24567776432 11111122233344444445555555 Q ss_pred eeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccc-cccCCCCCChHH Q lcl|Aclame:pro 76 GSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGA-DQWSDPTSNPLP 154 (309) Q Consensus 76 ~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt-~~Wsd~~sdPi~ 154 (309) +..+..+-..+++++-..+ +.+|......+.+.+.+....|.. ++++..-.......++.+ ..-...+++... T Consensus 220 ~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~d~~----~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d 293 (434) T protein:vir:62 220 LSPTEFDALATVTKKLLAR--TGLPIEQIVMDELKKAYVRKETQY----MVNGDEANNINDGALAKKAVEFKTDEKNLYD 293 (434) T ss_pred eeheeeEeehhhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHHH----HhccCCCCccccceeecccccccccccchhh Confidence 5554444445555554443 345556665666666665555533 233221111111111110 011223445566 Q ss_pred HHHHHHHHh--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH-HH-----HHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 155 VITDALDSV--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM-AF-----LQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 155 di~~~~~~~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~-~~-----l~~l~gl~~I~v~~a~~~~~~ 225 (309) +|.+....+ ..++| ..+|++.+|.+|+. + +..++.+ +..+ .+ -..++|.| |++-+..- T Consensus 294 ~l~~l~~~l~~~~~~~a~~v~n~~~~~~L~~---l----kd~~G~~-l~~~~~~~~~g~~~tl~G~p-V~~~~~~~---- 360 (434) T protein:vir:62 294 ALVKMKNTPVKEVRKKARWVLNTAALTKIET---M----KTDDGFP-LLRPFNQAEGGIGYTLLGFP-VEEEDAID---- 360 (434) T ss_pred HHHHHHhhcchhhhcCCEEEEcHHHHHHHHH---h----hccCCCE-eeccCCCccCCCCceeccee-eEEecCcc---- Confidence 776666655 34555 46999999988765 2 2222222 1111 01 12366777 33322211 Q ss_pred cCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeee-cchhhhhhh Q lcl|Aclame:pro 226 PGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVT-APDLGFFFE 304 (309) Q Consensus 226 ~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~-~~~~G~l~~ 304 (309) .+..+ +..+++|.+- +..+.....+...-....+.+-..+...+++.+..|-+++ +|.+.-+|+ T Consensus 361 ~~~~~-------~~~~i~~Gdf--------s~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~ 425 (434) T protein:vir:62 361 IPDSP-------DTPVFYFGDF--------SKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYK 425 (434) T ss_pred CccCC-------CceEEEEeec--------cceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEE Confidence 01111 1111222211 1112221111000001111222334444666666665544 466666663 Q ss_pred cc----ccC Q lcl|Aclame:pro 305 NA----VAA 309 (309) Q Consensus 305 ~~----va~ 309 (309) -. .+| T Consensus 426 ~~~~~~~~~ 434 (434) T protein:vir:62 426 YVLKAPTGA 434 (434) T ss_pred EEeccCCCC Confidence 33 333 No 124 >protein:vir:79712 Length: 285 # NCBI annotation: major capsid protein gp34 # Family: family:all:701 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285883;genbank:gi:148750840;genbank:GeneID:5220414 Probab=62.51 E-value=0.33 Score=23.20 Aligned_cols=268 Identities=13% Similarity=0.082 Sum_probs=107.1 Q ss_pred CCCCCCCcchhhHHHHHhhcchhhhhhhhCCcc--cc---ccccceeEEechhHhhhchhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVLPRV--PV---GKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v--~v---~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~~ 75 (309) |+.... .-....+-.-|....+.++.+-+.- .+ +..+.|+|+..-...+.. -.|..+.... +.+...++ T Consensus 1 Main~~--~k~~~~ld~~~~~~~~~~~l~~~~n~~~~~~~gak~VkIp~ist~~gl~d--Y~R~~g~~~g--~v~~~~et 74 (285) T protein:vir:79 1 MTVVLD--SKDLARIDEEYKADSQVWSYLTGGNGVTQRFRGHNEVRINKLSGFVDATA--YKRGQDNARK--TISVGKET 74 (285) T ss_pred Ccchhh--HHHHHHHHHHHHHhhhhhhhcccCCcceeEecCCCEEEEeeecccccccc--cccccCcccc--ccceeeeE Confidence 775542 1223334444444455555554421 12 233456666531122332 2334333333 33444444 Q ss_pred eee-eccchhhcCCHHHHHHHhhcCCHHHHHH-HHHHHHHHHHH-HHHHHHHhhcccccCcccceecccccccCCCCCCh Q lcl|Aclame:pro 76 GST-EDHGLDAPVPQADIDNAPTNYNPLGHAT-EQTTNLILLDR-EARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNP 152 (309) Q Consensus 76 ~~~-~e~~L~~~v~~~~~~~a~~~~d~~~~av-~~l~~~i~~~~-E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdP 152 (309) +.+ .+++....|+.-+.++.. ... .+..+ +...+++-=.. -.++++++...... ...++ ..++. T Consensus 75 ~tl~~DR~~~f~iD~mDvdEn~-~~~-~~ni~~ef~~~~vvPEiDayrfskla~~a~~~---~~~~~--------T~~nv 141 (285) T protein:vir:79 75 VKLTHEDWFGYDLDQFDMDENG-AYT-VENVVREHNKMITIPHRDKVAVQKLFDSAAKK---ATDSI--------TKDNA 141 (285) T ss_pred EEeeccccceecccccchhhhh-hhh-HHHHHHHHHhhhhcchhhHHHHHHHHhhcccc---ccccc--------CHHHH Confidence 443 344555555554444311 111 11111 11111110001 12355555443221 11111 23567 Q ss_pred HHHHHHHHHHh---CC-CCcEEEeCHHHHHHHhcCHHHHHHhccCCC-cccccCHHHHHHHhC-CCeEEeecceeecccc Q lcl|Aclame:pro 153 LPVITDALDSV---IL-RPNIGVLGRRTATILRRHPKIVKAYNGSLG-DEGMVPMAFLQELLE-LDAIYIGEARLNIARP 226 (309) Q Consensus 153 i~di~~~~~~~---g~-~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~-~~~~vt~~~l~~l~g-l~~I~v~~a~~~~~~~ 226 (309) +..|++++.++ |+ .+-.|++++.++..|++.+.+.+.+..+.. ..+-+.. .+.++=| ++-|.|..++..+. T Consensus 142 ~~~i~~~~~~lde~~vp~~rvl~vTp~~~~~Lk~s~~~~r~~~~~~~~~~~~i~~-~V~~lDg~v~ii~Vps~r~kt~-- 218 (285) T protein:vir:79 142 LDAYDTAEAYMFDNEVPGGFVMFVSSAYYTALKQSAAVTRTFSTDGTMVINGIDR-RVAQLDGGVPIVRVSSDRLKGL-- 218 (285) T ss_pred HHHHHHHHHHHHHcCCCCceEEEEChHHHHHHHhhhhhheecccccceeccceee-eeccccceeEEEEcchhhccCc-- Confidence 88888887655 55 345699999999999999999888754321 1122211 2333335 67777777766432 Q ss_pred CCCcccceecCCcEEEEecC-CCCCCcCcceecccccccccccCCccccccccCC-ceEEEeecccceeeecchhhhhhh Q lcl|Aclame:pro 227 GQNPNLIRAWGPHASFIYRD-RLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRG-GQRVRVGESVKELVTAPDLGFFFE 304 (309) Q Consensus 227 g~~~~~~~v~~~~~~L~~~~-~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g-~~~v~v~~~~~~~v~~~~~G~l~~ 304 (309) +....+.++ +.+.. +.+........=+++ .---.| ++.+....-+|-.|.-.-.-.++- T Consensus 219 ~~~k~Infi------iv~~~a~i~~~K~~~~~~f~P-------------~~~~~~d~~~~~~R~Y~d~fv~~nk~~~Iy~ 279 (285) T protein:vir:79 219 GITNHVNFI------LTPLSAIAPIVKYDSVSVIDP-------------STDRSGNRWTIKGLSYYDAIVLDNAKKGIYV 279 (285) T ss_pred CcchhccEE------EecCceeccceeeeeeEeECC-------------CCCCCcceeeeeeeeeeeeeehhhccceeee Confidence 112222222 22111 111111111111111 000000 112222222222222222222222 Q ss_pred ccccC Q lcl|Aclame:pro 305 NAVAA 309 (309) Q Consensus 305 ~~va~ 309 (309) ++-|| T Consensus 280 ~~~a~ 284 (285) T protein:vir:79 280 AATAG 284 (285) T ss_pred eeccc Confidence 33333 No 125 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=61.44 E-value=0.35 Score=23.06 Aligned_cols=276 Identities=13% Similarity=0.063 Sum_probs=112.3 Q ss_pred CC--CCCCCcchhh--HHHHHhhcchh----hhhhhhCC---------ccc-cccccceeEEechhHhhhchhHhhcccc Q lcl|Aclame:pro 1 MS--NAPFPIDPEL--TAIAIAYRNGR----MISDEVLP---------RVP-VGKQEFKFWKYDLAQGFTVPETLVGRKS 62 (309) Q Consensus 1 m~--~~~f~~dp~L--T~~a~~y~n~~----~ig~~lfP---------~v~-v~~~~~k~~~~~~~~~f~~~~t~~~~~~ 62 (309) |- +.+|..-+-+ ++..-||-+|+ || +.|.. .++ +.....+++..+-...........+... T Consensus 1 ~~~~~~~~~~~k~it~~d~~gG~L~P~~~~~~i-~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~~ 79 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVPDLGKGILAVQRFGEFV-REVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTKV 79 (314) T ss_pred CchhhhHHHhhcccccccCCCceeChHHHHHHH-HHHHhccchhhheeeecccCccceeecccccCcccccccccccCCc Confidence 32 2232222211 22234565554 33 22222 111 1222234444432211111111112222 Q ss_pred cccccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhc---cc---ccCcccc Q lcl|Aclame:pro 63 KPNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFS---PN---SYAAGNK 136 (309) Q Consensus 63 ~~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~---~~---~y~~~~~ 136 (309) .....+.++...++.+++......|+++..++....-|.++.-+..+.+++...+|...-+.--+ +. +-+.+.- T Consensus 80 ~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l 159 (314) T protein:vir:41 80 APTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWM 159 (314) T ss_pred cCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhh Confidence 33445667777888888888878888888776554447788888888888888777654332110 00 0111110 Q ss_pred eeccccc--ccCC-CCCChHHHHHHHHHHhC--C-C--Cc-EEEeCHHHHHHHhcCHHHHHHhccCCCcccc--cCHHHH Q lcl|Aclame:pro 137 TTLSGAD--QWSD-PTSNPLPVITDALDSVI--L-R--PN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGM--VPMAFL 205 (309) Q Consensus 137 ~~lsgt~--~Wsd-~~sdPi~di~~~~~~~g--~-~--Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~--vt~~~l 205 (309) +.++.. ..+. ..++|...+.+...++. + + +| +.+|+.+++.+++ +.+..+....+. +...+- T Consensus 160 -~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r------~~l~~~~~~l~~~~~~~~~~ 232 (314) T protein:vir:41 160 -KLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYR------KQLLVRETGLGDSALIGATG 232 (314) T ss_pred -hhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHH------HHHhccCCcccchhhhCCCC Confidence 001111 1222 34567777888777762 2 1 33 5789998887744 444443322211 111122 Q ss_pred HHHhCCCeEEeecceeeccccCCCccccee-cCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEE Q lcl|Aclame:pro 206 QELLELDAIYIGEARLNIARPGQNPNLIRA-WGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRV 284 (309) Q Consensus 206 ~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v-~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v 284 (309) ..++|.|-+.+ .+ + ..+ .|+..+ ++.+.. .-.||...+. .+..+++...+...+ T Consensus 233 ~~l~G~PV~~~-~~-~-----------~~~~~~~~~i-~fgd~~-----nlv~~~~~~i------r~~~~~~a~~~~~~~ 287 (314) T protein:vir:41 233 LQYDGIPIQYV-PA-L-----------DALGDDKARA-LLTVPT-----NLVYGFWRNI------RIEPKRDAAMRRTEY 287 (314) T ss_pred ceecceeeEec-cc-c-----------cccCCCCceE-EEechh-----heEEEeecee------EEeecccCcCCeEEE Confidence 22556663322 11 0 011 122333 333311 1123332221 112222222332222 Q ss_pred Eeecccceee--ecchhhhhhhccccC Q lcl|Aclame:pro 285 RVGESVKELV--TAPDLGFFFENAVAA 309 (309) Q Consensus 285 ~v~~~~~~~v--~~~~~G~l~~~~va~ 309 (309) -.....+-.+ ....+=.++.++=|| T Consensus 288 ~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 288 IASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred EEEEEeceEEEEcCcEEEEEeeccCCC Confidence 2222111111 111111222233333 No 126 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=60.95 E-value=0.36 Score=23.00 Aligned_cols=278 Identities=9% Similarity=0.034 Sum_probs=111.7 Q ss_pred CCCC---CCCcchhhHHHHHhhcchhhhhhhhCCccccccccc---eeEEechhHhhhchhHhhccc---cc--cccccc Q lcl|Aclame:pro 1 MSNA---PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEF---KFWKYDLAQGFTVPETLVGRK---SK--PNEVEF 69 (309) Q Consensus 1 m~~~---~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~---k~~~~~~~~~f~~~~t~~~~~---~~--~~~ve~ 69 (309) |+.. .|+ .....++-+.|++ -+.+|-|.|.....+. ++..|+..+.-.+..++.+.. +. ...... T Consensus 13 Ms~~i~~~fv-~qy~~~v~~~~qq---~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~dtp~~~~ 88 (322) T protein:vir:10 13 IAGDIDQAFV-QTYETTLRILSQQ---KSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSADGTYPTPVNNK 88 (322) T ss_pred eechhhhHHH-HHHHHHHHHHHHH---hhhhhhcccccccccccccceeecccccccccccccccccccCcccCCCcccc Confidence 5542 233 3333344344443 3456677665443332 344444322111111111111 11 111233 Q ss_pred CcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCccc-ceecccccccCCC Q lcl|Aclame:pro 70 SATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGN-KTTLSGADQWSDP 148 (309) Q Consensus 70 ~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~-~~~lsgt~~Wsd~ 148 (309) +...+...+.++....+|+..+ +.+..+||....++.....+.+.....+...+...++-+... .+.+..+..=.+. T Consensus 89 ~~~~r~~~~~d~~~~~~VDd~D--~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~~~ss~~i~~g 166 (322) T protein:vir:10 89 PFAKRRTNVDTYDTGHVVEQED--ISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVEFLATQEIGDG 166 (322) T ss_pred ccceEEEeecccccceecchHH--HHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccccCCCcccccC Confidence 4555666677776666665554 556678999888888777776655444443333322211110 0111111111111 Q ss_pred -CCChHHHHHHHHHHh---CCCC---cEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeeccee Q lcl|Aclame:pro 149 -TSNPLPVITDALDSV---ILRP---NIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARL 221 (309) Q Consensus 149 -~sdPi~di~~~~~~~---g~~P---n~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~ 221 (309) ...=.+.|.++++.+ .+-+ -.++++++-|..|+..+.+...-+. +++. ....-.+..+||++-|. ....= T Consensus 167 ~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~-~~~~-l~~~G~ig~~lGf~~i~-s~~lp 243 (322) T protein:vir:10 167 TKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYT-SAMD-LQSKGIITNWMGYTWIV-STRLD 243 (322) T ss_pred ccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcc-cchh-hhhcCeeeeeeeEEEEE-eccCC Confidence 123355566665554 3322 2599999999999999888765433 2211 11112234457766332 21110 Q ss_pred ecc----------ccCCCcccceecCCcEEEEecCCC--CCCcCcceecccc-cccccccCCccccccccCCceEEEeec Q lcl|Aclame:pro 222 NIA----------RPGQNPNLIRAWGPHASFIYRDRL--ADTRNGTTFGLTA-QWGDRVSGSIADPNIGLRGGQRVRVGE 288 (309) Q Consensus 222 ~~~----------~~g~~~~~~~v~~~~~~L~~~~~~--~~~~~~~t~G~T~-~~~~~~~~~~~d~~~g~~g~~~v~v~~ 288 (309) +.. ..+.+...-..|-.+++.+-...- ..-.+-|...+.. -|.....|... +.++|-..+++.| T Consensus 244 ~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~r---i~~~gVv~i~~~e 320 (322) T protein:vir:10 244 KFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVR---VEDEHIFKLRLKN 320 (322) T ss_pred ccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceE---eccCcEEEEEEec Confidence 000 011111112234333332211100 0000001111000 00000001111 0112323345555 Q ss_pred cc Q lcl|Aclame:pro 289 SV 290 (309) Q Consensus 289 ~~ 290 (309) +. T Consensus 321 ~~ 322 (322) T protein:vir:10 321 SL 322 (322) T ss_pred cC Confidence 54 No 127 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=55.87 E-value=0.47 Score=22.38 Aligned_cols=289 Identities=11% Similarity=0.031 Sum_probs=102.4 Q ss_pred CCCCCCCcchhhHHHHHhhcchh-------hhhh---------hhCCccccc----cccceeEEechhHhhhchhHhhcc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNGR-------MISD---------EVLPRVPVG----KQEFKFWKYDLAQGFTVPETLVGR 60 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~~-------~ig~---------~lfP~v~v~----~~~~k~~~~~~~~~f~~~~t~~~~ 60 (309) |++-. .||.-..|=.+.+ |-|+ .+.|.+.+. ..+.+++..++.++- -+.+ T Consensus 1 ms~~~-----~~tr~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~-----~~~p 70 (335) T protein:vir:63 1 MSFLN-----DLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAK-----GRRA 70 (335) T ss_pred CCCcc-----cchhhhcccccchhheehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeee-----cccC Confidence 87642 1233221111111 2222 223333222 112233444432210 1122 Q ss_pred cccccccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------- Q lcl|Aclame:pro 61 KSKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSY--------- 131 (309) Q Consensus 61 ~~~~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y--------- 131 (309) |.... .......+..-+.+.-|.....-.++++++..||-+....+..-..+.......+...+...... T Consensus 71 G~~l~-~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~ 149 (335) T protein:vir:63 71 GEELE-RSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDA 149 (335) T ss_pred CcCcC-CCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCC Confidence 22221 11112222222333334444555566677778888777666655555555544444333332221 Q ss_pred ---CcccceecccccccCCCCCC-hHHHHHHHHHHh--------CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccc Q lcl|Aclame:pro 132 ---AAGNKTTLSGADQWSDPTSN-PLPVITDALDSV--------ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGM 199 (309) Q Consensus 132 ---~~~~~~~lsgt~~Wsd~~sd-Pi~di~~~~~~~--------g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~ 199 (309) +......++|++.=+ +.+ ...-+.++.+++ |..+-.++++++.|.+|+.|+++++.-++....... T Consensus 150 ~~~G~~~~~~~tg~~~~~--~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~ 227 (335) T protein:vir:63 150 FSPGVLEKLDLTGLTAKQ--AADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATND 227 (335) T ss_pred cCCCcceeeeeccCcccc--cHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccccccccccccccc Confidence 111233444432111 111 223344444444 233468999999999999999998875443211112 Q ss_pred cCHHHHHHHhCCCeEEeecceeecccc----CCCccccee-cCCcEEEEecCCCCCCcCcceecccccccccccCCcccc Q lcl|Aclame:pro 200 VPMAFLQELLELDAIYIGEARLNIARP----GQNPNLIRA-WGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADP 274 (309) Q Consensus 200 vt~~~l~~l~gl~~I~v~~a~~~~~~~----g~~~~~~~v-~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~ 274 (309) .....+..+.|++ |+.....-..+.. +...+..+. +...+.+++.+.--.+..-........+..+..+++.+. T Consensus 228 ~~~g~v~~v~Gv~-V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~ 306 (335) T protein:vir:63 228 YVKSRVAILNGVK-VLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDT 306 (335) T ss_pred ccCceeEEeeceE-EEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHH Confidence 3344555666665 2221110000000 000000000 000122222221100000000011111222222222222 Q ss_pred ccccCCceEEEeecccceeeecchhhhhhhcc Q lcl|Aclame:pro 275 NIGLRGGQRVRVGESVKELVTAPDLGFFFENA 306 (309) Q Consensus 275 ~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~ 306 (309) .+-+..+ ..| .+.--+|.=+.+|-+=.-| T Consensus 307 ~~a~G~g-~lR--Pe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:63 307 FQMYNIG-ARR--PDTAGAIELKGIGAFDITA 335 (335) T ss_pred HHHcCCc-ccc--cceEEEEEEcCCCceeecC Confidence 2111111 111 1111111113333332222 No 128 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=53.18 E-value=0.54 Score=22.07 Aligned_cols=261 Identities=10% Similarity=-0.027 Sum_probs=109.9 Q ss_pred CCCC------CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc-cccCcCc Q lcl|Aclame:pro 1 MSNA------PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE-VEFSATD 73 (309) Q Consensus 1 m~~~------~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~-ve~~~~~ 73 (309) |+.. ..++..+...|...-++..-| -.+++.+||...+++++..... -.....-++.++.... ....+.+ T Consensus 91 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i-~~~~~~~~~~~~~~~~~~~~~~--~~~~a~~v~Eg~~~~~~~~~~f~~ 167 (371) T protein:vir:81 91 MSEGSNQDGGYTVPQDIQTRINELRESKDAL-QNLITVEPVTTLSGSRVFKKRS--QQTGFVEVAEGAAIGEKATPQFTL 167 (371) T ss_pred hccCCCccCceeecHhHHHHHHHHHHhhhhh-hhhceeeeccCCceeEEEEeec--CCcceeeeccccccccccccceee Confidence 3321 122222222222111121112 2335667777666665443211 0111123444554433 3455666 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChH Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPL 153 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi 153 (309) .+..++..+-..+++++-..+. .++....-.+.+.+.+....+..+....-. ...+|.. -. T Consensus 168 i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~---------~~~~~~~--------~~ 228 (371) T protein:vir:81 168 LQYQVKKYAGFFRVTNELLNDS--TEAIVNTLVRWIGDESRVTRNGLIINVLNT---------KAKTAIA--------DL 228 (371) T ss_pred EEeeeeEEEEeehhhHHHHhhh--hHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------ccccccc--------cH Confidence 6666666655567777755543 244455555666666655555433332110 0111111 12 Q ss_pred HHHHHHHHH-h--CCC-CcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH----HHHHHHhCCCeEEeecceeeccc Q lcl|Aclame:pro 154 PVITDALDS-V--ILR-PNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM----AFLQELLELDAIYIGEARLNIAR 225 (309) Q Consensus 154 ~di~~~~~~-~--g~~-Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~----~~l~~l~gl~~I~v~~a~~~~~~ 225 (309) .+|...... + .++ ....+|++..|..|+. ++..++.. +..+ ..-..++|.| |++.+....... T Consensus 229 ~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~-------lkd~~g~~-l~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~ 299 (371) T protein:vir:81 229 DGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDT-------LKDQNGQY-LLQPSISSPTGRQLLGLP-VVIVSNKVLANR 299 (371) T ss_pred HHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHH-------hhccCCCe-eeecccCCCCCceeccee-EEEecccccCcc Confidence 233333321 1 233 4578999999988764 22222221 1111 1112345665 333222110000 Q ss_pred cCC---CcccceecCCcE--EEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhh Q lcl|Aclame:pro 226 PGQ---NPNLIRAWGPHA--SFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLG 300 (309) Q Consensus 226 ~g~---~~~~~~v~~~~~--~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G 300 (309) ... .....-+|++-. ++++. .-|.+.++... ....-..+...+|+...++-.+.-+++. T Consensus 300 ~~~~~~~~~~~i~~Gd~~~~~~~~~----------~~~~~i~~~~~------~~~~f~~~~v~~~~~~r~d~~~~~~~a~ 363 (371) T protein:vir:81 300 VDGGTGAQFAPIIVGDLKEAVVMFD----------RQRTEIMSSNV------AMDAFETDATLWRAIERMDVKMRDDEAF 363 (371) T ss_pred ccccccCCcceEEEEehhceEEEEe----------ecceEEEEecc------ccchhhcCceEEEEEEeeccEEecccce Confidence 000 000011112100 00000 00111111110 0011124556788889999999999998 Q ss_pred hhhhcccc Q lcl|Aclame:pro 301 FFFENAVA 308 (309) Q Consensus 301 ~l~~~~va 308 (309) ..++=+.| T Consensus 364 ~~~~~~~A 371 (371) T protein:vir:81 364 VFGEVQLA 371 (371) T ss_pred EEEEEecC Confidence 88887777 No 129 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=52.45 E-value=0.55 Score=21.99 Aligned_cols=261 Identities=9% Similarity=-0.049 Sum_probs=103.6 Q ss_pred CCC-----C-CCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc-cccCcCc Q lcl|Aclame:pro 1 MSN-----A-PFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE-VEFSATD 73 (309) Q Consensus 1 m~~-----~-~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~-ve~~~~~ 73 (309) |+. . ..++..+.+.|-..-++..-|.+ +++.+|+....++++...- .........++-++.... ....+.. T Consensus 116 ~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~v~E~~~~~~~~~~~~~~ 193 (408) T protein:vir:74 116 ETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQ-YVRVESVSTSSGSRVYEKW-TDVTPLKAMDEEDGKIPDLDNPRLTI 193 (408) T ss_pred hcccccCCCceeechhHhhHHHHHHhhhcchhh-hcceeeccCCcceEEEEee-cCCcccccccccccccccccccceee Confidence 211 1 12333333333222122112222 2345566655555433211 111111223444444443 2345555 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChH Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPL 153 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi 153 (309) .++..+..+-..+++++-..+ +.+|......+.+.+.+....+..+. ++. +......+...| T Consensus 194 i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~~~~~~d~~il----~G~----G~~~~~~~~~~~-------- 255 (408) T protein:vir:74 194 IKYLIKRYAGIITATNTLLKD--TAENILAWLSSWIAKKVVVTRNQAII----AAM----GTVPKKPTIANF-------- 255 (408) T ss_pred EEeeeeeEEeeehhHHHHHhh--chHHHHHHHHHHHHHHHHHHHHHHHh----hcc----cccccccccccH-------- Confidence 566666665556666665543 34555666666677777666554332 221 111112222112 Q ss_pred HHHHHHHH-Hh--CCCC-cEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH----HHHHHhCCCeEEeecc-eeecc Q lcl|Aclame:pro 154 PVITDALD-SV--ILRP-NIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA----FLQELLELDAIYIGEA-RLNIA 224 (309) Q Consensus 154 ~di~~~~~-~~--g~~P-n~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~----~l~~l~gl~~I~v~~a-~~~~~ 224 (309) .+|...+. .+ ..++ -..+|++..|.+|+. + +..++.. ++.++ .=..++|.| |++.+. ..... T Consensus 256 ~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~---l----kd~~G~~-l~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~ 326 (408) T protein:vir:74 256 DDVITMINTSVDPAIIATSSLLTNQSGLNKLAL---V----KTAEGKY-LLEPDPTKPNSYLIKGKQ-VIVVADRWLPNS 326 (408) T ss_pred HHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH---h----hcCCCce-EeccCcCCCCCceeccee-eEEecCcccccc Confidence 22332221 22 3333 468999999999874 2 2222222 22211 113456766 443322 21111 Q ss_pred ccCCCcccceecCCc--EEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhh Q lcl|Aclame:pro 225 RPGQNPNLIRAWGPH--ASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFF 302 (309) Q Consensus 225 ~~g~~~~~~~v~~~~--~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l 302 (309) ......-++++. .++++.. -|.+.++..... ..-..+...+|+...++-.+.-+++-.+ T Consensus 327 ---~~~~~~i~~gd~~~~~~~~~~----------~~~~i~~~~~~~------~~f~~~~~~~r~~~r~d~~~~~~~a~~~ 387 (408) T protein:vir:74 327 ---GSTVYPLYYGDMSQAITLFDR----------ENMSLLPTNIGA------GAFETDTTKIRVIDRFDVKATDSEALVA 387 (408) T ss_pred ---cCCcceEEEEehhccEEEEEe----------cceEEEEecccc------chhhcceeeEEEEEeeCcEEecccceEE Confidence 111111222221 1111110 112222111000 0001334456666666666666665554 Q ss_pred hhccccC Q lcl|Aclame:pro 303 FENAVAA 309 (309) Q Consensus 303 ~~~~va~ 309 (309) ++=.-.+ T Consensus 388 ~~~~~~~ 394 (408) T protein:vir:74 388 GSFTAIA 394 (408) T ss_pred EEeeccc Confidence 4321111 No 130 >protein:vir:99523 Length: 311 # NCBI annotation: putative protein # Family: family:all:701 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958538;genbank:gi:41179320;genbank:GeneID:2717161 Probab=46.36 E-value=0.74 Score=21.31 Aligned_cols=275 Identities=13% Similarity=0.101 Sum_probs=107.8 Q ss_pred CCCCCCCcchhhHHHHHhhcch--------hhhhhhhCCcccc--ccccceeEEechhHhhhchhHhhcccccccccccC Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNG--------RMISDEVLPRVPV--GKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFS 70 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~--------~~ig~~lfP~v~v--~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~ 70 (309) |... .|..=.++|.-|.+. -+-|..--|...+ +..+.|+|+..- ..+..++ |..+. +..+.+ T Consensus 1 ~~~~---an~mAlnya~~~~~~Ld~~~~~~~~t~~l~~~~~~~~~Gak~VkIp~i~~-~gl~dY~--R~~g~--~~g~v~ 72 (311) T protein:vir:99 1 MPTD---AETRGFNYVTKDGNLLDQKITAGLFTAALGTPEVDLVNGGRSFTLKTIST-SGLKDHT--RGKGF--NSGTIS 72 (311) T ss_pred CCCc---chhhHHHHHHHHHHHHHHHHHhhhcccceecCchheeecCCEEEEEeeee-ccccccc--cccCc--ccccee Confidence 4321 121112333333321 1122222333322 233455666543 2333322 33333 334444 Q ss_pred cCccceee-eccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHH-HHHHHHhhcccccCccc-ceeccc-ccccC Q lcl|Aclame:pro 71 ATDETGST-EDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDRE-ARTSKLVFSPNSYAAGN-KTTLSG-ADQWS 146 (309) Q Consensus 71 ~~~~~~~~-~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E-~~~a~~~~~~~~y~~~~-~~~lsg-t~~Ws 146 (309) ...+++.+ .+++....|+.-+.++.........-.-+...+.+-=... .++++++.........+ ..++.. +.+=+ T Consensus 73 ~~~et~tl~~DR~~~f~vD~mDvdETn~~~~~ani~~~f~r~~vvPEiDayrfskla~~a~~~~~~~~~~~~~~~~~~~~ 152 (311) T protein:vir:99 73 DEKTIYTMGQDRDVEFYLDRQDVDETDNELAMANISNVFITEHVQPELDSYRFSKIATSFDNLDGTDTEGTLLAKTHKTE 152 (311) T ss_pred eeeeEEEeeeccceeeecchhchhhhhhhhHHHHHHHHHHHhhhcchhhHHHHHHHHhhhhcccccccchhhhccccccc Confidence 44445543 4445666666555554332222111111111111111111 23444443332211111 111111 11111 Q ss_pred C--CCCChHHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEe-ecce Q lcl|Aclame:pro 147 D--PTSNPLPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYI-GEAR 220 (309) Q Consensus 147 d--~~sdPi~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v-~~a~ 220 (309) . ..++.+..|.+.+.++ +..+-.|.+++.++..|++.+.+.+.+.......+.|.. .+.++=|++-|.| -.++ T Consensus 153 ~~lt~~nvl~~l~~~~~~~~~v~~~~rvl~vTp~~~~lLk~~~~~~r~~~~~~~~~~~i~~-~V~~lDgv~Ii~V~ps~r 231 (311) T protein:vir:99 153 ETLDETNAYSQLKTGIGKVRKYGTQNLVGYVSSEVMDALERSKEFTRNITNQNVGTTALES-RITSIDGVQLIEVYESNR 231 (311) T ss_pred cccCHHHHHHHHHHHHHHHHhcCCCCeEEEEChHHHHHHhhchhhheeeeccccccccccc-ccceecCeEEEEecCchh Confidence 1 2345778888887665 455668999999999999999998877654433444533 3677778877777 3433 Q ss_pred eeccc---cCCCc-----ccceec-CCcEEEEe-------cCCCCCCcCcceecccccccccccCCccccccccCCceEE Q lcl|Aclame:pro 221 LNIAR---PGQNP-----NLIRAW-GPHASFIY-------RDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRV 284 (309) Q Consensus 221 ~~~~~---~g~~~-----~~~~v~-~~~~~L~~-------~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v 284 (309) +.+.. .|... .+.++. +..+++.+ ..+++. .....||.+++ |...-.+...- ...+.++ T Consensus 232 ~~t~~~ft~G~~~~~~ak~INfiiv~~~a~i~~~K~~~v~~f~P~~--~~~gd~~l~~~--R~Y~D~fv~~n-k~~~Iyv 306 (311) T protein:vir:99 232 FMTKYDFTDGAKPTEDAKAINFLVVAKPAVISIVKENAVFLFAPGQ--HTDGDGYLYQN--RLYHDLFIKKH-KRDGIFV 306 (311) T ss_pred hcchhhhcCCccccCcccccceEEeCCCeeeeeeeeeeeeeeCCCC--CCCcceeeeee--eeeeeeeeecc-ccCeEEE Confidence 32221 12111 111110 11111110 000010 01112444332 22222211100 0122233 Q ss_pred Eeecc Q lcl|Aclame:pro 285 RVGES 289 (309) Q Consensus 285 ~v~~~ 289 (309) -+... T Consensus 307 ~~k~A 311 (311) T protein:vir:99 307 SVKKA 311 (311) T ss_pred eeecC Confidence 33332 No 131 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=46.09 E-value=0.75 Score=21.28 Aligned_cols=254 Identities=10% Similarity=-0.005 Sum_probs=111.4 Q ss_pred CCCCC----CCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhc-hhHhhcccccccccccCcCccc Q lcl|Aclame:pro 1 MSNAP----FPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTV-PETLVGRKSKPNEVEFSATDET 75 (309) Q Consensus 1 m~~~~----f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~-~~t~~~~~~~~~~ve~~~~~~~ 75 (309) |.... .++....+.|..--++..-|. .+++.+++.....+||+... +.. ...-++.++........+...+ T Consensus 109 ~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~Eg~~~~~~~~~f~~i~ 184 (379) T protein:vir:10 109 MTLPVNLTGAQPKDYNFDVVLNPSQMLNVS-DIVGAVSISGGTYTFVRENG---AGEGAIGAQVEGATKGQKDYDISMID 184 (379) T ss_pred cccCCCCccccchhhhhHHHHhHHhhhhHH-hhceeeeccCCceEEEEeec---CCCcccccccCCccccccccceeeeE Confidence 22111 111222222222212222233 34566777776777876532 111 1112344555555566677777 Q ss_pred eeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHHH Q lcl|Aclame:pro 76 GSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLPV 155 (309) Q Consensus 76 ~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~d 155 (309) +..+.++-..+|+++-..+++. ....-...+.+.+....+..+ .+..+ .......++ ..+++.+.+ T Consensus 185 ~~~~k~~~~~~iS~ell~D~~~---l~~~i~~~la~~~~~~~~~~~----~~g~~-~~~~~~~~~------~~~~~~~d~ 250 (379) T protein:vir:10 185 VNTDFIAGFTRYSKKMANNLPF---LTSFIPNALRRDYAKAENAAF----NAVLA-ANATASTEI------ITNKNKVEM 250 (379) T ss_pred eeeeeEEeeehhhHHHHhhHHH---HHHHHHHHHHHHHHHHHHHHH----hcccc-ccccccccc------ccCcccHHH Confidence 7777777666788776554421 222222333344433333222 11111 111111111 234455677 Q ss_pred HHHHHHH---hCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH------HHHHHHhCCCeEEeecceeecccc Q lcl|Aclame:pro 156 ITDALDS---VILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM------AFLQELLELDAIYIGEARLNIARP 226 (309) Q Consensus 156 i~~~~~~---~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~------~~l~~l~gl~~I~v~~a~~~~~~~ 226 (309) |...+.. .++.++.++|++..|.+|+. ++..++.. +..+ ..-..++|+|- ++-... + T Consensus 251 i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-------lkd~~G~~-l~~~~~~~~~~~~~~l~G~pv-v~s~~~-----~ 316 (379) T protein:vir:10 251 LINEIAKQENLDFPVTAIVLRPTDYYDILV-------TQKSVGAG-YGLPGVVTQDNGVLRINGIPL-FRATWL-----A 316 (379) T ss_pred HHHHHHhhhhccCCCCEEEEcHHHHHHHHH-------hhccCCce-eccCCccCCCCCcceecceee-EecCCC-----C Confidence 7776543 37889999999999988753 22222221 1111 11125678763 322111 1 Q ss_pred CCCcccceecCCcE--EEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhh Q lcl|Aclame:pro 227 GQNPNLIRAWGPHA--SFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFE 304 (309) Q Consensus 227 g~~~~~~~v~~~~~--~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~ 304 (309) . ++ -++++.. .+.+.. |.+.+. +.....+-..+...+|+.+.++-.|.-|++...+ T Consensus 317 a--g~--~~~gdf~~~~~~~~~-----------~~~i~~------~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~- 374 (379) T protein:vir:10 317 A--NK--YYVGDWTRVTKVTTE-----------GLSLEF------SEVEGTNFVKNNITARIEAQVALAVEQPAALIFG- 374 (379) T ss_pred C--Cc--eEEeecccEEEEEEe-----------ceEEEE------eecccccccCCcEEEEEEEEeccEEecCccEEEE- Confidence 0 11 1222211 111100 111110 0011111224455678888888888877775542 Q ss_pred ccccC Q lcl|Aclame:pro 305 NAVAA 309 (309) Q Consensus 305 ~~va~ 309 (309) -+++ T Consensus 375 -~~~~ 378 (379) T protein:vir:10 375 -DFTA 378 (379) T ss_pred -EecC Confidence 2333 No 132 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=46.05 E-value=0.75 Score=21.27 Aligned_cols=269 Identities=14% Similarity=0.067 Sum_probs=102.7 Q ss_pred CC-C--CC--CCcchhhHHHHHhhcch--------hhhhhhhCCccccccccceeEEechhHhhhchhHhhccccccccc Q lcl|Aclame:pro 1 MS-N--AP--FPIDPELTAIAIAYRNG--------RMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV 67 (309) Q Consensus 1 m~-~--~~--f~~dp~LT~~a~~y~n~--------~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~v 67 (309) |= + .+ ..-.+++ ...+.|.-| .+.++.|||...++.=..+..+|.-.+.- =....-+-+++...+ T Consensus 63 mDa~~~~~~t~~~~g~p-~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~-G~A~~ygd~~D~Pl~ 140 (382) T protein:vir:96 63 MDSNFTAPVTTPSIPTP-IQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPA-GTAVEYGDHTNIPLT 140 (382) T ss_pred cccccCCccccCCccHH-HHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecc-cceEEeecccCCCcc Confidence 22 1 01 1112222 333345444 36788888876544322233444221100 000011222222222 Q ss_pred c--cCcCccceeeeccchhhcCCHHHH-HHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccc-------- Q lcl|Aclame:pro 68 E--FSATDETGSTEDHGLDAPVPQADI-DNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNK-------- 136 (309) Q Consensus 68 e--~~~~~~~~~~~e~~L~~~v~~~~~-~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~-------- 136 (309) . .....++....+-+. .+...|. ..++..+|...+-.+..++. .|+..-++++-+.+-+.+|+ T Consensus 141 d~~~~~~~r~v~~~~~g~--~yg~lE~~rAa~~~~~l~~~Ka~aA~~a----le~~~N~i~f~G~~~g~~~~~yGllNdP 214 (382) T protein:vir:96 141 SWNANFERRTIVRGELGL--LVGTLEEGRASAIRLNSAETKRQQAAIG----LEIFRNAIGFYGWQSGLGNRTYGFLNDP 214 (382) T ss_pred ccccceeEEEEEEEEEee--eecHHHHHHHHhhCCCcHHHHHHHHHHH----HHHhhceEEEEeeecCcCcceEEEEeCC Confidence 2 233444444444443 3333333 33344566554433323322 23333344443321111121 Q ss_pred ----eecccccccCCCCCC-hHHHHHHHHHHh-----C-CC----CcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccC Q lcl|Aclame:pro 137 ----TTLSGADQWSDPTSN-PLPVITDALDSV-----I-LR----PNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVP 201 (309) Q Consensus 137 ----~~lsgt~~Wsd~~sd-Pi~di~~~~~~~-----g-~~----Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt 201 (309) ...+.+..|.+.+.+ .+.||.+...++ | .. |.+++|.+..+..|.+. +. .+.-= T Consensus 215 ~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~---------n~--~g~Tv 283 (382) T protein:vir:96 215 NLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT---------TP--YGISV 283 (382) T ss_pred CcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc---------Cc--cCccH Confidence 112234569988876 689999887765 3 22 45799999999888542 11 12111 Q ss_pred HHHHHHHh-CCCeEEeecceeeccccCCCcccceecCCcEEEEecCCCCCCc-Ccceeccccccccc----------ccC Q lcl|Aclame:pro 202 MAFLQELL-ELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTR-NGTTFGLTAQWGDR----------VSG 269 (309) Q Consensus 202 ~~~l~~l~-gl~~I~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~-~~~t~G~T~~~~~~----------~~~ 269 (309) .+.|++-| ++. |+. ..-+..+..++.+ +.++..+|.+...... ..++.+.+|....+ ... T Consensus 284 l~~lk~n~Pnl~-i~t-~peL~~a~~~g~g------~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~ 355 (382) T protein:vir:96 284 SDWIEQTYPKMR-IVS-APELSGVQMQGKT------PEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAK 355 (382) T ss_pred HHHHHHhcCCcE-EEE-ccccccccCCCcc------ceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecc Confidence 34566543 232 111 1112222112211 1223334444321110 01111222211111 111 Q ss_pred CccccccccCCceEEEeecccceeeecchhhh Q lcl|Aclame:pro 270 SIADPNIGLRGGQRVRVGESVKELVTAPDLGF 301 (309) Q Consensus 270 ~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~ 301 (309) .+..++.+.-+|..| +.|..++.-.|. T Consensus 356 ~~~~~~s~~t~Gv~i-----~~P~ai~~~~GI 382 (382) T protein:vir:96 356 SYVEDFSNGTAGALC-----KRPWAVVRYLGI 382 (382) T ss_pred eeEeccccceeeeEE-----EcchhhhhccCC Confidence 112222222233222 233333333333 No 133 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=43.04 E-value=0.86 Score=20.94 Aligned_cols=262 Identities=11% Similarity=0.039 Sum_probs=100.4 Q ss_pred CCC--CC--C-CcchhhHHHHHhhcch--------hhhhhhhCCccccccccceeEEechhHhhhchhHhhcccc---cc Q lcl|Aclame:pro 1 MSN--AP--F-PIDPELTAIAIAYRNG--------RMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKS---KP 64 (309) Q Consensus 1 m~~--~~--f-~~dp~LT~~a~~y~n~--------~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~---~~ 64 (309) .|+ .| . ..++-.-++...|-.| .+-++.|||...++.-..+..+|...|.- -+....| +. T Consensus 34 da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~----G~a~~ygd~~d~ 109 (336) T protein:vir:10 34 DAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPT----TKVATYGDYSSD 109 (336) T ss_pred hhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeeee----eeEEEccccCCC Confidence 111 11 1 2233222333445543 46788888866655444455555332200 0011112 22 Q ss_pred cccccCcCccceeeeccchhhcCCHHHHHHHh-hcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcc----cccCc-----c Q lcl|Aclame:pro 65 NEVEFSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSP----NSYAA-----G 134 (309) Q Consensus 65 ~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~-~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~----~~y~~-----~ 134 (309) ..+..........+.-.+.-......|...|+ ..++...+-.+..++.++ +..-+.++-+ ..|+. - T Consensus 110 P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale----~~~N~~~~~Gd~~~~~~GllN~P~l 185 (336) T protein:vir:10 110 GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA----KFLNGSYLFGVAGLENYGLINDPSL 185 (336) T ss_pred cceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH----HhhCeEEEEeecccceEEEeecCCC Confidence 22333222223333333333445555555443 355544433333333222 1111222211 11221 1 Q ss_pred cceecccccccCCCCCC-hHHHHHHHHHHh-----CC----CCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHH Q lcl|Aclame:pro 135 NKTTLSGADQWSDPTSN-PLPVITDALDSV-----IL----RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAF 204 (309) Q Consensus 135 ~~~~lsgt~~Wsd~~sd-Pi~di~~~~~~~-----g~----~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~ 204 (309) ...+-+.+..|...+.+ .+.||.+....+ |. .|++++|....+..|.+- + ..|.-=.+. T Consensus 186 ~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~~---------n--~~g~tv~~~ 254 (336) T protein:vir:10 186 SAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT---------N--QYGLSAAAK 254 (336) T ss_pred CcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC---------C--ccCccHHHH Confidence 11122233346656655 899999887765 32 477999999999998531 1 112111344 Q ss_pred HHHHhCCCeEEeecc-eeeccccCCCcccceecCCcEEEEecCCCCCCcCcceeccccccc----ccccCCccccccccC Q lcl|Aclame:pro 205 LQELLELDAIYIGEA-RLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWG----DRVSGSIADPNIGLR 279 (309) Q Consensus 205 l~~l~gl~~I~v~~a-~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~----~~~~~~~~d~~~g~~ 279 (309) |++-+ |.+.+-.+ -+.++ + .+.+.+|.....+.. .-...++-.+. ......+..+....- T Consensus 255 lk~n~--Pnl~i~t~pel~~A--g----------g~~~~~~~~~~~~~~-t~~~~~P~~f~~lpvq~~~~~~~v~~~~rt 319 (336) T protein:vir:10 255 LKEIF--PKLEFVTIPEYDTA--S----------GRLVQLWAPRVEGKD-TATCGFTEKMRAHSIERYSSYFRQKKSAGT 319 (336) T ss_pred HHHhC--CccEEEEccccccc--C----------CceEEEEEecccCCc-ceeeecChhhhccceeecCceeEeccccce Confidence 55532 22222111 11111 1 112223332211100 00111111110 011122233333333 Q ss_pred CceEEEeecccceeeecchhhh Q lcl|Aclame:pro 280 GGQRVRVGESVKELVTAPDLGF 301 (309) Q Consensus 280 g~~~v~v~~~~~~~v~~~~~G~ 301 (309) +|..|+ .|.-++.-.|. T Consensus 320 ~Gv~i~-----rP~ai~~~~GI 336 (336) T protein:vir:10 320 WGAVIF-----RPFAVAQMLGV 336 (336) T ss_pred eeeeee-----ccchheeeccC Confidence 333332 23333333333 No 134 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=40.42 E-value=0.97 Score=20.65 Aligned_cols=287 Identities=11% Similarity=0.023 Sum_probs=102.2 Q ss_pred CCCCCCCcchhhHHHHHh---------------hcchhhhhh-hhCCccccc----cccceeEEechhHhhhchhHhhcc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIA---------------YRNGRMISD-EVLPRVPVG----KQEFKFWKYDLAQGFTVPETLVGR 60 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~---------------y~n~~~ig~-~lfP~v~v~----~~~~k~~~~~~~~~f~~~~t~~~~ 60 (309) |++-. .||....| ....+|--. .+.+.+.+. ..+.+++..++.++ .-+.+ T Consensus 1 ms~~~-----~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~-----~~~~p 70 (335) T protein:vir:78 1 MSFLN-----DLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEA-----KGRRA 70 (335) T ss_pred CCccc-----cccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeeeeee-----ccccc Confidence 87642 12221111 111122211 223333322 11223344343221 11122 Q ss_pred cccccccccCcCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------C-- Q lcl|Aclame:pro 61 KSKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSY------A-- 132 (309) Q Consensus 61 ~~~~~~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y------~-- 132 (309) |....---....+..+ +.+.-|.....-.++++++..||.+....+..-..+...-...+...+...... + T Consensus 71 G~~l~~~~~~~~k~~i-tID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~ 149 (335) T protein:vir:78 71 GEELERSRVVNDKWNL-TVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDA 149 (335) T ss_pred CcccCCCCcccCCeEE-EecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCC Confidence 2222111112222233 333334444455566677888898877766666555555555554333333221 1 Q ss_pred --cc--cceecccccccCCCCCCh---HHHHHHHHHHhC--CCC------cEEEeCHHHHHHHhcCHHHHHHhccCCCcc Q lcl|Aclame:pro 133 --AG--NKTTLSGADQWSDPTSNP---LPVITDALDSVI--LRP------NIGVLGRRTATILRRHPKIVKAYNGSLGDE 197 (309) Q Consensus 133 --~~--~~~~lsgt~~Wsd~~sdP---i~di~~~~~~~g--~~P------n~~v~~~~~~~~l~~~~~i~~~~~~~~~~~ 197 (309) .+ -...++|+ ....++ ..-+..+.+++- -.| -+++++++.|.+|+.|+++++.-++..... T Consensus 150 ~~~G~~~~~~~tg~----~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~ 225 (335) T protein:vir:78 150 FSPGVLEKLDLTGL----TAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGAT 225 (335) T ss_pred cCCCcceeeeeccc----cccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccc Confidence 11 12233432 223344 333444444432 224 358999999999999999988755432211 Q ss_pred cccCHHHHHHHhCCCeEEeecceeecccc----CCCccccee-cCCcEEEEecCCCCCCcCcceecccccccccccCCcc Q lcl|Aclame:pro 198 GMVPMAFLQELLELDAIYIGEARLNIARP----GQNPNLIRA-WGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIA 272 (309) Q Consensus 198 ~~vt~~~l~~l~gl~~I~v~~a~~~~~~~----g~~~~~~~v-~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~ 272 (309) .......+..+.|++ |+.....-..+-. +...+..+. +...+.+++...--.+..--.......+..+..+++. T Consensus 226 ~~~~~g~v~~v~Gv~-V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i 304 (335) T protein:vir:78 226 NDYVKSRVAILNGVK-VLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVL 304 (335) T ss_pred cccccceeEEeeceE-EEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhh Confidence 123344555666765 3221111000000 000111111 0111223322211000000001111111122222222 Q ss_pred ccccccCCceEEEeecccceeeecchhhhhhhcc Q lcl|Aclame:pro 273 DPNIGLRGGQRVRVGESVKELVTAPDLGFFFENA 306 (309) Q Consensus 273 d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~ 306 (309) +..+-+..+ ..| .+.--+|.=+.+|-+=.-| T Consensus 305 ~~~~a~G~g-~lR--Pe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 305 DTFQMYNIG-ARR--PDTAGAIELKGIEAFDITA 335 (335) T ss_pred hHHHHcCCc-ccC--cceEEEEEecCCCcccccC Confidence 222111111 111 1111111111122111111 No 135 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=37.57 E-value=1.1 Score=20.33 Aligned_cols=269 Identities=14% Similarity=0.085 Sum_probs=109.3 Q ss_pred CC-----C-CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MS-----N-APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~-----~-~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |+ . ...++..+.+.|..--++..-| ..|++.+|+.....+||+..... ....-++.++...+.++.+.+. T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i-~~l~~~~~~~~~~~~~~~~~~~~---~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:10 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSL-ADLISSRPVTSPNLSYLTESAAH---NNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhH-HhhccccccCCCceEEEEEcCCC---CcceeeccCcccccccccceee Confidence 21 1 1233333333433222222223 35567888887777777643211 1111234455555556666665 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHH--hhccccc---Cccccee----------- Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKL--VFSPNSY---AAGNKTT----------- 138 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~--~~~~~~y---~~~~~~~----------- 138 (309) ++..+..+-..+++++-.++++ +....-.+.+.+.|....+..+-.. .-+...+ +...... T Consensus 227 ~~~~~k~a~~~~iS~ell~d~~---~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~ 303 (497) T protein:vir:10 227 YEQVGKVANALTITDEGLRDAP---ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSA 303 (497) T ss_pred EeeeeeeEeecHhHHHHHHhHH---HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhh Confidence 6655555545567776655443 1344444556666665555443211 0000000 0000000 Q ss_pred ------c--cccc----------------------ccC-------CCCCChHHHHHHHHHHh----CCCCcEEEeCHHHH Q lcl|Aclame:pro 139 ------L--SGAD----------------------QWS-------DPTSNPLPVITDALDSV----ILRPNIGVLGRRTA 177 (309) Q Consensus 139 ------l--sgt~----------------------~Ws-------d~~sdPi~di~~~~~~~----g~~Pn~~v~~~~~~ 177 (309) + .++. .|. ....|-+.++..+...+ ++.|+..+|++..| T Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~ 383 (497) T protein:vir:10 304 TVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDW 383 (497) T ss_pred hhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHH Confidence 0 0000 000 01123333444443322 56788999999999 Q ss_pred HHHhcCHHHHHHhccCCCcc-------cccC--HHHHHHHhCCCeEEeecceeeccccCCCcccceecCC---cEEEEec Q lcl|Aclame:pro 178 TILRRHPKIVKAYNGSLGDE-------GMVP--MAFLQELLELDAIYIGEARLNIARPGQNPNLIRAWGP---HASFIYR 245 (309) Q Consensus 178 ~~l~~~~~i~~~~~~~~~~~-------~~vt--~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~---~~~L~~~ 245 (309) .+|+. ++..++.. +... ...-..+||+| |++-++. +.++ .++++ ..++++. T Consensus 384 ~~l~~-------lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~p-V~~t~~~-----~~~~----~~~Gd~~~~~~~i~~ 446 (497) T protein:vir:10 384 ELLRL-------TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVP-VVTTPLI-----PLGT----ILVGHFAPSVIQTAR 446 (497) T ss_pred HHHHH-------hhcCCCceeccCcccccccccccCCceeecee-eEecCCC-----CCCc----eEEeecccceEEEEE Confidence 88753 22222211 0000 00112456766 3322221 1010 12121 1111111 Q ss_pred CCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 246 DRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 246 ~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) . . |.+..+.. .....-..+...+|+...++-.|.-|++-..++-.-++ T Consensus 447 r-~---------~~~v~~~~------~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:10 447 R-E---------GVTMQMTN------SNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred e-c---------ccEEEeec------ccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 0 0 11111110 00011123444567777777777777665555443333 No 136 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=37.57 E-value=1.1 Score=20.33 Aligned_cols=269 Identities=14% Similarity=0.085 Sum_probs=109.3 Q ss_pred CC-----C-CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MS-----N-APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~-----~-~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |+ . ...++..+.+.|..--++..-| ..|++.+|+.....+||+..... ....-++.++...+.++.+.+. T Consensus 151 ~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i-~~l~~~~~~~~~~~~~~~~~~~~---~~a~wv~E~~~~~~s~~~f~~i 226 (497) T protein:vir:78 151 NPFGSTGTFAPGILPTFLPGIVEQLFYELSL-ADLISSRPVTSPNLSYLTESAAH---NNAAAVAEAGTYPFSSEEFARV 226 (497) T ss_pred hhcccCcccccccchhhhHHHHHHHHhhhhH-HhhccccccCCCceEEEEEcCCC---CcceeeccCcccccccccceee Confidence 21 1 1233333333433222222223 35567888887777777643211 1111234455555556666665 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHH--hhccccc---Cccccee----------- Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKL--VFSPNSY---AAGNKTT----------- 138 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~--~~~~~~y---~~~~~~~----------- 138 (309) ++..+..+-..+++++-.++++ +....-.+.+.+.|....+..+-.. .-+...+ +...... T Consensus 227 ~~~~~k~a~~~~iS~ell~d~~---~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~ 303 (497) T protein:vir:78 227 YEQVGKVANALTITDEGLRDAP---ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSA 303 (497) T ss_pred EeeeeeeEeecHhHHHHHHhHH---HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhh Confidence 6655555545567776655443 1344444556666665555443211 0000000 0000000 Q ss_pred ------c--cccc----------------------ccC-------CCCCChHHHHHHHHHHh----CCCCcEEEeCHHHH Q lcl|Aclame:pro 139 ------L--SGAD----------------------QWS-------DPTSNPLPVITDALDSV----ILRPNIGVLGRRTA 177 (309) Q Consensus 139 ------l--sgt~----------------------~Ws-------d~~sdPi~di~~~~~~~----g~~Pn~~v~~~~~~ 177 (309) + .++. .|. ....|-+.++..+...+ ++.|+..+|++..| T Consensus 304 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~ 383 (497) T protein:vir:78 304 TVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDW 383 (497) T ss_pred hhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHH Confidence 0 0000 000 01123333444443322 56788999999999 Q ss_pred HHHhcCHHHHHHhccCCCcc-------cccC--HHHHHHHhCCCeEEeecceeeccccCCCcccceecCC---cEEEEec Q lcl|Aclame:pro 178 TILRRHPKIVKAYNGSLGDE-------GMVP--MAFLQELLELDAIYIGEARLNIARPGQNPNLIRAWGP---HASFIYR 245 (309) Q Consensus 178 ~~l~~~~~i~~~~~~~~~~~-------~~vt--~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~~~~v~~~---~~~L~~~ 245 (309) .+|+. ++..++.. +... ...-..+||+| |++-++. +.++ .++++ ..++++. T Consensus 384 ~~l~~-------lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~p-V~~t~~~-----~~~~----~~~Gd~~~~~~~i~~ 446 (497) T protein:vir:78 384 ELLRL-------TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVP-VVTTPLI-----PLGT----ILVGHFAPSVIQTAR 446 (497) T ss_pred HHHHH-------hhcCCCceeccCcccccccccccCCceeecee-eEecCCC-----CCCc----eEEeecccceEEEEE Confidence 88753 22222211 0000 00112456766 3322221 1010 12121 1111111 Q ss_pred CCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 246 DRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 246 ~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) . . |.+..+.. .....-..+...+|+...++-.|.-|++-..++-.-++ T Consensus 447 r-~---------~~~v~~~~------~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~ 494 (497) T protein:vir:78 447 R-E---------GVTMQMTN------SNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGA 494 (497) T ss_pred e-c---------ccEEEeec------ccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCc Confidence 0 0 11111110 00011123444567777777777777665555443333 No 137 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=34.46 E-value=1.3 Score=19.98 Aligned_cols=280 Identities=13% Similarity=0.027 Sum_probs=97.6 Q ss_pred CCCC-----CCCcchhhHHHHHhhcchhhhhhhh-CCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSNA-----PFPIDPELTAIAIAYRNGRMISDEV-LPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~~-----~f~~dp~LT~~a~~y~n~~~ig~~l-fP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) ++.+ ..++...-+.|..--++..-+ ..+ .-.+|+.....++|++...... .-++.++.....+.++.+. T Consensus 66 ~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l-~~lg~~~v~~~~g~~~~p~~t~~~~a----~wv~E~~~~~~s~~~f~~i 140 (366) T protein:vir:57 66 ISTAAGSGGALIPQNMQNEVIELLRDRTVV-RILGARSIPLPNGNLSMPRLSGGATA----GYVGEGKDVVATGATFDDV 140 (366) T ss_pred ccccccCCccccchhHHHHHHHHHhhhcch-hhhceeeeecCCCceEEEEEeCCcce----eeeccCccccccccceeEE Confidence 1111 123333222222111111111 122 1123444445677776432111 1234444444445555555 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecc------cccccCCC Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLS------GADQWSDP 148 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~ls------gt~~Wsd~ 148 (309) ++..+..+-..+++++-..+. .++.+..-.+.+.+.+.+..+.. ++++..-+...+..+. .+..++. T Consensus 141 ~~~~~k~~~~~~iS~ell~ds--~~~~~~~i~~~l~~a~~~~~d~a----~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~- 213 (366) T protein:vir:57 141 KLSAKTMIALVPVSNQLIGRA--GFNVEQLLLGDILSAIATREDKA----FLRDDGTGDTPKGMKAVATAANRLVAWTG- 213 (366) T ss_pred EEeeEEEEEeehhhHHHHhhh--hHHHHHHHHHHHHHHHHHHHHHH----hhccCCCCccccceeeccccccceeeccc- Confidence 555555555556666655433 23444444455666655544432 2222110101111110 0111211 Q ss_pred CCChHHHHHHHHHHh---------CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecc Q lcl|Aclame:pro 149 TSNPLPVITDALDSV---------ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEA 219 (309) Q Consensus 149 ~sdPi~di~~~~~~~---------g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a 219 (309) ++.-..+++.+.+.+ .+.....+|++..|.+|+. ++ ..++.. +.....-..|+|+| |++-+. T Consensus 214 t~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~---lk----d~~G~~-l~~~~~~g~l~G~P-vv~s~~ 284 (366) T protein:vir:57 214 TAINLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFG---LR----DGNGNK-VYPEMSQGILKGYP-IQRTSA 284 (366) T ss_pred cccchhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHh---hh----ccCCce-eccCCCCCeeccee-eEEccc Confidence 111122222222211 2345567999999988765 22 122211 11111112467887 444332 Q ss_pred eeeccccCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchh Q lcl|Aclame:pro 220 RLNIARPGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDL 299 (309) Q Consensus 220 ~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~ 299 (309) .-.....+. ....-++++-.-++.....+-.. ..+--.+|.- ..+... ..-..+...+|+.+.++-.+.-+.+ T Consensus 285 ip~~~~~~~-~~~~i~~gdfs~~~i~~~~~i~i-~~~~ea~~~~---~~g~~~--~~f~~~~~~iR~~~~~d~~v~~~~a 357 (366) T protein:vir:57 285 IPANLGDDG-NESEIYFCDFNDVVIGEDGMMKV-DFSTEATYKD---ADGQLV--SAFARNQSLIRVVTEHDIGFRHPEG 357 (366) T ss_pred cccccccCC-CccEEEEEecceEEEEEecceEE-EEeecccccc---ccccch--hhhhcCceeEEeeeeeCcEeecccc Confidence 211100011 11112233321111111111000 0000001110 001111 0112334456666655544443333 Q ss_pred hhhhhcccc Q lcl|Aclame:pro 300 GFFFENAVA 308 (309) Q Consensus 300 G~l~~~~va 308 (309) -.+++++.= T Consensus 358 ~~~lt~~~~ 366 (366) T protein:vir:57 358 LVLGTGVIW 366 (366) T ss_pred EEEEecccC Confidence 333333222 No 138 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=30.63 E-value=1.6 Score=19.52 Aligned_cols=285 Identities=13% Similarity=0.064 Sum_probs=106.6 Q ss_pred CCCC-CC-------Ccch---hhHHHHHhhcchhhhhhhhC-Ccccc----ccccceeEEechhHhhhchhHhhcccccc Q lcl|Aclame:pro 1 MSNA-PF-------PIDP---ELTAIAIAYRNGRMISDEVL-PRVPV----GKQEFKFWKYDLAQGFTVPETLVGRKSKP 64 (309) Q Consensus 1 m~~~-~f-------~~dp---~LT~~a~~y~n~~~ig~~lf-P~v~v----~~~~~k~~~~~~~~~f~~~~t~~~~~~~~ 64 (309) |++. +. ..|. .+-.+-.|.....|--.++| |.+.+ ...+.+++..+...... ..++... T Consensus 7 ~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig~~~~~~-----~~~g~~l 81 (332) T protein:vir:78 7 FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTGKLSAGY-----HTPGTPI 81 (332) T ss_pred ccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEeccceeEee-----ecCCCCC Confidence 4431 22 1110 00011111111222222222 32221 12223344444332111 1112111 Q ss_pred cc-cccCcCccceeeecc-chhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCc-------cc Q lcl|Aclame:pro 65 NE-VEFSATDETGSTEDH-GLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAA-------GN 135 (309) Q Consensus 65 ~~-ve~~~~~~~~~~~e~-~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~-------~~ 135 (309) .. -..+..+..+.+.+. -....| .+.++++..+|......+.....+....+..++..+........ +. T Consensus 82 ~~~~~~~~~~~~l~ID~~ky~~~~V--ddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g~~ 159 (332) T protein:vir:78 82 VGDAGIKANEKTLVMDDLLVSSQFV--YSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPGGF 159 (332) T ss_pred CCCCCCCCceEEEEEehhhhhHHHH--HhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccccc Confidence 11 123334444444332 122233 35567777888888777777777777777777666555433211 12 Q ss_pred ceecccccccCCCCCChHHHHHHHHHHh---CCCC--c-EEEeCHHHHHHHhc--CHHHHHHhccCCCcccccCHHHHHH Q lcl|Aclame:pro 136 KTTLSGADQWSDPTSNPLPVITDALDSV---ILRP--N-IGVLGRRTATILRR--HPKIVKAYNGSLGDEGMVPMAFLQE 207 (309) Q Consensus 136 ~~~lsgt~~Wsd~~sdPi~di~~~~~~~---g~~P--n-~~v~~~~~~~~l~~--~~~i~~~~~~~~~~~~~vt~~~l~~ 207 (309) ...++++...+. .+-+.-|.++...+ .+ | + .+++++..|..|+. |+++.++..+... .....-..+.. T Consensus 160 ~~~~~~~~~~~~--~~~~~~i~~a~~~Lde~~V-P~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~-~~~~~g~~i~~ 235 (332) T protein:vir:78 160 HVNIGAGNTNDA--QAIVDGFFEAAAVLDERSA-PQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQ-GDMNSGKGLYS 235 (332) T ss_pred ccccCCccccCH--HHHHHHHHHHHHHHhhcCC-CccCCEEEeCHHHHHHHHhhcCceeeeeeccccc-cceecceeeeE Confidence 233443322210 11223344444433 33 4 2 58999999999987 7877776554322 12232233556 Q ss_pred HhCCCeEEeecceeeccc--------cCCCcccceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccC Q lcl|Aclame:pro 208 LLELDAIYIGEARLNIAR--------PGQNPNLIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLR 279 (309) Q Consensus 208 l~gl~~I~v~~a~~~~~~--------~g~~~~~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~ 279 (309) +.|++ |+.-...-++.. .+....+.--+.+.+.|+|.+.-.+... ..+++-+-. .+-+.+.+. T Consensus 236 i~G~~-V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~--~~~~~~~~t---~~~~~~~~~--- 306 (332) T protein:vir:78 236 IAGIR-ILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQ--SVAPTIQTT---SGDFNVQYQ--- 306 (332) T ss_pred EeeeE-EEecCccccCcccccccccccccccccccccccceEEeecccceeeee--eeccchhhh---hcccchhhh--- Confidence 66655 222111111100 0000000000123334444332111000 011111100 000011110 Q ss_pred CceEEEeecccceeeecchhhhhhhcc Q lcl|Aclame:pro 280 GGQRVRVGESVKELVTAPDLGFFFENA 306 (309) Q Consensus 280 g~~~v~v~~~~~~~v~~~~~G~l~~~~ 306 (309) +..|+....+=..+.-|++.-.|.-+ T Consensus 307 -~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 307 -GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred -HhhhhhhhhhcCceecccceEEEeeC Confidence 12233333333344444444444433 No 139 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=29.45 E-value=1.7 Score=19.38 Aligned_cols=279 Identities=13% Similarity=0.065 Sum_probs=108.7 Q ss_pred CCC-----C-CCCcchhhHHHHHhh--cchhhhhhhhCCccccccccceeEEec--hhHhhhchhHhhcccccccccccC Q lcl|Aclame:pro 1 MSN-----A-PFPIDPELTAIAIAY--RNGRMISDEVLPRVPVGKQEFKFWKYD--LAQGFTVPETLVGRKSKPNEVEFS 70 (309) Q Consensus 1 m~~-----~-~f~~dp~LT~~a~~y--~n~~~ig~~lfP~v~v~~~~~k~~~~~--~~~~f~~~~t~~~~~~~~~~ve~~ 70 (309) |+. . ....|.. ..-.+-. +..+++ .++|..+|....+.|..-. ....|+.-+++... ....+.... T Consensus 1 mpaltLaea~k~~~d~l-~~~ViE~~~~~s~lL--~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~-~g~~~~~~t 76 (310) T protein:vir:97 1 MASVTLAESAKLAQDEL-VAGVIENIITVNRMF--DVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSG-AGAGKAAAT 76 (310) T ss_pred CcccchHHHhhcCcchH-HHHHHHHHhccchHH--HhCCcccccCCcceeeEeeccCCcccccccccccC-CCccccccc Confidence 551 1 1222222 1111111 112223 5578777776555544332 12234321211111 111222333 Q ss_pred cCccceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccc----ccCcccceecccccccC Q lcl|Aclame:pro 71 ATDETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPN----SYAAGNKTTLSGADQWS 146 (309) Q Consensus 71 ~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~----~y~~~~~~~lsgt~~Ws 146 (309) +..+++.|.--+=+..|+.+-.+.... ++.....+.+...++..+|..-..++-.+. -++-.|..+. +..-+ T Consensus 77 ~~~~~~~L~i~~g~~~Vd~~i~dl~~~--~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~--~q~i~ 152 (310) T protein:vir:97 77 FTKVNSNLTTIMGDAEVNGLIQATRSG--DGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCAS--GQKAT 152 (310) T ss_pred cceeeeeeeeeeehhhhhhHHHhhhcC--ChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCc--cceee Confidence 455566555444333444332221101 121111222333333444433333333111 1122233222 22221 Q ss_pred C-CCCCh--HHHHHHHHHHh---CCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCH--H----HHHHHhCCCeE Q lcl|Aclame:pro 147 D-PTSNP--LPVITDALDSV---ILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPM--A----FLQELLELDAI 214 (309) Q Consensus 147 d-~~sdP--i~di~~~~~~~---g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~--~----~l~~l~gl~~I 214 (309) - .++-| +.|++++++.+ +-.|..++|+++..++++.- .....+ .|+... + ++-.+-|+| | T Consensus 153 ~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~---~R~~~~----~g~~~~~~~~~G~~v~~~~GiP-i 224 (310) T protein:vir:97 153 TGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKAL---LRALGG----ASINEVVELPSGAEVPAYSGTP-I 224 (310) T ss_pred cCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHH---HHHhcC----CCCCCccccCCCCEEeeeCCeE-E Confidence 1 11222 48999999987 34799999999865443321 111111 121111 1 111222444 2 Q ss_pred EeecceeeccccCCCcccceecCCcEEEEecCCCCCC--cCcceecccccccccccCCccccccc---cCCceEEEeecc Q lcl|Aclame:pro 215 YIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADT--RNGTTFGLTAQWGDRVSGSIADPNIG---LRGGQRVRVGES 289 (309) Q Consensus 215 ~v~~a~~~~~~~g~~~~~~~v~~~~~~L~~~~~~~~~--~~~~t~G~T~~~~~~~~~~~~d~~~g---~~g~~~v~v~~~ 289 (309) +..+-.-....++.. .+..-+|.-..|.. ..|.+ |.+-. + ...+.....| +.+...++|.+. T Consensus 225 ~~~d~ip~~~~~~~~--------~gtTsIya~r~Ge~~~~~Gv~-Gl~~~-~---~~glsVr~~G~~~~~~v~~~~V~~Y 291 (310) T protein:vir:97 225 FRNDYIPTNQTKGGT--------TGCTTIFAGTLDDGSRTHGIA-GLTAT-Q---AAGIQVVDVGESEDSDEHIWRVKWY 291 (310) T ss_pred EEeCccCCCcccccc--------CCceeEEEEeeCcccccccee-ccccC-C---ccceeEEeCCcccCCcceeEEEEEe Confidence 222111000000000 11111111112211 01111 22110 0 1123333334 445678999999 Q ss_pred cceeeecchhhhhhhcccc Q lcl|Aclame:pro 290 VKELVTAPDLGFFFENAVA 308 (309) Q Consensus 290 ~~~~v~~~~~G~l~~~~va 308 (309) +.-.+..+++.-.|+|+.= T Consensus 292 ~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 292 CGLALFSEKGLACADGITN 310 (310) T ss_pred eeEEEecccceeeeccccC Confidence 9988988988888888888 No 140 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=29.22 E-value=1.7 Score=19.35 Aligned_cols=255 Identities=11% Similarity=0.037 Sum_probs=109.6 Q ss_pred CC------CCCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccc-cccCcCc Q lcl|Aclame:pro 1 MS------NAPFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNE-VEFSATD 73 (309) Q Consensus 1 m~------~~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~-ve~~~~~ 73 (309) ++ ....++..+.+.|-..-++..-| -.+++.+||...+++|+...... ....-++-++.... -...+.. T Consensus 111 ~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l-~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~E~~~~~~~~~~~~~~ 186 (394) T protein:vir:10 111 AGHVTSTEAGVLIPEEIIYDPTAEVNSVVDL-STLVTKTPVTTPKGTYPILKRAT---DRFSSVAELAENPALAEPEFEQ 186 (394) T ss_pred hcccccccCceeccHHHHHHHHHHHHhhhhh-hhhceeeeccCCceEEEEEecCC---Ccccccccccccccccccccee Confidence 11 12345555555543222222212 34566778877777877654321 11112333443332 2345555 Q ss_pred cceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChH Q lcl|Aclame:pro 74 ETGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPL 153 (309) Q Consensus 74 ~~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi 153 (309) .++..+..+-..+++++-.+++ .++....-.+.+.+.+....+..+....-+ ...... .+...+ T Consensus 187 v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~la~~~~~~~~~~il~g~g~------~~~~~~--------~~~~~~ 250 (394) T protein:vir:10 187 VDWSVSTYRGAIPLSEEAIADS--AVDLTSLVGQSINEKSVNTYNAMIAPVLQS------FTAKAT--------TTDTLV 250 (394) T ss_pred EEeeeeeeEeeehhHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHHHHhhcccc------cccccc--------cccccH Confidence 5555555555556666655543 344455555566666666555443322211 000111 112223 Q ss_pred HHHHHHHHH---hCCCCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHH--------HHHHHhCCCeEEeecceee Q lcl|Aclame:pro 154 PVITDALDS---VILRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMA--------FLQELLELDAIYIGEARLN 222 (309) Q Consensus 154 ~di~~~~~~---~g~~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~--------~l~~l~gl~~I~v~~a~~~ 222 (309) .+|...... ... -...+|+++.|..|+. ++..++.+ +..+. .=..|||+|-+++ +..+- T Consensus 251 d~l~~~~~~~~~~~~-~a~~vmn~~~~~~l~~-------lkd~~G~~-i~~~~~~~~~~~~~~~~L~G~PV~~~-~~~~~ 320 (394) T protein:vir:10 251 DSLKHILNVDLDPAY-SRALVVTQSLFNTLDT-------LKDKNGRY-LLHDASDSITDGTAKGTVLGVPVYVV-GDALL 320 (394) T ss_pred HHHHHHHHhhhhhhc-cCEEEecHHHHHHHHH-------hhccCCCe-eeeccccccccCCcccccccceeEEe-ccccc Confidence 334333321 123 3579999999999774 22222221 11110 1134788884433 22211 Q ss_pred ccccCCCcccceecCC--cEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhh Q lcl|Aclame:pro 223 IARPGQNPNLIRAWGP--HASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLG 300 (309) Q Consensus 223 ~~~~g~~~~~~~v~~~--~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G 300 (309) . ....+..-++++ +.++++.. =|.+.++... . ...+.+++...++-.+.-+++- T Consensus 321 ~---~~~~~~~i~~gd~s~~~~~~~~----------~~~~v~~~~~-------~----~~~~~~~~~~r~d~~~~~~~ai 376 (394) T protein:vir:10 321 G---SAAGDQKAFVGDLKRGVLFADR----------QQVTLAWEDS-------K----IYGRYLGAAFRFGVKQADSNAG 376 (394) T ss_pred C---CCCCceEEEEeeccccEEEEee----------cceEEEEecc-------c----ccceeEEEEEEeccEEeccccE Confidence 1 111111112222 11112110 0112221110 0 0112355666667777777777 Q ss_pred hhhhccccC Q lcl|Aclame:pro 301 FFFENAVAA 309 (309) Q Consensus 301 ~l~~~~va~ 309 (309) .+++-.-++ T Consensus 377 ~~~~~~~~~ 385 (394) T protein:vir:10 377 YFVTNTDAA 385 (394) T ss_pred EEEEeeccc Confidence 776544333 No 141 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=27.75 E-value=1.8 Score=19.17 Aligned_cols=284 Identities=12% Similarity=0.013 Sum_probs=102.2 Q ss_pred CCCCCCCcchhhHHHHHhhcch--------hhhhhhh---------CCcccccc-ccce---eEEechhHhhhchhHhhc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNG--------RMISDEV---------LPRVPVGK-QEFK---FWKYDLAQGFTVPETLVG 59 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~--------~~ig~~l---------fP~v~v~~-~~~k---~~~~~~~~~f~~~~t~~~ 59 (309) |++.+ +.-..|+-..|..+. .|+|+.+ .+.+.+.. .+++ ++..++... .-+. T Consensus 1 m~~~~--~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~iG~~tv-----~~~t 73 (347) T protein:vir:94 1 MANVP--GQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVMGRTSG-----VYLA 73 (347) T ss_pred CCCCC--ccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecccceee-----eeec Confidence 88764 222234433332221 1343322 22222221 1222 333333221 1122 Q ss_pred ccccc--cccccCcCccceeeeccch-hhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-Cccc Q lcl|Aclame:pro 60 RKSKP--NEVEFSATDETGSTEDHGL-DAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSY-AAGN 135 (309) Q Consensus 60 ~~~~~--~~ve~~~~~~~~~~~e~~L-~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y-~~~~ 135 (309) ++.+. +.-.....+..+.+.+.-. ...|+ +.++++..+|......+.....|.......++..+....+. +..+ T Consensus 74 ~G~~l~~~~~~~~~~e~~itID~~~~~~~~Vd--diD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~ 151 (347) T protein:vir:94 74 PGERLSDKRKGIKHTEKVITIDGLLTADVMIF--DIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASN 151 (347) T ss_pred CCCCcCCCCCCCCcceEEEEecchhhhhHHhh--hHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 33322 1112333444444332211 22343 45566777888877777666666666655554322111111 1111 Q ss_pred ceec--ccccccC----CCCCChH-------HHHHHHHHHh---CC--CCcEEEeCHHHHHHHhcCHHHHHHhccCCCcc Q lcl|Aclame:pro 136 KTTL--SGADQWS----DPTSNPL-------PVITDALDSV---IL--RPNIGVLGRRTATILRRHPKIVKAYNGSLGDE 197 (309) Q Consensus 136 ~~~l--sgt~~Ws----d~~sdPi-------~di~~~~~~~---g~--~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~ 197 (309) .... .+...-+ ..+.+|. ..|.+++..+ .+ ..-.+|+++..|..|+.|+.+...-..... T Consensus 152 ~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~-- 229 (347) T protein:vir:94 152 ENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALI-- 229 (347) T ss_pred cccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccc-- Confidence 1110 0011110 1122332 2333333333 22 234899999999999999887776543321 Q ss_pred cccCHHHHHHHhCCCeEEeecceee-c---cccCCCccc----ce---------e---cCCcEEEEecCCCCCCcCccee Q lcl|Aclame:pro 198 GMVPMAFLQELLELDAIYIGEARLN-I---ARPGQNPNL----IR---------A---WGPHASFIYRDRLADTRNGTTF 257 (309) Q Consensus 198 ~~vt~~~l~~l~gl~~I~v~~a~~~-~---~~~g~~~~~----~~---------v---~~~~~~L~~~~~~~~~~~~~t~ 257 (309) .+..-.+..++|++-+. ....-. . ...++.... .. + +.+.+.|++.+.-.+ T Consensus 230 -~~~~G~Vg~i~G~~V~~-Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~------- 300 (347) T protein:vir:94 230 -DPETGNIRNVMGFVVVE-VPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVG------- 300 (347) T ss_pred -cccccceEEEeceEEEe-cCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhh------- Confidence 12223455667765222 111000 0 000000000 00 0 111222332221111 Q ss_pred cccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 258 GLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 258 G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) |.+.-........++ ..=++.|+....+=..+.-|+|.-.|+=..|= T Consensus 301 --~v~~~~~~~e~~r~~---~~~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 301 --TVKLRDLALERDRDV---DAQGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred --hhhcccccccchhch---hhHHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 111100000000100 00122333333333444444443333222111 No 142 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=24.46 E-value=2.2 Score=18.73 Aligned_cols=294 Identities=12% Similarity=0.039 Sum_probs=103.6 Q ss_pred CCCC--CCCcchhhH----HHHH----hhcchhhh-hhhhCCccccc----cccceeEEechhHhhhchhHhhccccccc Q lcl|Aclame:pro 1 MSNA--PFPIDPELT----AIAI----AYRNGRMI-SDEVLPRVPVG----KQEFKFWKYDLAQGFTVPETLVGRKSKPN 65 (309) Q Consensus 1 m~~~--~f~~dp~LT----~~a~----~y~n~~~i-g~~lfP~v~v~----~~~~k~~~~~~~~~f~~~~t~~~~~~~~~ 65 (309) |+.. ...+.---+ ++-+ |-...+|. +..+.|.+.+. ..+.+++..++.++ .-+.+|.... T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~a-----~y~~~G~~ld 75 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETEL-----QVLAPGQSPN 75 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeEE-----eeeccccccC Confidence 7753 111110000 1111 11122232 23444554432 22234444444321 0112222222 Q ss_pred ccccCcCccceeeeccchhhcCCHHHHHHHhhcCC-HHHHHHHHHHHHHHHHHHHHHHHHhhccc---------ccCccc Q lcl|Aclame:pro 66 EVEFSATDETGSTEDHGLDAPVPQADIDNAPTNYN-PLGHATEQTTNLILLDREARTSKLVFSPN---------SYAAGN 135 (309) Q Consensus 66 ~ve~~~~~~~~~~~e~~L~~~v~~~~~~~a~~~~d-~~~~av~~l~~~i~~~~E~~~a~~~~~~~---------~y~~~~ 135 (309) ..... ..+..-+.+.-|-....--++++++.-|| .+....+..-..+.......+.+.+...+ +.+..+ T Consensus 76 g~~~~-~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~ 154 (402) T protein:vir:97 76 ATPTQ-ADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) T ss_pred CCCcc-cccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccc Confidence 21112 22332333333444444456667788888 56555444444444444444444332211 111111 Q ss_pred ---ceecccccccCCCCCChHHHH---HHHHHHh--CCCC---cEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHH Q lcl|Aclame:pro 136 ---KTTLSGADQWSDPTSNPLPVI---TDALDSV--ILRP---NIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAF 204 (309) Q Consensus 136 ---~~~lsgt~~Wsd~~sdPi~di---~~~~~~~--g~~P---n~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~ 204 (309) ...++++ -+++.-||..-+ .++...+ .--| -++++++..|.+|+.|++++++-++..+ .+...... T Consensus 155 ~g~s~~~~~t--~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~-~g~~~~G~ 231 (402) T protein:vir:97 155 HGFSINVNVT--ESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQ-SGATINGF 231 (402) T ss_pred cccccccccc--cchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhcccc-CCccccce Confidence 1111111 122223454333 2333333 1112 3799999999999999999887654222 23333444 Q ss_pred HHHHhCCCeEEeecceeeccc--------cCCCcccceecCC---cEEEEecCCCCCCcCcceecccc--cccccccCCc Q lcl|Aclame:pro 205 LQELLELDAIYIGEARLNIAR--------PGQNPNLIRAWGP---HASFIYRDRLADTRNGTTFGLTA--QWGDRVSGSI 271 (309) Q Consensus 205 l~~l~gl~~I~v~~a~~~~~~--------~g~~~~~~~v~~~---~~~L~~~~~~~~~~~~~t~G~T~--~~~~~~~~~~ 271 (309) +..+.|++ |+.-...-+.+. ....+.-..+-++ .+.++|.+.-..+.. +-..|- .|..+..++. T Consensus 232 v~~v~Gv~-Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk--~~~vT~~~~~d~r~~~~~ 308 (402) T protein:vir:97 232 VLSSYNCP-VIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGR--TIEVTGDIFYEKKEKTYY 308 (402) T ss_pred eEEEeceE-EEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEE--eeccccchhhchhHHHHH Confidence 44455655 222111111000 0000111111111 133344332111110 001110 1122222222 Q ss_pred cccccccCCceEEEeecccceee------ecchhhhhhhccccC Q lcl|Aclame:pro 272 ADPNIGLRGGQRVRVGESVKELV------TAPDLGFFFENAVAA 309 (309) Q Consensus 272 ~d~~~g~~g~~~v~v~~~~~~~v------~~~~~G~l~~~~va~ 309 (309) .+.++-...+- -..+.--+| +..|+|=+.++.++- T Consensus 309 id~~~a~G~g~---~RPeaa~vv~~~~~~t~~~~~~~~~~~~~~ 349 (402) T protein:vir:97 309 IDTFMAEGAIP---DRWEAVSVVTTKRDATTGDAGGPGDDHATV 349 (402) T ss_pred HHHHHHhCCcc---cCccceEEEEEecccccccCCccccchhhh Confidence 22221111100 011111112 333444454444332 No 143 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=24.21 E-value=2.2 Score=18.70 Aligned_cols=256 Identities=6% Similarity=-0.023 Sum_probs=102.3 Q ss_pred CCC------CCCCcchhhHHHHHhhcchhhhhhhhCCccccccccceeEEechhHhhhchhHhhcccccccccccCcCcc Q lcl|Aclame:pro 1 MSN------APFPIDPELTAIAIAYRNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEFSATDE 74 (309) Q Consensus 1 m~~------~~f~~dp~LT~~a~~y~n~~~ig~~lfP~v~v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~~~~~~ 74 (309) |+. ...+|..+.+.|-..-++..-|- .+...+++. +.++|...-. .. ...-++.++........+... T Consensus 118 l~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~-~~~~v~~~~--~~~~p~~~~~--~~-~a~~v~E~~~~~~~~~~f~~v 191 (387) T protein:vir:93 118 LPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLR-EKARLTNIK--GLEIPRVSYT--LD-DDDFITDVETAKELKLKGDTV 191 (387) T ss_pred hccCcCCCCceeechhHHHHHHHHHHhhchhh-hheeeeecC--CceEEEEeec--CC-ccccccCccccccccccccee Confidence 221 12344433333332211111111 122333333 2334432110 00 011234444445555666666 Q ss_pred ceeeeccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccCcccceecccccccCCCCCChHH Q lcl|Aclame:pro 75 TGSTEDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDREARTSKLVFSPNSYAAGNKTTLSGADQWSDPTSNPLP 154 (309) Q Consensus 75 ~~~~~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd~~sdPi~ 154 (309) ++..+..+-..+++++-..+ +.+|.+..-.+.+.+.+....+. . ++.++..-+ .....+..+..-.-...+.+. T Consensus 192 ~~~~~k~~~~~~iS~ell~D--s~~~l~~~i~~~la~~~~~~e~~-~--~~~~g~g~g-~p~g~l~~~~~~~v~~~~~~d 265 (387) T protein:vir:93 192 KFTTNKFKVFAAISDTVIHG--SDVDLVNWVENALQSGLAAKERK-D--ALAVSPKSG-LDHMSFYNGSVKEVEGADMYD 265 (387) T ss_pred eeeheeeeeechhhHHHHhh--hHHHHHHHHHHHHHHHHHHHHHH-h--HhhcCCCcc-ccceeeeccccccccccchHH Confidence 66666666666777665443 33455555555666655443221 1 111211100 011111111110112233455 Q ss_pred HHHHHHHHh--CCCCc-EEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceeeccccCCCcc Q lcl|Aclame:pro 155 VITDALDSV--ILRPN-IGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPN 231 (309) Q Consensus 155 di~~~~~~~--g~~Pn-~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~~~~~g~~~~ 231 (309) +|.+....+ .++.| ..+|++.+|..+++ .++..+.. .+ ...-..+||.| |++-++. T Consensus 266 ~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~------~~~d~~~~--~~-~~~~~~llG~P-V~~~~~~----------- 324 (387) T protein:vir:93 266 AIINALADLHEDYRDNATIYMRYADYVKIIS------VLSNGTTN--FF-DTPAEKVFGKP-VVFTDAA----------- 324 (387) T ss_pred HHHHHHhccChhhhcCCEEEEechHHHHHHH------HHhcCCCc--cc-ccCCccccccc-eEEecCC----------- Confidence 566655544 23333 57888887755432 22222211 11 01113467876 4432210 Q ss_pred cceecCCcEEEEecCCCCCCcCcceecccccccccccCCccccccccCCceEEEeecccceeeecchhhhhhhccccC Q lcl|Aclame:pro 232 LIRAWGPHASFIYRDRLADTRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTAPDLGFFFENAVAA 309 (309) Q Consensus 232 ~~~v~~~~~~L~~~~~~~~~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~~~~G~l~~~~va~ 309 (309) ..-++|+- +++|.-.. ...+.....-..|...++....+|-.++-+.+-.+++=..|+ T Consensus 325 ~~~~~GDf----------------~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~ 382 (387) T protein:vir:93 325 VKPIVGDF----------------NYFGINYD----GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENT 382 (387) T ss_pred Cceeeeeh----------------hhhheehh----hheeeecccccCCceeEEEEeeeCceeechhheEEEEeecCC Confidence 01122221 11111111 111111111123444466677788888888888777554444 No 144 >protein:vir:105464 Length: 346 # NCBI annotation: putative phage major capsid protein # Family: family:all:701 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529874;genbank:gi:90592614;genbank:GeneID:3974528 Probab=20.43 E-value=2.8 Score=18.16 Aligned_cols=274 Identities=11% Similarity=0.041 Sum_probs=95.9 Q ss_pred CCCCCCCcchhhHHHHHhhcchhhhhhhhC-Cc------c-c---cccccceeEEechhHhhhchhHhhccccccccccc Q lcl|Aclame:pro 1 MSNAPFPIDPELTAIAIAYRNGRMISDEVL-PR------V-P---VGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEF 69 (309) Q Consensus 1 m~~~~f~~dp~LT~~a~~y~n~~~ig~~lf-P~------v-~---v~~~~~k~~~~~~~~~f~~~~t~~~~~~~~~~ve~ 69 (309) |+-...-. ....+- ..|....|. +. . . .+..+.++|+..-...+..++ |+ ++-..+.+. T Consensus 1 Mainya~~--~~~~Ld-----~~~~~~~lts~~l~~~~~~~~v~~~ggktVkIp~is~tsGl~DY~--R~-~g~~~~g~v 70 (346) T protein:vir:10 1 MTINYAEK--YQAAVQ-----QAFYDGHLYSAELWNSPSNSIIKFDGAKHIKVPRLEITSGRKDRQ--RR-TITTPVANY 70 (346) T ss_pred CcchhHHH--HHHHHH-----HHHHhhhccchhhcccccccceEecCCCEEEEEEeeeeccccccc--cc-CCccccccc Confidence 55322100 000110 112222221 21 1 1 123345556553211233222 11 111112233 Q ss_pred CcCccceee-eccchhhcCCHHHHHHHhhcCCHHHHHHHHHHHHHHHHHH-HHHHHHhhcccccCcccceecccccccCC Q lcl|Aclame:pro 70 SATDETGST-EDHGLDAPVPQADIDNAPTNYNPLGHATEQTTNLILLDRE-ARTSKLVFSPNSYAAGNKTTLSGADQWSD 147 (309) Q Consensus 70 ~~~~~~~~~-~e~~L~~~v~~~~~~~a~~~~d~~~~av~~l~~~i~~~~E-~~~a~~~~~~~~y~~~~~~~lsgt~~Wsd 147 (309) +....++.+ .+++....|+.-+.++.+.......-.-+...+++-=... .+++.++.....-....+.+.+ = T Consensus 71 ~~~~et~tl~qDR~~~F~vD~mDvDETn~~~~~anv~~ef~r~~vvPEiDayrfskLa~~a~~~~~~~~~~~a------~ 144 (346) T protein:vir:10 71 SNDWDSYELKNERYWSTLVDPSDIDETNMVVSLANITKQFNLDSKMPEKDRYMFSHLYSGKEAAHDGGITTNT------L 144 (346) T ss_pred ccceeEEEeeccccceecccccchHHHHHHhHHHHHHHHHHHHhhcchhhHHHHHHHHHhhhhhccccccccc------c Confidence 334344443 3445556666555544432221111111222222211111 2344444332211111111110 0 Q ss_pred CCCChHHHHHHHHHHh---CC--CCcEEEeCHHHHHHHhcCHHHHHHhccCCCcccccCHHHHHHHhCCCeEEeecceee Q lcl|Aclame:pro 148 PTSNPLPVITDALDSV---IL--RPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQELLELDAIYIGEARLN 222 (309) Q Consensus 148 ~~sdPi~di~~~~~~~---g~--~Pn~~v~~~~~~~~l~~~~~i~~~~~~~~~~~~~vt~~~l~~l~gl~~I~v~~a~~~ 222 (309) ...+.+.-|.+.+.++ ++ .+-.|++++.++..|++.+.+.+.+.-. +.+.+ .-.+.++=|++-|.|-..+.. T Consensus 145 T~~ni~~~i~~~~~~lde~~vp~~~rvl~vTp~~~~lLk~s~~f~k~~~v~--~~~~i-~~~V~siDGv~Ii~VPs~r~~ 221 (346) T protein:vir:10 145 DEKNILPAFDNMMLDFDEARIPSTNRILYVTPKTNAILKRAEAMNRALTLK--DPNNI-QRTVYSLDDVTIRVVPSDLMQ 221 (346) T ss_pred CHHHHHHHHHHHHHHHHHccCCCCCeEEEECHHHHHHHhhchhheeccccc--ccccc-ceeeeeecCeEEEEcchhhcc Confidence 1245777788877665 33 3467999999999999999887766532 23334 334566667776666555543 Q ss_pred cccc---CCCcccceecCCcEEEEecCCCCC---CcCcceecccccccccccCCccccccccCCceEEEeecccceeeec Q lcl|Aclame:pro 223 IARP---GQNPNLIRAWGPHASFIYRDRLAD---TRNGTTFGLTAQWGDRVSGSIADPNIGLRGGQRVRVGESVKELVTA 296 (309) Q Consensus 223 ~~~~---g~~~~~~~v~~~~~~L~~~~~~~~---~~~~~t~G~T~~~~~~~~~~~~d~~~g~~g~~~v~v~~~~~~~v~~ 296 (309) +... |..... -+.++=++-+++.+. ....-.. ++++.-+..|.+.+....-+|-.|.- T Consensus 222 t~~~f~~G~~~~t---~ak~INfiiv~~~A~ia~~K~~~~~-------------if~P~~~~~g~~l~~~R~Y~D~fv~~ 285 (346) T protein:vir:10 222 TAYDFSDGSKIID---TAKQIEMFLIYNGVQIAPEKYSFVG-------------FDQPSAATSGNYLYYEQSYDDVLLLN 285 (346) T ss_pred cchhhccCccccC---CccceeEEEECCceeeeeeeeeeeE-------------eeCCCCCcccceeeeeeeeeeeeeec Confidence 2211 110000 011111111111111 1000000 01110011111111111111211111 Q ss_pred chhhhhh---hccccC Q lcl|Aclame:pro 297 PDLGFFF---ENAVAA 309 (309) Q Consensus 297 ~~~G~l~---~~~va~ 309 (309) .-.-.++ ..+.|+ T Consensus 286 nk~~~Iyv~~~~a~~~ 301 (346) T protein:vir:10 286 TKTKGIQFVVSDKPKK 301 (346) T ss_pred cccceEEEeeeccccc Confidence 1111111 111221 Done!