Query lcl|Aclame:protein:vir:98566|NCBI_annot:gp5|genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Match_columns 355 No_of_seqs 131 out of 270 Neff 5.1 Searched_HMMs 1612 Date Sat Nov 30 19:47:28 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_37 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_37_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:98566 Length: 355 100.0 2E-195 1E-198 1088.1 29.7 355 1-355 1-355 (355) 2 protein:vir:1829 Length: 355 # 100.0 4E-194 3E-197 1080.8 29.8 355 1-355 1-355 (355) 3 protein:vir:5694 Length: 357 # 100.0 6E-194 4E-197 1080.0 29.4 355 1-355 1-357 (357) 4 protein:vir:6061 Length: 357 # 100.0 8E-194 5E-197 1079.1 29.6 355 1-355 1-357 (357) 5 protein:vir:2016 Length: 357 # 100.0 9E-194 6E-197 1078.9 29.4 355 1-355 1-357 (357) 6 protein:vir:100331 Length: 342 100.0 6E-189 4E-192 1052.6 28.4 340 1-346 1-342 (342) 7 protein:vir:78186 Length: 337 100.0 4E-188 2E-191 1048.2 27.8 337 1-345 1-337 (337) 8 protein:vir:79157 Length: 339 100.0 5E-188 3E-191 1047.6 27.6 338 1-346 1-339 (339) 9 protein:vir:79171 Length: 337 100.0 4E-187 3E-190 1042.3 27.9 337 1-345 1-337 (337) 10 protein:vir:104011 Length: 337 100.0 5E-187 3E-190 1041.8 27.9 337 1-345 1-337 (337) 11 protein:vir:1153 Length: 338 # 100.0 2E-185 1E-188 1033.2 28.3 336 1-344 1-338 (338) 12 protein:vir:78777 Length: 358 100.0 4E-184 2E-187 1026.1 28.6 344 1-355 5-353 (358) 13 protein:vir:98856 Length: 343 100.0 1E-174 7E-178 974.4 27.9 336 1-355 1-341 (343) 14 protein:vir:3746 Length: 336 # 100.0 2E-174 1E-177 973.0 27.4 332 4-348 1-336 (336) 15 protein:vir:3783 Length: 336 # 100.0 2E-174 1E-177 972.8 27.3 332 4-348 1-336 (336) 16 protein:vir:270 Length: 341 # 100.0 2E-171 1E-174 956.3 25.6 333 1-355 5-340 (341) 17 protein:vir:3158 Length: 321 # 100.0 5.2E-80 3.2E-83 455.2 20.9 317 4-354 1-321 (321) 18 protein:vir:99424 Length: 360 100.0 3.2E-47 2E-50 275.5 18.4 331 1-345 1-360 (360) 19 protein:vir:4197 Length: 314 # 100.0 1.1E-42 7E-46 250.6 17.3 305 1-343 1-314 (314) 20 protein:vir:4159 Length: 315 # 100.0 7.1E-37 4.4E-40 218.8 16.0 306 1-340 1-315 (315) 21 protein:vir:4092 Length: 390 # 98.8 6.3E-10 3.9E-13 71.0 14.7 304 1-355 72-379 (390) 22 protein:vir:100247 Length: 425 98.6 4.4E-09 2.7E-12 66.4 14.6 307 1-349 108-425 (425) 23 protein:vir:94771 Length: 298 98.6 9.5E-09 5.9E-12 64.6 15.4 281 16-341 1-298 (298) 24 protein:vir:98339 Length: 415 98.5 2.2E-08 1.4E-11 62.6 16.7 299 1-355 101-411 (415) 25 protein:vir:81100 Length: 415 98.5 2.2E-08 1.4E-11 62.6 16.7 299 1-355 101-411 (415) 26 protein:vir:79987 Length: 415 98.5 2.2E-08 1.4E-11 62.6 16.7 299 1-355 101-411 (415) 27 protein:vir:4600 Length: 415 # 98.5 3.2E-08 2E-11 61.7 17.1 298 1-355 101-411 (415) 28 protein:vir:4700 Length: 415 # 98.5 3.2E-08 2E-11 61.7 17.1 298 1-355 101-411 (415) 29 protein:vir:9410 Length: 415 # 98.5 2.9E-08 1.8E-11 61.9 16.7 302 1-355 101-411 (415) 30 protein:vir:95376 Length: 425 98.5 4.7E-08 2.9E-11 60.8 17.9 302 1-355 111-425 (425) 31 protein:vir:4339 Length: 395 # 98.5 4.1E-08 2.6E-11 61.1 16.0 300 1-348 89-395 (395) 32 protein:vir:4511 Length: 409 # 98.5 5.8E-08 3.6E-11 60.3 16.8 299 1-355 84-409 (409) 33 protein:vir:100135 Length: 418 98.5 9.5E-08 5.9E-11 59.1 17.9 298 1-354 104-418 (418) 34 protein:vir:7771 Length: 330 # 98.4 7E-09 4.4E-12 65.3 11.5 300 1-349 1-330 (330) 35 protein:vir:103955 Length: 324 98.3 1.9E-07 1.2E-10 57.5 16.3 300 1-351 1-324 (324) 36 protein:vir:1638 Length: 298 # 98.3 1.7E-07 1.1E-10 57.7 15.8 280 16-341 1-298 (298) 37 protein:vir:104085 Length: 320 98.2 8.5E-08 5.3E-11 59.3 13.2 297 1-350 1-320 (320) 38 protein:vir:94142 Length: 304 98.2 8.6E-08 5.3E-11 59.3 13.1 282 1-341 1-304 (304) 39 protein:vir:105905 Length: 304 98.2 8.6E-08 5.3E-11 59.3 13.1 282 1-341 1-304 (304) 40 protein:vir:96223 Length: 324 98.2 9E-07 5.6E-10 53.7 18.6 300 1-351 1-324 (324) 41 protein:vir:7409 Length: 408 # 98.2 4.9E-07 3.1E-10 55.2 17.1 295 1-355 82-405 (408) 42 protein:vir:191 Length: 385 # 98.2 2.5E-07 1.5E-10 56.8 15.3 297 1-349 70-385 (385) 43 protein:vir:1886 Length: 385 # 98.2 2.5E-07 1.5E-10 56.8 15.3 297 1-349 70-385 (385) 44 protein:vir:1025 Length: 408 # 98.2 4E-07 2.5E-10 55.7 16.2 296 1-355 89-405 (408) 45 protein:vir:485 Length: 407 # 98.2 5.2E-07 3.2E-10 55.0 16.6 311 1-355 78-407 (407) 46 protein:vir:94673 Length: 419 98.2 4.8E-07 3E-10 55.2 16.1 305 1-350 98-419 (419) 47 protein:vir:3991 Length: 404 # 98.2 8.3E-07 5.2E-10 53.9 17.0 289 1-355 89-403 (404) 48 protein:vir:2504 Length: 305 # 98.2 4.7E-07 2.9E-10 55.3 15.7 287 16-350 1-305 (305) 49 protein:vir:80376 Length: 435 98.2 1.2E-06 7.4E-10 53.0 17.8 302 1-344 88-435 (435) 50 protein:vir:99749 Length: 324 98.1 6.3E-07 3.9E-10 54.6 16.1 300 1-351 1-324 (324) 51 protein:vir:97053 Length: 390 98.1 8E-07 5E-10 54.0 16.2 289 1-343 83-390 (390) 52 protein:vir:97148 Length: 324 98.1 1.7E-06 1.1E-09 52.2 17.5 300 1-351 1-324 (324) 53 protein:vir:41 Length: 299 # N 98.1 3.5E-07 2.2E-10 55.9 13.5 275 20-350 1-299 (299) 54 protein:vir:100172 Length: 394 98.1 2E-06 1.3E-09 51.8 17.6 287 1-355 88-391 (394) 55 protein:vir:96392 Length: 324 98.1 1.7E-06 1E-09 52.3 17.1 300 1-351 1-324 (324) 56 protein:vir:78830 Length: 324 98.1 1.7E-06 1E-09 52.3 17.1 300 1-351 1-324 (324) 57 protein:vir:4456 Length: 401 # 98.1 9E-07 5.6E-10 53.7 15.5 306 1-348 79-401 (401) 58 protein:vir:1328 Length: 392 # 98.1 6.5E-07 4E-10 54.5 14.5 294 1-349 85-392 (392) 59 protein:vir:4953 Length: 397 # 98.1 2.7E-06 1.7E-09 51.1 17.8 289 1-355 86-396 (397) 60 protein:vir:95963 Length: 395 98.0 4E-07 2.5E-10 55.7 13.0 303 1-355 75-388 (395) 61 protein:vir:4997 Length: 397 # 98.0 2.7E-06 1.7E-09 51.1 17.6 286 1-355 86-397 (397) 62 protein:vir:81227 Length: 413 98.0 2.1E-06 1.3E-09 51.7 16.7 298 1-348 102-413 (413) 63 protein:vir:81070 Length: 390 98.0 2.5E-06 1.6E-09 51.3 17.1 295 1-346 80-390 (390) 64 protein:vir:10364 Length: 390 98.0 2.9E-06 1.8E-09 50.9 17.1 292 1-343 83-390 (390) 65 protein:vir:101291 Length: 381 98.0 6.6E-07 4.1E-10 54.5 13.2 308 1-355 65-380 (381) 66 protein:vir:9509 Length: 381 # 98.0 6.6E-07 4.1E-10 54.5 13.2 308 1-355 65-380 (381) 67 protein:vir:6242 Length: 390 # 98.0 1.8E-06 1.1E-09 52.0 15.5 291 1-349 81-390 (390) 68 protein:vir:1433 Length: 435 # 98.0 5E-06 3.1E-09 49.7 17.8 300 1-344 91-435 (435) 69 protein:vir:81160 Length: 371 98.0 2.1E-06 1.3E-09 51.7 15.8 282 1-348 71-371 (371) 70 protein:vir:3845 Length: 395 # 97.9 4.2E-06 2.6E-09 50.0 16.9 287 1-355 86-394 (395) 71 protein:vir:9309 Length: 324 # 97.9 7.8E-06 4.9E-09 48.6 18.3 299 1-351 1-324 (324) 72 protein:vir:8102 Length: 543 # 97.9 3.5E-06 2.1E-09 50.5 16.3 295 1-349 217-543 (543) 73 protein:vir:4830 Length: 397 # 97.9 5.7E-06 3.6E-09 49.3 17.5 287 1-355 86-395 (397) 74 protein:vir:78523 Length: 338 97.9 1.5E-06 9.2E-10 52.5 14.1 300 1-350 1-338 (338) 75 protein:vir:6212 Length: 434 # 97.9 4.2E-06 2.6E-09 50.0 16.6 299 1-355 119-434 (434) 76 protein:vir:80684 Length: 315 97.9 3.1E-06 1.9E-09 50.8 15.5 290 16-355 1-314 (315) 77 protein:vir:4226 Length: 326 # 97.8 1.5E-06 9.5E-10 52.5 13.1 299 1-345 1-326 (326) 78 protein:vir:9574 Length: 300 # 97.8 6.9E-06 4.3E-09 48.9 16.5 283 16-354 1-300 (300) 79 protein:vir:101650 Length: 497 97.8 7.4E-06 4.6E-09 48.7 16.4 327 1-352 130-497 (497) 80 protein:vir:7855 Length: 497 # 97.8 7.4E-06 4.6E-09 48.7 16.4 327 1-352 130-497 (497) 81 protein:vir:95763 Length: 297 97.8 3.7E-06 2.3E-09 50.3 14.0 279 1-349 1-297 (297) 82 protein:vir:80128 Length: 466 97.7 5.2E-06 3.2E-09 49.5 14.6 316 1-355 123-461 (466) 83 protein:vir:1268 Length: 397 # 97.7 1E-05 6.3E-09 47.9 16.1 282 1-348 87-397 (397) 84 protein:vir:9759 Length: 303 # 97.7 5E-06 3.1E-09 49.6 13.5 283 20-342 1-303 (303) 85 protein:vir:78223 Length: 333 97.7 9.3E-06 5.8E-09 48.2 14.9 296 13-348 1-333 (333) 86 protein:vir:5739 Length: 366 # 97.7 1.9E-05 1.2E-08 46.5 16.5 299 1-342 20-366 (366) 87 protein:vir:99920 Length: 311 97.6 1.6E-05 1E-08 46.8 15.4 291 16-347 1-311 (311) 88 protein:vir:100884 Length: 389 97.6 4E-05 2.5E-08 44.7 18.6 283 1-352 83-389 (389) 89 protein:vir:2344 Length: 397 # 97.6 1.7E-05 1.1E-08 46.7 15.0 297 1-355 1-320 (397) 90 protein:vir:1084 Length: 437 # 97.5 2.4E-05 1.5E-08 46.0 15.6 287 1-355 136-436 (437) 91 protein:vir:4856 Length: 293 # 97.5 1.3E-05 8.1E-09 47.3 14.0 273 16-355 1-292 (293) 92 protein:vir:102873 Length: 392 97.5 5.6E-05 3.5E-08 43.9 17.9 285 1-354 89-392 (392) 93 protein:vir:105004 Length: 392 97.5 5.6E-05 3.5E-08 43.9 17.9 285 1-354 89-392 (392) 94 protein:vir:107593 Length: 392 97.5 5.6E-05 3.5E-08 43.9 17.9 285 1-354 89-392 (392) 95 protein:vir:102082 Length: 392 97.5 5.6E-05 3.5E-08 43.9 17.9 285 1-354 89-392 (392) 96 protein:vir:78350 Length: 383 97.5 4.5E-06 2.8E-09 49.9 10.9 295 1-353 72-383 (383) 97 protein:vir:102119 Length: 404 97.5 5.3E-05 3.3E-08 44.0 16.7 302 1-352 80-404 (404) 98 protein:vir:9704 Length: 394 # 97.5 2.2E-05 1.4E-08 46.1 14.4 277 1-349 103-394 (394) 99 protein:vir:2430 Length: 318 # 97.4 2.5E-05 1.5E-08 45.8 14.5 294 1-347 1-318 (318) 100 protein:vir:9643 Length: 377 # 97.4 2.5E-05 1.5E-08 45.9 13.8 297 1-354 67-377 (377) 101 protein:vir:105038 Length: 428 97.3 9E-05 5.6E-08 42.8 16.5 299 1-342 83-428 (428) 102 protein:vir:3870 Length: 400 # 97.3 4.4E-05 2.7E-08 44.5 14.6 277 1-349 101-400 (400) 103 protein:vir:8187 Length: 311 # 97.3 3.5E-05 2.2E-08 45.0 14.0 289 16-349 1-311 (311) 104 protein:vir:100632 Length: 381 97.2 2.4E-05 1.5E-08 45.9 12.5 305 1-355 65-381 (381) 105 protein:vir:8420 Length: 477 # 97.2 0.00013 8.3E-08 41.8 16.9 311 1-354 115-477 (477) 106 protein:vir:104256 Length: 458 97.2 0.00013 8.3E-08 41.8 15.8 303 1-348 126-458 (458) 107 protein:vir:1383 Length: 421 # 97.1 0.00019 1.2E-07 41.0 16.1 285 1-355 92-400 (421) 108 protein:vir:9361 Length: 402 # 97.1 8.5E-05 5.3E-08 42.9 13.8 286 1-354 98-402 (402) 109 protein:vir:962 Length: 397 # 97.0 3.5E-05 2.2E-08 45.0 11.5 277 1-348 112-397 (397) 110 protein:vir:78640 Length: 352 97.0 0.00012 7.6E-08 42.0 14.3 285 1-354 46-352 (352) 111 protein:vir:93881 Length: 387 96.8 0.00031 1.9E-07 39.8 14.9 287 1-354 83-387 (387) 112 protein:vir:101607 Length: 379 96.7 0.00038 2.4E-07 39.3 15.1 282 1-348 77-379 (379) 113 protein:vir:96978 Length: 387 96.7 0.00021 1.3E-07 40.7 13.2 286 1-354 83-387 (387) 114 protein:vir:2685 Length: 387 # 96.7 0.00021 1.3E-07 40.7 13.2 286 1-354 83-387 (387) 115 protein:vir:94424 Length: 387 96.7 0.00021 1.3E-07 40.7 13.2 286 1-354 83-387 (387) 116 protein:vir:96762 Length: 632 96.5 0.00053 3.3E-07 38.6 15.4 292 1-347 304-632 (632) 117 protein:vir:93616 Length: 645 96.5 0.00059 3.7E-07 38.3 16.6 300 1-348 286-645 (645) 118 protein:vir:98635 Length: 377 95.8 0.001 6.3E-07 37.0 12.7 285 1-354 67-377 (377) 119 protein:vir:103285 Length: 296 93.8 0.003 1.9E-06 34.4 10.0 272 20-343 1-296 (296) 120 protein:vir:78935 Length: 335 89.3 0.027 1.7E-05 29.2 12.3 289 1-347 1-335 (335) 121 protein:vir:9820 Length: 272 # 88.8 0.03 1.9E-05 28.9 14.4 260 16-348 1-272 (272) 122 protein:vir:3033 Length: 272 # 88.8 0.03 1.9E-05 28.9 14.4 260 16-348 1-272 (272) 123 protein:vir:107687 Length: 319 82.0 0.08 5E-05 26.6 12.8 297 1-340 1-319 (319) 124 protein:vir:78739 Length: 332 81.7 0.056 3.5E-05 27.4 7.9 284 13-345 1-332 (332) 125 protein:vir:104342 Length: 314 73.2 0.17 0.00011 24.8 12.4 286 1-343 3-314 (314) 126 protein:vir:79642 Length: 329 72.3 0.19 0.00011 24.6 10.1 298 1-343 6-329 (329) 127 protein:vir:8885 Length: 347 # 66.2 0.27 0.00017 23.7 13.3 287 1-346 1-347 (347) 128 protein:vir:3364 Length: 347 # 64.1 0.31 0.00019 23.4 8.3 303 1-347 1-347 (347) 129 protein:vir:94933 Length: 330 60.3 0.28 0.00018 23.6 6.4 290 1-343 5-330 (330) 130 protein:vir:6324 Length: 335 # 53.2 0.53 0.00033 22.1 12.6 294 1-349 1-335 (335) 131 protein:vir:80068 Length: 301 39.5 1 0.00063 20.5 11.4 279 1-340 1-301 (301) 132 protein:vir:100057 Length: 375 36.6 0.83 0.00052 21.0 5.0 304 1-355 1-374 (375) 133 protein:vir:80213 Length: 334 31.2 1.5 0.00094 19.6 12.1 292 1-345 1-334 (334) 134 protein:vir:80180 Length: 381 22.3 2.5 0.0015 18.4 11.3 298 13-355 1-312 (381) 135 protein:vir:105334 Length: 276 21.9 2.5 0.0016 18.4 12.4 267 1-355 1-276 (276) 136 protein:vir:1541 Length: 347 # 21.2 2.6 0.0016 18.3 14.2 297 1-347 1-347 (347) No 1 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=100.00 E-value=1.9e-195 Score=1088.10 Aligned_cols=355 Identities=100% Similarity=1.382 Sum_probs=353.1 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||++++++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccc Confidence 99999999999999999999999989999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) ||+++++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:98 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) T ss_pred cCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 99988999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+|++++++|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (355) T protein:vir:98 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~ 320 (355) |++|||||||+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 320 (355) T protein:vir:98 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) T ss_pred hHHHhhhHhhccCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 321 SMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) |+|||||||||+++|+||||+|+++++|+++++|| T Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) T protein:vir:98 321 SMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) T ss_pred hhcceeeeeccccEEEeeceeeeCCCCCcccccCC Confidence 99999999999999999999999999999999999 No 2 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=100.00 E-value=4.1e-194 Score=1080.82 Aligned_cols=355 Identities=93% Similarity=1.323 Sum_probs=352.4 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||+.++++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeecc Confidence 99999999999999999999999889999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) ||+++++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:18 81 DTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVK 160 (355) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 99989999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+|++++++|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dL 240 (355) T protein:vir:18 161 NPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKL 240 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~ 320 (355) |++|||||||+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 320 (355) T protein:vir:18 241 LADKYFPLVNKQQENTESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) T ss_pred hHHHHhHHhhccCChHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 321 SMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) |+|||||||||+++|+||||+|+++++|+++++|- T Consensus 321 s~Ne~YvVEd~~~~a~ieni~~~~~~~~~~~~~g~ 355 (355) T protein:vir:18 321 SMNIDYVVEAYAAGCLLENITLGDFTAPAAPEGGE 355 (355) T ss_pred hhcceeeeeccccEEEEeeeeecCCCCcccccCCC Confidence 99999999999999999999999999999999888 No 3 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=100.00 E-value=5.8e-194 Score=1079.99 Aligned_cols=355 Identities=76% Similarity=1.184 Sum_probs=348.0 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||++++++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) ||+++++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:56 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 99988899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+|++++++|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:56 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~ 320 (355) |++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+||+||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~ 320 (357) T protein:vir:56 241 LADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYE 320 (357) T ss_pred hhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccEEEEecceecCccCC--CCcCCCC Q lcl|Aclame:pro 321 SMNIDYVVEVYAAGCLLENITLGDFTAP--AAPESGA 355 (355) Q Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~~~~--~~~~~~a 355 (355) |+|||||||||+++|+||||+++++++| .+++++| T Consensus 321 s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~~~~~~~~a 357 (357) T protein:vir:56 321 SMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATEEPGA 357 (357) T ss_pred hhcceeeeeccccEEEeeeeeeccCCCCcccCCCCCC Confidence 9999999999999999999999876555 4555566 No 4 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=100.00 E-value=8.5e-194 Score=1079.06 Aligned_cols=355 Identities=76% Similarity=1.186 Sum_probs=347.6 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||++++++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) ||+++++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:60 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSS 160 (357) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 99988899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+|++++++|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:60 161 NQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~ 320 (355) |++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+||+||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~ 320 (357) T protein:vir:60 241 LADKYFPIVNREQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYE 320 (357) T ss_pred hhHHhhhHhhcCCChHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccEEEEecceecCccCC--CCcCCCC Q lcl|Aclame:pro 321 SMNIDYVVEVYAAGCLLENITLGDFTAP--AAPESGA 355 (355) Q Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~~~~--~~~~~~a 355 (355) |+|||||||||+++|+||||+++++++| .+++++| T Consensus 321 s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa~~~~~~~a 357 (357) T protein:vir:60 321 SMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATAEPGA 357 (357) T ss_pred hhcceeeeeccccEEEeeeeeeccCcccccCCCCCCC Confidence 9999999999999999999999876544 4555555 No 5 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=100.00 E-value=9.3e-194 Score=1078.85 Aligned_cols=355 Identities=76% Similarity=1.182 Sum_probs=348.5 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||++++++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) ||+++++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:20 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 99988899999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+|++.+++|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:20 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~ 320 (355) |++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+||+||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~ 320 (357) T protein:vir:20 241 LADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYE 320 (357) T ss_pred hhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccEEEEecceecCcc--CCCCcCCCC Q lcl|Aclame:pro 321 SMNIDYVVEVYAAGCLLENITLGDFT--APAAPESGA 355 (355) Q Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~~--~~~~~~~~a 355 (355) |+|||||||||+++|+||||++++++ ++.+++++| T Consensus 321 s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~~~~~~~~a 357 (357) T protein:vir:20 321 SMNIDYVVEDYAAGCLVEKIKVGDFSTPAKATAEPGA 357 (357) T ss_pred hhcceeeeeccccEEEeeeeeeccccCCccCCCCCCC Confidence 99999999999999999999998754 446667777 No 6 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=100.00 E-value=5.8e-189 Score=1052.55 Aligned_cols=340 Identities=57% Similarity=0.875 Sum_probs=332.1 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHH--cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDD--VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIAS 78 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~--v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~ 78 (355) |+++||++|++|++++|++|||+.+. ++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 80 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVAS 80 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCccccc Confidence 99999999999999999999998654 4689999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCCh Q lcl|Aclame:pro 79 TTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDR 158 (355) Q Consensus 79 Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~ 158 (355) ||||+++++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++||| T Consensus 81 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (342) T protein:vir:10 81 TTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDR 160 (342) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 99999888999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcH Q lcl|Aclame:pro 159 TKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGR 238 (355) Q Consensus 159 ~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~ 238 (355) ++|||||||||||||++||++|+|||++++ +.++|++|+||||+||||||+|++++|||||||++||||||||| T Consensus 161 ~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~------~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~ 234 (342) T protein:vir:10 161 NSNPLLQDVAKGWLQKMREDAKERVMNGES------TDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGR 234 (342) T ss_pred hhCcCccccchHHHHHHHhhhhhhhcccce------eccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 999999999999999999999999998753 45679999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhh Q lcl|Aclame:pro 239 KLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVEN 318 (355) Q Consensus 239 dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~ 318 (355) |||++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+||+||+++|||+|||||| T Consensus 235 dLladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 314 (342) T protein:vir:10 235 KLLADKYFPIVNQQNAPTEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVPKKDRIET 314 (342) T ss_pred hhhHHHHHHHHhcCCChHHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhccccccEEEEecceecCcc Q lcl|Aclame:pro 319 YESMNIDYVVEVYAAGCLLENITLGDFT 346 (355) Q Consensus 319 y~s~Ne~YvVEd~~~~a~ienI~~~~~~ 346 (355) |||+||||||||||++|+||||+|+++. T Consensus 315 y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 315 YESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred hhhhccceeeeccccEEEeecceecCCC Confidence 9999999999999999999999998866 No 7 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=100.00 E-value=3.6e-188 Score=1048.20 Aligned_cols=337 Identities=54% Similarity=0.910 Sum_probs=330.5 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~--~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt 78 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChh--hhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeee Confidence 99999999999999999999995 5899999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) +|++ .+|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 79 dt~~-~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 157 (337) T protein:vir:78 79 DTTK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) T ss_pred cCCC-cccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhh Confidence 9985 589999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+| +|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 158 nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~-----~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dL 232 (337) T protein:vir:78 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAG-----KVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) T ss_pred CcCccccchHHHHHHHhcchhhhhccccccCC-----ceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 99999999999999999999999998876655 4889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~ 320 (355) |++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+||+||+++|||+|||||||| T Consensus 233 ladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 312 (337) T protein:vir:78 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) T ss_pred hHHHHHHHHhcCCCcHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccEEEEecceecCc Q lcl|Aclame:pro 321 SMNIDYVVEVYAAGCLLENITLGDF 345 (355) Q Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~ 345 (355) |+|||||||||+++|+||||+|+++ T Consensus 313 s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred hccceeeeeccccEEEEeceeecCC Confidence 9999999999999999999999997 No 8 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=100.00 E-value=4.7e-188 Score=1047.57 Aligned_cols=338 Identities=56% Similarity=0.905 Sum_probs=329.4 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~--~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt 78 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGVE--RVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTT 78 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCcc--cccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecc Confidence 99999999999999999999996 5899999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) ||++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 79 dt~~-~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 157 (339) T protein:vir:79 79 DTTQ-QDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVA 157 (339) T ss_pred cCCC-CCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhh Confidence 9985 699999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeee-CCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRV-GKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRK 239 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~-G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~d 239 (355) |||||||||||||++||++|+|||++++..+++ |.+ |+||||+||||||+|++++|||||||++|||||||||| T Consensus 158 nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~-----i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~d 232 (339) T protein:vir:79 158 NPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGK-----ITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRN 232 (339) T ss_pred CcCccccchhHHHHHHhhhhhhhhccceeccce-----eEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchh Confidence 999999999999999999999999987665554 444 99999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhh Q lcl|Aclame:pro 240 LLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENY 319 (355) Q Consensus 240 Ll~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y 319 (355) ||++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+||+||+++|||+||||||| T Consensus 233 Lla~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y 312 (339) T protein:vir:79 233 LLSDKYFPLVNRDRDPVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNAKRDRIENY 312 (339) T ss_pred hhhhHhhhHhhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhccccccEEEEecceecCcc Q lcl|Aclame:pro 320 ESMNIDYVVEVYAAGCLLENITLGDFT 346 (355) Q Consensus 320 ~s~Ne~YvVEd~~~~a~ienI~~~~~~ 346 (355) ||+|||||||||+++|+||||+|++++ T Consensus 313 ~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 313 ESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred hhccceeeeeccccEEEeeeeecccCC Confidence 999999999999999999999998866 No 9 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=100.00 E-value=4.3e-187 Score=1042.29 Aligned_cols=337 Identities=55% Similarity=0.909 Sum_probs=330.4 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~--~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt 78 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChh--hhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeee Confidence 99999999999999999999995 5899999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) +|++ .+|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 79 ~t~~-~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~ 157 (337) T protein:vir:79 79 DTTK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) T ss_pred cCCC-CccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhh Confidence 9985 589999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+| +|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 158 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~-----~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dL 232 (337) T protein:vir:79 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAG-----KVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGREL 232 (337) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccCc-----ceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 99999999999999999999999998876655 4889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~ 320 (355) |++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 233 ladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 312 (337) T protein:vir:79 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) T ss_pred hhHHhhHHhccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccEEEEecceecCc Q lcl|Aclame:pro 321 SMNIDYVVEVYAAGCLLENITLGDF 345 (355) Q Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~ 345 (355) |+|||||||||+++|+||||+|+++ T Consensus 313 s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred hccceeeeeccccEEEEeceeecCC Confidence 9999999999999999999999997 No 10 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=100.00 E-value=5.4e-187 Score=1041.77 Aligned_cols=337 Identities=54% Similarity=0.911 Sum_probs=330.4 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~--~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt 78 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChh--hhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeee Confidence 99999999999999999999995 6899999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) +|++ .+|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 79 ~t~~-~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~ 157 (337) T protein:vir:10 79 DTTK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) T ss_pred cCCC-CccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhh Confidence 9985 589999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+| +|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 158 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~-----~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dL 232 (337) T protein:vir:10 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAG-----KVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccCc-----ceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 99999999999999999999999998876655 4889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~ 320 (355) |++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 233 ladk~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 312 (337) T protein:vir:10 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) T ss_pred hhHHhhHHhccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccccccEEEEecceecCc Q lcl|Aclame:pro 321 SMNIDYVVEVYAAGCLLENITLGDF 345 (355) Q Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~ 345 (355) |+|||||||||+++|+||||+|+++ T Consensus 313 s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred hccceeeeeccccEEEEeceeecCC Confidence 9999999999999999999999997 No 11 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=100.00 E-value=1.9e-185 Score=1033.24 Aligned_cols=336 Identities=53% Similarity=0.901 Sum_probs=326.8 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~--~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt 78 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGVN--SAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRT 78 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCCC--cccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccc Confidence 99999999999999999999996 4899999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) +|+.+.+|.|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 79 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~ 158 (338) T protein:vir:11 79 DTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAA 158 (338) T ss_pred cCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhh Confidence 99987789999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeee--CCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRV--GKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGR 238 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~--G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~ 238 (355) |||||||||||||++||++|+|||++++. ++ +|.+ |++|||+||||||+|++++|||||||++||||||||| T Consensus 159 nPllqDVNkGWlQ~~Re~ap~rv~~~~~~-~~-----~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~ 232 (338) T protein:vir:11 159 NPLLQDVNIGWFQQYRNNAPARVLKEGKT-TG-----KVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGR 232 (338) T ss_pred CcCccccchhHHHHHHhhhhhhhhhcccc-cc-----eeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEch Confidence 99999999999999999999999998743 33 3445 5669999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhh Q lcl|Aclame:pro 239 KLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVEN 318 (355) Q Consensus 239 dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~ 318 (355) |||++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||| T Consensus 233 dLladk~~~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~ 312 (338) T protein:vir:11 233 ELVHDKYFPMVNKDQPATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPEKNRIEN 312 (338) T ss_pred hhhHHHHhHHHhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhccccccEEEEecceecC Q lcl|Aclame:pro 319 YESMNIDYVVEVYAAGCLLENITLGD 344 (355) Q Consensus 319 y~s~Ne~YvVEd~~~~a~ienI~~~~ 344 (355) |||+|||||||||+++|+||||+|++ T Consensus 313 y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 313 YESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred hhhhccceeeeccccEEEeecceecC Confidence 99999999999999999999999999 No 12 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=100.00 E-value=3.9e-184 Score=1026.09 Aligned_cols=344 Identities=27% Similarity=0.391 Sum_probs=325.1 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||+.++++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++|+||||| T Consensus 5 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt 84 (358) T protein:vir:78 5 LTVQAEQRLNKYCDALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLYTGRK 84 (358) T ss_pred ccHHHHHHHHHHHHHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCcccceec Confidence 99999999999999999999999989999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccc---hHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQ---DFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~---dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) +| |+|++++++++++|+|+|||||+||+|++||+||||| ||++||++++.+|+|||||||||||+|+|++|| T Consensus 85 ~t-----r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td 159 (358) T protein:vir:78 85 KG-----GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADDTD 159 (358) T ss_pred CC-----CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccCCC Confidence 87 6899999999999999999999999999999999998 899999999999999999999999999999999 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEc Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG 237 (355) |++|||||||||||||++||++|+|||++++.+++.. +..|++|||+||||||+|++++|||||||++|||||||| T Consensus 160 ~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~----ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG 235 (358) T protein:vir:78 160 PTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIY----FDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVG 235 (358) T ss_pred hhhCcCccccchHHHHHHHhhchhhhhccccccCcee----ecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEc Confidence 9999999999999999999999999999887554321 223456999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhh Q lcl|Aclame:pro 238 RKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVE 317 (355) Q Consensus 238 ~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve 317 (355) ||||++|||||||+.++|||++|+|+++ |+||||||++|||||+++||||+|||||||||+||+||+++|||+||||| T Consensus 236 ~dLla~k~~~l~n~~~~pTE~~Aa~~i~--k~iGGlpa~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE 313 (358) T protein:vir:78 236 TDLVAAAQAKLYSEATKPSEQIAAQQLA--KSIAGRKAYIPPFFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQDSKSFD 313 (358) T ss_pred hhhhhHHhhhHhhcCCCcHHHHHHHHHH--HHhCCCeEEEccccCCCceEEeeccccEEEEecCcEEEEEEecccccccc Confidence 9999999999999999999999999986 89999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhhccccccEEEEecceecCccCCCCcCC--CC Q lcl|Aclame:pro 318 NYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPES--GA 355 (355) Q Consensus 318 ~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~--~a 355 (355) ||||||||||||||+++|+||||++...+.|+++++ ++ T Consensus 314 ~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~~~~~~~ 353 (358) T protein:vir:78 314 NQYWRMEGYALGEHKAYGGFEEADIEIGADPAVLAVEAAA 353 (358) T ss_pred chhhhcceeeeeccccEEEEeeeeeeeCCCCCccccCCcc Confidence 999999999999999999999998865444444433 22 No 13 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=100.00 E-value=1.1e-174 Score=974.37 Aligned_cols=336 Identities=24% Similarity=0.328 Sum_probs=313.7 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChH--HcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTD--DVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIAS 78 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~--~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~ 78 (355) |+++||++|++|++++|++|||+.+ +++++|+|+|++||+|+++|||||+||++|||++|+|++|+++.+|.+|+++| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~ 80 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYG 80 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccC Confidence 9999999999999999999999853 67899999999999999999999999999999999999999999999999999 Q ss_pred cccCC-CCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccch-HHHHHHHHHHHHhhhhHHHHhhcccccccCC Q lcl|Aclame:pro 79 TTDTS-GDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQD-FQRRIRDAIVKRQALDLIMAGFNGTTRADTS 156 (355) Q Consensus 79 Rt~T~-~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~d-F~~~i~~~i~~~~alD~i~IGfnG~s~A~~T 156 (355) ||+|+ ++++|.+. ++++|+|+|||||+||+|++||+|||||| |++||++++.+|+|||||||||||+|+|++| T Consensus 81 r~~t~~~~~~~~~~-----~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T 155 (343) T protein:vir:98 81 AHDRRTPIQQRWTR-----QVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDT 155 (343) T ss_pred ccccCCCccccccC-----CCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCC Confidence 99985 34455444 46789999999999999999999999998 9999999999999999999999999999998 Q ss_pred ChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEE Q lcl|Aclame:pro 157 DRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIV 236 (355) Q Consensus 157 d~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVviv 236 (355) +|||||||||||||++||++|+|||++++.+ .+++.+|+||||+||||||+|+++ |||||||++||||||| T Consensus 156 ---~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~-----~~~~~~G~ggdy~NLDalV~D~~~-~I~~~~~~d~dLVviv 226 (343) T protein:vir:98 156 ---SDPNLADVNKGWIQFVRENKATQILTQGATS-----GEIRLFGEGADYVNLDELAYDLKQ-GLDARHRDAGDLVFLV 226 (343) T ss_pred ---CCcchhhcchHHHHHHHhcchhhhhccceec-----cceeEecCCCCcccHHHHHHHHHh-cCchHHhcCCCEEEEE Confidence 6999999999999999999999999987543 345778999999999999999985 9999999999999999 Q ss_pred cHHHHHHHHHHHHhh-ccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhh Q lcl|Aclame:pro 237 GRKLLADKYFPLVNK-QQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDR 315 (355) Q Consensus 237 G~dLl~~k~~~l~n~-~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~r 315 (355) |||||++|||||+|+ .++|||++|+++++++|+||||||++|||||+++||||+|||||||||+||+||+++|||+||| T Consensus 227 G~dLla~~~~~l~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~r 306 (343) T protein:vir:98 227 GADLVAKEASLVYKGNGLIATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDDDKKA 306 (343) T ss_pred chhhhhhhhhhhhhhcCCChHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEecccccc Confidence 999999999999997 5789999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 316 VENYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 316 ve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) |||||||||||||||||++|+||||+++-+.. +++ T Consensus 307 ie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~-----~g~ 341 (343) T protein:vir:98 307 VRDSYYRNEAYAVEDCGKFMAVDFTKVKLSSG-----KGT 341 (343) T ss_pred ccchhhhcceeeeeccccEEEeeeeeeeecCC-----CCC Confidence 99999999999999999999999999965421 223 No 14 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=100.00 E-value=1.9e-174 Score=972.95 Aligned_cols=332 Identities=27% Similarity=0.364 Sum_probs=314.1 Q ss_pred HHHHHHHHHHHHHHHHhCCChHH--cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccccccccc Q lcl|Aclame:pro 4 ETRFKFNAYLTRVAELNNISTDD--VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTD 81 (355) Q Consensus 4 ~tr~~f~~y~~~~A~~ngv~~~~--v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~ 81 (355) -||++|++|++++|++|||+.+. ++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++|||||||+ T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCcccccccC Confidence 57789999999999999998754 4689999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHH-HHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 82 TSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQ-RRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 82 T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~-~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) |+ |.+++. ++++++|+|+|||||+||+|++||+|||||||+ .+++.++.+|+|||||||||||+|+|++|| T Consensus 81 t~----R~~~~~-~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~Td--- 152 (336) T protein:vir:37 81 TG----RNLANL-DHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTT--- 152 (336) T ss_pred CC----cccccc-CcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCC--- Confidence 85 667775 899999999999999999999999999999966 567778888899999999999999999998 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+|++ +.+|+||||+||||||+|+++ +||||||++||||||||||| T Consensus 153 nPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i----~~~G~~gdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dL 227 (336) T protein:vir:37 153 KADLSDVNKGWLKLLQEQRAANFMTESTKSSGKI----TIFGDNADYANLDDLAFDLKQ-GLDFRHQNRNDLVFLVGADL 227 (336) T ss_pred CCcccccchhHHHHHHhccchhhcccccccCCce----EEecCCCCcccHHHHHHHHHh-cCchHHhcCCCeEEEEchhh Confidence 9999999999999999999999999998777664 567999999999999999997 68999999999999999999 Q ss_pred HHHHHHHHHhhc-cccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQ-QENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENY 319 (355) Q Consensus 241 l~~k~~~l~n~~-~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y 319 (355) |++||+||+|+. +.|||++|+++++++|+||||||++|||||+++||||+|||||||||+|++||+++|||+||||||| T Consensus 228 la~~~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y 307 (336) T protein:vir:37 228 VSKETKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGLVTS 307 (336) T ss_pred hhhhhhhhhhhcCCCHHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccch Confidence 999999999984 7899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhccccccEEEEecceecCccCC Q lcl|Aclame:pro 320 ESMNIDYVVEVYAAGCLLENITLGDFTAP 348 (355) Q Consensus 320 ~s~Ne~YvVEd~~~~a~ienI~~~~~~~~ 348 (355) ||+||||||||||++|+||||++....+- T Consensus 308 ~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 308 YYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred hhhcceeeeeccccEEEeeeeeeeecCcC Confidence 99999999999999999999999775544 No 15 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=100.00 E-value=2e-174 Score=972.84 Aligned_cols=332 Identities=26% Similarity=0.347 Sum_probs=312.4 Q ss_pred HHHHHHHHHHHHHHHHhCCChHH--cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccccccccc Q lcl|Aclame:pro 4 ETRFKFNAYLTRVAELNNISTDD--VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTD 81 (355) Q Consensus 4 ~tr~~f~~y~~~~A~~ngv~~~~--v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~ 81 (355) -||++|++|++++|++|||+.+. ++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++|||||||+ T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccC Confidence 57789999999999999998654 5689999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHH-HHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 82 TSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQ-RRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 82 T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~-~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) |++ ++...++++++|+|+|||||+||+|++||+|||||||+ .+++.++.+|+|||||||||||+|+|++|| T Consensus 81 t~r-----~r~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~Td--- 152 (336) T protein:vir:37 81 TGR-----NLATLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTT--- 152 (336) T ss_pred CCC-----CccccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCC--- Confidence 962 33446799999999999999999999999999999955 567788888899999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |||||||||||||++||++|+|||++++..+|++ +.+|+||||+||||||+|+++ +||||||++||||||||||| T Consensus 153 nPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i----~~~G~~gdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dL 227 (336) T protein:vir:37 153 KTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKI----TIFGDNADYANLDDLAFDLKQ-GLDFRHQNRNDLVFLVGADL 227 (336) T ss_pred CccccccchhHHHHHHhccchhhcccccccCCce----EEecCCCCcccHHHHHHHHHh-ccchHHhcCCCeEEEEchhh Confidence 9999999999999999999999999998777664 557999999999999999997 79999999999999999999 Q ss_pred HHHHHHHHHhhc-cccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhh Q lcl|Aclame:pro 241 LADKYFPLVNKQ-QENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENY 319 (355) Q Consensus 241 l~~k~~~l~n~~-~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y 319 (355) |++|||||+|+. +.|||++|+++++++|+||||||++|||||+++||||+|||||||||+|++||+++|||+||||||| T Consensus 228 la~~~~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y 307 (336) T protein:vir:37 228 VSKETKLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGLVTS 307 (336) T ss_pred hhhhhhhhhhhcCCCHHHHHHHHHHHHHHhhCCceEEEccccCCCceEEeeccccEEEEecCcEEEEEEEccccccccch Confidence 999999999984 7899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhccccccEEEEecceecCccCC Q lcl|Aclame:pro 320 ESMNIDYVVEVYAAGCLLENITLGDFTAP 348 (355) Q Consensus 320 ~s~Ne~YvVEd~~~~a~ienI~~~~~~~~ 348 (355) ||+||||||||||++|+||||++....+- T Consensus 308 ~s~Ne~YvVEd~~~~a~iE~i~v~~~~e~ 336 (336) T protein:vir:37 308 YYRQEGYVVEDLGLMTAIDHTKVKLNGEV 336 (336) T ss_pred hhhcceeeeeccccEEEeeeeeeeccccC Confidence 99999999999999999999999775554 No 16 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=100.00 E-value=2.1e-171 Score=956.30 Aligned_cols=333 Identities=26% Similarity=0.406 Sum_probs=312.4 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++|+||||| T Consensus 5 m~~~tr~~~~~y~~~~A~~ngv~--~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt 82 (341) T protein:vir:27 5 LTQSAREYMDNFAQQLAKSYGVS--NVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTGRK 82 (341) T ss_pred ccHHHHHHHHHHHHHHHHHcCcc--cccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceeecc Confidence 99999999999999999999996 5899999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcc---cchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWAR---FQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~---~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) +|+ |.++++ ++++++|+|+|||||+||+|++||+||| ||||++|+++++++|+|||||||||||+|+|++|| T Consensus 83 dt~----R~~r~~-~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td 157 (341) T protein:vir:27 83 AGG----RFTKQV-GVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTD 157 (341) T ss_pred CCC----ceeccc-ccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCC Confidence 985 556665 8999999999999999999999999999 99999999999999999999999999999999999 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEc Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG 237 (355) |++|||||||||||||++||++|+|||+++ ++..|+||||+||||||+|++++|||||||++|||||||| T Consensus 158 ~~anPllqDVNkGWlQ~~Re~a~~rVl~~~----------~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG 227 (341) T protein:vir:27 158 PSANPLGQDVNEGWIAFVKNRKASQVVDVD----------VYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVG 227 (341) T ss_pred hhhcccccccchhHHHHHHhhcccceeccc----------eeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEc Confidence 999999999999999999999999999864 3667999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhh Q lcl|Aclame:pro 238 RKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVE 317 (355) Q Consensus 238 ~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve 317 (355) ||||++|||||||+.++|||++|++++ +|+||||||++|||||+++||||+|||||||||+|++||+++|||+||||| T Consensus 228 ~dLla~k~~~l~n~~~~ptE~~Aa~~i--~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie 305 (341) T protein:vir:27 228 SGLIGAAQAKLYDKADKPSEQIAAQKL--DKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSK 305 (341) T ss_pred hhhhhhhhhhhhccCCCCHHHHHHHHH--HHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEecccccccc Confidence 999999999999999999999999988 689999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 318 NYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 318 ~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) +|+| +||||||||++++|-..++.+.......+.- T Consensus 306 ~yes---~YvVEdyg~~~~~~~~~vkl~~~~~~~~~~~ 340 (341) T protein:vir:27 306 THTG---AWKVTQWVCWKRSPLTTQKKSTSALNHRSER 340 (341) T ss_pred chhh---hheeehhhhhhhccccccccCcccccccccc Confidence 9977 8999999999999944444433333322222 No 17 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=5.2e-80 Score=455.25 Aligned_cols=317 Identities=14% Similarity=0.170 Sum_probs=263.6 Q ss_pred HHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCC Q lcl|Aclame:pro 4 ETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTS 83 (355) Q Consensus 4 ~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~ 83 (355) -+++.|++|++++++.+++..+++...|+|.|+++|+|+++++++|.||++||+++|++.+|+++.+|+++++. |+.+. T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~-~~~~e 79 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHR-RPQDE 79 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCccc-ccccc Confidence 56788999999999999999888999999999999999999999999999999999999999999999988764 66555 Q ss_pred CCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhh Q lcl|Aclame:pro 84 GDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTL 163 (355) Q Consensus 84 ~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPl 163 (355) +..++.+.++ .+++.+|.|++++++++|+|++||+||++|||++++++.+++++|+|++++||||++++.++ T Consensus 80 ~~~~~~~~~~-~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~------- 151 (321) T protein:vir:31 80 GEWNENESDV-STGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDS------- 151 (321) T ss_pred cccccccccc-eeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCc------- Confidence 4445555554 68889999999999999999999999999999999999999999999999999998876443 Q ss_pred hhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHH Q lcl|Aclame:pro 164 LQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLAD 243 (355) Q Consensus 164 lqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~ 243 (355) +++||+||||++|++.+ .++.++++.++|. +.+++. +|+++||+++++|||||++++.+ T Consensus 152 ~~~~n~G~l~~a~~~~~-------------------~~~~~~~~~~~d~-l~~l~~-~l~~~yr~~~~~v~im~~~~~~~ 210 (321) T protein:vir:31 152 FENQNDGFITVAEGDVE-------------------TIDAADDILDNDL-VIRTIA-GLDSKYRARMNPALIVSEDQLLS 210 (321) T ss_pred ccccchhhhhhhccccc-------------------cccccccccCHHH-HHHHHH-hccHhHhcCCCeEEEechHHHHH Confidence 78999999999887532 1234455566774 456665 57999999999999999999998 Q ss_pred HHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEE-cc---chhhhhhh Q lcl|Aclame:pro 244 KYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDE-NP---KKDRVENY 319 (355) Q Consensus 244 k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d-~p---~r~rve~y 319 (355) .+.+|.++.. +....+. .-..+++|+|+|++++||||++.+++|+|+||++|++++.++|+..+ .+ +++|+++| T Consensus 211 ~~~~l~~~~~-~~~~~~l-~~~~~~tl~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 288 (321) T protein:vir:31 211 YHYTLTDRDT-PLGDNVI-MGEADVNPFSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYF 288 (321) T ss_pred HHHHHhcCCC-ccccchh-hccccccccceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEee Confidence 8888876543 4332221 11245689999999999999999999999999999888866655444 33 57899999 Q ss_pred hhhhhhhhccccccEEEEecceecCccCCCCcCCC Q lcl|Aclame:pro 320 ESMNIDYVVEVYAAGCLLENITLGDFTAPAAPESG 354 (355) Q Consensus 320 ~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~ 354 (355) +++|++||||||+++|++|||+... ++.++++. T Consensus 289 ~~~~~~~~ve~~~a~a~~~~i~~~~--~~~~~~~~ 321 (321) T protein:vir:31 289 MRGDDDFAIENTEAVVLAEGLGDPL--EHLEEETS 321 (321) T ss_pred eeeecceeEeccccEEEEecCCcch--hcccCCCC Confidence 9999999999999999999998644 33333333 No 18 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=100.00 E-value=3.2e-47 Score=275.47 Aligned_cols=331 Identities=15% Similarity=0.178 Sum_probs=228.7 Q ss_pred CCHHH--HHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhh--hhcccccccc Q lcl|Aclame:pro 1 MRPET--RFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGE--KIGVGVTGTI 76 (355) Q Consensus 1 M~~~t--r~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge--~v~lgv~~~i 76 (355) |.+++ -+.-|+++..+++++ +..++++ +|.+.|+++++|..++|+++.||++|+++++...+|+ +|++|.-... T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~-it~~~l~-~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~r 78 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKD-IGLAELD-GFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSGH 78 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhh-ccccccC-ceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeecc Confidence 88654 355699999999997 6666775 8999999999999999999999999999999999999 6666553333 Q ss_pred cccccC--CCCcCccccccccccCcceeEEeeeecceeCHHHHHh--hcccchHHHHHHHHHHHHhhhhHHHHhhccccc Q lcl|Aclame:pro 77 ASTTDT--SGDKERQTADFTALESSKYECNQINFDFHLKYKTLDL--WARFQDFQRRIRDAIVKRQALDLIMAGFNGTTR 152 (355) Q Consensus 77 a~Rt~T--~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~--WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~ 152 (355) ++-.+. ....++....+.-....+|.| -++|.++.+.. |....+|++.+.+++.++++.|+.++||||.+. T Consensus 79 ~~~e~~~~~~~~~~~~~~v~~~~~~~~~~-----~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~d 153 (360) T protein:vir:99 79 TRDEEGSRTENSEAESGSVKFNATDKSYY-----ILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGAS 153 (360) T ss_pred ccccCCCCCcCCcCccccCccccccceee-----EeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccch Confidence 221111 111222222222111223333 33556666555 555668999999999999999999999999987 Q ss_pred ccCC--ChhhhhhhhccchhHHHHHHhhccccc----cccccccCC-------ccccceeeeCCCcchhhHHHHHHHHHh Q lcl|Aclame:pro 153 ADTS--DRTKNTLLQDVAVGWLQKYRNEAPARV----MSNITDADG-------KVVSAVIRVGKNGDYENIDALVMDATN 219 (355) Q Consensus 153 A~~T--d~~anPllqDVNkGWlq~~Re~a~~~v----~~~~~~~~g-------~~~~~~i~~G~ggdy~nLDaLv~d~~~ 219 (355) ..++ |-+.+|++ ++|+||||+++.+ ++.+ .+.....+. ......-+.|.|+-|....+|+.+++. T Consensus 154 s~d~~~~~~~d~fl-~~~dGwlKka~~~-~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~ 231 (360) T protein:vir:99 154 SGNLQSIGGAAELD-NTFKGWIARAEGD-AQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQ 231 (360) T ss_pred hcccccCcccchhh-hhhHHHHHHhhcc-cchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHH Confidence 7644 34456766 9999999999876 3332 111111111 111223356888889999999999999 Q ss_pred cccchhhhCCC--CeEEEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCccCCCcEEEecCCCcEE Q lcl|Aclame:pro 220 NLIDEVYQDDP--NLVAIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANAVLVTTLENLSI 296 (355) Q Consensus 220 ~lid~~~~~~~--~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsI 296 (355) +|++. |++++ .++++++.+......-.|-++..+.... .++ ....++-|.|++.||+||++.+|+|+++|| | T Consensus 232 ~Lp~k-yr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~---~l~g~~~~~~~Gipi~~v~~~pd~~~mlT~p~NL-i 306 (360) T protein:vir:99 232 TLDSR-YRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSA---VIFGDSDITPFSYDLVGVNGFPDEYMMFTDPNNL-A 306 (360) T ss_pred hcchh-hhcCcccceEEEccCchHHHHHHHHhccCcccchh---heecccccccceeeeEEcCCCCCCceEEeccCce-e Confidence 88666 88877 4489999887665444444443322111 111 123467799999999999999999999999 5 Q ss_pred EEeeCcEEEEEEEccch---hh--hhhhhhhhhhhhccccccEEEEecceecCc Q lcl|Aclame:pro 297 YFMDESHRRSIDENPKK---DR--VENYESMNIDYVVEVYAAGCLLENITLGDF 345 (355) Q Consensus 297 Y~Q~gs~RR~~~d~p~r---~r--ve~y~s~Ne~YvVEd~~~~a~ienI~~~~~ 345 (355) |..-..+|.....+|+| +| +..|.+...+|++||++++|+++||+-.++ T Consensus 307 ~g~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 307 FGLYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred EEeeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 54444444444334444 22 444556679999999999999999987665 No 19 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=1.1e-42 Score=250.57 Aligned_cols=305 Identities=18% Similarity=0.188 Sum_probs=225.3 Q ss_pred CCHHHHHHHHHHHHHHHHHh-CCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCcccc-chhhhhhhhcccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELN-NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILP-VAEMKGEKIGVGVTGTIAS 78 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~n-gv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~-V~e~~Ge~v~lgv~~~ia~ 78 (355) |. ++.++.+.. +++..+.+ .+.+.|.+.++|+++|+|+|.||++++++. +...+++.-.+|+.+.+++ T Consensus 1 ~~---------~~~~~~~~~k~it~~d~~-gG~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~ 70 (314) T protein:vir:41 1 MD---------FLNKPFQITPKIDVPDLG-KGILAVQRFGEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEP 70 (314) T ss_pred Cc---------hhhhHHHhhcccccccCC-CceeChHHHHHHHHHHHhccchhhheeeecccCccceeecccccCccccc Confidence 32 222222221 22333443 677999999999999999999999999984 5666666666777777766 Q ss_pred cccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCCh Q lcl|Aclame:pro 79 TTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDR 158 (355) Q Consensus 79 Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~ 158 (355) ..+.++.....+......+..+|.|++....++|+|+.|+.|+..|+|++.+.+.+++++|.|+.+++|||.... . T Consensus 71 ~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~----~ 146 (314) T protein:vir:41 71 GRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSL----T 146 (314) T ss_pred ccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCC----c Confidence 555544434445556678889999999999999999999999999999999999999999999999999996543 3 Q ss_pred hhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcH Q lcl|Aclame:pro 159 TKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGR 238 (355) Q Consensus 159 ~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~ 238 (355) +++|+++ +++|||++... + +....+++|.+.+.++.+++.+|.++++++.+++||||++ T Consensus 147 s~~~~~~-~p~G~l~~a~~---------------~-----~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~ 205 (314) T protein:vir:41 147 TGRELYR-INDGWMKLAGN---------------Q-----YTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSN 205 (314) T ss_pred Ccccchh-cchhhhhhccc---------------c-----eeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecH Confidence 4567777 99999996421 1 1122456789999999999999988777788999999999 Q ss_pred HHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCcc-----CCCcEEEecCCCcEEEEeeCcEEEEEEEccch Q lcl|Aclame:pro 239 KLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYF-----PANAVLVTTLENLSIYFMDESHRRSIDENPKK 313 (355) Q Consensus 239 dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~Pff-----P~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r 313 (355) +.+ .++.+++.....+--..+ ..-....+|.|.|++.+|+| |++.|++|.++|| ||.-.-.+||..+-++++ T Consensus 206 ~t~-~~~r~~l~~~~~~l~~~~-~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nl-v~~~~~~ir~~~~~~a~~ 282 (314) T protein:vir:41 206 EIY-NGYRKQLLVRETGLGDSA-LIGATGLQYDGIPIQYVPALDALGDDKARALLTVPTNL-VYGFWRNIRIEPKRDAAM 282 (314) T ss_pred HHH-HHHHHHHhccCCcccchh-hhCCCCceecceeeEecccccccCCCCceEEEechhhe-EEEeeceeEEeecccCcC Confidence 977 467777654433321111 11122457999999999997 6799999999999 776666677777777777 Q ss_pred hhhhhhhhhhhhhhccccccEEE--Eecceec Q lcl|Aclame:pro 314 DRVENYESMNIDYVVEVYAAGCL--LENITLG 343 (355) Q Consensus 314 ~rve~y~s~Ne~YvVEd~~~~a~--ienI~~~ 343 (355) +++..|....-++.+|+.+.+|. +++..-+ T Consensus 283 ~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 283 RRTEYIASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred CeEEEEEEEEeceEEEEcCcEEEEEeeccCCC Confidence 89988888888877776655544 3332222 No 20 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=7.1e-37 Score=218.76 Aligned_cols=306 Identities=11% Similarity=0.086 Sum_probs=198.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCcccc-chhhhhhhhccccccccc-c Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILP-VAEMKGEKIGVGVTGTIA-S 78 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~-V~e~~Ge~v~lgv~~~ia-~ 78 (355) |-----.+.+....-+ +..+++ +. ..|.+.|++.++|+++++|+|.||++||++. ....+++.-.+|+.+++. + T Consensus 1 ~~~~~~~~~~~~~~~~-k~~t~~--d~-~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g 76 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIV-PKIDVP--DL-GRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPG 76 (315) T ss_pred CcccchhhcCChhhhh-hhcCCc--CC-CCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccc Confidence 2111111111111111 233442 34 4788999999999999999999999999864 455666655566655544 3 Q ss_pred cccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCCh Q lcl|Aclame:pro 79 TTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDR 158 (355) Q Consensus 79 Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~ 158 (355) ++.++. .++.+.....+...+|.|++..+.++|+|+.|+.|+..|||++.+.+.+++++|.|+.+++|||.+.+.++ T Consensus 77 ~~~~~~-~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p-- 153 (315) T protein:vir:41 77 RDETGQ-KLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDP-- 153 (315) T ss_pred cccccC-cCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCc-- Confidence 444432 34445555678889999999999999999999999999999999999999999999999999997765443 Q ss_pred hhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcH Q lcl|Aclame:pro 159 TKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGR 238 (355) Q Consensus 159 ~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~ 238 (355) + -..|+|||++.+........ .++.++. ..| ++.|++.+|..+++++.+++|+||++ T Consensus 154 ----~-~~~~~G~l~~a~~~~~~~~~----------------~~~a~~~-~~d-~l~~l~~sl~~~yr~~~~~~~~imn~ 210 (315) T protein:vir:41 154 ----L-LRMSDGWLKLASEKLTESDV----------------DPEAEDW-PMN-LFDTMIESLPTPYRNNLPNMKFYVTW 210 (315) T ss_pred ----c-ccccccceeccccccccccc----------------ccccccc-cHH-HHHHHHHhcChHHhhcCCceEEEEcH Confidence 2 23689999976543211100 0111111 123 45567777766665667899999999 Q ss_pred HHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCcc-----CCCcEEEecCCCcEEEEeeCcEEEEEEEccch Q lcl|Aclame:pro 239 KLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYF-----PANAVLVTTLENLSIYFMDESHRRSIDENPKK 313 (355) Q Consensus 239 dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~Pff-----P~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r 313 (355) +.+. ++-++......+--.-. ..-....+|.|.|++.+|.| |++.|++|.++||.+...++ +|+....+++. T Consensus 211 ~t~~-~~rklk~~~g~~lw~~~-~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~-i~i~~~~~a~~ 287 (315) T protein:vir:41 211 DIYR-AYRDALKGRETGLGDQA-LTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRN-IKVVPDYDAEM 287 (315) T ss_pred HHHH-HHHHHhccCCCccccch-hhcCCCceecccceEecccccccCCCCccEEEecccceEEEeccc-cEEEeeecCCC Confidence 9885 56666544322221100 01112358999999888777 67889999999995544333 44433334444 Q ss_pred hhhhhhh--hhhhhhhccccccEEEEecc Q lcl|Aclame:pro 314 DRVENYE--SMNIDYVVEVYAAGCLLENI 340 (355) Q Consensus 314 ~rve~y~--s~Ne~YvVEd~~~~a~ienI 340 (355) .++..|. ...-+|++|++ +++.+.+| T Consensus 288 ~~~~~~~~~r~d~~~~~~~~-~a~~~~~v 315 (315) T protein:vir:41 288 RLTKYVASLRTDNHYEDEEG-AVSATITV 315 (315) T ss_pred CceEEEEEEEeceeEEeccc-eeEeeeeC Confidence 4443332 22456788885 66667777 No 21 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.78 E-value=6.3e-10 Score=71.02 Aligned_cols=304 Identities=12% Similarity=0.081 Sum_probs=158.6 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+++-|+.|+++++. .+. ....+.|-+.+...+++.+.+.|.++++++++++.-..+...... +++-+.-+ T Consensus 72 l~~~~r~~~~~~~~~----~~~----~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~-~~~~a~~~ 142 (390) T protein:vir:40 72 LTSDESKYYNEVIAG----NGF----AGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVG-DVATAWWG 142 (390) T ss_pred ccHHHHHHHHHHHhc----cCc----ccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEc-CCcceeee Confidence 556666656554432 222 123567777888999999999999999999999865333322222 22222222 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . ......+.....++...|.+++.--...|+.+.|+... .+|+..+++.+.++++.-.-.--++|+-. . T Consensus 143 ~--E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~--~~l~~~i~~~la~~i~~~~~~a~l~G~G~-------~ 211 (390) T protein:vir:40 143 P--LCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGP--SWLDQYVRTILGEAMALGLEAGIVNGSGK-------D 211 (390) T ss_pred c--cccccCccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHhhhhcccCC-------C Confidence 2 11222223334567788888888888899999999653 37999999999999988877777888531 1 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) -| .|+|... .-+ + .+... ......-.+.+...++..+...+.+...+.....++||.+.- T Consensus 212 ~P------~Gil~~~-----~~~----~--~~~~~---~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t 271 (390) T protein:vir:40 212 QP------IGMMRDL-----NNV----T--AGEHP---VKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPAD 271 (390) T ss_pred cc------ceeeecc-----ccc----c--ccccc---cccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchh Confidence 12 4665321 000 0 00000 000111233344444444444333322233467899998653 Q ss_pred HHHHHHHHHhh-ccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEcc--chhhhh Q lcl|Aclame:pro 241 LADKYFPLVNK-QQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENP--KKDRVE 317 (355) Q Consensus 241 l~~k~~~l~n~-~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p--~r~rve 317 (355) ..+ ++..+.. .+....-+ ......|+|++..+++|++.+++-.+++.-|+ .++.++=..-++. .++.+. T Consensus 272 ~~~-~l~~~~~~~d~~G~~v------~~~~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~-~~~~~~v~~~~~~~f~~~~~~ 343 (390) T protein:vir:40 272 YWS-KIYAATSYMTPQGVWV------TGILPVPLEIVQSVAVPVGKAVAGRAKDYFMG-IGSEQVIRTSTEYRLLDDETL 343 (390) T ss_pred HHH-HHHHHhhccCCCCccc------cccCCCceeEEEcCCCCCCcEEEEeeceEEEE-eecceEEEecchhhhhcCcEE Confidence 221 1111111 11111111 12235699999999999999999999986444 4444442222211 112222 Q ss_pred hhhhhhhhhhccccccEEEEecceecCcc-CCCCcCCCC Q lcl|Aclame:pro 318 NYESMNIDYVVEVYAAGCLLENITLGDFT-APAAPESGA 355 (355) Q Consensus 318 ~y~s~Ne~YvVEd~~~~a~ienI~~~~~~-~~~~~~~~a 355 (355) -+-..-.+..|-|.++++.++ +.... .++.+..+- T Consensus 344 ~r~~~r~dg~v~~~~A~~~l~---~~~~~~~~~~~~~~~ 379 (390) T protein:vir:40 344 YYAKQYANGRPKDNSSFLVFD---ITGLEGSPAIDVNVV 379 (390) T ss_pred EEEEEEeCCEEecccceEEEE---eeccCCCCCCCccee Confidence 222222333444444444432 21111 111111111 No 22 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.62 E-value=4.4e-09 Score=66.40 Aligned_cols=307 Identities=15% Similarity=0.147 Sum_probs=165.6 Q ss_pred CC-HHHHHHHHHHHHHHHHHhCCChH-HcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccc Q lcl|Aclame:pro 1 MR-PETRFKFNAYLTRVAELNNISTD-DVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIAS 78 (355) Q Consensus 1 M~-~~tr~~f~~y~~~~A~~ngv~~~-~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~ 78 (355) +. .+.+..|..|+.+-.....+... .....|.|-+.+.+.+++.+++.+.+++.++++++.-..+. +-...+++-+. T Consensus 108 ~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-~~~~~~~~~a~ 186 (425) T protein:vir:10 108 LRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFS-KLFNMGGTTSG 186 (425) T ss_pred cccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceE-EEEEcCCccee Confidence 22 34466788887543222211110 11235677777788899999999999999999998765443 33444555554 Q ss_pred cccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCCh Q lcl|Aclame:pro 79 TTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDR 158 (355) Q Consensus 79 Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~ 158 (355) -+. ........+...++...|.+++.---+.|+.+.|+... ++|+..+.+.+.+.++.=.-.--+||+-. T Consensus 187 wv~--E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~--~~l~~~i~~~la~ai~~~~d~~~l~G~G~------ 256 (425) T protein:vir:10 187 WVG--EASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAE--IDLESWLATEVQTEFAKQEGKAFLAGDGT------ 256 (425) T ss_pred eec--cccccccccccccceeeeeheeeEeehHhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHhhhhcccCC------ Confidence 332 22222223333567788999999889999999998654 68999999999999988777777888431 Q ss_pred hhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcH Q lcl|Aclame:pro 159 TKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGR 238 (355) Q Consensus 159 ~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~ 238 (355) .+ ..|+|...-....... .+... ...+..+..+. .+.|.|+ |++.+| ++.|+. .-+++|.+ T Consensus 257 -~~------p~Gil~~~~~~~~~~~------~~~~~-~~~~~~~~~~~-~~~d~l~-~l~~~l-~~~~~~--~a~~vmn~ 317 (425) T protein:vir:10 257 -NK------PNGLLTYIAGGANAAK------HPFGA-IEVVNSGAAAD-ITSDGII-DLVYDL-PSAFTG--NARFAMNR 317 (425) T ss_pred -CC------cceeeecccccccccc------ccccc-ccccccccccc-ccHHHHH-HHHhhh-hhhhcc--CCEEEEch Confidence 12 3366643311100000 00000 00111111111 2345544 577765 666775 44888998 Q ss_pred HHHHHHHHHHH-hhccccchhhHHHHH-HhhhhhcccccccCCccCC-----CcEEEecCCCcEEEEeeCcEEEEEEEcc Q lcl|Aclame:pro 239 KLLADKYFPLV-NKQQENSESLAADII-ISQKRIGNLPAVRVPYFPA-----NAVLVTTLENLSIYFMDESHRRSIDENP 311 (355) Q Consensus 239 dLl~~k~~~l~-n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~-----~~ilIT~l~NLsIY~Q~gs~RR~~~d~p 311 (355) .... ++..+ +....|--. ..+. ....+|-|+|++..++||+ ..+++=.+++.-..+.+...+.....-- T Consensus 318 ~~~~--~L~~lkD~~G~~l~~--~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~ 393 (425) T protein:vir:10 318 NTQR--QVRKLKDGQGNYLWQ--PSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYT 393 (425) T ss_pred HHHH--HHHHhhcCCCceeec--cCccCCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecccc Confidence 7665 23322 222222100 0000 1124788999999999995 3477777877544455666654322111 Q ss_pred chhhhhhhhhhhhhhhccccccEEEEe--cceecCccCCC Q lcl|Aclame:pro 312 KKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAPA 349 (355) Q Consensus 312 ~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~ 349 (355) .++ ..+|..+.+--++.++ .+.+...+++. T Consensus 394 ~~~--------~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 394 AKP--------YVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred cCC--------cEEEEEEEEeccEeecccceEEEEeeccC Confidence 111 1233333222222221 22222211111 No 23 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.58 E-value=9.5e-09 Score=64.56 Aligned_cols=281 Identities=10% Similarity=0.047 Sum_probs=166.4 Q ss_pred HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCcCcccccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~ 95 (355) +| ++..+.|-|...+.+.+.++++|.+++..+++++.--.. ++-.-.+++-|+-+.- +.+. |..... T Consensus 1 ma---------~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~-~~p~~~~~~~a~~v~E--g~~~-~~~~~~ 67 (298) T protein:vir:94 1 MV---------LNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGE-KVFTFTMDSEIDVVAE--SGKK-THGGVT 67 (298) T ss_pred Ce---------eccccccChhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecCcceEEeeC--Cccc-cccccc Confidence 21 344667888889999999999999999999998765222 3333234444544432 2223 333345 Q ss_pred ccCcceeEEeeeecceeCHHHHHhhc-ccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHH Q lcl|Aclame:pro 96 LESSKYECNQINFDFHLKYKTLDLWA-RFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQK 174 (355) Q Consensus 96 l~~~~Y~c~qTn~d~~i~y~~LD~WA-~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~ 174 (355) ++.....+++.---+.|+.+.|.+.. ...+|.+.+.+.+.++++.....--+||+....-++.. +. ..-++... T Consensus 68 f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~----~~-~~~~~~~~ 142 (298) T protein:vir:94 68 LAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASA----VI-GTNHFDSK 142 (298) T ss_pred eeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccc----cc-cccccccc Confidence 67788888888889999999997765 45789999999999999988888888985322111110 00 00011111 Q ss_pred HHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 175 YRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQE 254 (355) Q Consensus 175 ~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~ 254 (355) +......++.+ ...+..+.+++..+ ...+++ +. +++|.+..... -..|...... T Consensus 143 --------------------~~~~~~~~~~~--~~~~~~i~~~~~~~-~~~~~~-~~-~~vmn~~~~~~-l~~lkd~~G~ 196 (298) T protein:vir:94 143 --------------------VTQKVEAPRGI--ADPNGAIENAVELL-TGVDAD-VT-GIAINPSFRSA-LAKQKDLQGN 196 (298) T ss_pred --------------------ccccccccccc--ccHHHHHHHHHHhh-hhcCCC-cc-EEEEcHHHHHH-HHHhhccCCC Confidence 11111112221 23333455555544 332332 22 68888876652 2223222221 Q ss_pred cchhhHHHHHHhhhhhcccccccCCccCCC------cEEEecCCCcEEEEeeCcEEEEEEEccchhhh-hhhhhhh---- Q lcl|Aclame:pro 255 NSESLAADIIISQKRIGNLPAVRVPYFPAN------AVLVTTLENLSIYFMDESHRRSIDENPKKDRV-ENYESMN---- 323 (355) Q Consensus 255 ~te~~aa~~~~~~k~iGGlpa~~~PffP~~------~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rv-e~y~s~N---- 323 (355) +-=.. ...-....++-|+|++..+++|++ .+++-.++++-.|..++..+-.+.+..+-|+. .+|..+| T Consensus 197 ~l~~~-~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~ 275 (298) T protein:vir:94 197 ALFPE-LKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYI 275 (298) T ss_pred eeecC-cccCCCCceecceeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEE Confidence 11000 000012347889999999999975 47888999987787777777766554332222 2344444 Q ss_pred -----hhhhccccccEEEEecce Q lcl|Aclame:pro 324 -----IDYVVEVYAAGCLLENIT 341 (355) Q Consensus 324 -----e~YvVEd~~~~a~ienI~ 341 (355) -++.|.+.++++.+.+++ T Consensus 276 r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 276 RAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEEEEeccEeecccceEEEEecC Confidence 355777888888888777 No 24 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.55 E-value=2.2e-08 Score=62.58 Aligned_cols=299 Identities=13% Similarity=0.144 Sum_probs=154.6 Q ss_pred CCHHHHHHHHHHHHHHHHH--hCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccc-ccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAEL--NNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGV-TGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~--ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv-~~~ia 77 (355) +....+..|..++...... ..+.. ....+.|-..+...+.+.+.+.+.+++++++++++...|.....-. ++.-+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (415) T protein:vir:98 101 VTSQEVRDFTEYLETRNDIQGGSLKT--DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) T ss_pred hHHHHHHHHHHHHhhhhhhhhccccc--cccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccc Confidence 2233333344443332221 11111 1123444446788999999999999999999999888777544322 22222 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) .-+.. .......+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|.-..... T Consensus 179 ~~v~E--~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~- 253 (415) T protein:vir:98 179 EKVEE--LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTG- 253 (415) T ss_pred eeecc--ccccCcccccceeeEEeeeeeeEeeehhhHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc- Confidence 22221 122222233346667777777777788999998864 2478888999988888776666666664322111 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEc Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG 237 (355) ..- .++. . ........+.. +.|.+ .+++..+.++.+. .-+++|. T Consensus 254 ------~~~--~~~~-----------------~----~~~~~~~~~~~---~~~~i-~~~~~~~~~~~~~---~~~~v~n 297 (415) T protein:vir:98 254 ------STS--SGFE-----------------K----EGKKLEVKKAK---SLDDI-KDAINLNVKPNYE---HNVAIVS 297 (415) T ss_pred ------ccc--cccc-----------------c----ccccccccccc---chhHH-HHHHHhhhhhccC---CCEEEEc Confidence 100 0000 0 00011112222 34543 4677666555443 3478899 Q ss_pred HHHHHHHHHHHH-hhccccchhhHHHHH-HhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEeeCcEEEEEEEc Q lcl|Aclame:pro 238 RKLLADKYFPLV-NKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESHRRSIDEN 310 (355) Q Consensus 238 ~dLl~~k~~~l~-n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q~gs~RR~~~d~ 310 (355) ++.+.. +..+ .....+- ...... ....+|-|+|++..|++|... +++-.|+++-+.+.++..+=...+. T Consensus 298 ~~~~~~--l~~lkd~~G~~l--~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~ 373 (415) T protein:vir:98 298 QTMFAK--LDKMKDKLGNYL--IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) T ss_pred HHHHHH--HHHhhccCCcee--eccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc Confidence 887652 3333 1111111 000000 123489999999999999765 8888899876666666555443321 Q ss_pred cchhhhhhhhhhhhhhhcc-ccccEEEE-ecceecCccCCCCcCCCC Q lcl|Aclame:pro 311 PKKDRVENYESMNIDYVVE-VYAAGCLL-ENITLGDFTAPAAPESGA 355 (355) Q Consensus 311 p~r~rve~y~s~Ne~YvVE-d~~~~a~i-enI~~~~~~~~~~~~~~a 355 (355) . .| ..+|.++ .++....- +.+.+.+.++++.+++-- T Consensus 374 ~------~~---~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:98 374 M------HF---GECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred c------cC---ceEEEEEEEeccEEeccccEEEEEEeccCCCCCcc Confidence 1 11 1122222 22221111 144444434444333332 No 25 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.55 E-value=2.2e-08 Score=62.58 Aligned_cols=299 Identities=13% Similarity=0.144 Sum_probs=154.6 Q ss_pred CCHHHHHHHHHHHHHHHHH--hCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccc-ccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAEL--NNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGV-TGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~--ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv-~~~ia 77 (355) +....+..|..++...... ..+.. ....+.|-..+...+.+.+.+.+.+++++++++++...|.....-. ++.-+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (415) T protein:vir:81 101 VTSQEVRDFTEYLETRNDIQGGSLKT--DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) T ss_pred hHHHHHHHHHHHHhhhhhhhhccccc--cccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccc Confidence 2233333344443332221 11111 1123444446788999999999999999999999888777544322 22222 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) .-+.. .......+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|.-..... T Consensus 179 ~~v~E--~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~- 253 (415) T protein:vir:81 179 EKVEE--LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTG- 253 (415) T ss_pred eeecc--ccccCcccccceeeEEeeeeeeEeeehhhHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc- Confidence 22221 122222233346667777777777788999998864 2478888999988888776666666664322111 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEc Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG 237 (355) ..- .++. . ........+.. +.|.+ .+++..+.++.+. .-+++|. T Consensus 254 ------~~~--~~~~-----------------~----~~~~~~~~~~~---~~~~i-~~~~~~~~~~~~~---~~~~v~n 297 (415) T protein:vir:81 254 ------STS--SGFE-----------------K----EGKKLEVKKAK---SLDDI-KDAINLNVKPNYE---HNVAIVS 297 (415) T ss_pred ------ccc--cccc-----------------c----ccccccccccc---chhHH-HHHHHhhhhhccC---CCEEEEc Confidence 100 0000 0 00011112222 34543 4677666555443 3478899 Q ss_pred HHHHHHHHHHHH-hhccccchhhHHHHH-HhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEeeCcEEEEEEEc Q lcl|Aclame:pro 238 RKLLADKYFPLV-NKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESHRRSIDEN 310 (355) Q Consensus 238 ~dLl~~k~~~l~-n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q~gs~RR~~~d~ 310 (355) ++.+.. +..+ .....+- ...... ....+|-|+|++..|++|... +++-.|+++-+.+.++..+=...+. T Consensus 298 ~~~~~~--l~~lkd~~G~~l--~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~ 373 (415) T protein:vir:81 298 QTMFAK--LDKMKDKLGNYL--IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) T ss_pred HHHHHH--HHHhhccCCcee--eccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc Confidence 887652 3333 1111111 000000 123489999999999999765 8888899876666666555443321 Q ss_pred cchhhhhhhhhhhhhhhcc-ccccEEEE-ecceecCccCCCCcCCCC Q lcl|Aclame:pro 311 PKKDRVENYESMNIDYVVE-VYAAGCLL-ENITLGDFTAPAAPESGA 355 (355) Q Consensus 311 p~r~rve~y~s~Ne~YvVE-d~~~~a~i-enI~~~~~~~~~~~~~~a 355 (355) . .| ..+|.++ .++....- +.+.+.+.++++.+++-- T Consensus 374 ~------~~---~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:81 374 M------HF---GECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred c------cC---ceEEEEEEEeccEEeccccEEEEEEeccCCCCCcc Confidence 1 11 1122222 22221111 144444434444333332 No 26 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.55 E-value=2.2e-08 Score=62.58 Aligned_cols=299 Identities=13% Similarity=0.144 Sum_probs=154.6 Q ss_pred CCHHHHHHHHHHHHHHHHH--hCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccc-ccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAEL--NNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGV-TGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~--ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv-~~~ia 77 (355) +....+..|..++...... ..+.. ....+.|-..+...+.+.+.+.+.+++++++++++...|.....-. ++.-+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (415) T protein:vir:79 101 VTSQEVRDFTEYLETRNDIQGGSLKT--DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) T ss_pred hHHHHHHHHHHHHhhhhhhhhccccc--cccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCccc Confidence 2233333344443332221 11111 1123444446788999999999999999999999888777544322 22222 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) .-+.. .......+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|.-..... T Consensus 179 ~~v~E--~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~- 253 (415) T protein:vir:79 179 EKVEE--LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTG- 253 (415) T ss_pred eeecc--ccccCcccccceeeEEeeeeeeEeeehhhHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccc- Confidence 22221 122222233346667777777777788999998864 2478888999988888776666666664322111 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEc Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG 237 (355) ..- .++. . ........+.. +.|.+ .+++..+.++.+. .-+++|. T Consensus 254 ------~~~--~~~~-----------------~----~~~~~~~~~~~---~~~~i-~~~~~~~~~~~~~---~~~~v~n 297 (415) T protein:vir:79 254 ------STS--SGFE-----------------K----EGKKLEVKKAK---SLDDI-KDAINLNVKPNYE---HNVAIVS 297 (415) T ss_pred ------ccc--cccc-----------------c----ccccccccccc---chhHH-HHHHHhhhhhccC---CCEEEEc Confidence 100 0000 0 00011112222 34543 4677666555443 3478899 Q ss_pred HHHHHHHHHHHH-hhccccchhhHHHHH-HhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEeeCcEEEEEEEc Q lcl|Aclame:pro 238 RKLLADKYFPLV-NKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESHRRSIDEN 310 (355) Q Consensus 238 ~dLl~~k~~~l~-n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q~gs~RR~~~d~ 310 (355) ++.+.. +..+ .....+- ...... ....+|-|+|++..|++|... +++-.|+++-+.+.++..+=...+. T Consensus 298 ~~~~~~--l~~lkd~~G~~l--~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~ 373 (415) T protein:vir:79 298 QTMFAK--LDKMKDKLGNYL--IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) T ss_pred HHHHHH--HHHhhccCCcee--eccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEecc Confidence 887652 3333 1111111 000000 123489999999999999765 8888899876666666555443321 Q ss_pred cchhhhhhhhhhhhhhhcc-ccccEEEE-ecceecCccCCCCcCCCC Q lcl|Aclame:pro 311 PKKDRVENYESMNIDYVVE-VYAAGCLL-ENITLGDFTAPAAPESGA 355 (355) Q Consensus 311 p~r~rve~y~s~Ne~YvVE-d~~~~a~i-enI~~~~~~~~~~~~~~a 355 (355) . .| ..+|.++ .++....- +.+.+.+.++++.+++-- T Consensus 374 ~------~~---~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:79 374 M------HF---GECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred c------cC---ceEEEEEEEeccEEeccccEEEEEEeccCCCCCcc Confidence 1 11 1122222 22221111 144444434444333332 No 27 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.53 E-value=3.2e-08 Score=61.67 Aligned_cols=298 Identities=13% Similarity=0.131 Sum_probs=154.5 Q ss_pred CCHHHHHHHHHHHHHHHHHh--CCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcc-ccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELN--NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV-GVTGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~n--gv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~l-gv~~~ia 77 (355) +....+..|..+.......- ++.. ....+.|-..+...+.+.+.+.+.+++.+++++++...|..... ..+++-+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~t--~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (415) T protein:vir:46 101 VTSQEVRDFTEYLETRNDIQGGSLKT--DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhccccc--cCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcce Confidence 23333344444443322111 1111 11234445566788999999999999999999998877754322 2233333 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) +-+..+ ......+...++...+..++.---+.|+++.|+... .+|+..+.+.+.++++.-.-.--++|.-...+. T Consensus 179 ~~v~Eg--~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~- 253 (415) T protein:vir:46 179 EKVEEL--EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTG- 253 (415) T ss_pred eecccc--cccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc- Confidence 333222 122222333566777777777777889999997644 488999999999998887777777774321111 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEc Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG 237 (355) .. ..++.. ........+...| |.+ .+++..+.++.+.. -+++|. T Consensus 254 ------~~--~~~~~~---------------------~~~~~~~~~~~~~---~~i-~~~~~~~~~~~~~~---~~~v~n 297 (415) T protein:vir:46 254 ------ST--SSGFEK---------------------EGKKLEVKKAKSL---DDI-KDAINLNVKPNYEH---NVAIVS 297 (415) T ss_pred ------cc--cccccc---------------------ccceeccccccch---HHH-HHHHHhhhhhccCC---CEEEEc Confidence 00 000000 0001111122234 433 35676666665442 378888 Q ss_pred HHHHHHHHHHHHhhcc-ccc--hhhHHHHHHhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEeeCcEEEEEEE Q lcl|Aclame:pro 238 RKLLADKYFPLVNKQQ-ENS--ESLAADIIISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESHRRSIDE 309 (355) Q Consensus 238 ~dLl~~k~~~l~n~~~-~~t--e~~aa~~~~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q~gs~RR~~~d 309 (355) ++.+. .+..+-..+ .+- +.... ....+|-|+|++..+++|... +++=.|+++.+.+.+....=...+ T Consensus 298 ~~~~~--~L~~lkd~~G~~i~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~ 372 (415) T protein:vir:46 298 QTMFA--KLDKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD 372 (415) T ss_pred HHHHH--HHHHhhccCCCeeeccCcCC---CCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec Confidence 88765 233332111 111 00000 123589999999999999654 788888887555554444333222 Q ss_pred ccchhhhhhhhhhhhhhh-ccccccEEEE-ecceecCccCCCCcCCCC Q lcl|Aclame:pro 310 NPKKDRVENYESMNIDYV-VEVYAAGCLL-ENITLGDFTAPAAPESGA 355 (355) Q Consensus 310 ~p~r~rve~y~s~Ne~Yv-VEd~~~~a~i-enI~~~~~~~~~~~~~~a 355 (355) . ..+ ...|. .+.+++...- +.+.+....+++++++-- T Consensus 373 ~------~~~---~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:46 373 Y------MHF---GECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred c------ccC---ceEEEEEEEeccEEeccccEEEEEeeccCCCCCCc Confidence 1 111 11222 2222222221 244444433333333322 No 28 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.53 E-value=3.2e-08 Score=61.67 Aligned_cols=298 Identities=13% Similarity=0.131 Sum_probs=154.5 Q ss_pred CCHHHHHHHHHHHHHHHHHh--CCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcc-ccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELN--NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV-GVTGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~n--gv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~l-gv~~~ia 77 (355) +....+..|..+.......- ++.. ....+.|-..+...+.+.+.+.+.+++.+++++++...|..... ..+++-+ T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~t--~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (415) T protein:vir:47 101 VTSQEVRDFTEYLETRNDIQGGSLKT--DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhccccc--cCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcce Confidence 23333344444443322111 1111 11234445566788999999999999999999998877754322 2233333 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) +-+..+ ......+...++...+..++.---+.|+++.|+... .+|+..+.+.+.++++.-.-.--++|.-...+. T Consensus 179 ~~v~Eg--~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~- 253 (415) T protein:vir:47 179 EKVEEL--EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTG- 253 (415) T ss_pred eecccc--cccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc- Confidence 333222 122222333566777777777777889999997644 488999999999998887777777774321111 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEc Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG 237 (355) .. ..++.. ........+...| |.+ .+++..+.++.+.. -+++|. T Consensus 254 ------~~--~~~~~~---------------------~~~~~~~~~~~~~---~~i-~~~~~~~~~~~~~~---~~~v~n 297 (415) T protein:vir:47 254 ------ST--SSGFEK---------------------EGKKLEVKKAKSL---DDI-KDAINLNVKPNYEH---NVAIVS 297 (415) T ss_pred ------cc--cccccc---------------------ccceeccccccch---HHH-HHHHHhhhhhccCC---CEEEEc Confidence 00 000000 0001111122234 433 35676666665442 378888 Q ss_pred HHHHHHHHHHHHhhcc-ccc--hhhHHHHHHhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEeeCcEEEEEEE Q lcl|Aclame:pro 238 RKLLADKYFPLVNKQQ-ENS--ESLAADIIISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESHRRSIDE 309 (355) Q Consensus 238 ~dLl~~k~~~l~n~~~-~~t--e~~aa~~~~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q~gs~RR~~~d 309 (355) ++.+. .+..+-..+ .+- +.... ....+|-|+|++..+++|... +++=.|+++.+.+.+....=...+ T Consensus 298 ~~~~~--~L~~lkd~~G~~i~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~ 372 (415) T protein:vir:47 298 QTMFA--KLDKMKDKLGNYLIQPDVKE---KTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD 372 (415) T ss_pred HHHHH--HHHHhhccCCCeeeccCcCC---CCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeec Confidence 88765 233332111 111 00000 123589999999999999654 788888887555554444333222 Q ss_pred ccchhhhhhhhhhhhhhh-ccccccEEEE-ecceecCccCCCCcCCCC Q lcl|Aclame:pro 310 NPKKDRVENYESMNIDYV-VEVYAAGCLL-ENITLGDFTAPAAPESGA 355 (355) Q Consensus 310 ~p~r~rve~y~s~Ne~Yv-VEd~~~~a~i-enI~~~~~~~~~~~~~~a 355 (355) . ..+ ...|. .+.+++...- +.+.+....+++++++-- T Consensus 373 ~------~~~---~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:47 373 Y------MHF---GECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL 411 (415) T ss_pred c------ccC---ceEEEEEEEeccEEeccccEEEEEeeccCCCCCCc Confidence 1 111 11222 2222222221 244444433333333322 No 29 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.52 E-value=2.9e-08 Score=61.93 Aligned_cols=302 Identities=14% Similarity=0.140 Sum_probs=153.4 Q ss_pred CCHHHHHHHHHHHHHHHHH--hCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcc-ccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAEL--NNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV-GVTGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~--ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~l-gv~~~ia 77 (355) +...-+..|..++...... ++... ....+.|-..+...+++.+.+.+.+++++++++++...|..... ..+++-+ T Consensus 101 ~~~~e~~~~~~~~~~~~~~~~~~~~~--~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 178 (415) T protein:vir:94 101 VTSQEVRDFTEYLETRNDIQGGSLKT--DSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAAL 178 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhhcccc--ccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCccc Confidence 2223333444443332211 11111 12245555567789999999999999999999998777764333 2233333 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) .-+..+ ......+...++...+..++.---+.|+.+.|+... .+|+..+.+.+.++++.-.-.--++|.-...+.. T Consensus 179 ~~v~Eg--~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~ 254 (415) T protein:vir:94 179 EKVEEL--EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGS 254 (415) T ss_pred eecccc--ccccccccccceeeEeeheeeeeechhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 322222 122222333466777777777777889999888543 5899999999999888776666667644222110 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEc Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG 237 (355) ...++.. ........+..+|..| .+++..+.++.+. .-+++|. T Consensus 255 ---------~~~~~~~---------------------~~~~~~~~~~~~~~~i----~~~~~~~~~~~~~---~~~~vmn 297 (415) T protein:vir:94 255 ---------TSSGFEK---------------------EGKKLEVKKAKSLDDI----KDAINLNVKPNYE---HNVAIVS 297 (415) T ss_pred ---------ccccccc---------------------cccccccccccchHHH----HHHHHhhhhhccC---CCEEEEc Confidence 0111100 0001112223344443 3466666565544 3378888 Q ss_pred HHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEeeCcEEEEEEEcc Q lcl|Aclame:pro 238 RKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESHRRSIDENP 311 (355) Q Consensus 238 ~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q~gs~RR~~~d~p 311 (355) +.... .+..+--.++.. ....... ....+|-|+|++..|++|... +++-.|+++-+.+.++..+=...+.. T Consensus 298 ~~~~~--~l~~lkd~~G~~-l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~ 374 (415) T protein:vir:94 298 QTMFA--KLDKMKDKLGNY-LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) T ss_pred HHHHH--HHHHhhccCCCe-eeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc Confidence 87654 233331111110 0000000 123578999999999999776 88889998765555554443332211 Q ss_pred chhhhhhhhhhhhhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 312 KKDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 312 ~r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) .+.....-..--+..|-+.++ +.+.+..+++.+++-- T Consensus 375 -~~~~~~r~~~r~d~~~~~~~a------~~~~~~~~~~~~~~~~ 411 (415) T protein:vir:94 375 -HFGECLMIAVRQDCRILDYKS------AIVIEYDDSERGEGDL 411 (415) T ss_pred -cCceEEEEEEEeccEEecccc------EEEEEEeccCCCCCcc Confidence 110000000001223333333 3333333333333222 No 30 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=98.52 E-value=4.7e-08 Score=60.77 Aligned_cols=302 Identities=15% Similarity=0.160 Sum_probs=154.8 Q ss_pred CCHHHHHHH-----------HHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhc Q lcl|Aclame:pro 1 MRPETRFKF-----------NAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIG 69 (355) Q Consensus 1 M~~~tr~~f-----------~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~ 69 (355) .+.+.+..+ ..+...+...-++ ....+.|-+.+...+.+.+++.+.+++.++++++.- +.++- T Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g--~~~ip 184 (425) T protein:vir:95 111 NRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAV----AGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG--TTRIL 184 (425) T ss_pred HHHHHHHHHhhhhhhhhhHHHHHHHHHHhhccc----ccCceeccHHHHHHHHHHHHhhhhHHHhhceeecCc--eeEEE Confidence 001111111 0111111111011 123455555578889999999999999999988742 22222 Q ss_pred ccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcc Q lcl|Aclame:pro 70 VGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNG 149 (355) Q Consensus 70 lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG 149 (355) .-.+++-++=+.- ..+....+...++...+..++.---+.|+.+.|+.+.- +|+..+++.+.+.++.-.-.--++| T Consensus 185 ~~~~~~~a~~v~E--~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~i~~~~d~~il~G 260 (425) T protein:vir:95 185 VDTDTSPATWIEQ--SGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSII--NLDDYVTKKIARAIAKALDLAIVKG 260 (425) T ss_pred EecCCcccccccc--ccccccccccccceeeeeheeeeeeehhhHHHHhccHH--HHHHHHHHHHHHHHHHHHHHHhhcc Confidence 2222222222221 11222222223455666666666677889998988754 7999999999999988888888898 Q ss_pred cccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCC Q lcl|Aclame:pro 150 TTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDD 229 (355) Q Consensus 150 ~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~ 229 (355) +-.. ..-| +|+|..+-. .. .....+....|.+|..++. ++..-++.. T Consensus 261 ~G~~-----~~~p------~Gil~~~~~------------~~-----~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~ 307 (425) T protein:vir:95 261 TGAA-----NKQP------LGIIPSLPP------------EN-----QVTVEADNNLLKNLVKQIG-----LIDTGDDSV 307 (425) T ss_pred CCCC-----cccc------ceeeccccc------------cc-----ccccccccchHHHHHHHHH-----hhhhhcccc Confidence 5321 1112 255532110 00 0011123345666665433 345556666 Q ss_pred CCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEE Q lcl|Aclame:pro 230 PNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDE 309 (355) Q Consensus 230 ~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d 309 (355) ..++++|-+.-+-..-+.|-...+.+..-+...-.....++-|+|++..+++|++.+++=.+++. ++..++.+.-..-+ T Consensus 308 ~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv~~~~~~~~~i~~Gd~~~~-~~~~~~~~~i~~~~ 386 (425) T protein:vir:95 308 GEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNFLDDDTVLFGEFEQY-TLVERENITIDSST 386 (425) T ss_pred CceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeEEcCcCCCccEEEEecccE-EEEeecceEEEeec Confidence 78888887542211111121111111110000000123467899999999999999999999884 44445544444332 Q ss_pred ccchhhhhhhhhhhhhhhccccccEEEEe--cceecCccCCCCcCCCC Q lcl|Aclame:pro 310 NPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAPAAPESGA 355 (355) Q Consensus 310 ~p~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~~~~~~a 355 (355) +. .|..-..+|.++.+--++.++ .+.+.+-..|.. || T Consensus 387 ~~------~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~---g~ 425 (425) T protein:vir:95 387 HV------KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQ---GA 425 (425) T ss_pred cc------ccccCceEEEEEEeeCcEeecccceEEEEecCcCC---CC Confidence 21 123333566665555555553 555544444333 33 No 31 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.46 E-value=4.1e-08 Score=61.07 Aligned_cols=300 Identities=12% Similarity=-0.022 Sum_probs=153.4 Q ss_pred CCHHHHHHHHHHHHHHHHH----hCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAEL----NNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTI 76 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~----ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~i 76 (355) .....+..|..++...... ..+........+.|-|...+.+++.+.+.+.++++++++++.--.+.........+- T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~ 168 (395) T protein:vir:43 89 AESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNN 168 (395) T ss_pred HHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCc Confidence 1122222232222211110 011111123356788889999999999999999999999986544433332211122 Q ss_pred cccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCC Q lcl|Aclame:pro 77 ASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTS 156 (355) Q Consensus 77 a~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~T 156 (355) ++-+ +.+... |..-..++...+.+++.--.+.|+.+.|+.. ++++..+.+.+.++++.-.-.--+||+-. T Consensus 169 a~~v--~E~~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~l~~~v~~~la~a~~~~~d~~~l~G~g~---- 238 (395) T protein:vir:43 169 AAPV--SEGTQK-PYSDLTFELENAPVRTIAHLFKASRQILDDA---SALQSYIDARARYGLMLVEECQLLYGNGT---- 238 (395) T ss_pred eeee--cCCccc-cccccceeEEEEeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCC---- Confidence 2211 222222 2233357778899999888899999998863 57889999999999888666677788431 Q ss_pred ChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEE Q lcl|Aclame:pro 157 DRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIV 236 (355) Q Consensus 157 d~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVviv 236 (355) .+|. +| +++..... .. .. .+.......+|. +.+++..+ ++.++. .-+++| T Consensus 239 ---~~~~-----~G------------i~~~~~~~----~~-~~-~~~~~~~~~~~~-i~~~~~~~-~~~~~~--~~~~vm 288 (395) T protein:vir:43 239 ---GANL-----HG------------IIPQAQAY----AP-PS-GVVVTAEQRIDR-IRLAILQA-QLAEFP--ASGIVL 288 (395) T ss_pred ---CCcc-----cc------------cccccccc----cc-cc-ccccccchhHHH-HHHHHHhh-ccccCC--CcEEEE Confidence 1221 11 11111000 00 00 112222334443 33445444 444443 337888 Q ss_pred cHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhh Q lcl|Aclame:pro 237 GRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRV 316 (355) Q Consensus 237 G~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rv 316 (355) .+..... ...+-.....+--..... ....++-|+|++..+++|++.+++-.+++...++.++...=.+.+... T Consensus 289 n~~~~~~-l~~lkd~~G~~i~~~~~~--~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~---- 361 (395) T protein:vir:43 289 NPIDWAL-IELNKDAENRYIIGSPQN--GTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTEND---- 361 (395) T ss_pred cHHHHHH-HHHhhccCCceecccccc--CCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEecccc---- Confidence 8876542 222222211111100000 123578899999999999999999999996544443333222222211 Q ss_pred hhhhhhh-hhhhccccccEEEEe--cceecCccCC Q lcl|Aclame:pro 317 ENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAP 348 (355) Q Consensus 317 e~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~ 348 (355) ++..+| -+|.++-+--++... .+...+.++. T Consensus 362 -~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 362 -KDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred -chhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 122233 344444333333332 3333322222 No 32 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.46 E-value=5.8e-08 Score=60.26 Aligned_cols=299 Identities=10% Similarity=0.101 Sum_probs=149.8 Q ss_pred CCHHHHHHHHHHHHHH--------------HHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRV--------------AELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGE 66 (355) Q Consensus 1 M~~~tr~~f~~y~~~~--------------A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge 66 (355) ...+.+..|.+|+... ++..++.. +..-.|.|-......+.+.+++.+.+++.++++++..-... T Consensus 84 ~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~-~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 162 (409) T protein:vir:45 84 QDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQ-DEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTM 162 (409) T ss_pred hhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCcc-CcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceE Confidence 2223334455555331 11222221 12235677777778899999999999999999998643221 Q ss_pred hh-cccccccccccccCCCCcCccccccccccCcceeEEeeee-cceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 67 KI-GVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINF-DFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIM 144 (355) Q Consensus 67 ~v-~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~-d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~ 144 (355) .+ ..+..+..+.- ++.+... +......+.....-++.-. -+.|+.+.|+.. .++|+..+.+.+.+++++-.-. T Consensus 163 ~~~~~~~~~~~~~~--v~E~~~~-~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds--~~~l~~~i~~~la~a~~~~~~~ 237 (409) T protein:vir:45 163 EWATADGTSEVGVL--LGENEEA-GEEDTDFGMGSLGALKMTSKIIRVSNELLQDS--AIDMEAYLARRIAERIGRGEAR 237 (409) T ss_pred EEEeeccCcccccc--ccccccc-cccccccceeeeeeeeeeeeehhhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHH Confidence 11 11111122211 1222122 2222223333333333322 245899999885 3699999999999999988877 Q ss_pred HhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccch Q lcl|Aclame:pro 145 AGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDE 224 (355) Q Consensus 145 IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~ 224 (355) --+||+-...+. .| .-+++... +. ...+..+ -.+.|.| .+++.. +++ T Consensus 238 a~l~G~G~~~~~----~p------------------~Gil~~~~---~~-----~~~~~~~-~~~~d~i-~~l~~~-l~~ 284 (409) T protein:vir:45 238 YLIQGTGAGTPK----QP------------------KGLAASVT---GT-----TQTAAAN-AVKWQEI-LALKHS-IDP 284 (409) T ss_pred HhhccCCCCCcc----cc------------------ceeeeccc---cc-----ccccccc-ccchHHH-HHHHHh-hhh Confidence 788886533221 12 11221111 00 0011111 1234543 456665 477 Q ss_pred hhhCCCCeEEEEcHHHHHHHHHHHH-hhccccc--hhhHHHHHHhhhhhcccccccCCccCC-----CcEEEecCCCcEE Q lcl|Aclame:pro 225 VYQDDPNLVAIVGRKLLADKYFPLV-NKQQENS--ESLAADIIISQKRIGNLPAVRVPYFPA-----NAVLVTTLENLSI 296 (355) Q Consensus 225 ~~~~~~~LVvivG~dLl~~k~~~l~-n~~~~~t--e~~aa~~~~~~k~iGGlpa~~~PffP~-----~~ilIT~l~NLsI 296 (355) .|+..+..+++|.+..++ ++..+ +....+- ....+ ....++-|+|++...++|. ..|++=.+++.-| T Consensus 285 ~~~~~a~~~~~~n~~~~~--~l~~lkd~~G~~i~~~~~~~---~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i 359 (409) T protein:vir:45 285 AYRRGPKFRLAFNDNTLK--LISEMEDGQGRPLWLPDIVG---VAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFII 359 (409) T ss_pred hhccCCeEEEEECHHHHH--HHHHhhcCCCceeeccCcCC---CCCceecceeeEEecCcCCccCCccEEEEeehhhhhe Confidence 788888899999988765 34434 2222211 11111 1234788999999999996 3466667777644 Q ss_pred EEeeCcEEEEEEEccchhhhhhhhhhh-hhhhccccccEEEE--ecceecCccCCCCcCCCC Q lcl|Aclame:pro 297 YFMDESHRRSIDENPKKDRVENYESMN-IDYVVEVYAAGCLL--ENITLGDFTAPAAPESGA 355 (355) Q Consensus 297 Y~Q~gs~RR~~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~i--enI~~~~~~~~~~~~~~a 355 (355) ..+ +...-...+++ |-..| .+|.++.+--+..+ +.+.+... +++.|| T Consensus 360 ~~~-~~~~~~~~~d~-------~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~----k~s~~~ 409 (409) T protein:vir:45 360 RRV-RYMILKRLVER-------YAEYDQTGFLAFHRFDCILEDTSAIKALVG----KGSVGG 409 (409) T ss_pred eec-cceEEEEeecc-------cccCCcEEEEEEEEeccEeechhheEEEEe----ccCCCC Confidence 433 33322222222 21112 23433322222222 23333222 122222 No 33 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.45 E-value=9.5e-08 Score=59.07 Aligned_cols=298 Identities=13% Similarity=0.015 Sum_probs=158.1 Q ss_pred CCHHHHHHHHHHHHHH-------------HHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRV-------------AELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEK 67 (355) Q Consensus 1 M~~~tr~~f~~y~~~~-------------A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~ 67 (355) .+..-...|..++... ....+... ....+.|-+.+.+.+.+.+.+.+.++++++++++.--.+.. T Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 181 (418) T protein:vir:10 104 TESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGV--SGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEY 181 (418) T ss_pred hhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCC--CCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeE Confidence 1122222232222221 11111111 23456788888889999999999999999999987655555 Q ss_pred hcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhh Q lcl|Aclame:pro 68 IGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGF 147 (355) Q Consensus 68 v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGf 147 (355) +.....++-++=+ +.+... +.....++...+.+++.---+.|+.+.|+.. ++|+..+++.+.++++.-.-.--| T Consensus 182 ~~~~~~~~~a~~v--~E~~~~-~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~l~~~i~~~l~~a~~~~~d~a~l 255 (418) T protein:vir:10 182 TVETGFTNNAAAV--AEGAQK-PTSDLKFNLKNQPVRTIAHLFKASRQILDDA---PALQSYIDGRARYGLQLTEEGQIL 255 (418) T ss_pred EEEecCCCceeee--ccCccc-cccccceeeEEEeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHh Confidence 5543333333222 222222 2333457778888888888888999999864 589999999999999888888888 Q ss_pred cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhh Q lcl|Aclame:pro 148 NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQ 227 (355) Q Consensus 148 nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~ 227 (355) ||+-.. .+|. |.+... +... . .....+..++|.++ +++..+ .+.++ T Consensus 256 ~G~g~~------~~p~------Gi~~~~----------------~~~~---~-~~~~~~~~~~~~i~-~~~~~~-~~~~~ 301 (418) T protein:vir:10 256 KGDGTG------ANIL------GILPQA----------------SAFM---P-SITLANATPIDKIR-LALLQA-VLAEF 301 (418) T ss_pred ccCCCC------cccc------cccccc----------------cccc---c-cccccccccHHHHH-HHHHhh-ccccC Confidence 984311 1232 443321 0000 0 11122233455433 345444 34334 Q ss_pred CCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEE Q lcl|Aclame:pro 228 DDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRS 306 (355) Q Consensus 228 ~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~ 306 (355) .. -+++|.+..... -..+-.....+ +-.+.. ....+|-|+|++..+++|++.+++-.+++....+.++...=. T Consensus 302 ~~--~~~v~n~~~~~~-L~~lkd~~G~~---i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~ 375 (418) T protein:vir:10 302 PA--TGIVLNPIDWAS-IELTKDSQGRY---IVGNPVNGTTPRLWNLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVL 375 (418) T ss_pred CC--CEEEEcHHHHHH-HHHhhcCCCce---eccccccCCCceecceeeEEcCCCCCCcEEEeeccceEEEEEecceEEE Confidence 32 268888886542 22222221111 111110 124589999999999999999999999874333333333222 Q ss_pred EEEccchhhhhhhhhhh-hhhhccccccEEEE--ecceecCccCCCCcCCC Q lcl|Aclame:pro 307 IDENPKKDRVENYESMN-IDYVVEVYAAGCLL--ENITLGDFTAPAAPESG 354 (355) Q Consensus 307 ~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~i--enI~~~~~~~~~~~~~~ 354 (355) +..+. .++...| ..|.++.+--++.. +.+...+..+++ +| T Consensus 376 ~~~~~-----~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~---~g 418 (418) T protein:vir:10 376 LSTEN-----VDDFEKNMVSIRAEERLALAVYRPESFVTGALVEQA---GG 418 (418) T ss_pred Eeccc-----chhhhcCceEEEEEEeeccEEecccceEEEEeccCC---CC Confidence 21111 1122233 34434333333333 244443333222 22 No 34 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.44 E-value=7e-09 Score=65.28 Aligned_cols=300 Identities=12% Similarity=0.071 Sum_probs=162.7 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |.-++. +...+... ......|-|.+.+.+.+.+++.|.+++.++++++..-... +-.-.+++-++.+ T Consensus 1 m~~~~~-----------~a~~~~~t-~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-~p~~~~~~~a~~v 67 (330) T protein:vir:77 1 MAGSTV-----------PSTQVALT-GDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGIS-IPHWTGAVSASWT 67 (330) T ss_pred Cccccc-----------chhhcccc-CCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceE-EEEEcCCcceeEe Confidence 332221 11111111 1234567788999999999999999999999887643322 2222344444443 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . .+... +.....++...+.+++.--...|+.+.|+. ..++|+..+.+.+.++++.-.-.--|||+-.. T Consensus 68 ~--Eg~~~-~~~~~~f~~i~~~~~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~------- 135 (330) T protein:vir:77 68 G--EAERK-PITKGSFGKQELEPVKITTIFAESAEVVRL--NPLNYLNTMRTKIAEAIALKFDAAAIHGIDKP------- 135 (330) T ss_pred c--CCCcc-ccccceeeEEEEeEEEEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCCC------- Confidence 3 22233 333344677889999999889999998875 34689999999999999998888889995421 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) +|. .|++..... ....... . ...+.+..-..+|.|+ +++..+ ...++ ..-+++|.+.. T Consensus 136 ~~~-----~g~~~~~~~---~~~~~~~---------~-~~~~~~~~~~~~~~l~-~~~~~~-~~~~~--~~~~~vmn~~~ 193 (330) T protein:vir:77 136 SAF-----KGYLAETTK---VVSLADT---------N-LTTASGPQGNAYLAVN-NALSLL-VNSGK--KWTGTLLDNVT 193 (330) T ss_pred Ccc-----ccccccccc---cceeecc---------c-ccccccccchhHHHHH-HHHHhh-hhcCC--CccEEEEcHHH Confidence 111 344443211 1111000 0 0111222222233322 344333 33233 34478999888 Q ss_pred HHHHHHHHHhhccccc--hhhHH-H-HHHhhhhhcccccccCCccCCCc------EEEecCCCcEEEEeeCcEEEEEEEc Q lcl|Aclame:pro 241 LADKYFPLVNKQQENS--ESLAA-D-IIISQKRIGNLPAVRVPYFPANA------VLVTTLENLSIYFMDESHRRSIDEN 310 (355) Q Consensus 241 l~~k~~~l~n~~~~~t--e~~aa-~-~~~~~k~iGGlpa~~~PffP~~~------ilIT~l~NLsIY~Q~gs~RR~~~d~ 310 (355) +.. ...+-.....+- +.... . ......++-|+|++..+++|++. +++..+++.-|-.+ +.+.=.+.++ T Consensus 194 ~~~-l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~-~~~~i~~~~e 271 (330) T protein:vir:77 194 EPI-LNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQI-GGLSFDVTDQ 271 (330) T ss_pred HHH-HHHHhccCCceeecCccccccccccCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEe-cCcEEEEeec Confidence 762 222322221111 11100 0 11234578899999999999875 88899998744333 3333333222 Q ss_pred cc--------------------hhhhhhhhhhhhhhhccccccEEEEecceecCccCCC Q lcl|Aclame:pro 311 PK--------------------KDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPA 349 (355) Q Consensus 311 p~--------------------r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~ 349 (355) .. +|++.-.-..-.++.|-+.++++.|....-+..|+.. T Consensus 272 ~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 272 ATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred ceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEeccCCcCCCCC Confidence 21 1111111111236778888888777655543333222 No 35 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.31 E-value=1.9e-07 Score=57.49 Aligned_cols=300 Identities=12% Similarity=0.063 Sum_probs=164.9 Q ss_pred CCHH-----HHHHHHHHHHHHHHHhCCChH-HcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccc Q lcl|Aclame:pro 1 MRPE-----TRFKFNAYLTRVAELNNISTD-DVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTG 74 (355) Q Consensus 1 M~~~-----tr~~f~~y~~~~A~~ngv~~~-~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~ 74 (355) |++. ..++|..++.+.+..+-.... .......|-+.+...+.+.+.+.|.++++++++++.-.... +-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-~p~~~~~ 79 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWADK 79 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE-EEEEeCC Confidence 6644 344455554444433221111 01234567778889999999999999999999987743222 2222234 Q ss_pred cccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD 154 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~ 154 (355) +.+.-+. .+... |.....++...+.+++.---..|+++.|+... .+|+..+.+.+.++++.-.-.-.++|.-.. T Consensus 80 ~~a~~v~--Eg~~~-~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~- 153 (324) T protein:vir:10 80 PGAYWVG--EGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN- 153 (324) T ss_pred cceeEec--cCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCC- Confidence 4444332 22222 33445677888899998888999999998664 689999999999988876666677774211 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEE Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVv 234 (355) .. |. | ++.... . +. ....+.-.|..|-. ++..+ ++.++... ++ T Consensus 154 -~~----~~------~------------i~~~~~-~-~~-----~~~~~~~t~~~i~~----~~~~l-~~~~~~~~--~~ 196 (324) T protein:vir:10 154 -PF----GK------S------------IAQSIE-K-TN-----KVIKGDFTQDNIID----LEALL-EDDELEAN--AF 196 (324) T ss_pred -cc----Cc------c------------cccccc-c-cc-----eeccccCCHHHHHH----HHHhh-hhccCCCC--EE Confidence 11 11 1 111000 0 00 00112233554444 44433 34344332 67 Q ss_pred EEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCC--cEEEecCCCcEEEEeeCcEEEEEEEccc Q lcl|Aclame:pro 235 IVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPAN--AVLVTTLENLSIYFMDESHRRSIDENPK 312 (355) Q Consensus 235 ivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~--~ilIT~l~NLsIY~Q~gs~RR~~~d~p~ 312 (355) +|.+..+.. -..+-.....+ .... ....++-|+|++..|..+.+ .+++..++++ +|..++..+=.+.++.. T Consensus 197 v~n~~~~~~-L~~l~d~~g~~--~~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~~~~ 269 (324) T protein:vir:10 197 ISKTQNRSL-LRKIVDPETKE--RIYD---RNSDTLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQ 269 (324) T ss_pred EEcHHHHHH-HHHhhccCCce--eecC---CCCccccceeEEeecCCCCCcceEEEEecccE-EEEEecCcEEEEeeccc Confidence 888887662 12222222111 1110 11357899999999986655 5889999997 45455555554444432 Q ss_pred h--------hhhhhhhhhh--------hhhhccccccEEEEecceecCccCCCCc Q lcl|Aclame:pro 313 K--------DRVENYESMN--------IDYVVEVYAAGCLLENITLGDFTAPAAP 351 (355) Q Consensus 313 r--------~rve~y~s~N--------e~YvVEd~~~~a~ienI~~~~~~~~~~~ 351 (355) - ..+.-|++-. -++.|-+.++++.+.+.+.+....|++- T Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:10 270 LSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred ccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 1 1121222222 3456666677777665555444333333 No 36 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.29 E-value=1.7e-07 Score=57.69 Aligned_cols=280 Identities=10% Similarity=0.056 Sum_probs=160.0 Q ss_pred HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCcCcccccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~ 95 (355) +| .+..+.|-|...+.+++.++++|.++++..++++.--. .++-.-.+++-|+-+. .+.+. |..-.. T Consensus 1 ma---------~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~--E~~~~-~~~~~~ 67 (298) T protein:vir:16 1 MV---------LNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVA--ESGKK-THGGVT 67 (298) T ss_pred Cc---------ccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEec--CCccc-cccccc Confidence 22 23356788899999999999999999999998876321 2343434455554443 22233 333345 Q ss_pred ccCcceeEEeeeecceeCHHHHHh-hcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHH Q lcl|Aclame:pro 96 LESSKYECNQINFDFHLKYKTLDL-WARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQK 174 (355) Q Consensus 96 l~~~~Y~c~qTn~d~~i~y~~LD~-WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~ 174 (355) ++...+..++.---..|+.+.|-+ +-...+|++.+.+.+.++++.-...-.+||+-...-+. T Consensus 68 f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~----------------- 130 (298) T protein:vir:16 68 LAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTA----------------- 130 (298) T ss_pred eeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcc----------------- Confidence 777888888888889999998843 44567899999999999988888888889843221111 Q ss_pred HHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHH-hhcc Q lcl|Aclame:pro 175 YRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLV-NKQQ 253 (355) Q Consensus 175 ~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~-n~~~ 253 (355) ..+.... ............+..+ .++++.+.+++..+ ...+++ +. +++|.+.... .+..+ .... T Consensus 131 ------~~~~~~~--~~~~~~~~~~~~~~~~--~~~~~~i~~~~~~~-~~~~~~-~~-~~vmn~~~~~--~l~~lkd~~G 195 (298) T protein:vir:16 131 ------SAVIGTN--HFDSKVTQKVEAPRGI--ADPNGAIENAVELL-TGVDAD-VT-GIAINPSFRS--ALAKQKDLQD 195 (298) T ss_pred ------ccccccc--cccccccccccccccc--ccHHHHHHHHHHHh-hhcCCC-cc-EEEEcHHHHH--HHHHhhccCC Confidence 1110000 0000000011111111 22333334444332 333332 22 5888877665 23322 2221 Q ss_pred ccchhhHHHHHHhhhhhcccccccCCccCCC------cEEEecCCCcEEEEeeCcEEEEEEEccchhh-hhhhhhhh--- Q lcl|Aclame:pro 254 ENSESLAADIIISQKRIGNLPAVRVPYFPAN------AVLVTTLENLSIYFMDESHRRSIDENPKKDR-VENYESMN--- 323 (355) Q Consensus 254 ~~te~~aa~~~~~~k~iGGlpa~~~PffP~~------~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~r-ve~y~s~N--- 323 (355) .+-= .....-....+|-|+|++..+++|+. .+++-.+++.-.|..++..+-.+.+.-+-+. -.+|...| T Consensus 196 ~~i~-~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~ 274 (298) T protein:vir:16 196 NALF-PELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVY 274 (298) T ss_pred Ceee-cCcccCCCCceecceeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEE Confidence 1110 00001112358999999999999975 4677889998778777776666654322222 22333333 Q ss_pred ------hhhhccccccEEEEecce Q lcl|Aclame:pro 324 ------IDYVVEVYAAGCLLENIT 341 (355) Q Consensus 324 ------e~YvVEd~~~~a~ienI~ 341 (355) -++.|-+..++|.+++++ T Consensus 275 ~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 275 IRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEEEEEEccEeecccceEEEeecC Confidence 345667777777777666 No 37 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.25 E-value=8.5e-08 Score=59.34 Aligned_cols=297 Identities=13% Similarity=0.089 Sum_probs=162.6 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |...+. |+.=...+++... ......|.|.+.+.+++.+.+.|.++++++++++.-... ++-.-.+++-++-+ T Consensus 1 ~~~~~~--~~~~~~~~~~t~~-----~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~~p~~~~~~~a~~v 72 (320) T protein:vir:10 1 MAAGTA--FQVDHAQIAQTGD-----TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQ-KIPHWIGDVSAQWI 72 (320) T ss_pred CCCCcc--CCHHHHHhhcccc-----ccccccccHHHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEeCCcceEEe Confidence 332222 1111111221111 112335889999999999999999999999998763222 22222233444333 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . ...+. |..-...+...+.+++.---..|+.+.|+.= .++++..+.+.+.++++...-.--++|+-....+ T Consensus 73 ~--E~~~~-~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~---- 143 (320) T protein:vir:10 73 G--EGDMK-PITKGNMTSQNIAPHKIATIFVASAETVRAN--PANYLGTMRTKVATAFAMAFDSAALNGTDSPFPT---- 143 (320) T ss_pred c--CCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcC--hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCc---- Confidence 3 22223 3344557788899999999999999999842 3689999999999999988888889996421111 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) .+....+... ..........+-..+|.+..++.. +++..+++ ..+++|.+.. T Consensus 144 ------------------~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~v~n~~~ 195 (320) T protein:vir:10 144 ------------------YLAQTTKSVS-------LADPGGATASDLTAYDAVAVNGLS-LLVNAKKK--WTHTLLDDIV 195 (320) T ss_pred ------------------cccccccccc-------ceecccccccccccHHHHHHHHHh-hhhcccCC--CcEEEEcHHH Confidence 0111111110 000111122333445655666554 44554443 5588999887 Q ss_pred HHHHHHHHH-hhccccch----hhHHHHHHhhhhhcccccccCCccCCCc--EEEecCCCcEEEEeeCcEEEEEEEccch Q lcl|Aclame:pro 241 LADKYFPLV-NKQQENSE----SLAADIIISQKRIGNLPAVRVPYFPANA--VLVTTLENLSIYFMDESHRRSIDENPKK 313 (355) Q Consensus 241 l~~k~~~l~-n~~~~~te----~~aa~~~~~~k~iGGlpa~~~PffP~~~--ilIT~l~NLsIY~Q~gs~RR~~~d~p~r 313 (355) ... ++.+ .....+.- ..-........++-|+|.+..+++|++. +++..++++ ++..++..+=.+.++... T Consensus 196 ~~~--L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~ 272 (320) T protein:vir:10 196 EPI--LNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNV-IWGQVGGLSFDVTDQATL 272 (320) T ss_pred HHH--HHHhhccCCceeeccccccCccccccCceeeeeeeEecCCCCCCceEEEEeecceE-EEEEecCeEEEEeeccee Confidence 652 3322 21111100 0000111223578999999999999997 456788887 455555555444333321 Q ss_pred h-------hhhhhhhhh---------hhhhccccccEEEEecceecCccCCCC Q lcl|Aclame:pro 314 D-------RVENYESMN---------IDYVVEVYAAGCLLENITLGDFTAPAA 350 (355) Q Consensus 314 ~-------rve~y~s~N---------e~YvVEd~~~~a~ienI~~~~~~~~~~ 350 (355) . .--+...+| -++.|.+.++++.|.++.- |.+ T Consensus 273 ~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~a-----p~~ 320 (320) T protein:vir:10 273 NLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVT-----PDA 320 (320) T ss_pred eeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccC-----CCC Confidence 1 111112222 3556667777777655442 222 No 38 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.24 E-value=8.6e-08 Score=59.32 Aligned_cols=282 Identities=15% Similarity=0.100 Sum_probs=158.6 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |--.+. ..-++... -...+.|-++..+.+.+.+.+.+.+++.++++++.--.. ++-.-.+++.+.-+ T Consensus 1 ma~~~~-----------~~~~~~~t-~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 67 (304) T protein:vir:94 1 MATPTY-----------TPGNVILS-DFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-KFTYLAKGVGAYWV 67 (304) T ss_pred Cccccc-----------cccccccc-CCCceecchhHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEeCCcceEEe Confidence 332222 12222221 123577888888999999999999999999998764222 22222234444433 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) .- ... .|.....++...+..++.---+.|+.+.|..= ..+|+..+.+.+.++++.-.-.-.+||.-....+.... T Consensus 68 ~E--~~~-~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~ 142 (304) T protein:vir:94 68 SE--TER-IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT--AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSG 142 (304) T ss_pred ec--Ccc-cccccceeeEEEEEEEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccc Confidence 22 222 23333456777788888777888888887733 36899999999999999999999999954322221111 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) +..+..+ +... ....++.-.|.+|-.|+ ..+ .+.++... +++|.+.. T Consensus 143 ~~~~~~~------------------------~~~~--~~~~~~~~~~~~i~~~~----~~l-~~~~~~~~--~~v~~~~~ 189 (304) T protein:vir:94 143 KPLVEGA------------------------EEKG--NVVTDTNNLYVDLSALM----ATI-EDEELDPN--GVLTTRSF 189 (304) T ss_pred ccccccc------------------------cccc--cccccccchHHHHHHHH----HHh-hhccCCcC--EEEEcHHH Confidence 1111100 0000 00011222366655544 333 33333322 78899887 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCc----EEEecCCCcEEEEeeCcEEEEEEEccch--- Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANA----VLVTTLENLSIYFMDESHRRSIDENPKK--- 313 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~----ilIT~l~NLsIY~Q~gs~RR~~~d~p~r--- 313 (355) +.. -..+-.....|- ......++-|+|++..+++|... +++..++++ ++..++..+-.+.+++.- T Consensus 190 ~~~-L~~lkd~~G~~l------~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~e~~~~~~ 261 (304) T protein:vir:94 190 RSK-MRNALDANDRPL------FDANGNEIMGLPLSYTGADVYDKKKSLALMGDWDYA-RYGILQGIEYAISEDATLTTL 261 (304) T ss_pred HHH-HHHhhccCCcEe------ecCCCccccceeeEEecccccCCCCcEEEEEehhhE-EEEEecceEEEEeecceeeee Confidence 763 223322222211 01123578899999999999665 889999997 566666665555444321 Q ss_pred ------hhhhhhhhhh---------hhhhccccccEEEEecce Q lcl|Aclame:pro 314 ------DRVENYESMN---------IDYVVEVYAAGCLLENIT 341 (355) Q Consensus 314 ------~rve~y~s~N---------e~YvVEd~~~~a~ienI~ 341 (355) -...++...| .++.|.+.++++.+...+ T Consensus 262 ~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 262 QASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 1122334444 344555666666655444 No 39 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.24 E-value=8.6e-08 Score=59.32 Aligned_cols=282 Identities=15% Similarity=0.100 Sum_probs=158.6 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |--.+. ..-++... -...+.|-++..+.+.+.+.+.+.+++.++++++.--.. ++-.-.+++.+.-+ T Consensus 1 ma~~~~-----------~~~~~~~t-~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 67 (304) T protein:vir:10 1 MATPTY-----------TPGNVILS-DFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-KFTYLAKGVGAYWV 67 (304) T ss_pred Cccccc-----------cccccccc-CCCceecchhHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEeCCcceEEe Confidence 332222 12222221 123577888888999999999999999999998764222 22222234444433 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) .- ... .|.....++...+..++.---+.|+.+.|..= ..+|+..+.+.+.++++.-.-.-.+||.-....+.... T Consensus 68 ~E--~~~-~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~ 142 (304) T protein:vir:10 68 SE--TER-IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT--AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSG 142 (304) T ss_pred ec--Ccc-cccccceeeEEEEEEEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccc Confidence 22 222 23333456777788888777888888887733 36899999999999999999999999954322221111 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) +..+..+ +... ....++.-.|.+|-.|+ ..+ .+.++... +++|.+.. T Consensus 143 ~~~~~~~------------------------~~~~--~~~~~~~~~~~~i~~~~----~~l-~~~~~~~~--~~v~~~~~ 189 (304) T protein:vir:10 143 KPLVEGA------------------------EEKG--NVVTDTNNLYVDLSALM----ATI-EDEELDPN--GVLTTRSF 189 (304) T ss_pred ccccccc------------------------cccc--cccccccchHHHHHHHH----HHh-hhccCCcC--EEEEcHHH Confidence 1111100 0000 00011222366655544 333 33333322 78899887 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCc----EEEecCCCcEEEEeeCcEEEEEEEccch--- Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANA----VLVTTLENLSIYFMDESHRRSIDENPKK--- 313 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~----ilIT~l~NLsIY~Q~gs~RR~~~d~p~r--- 313 (355) +.. -..+-.....|- ......++-|+|++..+++|... +++..++++ ++..++..+-.+.+++.- T Consensus 190 ~~~-L~~lkd~~G~~l------~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~e~~~~~~ 261 (304) T protein:vir:10 190 RSK-MRNALDANDRPL------FDANGNEIMGLPLSYTGADVYDKKKSLALMGDWDYA-RYGILQGIEYAISEDATLTTL 261 (304) T ss_pred HHH-HHHhhccCCcEe------ecCCCccccceeeEEecccccCCCCcEEEEEehhhE-EEEEecceEEEEeecceeeee Confidence 763 223322222211 01123578899999999999665 889999997 566666665555444321 Q ss_pred ------hhhhhhhhhh---------hhhhccccccEEEEecce Q lcl|Aclame:pro 314 ------DRVENYESMN---------IDYVVEVYAAGCLLENIT 341 (355) Q Consensus 314 ------~rve~y~s~N---------e~YvVEd~~~~a~ienI~ 341 (355) -...++...| .++.|.+.++++.+...+ T Consensus 262 ~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 262 QASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred cccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 1122334444 344555666666655444 No 40 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.24 E-value=9e-07 Score=53.72 Aligned_cols=300 Identities=12% Similarity=0.055 Sum_probs=167.7 Q ss_pred CCH-----HHHHHHHHHHHHHHHHhCCChHH-cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccc Q lcl|Aclame:pro 1 MRP-----ETRFKFNAYLTRVAELNNISTDD-VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTG 74 (355) Q Consensus 1 M~~-----~tr~~f~~y~~~~A~~ngv~~~~-v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~ 74 (355) |++ ...++|..++.+.+..+-..... ......|-+.+...+.+.+++.|.++++++++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 553 23444555555444433222111 123456777888999999999999999999998763221 12221233 Q ss_pred cccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD 154 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~ 154 (355) +-+.-+. .+... |..-..++...+..++.--...|+.+.|++.. ++|...+.+.+.++++.-.-.--|+|.-. T Consensus 80 ~~a~~v~--Eg~~~-~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~aia~~~d~~~l~G~g~-- 152 (324) T protein:vir:96 80 PGAYWVG--EGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGN-- 152 (324) T ss_pred cceeeec--CCccc-cccccceeEEEEEeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCC-- Confidence 3343332 22222 33345677888999999999999999999764 68999999999999888777777888431 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEE Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVv 234 (355) .. .|. -++.... .. .......-.|.+|-. ++..+ ++.+++. + ++ T Consensus 153 ~~----~~~------------------~~~~~~~----~~---~~~~~~~~~~~~i~~----~~~~i-~~~~~~~-~-~~ 196 (324) T protein:vir:96 153 NP----FGK------------------SIAQSIK----KT---NKVIKGDFTQDNIID----LEALL-EDDELEA-N-AF 196 (324) T ss_pred CC----cCc------------------ccccccc----cc---ceecccccchHHHHH----HHHhh-hhccCCC-C-EE Confidence 11 111 1111000 00 011112334555544 44333 4444432 2 67 Q ss_pred EEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCC--cEEEecCCCcEEEEeeCcEEEEEEEccc Q lcl|Aclame:pro 235 IVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPAN--AVLVTTLENLSIYFMDESHRRSIDENPK 312 (355) Q Consensus 235 ivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~--~ilIT~l~NLsIY~Q~gs~RR~~~d~p~ 312 (355) +|.+..+.. +..+--.++.. .... ....++-|+|++..|..+.+ .+++-.++++ +|...+..+=.+.++.. T Consensus 197 i~n~~~~~~--L~~lkd~~G~~-~~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~-~~~~~~~~~i~~~~~~~ 269 (324) T protein:vir:96 197 ISKTQNRSL--LRKIVDPETKE-RIYD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQ 269 (324) T ss_pred EEcHHHHHH--HHHhhCCCCCe-eecC---CCCCcccceeeEeecCCCCCcceEEEEecceE-EEEEecCcEEEEeeccc Confidence 888877652 32221111111 1111 12457999999987775544 5888899986 45455555555544432 Q ss_pred hh-------hhhhhhhhhh---------hhhccccccEEEEecceecCccCCCCc Q lcl|Aclame:pro 313 KD-------RVENYESMNI---------DYVVEVYAAGCLLENITLGDFTAPAAP 351 (355) Q Consensus 313 r~-------rve~y~s~Ne---------~YvVEd~~~~a~ienI~~~~~~~~~~~ 351 (355) .. ..-++..+|. ++.|-+.++++.+...+.+....|.+. T Consensus 270 ~~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 270 LSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred ccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 21 1123334443 666777777777665555544444333 No 41 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.23 E-value=4.9e-07 Score=55.16 Aligned_cols=295 Identities=11% Similarity=0.059 Sum_probs=155.2 Q ss_pred CCHHHHHHHHHHHHHHHHH--------hCCC------hHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAEL--------NNIS------TDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGE 66 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~--------ngv~------~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge 66 (355) +....+.....|...+.+. +.+. .......+.|-+.+...+.+.+.+.+.+++.+++++++...|. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~ 161 (408) T protein:vir:74 82 LNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGS 161 (408) T ss_pred ccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcce Confidence 2222222222322222211 1110 0112235778778888999999999999999999999887776 Q ss_pred hhccc--ccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 67 KIGVG--VTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIM 144 (355) Q Consensus 67 ~v~lg--v~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~ 144 (355) ....- -.++.+..+.-+ ......+...++...+.+++.---+.|+.+.|+. ...+|+..+.+.+.+.++.=.-. T Consensus 162 ~~~~~~~~~~~~~~~v~E~--~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~~~~~~d~ 237 (408) T protein:vir:74 162 RVYEKWTDVTPLKAMDEED--GKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKD--TAENILAWLSSWIAKKVVVTRNQ 237 (408) T ss_pred EEEEeecCCcccccccccc--cccccccccceeeEEeeeeeEEeeehhHHHHHhh--chHHHHHHHHHHHHHHHHHHHHH Confidence 44332 223333333222 1222223344667777777777778899999875 23478888888888888765555 Q ss_pred HhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccch Q lcl|Aclame:pro 145 AGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDE 224 (355) Q Consensus 145 IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~ 224 (355) --++|+... +. .|.. .+.|.++. ++...+++ T Consensus 238 ~il~G~G~~---------------------------------------~~------~~~~---~~~~~i~~-~~~~~l~~ 268 (408) T protein:vir:74 238 AIIAAMGTV---------------------------------------PK------KPTI---ANFDDVIT-MINTSVDP 268 (408) T ss_pred HHhhccccc---------------------------------------cc------cccc---ccHHHHHH-HHHHhhhh Confidence 555663211 00 0111 13454433 45456688 Q ss_pred hhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCC--ccCCCc-----EEEecCCCcEE Q lcl|Aclame:pro 225 VYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVP--YFPANA-----VLVTTLENLSI 296 (355) Q Consensus 225 ~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~P--ffP~~~-----ilIT~l~NLsI 296 (355) .|+. .-+++|.+..+. .+..+-..++ ......++. ....+|-|+|++..+ ++|..+ +++=.++..-. T Consensus 269 ~~~~--~a~~v~n~~~~~--~l~~lkd~~G-~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~ 343 (408) T protein:vir:74 269 AIIA--TSSLLTNQSGLN--KLALVKTAEG-KYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAIT 343 (408) T ss_pred hhcC--CCEEEEcHHHHH--HHHHhhcCCC-ceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEE Confidence 7775 468899988765 3333321111 111111111 113489999999877 477543 67777777655 Q ss_pred EEeeCcEEEEEEEccchhhhhhh-----hhhhhhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 297 YFMDESHRRSIDENPKKDRVENY-----ESMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 297 Y~Q~gs~RR~~~d~p~r~rve~y-----~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) ++.++..+=.+-+.. .+++..+ -..--++.|-+..+++.++=-.. ..+.++.+.++| T Consensus 344 ~~~~~~~~i~~~~~~-~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~-~~~~~~~~~~~~ 405 (408) T protein:vir:74 344 LFDRENMSLLPTNIG-AGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAI-ADQVGNFKTTTS 405 (408) T ss_pred EEEecceEEEEeccc-cchhhcceeeEEEEEeeCcEEecccceEEEEeecc-cCCCCCCCCCcc Confidence 665555553332211 1111111 11112445555555555532111 123334444444 No 42 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.22 E-value=2.5e-07 Score=56.78 Aligned_cols=297 Identities=11% Similarity=0.041 Sum_probs=151.6 Q ss_pred CCHH----HHHHHHHHHHHHHHH---h-------CCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhh Q lcl|Aclame:pro 1 MRPE----TRFKFNAYLTRVAEL---N-------NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGE 66 (355) Q Consensus 1 M~~~----tr~~f~~y~~~~A~~---n-------gv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge 66 (355) +... .+...+.+....... . ............|-|.+...+.+.+.+.+.+++.++++++.--.++ T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 149 (385) T protein:vir:19 70 NPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALE 149 (385) T ss_pred ccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceE Confidence 1111 111112222211100 0 0010001124457788899999999999999999999998755555 Q ss_pred hhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHh Q lcl|Aclame:pro 67 KIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAG 146 (355) Q Consensus 67 ~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IG 146 (355) .......++-++-+. .+......+ ..++...+..++.--.+.|+.+.|+.. ++++..+.+.+.++++.-.-.-- T Consensus 150 ~~~~~~~~~~a~~v~--E~~~~~~~~-~~~~~~~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~la~a~~~~~d~~~ 223 (385) T protein:vir:19 150 YVREEVFTNNADVVA--EKALKPESD-ITFSKQTANVKTIAHWVQASRQVMDDA---PMLQSYINNRLMYGLALKEEGQL 223 (385) T ss_pred EEEEecCCcceeeec--cCccccccc-cceeEEEEeeeeEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHH Confidence 444432222222221 122222333 357778888888888888999988864 57899999999999888665556 Q ss_pred hcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhh Q lcl|Aclame:pro 147 FNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVY 226 (355) Q Consensus 147 fnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~ 226 (355) ++|.-. .+|. .-+++... +. ....+. .....+|.|+ +++..+ .+.+ T Consensus 224 l~G~g~-------~~~~-----------------~Gi~~~~~---~~----~~~~~~-~~~~~~d~i~-~~~~~l-~~~~ 269 (385) T protein:vir:19 224 LNGDGT-------GDNL-----------------EGLNKVAT---AY----DTSLNA-TGDTRADIIA-HAIYQV-TESE 269 (385) T ss_pred HhccCC-------CCcc-----------------cccccccc---cc----cccccc-cccchHHHHH-HHHHhh-cccc Confidence 677211 1111 11111110 01 011111 2223566543 455544 4545 Q ss_pred hCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHH-HHhhhhhcccccccCCccCCCcEEEecCCC-cEEEEeeCcEE Q lcl|Aclame:pro 227 QDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADI-IISQKRIGNLPAVRVPYFPANAVLVTTLEN-LSIYFMDESHR 304 (355) Q Consensus 227 ~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~-~~~~k~iGGlpa~~~PffP~~~ilIT~l~N-LsIY~Q~gs~R 304 (355) ++. -+++|.+.... .+..+--.++ .-+-... -..+.++-|+|++..+++|++.+++-.+++ +-|+.+.+ .. T Consensus 270 ~~~--~~~~~~~~~~~--~l~~lkd~~G--~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~-~~ 342 (385) T protein:vir:19 270 FSA--SGIVLNPRDWH--NIALLKDNEG--RYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMD-AT 342 (385) T ss_pred CCC--CEEEEcHHHHH--HHHHhhcCCC--ceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecc-eE Confidence 443 27888887655 2332211111 1111000 122467889999999999999999998886 43443333 22 Q ss_pred EEEEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceecCccCCC Q lcl|Aclame:pro 305 RSIDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAPA 349 (355) Q Consensus 305 R~~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~~ 349 (355) =.+-++. .+|+.+| -+|.++-+-.++... .|...+..+.+ T Consensus 343 v~~~~~~-----~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 343 VEVSRED-----RDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEeccc-----cchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 2221111 1344455 344443333333332 34333322222 No 43 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.22 E-value=2.5e-07 Score=56.78 Aligned_cols=297 Identities=11% Similarity=0.041 Sum_probs=151.6 Q ss_pred CCHH----HHHHHHHHHHHHHHH---h-------CCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhh Q lcl|Aclame:pro 1 MRPE----TRFKFNAYLTRVAEL---N-------NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGE 66 (355) Q Consensus 1 M~~~----tr~~f~~y~~~~A~~---n-------gv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge 66 (355) +... .+...+.+....... . ............|-|.+...+.+.+.+.+.+++.++++++.--.++ T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 149 (385) T protein:vir:18 70 NPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALE 149 (385) T ss_pred ccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceE Confidence 1111 111112222211100 0 0010001124457788899999999999999999999998755555 Q ss_pred hhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHh Q lcl|Aclame:pro 67 KIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAG 146 (355) Q Consensus 67 ~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IG 146 (355) .......++-++-+. .+......+ ..++...+..++.--.+.|+.+.|+.. ++++..+.+.+.++++.-.-.-- T Consensus 150 ~~~~~~~~~~a~~v~--E~~~~~~~~-~~~~~~~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~la~a~~~~~d~~~ 223 (385) T protein:vir:18 150 YVREEVFTNNADVVA--EKALKPESD-ITFSKQTANVKTIAHWVQASRQVMDDA---PMLQSYINNRLMYGLALKEEGQL 223 (385) T ss_pred EEEEecCCcceeeec--cCccccccc-cceeEEEEeeeeEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHH Confidence 444432222222221 122222333 357778888888888888999988864 57899999999999888665556 Q ss_pred hcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhh Q lcl|Aclame:pro 147 FNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVY 226 (355) Q Consensus 147 fnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~ 226 (355) ++|.-. .+|. .-+++... +. ....+. .....+|.|+ +++..+ .+.+ T Consensus 224 l~G~g~-------~~~~-----------------~Gi~~~~~---~~----~~~~~~-~~~~~~d~i~-~~~~~l-~~~~ 269 (385) T protein:vir:18 224 LNGDGT-------GDNL-----------------EGLNKVAT---AY----DTSLNA-TGDTRADIIA-HAIYQV-TESE 269 (385) T ss_pred HhccCC-------CCcc-----------------cccccccc---cc----cccccc-cccchHHHHH-HHHHhh-cccc Confidence 677211 1111 11111110 01 011111 2223566543 455544 4545 Q ss_pred hCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHH-HHhhhhhcccccccCCccCCCcEEEecCCC-cEEEEeeCcEE Q lcl|Aclame:pro 227 QDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADI-IISQKRIGNLPAVRVPYFPANAVLVTTLEN-LSIYFMDESHR 304 (355) Q Consensus 227 ~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~-~~~~k~iGGlpa~~~PffP~~~ilIT~l~N-LsIY~Q~gs~R 304 (355) ++. -+++|.+.... .+..+--.++ .-+-... -..+.++-|+|++..+++|++.+++-.+++ +-|+.+.+ .. T Consensus 270 ~~~--~~~~~~~~~~~--~l~~lkd~~G--~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~-~~ 342 (385) T protein:vir:18 270 FSA--SGIVLNPRDWH--NIALLKDNEG--RYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMD-AT 342 (385) T ss_pred CCC--CEEEEcHHHHH--HHHHhhcCCC--ceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecc-eE Confidence 443 27888887655 2332211111 1111000 122467889999999999999999998886 43443333 22 Q ss_pred EEEEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceecCccCCC Q lcl|Aclame:pro 305 RSIDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAPA 349 (355) Q Consensus 305 R~~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~~ 349 (355) =.+-++. .+|+.+| -+|.++-+-.++... .|...+..+.+ T Consensus 343 v~~~~~~-----~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 343 VEVSRED-----RDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEeccc-----cchhhcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 2221111 1344455 344443333333332 34333322222 No 44 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.21 E-value=4e-07 Score=55.67 Aligned_cols=296 Identities=11% Similarity=0.052 Sum_probs=154.6 Q ss_pred CCHHHHHHHHHHHHHHHHHhCC-Ch------HHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccc- Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNI-ST------DDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGV- 72 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv-~~------~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv- 72 (355) .+...+..|.+|+......... .. ......+.|-+.+.+.+.+.+.+.+.+++.++++++.-..|.....-. T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 168 (408) T protein:vir:10 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) T ss_pred hHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeecc Confidence 2233333444443322111000 00 011235777667788899999999999999999999988887654322 Q ss_pred -cccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccc Q lcl|Aclame:pro 73 -TGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTT 151 (355) Q Consensus 73 -~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s 151 (355) .++.+.-+..+ ......+...++...+.+++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|+. T Consensus 169 ~~~~~a~~v~E~--~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g 244 (408) T protein:vir:10 169 DVTPLTVMDAED--GKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT--AENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) T ss_pred ccccceeeecCc--cccccccCcceeeEEeeeeeEEeeehhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 11223222222 12211233446777788888877788999988863 44888889999988888655544444432 Q ss_pred cccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCC Q lcl|Aclame:pro 152 RADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPN 231 (355) Q Consensus 152 ~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~ 231 (355) .. ... +.. .+.|.|+ +++...+++.|+. . T Consensus 245 ~~---------------------------------------~~~------~~~---~~~~~l~-~~~~~~~~~~~~~--~ 273 (408) T protein:vir:10 245 AA---------------------------------------PKK------PTI---AKFDDVI-TMINTAVDPAIIA--T 273 (408) T ss_pred cc---------------------------------------ccc------ccc---ccHHHHH-HHHHHhhhhhhcc--C Confidence 11 000 001 1455544 3444456777774 4 Q ss_pred eEEEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCC--ccCCCc-----EEEecCCCcEEEEeeCcE Q lcl|Aclame:pro 232 LVAIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVP--YFPANA-----VLVTTLENLSIYFMDESH 303 (355) Q Consensus 232 LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~P--ffP~~~-----ilIT~l~NLsIY~Q~gs~ 303 (355) -+++|.+..+.. +..+.-.++.. .....+. ....++-|+|++.++ .+|..+ +++-.+++.-..+.++.. T Consensus 274 a~~v~n~~~~~~--l~~lkd~~G~~-i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~ 350 (408) T protein:vir:10 274 SSLLTNQSGLNK--LALVKTAEGKY-LLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENM 350 (408) T ss_pred CEEEEcHHHHHH--HHHhhccCCce-EeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecce Confidence 588999887663 33332111110 0101111 123489999999876 577654 788888876544444444 Q ss_pred EEEEEEccc----hhhhhhhhhhhhhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 304 RRSIDENPK----KDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 304 RR~~~d~p~----r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) .=.+.+.+. ++.+.-+-..--+..|-+..+++.++=-... +..+..+.+++ T Consensus 351 ~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~-~~~~~~~~~~~ 405 (408) T protein:vir:10 351 SLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIA-DQVGNFKTTTS 405 (408) T ss_pred EEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeeccc-cCCCCCCCCCc Confidence 433333221 1111111112233444455555544311111 12223333333 No 45 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.20 E-value=5.2e-07 Score=55.02 Aligned_cols=311 Identities=14% Similarity=0.146 Sum_probs=156.8 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChH---H------cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTD---D------VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVG 71 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~---~------v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lg 71 (355) ...+.+..|..|+.+-.. .++... . ....+.|-+.+...+.+.+++.+.+++.++++++.-... ++..- T Consensus 78 ~~~e~~~a~~~~l~~g~~-~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~ 155 (407) T protein:vir:48 78 VASEHKEAFIGFMRKGRE-DGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDY-KKLVN 155 (407) T ss_pred hhhHHHHHHHHHHhccch-hhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCce-EEEEe Confidence 666777788888653210 111000 0 112456655668889999999999999999988865432 22233 Q ss_pred ccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccc Q lcl|Aclame:pro 72 VTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTT 151 (355) Q Consensus 72 v~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s 151 (355) .+++-++-+.- +......+....+...|..++.---+.|+.+.|+. ...+|+..+.+.+.+.++.=.-.--+||+- T Consensus 156 ~~~~~a~~v~E--~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~~~a~l~G~G 231 (407) T protein:vir:48 156 LGGTTSGWVGE--TDARPETATSKLGLIEPFMGEIYGNPQATQKMLDD--AFFNVEDWINSELALEFAEQEEIAFTSGDG 231 (407) T ss_pred cCCcceeeecc--cccccccccccceeEEeeeeeeEeehhhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHhhhhccCC Confidence 34444433221 12221123334566778888877788999999985 234788888888888777655444466732 Q ss_pred cccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCC Q lcl|Aclame:pro 152 RADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPN 231 (355) Q Consensus 152 ~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~ 231 (355) +..|. |=|....... .......+.. ..+..+.. +-.+.|.| .+++..| ++.|+... T Consensus 232 -------~~~p~------Gil~~~~~~~-----~~~~~~~~~~--~~~~~~~~-~~~~~d~i-~~l~~~l-~~~~~~~a- 287 (407) T protein:vir:48 232 -------SKKPK------GFLAYESTDE-----DDKTRAFGKL--QHIASGAA-SGVTADAI-IKLIYTL-RKAHRSGA- 287 (407) T ss_pred -------CCccc------eeeecccccc-----cccccccccc--cccccccc-cccChHHH-HHHHHhh-chhhhcCC- Confidence 11232 2221110000 0000011111 11122222 22345654 4677665 67777654 Q ss_pred eEEEEcHHHHHHHHHHHH-hhccccc--hhhHHHHHHhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEeeCcE Q lcl|Aclame:pro 232 LVAIVGRKLLADKYFPLV-NKQQENS--ESLAADIIISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESH 303 (355) Q Consensus 232 LVvivG~dLl~~k~~~l~-n~~~~~t--e~~aa~~~~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q~gs~ 303 (355) +++|.+..++ .+..+ .....|- +.... ....+|-|+|++..+++|..+ |++=.|+..-..+.+... T Consensus 288 -~~v~n~~~~~--~L~~lkD~~Gr~l~~~~~~~---g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~ 361 (407) T protein:vir:48 288 -KFMMNNSSLF--AIRLLKDNDGNYLWRPGIEL---GQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGT 361 (407) T ss_pred -EEEEcHHHHH--HHHHhhccCCceeeccCcCC---CCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeece Confidence 6888887664 23333 2221110 11110 113479999999999999733 666677643222333334 Q ss_pred EEEEEEccchhhhhhhhhhhhhhhccccccEEEEe--cceecCccCCCCcCCCC Q lcl|Aclame:pro 304 RRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAPAAPESGA 355 (355) Q Consensus 304 RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~~~~~~a 355 (355) +-.. ++ +. ..-..+|.++..--++.++ .|.+...++++....+| T Consensus 362 ~i~~--d~----~~--~~~~~~~~~~~r~d~~v~~~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 362 RILR--DP----YT--NKPFVGFYTTKRTGGMLVDSQAIKLMKIGAATRQKAAA 407 (407) T ss_pred EEEe--ec----cc--cCCcEEEEEEEEeccEEecccceEEEEeeccCCCCCCC Confidence 3221 11 11 1112344333322233332 44444444444444444 No 46 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.18 E-value=4.8e-07 Score=55.21 Aligned_cols=305 Identities=10% Similarity=0.031 Sum_probs=147.9 Q ss_pred CCHHHHHHHHHHHHHHHHHh---CCChH--HcceeeecCcHHHHHHHHHHHh-hHHHhCcCccccchhhhhh-------h Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELN---NISTD--DVSKKFTVEPSVTQTLMNTVQA-SSAFLKTINILPVAEMKGE-------K 67 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~n---gv~~~--~v~~~Fsv~P~~~q~L~~~iqe-ss~FL~~INv~~V~e~~Ge-------~ 67 (355) |....+..+..+........ ..... .....+.+.|......+....+ ++.+.+.++++++.--... . T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 177 (419) T protein:vir:94 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGT 177 (419) T ss_pred HHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeecccc Confidence 22222222222222221111 10000 1223556777777777665544 4456678888876532221 1 Q ss_pred hcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhh Q lcl|Aclame:pro 68 IGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGF 147 (355) Q Consensus 68 v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGf 147 (355) +.+...++-++= ++.+... +.....++...+..++.---+.|+.+.|+.. ++|+..+.+.+.++++.=.-.-.+ T Consensus 178 ~~~~~~~~~a~~--v~Eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~la~a~~~~~d~aii 251 (419) T protein:vir:94 178 AGAGSTWNKAAV--VPEGTAK-PQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLL 251 (419) T ss_pred ccccccCcccce--ecCCccc-cccccceeeEEeeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111211211 1112222 2222346667777777777788999999864 579999999999999887777778 Q ss_pred cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhh Q lcl|Aclame:pro 148 NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQ 227 (355) Q Consensus 148 nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~ 227 (355) ||.- +.+|. |++..-. +.+.. .... .....+...+|. +.+++..+....++ T Consensus 252 ~G~G-------~~~p~------Gi~~~~~------~~~~~------~~~~---~~~~t~~~~~~~-l~~~~~~~~~~~~~ 302 (419) T protein:vir:94 252 NGNG-------STEMQ------GILTTPG------IGTYQ------QPKP---TAPATDEPPLVD-IRRAKTVAEIAGFP 302 (419) T ss_pred hccC-------ccccc------ceecccc------ccccc------cccc---ccccccchhHHH-HHHHHHhhhhccCC Confidence 8833 22343 7765211 00000 0000 001111122332 33345444444333 Q ss_pred CCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEE Q lcl|Aclame:pro 228 DDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRS 306 (355) Q Consensus 228 ~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~ 306 (355) . . +++|.+..+.. ...+......+-- .-.... ....+|-|+|++..+++|++.+++-.+++...++.+....=. T Consensus 303 ~--~-~~v~n~~~~~~-l~~~k~~~~~~~~-~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~ 377 (419) T protein:vir:94 303 P--D-GVVVHPQDWES-IELDQAPGSGVFR-VIANVQGEATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVL 377 (419) T ss_pred C--C-EEEEcHHHHHH-HHHHhhcCCCcee-ecCCcccCCCccccceeeEEcCCCCCccEEEeeccceEEEEEecceEEE Confidence 2 2 78888776542 2222222122100 000000 124589999999999999999999999987655655554433 Q ss_pred EEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceecCccCCCC Q lcl|Aclame:pro 307 IDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAPAA 350 (355) Q Consensus 307 ~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~~~ 350 (355) +.+... ++...| .+|.++.+--++.+. .|...+.++..+ T Consensus 378 ~~~~~~-----~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 378 MTDSHA-----DFFTANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred Eecccc-----chhhcCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 322221 222233 445444444444443 333333222222 No 47 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.16 E-value=8.3e-07 Score=53.91 Aligned_cols=289 Identities=11% Similarity=0.068 Sum_probs=156.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhC-CCh------HHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccc-- Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNN-IST------DDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVG-- 71 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ng-v~~------~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lg-- 71 (355) .+...+..|..|+........ ... ......+.|-+.+.+.+.+.+.+.+.+++.++++++....|....+- T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 168 (404) T protein:vir:39 89 LKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (404) T ss_pred hHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeec Confidence 344455556666543221111 000 01123466777888999999999999999999999998888766442 Q ss_pred ccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccc Q lcl|Aclame:pro 72 VTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTT 151 (355) Q Consensus 72 v~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s 151 (355) ..++.+.-+..+. .....+...++...+.+++.-=-+.|+.+.|+.. .++|+..+.+.+.+.++.=.-.--++|+. T Consensus 169 ~~~~~a~~v~Eg~--~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~d~~il~g~g 244 (404) T protein:vir:39 169 DVTPLTVMDAEDG--KIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDT--AENILAWLSSWIAKKVVVTRNQAIIAAMG 244 (404) T ss_pred CCccceeeecCcc--ccccccccceeeEEeeeeeEEeeehhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 2223333333321 1111233446667777777776778999988864 35788888888888887655555556632 Q ss_pred cccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCC Q lcl|Aclame:pro 152 RADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPN 231 (355) Q Consensus 152 ~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~ 231 (355) .. .. .+..- +.|.+ .+++...+++.|+. . T Consensus 245 ~~---------------------------------------~~------~~~~~---~~~~i-~~~~~~~~~~~~~~--~ 273 (404) T protein:vir:39 245 TV---------------------------------------PK------KPTIA---KFDDV-ITMINTSVDPAIIA--T 273 (404) T ss_pred cc---------------------------------------cc------ccccc---cHHHH-HHHHHHhhhhhhcc--C Confidence 11 00 01111 34543 34555567787765 4 Q ss_pred eEEEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCC--ccCCCc-----EEEecCCCcEEEEeeCcE Q lcl|Aclame:pro 232 LVAIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVP--YFPANA-----VLVTTLENLSIYFMDESH 303 (355) Q Consensus 232 LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~P--ffP~~~-----ilIT~l~NLsIY~Q~gs~ 303 (355) -+++|.+..+. .+..+-..++..- ....+. ....+|-|+|++.+. .+|..+ +++-.|++.-+.+.++.. T Consensus 274 a~~v~n~~~~~--~L~~lkd~~G~~l-~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 350 (404) T protein:vir:39 274 SSLLTNQSGLN--KLALVKTAEGKYL-LEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENM 350 (404) T ss_pred CEEEEcHHHHH--HHHHhhccCCcee-eccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecce Confidence 58899988764 3333321221110 000111 123488899999865 456543 777788876666665555 Q ss_pred EEEEEEccchhhhhhhhhhh---------hhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 304 RRSIDENPKKDRVENYESMN---------IDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 304 RR~~~d~p~r~rve~y~s~N---------e~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) +=.+.+... ++...| -++.|-+..+++.+. +...+.+....+++ T Consensus 351 ~i~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~---~~~~a~~~~~~~~~ 403 (404) T protein:vir:39 351 SLLPTNIGA-----GAFETDTTKIRVIDRFDVKTTDSEALVAGS---FTAIADQVGNFTAG 403 (404) T ss_pred EEEEeccch-----hhhhhceeeEEEEeeeccEEecccceEEEE---eeccccCCCCCCCC Confidence 543333221 222222 233444444444433 33322223333333 No 48 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.16 E-value=4.7e-07 Score=55.29 Aligned_cols=287 Identities=13% Similarity=0.100 Sum_probs=156.6 Q ss_pred HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCc-C-cccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDK-E-RQTADF 93 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~-~-r~~~~~ 93 (355) +| .... ....+.|-+.+.+.+.+.+++.+.++++++++++.--. .++-.-.+++-|+-+.-+... + -.+... T Consensus 1 ma---~~t~--~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~ 74 (305) T protein:vir:25 1 MA---DISR--AEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATLPEADWVGESATDPKGVKPTSK 74 (305) T ss_pred CC---CccC--CccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCc-EEEEEEeCCcceEEeecccccccccccccc Confidence 32 2222 12357788888899999999999999999999876322 122222233333322221110 0 012223 Q ss_pred ccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHH Q lcl|Aclame:pro 94 TALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQ 173 (355) Q Consensus 94 ~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq 173 (355) ..++...+..++.---..|+.+.|+... ++|+..+++.+.++++.-.-.-.|||+-.. +.. T Consensus 75 ~~f~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~--~~~--------------- 135 (305) T protein:vir:25 75 VTWANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKP--ASW--------------- 135 (305) T ss_pred cceeeEEeeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHhhhheeccCCC--CCc--------------- Confidence 4466677888888888899999997643 589999999999999999999999996411 110 Q ss_pred HHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 174 KYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQ 253 (355) Q Consensus 174 ~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~ 253 (355) .+..+....... +. .....+..-.+.++..++..+...+.+.-+... .++|.+.....- ..+-+. + T Consensus 136 -----~~~~~~~~~~~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~v~~~~~~~~l-~~lkd~-~ 201 (305) T protein:vir:25 136 -----VSPALIPAAVTA-GQ---AVEVVGGVANESDIVGATNRAAKAVASAGWAPD---TLLSSLALRYEV-ANIRDA-N 201 (305) T ss_pred -----cccccccccccc-cc---cccccccchhhhHHHHHHHHHHHhhhhcccccc---eeEecHHHHHHH-HHhhcc-C Confidence 011111111000 01 111122223344444445554433322222211 267777655521 122111 1 Q ss_pred ccchhhHHHHHHhhhhhcccccccCCccCCC----cEEEecCCCcEEEEeeCcEEEEEEEc----cchhhhhhhhhhh-- Q lcl|Aclame:pro 254 ENSESLAADIIISQKRIGNLPAVRVPYFPAN----AVLVTTLENLSIYFMDESHRRSIDEN----PKKDRVENYESMN-- 323 (355) Q Consensus 254 ~~te~~aa~~~~~~k~iGGlpa~~~PffP~~----~ilIT~l~NLsIY~Q~gs~RR~~~d~----p~r~rve~y~s~N-- 323 (355) + +-+....++-|+|++..+++|.. .+++-.++++.|..+.| .+=.+.++ ....++.-|++-. T Consensus 202 G-------~~i~~~~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~~~~ 273 (305) T protein:vir:25 202 G-------NPVFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQD-ITVKFLDQATLGTGENQINLAERDMVA 273 (305) T ss_pred C-------ceeecCCcccccceEEcCccCCCCCccEEEEEecceEEEEEecC-eEEEEeeeeeeecCCceeeeeecCcEE Confidence 1 11223458999999999998854 57888999875544443 33222222 1122222222211 Q ss_pred ------hhhhccccccEEEEecceecCccCCCC Q lcl|Aclame:pro 324 ------IDYVVEVYAAGCLLENITLGDFTAPAA 350 (355) Q Consensus 324 ------e~YvVEd~~~~a~ienI~~~~~~~~~~ 350 (355) -|+.|-+..+++.+.+++++. .+|++ T Consensus 274 ~R~~~r~~~~v~~p~a~v~~~~~~~~~-~~pa~ 305 (305) T protein:vir:25 274 LRLKARFAYVLGVSATAQGANKTPVAV-VAPAA 305 (305) T ss_pred EEEEEeecceeeCcccEEEEccccccc-cCCCC Confidence 256788888888888887653 23333 No 49 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.16 E-value=1.2e-06 Score=53.04 Aligned_cols=302 Identities=10% Similarity=0.061 Sum_probs=156.7 Q ss_pred CC--HHHHHHHHHHHHHHHHHhC---------------------CChH-HcceeeecCcHHHHHHHHHHHhhHHHhCc-C Q lcl|Aclame:pro 1 MR--PETRFKFNAYLTRVAELNN---------------------ISTD-DVSKKFTVEPSVTQTLMNTVQASSAFLKT-I 55 (355) Q Consensus 1 M~--~~tr~~f~~y~~~~A~~ng---------------------v~~~-~v~~~Fsv~P~~~q~L~~~iqess~FL~~-I 55 (355) .+ ......|..++..++..-+ +... .....+.|-..+.+.+.+.+++.+.+++. . T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~ 167 (435) T protein:vir:80 88 PKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGA 167 (435) T ss_pred cchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccc Confidence 00 0011122333322221110 1000 01123445556677899999988877664 3 Q ss_pred ccccchhhhhhhhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHH Q lcl|Aclame:pro 56 NILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIV 135 (355) Q Consensus 56 Nv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~ 135 (355) ++++...-. .++-.-.+++-++-+.-+ .. .|.....++...+..++.---..|+.+.|+..+-.|+++..+.+.+. T Consensus 168 ~~v~~~~~~-~~~p~~~~~~~a~~v~E~--~~-~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~ 243 (435) T protein:vir:80 168 RTLPLSNGN-ITIPRLKGGAIVGYIGAD--TD-IPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLT 243 (435) T ss_pred eeeecCCCc-eEEEEEeCCcceeeeccC--cc-ccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHH Confidence 454443211 122122233334333322 22 23333457788899999999999999999998888999999999999 Q ss_pred HHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHH Q lcl|Aclame:pro 136 KRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVM 215 (355) Q Consensus 136 ~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~ 215 (355) ++++.-.-.--+||+..+. .| +|++... . .........+..+..++..+. T Consensus 244 ~a~~~~~d~a~l~G~G~~~------~p------~Gi~~~~---~---------------~~~~~~~~~~~~~~~~~~d~~ 293 (435) T protein:vir:80 244 AAIGAREDKAFIRDDGTAN------TP------KGLRFWA---L---------------PGNVITASDGSTLQKIETDLG 293 (435) T ss_pred HHHHHHHHHHhhccCCCCC------cc------cceeecc---c---------------ccceeecccccchhhHHHHHH Confidence 9999887777788843221 12 2443211 0 001111223344444444444 Q ss_pred HHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCC--------cEE Q lcl|Aclame:pro 216 DATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPAN--------AVL 287 (355) Q Consensus 216 d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~--------~il 287 (355) +++..+... .......+++|.+.... ++..+--.++..- --+ ....++-|+|++..+++|.+ .++ T Consensus 294 ~~~~~~~~~-~~~~~~~~~vmn~~~~~--~L~~lkd~~G~~l--~~~--~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~ 366 (435) T protein:vir:80 294 KAILALENA-DANLTQPGWIMAPRTFR--FLEGLRDGNGNKV--YPE--LANGMLKGYPVGKTTQVPINLGEAGKESEIY 366 (435) T ss_pred HHHHHhhcc-ccccccCEEEEcHHHHH--HHHhhhccCCcee--ccC--CCCCeEeeeeeEEeccccccccCCCCcceEE Confidence 444333221 12234568899888664 2332221221111 001 12458999999999999985 578 Q ss_pred EecCCCcEEEEeeCcEEEEEEEccch----hhhhhhhhhhh-hhhcc--------ccccEEEEecceecC Q lcl|Aclame:pro 288 VTTLENLSIYFMDESHRRSIDENPKK----DRVENYESMNI-DYVVE--------VYAAGCLLENITLGD 344 (355) Q Consensus 288 IT~l~NLsIY~Q~gs~RR~~~d~p~r----~rve~y~s~Ne-~YvVE--------d~~~~a~ienI~~~~ 344 (355) +..+++.- ...++..+=.+.+.... -.+.+++.+|. +|-+| +.++++.+.++.++- T Consensus 367 ~gd~s~~~-i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 367 FTDFGDVF-IGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred EEEcccEE-EEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 88887753 34455555444333321 12234444453 33333 445555555665543 No 50 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.15 E-value=6.3e-07 Score=54.57 Aligned_cols=300 Identities=13% Similarity=0.101 Sum_probs=166.7 Q ss_pred CCHH-----HHHHHHHHHHHHHHHhCCChH-HcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccc Q lcl|Aclame:pro 1 MRPE-----TRFKFNAYLTRVAELNNISTD-DVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTG 74 (355) Q Consensus 1 M~~~-----tr~~f~~y~~~~A~~ngv~~~-~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~ 74 (355) |++. ..++|..++.+.+...-.... .......|-+.+...+.+.+.+.|.+++..+++++.-... ++-.-.++ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 6643 344555555555443322111 0123456777889999999999999999999998773321 22221223 Q ss_pred cccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD 154 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~ 154 (355) +-+.-+. .+... |.....++...+.+++.---..|+++.|+... ++|+..+.+.+.++++.-.-.--++|.-. T Consensus 80 ~~a~~v~--Eg~~~-~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~~~l~G~g~-- 152 (324) T protein:vir:99 80 PGAYWVG--EGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGN-- 152 (324) T ss_pred cceeEec--cCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCC-- Confidence 3333222 22222 33445677888899998888999999999774 68999999999998877666666777421 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEE Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVv 234 (355) +|... | ++.... . +. .. ..+.-.|.. +.+++..| ++.+++.. ++ T Consensus 153 ------~~~~~----~------------~~~~~~-~-~~----~~-~~~~~~~~~----i~~~~~~l-~~~~~~~~--~~ 196 (324) T protein:vir:99 153 ------NPFGK----S------------IAQSIE-K-TN----KV-IKGDFTQDN----IIDLEALL-EDDELEAN--AF 196 (324) T ss_pred ------CccCc----c------------cccccc-c-cc----ee-ccccCCHHH----HHHHHHhh-hhccCCCC--EE Confidence 11110 1 111000 0 00 01 111122333 33455544 45444433 68 Q ss_pred EEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCC--cEEEecCCCcEEEEeeCcEEEEEEEccc Q lcl|Aclame:pro 235 IVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPAN--AVLVTTLENLSIYFMDESHRRSIDENPK 312 (355) Q Consensus 235 ivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~--~ilIT~l~NLsIY~Q~gs~RR~~~d~p~ 312 (355) +|.+..+.. -..+-.....+ .... ....++-|+|++..|..|.+ .+++..++++ +|..++..+=.+.++.. T Consensus 197 v~n~~~~~~-L~~l~d~~g~~--~~~~---~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~-~~~~~~~~~i~~~~~~~ 269 (324) T protein:vir:99 197 ISKTQNRSL-LRKIVDPETKE--RIYD---RNSDTLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQ 269 (324) T ss_pred EEcHHHHHH-HHHhhcCCCce--eecC---CCCccccceeEEeecCCCCCcceEEEEecccE-EEEEecCcEEEEeeccc Confidence 888887662 12222221111 1100 11347899999999997755 5888999986 55555555554444432 Q ss_pred h----------------hhhhhhhhhhhhhhccccccEEEEecceecCccCCCCc Q lcl|Aclame:pro 313 K----------------DRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPAAP 351 (355) Q Consensus 313 r----------------~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~ 351 (355) . |.+.---..--++.|.+.++++.+...+.+..+.|++- T Consensus 270 ~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~~ 324 (324) T protein:vir:99 270 LSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred ccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 1 11111111223557777777777776666555444444 No 51 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.12 E-value=8e-07 Score=54.01 Aligned_cols=289 Identities=11% Similarity=0.022 Sum_probs=150.2 Q ss_pred CCHHHHHHHHHHHHHHHH------------Hh-CCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAE------------LN-NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEK 67 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~------------~n-gv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~ 67 (355) .+.. .+.++...... .+ +.........+.|.|...+.+.+.+++.+.++++++++++..-.... T Consensus 83 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:97 83 VASE---QFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhH---HHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEE Confidence 1111 11222211111 11 11111123456788889999999999999999999999987544444 Q ss_pred hcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhh Q lcl|Aclame:pro 68 IGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGF 147 (355) Q Consensus 68 v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGf 147 (355) .......+-++-+. .+......+ ..++...+..++.--.+.|+.+.|+.. ++|+..+.+.+.+.++.-.-.--+ T Consensus 160 ~~~~~~~~~a~~v~--Eg~~~~~~~-~~~~~i~~~~~k~~~~~~is~ell~ds---~~l~~~i~~~la~a~~~~~d~a~l 233 (390) T protein:vir:97 160 VQETGFVNNAAIVA--EGALKPESS-LKFAKKTDTTHVIAHTMKATRQILSDA---PQLASYMNNRLIRGLKVKEDAEIL 233 (390) T ss_pred EEEecCCcceeeec--CCccccccc-cceeEEEEeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHh Confidence 44332222232222 122222233 346677777777777788899888764 579999999999998887777778 Q ss_pred cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhh Q lcl|Aclame:pro 148 NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQ 227 (355) Q Consensus 148 nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~ 227 (355) +|+- |+ .+| +|.+.. . +.. . . .+....-..+|. +.+++..+ ++.++ T Consensus 234 ~G~g----~~--~~p------~Gi~~~------------~----~~~--~-~-~~~~~~~~~~d~-~~~~~~~~-~~~~~ 279 (390) T protein:vir:97 234 RGTG----AN--DGL------LGLIPQ------------A----TTY--A-A-PTTIAGATRVDQ-LRLAMLQA-SLAEY 279 (390) T ss_pred hcCC----CC--ccc------cceeec------------c----ccc--c-c-cccccccchHHH-HHHHHHhh-ccccC Confidence 8832 11 112 233321 0 000 0 0 111112223453 44466554 44444 Q ss_pred CCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEE Q lcl|Aclame:pro 228 DDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSI 307 (355) Q Consensus 228 ~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~ 307 (355) ... +++|.+..+. +-..|-.....+--....+ ..+.++-|+|++..+++|++.+++-.+++--.++.+....=.. T Consensus 280 ~~~--~~v~n~~~~~-~L~~lkd~~G~~l~~~~~~--~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~ 354 (390) T protein:vir:97 280 PAS--GIVINPIDWA-AIELAKDANNQYLIGNARG--TLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEI 354 (390) T ss_pred CCC--EEEEcHHHHH-HHHHhhcCCCceeecCccC--CCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEE Confidence 332 6888887654 1122222221111011111 1245899999999999999999999998743344444444333 Q ss_pred EEccchhhhhhhhhhh-hhhhccccccEEEEe-----cceec Q lcl|Aclame:pro 308 DENPKKDRVENYESMN-IDYVVEVYAAGCLLE-----NITLG 343 (355) Q Consensus 308 ~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie-----nI~~~ 343 (355) .+.+. +..+| .+|.++-+-..+... .|+|+ T Consensus 355 ~~~~~------~f~~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 355 GYVND------DFQRNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred eeccc------ccccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 33221 12223 234333332222221 23333 No 52 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.09 E-value=1.7e-06 Score=52.16 Aligned_cols=300 Identities=12% Similarity=0.070 Sum_probs=166.7 Q ss_pred CCHH-----HHHHHHHHHHHHHHHhCCChHH-cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccc Q lcl|Aclame:pro 1 MRPE-----TRFKFNAYLTRVAELNNISTDD-VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTG 74 (355) Q Consensus 1 M~~~-----tr~~f~~y~~~~A~~ngv~~~~-v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~ 74 (355) |++. ....|..+....+...-..... ......|-+.+.+.+++.+.+.|.++++.+++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~ 79 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCce-EEEEEecC Confidence 6643 4445666666555433222111 124566767788999999999999999999998763221 22222233 Q ss_pred cccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD 154 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~ 154 (355) +-|.-+ +.+... |......+...+.+++.---..|+.+.|+... ++|+..+.+.+.++++.-.-..-++|+-.. T Consensus 80 ~~a~~v--~Eg~~~-~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~g~~- 153 (324) T protein:vir:97 80 PGAYWV--GEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN- 153 (324) T ss_pred cceeEe--ccCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhccCCCC- Confidence 333322 222233 33445677888999999989999999998664 689999999999998888888888885311 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEE Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVv 234 (355) . .|. | ++.... .. .....+...|.+|-. ++..+ .+-++.. + ++ T Consensus 154 -~----~~~------g------------i~~~~~-----~~--~~~~~~~~~~~~i~~----~~~~l-~~~~~~~-~-~~ 196 (324) T protein:vir:97 154 -P----FGK------S------------IAQSIE-----KT--NKVIKGDFTQDNIID----LEALL-EDDELEA-N-AF 196 (324) T ss_pred -c----cCc------c------------cccccc-----cc--ceeccccCCHHHHHH----HHHhh-hhccCCC-C-EE Confidence 1 111 1 111000 00 011123344665554 44333 3333332 2 67 Q ss_pred EEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccC--CCcEEEecCCCcEEEEeeCcEEEEEEEccc Q lcl|Aclame:pro 235 IVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFP--ANAVLVTTLENLSIYFMDESHRRSIDENPK 312 (355) Q Consensus 235 ivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP--~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~ 312 (355) +|.+..+.. +..+--.++.. ... . ....++-|+|++..|..| ...+++-.++++ +|..++..+=.+.++.. T Consensus 197 v~n~~~~~~--L~~lkd~~g~~-~~~-~--~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~-~i~~~~~~~i~~~~~~~ 269 (324) T protein:vir:97 197 ISKTQNRSL--LRKIVDPETKE-RIY-D--RNSDTLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQ 269 (324) T ss_pred EEcHHHHHH--HHHhhcCCCce-eec-C--CCCccccceeeEeecCCCCCcceEEEEecccE-EEEEecCcEEEEeeccc Confidence 888877652 33221112211 110 0 113578999999988855 446888899987 45555555544444432 Q ss_pred hh--------hhhhhhhh--------hhhhhccccccEEEEecceecCccCCCCc Q lcl|Aclame:pro 313 KD--------RVENYESM--------NIDYVVEVYAAGCLLENITLGDFTAPAAP 351 (355) Q Consensus 313 r~--------rve~y~s~--------Ne~YvVEd~~~~a~ienI~~~~~~~~~~~ 351 (355) .. .+.-|+.- --++.|-+.++++.+...+.+....|++. T Consensus 270 ~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) T protein:vir:97 270 LSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred ccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCCCCC Confidence 11 11112221 12445666677776665554443333333 No 53 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.08 E-value=3.5e-07 Score=55.95 Aligned_cols=275 Identities=13% Similarity=0.091 Sum_probs=157.0 Q ss_pred hCCChHHcc----eeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCcCcccccccc Q lcl|Aclame:pro 20 NNISTDDVS----KKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) Q Consensus 20 ngv~~~~v~----~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~ 95 (355) .|.++...+ ....|-+.+.+.+++.+++.|.++++.+++++.-....... .+++-++=+ +.+.+. +..-.. T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~--~~~~~a~~v--~E~~~~-~~~~~~ 75 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTF--MSGVGAFWV--DEAERI-QTSKPT 75 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEE--EcCCceeee--ecCccc-cccccc Confidence 666543321 23568788889999999999999999999998744333222 334444322 333333 333355 Q ss_pred ccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHHH Q lcl|Aclame:pro 96 LESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKY 175 (355) Q Consensus 96 l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~ 175 (355) ++...+..++.--.+.|+.+.|+.= .++|+..+.+.+.+.++.-.-.--+||+- +..| .|.|+.. T Consensus 76 f~~v~l~~~k~~~~~~is~ell~ds--~~~~~~~i~~~l~~a~~~~~d~a~l~G~g-------~~~~------~gil~~~ 140 (299) T protein:vir:41 76 FTKAKMRSKKMGVIIPTTKENLNYS--VTNFFSLMQAEIVEAFYKKFDQAVFTGVE-------SPYN------WNILKSA 140 (299) T ss_pred eeEEEEeeEEEEEeehhhHHHHhcC--HHHHHHHHHHHHHHHHHHHHHHHHhhccc-------Cccc------ccccccc Confidence 7788899999888899999999842 26899999999999988866666678842 1122 2555422 Q ss_pred HhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhcccc Q lcl|Aclame:pro 176 RNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQEN 255 (355) Q Consensus 176 Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~ 255 (355) .. . . ..... .-.++|. +.+++..+ .+.++... +++|.++.... ...+-.....+ T Consensus 141 ~~------------~----~---~~~~~--~~~~~~~-l~~~~~~l-~~~~~~~~--~~v~n~~~~~~-L~~lkd~~G~~ 194 (299) T protein:vir:41 141 TD------------A----S---NLVEE--TANKYDD-LNEAIGLI-EAEDLEPN--GIATIRKQRVK-YRSTKDGNGMP 194 (299) T ss_pred cc------------c----c---eeecc--ccccHHH-HHHHHHhh-hcccCCcC--EEEEcHHHHHH-HHHhhccCCce Confidence 11 0 0 00111 1123443 44566554 55555432 68899877552 23333322221 Q ss_pred chhhHHHHHHhhhhhcccccccCCccCCCc----EEEecCCCcEEEEeeCcEEEEEEEccchhhh-------hhhhhhh- Q lcl|Aclame:pro 256 SESLAADIIISQKRIGNLPAVRVPYFPANA----VLVTTLENLSIYFMDESHRRSIDENPKKDRV-------ENYESMN- 323 (355) Q Consensus 256 te~~aa~~~~~~k~iGGlpa~~~PffP~~~----ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rv-------e~y~s~N- 323 (355) --. ......+.++-|+|++..+++|.+. +++-.++++.| ..++..+-.+.++...... .+++.+| T Consensus 195 l~~--~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i-~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 271 (299) T protein:vir:41 195 IFN--TATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAYY-GILRGVEYEILTEATLTTVADETGKPLNLAERDM 271 (299) T ss_pred eec--CCcCCCCceecceeeEEecccCCCCCceEEEEEecccEEE-EEecCcEEEEeecccccccccccccchhhhhcCc Confidence 110 0111123478899999999999998 99999999754 4444454444444332211 1222222 Q ss_pred --------hhhhccccccEEEEecceecCccCCCC Q lcl|Aclame:pro 324 --------IDYVVEVYAAGCLLENITLGDFTAPAA 350 (355) Q Consensus 324 --------e~YvVEd~~~~a~ienI~~~~~~~~~~ 350 (355) .++.|.+.++++.+. . .+ +. T Consensus 272 ~~~r~~~~~d~~v~~~~A~~~l~---~---~a-a~ 299 (299) T protein:vir:41 272 AAIKATFEVGFMVVKDEAFSAVQ---P---KA-GN 299 (299) T ss_pred EEEEEEEEeccEEecccceEEEE---e---cc-CC Confidence 233444555555442 1 00 00 No 54 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.08 E-value=2e-06 Score=51.78 Aligned_cols=287 Identities=9% Similarity=0.031 Sum_probs=148.2 Q ss_pred CCHHHHHHHHHHHHHHHH-----HhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAE-----LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGT 75 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~-----~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ 75 (355) .....+..|..|+...-. .++... ....+.|-+.+.+.+.+.+.+.+.+++.+++++|.-.+|.......++. T Consensus 88 ~~~~~~~~~~~~l~~~~~~~~~~~~~~t~--~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 165 (394) T protein:vir:10 88 PIDAKKKAINDFIHSHGKVIDNAAGHVTS--TEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATD 165 (394) T ss_pred HHHHHHHHHHHHHhccchhhhhhhccccc--ccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCC Confidence 233445567776543211 111111 1235778777789999999999999999999999877766554432222 Q ss_pred ccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccC Q lcl|Aclame:pro 76 IASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADT 155 (355) Q Consensus 76 ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~ 155 (355) -+.- .+...+....+...++...+..++.---+.|+.+.|+. ..++|+..+.+.+.+.++.-.-.--.+|..- T Consensus 166 ~~~~--~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~il~g~g~--- 238 (394) T protein:vir:10 166 RFSS--VAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIAD--SAVDLTSLVGQSINEKSVNTYNAMIAPVLQS--- 238 (394) T ss_pred cccc--ccccccccccccccceeEEeeeeeeEeeehhHHHHHhh--hhHHHHHHHHHHHHHHHHHHHHHHHhhcccc--- Confidence 1111 11111221123334555556666655557788888885 3468899888888887775322222222110 Q ss_pred CChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEE Q lcl|Aclame:pro 156 SDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAI 235 (355) Q Consensus 156 Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvi 235 (355) |. .+........|.|+ +++...+++.|. =+++ T Consensus 239 ------------------------------------~~-------~~~~~~~~~~d~l~-~~~~~~~~~~~~----a~~v 270 (394) T protein:vir:10 239 ------------------------------------FT-------AKATTTDTLVDSLK-HILNVDLDPAYS----RALV 270 (394) T ss_pred ------------------------------------cc-------cccccccccHHHHH-HHHHhhhhhhcc----CEEE Confidence 00 01112345667654 456667788764 2799 Q ss_pred EcHHHHHHHHHHHHhhccc-cc--hhhHHHHH-HhhhhhcccccccCCc--cCCC----cEEEecCCCcEEEEeeCcEEE Q lcl|Aclame:pro 236 VGRKLLADKYFPLVNKQQE-NS--ESLAADII-ISQKRIGNLPAVRVPY--FPAN----AVLVTTLENLSIYFMDESHRR 305 (355) Q Consensus 236 vG~dLl~~k~~~l~n~~~~-~t--e~~aa~~~-~~~k~iGGlpa~~~Pf--fP~~----~ilIT~l~NLsIY~Q~gs~RR 305 (355) |.+..+.. +..+--.++ |- ...-.... ....++-|+|++.++. +|.. .+++-.|++.-+.+-++..+- T Consensus 271 mn~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v 348 (394) T protein:vir:10 271 VTQSLFNT--LDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTL 348 (394) T ss_pred ecHHHHHH--HHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEE Confidence 99887662 333322221 10 00000000 1124788999988774 3322 278888887433333333444 Q ss_pred EEEEccchhhhhhhhhhhhhh-hccccccEEEE-ecceecCccCCCCcCCCC Q lcl|Aclame:pro 306 SIDENPKKDRVENYESMNIDY-VVEVYAAGCLL-ENITLGDFTAPAAPESGA 355 (355) Q Consensus 306 ~~~d~p~r~rve~y~s~Ne~Y-vVEd~~~~a~i-enI~~~~~~~~~~~~~~a 355 (355) ...++....+ +| +++.+++...- +.|.+.....+++++.++ T Consensus 349 ~~~~~~~~~~---------~~~~~~r~d~~~~~~~ai~~~~~~~~~~~~~~~ 391 (394) T protein:vir:10 349 AWEDSKIYGR---------YLGAAFRFGVKQADSNAGYFVTNTDAASGSTSG 391 (394) T ss_pred EEecccccce---------eEEEEEEeccEEeccccEEEEEeecccCCCCCC Confidence 4444333222 22 22333322222 255554444444444444 No 55 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.07 E-value=1.7e-06 Score=52.26 Aligned_cols=300 Identities=12% Similarity=0.069 Sum_probs=166.2 Q ss_pred CC-----HHHHHHHHHHHHHHHHHhCCChHH-cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccc Q lcl|Aclame:pro 1 MR-----PETRFKFNAYLTRVAELNNISTDD-VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTG 74 (355) Q Consensus 1 M~-----~~tr~~f~~y~~~~A~~ngv~~~~-v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~ 74 (355) |+ +.+++.|..+....+..+-..... ....+.|-+.+...+.+.+.+.|.++++++++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 54 455666666666655544222211 123567777788999999999999999999988763221 22222233 Q ss_pred cccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD 154 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~ 154 (355) +-+.=+ +.+... |..-..++...+..++.---..|+.+.|+... ++|+..+.+.+.+.++.-.-.-.|+|+-.. T Consensus 80 ~~a~~v--~Eg~~~-~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~- 153 (324) T protein:vir:96 80 PGAYWV--GEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN- 153 (324) T ss_pred cceeEe--cCCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCC- Confidence 333322 222233 33334577788888888888889988888553 689999999999999988877888884311 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEE Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVv 234 (355) .. | .-++.... . .. ....+...|.+|-.+.. . +++.+++.. ++ T Consensus 154 -~~----~------------------~gi~~~~~----~-~~--~~~~~~~t~~~i~~~~~----~-l~~~~~~~~--~~ 196 (324) T protein:vir:96 154 -PF----G------------------KSIAQSIE----K-TN--KVIKGDFTQDNIIDLEA----L-LEDDELEAN--AF 196 (324) T ss_pred -Cc----C------------------cccccccc----c-cc--eeccccccHHHHHHHHH----h-hhhccCCCC--EE Confidence 10 1 11111000 0 00 01122234555554433 2 344444332 68 Q ss_pred EEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccC--CCcEEEecCCCcEEEEeeCcEEEEEEEccc Q lcl|Aclame:pro 235 IVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFP--ANAVLVTTLENLSIYFMDESHRRSIDENPK 312 (355) Q Consensus 235 ivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP--~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~ 312 (355) +|.+..... -..+-.....+. +.. ....++-|+|++..|..+ ++.+++-.++++ ++..++..+=.+.+++. T Consensus 197 vmn~~~~~~-L~~l~d~~G~~~--~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~-~~g~~~~~~i~~~~~~~ 269 (324) T protein:vir:96 197 ISKTQNRSL-LRKIVDPETKER--IYD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQ 269 (324) T ss_pred EEcHHHHHH-HHHhhccCCCee--ecC---CCCCcccceeeEeeCCCCCCcceEEEEecceE-EEEEecCcEEEEeeccc Confidence 888776552 112222211111 111 123579999999888744 556888899886 45555555555544432 Q ss_pred --------hhhhhhhhhhh--------hhhhccccccEEEEecceecCccCCCCc Q lcl|Aclame:pro 313 --------KDRVENYESMN--------IDYVVEVYAAGCLLENITLGDFTAPAAP 351 (355) Q Consensus 313 --------r~rve~y~s~N--------e~YvVEd~~~~a~ienI~~~~~~~~~~~ 351 (355) -..+..|++-. -++.|-+.+++|.+...+.+....|.+. T Consensus 270 ~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 270 LSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred ccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 22222222211 2556666666666665555443333333 No 56 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.07 E-value=1.7e-06 Score=52.26 Aligned_cols=300 Identities=12% Similarity=0.069 Sum_probs=166.2 Q ss_pred CC-----HHHHHHHHHHHHHHHHHhCCChHH-cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccc Q lcl|Aclame:pro 1 MR-----PETRFKFNAYLTRVAELNNISTDD-VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTG 74 (355) Q Consensus 1 M~-----~~tr~~f~~y~~~~A~~ngv~~~~-v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~ 74 (355) |+ +.+++.|..+....+..+-..... ....+.|-+.+...+.+.+.+.|.++++++++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 54 455666666666655544222211 123567777788999999999999999999988763221 22222233 Q ss_pred cccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD 154 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~ 154 (355) +-+.=+ +.+... |..-..++...+..++.---..|+.+.|+... ++|+..+.+.+.+.++.-.-.-.|+|+-.. T Consensus 80 ~~a~~v--~Eg~~~-~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~- 153 (324) T protein:vir:78 80 PGAYWV--GEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN- 153 (324) T ss_pred cceeEe--cCCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCC- Confidence 333322 222233 33334577788888888888889988888553 689999999999999988877888884311 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEE Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVv 234 (355) .. | .-++.... . .. ....+...|.+|-.+.. . +++.+++.. ++ T Consensus 154 -~~----~------------------~gi~~~~~----~-~~--~~~~~~~t~~~i~~~~~----~-l~~~~~~~~--~~ 196 (324) T protein:vir:78 154 -PF----G------------------KSIAQSIE----K-TN--KVIKGDFTQDNIIDLEA----L-LEDDELEAN--AF 196 (324) T ss_pred -Cc----C------------------cccccccc----c-cc--eeccccccHHHHHHHHH----h-hhhccCCCC--EE Confidence 10 1 11111000 0 00 01122234555554433 2 344444332 68 Q ss_pred EEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccC--CCcEEEecCCCcEEEEeeCcEEEEEEEccc Q lcl|Aclame:pro 235 IVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFP--ANAVLVTTLENLSIYFMDESHRRSIDENPK 312 (355) Q Consensus 235 ivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP--~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~ 312 (355) +|.+..... -..+-.....+. +.. ....++-|+|++..|..+ ++.+++-.++++ ++..++..+=.+.+++. T Consensus 197 vmn~~~~~~-L~~l~d~~G~~~--~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~-~~g~~~~~~i~~~~~~~ 269 (324) T protein:vir:78 197 ISKTQNRSL-LRKIVDPETKER--IYD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETAQ 269 (324) T ss_pred EEcHHHHHH-HHHhhccCCCee--ecC---CCCCcccceeeEeeCCCCCCcceEEEEecceE-EEEEecCcEEEEeeccc Confidence 888776552 112222211111 111 123579999999888744 556888899886 45555555555544432 Q ss_pred --------hhhhhhhhhhh--------hhhhccccccEEEEecceecCccCCCCc Q lcl|Aclame:pro 313 --------KDRVENYESMN--------IDYVVEVYAAGCLLENITLGDFTAPAAP 351 (355) Q Consensus 313 --------r~rve~y~s~N--------e~YvVEd~~~~a~ienI~~~~~~~~~~~ 351 (355) -..+..|++-. -++.|-+.+++|.+...+.+....|.+. T Consensus 270 ~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:78 270 LSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred ccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 22222222211 2556666666666665555443333333 No 57 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.07 E-value=9e-07 Score=53.74 Aligned_cols=306 Identities=14% Similarity=0.134 Sum_probs=152.9 Q ss_pred CCHHHHHHHHHHHHHHHHH--hCCChHH------cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAEL--NNISTDD------VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGV 72 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~--ngv~~~~------v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv 72 (355) +..+.|..|..|+...... ....... ....+.|-+.+.+.+.+.+++.+.+++.++++++.-... ++.... T Consensus 79 ~~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~ 157 (401) T protein:vir:44 79 VAAEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDY-KKLVNL 157 (401) T ss_pred hhHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCce-EEEEec Confidence 6666788888887432111 0000000 112467767778899999999999999999998854322 333334 Q ss_pred cccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccc Q lcl|Aclame:pro 73 TGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTR 152 (355) Q Consensus 73 ~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~ 152 (355) +++.++-+.- +......+...++...|.-++.---+.|+.+.|+. ...+|+..+.+.+.+.++.-.-.--+||+-. T Consensus 158 ~~~~a~wv~E--~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~ai~~~~~~~~l~G~G~ 233 (401) T protein:vir:44 158 GGTASGWVGE--TDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDD--AFFNVEAWINSELATEFAEQEEIAFTTGDGT 233 (401) T ss_pred CCccceeecc--ccccCccccccceeeeeehhheeeehhhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHhhhhccCCC Confidence 4444433221 11221122223444555555555556688888874 2348999999999999998888888888431 Q ss_pred ccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCe Q lcl|Aclame:pro 153 ADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNL 232 (355) Q Consensus 153 A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~L 232 (355) .+| +|.|............ ..+. ...+..|. .+..+.|.++ +++..| ++.|+. .- T Consensus 234 -------~~p------~Gil~~~~~~~~~~~~-----~~~~--~~~~~t~~-~~~~~~d~i~-~~~~~l-~~~~~~--~a 288 (401) T protein:vir:44 234 -------KKP------KGFLAYESTEESDKAR-----AFGK--LQHIVSGE-ATAVTADAII-KLIYTL-RKAHRT--GA 288 (401) T ss_pred -------Ccc------ceeecccccccccccc-----cccc--cccccccc-ccccCHHHHH-HHHHhc-chhhhc--CC Confidence 122 3555433221111111 0111 11111222 2223456544 577665 666665 34 Q ss_pred EEEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEeeCcEEEE Q lcl|Aclame:pro 233 VAIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESHRRS 306 (355) Q Consensus 233 VvivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q~gs~RR~ 306 (355) +++|.+.... +-..|-+....|- ....+. ....+|-|+|++..+++|..+ +++=.|+-.-..+.+...+-. T Consensus 289 ~~v~n~~~~~-~L~~lkd~~G~~l--~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~ 365 (401) T protein:vir:44 289 KFMMNNNSLF-AIRLLKDTEGNYL--WRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRIL 365 (401) T ss_pred EEEEcHHHHH-HHHHhhccCCcee--ecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEe Confidence 7899988654 2222323322221 000000 123579999999999999644 666666532222333333322 Q ss_pred EEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceecCccCC Q lcl|Aclame:pro 307 IDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAP 348 (355) Q Consensus 307 ~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~ 348 (355) . + +|.-.| .+|.++..--++.++ .+.+...++. T Consensus 366 ~--~-------~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 366 R--D-------PYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred e--e-------ccccCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 1 1 111111 223222211122221 3333221111 No 58 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.06 E-value=6.5e-07 Score=54.52 Aligned_cols=294 Identities=15% Similarity=0.086 Sum_probs=146.8 Q ss_pred CCHHHHHHHHHHH--------HHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCc-Cccccchhhhhhhhccc Q lcl|Aclame:pro 1 MRPETRFKFNAYL--------TRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKT-INILPVAEMKGEKIGVG 71 (355) Q Consensus 1 M~~~tr~~f~~y~--------~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~-INv~~V~e~~Ge~v~lg 71 (355) +.......+.... .......+.. ......+-|++...++..+.+.+..|+. .+++++..-..-.+-.. T Consensus 85 ~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~---~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 161 (392) T protein:vir:13 85 ADHDDDAVLRAGNLGEARSFEFAPEKRDGTK---AGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVI 161 (392) T ss_pred hhHHHHHHHhccchhhhHHHHhhhhhhcccc---cCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEE Confidence 1111111111110 0011111111 1123345666777766666666655554 46666543222233333 Q ss_pred ccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccc Q lcl|Aclame:pro 72 VTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTT 151 (355) Q Consensus 72 v~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s 151 (355) .+++-++=+ +.+......+ ..++...|..++.---+.|+++.|+.. .++|+..+.+.+.+.++.=.-.--+||+- T Consensus 162 ~~~~~a~~v--~E~~~~~~~~-~~f~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~l~G~G 236 (392) T protein:vir:13 162 TGRATAGIV--GETAEIPESY-PATTQRSMGGFKYGFASVVSYEFATDQ--VLDLVGFLVSDAGPAIGDAMGRHFLTGTG 236 (392) T ss_pred cCCcceeee--cccccccccc-cceeeEEeeeeeEEeeehhHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhcccC Confidence 333434322 2222332333 457778888888888889999999975 34788888888888887655555566631 Q ss_pred cccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCC Q lcl|Aclame:pro 152 RADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPN 231 (355) Q Consensus 152 ~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~ 231 (355) +..| +|+|... +.... .+.. ...+....|.| .+++.+| ++.|+.. T Consensus 237 -------t~~p------~Gil~~~------------~~~~~-----~~~~-~~~~~~~~d~l-~~~~~~l-~~~~~~~-- 281 (392) T protein:vir:13 237 -------TGQP------RGILTDA------------TGANA-----AFGE-ADADSKVSDAL-IDLFHEV-PSAYRKN-- 281 (392) T ss_pred -------Cccc------ccccccc------------ccccc-----cccc-cccccccHHHH-HHHHHhh-hhhhhcC-- Confidence 1123 2555321 10000 0111 12233445554 3567655 6667754 Q ss_pred eEEEEcHHHHHHHHHHHHhhccccc--hhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEE Q lcl|Aclame:pro 232 LVAIVGRKLLADKYFPLVNKQQENS--ESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDE 309 (355) Q Consensus 232 LVvivG~dLl~~k~~~l~n~~~~~t--e~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d 309 (355) -+++|.+.... +...|-+....+- ....+ ....++-|+|++..+++|++.|++-.|+++-| .+++.++=.... T Consensus 282 a~~v~n~~~~~-~l~~lkd~~G~~l~~~~~~~---g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i-~~~~~~~i~~~~ 356 (392) T protein:vir:13 282 AKFVVNDLRAA-QMRKLKDANGQYLWQSALTV---GAPDTFNGKVVETDDGMPADKVLFADLSKYRV-RFAGSLRVDRSV 356 (392) T ss_pred CEEEEcHHHHH-HHHHhhccCCceeecCCcCC---CCCceecceeeEEcCCCCCCcEEEeeccceeE-EeecceEEEeec Confidence 47888888765 2222333222221 11111 11347899999999999999999999988643 444544432222 Q ss_pred ccchhhhhhhhhhh-hhhhccccccEEEEe--cceecCccCCC Q lcl|Aclame:pro 310 NPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAPA 349 (355) Q Consensus 310 ~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~~ 349 (355) + .|...| .+|..+.+--+..++ .+.+....+.+ T Consensus 357 ~-------~~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 357 D-------AKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred c-------ccccCCcEEEEEEEEeccEEecccceEEEEeeccC Confidence 2 233333 344444333333332 34433322222 No 59 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=98.05 E-value=2.7e-06 Score=51.12 Aligned_cols=289 Identities=10% Similarity=0.052 Sum_probs=153.1 Q ss_pred CCHHHHHHHHHHHHHH-----HHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcc--ccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRV-----AELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV--GVT 73 (355) Q Consensus 1 M~~~tr~~f~~y~~~~-----A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~l--gv~ 73 (355) ++..-+..|..|+..- +....... ....+.|-..+...+.+.+.+.+.+++++++++++...|..... ... T Consensus 86 ~~~~~~~~~~~~l~~~~~~~~~~~~~~t~--~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (397) T protein:vir:49 86 VKAGFVKDFKNLVRGRYQNLLDSKTDASG--SDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDI 163 (397) T ss_pred HHHHHHHHHHHHHhcchhHHHHHhhcccc--ccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccC Confidence 3444455565554321 11111111 22456776677889999999999999999999999888876533 223 Q ss_pred ccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccc Q lcl|Aclame:pro 74 GTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRA 153 (355) Q Consensus 74 ~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A 153 (355) ++.++-+..+ ......+...++...+.+++.---+.|+.+.|+.- .++|+..+.+.+.++++.-.-.--++|+... T Consensus 164 ~~~a~~v~E~--~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~ 239 (397) T protein:vir:49 164 TGLANIDDEA--GKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADS--AENILAWLSGWIAKKVVVTRNKAILEAIAAL 239 (397) T ss_pred CcceeeecCc--cccccccccceeeEEeeeeeEEeeehhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 3444443332 22222333456677788888877788999988753 3589999999999998876666666663311 Q ss_pred cCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeE Q lcl|Aclame:pro 154 DTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLV 233 (355) Q Consensus 154 ~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LV 233 (355) .. . + .-.+.|. +.+++.. +++.|+. .-+ T Consensus 240 ~~---------------------------------------~------~---~~~~~d~-i~~~~~~-l~~~~~~--~a~ 267 (397) T protein:vir:49 240 PT---------------------------------------K------P---TLTKWDD-IIDLEAK-VDPAIKQ--TSF 267 (397) T ss_pred cc---------------------------------------c------c---ccccHHH-HHHHHHh-hhhhhcC--CCE Confidence 10 0 0 0124454 3345654 4666664 458 Q ss_pred EEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCc--cCCCc-----EEEecCCCcEEEEeeCcEEE Q lcl|Aclame:pro 234 AIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPY--FPANA-----VLVTTLENLSIYFMDESHRR 305 (355) Q Consensus 234 vivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~Pf--fP~~~-----ilIT~l~NLsIY~Q~gs~RR 305 (355) ++|.+..+. .+..+-..++.. .....+. ....+|-|+|++.++. +|.++ +++=.|++.-..+.++..+= T Consensus 268 ~vmn~~~~~--~l~~lkd~~G~~-l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i 344 (397) T protein:vir:49 268 FLTNTSGFT--ALKKVKNALGDY-LMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSL 344 (397) T ss_pred EEEcHHHHH--HHHHhhcCCCce-eeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEE Confidence 899988765 344442222110 0100111 1235899999987653 66544 66667776433333333332 Q ss_pred EEEEccchhhhhhhhhhh-hhhhccccccEEEEe--c---ceecCccCCCCcCCC-C Q lcl|Aclame:pro 306 SIDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--N---ITLGDFTAPAAPESG-A 355 (355) Q Consensus 306 ~~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--n---I~~~~~~~~~~~~~~-a 355 (355) ...+. .++++..| ..|.++.+--+.... . +++...+++++..+. | T Consensus 345 ~~~~~-----~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 396 (397) T protein:vir:49 345 LSTNI-----GGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNLGSTA 396 (397) T ss_pred EEecc-----ccchhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCCCccccc Confidence 22111 12233333 234333332222222 2 333332222211111 1 No 60 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.04 E-value=4e-07 Score=55.68 Aligned_cols=303 Identities=13% Similarity=0.091 Sum_probs=147.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) ++++-|..|++ +.+..+ ....+.|-+.+.+.+++.+++.|.++++++++++.- ...+-...+++-++-+ T Consensus 75 l~~ee~~~~~~----~~~~t~-----~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~ 143 (395) T protein:vir:95 75 LTSEERKFFND----INYDVG-----YTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI--KTRVIKADPAGQAVWG 143 (395) T ss_pred cchHHHHHHHH----HhhccC-----CCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEe Confidence 44444443332 221111 223577888889999999999999999999988752 1223222222322211 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) +...++.+.....++...+.+++.---..|+.+.|+.= ..+++..+++.+.++++.=.-.--+||+-...+ T Consensus 144 --~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds--~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~----- 214 (395) T protein:vir:95 144 --KVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFG--PAWIERFVRTQIQEAISVALESAIINGGGAAKT----- 214 (395) T ss_pred --ecccccCccccccceeeeeceeeEEEeecccHHHHhcc--hhHHHHHHHHHHHHHHHHHHhhheeeccCCCCc----- Confidence 11123333333456667788888877788999999742 236888899999999888777777788543321 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHH---hcc----cchhhhCCCCeE Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDAT---NNL----IDEVYQDDPNLV 233 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~---~~l----id~~~~~~~~LV 233 (355) -| +|+|..+-... .... .+. .++. -.|.++|.++..+. ..+ .....+-...+. T Consensus 215 qP------~Gil~~~~~~~--~~~~-----~~~-~~~~------~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 274 (395) T protein:vir:95 215 QP------VGLMKDVNTNS--GAVT-----DKA-SSGT------LTFADADTTILELNDVLKNLSVDEKGKELKIDGKVA 274 (395) T ss_pred Cc------eeeeecccccc--cccc-----ccc-ccch------hhhhhhHhhHHHHHHHHHhhccccccchhhhcCceE Confidence 13 26664221100 0000 000 0000 12334443322211 100 111123345677 Q ss_pred EEEcHHHHHHHH-HHHHhhccccchhhHHHHHHhhhhhc-ccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEcc Q lcl|Aclame:pro 234 AIVGRKLLADKY-FPLVNKQQENSESLAADIIISQKRIG-NLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENP 311 (355) Q Consensus 234 vivG~dLl~~k~-~~l~n~~~~~te~~aa~~~~~~k~iG-Glpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p 311 (355) ++|.+.-..+.. .++....+ ++-. ..+| |+|.+..++||++.++.-.+++..|+ .++.++=..-++. T Consensus 275 ~~mn~~t~~~~~g~~~~~~~~-------G~~~---~~lg~g~~v~~~~~~p~~~i~fgdfs~y~i~-~r~~~~i~~~~~~ 343 (395) T protein:vir:95 275 LVVNPRDSWDVQARYTYLTAN-------GGFV---TVLPYNVTIITSEFVPEGKLVAFVTDRYNAV-RGGGLTVKKFDQT 343 (395) T ss_pred EEEcchhhhhcCCcceeccCC-------Ccce---eccCCcceEEEcCCCCCCcEEEEecccEEEE-EecceEEEeccch Confidence 888765443221 11111111 1100 1122 77889999999999999998886554 3444433222211 Q ss_pred --chhhhhhhhhhhhhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 312 --KKDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 312 --~r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) .++++..+-..--+-.+=|.+++.++ .|++.+++....+.+|. T Consensus 344 ~~~~d~~~f~~~~r~dg~~~~~~A~~~l-~i~~~~~~~~~~~~~~~ 388 (395) T protein:vir:95 344 LALEDAVLFTAKTFAYGQPDDNKASAVY-DLKVASAPRRQTSAGGT 388 (395) T ss_pred hhhCCcEEEEEEEEECCEEeccccEEEE-EeeccCCCCCCCCCCCC Confidence 11212111111112222233333222 24444443333333333 No 61 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.04 E-value=2.7e-06 Score=51.12 Aligned_cols=286 Identities=10% Similarity=0.084 Sum_probs=151.6 Q ss_pred CCHHHHHHHHHHHHHH----HHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccc--cc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRV----AELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGV--TG 74 (355) Q Consensus 1 M~~~tr~~f~~y~~~~----A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv--~~ 74 (355) +...-+..|.+|+..- .+...... .....+.|-+.+...+.+.+.+.+.+++.+++++++...|.....-. .+ T Consensus 86 ~~~~~~~~~~~~l~~~~~~~~~~~~~~t-~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 164 (397) T protein:vir:49 86 VKANFVKDFKNLVRGRYQNLLDSKTDGS-GSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADIT 164 (397) T ss_pred HHHHHHHHHHHHhhcchhhHHHhhhccC-CccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCC Confidence 4445555666665321 11111110 11235677666678899999999999999999999888776543321 22 Q ss_pred cccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD 154 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~ 154 (355) +.+.-+.-+ ......+...++...+.+++.---+.|+.+.|.... .+|+..+.+.+.++++.-.-.--++|+-... T Consensus 165 ~~a~~v~E~--~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~ 240 (397) T protein:vir:49 165 GLAKLDDEG--GQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILEAIGTLP 240 (397) T ss_pred cceeeeccc--cccccccccceeeeEeeeeeeEeehhhHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 333333222 111122223356677888888777888988887532 4789999999999988877777777743110 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEE Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVv 234 (355) | . +.. .+.|.| .+++.. +++.|+. .-++ T Consensus 241 -------~--------------------------------~------~~~---~~~d~i-~~~~~~-l~~~~~~--~a~~ 268 (397) T protein:vir:49 241 -------N--------------------------------K------PTL---AKWDDI-IDLQAK-VDPAIKQ--TSLF 268 (397) T ss_pred -------c--------------------------------c------ccc---cCHHHH-HHHHHh-hhhhhcC--CCEE Confidence 0 0 001 134443 345544 4666664 3488 Q ss_pred EEcHHHHHHHHHHHH-hhccccchhhHHHHH-HhhhhhcccccccCC--ccCCC-----cEEEecCCCcEEEEeeCcEEE Q lcl|Aclame:pro 235 IVGRKLLADKYFPLV-NKQQENSESLAADII-ISQKRIGNLPAVRVP--YFPAN-----AVLVTTLENLSIYFMDESHRR 305 (355) Q Consensus 235 ivG~dLl~~k~~~l~-n~~~~~te~~aa~~~-~~~k~iGGlpa~~~P--ffP~~-----~ilIT~l~NLsIY~Q~gs~RR 305 (355) +|.+..+. ++..+ +....+- ....+. ....+|-|+|++.++ .+|.. .+++-.|++.-+++.++...= T Consensus 269 v~n~~~~~--~l~~lkd~~g~~l--~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 344 (397) T protein:vir:49 269 LTNTSGFT--ALKKVKNAMGDYL--MERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSL 344 (397) T ss_pred EEcHHHHH--HHHHhhccCCcee--ecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEE Confidence 99988765 33333 2221110 000111 113489999998765 45543 467777776444444343332 Q ss_pred EEEEccchhhhhhhhhhh-hhh--------hccccccEEEEecceecCc--cCCCCcCCCC Q lcl|Aclame:pro 306 SIDENPKKDRVENYESMN-IDY--------VVEVYAAGCLLENITLGDF--TAPAAPESGA 355 (355) Q Consensus 306 ~~~d~p~r~rve~y~s~N-e~Y--------vVEd~~~~a~ienI~~~~~--~~~~~~~~~a 355 (355) ...+.. .+++-.| .+| .|-+...++.+ ++... ++|.++..|| T Consensus 345 ~~~~~~-----~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~---~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 345 LSTNIG-----GGAFETDTTKVRVIDRFDVVSTDTEAFVPA---SFKAIADQKAKLSTAGA 397 (397) T ss_pred EEeccc-----cchhhcCeeeEEEEEeeccEEecccceEEE---EecccccccCcccccCC Confidence 221111 1222222 233 33334444433 33222 3345555555 No 62 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=98.02 E-value=2.1e-06 Score=51.67 Aligned_cols=298 Identities=10% Similarity=0.076 Sum_probs=144.0 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccc---cccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVG---VTGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lg---v~~~ia 77 (355) .....+..+.+.. ..+...+.. ....+.|-+...+.++..+.+.+.+++.+++++++-..+...... +...-+ T Consensus 102 ~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a 177 (413) T protein:vir:81 102 VGEYVAPRVKAAS-DPASTATLT---DEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGF 177 (413) T ss_pred hhhhhhhHHHhhh-hhhhhcccc---cccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCceeEEEecccccccccc Confidence 0000000111110 111111111 234566778888999999999999999999998876544322111 111112 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) +-+.-+ ......+...++...+..++.--.+.|+.+.|+.. +.|...+++.+.++++.=.-.--+||+- T Consensus 178 ~~v~Eg--~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~d~~~l~G~G------ 246 (413) T protein:vir:81 178 KTVAEG--GKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDY---DFLVSYINARLLEELAIEEERQLLLGDG------ 246 (413) T ss_pred ceecCc--ccccccCcccceeeEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhccCC------ Confidence 222211 12212232345566777777766788999999875 4689999999998888766666678732 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhccc-chhhhCCCCeEEEE Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLI-DEVYQDDPNLVAIV 236 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~li-d~~~~~~~~LVviv 236 (355) +.+| -+|++... +.. .+..+.+.++ .| .+.+++..+. +.-++ +. .++| T Consensus 247 -~~~~-----~~Gi~~~~----------------~~~---~~~~~~~~~~--~~-~i~~~~~~~~~~~~~~--~~-~~vm 295 (413) T protein:vir:81 247 -TGNN-----LTGLLKRD----------------GIQ---TLAVSNKDEL--AD-SIYKAMTNISLATPFQ--AD-ALVI 295 (413) T ss_pred -CCCc-----cccccccc----------------ccc---cccccccchh--HH-HHHHHHHHhhhhccCC--Cc-EEEE Confidence 1111 12554310 000 1112222222 22 2333333222 22222 22 4777 Q ss_pred cHHHHHHHHHHHHhhccccc------hhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEc Q lcl|Aclame:pro 237 GRKLLADKYFPLVNKQQENS------ESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDEN 310 (355) Q Consensus 237 G~dLl~~k~~~l~n~~~~~t------e~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~ 310 (355) .+..+.. -..|-.....+- .....-......++-|+|++..+++|++.+++-.+++-...+.++...=.+-+. T Consensus 296 n~~~~~~-l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~ 374 (413) T protein:vir:81 296 NPLDYQE-LRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRKGGVRIDSTNT 374 (413) T ss_pred cHHHHHH-HHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEecceEEEEecc Confidence 7765552 222222211110 000111112235788999999999999999999998743333333333222221 Q ss_pred c----chhhhhhhhhhhhhhhccccccEEEEecceecCccCC Q lcl|Aclame:pro 311 P----KKDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAP 348 (355) Q Consensus 311 p----~r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~ 348 (355) . .++++.-.-..--++.|-+..+++.+ ++..+.+| T Consensus 375 ~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l---~~~~~~~p 413 (413) T protein:vir:81 375 NVDDFENNLITVRAEERVGLMVTFPEAIVQL---DVAEVVTP 413 (413) T ss_pred ccchhhcCcEEEEEEEeeccEEecccceEEE---EecCCCCC Confidence 1 12222211111123344444444432 34443344 No 63 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.02 E-value=2.5e-06 Score=51.26 Aligned_cols=295 Identities=12% Similarity=0.011 Sum_probs=153.5 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCCh-------------HHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNIST-------------DDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEK 67 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~-------------~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~ 67 (355) ....-...+..+........+... ........+-|.....+.+.+.+.+.+++.++++++.--.... T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:81 80 DMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEE Confidence 000001112222222211111100 0112345677888899999999999999999999887555445 Q ss_pred hcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhh Q lcl|Aclame:pro 68 IGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGF 147 (355) Q Consensus 68 v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGf 147 (355) ..+....+-+.-+. .+......+ ..++...+..++.--.+.|+.+.|+.. ++++..+.+.+.+.++.-.-.--+ T Consensus 160 ~~~~~~~~~a~~v~--Eg~~~~~~~-~~~~~i~~~~~k~~~~~~is~ell~d~---~~~~~~i~~~l~~~~~~~~d~a~l 233 (390) T protein:vir:81 160 VQETGFVNNAAIVA--EGALKPESS-LKFAKKTDTTHVIAHTMKATRQILSDA---PQLASYMNNRLIRGLKVKEDAEIL 233 (390) T ss_pred EEEecCCcceeeec--CCccccccc-ceeeEEEEeeeEEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 54432222222121 122333333 357778889998888899999999874 579999999999998887777777 Q ss_pred cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhh Q lcl|Aclame:pro 148 NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQ 227 (355) Q Consensus 148 nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~ 227 (355) ||.- ++ .+| +|.+.. .. ... . ....++....|. +.+++..+.. .++ T Consensus 234 ~G~g----~~--~~~------~Gi~~~------------~~-----~~~--~-~~~~~~~~~~~~-~~~~~~~~~~-~~~ 279 (390) T protein:vir:81 234 RGTG----AN--DGL------LGLIPQ------------AT-----TYA--A-PTTIAGATRVDQ-LRLAMLQASL-AEY 279 (390) T ss_pred hcCC----CC--Ccc------cceeec------------cc-----ccc--c-ccccccchhHHH-HHHHHHhhcc-ccC Confidence 8832 11 112 233321 10 000 0 111222234454 4556665544 344 Q ss_pred CCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEE Q lcl|Aclame:pro 228 DDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSI 307 (355) Q Consensus 228 ~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~ 307 (355) ... +++|.+..+. +--.|-.....+-=..... ....++-|+|++..+++|++.+++=.+++.-..+.++..+=.. T Consensus 280 ~~~--~~v~~~~~~~-~l~~lkd~~G~~l~~~~~~--~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~ 354 (390) T protein:vir:81 280 NPS--GIVINPIDWA-AIELAKDANNQYLIGNARG--TLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEI 354 (390) T ss_pred CCC--EEEEcHHHHH-HHHHhhcCCCceeecCccc--ccCceecceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEE Confidence 322 7888887654 2122222221111000000 1235788999999999999999999998843334444444333 Q ss_pred EEccchhhhhhhhhhh-hhhhccccccEEEEe--cceecCcc Q lcl|Aclame:pro 308 DENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFT 346 (355) Q Consensus 308 ~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~ 346 (355) .+.+. |...| .+|.++.+--+..+. .+....-+ T Consensus 355 ~~~~~------~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 355 GYVGE------DFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ecccc------hhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 22222 22233 234333333222222 22222211 No 64 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.00 E-value=2.9e-06 Score=50.92 Aligned_cols=292 Identities=10% Similarity=-0.014 Sum_probs=146.0 Q ss_pred CCHHHHHHHHHHHHHHH------------HHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVA------------ELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKI 68 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A------------~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v 68 (355) ++++....|..+...-. ...... .......+-|.....+++.+.+.+.++++++++++....+... T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 160 (390) T protein:vir:10 83 VASEQFQASAGRWNDRSARATMNIKAALNTASTDA--AGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYV 160 (390) T ss_pred hhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhccc--ccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEE Confidence 11111111111111100 011111 0123455677788899999999999999999999876554444 Q ss_pred cccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhc Q lcl|Aclame:pro 69 GVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFN 148 (355) Q Consensus 69 ~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfn 148 (355) .+....+-++-+. .+......++ ..+...+..++.---+.|+.+.|+.- ++++..+.+.+.+.++.-.-.--++ T Consensus 161 ~~~~~~~~a~~v~--Eg~~~~~~~~-~~~~i~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~l~~~~~~~~~~~il~ 234 (390) T protein:vir:10 161 QETGFVNNAAIVA--EGALKPESSL-KFAKKTDTTHVIAHTMKATRQILSDA---PQLASYMNNRLIRGLKVKEDAEILR 234 (390) T ss_pred EEecCCcceeeec--CCcccccccc-ceeEEEEeeEEEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 4322222222221 1222333333 46778888888888888999988863 5899999999999887755555556 Q ss_pred ccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhC Q lcl|Aclame:pro 149 GTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQD 228 (355) Q Consensus 149 G~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~ 228 (355) |+-. + .+|.+ +++... . . .+..+..+ ....|. +.+++..+ ++.++. T Consensus 235 G~G~----~--~~p~G------------------i~~~~~----~--~-~~~~~~~~-~~~~~~-~~~~~~~l-~~~~~~ 280 (390) T protein:vir:10 235 GTGA----N--DGLLG------------------LIPQAT----T--Y-AAPTTIAG-ATRVDQ-LRLAMLQA-SLAEYP 280 (390) T ss_pred cCCC----C--ccccc------------------cccccc----c--c-cccccccc-cchHHH-HHHHHHhh-ccccCC Confidence 6321 1 11222 221110 0 0 01111111 113343 55566655 454554 Q ss_pred CCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCC-cEEEEeeCcEEEEE Q lcl|Aclame:pro 229 DPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLEN-LSIYFMDESHRRSI 307 (355) Q Consensus 229 ~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~N-LsIY~Q~gs~RR~~ 307 (355) .. +++|.+.-+. +-..|-+....+-=... .-..+.++-|+|++..+++|++.+++-.+++ .-|+...| .+=.+ T Consensus 281 ~~--~~v~n~~~~~-~L~~lkd~~g~~l~~~~--~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~-~~i~~ 354 (390) T protein:vir:10 281 AS--GIVINPIDWA-AIELAKDANNQYLIGNA--RGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWD-ARVEI 354 (390) T ss_pred CC--EEEEcHHHHH-HHHHhhcCCCceeecCC--cCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecc-eEEEE Confidence 33 6778877554 11222222211110000 0112457899999999999999999999986 34444333 32222 Q ss_pred EEcc---chhhhhhhhhhhhhhhccccccEEEEecceec Q lcl|Aclame:pro 308 DENP---KKDRVENYESMNIDYVVEVYAAGCLLENITLG 343 (355) Q Consensus 308 ~d~p---~r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~ 343 (355) -+.. .+|.+.-+-..--++.|-+..+++ .|+|+ T Consensus 355 ~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~---~~~~a 390 (390) T protein:vir:10 355 GYVNDDFQRNMVTVLAEERLALVVYRPEALI---SGSFA 390 (390) T ss_pred eecccccccCcEEEEEEEeeccEEeccccEE---EEEeC Confidence 2211 122222111112222333333333 33343 No 65 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=97.98 E-value=6.6e-07 Score=54.45 Aligned_cols=308 Identities=12% Similarity=0.031 Sum_probs=144.8 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) ++.+-|+.|+++.. +.+ ....|.|-+...+++++.+.+.|.+++.++++++.- +.++-...+++.|+=+ T Consensus 65 lt~~e~~~~~~~~~------~~~---~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~ 133 (381) T protein:vir:10 65 LSANQRSFFMDINK------NVN---YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) T ss_pred ccHHHHHHHHHHhc------ccC---CCCceecCHHHHHHHHHHHHhhccceeheeeEecCc--ceEEEEecCCcceeee Confidence 55555555554322 121 234688999999999999999999999999887752 2234333333333221 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . ....+.......++...+.+++.---..|+.+.|++ ...+++..+++.+.+++|.=.-.--+||+- +. T Consensus 134 ~--e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~D--s~~~ie~~i~~~la~~~a~~~~~a~i~G~G-------~~ 202 (381) T protein:vir:10 134 K--IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTG-------KD 202 (381) T ss_pred c--ccccccccccccceeeeecceeEEeechhhHHHhhc--CHHHHHHHHHHHHHHHHHHHhhheeEeccC-------CC Confidence 1 112222222234556677777777778899999987 233788889999999888765555556643 12 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeee-CCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRV-GKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRK 239 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~-G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~d 239 (355) .| +|+|..+-. ...-+.+.... +.....++. ....-|..|.+++..+-......-..-....+++|.+. T Consensus 203 qP------~Gil~~~~~---~~~~~~g~~~~-~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~ 272 (381) T protein:vir:10 203 QP------IGLNRQVQK---GVSVTEGAYPE-KEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) T ss_pred Cc------eeeeeccCc---ccccccccccc-cccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccc Confidence 23 355432211 00001110000 000001110 01111334444443322111110011235678999876 Q ss_pred HHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhh Q lcl|Aclame:pro 240 LLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENY 319 (355) Q Consensus 240 Ll~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y 319 (355) -... -.++..-.+.. ++.+. ..--|.+.+..++||++.++.-.+++--|. -++.++=..-+ +. -| T Consensus 273 t~~~-l~~~~~~~~~~-----G~~v~--~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~-~r~~~~i~~~~--~~----~~ 337 (381) T protein:vir:10 273 DAFE-VQAQYTHLNAN-----GVYVT--ALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGINVQKFK--ET----LA 337 (381) T ss_pred cHHh-hccccccCCCC-----Cceee--cCCCCceEEecCCCCcCcEEEEecccEEEE-EecccEEEeec--hh----Hh Confidence 4431 11221111100 11000 011266688899999999999988885443 33333321111 11 11 Q ss_pred hhhhhhhhccccccEEEEe-------cceecCccCCCCcCCCC Q lcl|Aclame:pro 320 ESMNIDYVVEVYAAGCLLE-------NITLGDFTAPAAPESGA 355 (355) Q Consensus 320 ~s~Ne~YvVEd~~~~a~ie-------nI~~~~~~~~~~~~~~a 355 (355) ..-..+|.+-.+--+..++ .|++.+.+...+....- T Consensus 338 ~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~ 380 (381) T protein:vir:10 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEET 380 (381) T ss_pred hcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCccccccc Confidence 1111233222222122221 14443322111111111 No 66 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=97.98 E-value=6.6e-07 Score=54.45 Aligned_cols=308 Identities=12% Similarity=0.031 Sum_probs=144.8 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) ++.+-|+.|+++.. +.+ ....|.|-+...+++++.+.+.|.+++.++++++.- +.++-...+++.|+=+ T Consensus 65 lt~~e~~~~~~~~~------~~~---~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~ 133 (381) T protein:vir:95 65 LSANQRSFFMDINK------NVN---YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) T ss_pred ccHHHHHHHHHHhc------ccC---CCCceecCHHHHHHHHHHHHhhccceeheeeEecCc--ceEEEEecCCcceeee Confidence 55555555554322 121 234688999999999999999999999999887752 2234333333333221 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . ....+.......++...+.+++.---..|+.+.|++ ...+++..+++.+.+++|.=.-.--+||+- +. T Consensus 134 ~--e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~D--s~~~ie~~i~~~la~~~a~~~~~a~i~G~G-------~~ 202 (381) T protein:vir:95 134 K--IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTG-------KD 202 (381) T ss_pred c--ccccccccccccceeeeecceeEEeechhhHHHhhc--CHHHHHHHHHHHHHHHHHHHhhheeEeccC-------CC Confidence 1 112222222234556677777777778899999987 233788889999999888765555556643 12 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeee-CCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRV-GKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRK 239 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~-G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~d 239 (355) .| +|+|..+-. ...-+.+.... +.....++. ....-|..|.+++..+-......-..-....+++|.+. T Consensus 203 qP------~Gil~~~~~---~~~~~~g~~~~-~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~ 272 (381) T protein:vir:95 203 QP------IGLNRQVQK---GVSVTEGAYPE-KEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPS 272 (381) T ss_pred Cc------eeeeeccCc---ccccccccccc-cccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccc Confidence 23 355432211 00001110000 000001110 01111334444443322111110011235678999876 Q ss_pred HHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhh Q lcl|Aclame:pro 240 LLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENY 319 (355) Q Consensus 240 Ll~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y 319 (355) -... -.++..-.+.. ++.+. ..--|.+.+..++||++.++.-.+++--|. -++.++=..-+ +. -| T Consensus 273 t~~~-l~~~~~~~~~~-----G~~v~--~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~-~r~~~~i~~~~--~~----~~ 337 (381) T protein:vir:95 273 DAFE-VQAQYTHLNAN-----GVYVT--ALPFNLNVIESTVQEAGKVLTYVKGLYDGY-LAGGINVQKFK--ET----LA 337 (381) T ss_pred cHHh-hccccccCCCC-----Cceee--cCCCCceEEecCCCCcCcEEEEecccEEEE-EecccEEEeec--hh----Hh Confidence 4431 11221111100 11000 011266688899999999999988885443 33333321111 11 11 Q ss_pred hhhhhhhhccccccEEEEe-------cceecCccCCCCcCCCC Q lcl|Aclame:pro 320 ESMNIDYVVEVYAAGCLLE-------NITLGDFTAPAAPESGA 355 (355) Q Consensus 320 ~s~Ne~YvVEd~~~~a~ie-------nI~~~~~~~~~~~~~~a 355 (355) ..-..+|.+-.+--+..++ .|++.+.+...+....- T Consensus 338 ~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~~~~~~~~~ 380 (381) T protein:vir:95 338 LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEGTEET 380 (381) T ss_pred hcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCcCccccccc Confidence 1111233222222122221 14443322111111111 No 67 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=97.97 E-value=1.8e-06 Score=52.04 Aligned_cols=291 Identities=14% Similarity=0.085 Sum_probs=140.6 Q ss_pred CCHHHHHHHHHHHHH------------HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHh-CcCccccchhhhhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTR------------VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFL-KTINILPVAEMKGEK 67 (355) Q Consensus 1 M~~~tr~~f~~y~~~------------~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL-~~INv~~V~e~~Ge~ 67 (355) .+......+.+|+.. .....+.. .+....+-|++.++++..+.+.+..| +..+++++....+-. T Consensus 81 ~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~---~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~ 157 (390) T protein:vir:62 81 AQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTK---AGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLD 157 (390) T ss_pred chhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccc---cCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeE Confidence 111111112222211 00011111 12234456666666665554444444 566777765322222 Q ss_pred hcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhh Q lcl|Aclame:pro 68 IGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGF 147 (355) Q Consensus 68 v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGf 147 (355) +-.-.+++-++-+. ........+ ..++...|..++.---+.|+++.|+.. .++|+..+++.+.++++.=.-.--+ T Consensus 158 ~p~~~~~~~a~wv~--E~~~~~~~~-~~f~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~l 232 (390) T protein:vir:62 158 FTVITGRSSASIVG--ETAEIPESY-PATAQRSMGGFKYGFASVVSYEFATDQ--VLDLVGFLVSDAGPAIGDAMGRHFI 232 (390) T ss_pred EEEEcCCcceeeec--ccccccccc-cceeeeEeeeeeEEeehHHHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHhhhh Confidence 33333334443322 222332333 346777888888888889999999874 3478888999988888765555566 Q ss_pred cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhh Q lcl|Aclame:pro 148 NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQ 227 (355) Q Consensus 148 nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~ 227 (355) ||+- . | +|++... ...... +..+ ..+-.+.|.|+ +++.+| ++.|+ T Consensus 233 ~G~G-----~----p------~Gi~~~~------------~~~~~~-----~~~~-~~~~~~~~~l~-~~~~~l-~~~~~ 277 (390) T protein:vir:62 233 TGTG-----Q----P------RGILTDA------------SPATAT-----FLAT-DTDSKVSDALI-DLFHEV-PSAYR 277 (390) T ss_pred ccCC-----c----c------ccccccc------------cccccc-----eecc-cccccchHHHH-HHHHhh-hhhhh Confidence 7732 1 2 4655421 000000 1111 11222344333 455555 56666 Q ss_pred CCCCeEEEEcHHHHHHHHHHHH-hhcccc--chhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEE Q lcl|Aclame:pro 228 DDPNLVAIVGRKLLADKYFPLV-NKQQEN--SESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHR 304 (355) Q Consensus 228 ~~~~LVvivG~dLl~~k~~~l~-n~~~~~--te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~R 304 (355) . .-+++|.+..+. ++..+ .+...+ .....+ ....++.|+|++..+++|++.+++=.|+..-|. .++.+. T Consensus 278 ~--~a~~vmn~~~~~--~L~~lkd~~g~~l~~~~~~~---g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~-~~~~~~ 349 (390) T protein:vir:62 278 A--NAKYVVNDLRAA--QMRKLKDANGQYLWQSGLTV---GAPSLFNGKVVETDDGMPADKILFADLSKYRVR-FAGSLR 349 (390) T ss_pred c--CCEEEEchHHHH--HHHHhhccCCCeeecCCcCC---CccceecccceEEecCCCCccEEEeeccceeEE-eecceE Confidence 5 447899988765 34333 222111 111111 123479999999999999999998777765333 233332 Q ss_pred EEEEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceecCccCCC Q lcl|Aclame:pro 305 RSIDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAPA 349 (355) Q Consensus 305 R~~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~~ 349 (355) =....+ .|...| -+|.++..--+..++ .|.+....+.+ T Consensus 350 v~~~~~-------~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 350 VDRSVD-------AKFSTDQIVYRFLQRADGLLVDARGAKVLTVTPGA 390 (390) T ss_pred EEeecc-------ccccCCcEEEEEEEEeCcEeechhheEEEEeecCC Confidence 222112 122233 233333222223332 33332222111 No 68 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=97.97 E-value=5e-06 Score=49.66 Aligned_cols=300 Identities=10% Similarity=0.083 Sum_probs=146.8 Q ss_pred CCHHHHHHHHHHHHHHHHHhC---------------------CChH-HcceeeecCcHHHHHHHHHHHhhHHHhCc-Ccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNN---------------------ISTD-DVSKKFTVEPSVTQTLMNTVQASSAFLKT-INI 57 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ng---------------------v~~~-~v~~~Fsv~P~~~q~L~~~iqess~FL~~-INv 57 (355) +. .-...|..|+..++..-| +... .....+.|-+.+.+.+.+.+++.+.+++. .++ T Consensus 91 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~ 169 (435) T protein:vir:14 91 LE-VKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGART 169 (435) T ss_pred hh-hhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhccee Confidence 11 111223333332222111 1000 01123455555678899999998888765 344 Q ss_pred ccchhhhhhhh-cccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHH Q lcl|Aclame:pro 58 LPVAEMKGEKI-GVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVK 136 (355) Q Consensus 58 ~~V~e~~Ge~v-~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~ 136 (355) ++.. +|..- -.-.+++-+.-+.- .......+ ..++...|.+++.---+.|+.+.|+.-+-.|.|+..+.+.+.+ T Consensus 170 ~~~~--~~~~~~p~~~~~~~a~~v~E--~~~~~~~~-~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~ 244 (435) T protein:vir:14 170 LPLS--NGNITIPRLKGGAIVGYIGA--DTDIPTTQ-QQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTA 244 (435) T ss_pred eecC--CCceEEEEEeCCcceeeecc--Cccccccc-cceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHH Confidence 4433 33211 11112233332222 22222233 3467788889888888999999999977778899999999999 Q ss_pred HhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHH Q lcl|Aclame:pro 137 RQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMD 216 (355) Q Consensus 137 ~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d 216 (355) +++.-.-.--++|+..+. +| +|++... .+..+ ...-.++.+..+.+.+.+ T Consensus 245 ai~~~~d~a~l~G~G~~~------~p------~Gi~~~~---~~~~~---------------~~~~~~~~~~~~~~~~~~ 294 (435) T protein:vir:14 245 AIGAREDKAFIRDDGTAN------TP------KGLRFWA---LPSNV---------------ITASDASTLQKIETDLGK 294 (435) T ss_pred HHHHHHHHHhhccCCCCc------cc------cceeecc---cccce---------------eccccccchhhHHHHHHH Confidence 988665555567743221 12 2544210 00001 111122222222222333 Q ss_pred HHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCC--------cEEE Q lcl|Aclame:pro 217 ATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPAN--------AVLV 288 (355) Q Consensus 217 ~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~--------~ilI 288 (355) ++..+ ...+......+++|.+...+. +..+.-.++..- . -+ ....+|-|+|++..+++|.+ .+++ T Consensus 295 l~~~~-~~~~~~~~~~~~v~n~~~~~~--L~~lkd~~G~~l-~-~~--~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~ 367 (435) T protein:vir:14 295 VILAL-ENADANLTQPGWIMAPRTFRF--LEGLRDGNGNKV-Y-PE--LANGMLKGYPVGKTTQVPINLGETGKESEIYF 367 (435) T ss_pred HHHHh-hhccccccCCEEEEcHHHHHH--HHHhhccCCcee-c-cC--CCCCeeecceeEeeccccccccCCCccceEEE Confidence 33222 111122235689999887752 333322222111 0 00 12357899999999999985 5788 Q ss_pred ecCCCcEEEEeeCcEEEEEEEccchh----hhhhhhhhhh-hh--------hccccccEEEEecceecC Q lcl|Aclame:pro 289 TTLENLSIYFMDESHRRSIDENPKKD----RVENYESMNI-DY--------VVEVYAAGCLLENITLGD 344 (355) Q Consensus 289 T~l~NLsIY~Q~gs~RR~~~d~p~r~----rve~y~s~Ne-~Y--------vVEd~~~~a~ienI~~~~ 344 (355) -.++..- +..++..+-.+.++.... .+.+|+.+|. +| .|=+.++++.+.++.++- T Consensus 368 gd~s~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 368 TDFGDVF-IGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred eecccEE-EEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 8887743 445555555444433211 1123333332 22 222333333333333322 No 69 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=97.96 E-value=2.1e-06 Score=51.67 Aligned_cols=282 Identities=11% Similarity=0.097 Sum_probs=151.4 Q ss_pred CCHHHHHHHHHHHHH-HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhh-hcccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTR-VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEK-IGVGVTGTIAS 78 (355) Q Consensus 1 M~~~tr~~f~~y~~~-~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~-v~lgv~~~ia~ 78 (355) ++.+.+..|..|+.. ..++..... .....+.|-+.+...+.+.+.+.|.+++.+++++++-..|.. +..+.+++-+. T Consensus 71 ~~~~~~~~~~~~l~~~~~~a~~~~t-~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 149 (371) T protein:vir:81 71 VKENEVEAFVNHIRTRFRNAMSEGS-NQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFV 149 (371) T ss_pred hHHHHHHHHHHHHHHHHHHhhccCC-CccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccee Confidence 555566667666543 333333322 123456777778899999999999999999999998766664 33333333333 Q ss_pred cccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCCh Q lcl|Aclame:pro 79 TTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDR 158 (355) Q Consensus 79 Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~ 158 (355) -+.. +......+...++.....+++.---+.|+.+.|+... ++|+..+.+.+.++++.-.-..-++|+.... T Consensus 150 ~v~E--g~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~---- 221 (371) T protein:vir:81 150 EVAE--GAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDST--EAIVNTLVRWIGDESRVTRNGLIINVLNTKA---- 221 (371) T ss_pred eecc--ccccccccccceeeEEeeeeEEEEeehhhHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---- Confidence 2222 2222222333566777888888888899999998643 5889999999998877655544455532110 Q ss_pred hhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcH Q lcl|Aclame:pro 159 TKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGR 238 (355) Q Consensus 159 ~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~ 238 (355) | . .. .+.|.+.. ++...+++.|+. ..+++|.+ T Consensus 222 ---------------------~----------~-----------~~---~~~~~i~~-~~~~~l~~~~~~--~a~~vmn~ 253 (371) T protein:vir:81 222 ---------------------K----------T-----------AI---ADLDGLKQ-IINVQLDPVFRS--TSSVIVNQ 253 (371) T ss_pred ---------------------c----------c-----------cc---ccHHHHHH-HHHhhcchhhhc--CCEEEEcH Confidence 0 0 00 13344333 344456777764 56889998 Q ss_pred HHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCccCCC------------cEEEecCCCc-EEEEeeCcEE Q lcl|Aclame:pro 239 KLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPYFPAN------------AVLVTTLENL-SIYFMDESHR 304 (355) Q Consensus 239 dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~------------~ilIT~l~NL-sIY~Q~gs~R 304 (355) ...+ .+..+--.++.. .....+. ....++-|+|++..+++|.+ .+++=.+++. .|+.+.+ .+ T Consensus 254 ~~~~--~L~~lkd~~g~~-l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~-~~ 329 (371) T protein:vir:81 254 DAFN--WLDTLKDQNGQY-LLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQR-TE 329 (371) T ss_pred HHHH--HHHHhhccCCCe-eeecccCCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecc-eE Confidence 7765 233332111110 0000000 12357889999999999854 3455555542 2222222 22 Q ss_pred EEEEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceecCccCC Q lcl|Aclame:pro 305 RSIDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLGDFTAP 348 (355) Q Consensus 305 R~~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~~ 348 (355) =.+.+ .. .+++..| ..|.+|-+--+..+. .+.+....+. T Consensus 330 i~~~~-~~----~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 330 IMSSN-VA----MDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred EEEec-cc----cchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 11111 11 1222233 355554443333332 4444332222 No 70 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=97.93 E-value=4.2e-06 Score=50.04 Aligned_cols=287 Identities=11% Similarity=0.060 Sum_probs=148.1 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHc-ceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcc--ccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDV-SKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV--GVTGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v-~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~l--gv~~~ia 77 (355) ....++...+.++..+-+.-....... ...+.|-+.+...+.+.+.+.+.+++.+++++++...|....+ ...++.+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a 165 (395) T protein:vir:38 86 GKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLK 165 (395) T ss_pred hhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccc Confidence 333334444444433322211111111 2345666677889999999999999999999999888876432 2223334 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) +-+..+ ......+...++...+.+++.---+.|+.+.|+.. .++|+..+.+.+.+.++.-.-.--+||.-.... T Consensus 166 ~~v~E~--~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~-- 239 (395) T protein:vir:38 166 DLDDES--ALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDT--VDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPK-- 239 (395) T ss_pred cccccc--cccccccccceeeEEeeeeeeEeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-- Confidence 332222 11111222345667788887777788888888763 347899999999998887655555555321100 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEc Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVG 237 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG 237 (355) .+....| |.++ ++++..+++.|+. .-+++|. T Consensus 240 -------------------------------------------~~~~~~~---~~i~-~~~~~~l~~~~~~--~a~~v~n 270 (395) T protein:vir:38 240 -------------------------------------------KPTISQF---DNIK-DLENNTLDPAIES--TSSFITN 270 (395) T ss_pred -------------------------------------------ccccccH---HHHH-HHHHHhhhhhhcC--CCEEEEc Confidence 0111223 3332 3444456777774 5689999 Q ss_pred HHHHHHHHHHHH-hhccccc--hhhHHHHHHhhhhhcccccccCCccCCC------cEEEecCCCcEEEEeeCcEEEEEE Q lcl|Aclame:pro 238 RKLLADKYFPLV-NKQQENS--ESLAADIIISQKRIGNLPAVRVPYFPAN------AVLVTTLENLSIYFMDESHRRSID 308 (355) Q Consensus 238 ~dLl~~k~~~l~-n~~~~~t--e~~aa~~~~~~k~iGGlpa~~~PffP~~------~ilIT~l~NLsIY~Q~gs~RR~~~ 308 (355) +..+.. +..+ .....+- ..... ....+|-|+|++..+..|.. .+++-.|++...-+.++...=.+. T Consensus 271 ~~~~~~--L~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~ 345 (395) T protein:vir:38 271 QSGYNI--LSKVKDADGRYLMQPDVTS---PDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTT 345 (395) T ss_pred HHHHHH--HHHhhccCCceeeccCcCC---CCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEe Confidence 887652 3222 2221111 00000 12347899999998764333 377777776432233333332222 Q ss_pred Eccchhhhhhhhhhh-h--------hhhccccccEEEEecceecCcc-CCCCcCCCC Q lcl|Aclame:pro 309 ENPKKDRVENYESMN-I--------DYVVEVYAAGCLLENITLGDFT-APAAPESGA 355 (355) Q Consensus 309 d~p~r~rve~y~s~N-e--------~YvVEd~~~~a~ienI~~~~~~-~~~~~~~~a 355 (355) +.+ .++...| . ++.|-+..+++.++ +.... .|++....+ T Consensus 346 ~~~-----~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~---~~~~~~~~~~~~~~~ 394 (395) T protein:vir:38 346 NVG-----AGSFEHDTTKLRFIDRFDVQLIDDGAFAAAS---FKTVANQAQGTAGTG 394 (395) T ss_pred ccc-----cchhhcCceEEEEEEeeccEEecccceEEEE---eecccCCCCCccCCC Confidence 222 1222222 2 33444444554443 22211 111221222 No 71 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.93 E-value=7.8e-06 Score=48.58 Aligned_cols=299 Identities=13% Similarity=0.112 Sum_probs=164.3 Q ss_pred CCHHHHHHHH--HHHHH---HHHHh--CCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccc Q lcl|Aclame:pro 1 MRPETRFKFN--AYLTR---VAELN--NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVT 73 (355) Q Consensus 1 M~~~tr~~f~--~y~~~---~A~~n--gv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~ 73 (355) |.+.-..+++ .|... ..+.+ ++.. .......|-+.+...+++.+++.|.++++..++++.--. -++-.-.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~-~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~ 78 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM-HEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWAD 78 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccccc-cCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEec Confidence 7755554443 22222 22211 1111 012345677788999999999999999999988865321 12222223 Q ss_pred ccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccc Q lcl|Aclame:pro 74 GTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRA 153 (355) Q Consensus 74 ~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A 153 (355) ++-++-+ +.+.+. +......+...+..++.---..|+.+.|++.. ++|...+.+.+.++++.-.-.--++|.-. T Consensus 79 ~~~a~~v--~Eg~~~-~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~g~- 152 (324) T protein:vir:93 79 KPGAYWV--GEGQKI-ETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGN- 152 (324) T ss_pred Ccceeee--cCCccc-cccccceeEEEEEeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhcCCCC- Confidence 4444322 222233 33345677888999998888999999999764 68999999999998887666666788421 Q ss_pred cCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeE Q lcl|Aclame:pro 154 DTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLV 233 (355) Q Consensus 154 ~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LV 233 (355) ... |.-++...... ..... +.-.|..| .+++..+ ++.+++.. + T Consensus 153 -~~~----------------------~~~~~~~~~~~------~~~~~-~~~~~~~i----~~~~~~l-~~~~~~~~--~ 195 (324) T protein:vir:93 153 -NPF----------------------GKSIAQSIEKT------NKVIK-GDFTQDNI----IDLEALL-EDDELEAN--A 195 (324) T ss_pred -CCc----------------------Ccccccccccc------ceecc-ccccHHHH----HHHHHhh-hhccCCCC--E Confidence 110 11111110000 00111 11223333 3455443 44444432 6 Q ss_pred EEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCc--cCCCcEEEecCCCcEEEEeeCcEEEEEEEcc Q lcl|Aclame:pro 234 AIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPY--FPANAVLVTTLENLSIYFMDESHRRSIDENP 311 (355) Q Consensus 234 vivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~Pf--fP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p 311 (355) ++|.+..+.. -..+-+....+- ... ....++-|+|++..|. .+.+.+++-.++++ +|...+..+=.+.++. T Consensus 196 ~v~n~~~~~~-L~~l~d~~G~~~--~~~---~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~-~~~~~~~~~i~~~~~~ 268 (324) T protein:vir:93 196 FISKTQNRSL-LRKIVDPETKER--IYD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKL-IYGIPQLIEYKIDETA 268 (324) T ss_pred EEEcHHHHHH-HHHhhCCCCCee--ecC---CCCCcccceeeEeecCCCCCcceEEEEecceE-EEEEecCcEEEEeecc Confidence 8888877652 122322221111 110 1245789999988666 45556888889887 4555555544444432 Q ss_pred ch-------hhhhhhhhhhh---------hhhccccccEEEEecceecCccCCCCc Q lcl|Aclame:pro 312 KK-------DRVENYESMNI---------DYVVEVYAAGCLLENITLGDFTAPAAP 351 (355) Q Consensus 312 ~r-------~rve~y~s~Ne---------~YvVEd~~~~a~ienI~~~~~~~~~~~ 351 (355) .. ...-+++..|. ++.|-+.++++.+...+.+..+.|.+. T Consensus 269 ~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:93 269 QLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred cccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 11 11112333343 667888888887766666554544444 No 72 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.93 E-value=3.5e-06 Score=50.53 Aligned_cols=295 Identities=8% Similarity=0.010 Sum_probs=146.6 Q ss_pred CCHHHHHHHHHHHH-------------HH--HHHhCCChHHcceeeecCcHHHHH-HHHHHHhhHHHhCcCccccchhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLT-------------RV--AELNNISTDDVSKKFTVEPSVTQT-LMNTVQASSAFLKTINILPVAEMK 64 (355) Q Consensus 1 M~~~tr~~f~~y~~-------------~~--A~~ngv~~~~v~~~Fsv~P~~~q~-L~~~iqess~FL~~INv~~V~e~~ 64 (355) -+...+..|..++. .+ +...++.. ....+.|-+.+... +...+.+.+.+.+..++++. . T Consensus 217 ~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~--~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~---~ 291 (543) T protein:vir:81 217 SSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTK--ADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA---T 291 (543) T ss_pred hhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhccccc--ccCcccCchhhhhHHHHHHHhhhchhhhhcccccC---C Confidence 00111111111111 00 11222221 12234444455544 45667788888888887665 3 Q ss_pred hh-hhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHH Q lcl|Aclame:pro 65 GE-KIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLI 143 (355) Q Consensus 65 Ge-~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i 143 (355) |. .+....+++.+.-+. .+... +.....++...+..++.--.+.|+.+.|+. + ++|...|.+.+.+.++.-.- T Consensus 292 g~~~~~~~~~~~~a~~v~--Eg~~~-~~~~~~~~~i~~~~~k~~~~~~is~ell~d-~--~~~~~~i~~~l~~~~~~~~d 365 (543) T protein:vir:81 292 GDVWHGVSSAAVQWSWDA--EFEEV-SDDSPEFGQPEIPVKKAQGFVPISIEALQD-E--ANVTETVALLFAEGKDELEA 365 (543) T ss_pred cceEEEEecCCcceeecc--cCccc-cccccccceeeeeeeeeEeeehhhHHHHhc-c--HHHHHHHHHHHHHHHHHHHH Confidence 43 222333444443332 22223 334445778889999999899999999974 2 69999999999999998887 Q ss_pred HHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeC--CCcchhhHHHHHHHHHhcc Q lcl|Aclame:pro 144 MAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVG--KNGDYENIDALVMDATNNL 221 (355) Q Consensus 144 ~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G--~ggdy~nLDaLv~d~~~~l 221 (355) .-.|||.-.+ ..|. |.+.. ...... .+..+ ..-.|..+.+ ++.. T Consensus 366 ~ail~G~Gt~------~~p~------Gi~~~------------~~~~~~-----~~~~~~~~~~~~~~~~~----~~~~- 411 (543) T protein:vir:81 366 VTLTTGTGQG------NQPT------GIVTA------------LAGTAA-----EIAPVTAETFALADVYA----VYEQ- 411 (543) T ss_pred HHHhccCCCC------cccc------cchhh------------cccccc-----cccccccccccHHHHHH----HHHh- Confidence 8888983211 1222 32221 100000 01111 1223444443 4443 Q ss_pred cchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCc----------EEEecC Q lcl|Aclame:pro 222 IDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANA----------VLVTTL 291 (355) Q Consensus 222 id~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~----------ilIT~l 291 (355) +++.|+. .-+++|.+..+.. -..+-.....+--..... ....+|-|+|++..+++|.+. |++-.+ T Consensus 412 l~~~~~~--~~~~v~n~~~~~~-l~~lkd~~G~~l~~~~~~--g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~ 486 (543) T protein:vir:81 412 LAARHRR--QGAWLANNLIYNK-IRQFDTQGGAGLWTTIGN--GEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNF 486 (543) T ss_pred hhccccC--CcEEEEcHHHHHH-HHHhhcCCCceeccCcCC--CCCccccceeeEEeccccccccccccCCcceEEEeec Confidence 3555554 4588999887652 122222221111100000 123478899999999999875 778888 Q ss_pred CCcEEEEeeC-cEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEe--cceecCccCCC Q lcl|Aclame:pro 292 ENLSIYFMDE-SHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAPA 349 (355) Q Consensus 292 ~NLsIY~Q~g-s~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~ 349 (355) +++.|....| ++.+ .|+...-.++..-.-+|.++-+-.++... .+.+...++.+ T Consensus 487 ~~~~i~~~~~~~i~~----~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 487 QNYVIADRIGMTVEF----IPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred cceeEEeecccEEEE----eccccccchhhcCceEEEEEEeeccEeecccceEEEEecccC Confidence 8876654443 2222 12211111122222345444333333332 33333323222 No 73 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=97.92 E-value=5.7e-06 Score=49.32 Aligned_cols=287 Identities=10% Similarity=0.060 Sum_probs=150.8 Q ss_pred CCHHHHHHHHHHHHHHHH-----HhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcc--ccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAE-----LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV--GVT 73 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~-----~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~l--gv~ 73 (355) +.+.-+..|..|+..--. ...... ....+.|-+.+...+.+.+.+.+.+++++++++++...|...-. ... T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 163 (397) T protein:vir:48 86 VKAGFVKDFKNLVRGRYQNLLDSKTDASG--SDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADI 163 (397) T ss_pred HHHHHHHHHHHHHhhhhhHHHHHhhccCC--ccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCC Confidence 334444444444432210 011100 12357787888899999999999999999999999888876643 223 Q ss_pred ccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccc Q lcl|Aclame:pro 74 GTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRA 153 (355) Q Consensus 74 ~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A 153 (355) ++.+..+..+. .....+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.+.++.-.-.--+||+..+ T Consensus 164 ~~~a~~v~E~~--~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~ 239 (397) T protein:vir:48 164 TGLAKLDDEAG--SIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADS--AENILAWLSGWIAKKVVVTRNKAILEAIATL 239 (397) T ss_pred Ccceeeecccc--ccccccccceeeEEeeheeeeeehhhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 33444333221 1111222234455555555555578899988764 3478888888888888877766667774321 Q ss_pred cCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeE Q lcl|Aclame:pro 154 DTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLV 233 (355) Q Consensus 154 ~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LV 233 (355) . .. +..- +.|.++ +++..| ++.|+. .-+ T Consensus 240 ~---------------------------------------~~------~~~~---~~d~i~-~~~~~l-~~~~~~--~a~ 267 (397) T protein:vir:48 240 P---------------------------------------TK------PTLT---KWDDII-DLQAKV-DPAIKQ--TSF 267 (397) T ss_pred c---------------------------------------cc------cccc---cHHHHH-HHHHHh-hhhhcC--CCE Confidence 0 00 0111 334433 455444 566664 458 Q ss_pred EEEcHHHHHHHHHHHHhhcc-ccc--hhhHHHHHHhhhhhcccccccCC--ccC-----CCcEEEecCCCcEEEEeeCcE Q lcl|Aclame:pro 234 AIVGRKLLADKYFPLVNKQQ-ENS--ESLAADIIISQKRIGNLPAVRVP--YFP-----ANAVLVTTLENLSIYFMDESH 303 (355) Q Consensus 234 vivG~dLl~~k~~~l~n~~~-~~t--e~~aa~~~~~~k~iGGlpa~~~P--ffP-----~~~ilIT~l~NLsIY~Q~gs~ 303 (355) ++|.+...+ .+..+-..+ .+- ..... ....+|-|+|++.++ ++| ...+++=.|++...++.++.. T Consensus 268 ~v~n~~~~~--~L~~lkd~~G~~i~~~~~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 342 (397) T protein:vir:48 268 FLTNTSGFT--ALKKVKNAFGDYLMERDVKS---PTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQM 342 (397) T ss_pred EEECHHHHH--HHHHhhcCCCceeeccCcCC---CCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecce Confidence 889988765 344332111 111 11111 123589999998765 344 334777788876656666655 Q ss_pred EEEEEEccchhhhhhhhhhh-hhhhccccccEEEEe--c---ceecCccCCCCcCCCC Q lcl|Aclame:pro 304 RRSIDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--N---ITLGDFTAPAAPESGA 355 (355) Q Consensus 304 RR~~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--n---I~~~~~~~~~~~~~~a 355 (355) +=.+.+..+ +|...| ..|.++-+--++.+. . +++....+|.+..+.- T Consensus 343 ~i~~~~~~~-----~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~ 395 (397) T protein:vir:48 343 SLLSTNIGG-----GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGNLGST 395 (397) T ss_pred EEEEeccch-----hhhhcCceeEEEEeeeccEEecccceEEEEecccccCCCCcccc Confidence 543333221 233333 233222222222222 2 3343322222211111 No 74 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.92 E-value=1.5e-06 Score=52.55 Aligned_cols=300 Identities=10% Similarity=0.013 Sum_probs=147.6 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHH--c-ceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDD--V-SKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~--v-~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia 77 (355) |-..-. +. +...|.+... . ...-.|-+++.+.+++.+++.|.+++++.++++.--... +-.-..++.+ T Consensus 1 ~~~~~e--~~------~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-ip~~~~~~~a 71 (338) T protein:vir:78 1 MATLNE--LA------PNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETI-IPTTVKRPEV 71 (338) T ss_pred CcchHH--hh------hhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceE-EEEEecCccc Confidence 211000 00 1111111110 0 112246677789999999999999999999887643222 2222222222 Q ss_pred ccccC------CCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccc Q lcl|Aclame:pro 78 STTDT------SGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTT 151 (355) Q Consensus 78 ~Rt~T------~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s 151 (355) .-+.. +.+.. .+.....++...+.+++.---..|+.+.|+... ++|+..+.+.+.++++.-.-.--+||+. T Consensus 72 ~~v~~~~~~~~~Eg~~-~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~a~~~~~d~~~l~G~g 148 (338) T protein:vir:78 72 GQVGVGTSNEQREGGT-KPLSGTAWDTRSVAPIKLATIVTVSEEFARMNP--SGLYTKLQADLAYAIGRGIDLAVFHGKS 148 (338) T ss_pred eeeccccccccccccc-ccccccceeEEEEEEEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHhhcccC Confidence 22111 11112 233334577788999999888999999988743 6899999999999999888888889866 Q ss_pred cccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCC Q lcl|Aclame:pro 152 RADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPN 231 (355) Q Consensus 152 ~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~ 231 (355) ....+.| .|++ +....... ........+....|..|.. ++..+...... .. T Consensus 149 ~~~~~~~----------~gi~------------~~~~~~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~--~~ 199 (338) T protein:vir:78 149 PLTGSAL----------QGID------------TNNVIVNT-TNVDYLQTGTTPLLDRFLD----GYDLVSANTDV--DF 199 (338) T ss_pred CCccccc----------cccc------------cccccccc-cccccccccchhhHHHHHH----HHHHhhhhccc--cc Confidence 4433222 1221 11111100 0111111222223333333 33322221111 23 Q ss_pred eEEEEcHHHHHHH-HHH-HHhhccccc--hhhHHHHHHhhhhhcccccccCCccCCC---------cEEEecCCCcEEEE Q lcl|Aclame:pro 232 LVAIVGRKLLADK-YFP-LVNKQQENS--ESLAADIIISQKRIGNLPAVRVPYFPAN---------AVLVTTLENLSIYF 298 (355) Q Consensus 232 LVvivG~dLl~~k-~~~-l~n~~~~~t--e~~aa~~~~~~k~iGGlpa~~~PffP~~---------~ilIT~l~NLsIY~ 298 (355) -+++|.++..+.- .++ +-+....+- +... -....+|-|+|++..+++|++ .+++-.+++.-| . T Consensus 200 ~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~---~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~-~ 275 (338) T protein:vir:78 200 NGWAADPRYRARLLRSQAYRDANGNVDPTRINL---AASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKY-G 275 (338) T ss_pred eEEEEchHHHHHHHHHhhhccCCCceeeccccc---CCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEE-E Confidence 4788887654410 111 112221111 1111 122358999999999999974 266677777543 3 Q ss_pred eeCcEEEEEEEccchhhh-------hhhhhhh-h--------hhhccccccEEEEecceecCccCCCC Q lcl|Aclame:pro 299 MDESHRRSIDENPKKDRV-------ENYESMN-I--------DYVVEVYAAGCLLENITLGDFTAPAA 350 (355) Q Consensus 299 Q~gs~RR~~~d~p~r~rv-------e~y~s~N-e--------~YvVEd~~~~a~ienI~~~~~~~~~~ 350 (355) .++.++=.+.++...... -++..+| - ++.|-+.++++.+. ++.+|.+ T Consensus 276 ~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~-----~~~~~~~ 338 (338) T protein:vir:78 276 FADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFV-----DDEDPDA 338 (338) T ss_pred eecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEe-----cccCCCC Confidence 334444444443332222 2233333 2 33444444444433 3333333 No 75 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=97.91 E-value=4.2e-06 Score=50.05 Aligned_cols=299 Identities=12% Similarity=0.041 Sum_probs=148.3 Q ss_pred CCHHHHHHHHHHHHH-----HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTR-----VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGT 75 (355) Q Consensus 1 M~~~tr~~f~~y~~~-----~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ 75 (355) -..+-|..|..|+.. -+...++.. ....|.|-+.+.+.+++.+.+.+.+.+..+++++.- +-++-+-..++ T Consensus 119 ~~~e~r~a~~~~l~~~~~~~e~~a~~~~t--~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~--~~~~p~~~~~~ 194 (434) T protein:vir:62 119 KETEIRSVFANYIVGNIDEKEARALGLVT--GNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE--NIKYPVLVKKA 194 (434) T ss_pred HHHHHHHHHHHHhccccchhhhhhhcccc--cccceecchhhHHHHHHhhhhhhhhhhhcceeccCC--ceEEEEEecCC Confidence 112335556666542 222233322 123577777778889999999999999999887641 11222212222 Q ss_pred ccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccC Q lcl|Aclame:pro 76 IASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADT 155 (355) Q Consensus 76 ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~ 155 (355) .++-..........+.....++...+.+++.---+.|+.+.|+.- ..+|+..+.+.+.++++.-.-.--+||+-.. T Consensus 195 ~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~-- 270 (434) T protein:vir:62 195 EAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLART--GLPIEQIVMDELKKAYVRKETQYMVNGDEAN-- 270 (434) T ss_pred cccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhccCCCC-- Confidence 221111111111222222345556666666666678899988864 3589999999999999887777777884322 Q ss_pred CChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEE Q lcl|Aclame:pro 156 SDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAI 235 (355) Q Consensus 156 Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvi 235 (355) +|.. | ++... + +... ..+-...|.|+ +++..+ ++.|+. .-+++ T Consensus 271 -----~~~~-----g------------~~~~~----~------~~~~-~~~~~~~d~l~-~l~~~l-~~~~~~--~a~~v 313 (434) T protein:vir:62 271 -----NIND-----G------------ALAKK----A------VEFK-TDEKNLYDALV-KMKNTP-VKEVRK--KARWV 313 (434) T ss_pred -----cccc-----c------------eeecc----c------cccc-ccccchhhHHH-HHHhhc-chhhhc--CCEEE Confidence 2211 1 11100 0 1111 11112345443 566655 666664 44889 Q ss_pred EcHHHHHHHHHHHH-hhccccchhhHHHHH-HhhhhhcccccccCCccCCCc------EEEecCCCcEEEEeeCcEEEEE Q lcl|Aclame:pro 236 VGRKLLADKYFPLV-NKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANA------VLVTTLENLSIYFMDESHRRSI 307 (355) Q Consensus 236 vG~dLl~~k~~~l~-n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~------ilIT~l~NLsIY~Q~gs~RR~~ 307 (355) |.+..+. ++..+ .....|-=....+.. ....+|-|+|++..+++|... |++=.|+..-|+-..|...=.. T Consensus 314 ~n~~~~~--~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~ 391 (434) T protein:vir:62 314 LNTAALT--KIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQK 391 (434) T ss_pred EcHHHHH--HHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccceEEEEeeceeEEEe Confidence 9888665 23333 222111100000000 112378899999999999765 5555555544443334332111 Q ss_pred EEccchhhhhhhhhhh-hhhhccccccEEEE---ecceecCccCCCCcCCCC Q lcl|Aclame:pro 308 DENPKKDRVENYESMN-IDYVVEVYAAGCLL---ENITLGDFTAPAAPESGA 355 (355) Q Consensus 308 ~d~p~r~rve~y~s~N-e~YvVEd~~~~a~i---enI~~~~~~~~~~~~~~a 355 (355) . .+.|...| .+|.++..--+-+| +.+.+.... ..++.+| T Consensus 392 ~-------~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~--~~~~~~~ 434 (434) T protein:vir:62 392 L-------VELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYV--LKAPTGA 434 (434) T ss_pred e-------hhhhcccCceEEEEEeeecceeecCcccceEEEEE--eccCCCC Confidence 1 12233333 35655554433343 233221110 1111122 No 76 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.89 E-value=3.1e-06 Score=50.76 Aligned_cols=290 Identities=10% Similarity=0.018 Sum_probs=152.9 Q ss_pred HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCcCcccccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~ 95 (355) || .+. ..+..+.|-+...+.+++.++++|.++++.+++++.-- +-++-.-.+++-|+-+.-+ +..+..-.. T Consensus 1 Ma--~~~---~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~-~~~ip~~~~~~~a~wv~Eg---~~~~~s~~~ 71 (315) T protein:vir:80 1 MA--DDF---LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGEG---EVKPSASVD 71 (315) T ss_pred CC--CCc---CCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeeCC---ccccccccc Confidence 33 122 12457889999999999999999999999999887532 1233333445555544322 223444456 Q ss_pred ccCcceeEEeeeecceeCHHHHHhhcc--cchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHH Q lcl|Aclame:pro 96 LESSKYECNQINFDFHLKYKTLDLWAR--FQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQ 173 (355) Q Consensus 96 l~~~~Y~c~qTn~d~~i~y~~LD~WA~--~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq 173 (355) ++.....+++.---+.|+-+.|.+..- ...++..+.+.+.+.++.=.-.-.|||+.-...+.+. T Consensus 72 f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~-------------- 137 (315) T protein:vir:80 72 VSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAAS-------------- 137 (315) T ss_pred eeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccc-------------- Confidence 777888888888888898888865442 2347788888888888876667788996422111110 Q ss_pred HHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 174 KYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQ 253 (355) Q Consensus 174 ~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~ 253 (355) .+.... ......+ ...+..|.+|+.++. ++. ...++ .+. +++|.+.....- .+|..... T Consensus 138 ---------~~~~~~----~~~~~~~-~~~~~~~~d~~~~~~-~~~---~~~~~-~~~-~~imn~~~~~~L-~~l~~~~g 196 (315) T protein:vir:80 138 ---------AVHTSL----NKTKNIV-DATDSATADLVKAVG-LIA---GAGLQ-VPN-GVALDPAFSFAL-STEVYPKG 196 (315) T ss_pred ---------cccccc----cccccee-eccccchHHHHHHHH-HHh---hccCc-cce-EEEEcHHHHHHH-HHHhhccC Confidence 000000 0011111 123345777776654 232 22222 122 688887766531 22222111 Q ss_pred ccch-h-hHHHHH-HhhhhhcccccccCCccCCCc---------EEEecCCCcEEEEeeCcEEEEEEEccchhhh-hhhh Q lcl|Aclame:pro 254 ENSE-S-LAADII-ISQKRIGNLPAVRVPYFPANA---------VLVTTLENLSIYFMDESHRRSIDENPKKDRV-ENYE 320 (355) Q Consensus 254 ~~te-~-~aa~~~-~~~k~iGGlpa~~~PffP~~~---------ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rv-e~y~ 320 (355) .+.- . +--.+. ....++-|+|++..+++|++. +++-.++++ +|-..+.++-.+.+..+-+.. .++. T Consensus 197 ~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~-~~g~~~~~~i~i~~~~~~~~~~~~~~ 275 (315) T protein:vir:80 197 SPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRV-HWGFQRNFPIELIEYGDPDQTGRDLK 275 (315) T ss_pred CcccccccccccccCCCceecceeeEecCcCCcccccccccccEEEEeecccE-EEEEecCeeEEEeccccccCcccchh Confidence 1110 0 000000 112479999999999999764 455667775 343444444333333221111 1233 Q ss_pred hhh---------hhhhccccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 321 SMN---------IDYVVEVYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 321 s~N---------e~YvVEd~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) .+| -++.|.+.++++.+++..- |-+..++. T Consensus 276 ~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a-----~~~~~~~~ 314 (315) T protein:vir:80 276 GHNEVMVRAEAVLYVAIESLDSFAVVKEKAA-----PKPNPPAE 314 (315) T ss_pred hcCcEEEEEEEEecceeecccceEEEeeccC-----CCCCCCCC Confidence 333 3444555555555443322 22222222 No 77 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=97.85 E-value=1.5e-06 Score=52.46 Aligned_cols=299 Identities=13% Similarity=0.064 Sum_probs=160.9 Q ss_pred CC--H-HHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccccc Q lcl|Aclame:pro 1 MR--P-ETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIA 77 (355) Q Consensus 1 M~--~-~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia 77 (355) |. + +++..+... ..+...+.. .+....|-|++.+.+.+.+++.+..+++++++++.-- +.++-.-.+++-+ T Consensus 1 ~~~~~~r~~~~~~~~---e~~a~~~~~--~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a 74 (326) T protein:vir:42 1 MAVNPDRTTPFLGVN---DPKVAQTGD--SMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTT-GQKIPHWTGDVSA 74 (326) T ss_pred CCCCccchhhhcCcc---hhhheeccc--cCCcceechhhHHHHHHHHHhcchhhhhcceeeccCC-ceEEEEEeCCcce Confidence 43 3 222222221 122222222 1223458888999999999999999999999987632 2234333445555 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCC Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSD 157 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td 157 (355) +.+.- +.+. |.....++...+..++.---..|+.+.|+. ...+|+..+.+.+.++++.-.-.-.|||+- T Consensus 75 ~~v~E--g~~~-~~~~~~f~~i~~~~~k~~~~v~iS~ell~~--s~~~~~~~i~~~l~~a~~~~~d~a~l~G~g------ 143 (326) T protein:vir:42 75 SWIGE--GDMK-PITKGNMTSQTIAPHKIATIFVASAETVRA--NPANYLGTMRTKVATAFAMAFDNAAINGTD------ 143 (326) T ss_pred EEecC--Cccc-cccccceeEEEEeeEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHHhhcccC------ Confidence 54432 2233 333455777888888888888888888874 346899999999999999988888899944 Q ss_pred hhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCC--CcchhhHHHHHHHHHhcccchhhhCCCCeEEE Q lcl|Aclame:pro 158 RTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGK--NGDYENIDALVMDATNNLIDEVYQDDPNLVAI 235 (355) Q Consensus 158 ~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~--ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvi 235 (355) +.+|.+ ++.... ..+... ....+. +..+..++ ..++.. ++.+.++. ..+++ T Consensus 144 -s~~p~g------------------i~~~~~-~~~~~~--~~~~~~~~~~~~~~~~--~~~~~~-~~~~~~~~--~a~~v 196 (326) T protein:vir:42 144 -SPFPTF------------------LAQTTK-EVSLVD--PDGTGSNADLTVYDAV--AVNALS-LLVNAGKK--WTHTL 196 (326) T ss_pred -CCcccc------------------cccccc-ccceee--cccccccccchhHHHH--HHHHHh-hhhhhccC--ccEEE Confidence 122321 010000 000000 011112 22333333 333332 34554443 45788 Q ss_pred EcHHHHHHHHHHHHhhccccc--hhh-HHH-HHHhhhhhcccccccCCccCCCcEEE--ecCCCcEEEEeeCcEEEEEEE Q lcl|Aclame:pro 236 VGRKLLADKYFPLVNKQQENS--ESL-AAD-IIISQKRIGNLPAVRVPYFPANAVLV--TTLENLSIYFMDESHRRSIDE 309 (355) Q Consensus 236 vG~dLl~~k~~~l~n~~~~~t--e~~-aa~-~~~~~k~iGGlpa~~~PffP~~~ilI--T~l~NLsIY~Q~gs~RR~~~d 309 (355) |.+..+.. -..|-.....+- +.. ... ......++-|+|++..+++|++..++ ..++++- |...+...=.+.+ T Consensus 197 ~n~~~~~~-L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~-~~~~~~~~v~~~~ 274 (326) T protein:vir:42 197 LDDITEPI-LNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLV-WGQVGGLSFDVTD 274 (326) T ss_pred EeHHHHHH-HHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEE-EEEecceEEEEee Confidence 88877652 222322211111 000 000 00113478899999999999998654 5778773 4455544433333 Q ss_pred c--------cchhhhhhhhhhhh--------hhhccccccEEEEecceecCc Q lcl|Aclame:pro 310 N--------PKKDRVENYESMNI--------DYVVEVYAAGCLLENITLGDF 345 (355) Q Consensus 310 ~--------p~r~rve~y~s~Ne--------~YvVEd~~~~a~ienI~~~~~ 345 (355) + +.-..+..|+.-.- ++.|.+.++++.+.++..+++ T Consensus 275 e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 275 QATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred cceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 2 22233333433222 345666666666555554443 No 78 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=97.83 E-value=6.9e-06 Score=48.89 Aligned_cols=283 Identities=14% Similarity=0.075 Sum_probs=153.5 Q ss_pred HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCcCcccccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~ 95 (355) +|.. ..+...-|-|++...+++.+++.|.++++.+++++.--. ..+-.-.+++-|+-+.- +.+ .|..-.. T Consensus 1 ma~~------t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~-~~~p~~~~~~~a~wv~E--g~~-~~~s~~~ 70 (300) T protein:vir:95 1 MSEA------QLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNG-QREFVFDFDSDIDIVAE--NGK-KTHGGVS 70 (300) T ss_pred Cccc------ccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEeeC--Ccc-ccccccc Confidence 2211 122345688899999999999999999988888765422 22333334444543332 222 3444456 Q ss_pred ccCcceeEEeeeecceeCHHHHHhhc-ccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHH Q lcl|Aclame:pro 96 LESSKYECNQINFDFHLKYKTLDLWA-RFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQK 174 (355) Q Consensus 96 l~~~~Y~c~qTn~d~~i~y~~LD~WA-~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~ 174 (355) ++...+.+++.--.+.|+.+.|.++. ..+++++.+.+.+.+.++.=.-.-.|+|+-...-+. T Consensus 71 f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~----------------- 133 (300) T protein:vir:95 71 LDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQA----------------- 133 (300) T ss_pred ceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCC----------------- Confidence 77888999999999999999997775 468999999999999999877788889853221111 Q ss_pred HHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 175 YRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQE 254 (355) Q Consensus 175 ~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~ 254 (355) ..+.... ...+.. ...+..+..-.|.+|..++.. ++..+++ +. +++|.+..... -..|-..... T Consensus 134 ------~~~~~~~-~~~~~~-~~~~~~~~~~~~~~i~~~~~~-----~~~~~~~-~~-~~vmn~~~~~~-L~~lkd~~G~ 197 (300) T protein:vir:95 134 ------STIIGDN-CFDKKV-TQTVPFKDTNPDESMEDAVGM-----IDGSERD-IT-GAILDPIFTTA-LSKMKNAEGG 197 (300) T ss_pred ------ccccccc-cccccc-ceeecccccchHHHHHHHHHH-----hhhcCCC-cc-EEEECHHHHHH-HHHhhccCCC Confidence 0000000 000000 001111112225555544432 2333343 33 68888776552 2222222211 Q ss_pred cchhhHHHHHHhhhhhcccccccCCccCCCc------EEEecCCCcEEEEeeCcEEEEEEEccchh-hhhhhhhhh-hhh Q lcl|Aclame:pro 255 NSESLAADIIISQKRIGNLPAVRVPYFPANA------VLVTTLENLSIYFMDESHRRSIDENPKKD-RVENYESMN-IDY 326 (355) Q Consensus 255 ~te~~aa~~~~~~k~iGGlpa~~~PffP~~~------ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~-rve~y~s~N-e~Y 326 (355) +-=.. ...-....++-|+|++..+++|... +++-.++++-.|..+....-++.+..+.| .-.+|...| -+| T Consensus 198 ~i~~~-~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~ 276 (300) T protein:vir:95 198 KLYPE-LAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYI 276 (300) T ss_pred eeccC-ccccCCCceecceeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEE Confidence 11000 0001224689999999999999776 67778887755655555555554433221 112344444 333 Q ss_pred --------hccccccEEEEecceecCccCCCCcCCC Q lcl|Aclame:pro 327 --------VVEVYAAGCLLENITLGDFTAPAAPESG 354 (355) Q Consensus 327 --------vVEd~~~~a~ienI~~~~~~~~~~~~~~ 354 (355) .|.+..+++.+. .++| T Consensus 277 r~~~r~d~~v~~~~a~~~l~------------~~~g 300 (300) T protein:vir:95 277 RCEAYIGWGIMDAASFARIV------------KTGG 300 (300) T ss_pred EEEEeecceeecccceEEEe------------cCCC Confidence 333333333322 1112 No 79 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=97.81 E-value=7.4e-06 Score=48.70 Aligned_cols=327 Identities=10% Similarity=0.020 Sum_probs=156.4 Q ss_pred CCHHHHHHHHHHHHHHHH--HhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAE--LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIAS 78 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~--~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~ 78 (355) ...+.+..|..+....+. .+.... ...-.+.|-|.+...+++.+++.+.++++++++++.--...........+-++ T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~ 208 (497) T protein:vir:10 130 AAAELMGAFADGETAPAAIGQNPFGS-TGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAA 208 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhccc-CcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcce Confidence 111122222222222111 111111 11235778899999999999999999999999988653222211111111222 Q ss_pred cccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc---- Q lcl|Aclame:pro 79 TTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD---- 154 (355) Q Consensus 79 Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~---- 154 (355) -+. .+... |..-..++...+..++.---+.|+.+.|+.. |+++..|.+.+.+.++.=.-.--+||+-... T Consensus 209 wv~--E~~~~-~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~---~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi 282 (497) T protein:vir:10 209 AVA--EAGTY-PFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGL 282 (497) T ss_pred eec--cCccc-ccccccceeeEeeeeeeEeecHhHHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccc Confidence 222 22222 2223346677777777777788999999874 5789999999999888654444445421100 Q ss_pred --------C---------CChh--------hhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhh Q lcl|Aclame:pro 155 --------T---------SDRT--------KNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYEN 209 (355) Q Consensus 155 --------~---------Td~~--------anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~n 209 (355) . +... ..-....+|..|+..++..+...-.... .+ .+..+..-++.. T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~-----~~~~~~~~~~~~ 354 (497) T protein:vir:10 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGS---GS-----GVAGSYPTAAEI 354 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhh---cc-----chhccccchhhh Confidence 0 0000 0011234555566665553322211100 00 011112223344 Q ss_pred HHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhcc------ccchhhHHHHHHhhhhhcccccccCCccCC Q lcl|Aclame:pro 210 IDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQ------ENSESLAADIIISQKRIGNLPAVRVPYFPA 283 (355) Q Consensus 210 LDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~------~~te~~aa~~~~~~k~iGGlpa~~~PffP~ 283 (355) ++-++ +++..+... ....++ +++|.+.-+. .+.++.-.+ .+....+.+.....+++-|+|++..|++|+ T Consensus 355 ~~~~~-~~~~~~~~~-~~~~~~-~~vmn~~~~~--~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~ 429 (497) T protein:vir:10 355 AENVF-DAFVDIQLT-LFQTPN-AVVMNPRDWE--LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL 429 (497) T ss_pred hhHHH-HHHhhhhhh-cccCCC-eEEEchHHHH--HHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCC Confidence 44322 233333222 233344 4556654322 122221111 111222233333346888999999999999 Q ss_pred CcEEEecCCCcEEE-EeeCcEEEEEEEccchhhhhhhhhhh-hhhhccccccEEEE--ecceecCccCCCCcC Q lcl|Aclame:pro 284 NAVLVTTLENLSIY-FMDESHRRSIDENPKKDRVENYESMN-IDYVVEVYAAGCLL--ENITLGDFTAPAAPE 352 (355) Q Consensus 284 ~~ilIT~l~NLsIY-~Q~gs~RR~~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~i--enI~~~~~~~~~~~~ 352 (355) +.+++-.++...+. +.++..+=.+-+. ..+|+.+| -+|.+|..-.++.. |.|...+-.++++.. T Consensus 430 ~~~~~Gd~~~~~~~i~~r~~~~v~~~~~-----~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 430 GTILVGHFAPSVIQTARREGVTMQMTNS-----NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred CceEEeecccceEEEEEecccEEEeecc-----cchhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 99999888765443 3344444333221 11233334 34444333222222 244444433333333 No 80 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=97.81 E-value=7.4e-06 Score=48.70 Aligned_cols=327 Identities=10% Similarity=0.020 Sum_probs=156.4 Q ss_pred CCHHHHHHHHHHHHHHHH--HhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAE--LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIAS 78 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~--~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~ 78 (355) ...+.+..|..+....+. .+.... ...-.+.|-|.+...+++.+++.+.++++++++++.--...........+-++ T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~ 208 (497) T protein:vir:78 130 AAAELMGAFADGETAPAAIGQNPFGS-TGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAA 208 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhccc-CcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcce Confidence 111122222222222111 111111 11235778899999999999999999999999988653222211111111222 Q ss_pred cccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc---- Q lcl|Aclame:pro 79 TTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD---- 154 (355) Q Consensus 79 Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~---- 154 (355) -+. .+... |..-..++...+..++.---+.|+.+.|+.. |+++..|.+.+.+.++.=.-.--+||+-... T Consensus 209 wv~--E~~~~-~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~---~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi 282 (497) T protein:vir:78 209 AVA--EAGTY-PFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGL 282 (497) T ss_pred eec--cCccc-ccccccceeeEeeeeeeEeecHhHHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccc Confidence 222 22222 2223346677777777777788999999874 5789999999999888654444445421100 Q ss_pred --------C---------CChh--------hhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhh Q lcl|Aclame:pro 155 --------T---------SDRT--------KNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYEN 209 (355) Q Consensus 155 --------~---------Td~~--------anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~n 209 (355) . +... ..-....+|..|+..++..+...-.... .+ .+..+..-++.. T Consensus 283 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~-----~~~~~~~~~~~~ 354 (497) T protein:vir:78 283 LQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGS---GS-----GVAGSYPTAAEI 354 (497) T ss_pred ccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhh---cc-----chhccccchhhh Confidence 0 0000 0011234555566665553322211100 00 011112223344 Q ss_pred HHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhcc------ccchhhHHHHHHhhhhhcccccccCCccCC Q lcl|Aclame:pro 210 IDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQ------ENSESLAADIIISQKRIGNLPAVRVPYFPA 283 (355) Q Consensus 210 LDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~------~~te~~aa~~~~~~k~iGGlpa~~~PffP~ 283 (355) ++-++ +++..+... ....++ +++|.+.-+. .+.++.-.+ .+....+.+.....+++-|+|++..|++|+ T Consensus 355 ~~~~~-~~~~~~~~~-~~~~~~-~~vmn~~~~~--~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~ 429 (497) T protein:vir:78 355 AENVF-DAFVDIQLT-LFQTPN-AVVMNPRDWE--LLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPL 429 (497) T ss_pred hhHHH-HHHhhhhhh-cccCCC-eEEEchHHHH--HHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCC Confidence 44322 233333222 233344 4556654322 122221111 111222233333346888999999999999 Q ss_pred CcEEEecCCCcEEE-EeeCcEEEEEEEccchhhhhhhhhhh-hhhhccccccEEEE--ecceecCccCCCCcC Q lcl|Aclame:pro 284 NAVLVTTLENLSIY-FMDESHRRSIDENPKKDRVENYESMN-IDYVVEVYAAGCLL--ENITLGDFTAPAAPE 352 (355) Q Consensus 284 ~~ilIT~l~NLsIY-~Q~gs~RR~~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~i--enI~~~~~~~~~~~~ 352 (355) +.+++-.++...+. +.++..+=.+-+. ..+|+.+| -+|.+|..-.++.. |.|...+-.++++.. T Consensus 430 ~~~~~Gd~~~~~~~i~~r~~~~v~~~~~-----~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 430 GTILVGHFAPSVIQTARREGVTMQMTNS-----NGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATGS 497 (497) T ss_pred CceEEeecccceEEEEEecccEEEeecc-----cchhhhcCcEEEEEEEeecceeeccccEEEEEecCCccCC Confidence 99999888765443 3344444333221 11233334 34444333222222 244444433333333 No 81 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=97.76 E-value=3.7e-06 Score=50.34 Aligned_cols=279 Identities=15% Similarity=0.110 Sum_probs=159.7 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+-+.-..++... + .+....|-+++.+.+.+.+.+.|.++++.+++++.-..+..+-...+++-++-+ T Consensus 1 m~~~~~~~~~~~~----------t--~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v 68 (297) T protein:vir:95 1 MTVQTFNPENVLV----------S--QKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWV 68 (297) T ss_pred CCccccccccccc----------c--CCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEe Confidence 5544332222211 1 122346778888999999999999999999998765444455445555555444 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . .+... +......+...+.+++.---..|+.+.|+... ++|+..+.+.+.+.++...-.-.+||+-... T Consensus 69 ~--Eg~~~-~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~------ 137 (297) T protein:vir:95 69 N--ETEKI-KTDKPEVVPVTLKAHKLGIILVTSREALNYTW--KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPF------ 137 (297) T ss_pred e--cCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHHhcccCCcc------ Confidence 3 22233 33335577888899988888889999888654 6899999999999999888888889953211 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) | .| +++... .. ....+++-+|.+|-+ ++..+.+. +.+ .-+++|.++. T Consensus 138 -~------~g------------i~~~~~----~~---~~~~~~~~t~~~i~~----~~~~l~~~-~~~--~~~~v~~~~~ 184 (297) T protein:vir:95 138 -A------NS------------VAKAAK----DA---NKVIGGPINYDNILK----LQDALYDA-DVE--PNAFVSKIQN 184 (297) T ss_pred -c------cc------------cccccc----cc---ceecccccCHHHHHH----HHHHhhhc-cCC--cCEEEEcHHH Confidence 1 11 111110 00 011122334665544 44444333 332 2378999997 Q ss_pred HHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCc--cCCCcEEEecCCCcEEEEeeCcEEEEEEEccch----- Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPY--FPANAVLVTTLENLSIYFMDESHRRSIDENPKK----- 313 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~Pf--fP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r----- 313 (355) ... ...|-+....+- ...+..++-|+|++..|. .+++.+++-.++++ +|...+..+-.+.++... T Consensus 185 ~~~-L~~l~d~~G~~i------~~~~~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~-~~~~~~~~~i~~~~~~~~~~~~~ 256 (297) T protein:vir:95 185 RSA-LREARDGNKVSI------YDKAANTIDGITTVDLKSARFEKGDLLAGDFDNL-IYGVPYNITYKISEEGQISTITN 256 (297) T ss_pred HHH-HHHhhccCCcee------ecCCCCcccceeeEeecCCCCCCceEEEEecccE-EEEEecCeEEEEeeccccccccc Confidence 662 223333221111 012235788999986554 68889999999997 455556555544444322 Q ss_pred ---hhhhhhhhhhh--------hhhccccccEEEEecceecCccCCC Q lcl|Aclame:pro 314 ---DRVENYESMNI--------DYVVEVYAAGCLLENITLGDFTAPA 349 (355) Q Consensus 314 ---~rve~y~s~Ne--------~YvVEd~~~~a~ienI~~~~~~~~~ 349 (355) ..+..|+.-.- ++.|-+.+++|.+ ..++|. T Consensus 257 ~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l------~~at~~ 297 (297) T protein:vir:95 257 ADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKL------TPAERV 297 (297) T ss_pred cCccchhhhhcCcEEEEEEEEeccEeecccceEEE------eecCCC Confidence 22222332223 3344444444443 223333 No 82 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=97.75 E-value=5.2e-06 Score=49.55 Aligned_cols=316 Identities=13% Similarity=0.058 Sum_probs=139.2 Q ss_pred CCHHHHHH------HHHHHHHHHHHhCCChHHccee-eecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccc Q lcl|Aclame:pro 1 MRPETRFK------FNAYLTRVAELNNISTDDVSKK-FTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVT 73 (355) Q Consensus 1 M~~~tr~~------f~~y~~~~A~~ngv~~~~v~~~-Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~ 73 (355) |..+.|.. +..+..++........ ..+.. ..|-..+...+++.+.+.+.+++.+++++++-.. .+.+... T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g~~--~~~~~~~ 199 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTLAQQKR-AVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKGTA--RQNIAGA 199 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhhh-hhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCcee--EeeeecC Confidence 22222222 1222222222221111 12222 3344446788889999999999999998885211 1222222 Q ss_pred ccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccc Q lcl|Aclame:pro 74 GTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRA 153 (355) Q Consensus 74 ~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A 153 (355) ++.+.-+ +...+....+ ...+...|.+++.---+.|+.+.|+. ..++|+..+++.+.++++.=.-.--+||+- T Consensus 200 ~~~a~wv--~E~~~~~~~~-~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~ail~G~G-- 272 (466) T protein:vir:80 200 IPEGVWT--EAVANLNELS-LSFSQIEVDGYKVGGFIPIPNSTLED--SDLNLADEILDAIGQAIGFALDKAILYGTG-- 272 (466) T ss_pred Ccceeec--cccccccccc-ccccceeecceeeeeehhhhHHHHhc--chHHHHHHHHHHHHHHHHHHHhhheeeccC-- Confidence 2333222 2222333333 33666778888877788899999973 224789999999999877655555556632 Q ss_pred cCCChhhhhhhhccchhHHHHHHhhccccccccccccC-C---ccc----cceeeeCCCcchhhHHHHHHHHHhcccchh Q lcl|Aclame:pro 154 DTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDAD-G---KVV----SAVIRVGKNGDYENIDALVMDATNNLIDEV 225 (355) Q Consensus 154 ~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~-g---~~~----~~~i~~G~ggdy~nLDaLv~d~~~~lid~~ 225 (355) ..+| +|+|...= ...-....... . ... ......+..+.+...| ++.. +..+.+. T Consensus 273 -----~~~P------~Gil~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~- 334 (466) T protein:vir:80 273 -----TKMP------VGIVTRLA----QTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSE-LVLK-LSKARAN- 334 (466) T ss_pred -----CCCc------ceeeeccc----ccccccccccccccccccchhhhhhhhhhccchhhHHHH-HHHH-HHhhhcc- Confidence 1123 35543210 00000000000 0 000 0000112222222233 2222 1122222 Q ss_pred hhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEE Q lcl|Aclame:pro 226 YQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRR 305 (355) Q Consensus 226 ~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR 305 (355) ..++..++++....... .+.+.-..+... ...... .....+.|+|.+.-|++|.+.++.-.++..-|+. +..++- T Consensus 335 -~~~~~~~w~~~~~~~~~-l~~~~~~~~~~g-~~~~~~-~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~-r~~~~i 409 (466) T protein:vir:80 335 -YSNGMKFWAMSSNTHAV-LMSKAITFNSAG-ALVASL-NNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAE-RADIKL 409 (466) T ss_pred -ccCCceeEEecchhHHH-hhcccccccCCc-cccccC-CCcccccccceeecCccCccceeeeccccEEEEe-ecceEE Confidence 34566678876664331 222210011111 111111 1112478999999999999999888877755443 223322 Q ss_pred EEEEccchhhhhhhhhhhhhhhcc--------ccccEEEEecceecCccCCCCcCCCC Q lcl|Aclame:pro 306 SIDENPKKDRVENYESMNIDYVVE--------VYAAGCLLENITLGDFTAPAAPESGA 355 (355) Q Consensus 306 ~~~d~p~r~rve~y~s~Ne~YvVE--------d~~~~a~ienI~~~~~~~~~~~~~~a 355 (355) ..- ++. .|..-+..|.+. +.+.+..++-=++.+...+.+.+..| T Consensus 410 ~~~--~~~----~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~~~~~~~~~ 461 (466) T protein:vir:80 410 AQS--EHV----RFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTTSITFAPDEA 461 (466) T ss_pred Eec--hhh----hhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcccceeeecCcC Confidence 211 111 111112233222 22333333211112222222222222 No 83 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=97.74 E-value=1e-05 Score=47.94 Aligned_cols=282 Identities=10% Similarity=0.111 Sum_probs=150.1 Q ss_pred CCHHHHHHHHHHHHHHH------------------HHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVA------------------ELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAE 62 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A------------------~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e 62 (355) +...+...++++...+. ...+... ....+.|-+.....+++.+.+.+.+++++++++++. T Consensus 87 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~--~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 164 (397) T protein:vir:12 87 NEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGIND--EDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTT 164 (397) T ss_pred hhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhcccccc--ccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccC Confidence 22222222222222111 0011111 123466666677889999999999999999999998 Q ss_pred hhhhhhc-ccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 63 MKGEKIG-VGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALD 141 (355) Q Consensus 63 ~~Ge~v~-lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD 141 (355) .+|+... ...+++.+.-+..+. .....+...++...+.+++.---+.|+.+.|+... .+|+..+.+.+.++++.- T Consensus 165 ~~~~~~~~~~~~~~~a~~v~Eg~--~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~--~~l~~~i~~~l~~~~~~~ 240 (397) T protein:vir:12 165 RSGTRLLEKNADMVPFSPVEELG--NLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSD--QAIMTYVAKWFAKKSVVT 240 (397) T ss_pred CceeEEEEEecCCcceeeecccc--cccccccccceeEEeeheeeEeeehhhHHHHhhch--HHHHHHHHHHHHHHHHHH Confidence 8887533 333444343322221 11112233456677777777777889999886543 478888999999988876 Q ss_pred HHHHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcc Q lcl|Aclame:pro 142 LIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNL 221 (355) Q Consensus 142 ~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~l 221 (355) .-.--++|+.... |.- -.+.|.|+ ++++.. T Consensus 241 ~d~~il~G~g~~~-------------------------~~g------------------------~~~~~~i~-~~~~~~ 270 (397) T protein:vir:12 241 RNNLILAAIASLK-------------------------KVD------------------------IDGLDGIK-KALNVT 270 (397) T ss_pred HHHHHHhcccccc-------------------------ccc------------------------cccHHHHH-HHHhhc Confidence 6666677743110 100 01345543 355556 Q ss_pred cchhhhCCCCeEEEEcHHHHHHHHHHHH-hhccccchhhHHHHH-HhhhhhcccccccCCc-cCCCc-----EEEecCCC Q lcl|Aclame:pro 222 IDEVYQDDPNLVAIVGRKLLADKYFPLV-NKQQENSESLAADII-ISQKRIGNLPAVRVPY-FPANA-----VLVTTLEN 293 (355) Q Consensus 222 id~~~~~~~~LVvivG~dLl~~k~~~l~-n~~~~~te~~aa~~~-~~~k~iGGlpa~~~Pf-fP~~~-----ilIT~l~N 293 (355) +++.|+. ..+++|.+...+ .+..+ +....+- ....+. ..+.++-|+|++..+. +|+.+ +++-.+++ T Consensus 271 l~~~~~~--~a~~~~n~~~~~--~L~~lkd~~G~~l--~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~ 344 (397) T protein:vir:12 271 LDPMVAP--GSIVLTNQDGYD--WLDTLKDGTGRYL--LQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKE 344 (397) T ss_pred cchhhhC--CCEEEEcHHHHH--HHHHhhccCCcee--ecccccCCCCccccceeeEEecccccccCCCccEEEEEehhc Confidence 6887774 568999988765 23333 2221110 000111 1235888999987665 44332 78888888 Q ss_pred cEEEEeeCcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEe--cceecCccCC Q lcl|Aclame:pro 294 LSIYFMDESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAP 348 (355) Q Consensus 294 LsIY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~ 348 (355) .-+..-+....=.+.+.+. ..|..-..+|.++-+-.+.... .+.+..-++. T Consensus 345 ~~~~~~~~~~~i~~~~~~~----~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 345 AIVLFDREQQSIASTDTGA----GAFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred eEEEEeecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 5433333333322222221 1122223456555444444443 3433332222 No 84 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=97.67 E-value=5e-06 Score=49.65 Aligned_cols=283 Identities=13% Similarity=0.066 Sum_probs=157.1 Q ss_pred hCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCcCccccccccccCc Q lcl|Aclame:pro 20 NNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTALESS 99 (355) Q Consensus 20 ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~ 99 (355) .|+. .+..+.|-+.+.+.+.+.+++.|.++++.+++++.--. .++-.-.+++-|.-+. .+.. .|..-..++.. T Consensus 1 m~t~---t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~--E~~~-~~~s~~~f~~v 73 (303) T protein:vir:97 1 MGTE---TSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNG-SKEFTFTLDSDIDVVA--ENGK-KTHGGLSLEPV 73 (303) T ss_pred Cccc---CCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEecCcceEEee--cCcc-ccccccceeeE Confidence 4443 23468899999999999999999999999998876322 2333323444444332 2222 23333456677 Q ss_pred ceeEEeeeecceeCHHHHHhh-cccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHHHHhh Q lcl|Aclame:pro 100 KYECNQINFDFHLKYKTLDLW-ARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNE 178 (355) Q Consensus 100 ~Y~c~qTn~d~~i~y~~LD~W-A~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~ 178 (355) .+..++.---+.|+-+.|.+= ...++|.+.+.+.+.++++.-+-.-.+||+.-+..+.-. + +|+.- T Consensus 74 ~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~--~------~~~~~----- 140 (303) T protein:vir:97 74 TIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASD--V------IGTNH----- 140 (303) T ss_pred EeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccc--c------ccccc----- Confidence 788888887788888877432 346789999999999999988888888996422222211 1 11100 Q ss_pred ccccccccccccCCccccceeeeCCCc-chhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccch Q lcl|Aclame:pro 179 APARVMSNITDADGKVVSAVIRVGKNG-DYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSE 257 (355) Q Consensus 179 a~~~v~~~~~~~~g~~~~~~i~~G~gg-dy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te 257 (355) .. ......+..+.+. -|.+|.+++. .+ .+.+++. . .++|.+..... -..+-+....+-- T Consensus 141 -----~~-------~~~~~~~~~~~~~~~~~~i~~~~~----~~-~~~~~~~-~-~~vmn~~~~~~-L~~lkd~~g~~~~ 200 (303) T protein:vir:97 141 -----FD-------SKVTQVVKFTESEDADANIEAAVN----LI-QGAEGVV-T-GLAMDTEFSTA-LAKVTNGEMGPKM 200 (303) T ss_pred -----cc-------cccccccccccccchHHHHHHHHH----HH-hhcCCCc-c-EEEEcHHHHHH-HHHhhccCCCeEE Confidence 00 0011111222222 2555555433 22 2323322 2 48888776652 2233222221111 Q ss_pred hhHHHHHHhhhhhcccccccCCccCCCc--------EEEecCCCcEEEEeeCcEEEEEEEccchhh-hhhhhhhh----- Q lcl|Aclame:pro 258 SLAADIIISQKRIGNLPAVRVPYFPANA--------VLVTTLENLSIYFMDESHRRSIDENPKKDR-VENYESMN----- 323 (355) Q Consensus 258 ~~aa~~~~~~k~iGGlpa~~~PffP~~~--------ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~r-ve~y~s~N----- 323 (355) ....+.-....+|-|+|++...++|... +++=.+++.-.|..+...+=.+.+.-+.+. -.+|.-.| T Consensus 201 ~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r 280 (303) T protein:vir:97 201 YPELAWGANPDSINGLKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLR 280 (303) T ss_pred ecCccCCCCCceecceeeEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEE Confidence 0000000112378899999999998653 556667777666666655554443322111 11333334 Q ss_pred ----hhhhccccccEEEEeccee Q lcl|Aclame:pro 324 ----IDYVVEVYAAGCLLENITL 342 (355) Q Consensus 324 ----e~YvVEd~~~~a~ienI~~ 342 (355) -++.|-+.++++.+.+.++ T Consensus 281 ~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 281 AEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred EEEEeccEeecccceEEeeCCCC Confidence 3446667777777777666 No 85 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=97.67 E-value=9.3e-06 Score=48.17 Aligned_cols=296 Identities=13% Similarity=0.082 Sum_probs=145.2 Q ss_pred HHHHHHH----hCCChHHc---ceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCC- Q lcl|Aclame:pro 13 LTRVAEL----NNISTDDV---SKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSG- 84 (355) Q Consensus 13 ~~~~A~~----ngv~~~~v---~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~- 84 (355) ++.+.++ -|.+.+.. .....|-+++...+++.+++.|.++++++++++.- .+.++-.-.+++.|+-+..+. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~eg~~ 79 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVGQVGVGTS 79 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEeecCccc Confidence 2222222 12222111 01225667788999999999999999999988763 122333333333332222110 Q ss_pred ----CcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 85 ----DKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 85 ----~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) .....+......+......++.---..|+.+.|+. ..++|+..+++.+.+.++.-.---.+||+-....+-+ T Consensus 80 ~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~--s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~-- 155 (333) T protein:vir:78 80 NEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARM--NPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSAL-- 155 (333) T ss_pred ccccccccccccccceeEEEEeeEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccc-- Confidence 01122333344555666777777778888888752 2358999999999999999888888898664433221 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) . |.+ +..... ........+..++ ..+|.++ +++..+.....++ .-+++|.+.. T Consensus 156 --~------g~~------------~~~~~~---~~~~~~~~~~~~~-~~~~~i~-~~~~~~~~~~~~~--~~~~vmn~~~ 208 (333) T protein:vir:78 156 --Q------GID------------TDNVIA---NTTNVDYLQETGD-PLLDRLL-DGYDLVSANTDVE--FNGWAVDPRF 208 (333) T ss_pred --c------ccc------------cccccc---ccccccccccccc-hhHHHHH-HHHHhhccccccC--ceEEEEcchH Confidence 1 111 110000 0111122223332 2344432 3444333222222 2267777665 Q ss_pred HHHH-HHHHHhhccccchhhHHHHH--HhhhhhcccccccCCccCCC---------cEEEecCCCcEEEEeeCcEEEEEE Q lcl|Aclame:pro 241 LADK-YFPLVNKQQENSESLAADII--ISQKRIGNLPAVRVPYFPAN---------AVLVTTLENLSIYFMDESHRRSID 308 (355) Q Consensus 241 l~~k-~~~l~n~~~~~te~~aa~~~--~~~k~iGGlpa~~~PffP~~---------~ilIT~l~NLsIY~Q~gs~RR~~~ 308 (355) .+.- ..... .+....-+..... ....+|-|+|++..+++|++ .+++..+++.-|... +.++=.+. T Consensus 209 ~~~L~~~~~~--~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~-~~~~i~~~ 285 (333) T protein:vir:78 209 RAHLLRAQAY--RDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFA-DEIRIKMS 285 (333) T ss_pred HHHHHHHhhh--cCCCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEe-eccEEEEe Confidence 4321 11111 1111111100000 12357899999999999976 488888988655433 33333332 Q ss_pred Eccch----hhhhhhhhhh-h--------hhhccccccEEEEecceecCccCC Q lcl|Aclame:pro 309 ENPKK----DRVENYESMN-I--------DYVVEVYAAGCLLENITLGDFTAP 348 (355) Q Consensus 309 d~p~r----~rve~y~s~N-e--------~YvVEd~~~~a~ienI~~~~~~~~ 348 (355) +.-.. ....++...| - ++.|.+.++++.+ ..+.+| T Consensus 286 ~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l-----~~~~a~ 333 (333) T protein:vir:78 286 DTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKF-----VDDEQP 333 (333) T ss_pred ccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEE-----eccCCC Confidence 22111 1111222222 2 4444555444443 333444 No 86 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.66 E-value=1.9e-05 Score=46.48 Aligned_cols=299 Identities=11% Similarity=0.050 Sum_probs=156.0 Q ss_pred CCHH----HHHHHHHHHHHHHHHhC---------------------CChHHcceeeecCcHHHHHHHHHHHhhHHHhCc- Q lcl|Aclame:pro 1 MRPE----TRFKFNAYLTRVAELNN---------------------ISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKT- 54 (355) Q Consensus 1 M~~~----tr~~f~~y~~~~A~~ng---------------------v~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~- 54 (355) .+++ .-..|..|...+|..-| +.....+-.+.|-+++...+++.+.+.+.+.+. T Consensus 20 ~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg 99 (366) T protein:vir:57 20 IKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILG 99 (366) T ss_pred cccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcchhhhc Confidence 0000 00112222221111111 110001224446556778899999988877665 Q ss_pred Cccccchhhhhhh-hcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHH Q lcl|Aclame:pro 55 INILPVAEMKGEK-IGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDA 133 (355) Q Consensus 55 INv~~V~e~~Ge~-v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~ 133 (355) .++++.. .|.. +-.-.+++-++-+ +.+......+ ..++...+..++.---+.|+-+.|+.- .++|+..+++. T Consensus 100 ~~~v~~~--~g~~~~p~~t~~~~a~wv--~E~~~~~~s~-~~f~~i~~~~~k~~~~~~iS~ell~ds--~~~~~~~i~~~ 172 (366) T protein:vir:57 100 ARSIPLP--NGNLSMPRLSGGATAGYV--GEGKDVVATG-ATFDDVKLSAKTMIALVPVSNQLIGRA--GFNVEQLLLGD 172 (366) T ss_pred eeeeecC--CCceEEEEEeCCcceeee--ccCccccccc-cceeEEEEeeEEEEEeehhhHHHHhhh--hHHHHHHHHHH Confidence 5655543 3321 1111233333322 2223333333 346777888888888888999988743 35899999999 Q ss_pred HHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHH Q lcl|Aclame:pro 134 IVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDAL 213 (355) Q Consensus 134 i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaL 213 (355) +.++++.-.-.--++|.-.+ .+|. |.+.. .... .......|.+.++..+|++ T Consensus 173 l~~a~~~~~d~a~l~G~G~~------~~p~------Gi~~~------------~~~~----~~~~~~~~t~~~~~~~~~~ 224 (366) T protein:vir:57 173 ILSAIATREDKAFLRDDGTG------DTPK------GMKAV------------ATAA----NRLVAWTGTAINLTTIDEY 224 (366) T ss_pred HHHHHHHHHHHHhhccCCCC------cccc------ceeec------------cccc----cceeeccccccchhhHHHH Confidence 99999977777777884311 1232 32211 1001 1112234667788888876 Q ss_pred HHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCC--------c Q lcl|Aclame:pro 214 VMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPAN--------A 285 (355) Q Consensus 214 v~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~--------~ 285 (355) +- ++.......-......+++|.+..... +..+-..++.. +--. ..+.++-|+|++..+++|++ . T Consensus 225 ~~-~~~~~~~~~~~~~~~a~~vmn~~~~~~--L~~lkd~~G~~--l~~~--~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~ 297 (366) T protein:vir:57 225 LD-SLILKHMDSNSNMIRCGWGLSNRTYMT--LFGLRDGNGNK--VYPE--MSQGILKGYPIQRTSAIPANLGDDGNESE 297 (366) T ss_pred HH-HHHHhhhccccccccCEEEecHHHHHH--HHhhhccCCce--eccC--CCCCeecceeeEEccccccccccCCCccE Confidence 43 322221111122346678898886652 33221111111 0000 12457899999999999984 3 Q ss_pred EEEecCCCcEEEEeeCcEEEEEEEccch----hhhhhhhhhh---------hhhhccccccEEEEeccee Q lcl|Aclame:pro 286 VLVTTLENLSIYFMDESHRRSIDENPKK----DRVENYESMN---------IDYVVEVYAAGCLLENITL 342 (355) Q Consensus 286 ilIT~l~NLsIY~Q~gs~RR~~~d~p~r----~rve~y~s~N---------e~YvVEd~~~~a~ienI~~ 342 (355) +++-.++++- +-.++.++=.+-+++.. -.+.+-+.+| -++.|-+.++++.+.+|.+ T Consensus 298 i~~gdfs~~~-i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 298 IYFCDFNDVV-IGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred EEEEecceEE-EEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 7777777763 34555554433333211 0111122223 3456668899999999999 No 87 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=97.59 E-value=1.6e-05 Score=46.81 Aligned_cols=291 Identities=9% Similarity=-0.025 Sum_probs=149.2 Q ss_pred HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCcCcccccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~ 95 (355) +|- . ..+..+.|-+.+.+.+.+.+.+.|.+++..+++++..- +.++-.-.+++-|+-+. .+.+. |..... T Consensus 1 Mat---~---tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~--Eg~~~-~~~~~~ 70 (311) T protein:vir:99 1 MAT---F---GTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFG-NEDIITFNGRPKAEFVG--EGQQK-SSTTGE 70 (311) T ss_pred Cce---e---cCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCceeEEee--cCccc-ccccce Confidence 331 1 12346778777889999999999999999999988742 23333333445454432 22233 333345 Q ss_pred ccCcceeEEeeeecceeCHHHHHhhcc-cchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHH Q lcl|Aclame:pro 96 LESSKYECNQINFDFHLKYKTLDLWAR-FQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQK 174 (355) Q Consensus 96 l~~~~Y~c~qTn~d~~i~y~~LD~WA~-~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~ 174 (355) ++...+..++.---+.|+.+.|.++.. ..+|.+.+++.+.++++.-.-.-.|+|.-....+ +|.+ ..+|+.+ T Consensus 71 f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~----~~~g---~~~~~~~ 143 (311) T protein:vir:99 71 FDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGT----VIPG---WSNYLGA 143 (311) T ss_pred eeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCc----cccc---ccccccc Confidence 777888888888899999999988754 6799999999999999999988899985422211 1111 1112211 Q ss_pred HHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 175 YRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQE 254 (355) Q Consensus 175 ~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~ 254 (355) . ...+..+ ..+-..+|+.+.+++..+ .....+-.--.++|.+..... -..|-..... T Consensus 144 ~--------------------~~~~~~~-~~~~~~~~~~i~~~~~~~-~~~~~~~~~~~~vmn~~~~~~-L~~lkd~~G~ 200 (311) T protein:vir:99 144 A--------------------SKRVELT-ADTIANPDLAIEAAVGLL-VANGHPTPVNGLALHPSIAWG-LSTARYTDGR 200 (311) T ss_pred c--------------------cceeecc-ccccchhHHHHHHHHHHH-hhhccCCCccEEEEcHHHHHH-HHhhhccCCC Confidence 0 0111111 122234555555555422 222222222247888776552 2222222111 Q ss_pred cchhhHHHHHHhhhhhcccccccCCccCCCcEE----------------EecCCCcEEEEeeCcEEEEEEEccchhhhhh Q lcl|Aclame:pro 255 NSESLAADIIISQKRIGNLPAVRVPYFPANAVL----------------VTTLENLSIYFMDESHRRSIDENPKKDRVEN 318 (355) Q Consensus 255 ~te~~aa~~~~~~k~iGGlpa~~~PffP~~~il----------------IT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~ 318 (355) +-=. ....-....++-|+|++...++|++... +-.++++--|..+....=++.+..+-+.-.+ T Consensus 201 ~l~~-~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~ 279 (311) T protein:vir:99 201 KKFP-ELGLGIGVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGD 279 (311) T ss_pred eeec-CcccCCCCceecceeeEeecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchh Confidence 1100 0000011357899999999988866543 2333443222111111111111111121223 Q ss_pred hhhhh-hhhhccccccEEEEe--cceecCccC Q lcl|Aclame:pro 319 YESMN-IDYVVEVYAAGCLLE--NITLGDFTA 347 (355) Q Consensus 319 y~s~N-e~YvVEd~~~~a~ie--nI~~~~~~~ 347 (355) +..+| -+|-+|.+-.++... -+.+.++.+ T Consensus 280 ~~~~d~~~~r~~~r~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 280 LKRHNQIALRLEIVYGWYVFTDRFVVIENAVA 311 (311) T ss_pred hhhcCcEEEEEEEeecceecChhHeeeecccC Confidence 33333 444333333333232 244444333 No 88 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=97.59 E-value=4e-05 Score=44.68 Aligned_cols=283 Identities=10% Similarity=0.063 Sum_probs=140.0 Q ss_pred CCHH----HHHHHHHHHHHH---HHH-hCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccc Q lcl|Aclame:pro 1 MRPE----TRFKFNAYLTRV---AEL-NNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGV 72 (355) Q Consensus 1 M~~~----tr~~f~~y~~~~---A~~-ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv 72 (355) |... -+..|..|+..- ++. .+... ....|.|-+.+.+.+.+.+.+.+.+++.++++++.-.+|....+-- T Consensus 83 ~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~--~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 160 (389) T protein:vir:10 83 LSKKPIDAKKKAINDFIHSHGKVIDATSKVTS--TEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR 160 (389) T ss_pred cchhHHHHHHHHHHHHhhcchhhhhhhccccc--CCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEec Confidence 3322 234566665422 111 11111 2235777667778899999999999999999999877776544322 Q ss_pred cccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccc Q lcl|Aclame:pro 73 TGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTR 152 (355) Q Consensus 73 ~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~ 152 (355) ++.-+.- .+........+...++...+..++.---+.|+.+.|+. ..++|+..+.+.+.++++.-+-.-=.+|... T Consensus 161 ~~~~~~~--~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~i~~g~~~ 236 (389) T protein:vir:10 161 ATDRFSS--VAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIAD--SAVDLTALVGQSIKEKSVNTYNAMIAPVLQS 236 (389) T ss_pred CCCcccc--ccccccccccccccceeeeeeheeeEeeehhhHHHHhh--hhHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 2221111 11122222223334556666676666666778887764 3457899999998888775221111111100 Q ss_pred ccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCe Q lcl|Aclame:pro 153 ADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNL 232 (355) Q Consensus 153 A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~L 232 (355) +. . +......+.|.|+ ++++..+++.+. = T Consensus 237 ---------------------------------------~~--~-----~~~~~~~~~d~l~-~~~~~~~~~~~~----a 265 (389) T protein:vir:10 237 ---------------------------------------FT--A-----KKTTTDTLVDSLK-HILNVDLDPAYS----R 265 (389) T ss_pred ---------------------------------------cc--c-----ccccccccHHHHH-HHHHhhhhhhhC----c Confidence 00 0 0111233556554 456556677663 2 Q ss_pred EEEEcHHHHHHHHHHHHhhccc-cc--hhhHHHHH-HhhhhhcccccccCCc-cCCCc-----EEEecCCCcEEEEeeCc Q lcl|Aclame:pro 233 VAIVGRKLLADKYFPLVNKQQE-NS--ESLAADII-ISQKRIGNLPAVRVPY-FPANA-----VLVTTLENLSIYFMDES 302 (355) Q Consensus 233 VvivG~dLl~~k~~~l~n~~~~-~t--e~~aa~~~-~~~k~iGGlpa~~~Pf-fP~~~-----ilIT~l~NLsIY~Q~gs 302 (355) +++|.+..+. .+..+--.++ |- ........ ....+|-|+|++.++. +|+.. +++-.|++.-.++-++. T Consensus 266 ~~~~n~~~~~--~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 343 (389) T protein:vir:10 266 ALVVTQSLFN--TLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQ 343 (389) T ss_pred EEEecHHHHH--HHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecc Confidence 7888887654 3333322221 10 00000000 1234799999987653 44432 88889988644433333 Q ss_pred EEEEEEEccchhhhhhhhhhhhhh-hccccccEEEEe-----cceecCccCCCCcC Q lcl|Aclame:pro 303 HRRSIDENPKKDRVENYESMNIDY-VVEVYAAGCLLE-----NITLGDFTAPAAPE 352 (355) Q Consensus 303 ~RR~~~d~p~r~rve~y~s~Ne~Y-vVEd~~~~a~ie-----nI~~~~~~~~~~~~ 352 (355) .+=...++.. |.. .+ +++.++.. .+. .+++.+.+++.+.+ T Consensus 344 ~~i~~~~~~~------~~~---~~~~~~r~d~~-~~~~~a~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 344 VTLAWEDSKI------YGK---YLGAAFRFGVQ-KADSKAGYFVTNTDVPGSALGK 389 (389) T ss_pred eEEEeecccc------ccc---eEEEEEEeccE-EecccceEEEEeeccCCCCCCC Confidence 3333222211 111 22 22333322 222 23444333332222 No 89 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.56 E-value=1.7e-05 Score=46.70 Aligned_cols=297 Identities=10% Similarity=0.025 Sum_probs=149.7 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |--... ...++.. +- ....-.+-|.+.+.+.+.+++.+.+++..+++++.-... ++-.-.+++-+.-+ T Consensus 1 ~g~~~e------~~~~~~~-~t----~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-~ip~~~~~~~a~wv 68 (397) T protein:vir:23 1 MGFSAD------HSQIAQT-KD----TMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGI-VIPHWTGDVSAQWI 68 (397) T ss_pred CCcCHH------HHHHhhc-cC----CCCccccchhHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEcCCcceEEe Confidence 321111 1111111 11 111235788999999999999999999999888763221 22222334444333 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . .+... +.....++...|..++.---..|+.+.|+.= .++|+..+++.+.++++.-.-.--+||.-.. T Consensus 69 ~--Eg~~~-~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds--~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~------- 136 (397) T protein:vir:23 69 G--EGDMK-PITKGNMTKRDVHPAKIATIFVASAETVRAN--PANYLGTMRTKVATAIAMAFDNAALHGTNAP------- 136 (397) T ss_pred c--CCccc-cccccceeEEEEeeEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhcccCC------- Confidence 2 22222 3333457778888888888889999988832 3789999999999999988888888885421 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) .|. .||+-.. + ..........|.. + .++...|. +-+++ .-+++|.+.. T Consensus 137 ~~~-----~~~~~~~----------------~----~~~~~~~~~~~~~---~-~~~~~~l~-~~~~~--~a~~vmn~~~ 184 (397) T protein:vir:23 137 SAF-----QGYLDQS----------------N----KTQSISPNAYQGL---G-VSGLTKLV-TDGKK--WTHTLLDDTV 184 (397) T ss_pred ccc-----ccccccc----------------c----ceeeecccchhHH---H-HHHHHhhh-hcccC--CCEEEEcHHH Confidence 111 1222100 0 0111122222322 2 23333343 33343 3578888876 Q ss_pred HHHHHHHHHhhccccc--hhhHHH--HHHhhhhhcccccccCCccCCCcE--EEecCCCcEEEEeeCcEEEEEEEccc-- Q lcl|Aclame:pro 241 LADKYFPLVNKQQENS--ESLAAD--IIISQKRIGNLPAVRVPYFPANAV--LVTTLENLSIYFMDESHRRSIDENPK-- 312 (355) Q Consensus 241 l~~k~~~l~n~~~~~t--e~~aa~--~~~~~k~iGGlpa~~~PffP~~~i--lIT~l~NLsIY~Q~gs~RR~~~d~p~-- 312 (355) .. +-..+-.....+- +..... ......++-|+|++..+++|++.+ ++..++++-| ...+..+-.+.++.. T Consensus 185 ~~-~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i-~~~~~i~i~~~~e~~~~ 262 (397) T protein:vir:23 185 EP-VLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIW-GQVGGLSFDVTDQATLN 262 (397) T ss_pred HH-HHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEE-EEEeceEEEEeeeeeee Confidence 55 2122222211110 001111 111234788999999999999976 4668888754 333334333333321 Q ss_pred --------------hhhhhhhhhhhhhhhccccccEEEEecceecCcc-CCCCcCCCC Q lcl|Aclame:pro 313 --------------KDRVENYESMNIDYVVEVYAAGCLLENITLGDFT-APAAPESGA 355 (355) Q Consensus 313 --------------r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~-~~~~~~~~a 355 (355) +|++.-.-..--++.|-+.++++.+..-...... ....+..+. T Consensus 263 ~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~ 320 (397) T protein:vir:23 263 LGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYALDLDGASAG 320 (397) T ss_pred eccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeeecccccCcc Confidence 1222111112234455555555555411111111 111111122 No 90 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=97.54 E-value=2.4e-05 Score=45.96 Aligned_cols=287 Identities=10% Similarity=0.066 Sum_probs=130.0 Q ss_pred CCHHHHHHHHHHHHHHHH--HhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAE--LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIAS 78 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~--~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~ 78 (355) .....+..|..++..--. ..... .....|.|-..+...+. .+.+.+.+++.+++++++...+.......+++.++ T Consensus 136 ~~~~~~~~~~~~~~~~e~~~~~~~~--~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (437) T protein:vir:10 136 IADKKVTAFADYLKTGEVRDVTGIA--LKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLT 212 (437) T ss_pred HHHhhhhhhHHHHHhhhhhhhhhcc--cccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEEeeccccccc Confidence 222222334444432111 11111 12234555445555544 45667778899999998877766544433333332 Q ss_pred cccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCCh Q lcl|Aclame:pro 79 TTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDR 158 (355) Q Consensus 79 Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~ 158 (355) -+.-+ ......+...++...+..++.-.-+.|+.+.|+... ++|+..+.+.+.++++.-.-.-=+||.. T Consensus 213 ~~~e~--~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~~~~~~~~i~~g~g------- 281 (437) T protein:vir:10 213 AHTEY--GQTTKNATPVITPILWDLKTYTGGYVFSQELISDSS--YDWQAELQSRLIELRDNTDDSLIITALT------- 281 (437) T ss_pred ccccc--ccccccccccceeeeeehhheeeehhhhHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhhhc------- Confidence 22211 111111222344445555555455778888888643 4788888888888876533222233321 Q ss_pred hhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcH Q lcl|Aclame:pro 159 TKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGR 238 (355) Q Consensus 159 ~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~ 238 (355) +| ....+ + +. +.|. +.|+++.-+++.|+.+ -+|+|.+ T Consensus 282 --------------------------------~~--~~~~~--~-~~---~~~~-~~~~~~~~l~~~~~~~--~~~~~~~ 318 (437) T protein:vir:10 282 --------------------------------DG--IKKTT--S-TY---LLGD-LKKVLNVTLKPQDSAA--ASIVMSQ 318 (437) T ss_pred --------------------------------cc--ccccc--c-cc---chhh-HHHHHHhhhhhhhhcC--CEEEEcH Confidence 00 00001 1 11 1222 2344544467778754 4899999 Q ss_pred HHHHHHHHHHH-hhccccchhhHHHHH-HhhhhhcccccccCCcc--CCCc-----EEEecCCCcEEEEeeCcEEEEEEE Q lcl|Aclame:pro 239 KLLADKYFPLV-NKQQENSESLAADII-ISQKRIGNLPAVRVPYF--PANA-----VLVTTLENLSIYFMDESHRRSIDE 309 (355) Q Consensus 239 dLl~~k~~~l~-n~~~~~te~~aa~~~-~~~k~iGGlpa~~~Pff--P~~~-----ilIT~l~NLsIY~Q~gs~RR~~~d 309 (355) ..+. .+..+ +....+- ...++. ....++-|+|++.++.+ |..+ +++=.|++.-+.+-+...+=.. T Consensus 319 ~~~~--~l~~lkd~~g~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~-- 392 (437) T protein:vir:10 319 SAYN--LFDMATDAMGRPL--LQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQF-- 392 (437) T ss_pred HHHH--HHHHhhccCCCee--eccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEE-- Confidence 8765 34433 2221111 111111 12358999999998865 5443 6777777653333222221111 Q ss_pred ccchhhhhhhhhhhhhhhccccccEEEE-eccee--cCccCCCCcCCCC Q lcl|Aclame:pro 310 NPKKDRVENYESMNIDYVVEVYAAGCLL-ENITL--GDFTAPAAPESGA 355 (355) Q Consensus 310 ~p~r~rve~y~s~Ne~YvVEd~~~~a~i-enI~~--~~~~~~~~~~~~a 355 (355) .+.++ .+.- -..+++-+++...- +.|.+ ++.++....++++ T Consensus 393 ~~~~~---~~~~--~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~ 436 (437) T protein:vir:10 393 QDTYD---IWYK--QLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQSTA 436 (437) T ss_pred ecccc---cccc--eeeEEEEEccEEecccceEEEEeeccccccCCCCC Confidence 11111 1111 11233333332222 13332 2223332222222 No 91 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=97.53 E-value=1.3e-05 Score=47.35 Aligned_cols=273 Identities=11% Similarity=0.072 Sum_probs=139.6 Q ss_pred HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcc--cccccccccccCCCCcCcccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV--GVTGTIASTTDTSGDKERQTADF 93 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~l--gv~~~ia~Rt~T~~~~~r~~~~~ 93 (355) +.+....... ....+.|-+.+.+.+++.+++.+.+++..+++++....|..... ...++.++-+.- +.+....+. T Consensus 1 ~l~~~~~~t~-~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~E--g~~~~~~~~ 77 (293) T protein:vir:48 1 MLDSKTDHSG-SDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDE--AGKIADIDD 77 (293) T ss_pred Cceeeccccc-CcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecC--Ccccccccc Confidence 3333322221 12357777777899999999999999999999998888875433 333344443322 222222334 Q ss_pred ccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHH Q lcl|Aclame:pro 94 TALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQ 173 (355) Q Consensus 94 ~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq 173 (355) ..++...+.|++.---..|+.+.|+... .+++..+++.+.++++.-.-.--++|.. T Consensus 78 ~~~~~i~l~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~la~~~~~~~~~~i~~g~~---------------------- 133 (293) T protein:vir:48 78 PKLSLIKYTIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILGVVD---------------------- 133 (293) T ss_pred cceeEEEEeeeEEEEeehhhHHHHhhhh--HHHHHHHHHHHHHHHHHHHHhHHhhccc---------------------- Confidence 4577788999999999999999998654 4788888888888876522221122211 Q ss_pred HHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHH-hhc Q lcl|Aclame:pro 174 KYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLV-NKQ 252 (355) Q Consensus 174 ~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~-n~~ 252 (355) .... .+..- +.|.| .+++..+ ++.++.. -+++|.+..++ ++..+ ... T Consensus 134 -------------------~~~~----~~~~~---~~d~i-~~~~~~l-~~~~~~~--a~~vmn~~~~~--~L~~lkd~~ 181 (293) T protein:vir:48 134 -------------------KLPT----KPTLT---KWDDI-IDLEAKV-DPAIKQT--SFFLTNTSGFT--ALKKVKNAL 181 (293) T ss_pred -------------------cccc----ccccc---CHHHH-HHHHHhh-hhhhcCC--CEEEEcHHHHH--HHHHhhccC Confidence 0000 01112 34443 3355544 5556643 47888887765 23322 211 Q ss_pred cccchhhHHHHH-HhhhhhcccccccCC--ccCCC-----cEEEecCCCc-EEEEeeCcEEEEEEEccchhhhhhhhhhh Q lcl|Aclame:pro 253 QENSESLAADII-ISQKRIGNLPAVRVP--YFPAN-----AVLVTTLENL-SIYFMDESHRRSIDENPKKDRVENYESMN 323 (355) Q Consensus 253 ~~~te~~aa~~~-~~~k~iGGlpa~~~P--ffP~~-----~ilIT~l~NL-sIY~Q~gs~RR~~~d~p~r~rve~y~s~N 323 (355) ..+- ...++. ....+|-|+|++.++ ++|.. .+++-.+++. -|..+.+ .+=...+. ..++...| T Consensus 182 g~~l--~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~-~~i~~~~~-----~~~~~~~~ 253 (293) T protein:vir:48 182 GDYL--MERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQ-MSLLSTNI-----GGGAFETD 253 (293) T ss_pred CceE--eecCcCCCCCceecceeeEEecccccCCccCCceEEEEEeccceEEEEEecc-eEEEEecc-----cchhhhcC Confidence 1110 000000 123589999998754 45543 2677777764 3333333 22221111 11233333 Q ss_pred -hhhhccccccEEEEe--cc---eecCccCCCCcC-CCC Q lcl|Aclame:pro 324 -IDYVVEVYAAGCLLE--NI---TLGDFTAPAAPE-SGA 355 (355) Q Consensus 324 -e~YvVEd~~~~a~ie--nI---~~~~~~~~~~~~-~~a 355 (355) .+|.++.+--+...+ .| ++....+|.+.. .-| T Consensus 254 ~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~~ 292 (293) T protein:vir:48 254 TTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGSTA 292 (293) T ss_pred eEEEEEEEeeCcEEecccceEEEEeeccccCCccccccC Confidence 334333322222222 33 333322221111 111 No 92 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=97.49 E-value=5.6e-05 Score=43.88 Aligned_cols=285 Identities=12% Similarity=0.123 Sum_probs=142.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhh-hccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEK-IGVGVTGTIAST 79 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~-v~lgv~~~ia~R 79 (355) ++.+.+........+.+ .+.... ....+.|-+.....+++.+++.|.+++++++++|.-..|.. +....+++-++- T Consensus 89 ~~~~~~~~~~~~~~~~~-~~~~t~--~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 165 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRA-MSGLTG--EDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE 165 (392) T ss_pred ccHHHHHHHhhhhhhhh-cccccc--CCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcccee Confidence 12222222222222211 111111 12467787777789999999999999999999998777764 333333333333 Q ss_pred ccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChh Q lcl|Aclame:pro 80 TDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRT 159 (355) Q Consensus 80 t~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~ 159 (355) +.-+ ......+...++.....+++.---+.|+.+.|+.. .++|...+.+.+.+.++.-.-.--++|...+. T Consensus 166 v~E~--~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~----- 236 (392) T protein:vir:10 166 ITEM--GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT----- 236 (392) T ss_pred eccc--ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----- Confidence 2222 22222233346677788888888888999999874 46889999999988887643333333322110 Q ss_pred hhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHH Q lcl|Aclame:pro 160 KNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRK 239 (355) Q Consensus 160 anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~d 239 (355) . .... +.|.++ ++++..+++.|+ +..+++|.+. T Consensus 237 ----------------------------------~-------~~~~---~~d~i~-~~~~~~l~~~~~--~~a~~vm~~~ 269 (392) T protein:vir:10 237 ----------------------------------K-------QAIK---SLDDIK-DVLNVKLDPAIS--PNAILLTNQD 269 (392) T ss_pred ----------------------------------c-------cCcc---CHHHHH-HHHHHhhhhhhc--cCCEEEEcHH Confidence 0 0111 234433 345445677776 4578999988 Q ss_pred HHHHHHHHHH-hhccccc--hhhHHHHHHhhhhhccccc-ccCCcc-CCC------c--EEEecCCCcEEEEeeCcEEEE Q lcl|Aclame:pro 240 LLADKYFPLV-NKQQENS--ESLAADIIISQKRIGNLPA-VRVPYF-PAN------A--VLVTTLENLSIYFMDESHRRS 306 (355) Q Consensus 240 Ll~~k~~~l~-n~~~~~t--e~~aa~~~~~~k~iGGlpa-~~~Pff-P~~------~--ilIT~l~NLsIY~Q~gs~RR~ 306 (355) .+.. +..+ .....+- ..... ....+|-|.|. ++.+.+ |.+ . +++=.|++...-..++..+=. T Consensus 270 ~~~~--L~~lkd~~G~~l~~~~~~~---~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~ 344 (392) T protein:vir:10 270 GFNY--LDKLKDKDGKYILQSDPTQ---KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELA 344 (392) T ss_pred HHHH--HHHhhccCCCeEeecCccC---CccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEE Confidence 7652 3322 1111110 00000 11345667654 444332 211 1 444455553322223333322 Q ss_pred EEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceec--CccCCCCcCCC Q lcl|Aclame:pro 307 IDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLG--DFTAPAAPESG 354 (355) Q Consensus 307 ~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~--~~~~~~~~~~~ 354 (355) . .+.- .+++..| -+|.++-+--++... .|... ...+|+++++| T Consensus 345 ~--~~~~---~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 345 S--TDVG---GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred E--eccc---cchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 2 2211 1233334 456555544444443 55542 23556665555 No 93 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=97.49 E-value=5.6e-05 Score=43.88 Aligned_cols=285 Identities=12% Similarity=0.123 Sum_probs=142.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhh-hccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEK-IGVGVTGTIAST 79 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~-v~lgv~~~ia~R 79 (355) ++.+.+........+.+ .+.... ....+.|-+.....+++.+++.|.+++++++++|.-..|.. +....+++-++- T Consensus 89 ~~~~~~~~~~~~~~~~~-~~~~t~--~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 165 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRA-MSGLTG--EDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE 165 (392) T ss_pred ccHHHHHHHhhhhhhhh-cccccc--CCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcccee Confidence 12222222222222211 111111 12467787777789999999999999999999998777764 333333333333 Q ss_pred ccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChh Q lcl|Aclame:pro 80 TDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRT 159 (355) Q Consensus 80 t~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~ 159 (355) +.-+ ......+...++.....+++.---+.|+.+.|+.. .++|...+.+.+.+.++.-.-.--++|...+. T Consensus 166 v~E~--~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~----- 236 (392) T protein:vir:10 166 ITEM--GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT----- 236 (392) T ss_pred eccc--ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----- Confidence 2222 22222233346677788888888888999999874 46889999999988887643333333322110 Q ss_pred hhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHH Q lcl|Aclame:pro 160 KNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRK 239 (355) Q Consensus 160 anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~d 239 (355) . .... +.|.++ ++++..+++.|+ +..+++|.+. T Consensus 237 ----------------------------------~-------~~~~---~~d~i~-~~~~~~l~~~~~--~~a~~vm~~~ 269 (392) T protein:vir:10 237 ----------------------------------K-------QAIK---SLDDIK-DVLNVKLDPAIS--PNAILLTNQD 269 (392) T ss_pred ----------------------------------c-------cCcc---CHHHHH-HHHHHhhhhhhc--cCCEEEEcHH Confidence 0 0111 234433 345445677776 4578999988 Q ss_pred HHHHHHHHHH-hhccccc--hhhHHHHHHhhhhhccccc-ccCCcc-CCC------c--EEEecCCCcEEEEeeCcEEEE Q lcl|Aclame:pro 240 LLADKYFPLV-NKQQENS--ESLAADIIISQKRIGNLPA-VRVPYF-PAN------A--VLVTTLENLSIYFMDESHRRS 306 (355) Q Consensus 240 Ll~~k~~~l~-n~~~~~t--e~~aa~~~~~~k~iGGlpa-~~~Pff-P~~------~--ilIT~l~NLsIY~Q~gs~RR~ 306 (355) .+.. +..+ .....+- ..... ....+|-|.|. ++.+.+ |.+ . +++=.|++...-..++..+=. T Consensus 270 ~~~~--L~~lkd~~G~~l~~~~~~~---~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~ 344 (392) T protein:vir:10 270 GFNY--LDKLKDKDGKYILQSDPTQ---KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELA 344 (392) T ss_pred HHHH--HHHhhccCCCeEeecCccC---CccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEE Confidence 7652 3322 1111110 00000 11345667654 444332 211 1 444455553322223333322 Q ss_pred EEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceec--CccCCCCcCCC Q lcl|Aclame:pro 307 IDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLG--DFTAPAAPESG 354 (355) Q Consensus 307 ~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~--~~~~~~~~~~~ 354 (355) . .+.- .+++..| -+|.++-+--++... .|... ...+|+++++| T Consensus 345 ~--~~~~---~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 345 S--TDVG---GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred E--eccc---cchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 2 2211 1233334 456555544444443 55542 23556665555 No 94 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=97.49 E-value=5.6e-05 Score=43.88 Aligned_cols=285 Identities=12% Similarity=0.123 Sum_probs=142.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhh-hccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEK-IGVGVTGTIAST 79 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~-v~lgv~~~ia~R 79 (355) ++.+.+........+.+ .+.... ....+.|-+.....+++.+++.|.+++++++++|.-..|.. +....+++-++- T Consensus 89 ~~~~~~~~~~~~~~~~~-~~~~t~--~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 165 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRA-MSGLTG--EDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE 165 (392) T ss_pred ccHHHHHHHhhhhhhhh-cccccc--CCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcccee Confidence 12222222222222211 111111 12467787777789999999999999999999998777764 333333333333 Q ss_pred ccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChh Q lcl|Aclame:pro 80 TDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRT 159 (355) Q Consensus 80 t~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~ 159 (355) +.-+ ......+...++.....+++.---+.|+.+.|+.. .++|...+.+.+.+.++.-.-.--++|...+. T Consensus 166 v~E~--~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~----- 236 (392) T protein:vir:10 166 ITEM--GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT----- 236 (392) T ss_pred eccc--ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----- Confidence 2222 22222233346677788888888888999999874 46889999999988887643333333322110 Q ss_pred hhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHH Q lcl|Aclame:pro 160 KNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRK 239 (355) Q Consensus 160 anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~d 239 (355) . .... +.|.++ ++++..+++.|+ +..+++|.+. T Consensus 237 ----------------------------------~-------~~~~---~~d~i~-~~~~~~l~~~~~--~~a~~vm~~~ 269 (392) T protein:vir:10 237 ----------------------------------K-------QAIK---SLDDIK-DVLNVKLDPAIS--PNAILLTNQD 269 (392) T ss_pred ----------------------------------c-------cCcc---CHHHHH-HHHHHhhhhhhc--cCCEEEEcHH Confidence 0 0111 234433 345445677776 4578999988 Q ss_pred HHHHHHHHHH-hhccccc--hhhHHHHHHhhhhhccccc-ccCCcc-CCC------c--EEEecCCCcEEEEeeCcEEEE Q lcl|Aclame:pro 240 LLADKYFPLV-NKQQENS--ESLAADIIISQKRIGNLPA-VRVPYF-PAN------A--VLVTTLENLSIYFMDESHRRS 306 (355) Q Consensus 240 Ll~~k~~~l~-n~~~~~t--e~~aa~~~~~~k~iGGlpa-~~~Pff-P~~------~--ilIT~l~NLsIY~Q~gs~RR~ 306 (355) .+.. +..+ .....+- ..... ....+|-|.|. ++.+.+ |.+ . +++=.|++...-..++..+=. T Consensus 270 ~~~~--L~~lkd~~G~~l~~~~~~~---~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~ 344 (392) T protein:vir:10 270 GFNY--LDKLKDKDGKYILQSDPTQ---KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELA 344 (392) T ss_pred HHHH--HHHhhccCCCeEeecCccC---CccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEE Confidence 7652 3322 1111110 00000 11345667654 444332 211 1 444455553322223333322 Q ss_pred EEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceec--CccCCCCcCCC Q lcl|Aclame:pro 307 IDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLG--DFTAPAAPESG 354 (355) Q Consensus 307 ~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~--~~~~~~~~~~~ 354 (355) . .+.- .+++..| -+|.++-+--++... .|... ...+|+++++| T Consensus 345 ~--~~~~---~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 345 S--TDVG---GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred E--eccc---cchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 2 2211 1233334 456555544444443 55542 23556665555 No 95 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=97.49 E-value=5.6e-05 Score=43.88 Aligned_cols=285 Identities=12% Similarity=0.123 Sum_probs=142.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhh-hccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEK-IGVGVTGTIAST 79 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~-v~lgv~~~ia~R 79 (355) ++.+.+........+.+ .+.... ....+.|-+.....+++.+++.|.+++++++++|.-..|.. +....+++-++- T Consensus 89 ~~~~~~~~~~~~~~~~~-~~~~t~--~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 165 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRA-MSGLTG--EDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAE 165 (392) T ss_pred ccHHHHHHHhhhhhhhh-cccccc--CCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcccee Confidence 12222222222222211 111111 12467787777789999999999999999999998777764 333333333333 Q ss_pred ccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChh Q lcl|Aclame:pro 80 TDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRT 159 (355) Q Consensus 80 t~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~ 159 (355) +.-+ ......+...++.....+++.---+.|+.+.|+.. .++|...+.+.+.+.++.-.-.--++|...+. T Consensus 166 v~E~--~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~----- 236 (392) T protein:vir:10 166 ITEM--GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT----- 236 (392) T ss_pred eccc--ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----- Confidence 2222 22222233346677788888888888999999874 46889999999988887643333333322110 Q ss_pred hhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHH Q lcl|Aclame:pro 160 KNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRK 239 (355) Q Consensus 160 anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~d 239 (355) . .... +.|.++ ++++..+++.|+ +..+++|.+. T Consensus 237 ----------------------------------~-------~~~~---~~d~i~-~~~~~~l~~~~~--~~a~~vm~~~ 269 (392) T protein:vir:10 237 ----------------------------------K-------QAIK---SLDDIK-DVLNVKLDPAIS--PNAILLTNQD 269 (392) T ss_pred ----------------------------------c-------cCcc---CHHHHH-HHHHHhhhhhhc--cCCEEEEcHH Confidence 0 0111 234433 345445677776 4578999988 Q ss_pred HHHHHHHHHH-hhccccc--hhhHHHHHHhhhhhccccc-ccCCcc-CCC------c--EEEecCCCcEEEEeeCcEEEE Q lcl|Aclame:pro 240 LLADKYFPLV-NKQQENS--ESLAADIIISQKRIGNLPA-VRVPYF-PAN------A--VLVTTLENLSIYFMDESHRRS 306 (355) Q Consensus 240 Ll~~k~~~l~-n~~~~~t--e~~aa~~~~~~k~iGGlpa-~~~Pff-P~~------~--ilIT~l~NLsIY~Q~gs~RR~ 306 (355) .+.. +..+ .....+- ..... ....+|-|.|. ++.+.+ |.+ . +++=.|++...-..++..+=. T Consensus 270 ~~~~--L~~lkd~~G~~l~~~~~~~---~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~ 344 (392) T protein:vir:10 270 GFNY--LDKLKDKDGKYILQSDPTQ---KNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELA 344 (392) T ss_pred HHHH--HHHhhccCCCeEeecCccC---CccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEE Confidence 7652 3322 1111110 00000 11345667654 444332 211 1 444455553322223333322 Q ss_pred EEEccchhhhhhhhhhh-hhhhccccccEEEEe--cceec--CccCCCCcCCC Q lcl|Aclame:pro 307 IDENPKKDRVENYESMN-IDYVVEVYAAGCLLE--NITLG--DFTAPAAPESG 354 (355) Q Consensus 307 ~~d~p~r~rve~y~s~N-e~YvVEd~~~~a~ie--nI~~~--~~~~~~~~~~~ 354 (355) . .+.- .+++..| -+|.++-+--++... .|... ...+|+++++| T Consensus 345 ~--~~~~---~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 345 S--TDVG---GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred E--eccc---cchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 2 2211 1233334 456555544444443 55542 23556665555 No 96 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=97.48 E-value=4.5e-06 Score=49.88 Aligned_cols=295 Identities=13% Similarity=0.044 Sum_probs=138.4 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) ++++.|..|+++.. +. .....|.|-+++..++.+.+.+.|.++++++++++.- +-++-...+++.|+=+ T Consensus 72 lt~~e~~~~~~~~~------~~---~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~--~~~i~~~~~~~~a~w~ 140 (383) T protein:vir:78 72 ITNEEIKFFNDINK------EV---GYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL--RTKFLKSETSGVAVWG 140 (383) T ss_pred hhHHHHHHHHHHhc------cC---CCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC--ceEEEEEcCCcceEEe Confidence 44555554443321 11 1234688888899999999999999999999888752 1233333344433321 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . ...++.......++...+.+++.=--..|+.+.|+.=. .+++..+++.+.+++|.=.-.--++|+- +. T Consensus 141 ~--e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~--~~ie~~i~~~l~~~~a~~~~~a~i~G~G-------~~ 209 (383) T protein:vir:78 141 K--IFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGP--AWVKRFVVTQIEEAFAVALESAYIVGDG-------ND 209 (383) T ss_pred e--cccccccccCcceeeEeecceeeEeeccchHHHhhccH--HHHHHHHHHHHHHHHHHHHhhheEeccC-------CC Confidence 1 11222222222355556666666666889999998532 2678888888888887655455556732 11 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHH---HHHHHHhccc----chhhhCCCCeE Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDA---LVMDATNNLI----DEVYQDDPNLV 233 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDa---Lv~d~~~~li----d~~~~~~~~LV 233 (355) . -+|+|..+= . .+... +....+....|. -.+..++. ++..+.+... ....+-...++ T Consensus 210 q------P~Gil~~~~---~-----~~~~~-~~~~~~~~~~~~-~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 273 (383) T protein:vir:78 210 K------PIGLNRKVG---K-----GSTVV-DGVYAEKAATGT-LTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVT 273 (383) T ss_pred C------ceeeeeccC---C-----ccccc-ccccccccccch-hhhhhhHHHHHHHHHHHhccchhcccchhhhcCceE Confidence 1 236653210 0 00000 011111111111 11222222 2222221110 11112245678 Q ss_pred EEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhc--ccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEcc Q lcl|Aclame:pro 234 AIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIG--NLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENP 311 (355) Q Consensus 234 vivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iG--Glpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p 311 (355) ++|++.-..+ -.|.+...+.+ ++- .++- |++.+..+++|++.++.-.++.--| ..++.+|=..-+ T Consensus 274 ~~~n~~~~~~-~~~~~~~~~~~-----G~~----~t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i-~~r~~~~i~~~~-- 340 (383) T protein:vir:78 274 LLVNPTDAWD-VKKQYTSLNAN-----GVY----VTALPFNLNIIESLFVPEKKAISYVAERYDA-LIGGPLDIGTYD-- 340 (383) T ss_pred EEEcCcchhh-hccchhccCCC-----Cce----eeecCCCceEEecCCCCcccEEEeeccceEE-EecccceEEecc-- Confidence 8888632111 12222111111 110 1233 4446778999999999888888544 344444422111 Q ss_pred chhhhhhhhhhhhhhhcc--------ccccEEEEecceecCccCCCCcCC Q lcl|Aclame:pro 312 KKDRVENYESMNIDYVVE--------VYAAGCLLENITLGDFTAPAAPES 353 (355) Q Consensus 312 ~r~rve~y~s~Ne~YvVE--------d~~~~a~ienI~~~~~~~~~~~~~ 353 (355) +. -|..-..+|..= |.+++..++ |++ ++.+..|++ T Consensus 341 ~~----~f~~d~~~f~~~~r~dG~~~~~~A~~vl~-~~~--~~~~~~~~~ 383 (383) T protein:vir:78 341 QT----LAIEDLNLYAAKQFAYGKAKDDKAAAVWT-LNI--NPAEQTPEG 383 (383) T ss_pred hh----hhhcCceEEEEEEEEcCEEecCCeEEEEE-EEe--cCCCCCCCC Confidence 11 011111222111 222233333 333 334444554 No 97 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=97.47 E-value=5.3e-05 Score=44.00 Aligned_cols=302 Identities=11% Similarity=0.119 Sum_probs=143.9 Q ss_pred CCHHHHHHHHHHHHHHHHH-----------hCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAEL-----------NNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIG 69 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~-----------ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~ 69 (355) .....+.....++...... ..... .....+.|-+.+...+.+.+++.+.+++++++++|....|.... T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~-~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~ 158 (404) T protein:vir:10 80 GALFVRAIADNLLKQKNQRGLNLSEKEINAISENI-DEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTY 158 (404) T ss_pred HHHHHHHHHHHHHHHHHhhhhcchhhHHhhhcccc-CCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEE Confidence 1111122222222222111 11110 12345677778889999999999999999999999988886532 Q ss_pred -ccccccccccccCCCCcCccccc--cccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHh Q lcl|Aclame:pro 70 -VGVTGTIASTTDTSGDKERQTAD--FTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAG 146 (355) Q Consensus 70 -lgv~~~ia~Rt~T~~~~~r~~~~--~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IG 146 (355) ...+++-+.-+..+. .. +.. ...++...+..++.---+.|+.+.|+. ..++|+..+++.+.+.++.-.-.-= T Consensus 159 ~~~~~~~~~~~v~e~~--~~-~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~i 233 (404) T protein:vir:10 159 EKRSKQKPMKPLSENQ--QI-PTNGDNGKLERFNFKLKDLADFMSIPNDLLKF--ADKSLEDWIINWFVDKVRITRNAEI 233 (404) T ss_pred EEecCCcceeeccccc--cc-cccccccceeeeEeeheeeEeeehhhHHHHhh--cHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222333333332221 11 111 122344555555555556777777764 1247888888888887776544444 Q ss_pred hcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhh Q lcl|Aclame:pro 147 FNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVY 226 (355) Q Consensus 147 fnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~ 226 (355) ++|+- ++ .+|.+ +++.. ....+..+....|..|..++. .-+++-| T Consensus 234 l~G~g----~~--~~~~g------------------i~~~~-------~~~~~~~~~~~~~~~~~~~~~----~~l~~~~ 278 (404) T protein:vir:10 234 LYGAG----GD--EHATG------------------IMTAN-------KFKKITLPKSPALKDFKKCKN----VELLNVF 278 (404) T ss_pred hhcCC----CC--Ccccc------------------eeecc-------ccceeeccccccHHHHHHHHH----hhhhccc Confidence 56622 10 11211 11100 111234455566766665332 2235555 Q ss_pred hCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCC-ccCCCc-----EEEecCCCcEEEEe Q lcl|Aclame:pro 227 QDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVP-YFPANA-----VLVTTLENLSIYFM 299 (355) Q Consensus 227 ~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~P-ffP~~~-----ilIT~l~NLsIY~Q 299 (355) + +..+++|.+..++ .+..+--.++.. .....+. ....+|-|+|++.+| .+|+.+ +++-.+++.-..+. T Consensus 279 ~--~~~~~v~n~~~~~--~L~~lkd~~G~~-l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~ 353 (404) T protein:vir:10 279 K--ATSSWIVNQDGFN--YLDSLEDKTGRP-YLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVS 353 (404) T ss_pred c--CCCEEEEcHHHHH--HHHHhhccCCce-eeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEE Confidence 4 3568899988765 233331111110 0000000 123478899998654 456554 77788887544444 Q ss_pred eCcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEe--cceecCccCCCCcC Q lcl|Aclame:pro 300 DESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAPAAPE 352 (355) Q Consensus 300 ~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~~~~ 352 (355) ++...=.+.+.+. -+|..-..+|.++-+--++... .+.+...++.++|+ T Consensus 354 ~~~~~i~~~~~~~----~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 354 DGAYELATTNIGA----GAFETNTTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred ecceEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 4444433322221 1121111334444333333332 33333333333333 No 98 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=97.46 E-value=2.2e-05 Score=46.13 Aligned_cols=277 Identities=12% Similarity=0.054 Sum_probs=138.0 Q ss_pred CCHHHHHHHHHHHHHHHHH-------hCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAEL-------NNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVT 73 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~-------ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~ 73 (355) ........+..+....... .|+.. ....+.|-+.....+++.+.+.+.+++.++++++..-.+....+..+ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~--~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 180 (394) T protein:vir:97 103 NDSLRFEGKDEVLMPINETTPVEPQKDGIKK--ENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRA 180 (394) T ss_pred hhhhhhhhHHHHHHHHHhhhhhhhhcccccc--ccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecC Confidence 1111222233333322221 12221 12345666777889999999999999999999998877765544333 Q ss_pred ccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccc Q lcl|Aclame:pro 74 GTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRA 153 (355) Q Consensus 74 ~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A 153 (355) ++-+.-+ +.+......+...++...+.+++.---+.|+.+.|+.= .++|+..+.+.+.++++.-.-.--.+|.. T Consensus 181 ~~~~~~v--~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds--~~~~~~~i~~~la~~~~~~~~~~i~~g~~-- 254 (394) T protein:vir:97 181 TTKMVTV--AELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA--DVDLVGIVSESISQIKVNTTNDAIAKVLK-- 254 (394) T ss_pred CCcccee--cccccccccccccceeEEeehhheeeehhhHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHHHHhhccc-- Confidence 3222211 21222222233345666677776666677888877632 24788888888887777522111111110 Q ss_pred cCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeE Q lcl|Aclame:pro 154 DTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLV 233 (355) Q Consensus 154 ~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LV 233 (355) . +....-.+.|.|+ ++++..+++.+. =+ T Consensus 255 ---------------------------------------~--------~~~~~~~~~~~~~-~~~~~~~~~~~~----a~ 282 (394) T protein:vir:97 255 ---------------------------------------S--------FTTKTVKNLDEIK-ALLNGGFDPAYN----VS 282 (394) T ss_pred ---------------------------------------c--------ccccccccHHHHH-HHHHhhhhhhhC----CE Confidence 0 0111122455544 456666776543 26 Q ss_pred EEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCc--cCCCcEEEecCCCcEEEEeeCcEEEEEEEc Q lcl|Aclame:pro 234 AIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPY--FPANAVLVTTLENLSIYFMDESHRRSIDEN 310 (355) Q Consensus 234 vivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~Pf--fP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~ 310 (355) ++|.+.-.. .+..+-..++.. .....+. ....+|-|+|++..|. +|.+.+++=.+++...++-+....=...++ T Consensus 283 ~v~n~~~~~--~l~~lkd~~G~~-i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~ 359 (394) T protein:vir:97 283 LIVSQSFYQ--TLDTLKDGNGRY-LLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN 359 (394) T ss_pred EEEcHHHHH--HHHHhhccCCCe-eeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEecc Confidence 888887654 233332122110 0000010 1134789999998774 777778888887754444333332222222 Q ss_pred cchhhhhhhhhhhhhhhccc-cccEEEE-e---cceecCccCCC Q lcl|Aclame:pro 311 PKKDRVENYESMNIDYVVEV-YAAGCLL-E---NITLGDFTAPA 349 (355) Q Consensus 311 p~r~rve~y~s~Ne~YvVEd-~~~~a~i-e---nI~~~~~~~~~ 349 (355) +. -+.+|.++. ++....- + -|++...++|- T Consensus 360 ~~---------~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 360 EI---------YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred cc---------cceeEEEEEEEccEEecccceEEEEecccccCC Confidence 11 122333322 2222222 1 33444444444 No 99 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.44 E-value=2.5e-05 Score=45.82 Aligned_cols=294 Identities=12% Similarity=0.061 Sum_probs=158.3 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) |+.-+. |+.-...++.... ......|-|.+.+.+.+.+++.+.+++.++++++.-... ++-.-.+++-+.-+ T Consensus 1 ~~~~~~--~~~e~~~~~~~~~-----~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v 72 (318) T protein:vir:24 1 MAAGTA--FAVDHAQIAQTGD-----TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQ-KIPHWVGDVSAQWI 72 (318) T ss_pred CCCCCC--CCHHHHHhhcccC-----cccceeechhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEeCCcceEEe Confidence 444322 2222222222111 223567888899999999999999999999998864322 22222334444332 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . .+.+. +.....++...+.+++.---+.|+.+.|+. ..++|+..+++.+.++++.-.-.--+||+-... T Consensus 73 ~--Eg~~~-~~~~~~f~~i~~~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~------ 141 (318) T protein:vir:24 73 G--EGDMK-PITKGNMTSQTIAPHKIATIFVASAETVRA--NPANYLGTMRTKVATAFAMAFDGAAMHGTDSPF------ 141 (318) T ss_pred c--CCccc-cccccceeEEEEeeEEEEEeehhhHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCC------ Confidence 2 22223 333345778889999988888899988875 236899999999999999877777789854211 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) |. |=++ .. ..... .+..+.=...|..+.+++.. +.+.++ ...+++|.+.. T Consensus 142 -~~------~~~~------------~~----~~~~~----~~~~~~~~~~~~~~~~~~~~-~~~~~~--~~~~~v~n~~~ 191 (318) T protein:vir:24 142 -PT------YIGQ------------TT----KAISI----ADTTGATTVYDQVAVNGLSL-LVNDGK--KWTHTLLDDIT 191 (318) T ss_pred -Cc------cccc------------cc----ccccc----cccccccchHHHHHHHHHHh-hccccC--CCCEEEEcHHH Confidence 11 1000 00 00000 00111112333344555543 344343 34588999887 Q ss_pred HHHHHHHHHhhcccc------chhhHHHHHHhhhhhcccccccCCccCCCcE--EEecCCCcEEEEeeCcEEEEEEEccc Q lcl|Aclame:pro 241 LADKYFPLVNKQQEN------SESLAADIIISQKRIGNLPAVRVPYFPANAV--LVTTLENLSIYFMDESHRRSIDENPK 312 (355) Q Consensus 241 l~~k~~~l~n~~~~~------te~~aa~~~~~~k~iGGlpa~~~PffP~~~i--lIT~l~NLsIY~Q~gs~RR~~~d~p~ 312 (355) ... ...+-+....+ ...-.. .....++-|+|++..|..|++.. ++-.++.+ +|...+..+=.+.++.. T Consensus 192 ~~~-L~~lkd~~G~~l~~~~~~~~~~~--~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~-~~~~~~~l~i~~~~~~~ 267 (318) T protein:vir:24 192 EPI-LNGAKDQNGRPLFIESTYGEAAS--PFRSGRIVARPTILSDHVVEGTTVGFMGDFSQL-IWGQIGGLSFDVTDQAT 267 (318) T ss_pred HHH-HHHhhccCCceeecCccccCccc--cccCceEEEEeeEEeCCCCCCccEEEEeecceE-EEEEecCeEEEEeeccc Confidence 652 22332221111 111111 11235788999999999998864 55677776 45555554443333322 Q ss_pred --------hhhhhhhhhh--------hhhhhccccccEEEEecceecCccC Q lcl|Aclame:pro 313 --------KDRVENYESM--------NIDYVVEVYAAGCLLENITLGDFTA 347 (355) Q Consensus 313 --------r~rve~y~s~--------Ne~YvVEd~~~~a~ienI~~~~~~~ 347 (355) -..+..|++- --++.|.+.++++.|.++.-+...- T Consensus 268 ~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 268 LNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred eeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCCC Confidence 1111222211 1244566666666655544332211 No 100 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=97.38 E-value=2.5e-05 Score=45.85 Aligned_cols=297 Identities=12% Similarity=0.021 Sum_probs=151.3 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) ++++.|..|++++.. +.+ ....|.|-+++..++.+.+.+.|.++++++++++.- +.++-...+++-|+=+ T Consensus 67 lt~ee~~~~~~~~~~-----~~~---~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~wv 136 (377) T protein:vir:96 67 LTAEEIKFFNDIDKN-----VGG---KDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWG 136 (377) T ss_pred cCHHHHHHHHHHHhc-----CCC---CCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceeEe Confidence 677777777665432 111 234677877889999999999999999999988742 2344444444433322 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . ...++........+...+.+++.---..|+++.|+.=. .+++..+++.+.++++.=.-.--+||+=.. T Consensus 137 ~--e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~--~~le~~i~~~l~~~~~~~~~~a~i~G~G~~------- 205 (377) T protein:vir:96 137 D--IFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWLKQFITEQLKEAIAVALELAIVKGNGLL------- 205 (377) T ss_pred e--cccccccccCccceeEeeeeeeEEeechhhHHHhhcch--hhHHHHHHHHHHHHHHHHHhhceEeccCCC------- Confidence 1 11233333334566778888888888899999997522 368888999999988875555566773311 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCC--CcchhhHHHHHHHHHhccc-c---hhhhCCCCeEE Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGK--NGDYENIDALVMDATNNLI-D---EVYQDDPNLVA 234 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~--ggdy~nLDaLv~d~~~~li-d---~~~~~~~~LVv 234 (355) - -+|+|...... -+-.......+.....+...|+ ..+..++.-++++++..+- + ...+..+..|+ T Consensus 206 ~------P~Gil~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~ 276 (377) T protein:vir:96 206 Q------PVGLLKDLSQP---TVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKL 276 (377) T ss_pred c------ceeeeeccccc---cccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEE Confidence 1 23555432110 0000000011111111111221 1223333334444332210 0 01122457889 Q ss_pred EEcHHHHHHHH--HHHHhhccccchhhHHHHHHhhhhhcccc--cccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEc Q lcl|Aclame:pro 235 IVGRKLLADKY--FPLVNKQQENSESLAADIIISQKRIGNLP--AVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDEN 310 (355) Q Consensus 235 ivG~dLl~~k~--~~l~n~~~~~te~~aa~~~~~~k~iGGlp--a~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~ 310 (355) +|-+.-..+-. ....+. + ++- .++.|+| .+.-+++|++.++.-.+++--| ..++.+|=.. T Consensus 277 ~mn~~t~~~~~~~~~~~~~-~-------G~~----~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i-~~r~~~~i~~--- 340 (377) T protein:vir:96 277 LLNPEDRWTLEAKFTSRNQ-F-------GEY----VTVLPHGITILESLAVETGKAIAFVANRYDA-FMATASTIEE--- 340 (377) T ss_pred EEchhhHHhccccccccCC-C-------CCc----eeccCCCceEEecCCCCcccEEEEEcCcEEE-EEecccEEEe--- Confidence 98876433211 001111 1 111 1344444 6778999999999999988433 3333333211 Q ss_pred cchhhhhhhhhhhhhhhccccccEEEEe--cceecCccC-C-CCcCCC Q lcl|Aclame:pro 311 PKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTA-P-AAPESG 354 (355) Q Consensus 311 p~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~-~-~~~~~~ 354 (355) +.+.|..+|.-.+-++. +-...+..+ . -.-..| T Consensus 341 -----------~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 341 -----------YDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred -----------ehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 12346666666555554 111111111 1 111111 No 101 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.33 E-value=9e-05 Score=42.77 Aligned_cols=299 Identities=10% Similarity=0.076 Sum_probs=143.7 Q ss_pred CC--HHHHHHHHHHHHHHH---------------------HHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCc-Cc Q lcl|Aclame:pro 1 MR--PETRFKFNAYLTRVA---------------------ELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKT-IN 56 (355) Q Consensus 1 M~--~~tr~~f~~y~~~~A---------------------~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~-IN 56 (355) +. ......|..+...++ ....+........+.|-......+.+.+++++.+++. .+ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~ 162 (428) T protein:vir:10 83 AEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGAR 162 (428) T ss_pred cccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhcce Confidence 00 000001111111100 0000110000113445556677899999999988776 45 Q ss_pred cccchhhhhh-hhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHH Q lcl|Aclame:pro 57 ILPVAEMKGE-KIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIV 135 (355) Q Consensus 57 v~~V~e~~Ge-~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~ 135 (355) +++.. +|. ++-.-.+++-++-+.-+ ......+ ..++...|.-++.---+.|+.+.|+. ..++|+..+.+.+. T Consensus 163 ~~~~~--~g~~~~p~~~~~~~a~~v~Eg--~~~~~~~-~~f~~i~~~~~k~~~~v~is~ell~d--s~~~l~~~i~~~l~ 235 (428) T protein:vir:10 163 SIPLP--NGNMSLPRLAGGATASYTGEN--QDAKVSE-ARFDDVKLTAKTMIAMVPISNALIGR--AGFNVEQLVLQDIL 235 (428) T ss_pred eeecC--CcceEEEEEeCCcceeeeccC--ccccccc-cceeeEEeeeEEEEEeehhhHHHHhh--hhHHHHHHHHHHHH Confidence 55443 232 11111223333333222 2222222 33556667777776778899998874 14689999999999 Q ss_pred HHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHH Q lcl|Aclame:pro 136 KRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVM 215 (355) Q Consensus 136 ~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~ 215 (355) ++++.-+-.--+||.-.. .+|.+ +++.......... ...+...++..+|.++. T Consensus 236 ~ai~~~~d~~~l~G~G~~------~~p~G------------------i~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~ 288 (428) T protein:vir:10 236 TAISVREDKAFMRDDGTG------DTPIG------------------MKARATQWNRLLP---WAADAAVNLDTIDTYLD 288 (428) T ss_pred HHHHHHHHHHHhccCCCC------ccccc------------------ccccccccccccc---ccccccccHHHHHHHHH Confidence 999876666677884311 22321 2211111110000 11224455555554332 Q ss_pred HHHh-cccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCc--------E Q lcl|Aclame:pro 216 DATN-NLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANA--------V 286 (355) Q Consensus 216 d~~~-~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~--------i 286 (355) -+.. ..... ......+++|...... .+..+-..++ .-+--. ..+.+|.|+|++..+++|++. + T Consensus 289 ~~~~~~~~~~--~~~~~~~~v~n~~~~~--~L~~lkd~~G--~~i~~~--~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i 360 (428) T protein:vir:10 289 SIILMSMDGN--SNMISSGWGMSNRTYM--KLFGLRDGNG--NKVYPE--MAQGMLKGYPIQRTSAIPANLGEGGKESEI 360 (428) T ss_pred HHHHhhhccc--cccccCEEEEcHHHHH--HHHHhhccCC--ceeccC--CCCCeeeceeeEEeccccccccCCCccceE Confidence 2211 01111 1223468899887664 2333321111 111101 124579999999999999863 6 Q ss_pred EEecCCCcEEEEeeCcEEEEEEEccch----hhhhhhhhhhh---------hhhccccccEEEEeccee Q lcl|Aclame:pro 287 LVTTLENLSIYFMDESHRRSIDENPKK----DRVENYESMNI---------DYVVEVYAAGCLLENITL 342 (355) Q Consensus 287 lIT~l~NLsIY~Q~gs~RR~~~d~p~r----~rve~y~s~Ne---------~YvVEd~~~~a~ienI~~ 342 (355) ++-.++++-| ..++..+-..-++... ..+.+++..|. ++.|=+.++++.+.+|++ T Consensus 361 ~~gd~s~~~i-~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 361 YFADFNDVVI-GEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred EEEecceEEE-EEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 6666766533 3445554433222211 12224444443 335556677777788888 No 102 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=97.32 E-value=4.4e-05 Score=44.49 Aligned_cols=277 Identities=12% Similarity=0.057 Sum_probs=134.5 Q ss_pred CCHHH-------HHHHHHHHHHHH--------HHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhh Q lcl|Aclame:pro 1 MRPET-------RFKFNAYLTRVA--------ELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKG 65 (355) Q Consensus 1 M~~~t-------r~~f~~y~~~~A--------~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~G 65 (355) +.... +.....+....+ ...++.. ....+.|-+.....+++.+.+.+.+++.+++++|+...| T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 178 (400) T protein:vir:38 101 TRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKA--ADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKG 178 (400) T ss_pred hHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccc--cCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcce Confidence 00000 000011111000 0112221 122455666778889999999999999999999988777 Q ss_pred hhhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHH Q lcl|Aclame:pro 66 EKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMA 145 (355) Q Consensus 66 e~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~I 145 (355) ....+..+++.++-+..+ ..........++...+.+++.--=+.|+.+.|+. ..++|+..+.+.+.++++.=.-.- T Consensus 179 ~~~~~~~~~~~~~~~~E~--~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~~~~~~~~~~ 254 (400) T protein:vir:38 179 TYPTVANATTKMVTVAEL--EKNPAMAKPEFKPVNWSVETYRQALPVSQESIDD--SAIDLVGLIAQNGQQIKVNTTNGA 254 (400) T ss_pred EEEEEecCCCcccccccc--ccccccccccceeeEeehhheeeehhhHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHh Confidence 655443333323222111 1221122233445555555555556777777763 134788888888888776544333 Q ss_pred hhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchh Q lcl|Aclame:pro 146 GFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEV 225 (355) Q Consensus 146 GfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~ 225 (355) .++|+... . ...-.+.|.+ .+++...+++. T Consensus 255 i~~~~~~~---------------------------------------~----------~~~~~~~~~~-~~~~~~~~~~~ 284 (400) T protein:vir:38 255 VATLLKGF---------------------------------------T----------AKTISSVDDL-KHINNVDLDPA 284 (400) T ss_pred hhhccccc---------------------------------------c----------ccccccHHHH-HHHHHhhhhhh Confidence 33442210 0 0001134433 34566666665 Q ss_pred hhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEe Q lcl|Aclame:pro 226 YQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFM 299 (355) Q Consensus 226 ~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q 299 (355) + .-+++|.+..+.. +..+-..++.. ....++. ....++-|+|++..+++|... +++=.|++..+.+- T Consensus 285 ~----~a~~v~~~~~~~~--l~~lkd~~G~~-i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~ 357 (400) T protein:vir:38 285 Y----SRVIIASQSFYNF--LDTVKDGNGRY-LLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFAN 357 (400) T ss_pred h----CcEEEEcHHHHHH--HHHhhccCCCe-eeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEe Confidence 4 2488998887652 43332222110 0000110 123579999999999999654 67778888655554 Q ss_pred eCcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEe--cceecCccCCC Q lcl|Aclame:pro 300 DESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAPA 349 (355) Q Consensus 300 ~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~ 349 (355) +....-...++... +.+|.+.-+-.+..+. .|.+....+.| T Consensus 358 ~~~~~~~~~~~~~~---------~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 358 RADFMVRWVDDQIY---------GQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred ecceEEEEeccccc---------ceeEEEEEEeccEEecccceEEEEeecCC Confidence 44444333332211 1233332222222221 22222211111 No 103 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.31 E-value=3.5e-05 Score=45.02 Aligned_cols=289 Identities=10% Similarity=0.040 Sum_probs=149.7 Q ss_pred HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCcCcccccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTA 95 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~ 95 (355) +| . -.+..|.|-+...+.+.+.++++|..+++.+++++.--. .++-.-.+++-|+-+. .+.... ..-.. T Consensus 1 ma----t---~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~-~~~p~~~~~~~a~wv~--Eg~~~~-~~~~~ 69 (311) T protein:vir:81 1 MV----A---LATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVG--EGAQKS-ESTAT 69 (311) T ss_pred Cc----e---ecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCc-eEEEEEeCCceeEEee--cCcccc-cccce Confidence 11 1 012368888888999999999999999999998864321 2222223445554333 222333 33345 Q ss_pred ccCcceeEEeeeecceeCHHHHHhhc-ccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHH Q lcl|Aclame:pro 96 LESSKYECNQINFDFHLKYKTLDLWA-RFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQK 174 (355) Q Consensus 96 l~~~~Y~c~qTn~d~~i~y~~LD~WA-~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~ 174 (355) ++..++.+++.--.+.|+.+.|..+. ...+|++.+.+.+.++++.-.-.-.+||+.....+.+. |.+.. T Consensus 70 f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~----------gi~~~ 139 (311) T protein:vir:81 70 FAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALS----------GSPAK 139 (311) T ss_pred eeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccc----------ccccc Confidence 78888999999888999999997765 45689999999999999999888899996422222111 11111 Q ss_pred HHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccc Q lcl|Aclame:pro 175 YRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQE 254 (355) Q Consensus 175 ~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~ 254 (355) + ... .+ .+.. ..++-.++|.++.+++. ++... .-++. .++|.+..+.. -..|-..... T Consensus 140 ~------------~~~-~~----~~~~-~~~~~~~~~~~i~~~~~-~~~~~-~~~~~-~~vmn~~~~~~-l~~lkd~~G~ 197 (311) T protein:vir:81 140 I------------LDT-TN----IVEL-TTGTSATPDLAVEAAVG-LVLGD-NLSPD-GVALDNTFSFM-LATQRDSQGR 197 (311) T ss_pred c------------ccc-ce----eeee-cccccchHHHHHHHHHH-Hhhhc-CCCce-EEEEcHHHHHH-HHhhhccCCC Confidence 0 000 00 0111 12222344544554543 33332 22333 47777776642 2222222211 Q ss_pred cchhhHHHHHHhhhhhcccccccCCccCCCcE------------------EEecCCCcEEEEeeCcEEEEEEEccchhhh Q lcl|Aclame:pro 255 NSESLAADIIISQKRIGNLPAVRVPYFPANAV------------------LVTTLENLSIYFMDESHRRSIDENPKKDRV 316 (355) Q Consensus 255 ~te~~aa~~~~~~k~iGGlpa~~~PffP~~~i------------------lIT~l~NLsIY~Q~gs~RR~~~d~p~r~rv 316 (355) + .-.....-....++-|+|++..-++|++.. ++=.++++- |-.++..+=.+.++.+-+.. T Consensus 198 ~-l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~-i~~~~~~~~~~~~~~~~~~~ 275 (311) T protein:vir:81 198 K-LYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFR-WGVQVSIPLELIEFGDPDGL 275 (311) T ss_pred e-eecCccccCCCceecceeEEecccccccccccccccchhcccCCccEEEEEecccEE-EEEeccceEEEeccCCCCcc Confidence 1 100011112346888999999888887653 333444432 21222222233333222333 Q ss_pred hhhhhhh-hhhhc-cccccEEEE-ecceecCccCCC Q lcl|Aclame:pro 317 ENYESMN-IDYVV-EVYAAGCLL-ENITLGDFTAPA 349 (355) Q Consensus 317 e~y~s~N-e~YvV-Ed~~~~a~i-enI~~~~~~~~~ 349 (355) .+|..+| -+|-+ +.++....- +.|........+ T Consensus 276 ~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 276 GDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred hhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 3445555 34433 333322222 233332211111 No 104 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=97.25 E-value=2.4e-05 Score=45.93 Aligned_cols=305 Identities=12% Similarity=0.023 Sum_probs=139.0 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) ++.+-|+.|+++.. . .+ ....|.|-++...++.+.+.+.|.+++.++++++.- +.++....+++.|+=+ T Consensus 65 l~~~e~~~~~~~~~----~--t~---~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~--~~~i~~~~~~~~a~W~ 133 (381) T protein:vir:10 65 LSANQRNFFMDINK----S--VG---YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWG 133 (381) T ss_pred cCHHHHHHHHHHhh----c--CC---CCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc--ceEEEeecCCcceEEe Confidence 44444444443221 1 11 223578888899999999999999999999988742 2344444444433221 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . ...++.......++...+.+++.---..|+.+.|+...- +++..+++.+.+++|.=.-.-=.||+- +. T Consensus 134 ~--e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~--~le~~i~~~la~~~a~~~~~afi~GdG-------~~ 202 (381) T protein:vir:10 134 K--IYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVRVQIEEAFAVALETAFLKGTG-------KD 202 (381) T ss_pred e--cccccccccCccceeEeecceeEEeeccccHHHHhccHH--HHHHHHHHHHHHHHHHHhhceeEeccc-------CC Confidence 1 112222222234556667777777778899999998643 577788888888776533333335632 11 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeee-CCCcchhhHHHHHHHHHhcccchh--hhCCCCeEEEEc Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRV-GKNGDYENIDALVMDATNNLIDEV--YQDDPNLVAIVG 237 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~-G~ggdy~nLDaLv~d~~~~lid~~--~~~~~~LVvivG 237 (355) -| +|+|..+ ++...-..+.... +.....++. .....|..|.+++..+. +..-. .......+++|. T Consensus 203 qP------~Gil~~~---~~~~~~~~g~~~~-~~~~~~~t~~~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~vmn 270 (381) T protein:vir:10 203 QP------IGLNRQV---QKGVSVTDGAYPE-KEEQGTLTFANPRATVNELTQVFKYHS--TNEKGKSVAVKGNVTMVVN 270 (381) T ss_pred Cc------eeeeecC---Ccccccccccccc-ccccccccccchhhHHHHHHHHHHhhh--hhhccccccccCceEEEEc Confidence 22 4665321 1111111110000 000001110 01122344444333221 11110 112346678887 Q ss_pred HHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhh Q lcl|Aclame:pro 238 RKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVE 317 (355) Q Consensus 238 ~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve 317 (355) +.-.. +-.++..-.+.. ++-+. ..--|.|.+..|+||++.|+.-.+++--|.- ++.+|=..-+ + T Consensus 271 ~~t~~-~l~~~~~~~~~~-----G~~v~--~lp~g~~vv~~~~~p~~~i~fGDfs~Y~i~~-r~~~~i~~~~--~----- 334 (381) T protein:vir:10 271 PSDAF-EVQAQYTHLNAN-----GVYVT--ALPFNLNVIESTVQEAGKVLTYVKGLYDGYL-AGGINVQKFK--E----- 334 (381) T ss_pred hhhHH-hhccccccCCCC-----Cceee--cCCCCceeEEcCCCCcCcEEEEEcccEEEEE-ecccEEEeec--h----- Confidence 66333 112221111110 11110 0012677888999999999999998865543 3333321111 1 Q ss_pred hhhhh-hhhhhccccccEEEEe-------cceecC-ccCCCCcCCCC Q lcl|Aclame:pro 318 NYESM-NIDYVVEVYAAGCLLE-------NITLGD-FTAPAAPESGA 355 (355) Q Consensus 318 ~y~s~-Ne~YvVEd~~~~a~ie-------nI~~~~-~~~~~~~~~~a 355 (355) .|... -.+|..--+--+..++ .|++.+ +|+...++--- T Consensus 335 ~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 335 TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPALEDTEETL 381 (381) T ss_pred hhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccccccccccC Confidence 11111 1233222222222221 133322 11111111111 No 105 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=97.20 E-value=0.00013 Score=41.81 Aligned_cols=311 Identities=14% Similarity=0.101 Sum_probs=144.7 Q ss_pred CCHHHH-----HHHHHHHH----------------HHHHHhCCChHHcceeeecCcHH-HHHHHHHHHhhHHHhCcCccc Q lcl|Aclame:pro 1 MRPETR-----FKFNAYLT----------------RVAELNNISTDDVSKKFTVEPSV-TQTLMNTVQASSAFLKTINIL 58 (355) Q Consensus 1 M~~~tr-----~~f~~y~~----------------~~A~~ngv~~~~v~~~Fsv~P~~-~q~L~~~iqess~FL~~INv~ 58 (355) |.+..+ ..+..... ...+.-.+........+.|-|.. ...+++.+++++.+++.+.++ T Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~ 194 (477) T protein:vir:84 115 LAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTE 194 (477) T ss_pred HHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhcee Confidence 000000 00000000 00000011111122345566663 678999999999999999999 Q ss_pred cchhhhhhhhccc-ccccccc-cccCCC--CcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHH Q lcl|Aclame:pro 59 PVAEMKGEKIGVG-VTGTIAS-TTDTSG--DKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAI 134 (355) Q Consensus 59 ~V~e~~Ge~v~lg-v~~~ia~-Rt~T~~--~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i 134 (355) +++...|..-..- .+|+..+ -+.-+. .....|.....++...+.+++.---+.|+.+.|+..+ ++++..+++.+ T Consensus 195 ~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l 272 (477) T protein:vir:84 195 PLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAA--VSVDEFVFRDL 272 (477) T ss_pred eecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeeeHHHHHHHhccc--hhHHHHHHHHH Confidence 9888777532111 1222211 111110 0111222233466677888888888889999998765 58999999999 Q ss_pred HHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHH Q lcl|Aclame:pro 135 VKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALV 214 (355) Q Consensus 135 ~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv 214 (355) .+.++.=.-.--++|+-.+ .+|. |.+.. .. +......+.+.++..+|.+. T Consensus 273 ~~~~~~~~d~~~l~G~Gt~------~~p~------Gi~~~----~~--------------~~~~~~~~~~~t~~~~~~~~ 322 (477) T protein:vir:84 273 AADYANKLNVQVISGTGSN------NQVV------GVRAT----AG--------------ITQVTATSAGSALEKHQIIY 322 (477) T ss_pred HHHHHHHHHHHHhccCCCC------Cccc------eeeec----cc--------------cccccccccccchhhHHHHH Confidence 9998866666677884211 1332 33321 00 00001123455677887765 Q ss_pred HH---HHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccc-----h------hhHHHHH-HhhhhhcccccccCC Q lcl|Aclame:pro 215 MD---ATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENS-----E------SLAADII-ISQKRIGNLPAVRVP 279 (355) Q Consensus 215 ~d---~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~t-----e------~~aa~~~-~~~k~iGGlpa~~~P 279 (355) .+ ++.. +++-++..+.. +++....++ ....|......|- . .+.+... ....++-|+|++..+ T Consensus 323 ~~i~~~~~~-~~~~~~~~~~~-~v~~~~~~~-~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~ 399 (477) T protein:vir:84 323 QKIADAIQR-VHTSRFLEPEV-IVMHPRRWA-SFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDP 399 (477) T ss_pred HHHHHHHhh-ccccccCCccE-EEEcHHHHH-HHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecC Confidence 43 3332 34444544444 455544333 1222322221110 0 0011111 113478899999999 Q ss_pred ccCCC--------cEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEE---ecceecCccCC Q lcl|Aclame:pro 280 YFPAN--------AVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLL---ENITLGDFTAP 348 (355) Q Consensus 280 ffP~~--------~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~i---enI~~~~~~~~ 348 (355) ++|++ .+++-.++.+-| ++ ++++-. ..++ .+.++ ....|.|.-|-.+.++ +.+.+....+. T Consensus 400 ~~p~~~~~~~d~~~i~~gd~~~~~i-~~-~~~~~~--~~~~--~~~~~--~~~~~~v~~~~~~~~~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 400 TLPTTLGTGTDQDVIHVLRASDLAL-FE-SSVRMR--ALQE--TRAEN--LSVLLQVYGYLAFTAARFPQSVVEIGGTAL 471 (477) T ss_pred cccccccccCCcceEEEEEeceEEE-Ee-eceeEE--eccc--ccccc--ceeeeeehhhhhhhhhccccceEEeecccc Confidence 99975 467777776522 33 333322 2221 11111 1112222221111111 23333222222 Q ss_pred CCcCCC Q lcl|Aclame:pro 349 AAPESG 354 (355) Q Consensus 349 ~~~~~~ 354 (355) .+|-=+ T Consensus 472 ~~~~~~ 477 (477) T protein:vir:84 472 TAPTFA 477 (477) T ss_pred cccccC Confidence 222222 No 106 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=97.17 E-value=0.00013 Score=41.81 Aligned_cols=303 Identities=15% Similarity=0.121 Sum_probs=138.9 Q ss_pred CC-HH-HHHHHHHHHHHHHHHhC---------CC---hH--HcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhh Q lcl|Aclame:pro 1 MR-PE-TRFKFNAYLTRVAELNN---------IS---TD--DVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMK 64 (355) Q Consensus 1 M~-~~-tr~~f~~y~~~~A~~ng---------v~---~~--~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~ 64 (355) +. +. ....+..+...+.+... +. .. .-...+.|-+.+.+.+.+.+++++.+++.++++++.--. T Consensus 126 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 205 (458) T protein:vir:10 126 TQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKI 205 (458) T ss_pred hhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcc Confidence 11 00 00111112221111100 00 00 001345677788999999999999999999998875322 Q ss_pred hhhhcccccccccccccCCC-CcCccc--cccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 65 GEKIGVGVTGTIASTTDTSG-DKERQT--ADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALD 141 (355) Q Consensus 65 Ge~v~lgv~~~ia~Rt~T~~-~~~r~~--~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD 141 (355) .. +..-..++-|+-+.-+. .++-.. .....++...+..++.--.+.|+.+.|+... ++|+..+.+.+.+.++.- T Consensus 206 ~~-~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~--~~~~~~i~~~l~~~i~~~ 282 (458) T protein:vir:10 206 LT-MLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVS 282 (458) T ss_pred eE-EEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcch--HHHHHHHHHHHHHHHHHH Confidence 11 11112222221111111 111100 0111245556777777777899999887644 689999999999998866 Q ss_pred HHHHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcc Q lcl|Aclame:pro 142 LIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNL 221 (355) Q Consensus 142 ~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~l 221 (355) .-.--+||+- +..| +|.+. .....++..+.. . -+...+-.+.|.|+ +++..+ T Consensus 283 ~d~~~l~G~G-------~~~p------~Gi~~------------~~~~~~~~~~~~-~-~~~~~~~~~~~~i~-~~~~~l 334 (458) T protein:vir:10 283 IEEAFMTGDG-------SGKP------KGLLT------------LASEDSAKVVTE-A-KADGSVLVTAKTIS-KLRRKL 334 (458) T ss_pred HHHHhhcCCC-------CCcc------ceeee------------cccccccceeec-c-cccccccccHHHHH-HHHHhh Confidence 6666678842 1122 23322 222222221111 0 11112223344433 455544 Q ss_pred cchhhhCCCCeEEEEcHHHHHHHHHHHHhhccc-cch--hhH-HHHHHhhhhhcccccccCCccCCCc----EEEecCCC Q lcl|Aclame:pro 222 IDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQE-NSE--SLA-ADIIISQKRIGNLPAVRVPYFPANA----VLVTTLEN 293 (355) Q Consensus 222 id~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~-~te--~~a-a~~~~~~k~iGGlpa~~~PffP~~~----ilIT~l~N 293 (355) ++.|+. .-+++|.+..+. ++..+-..++ +-- ... ........++-|+|++...++|+.+ +++=.+.+ T Consensus 335 -~~~~~~--~~~~v~~~~~~~--~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~~ 409 (458) T protein:vir:10 335 -GRHGLK--LSKLVLIVSMDA--YYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYKD 409 (458) T ss_pred -hhhhcC--CCEEEEcHHHHH--HHHhhcccCCceeeccccccccccCcCceecceeeEEccccccccCCcceEEEEecc Confidence 565654 457888887765 3333322221 110 011 1111223478899999999999863 55555543 Q ss_pred cEEEEeeCcEEEEEEEccchhhhhhhhhhh-hhhhcc-ccccEEEE-ecceecCccCC Q lcl|Aclame:pro 294 LSIYFMDESHRRSIDENPKKDRVENYESMN-IDYVVE-VYAAGCLL-ENITLGDFTAP 348 (355) Q Consensus 294 LsIY~Q~gs~RR~~~d~p~r~rve~y~s~N-e~YvVE-d~~~~a~i-enI~~~~~~~~ 348 (355) --..+.++..+- . + ++|-..| .+|..| ..|..+.. +.+..+..++. T Consensus 410 ~~~~~~~~~~~v--~----~---d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 410 NFVMPRQRAVTV--E----R---ERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred cEEEEEeeceEE--E----e---ecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 222233333332 1 1 1232222 222222 22222222 23333222222 No 107 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=97.06 E-value=0.00019 Score=41.02 Aligned_cols=285 Identities=9% Similarity=0.061 Sum_probs=134.7 Q ss_pred CCHHHHHHHHHHHHHH----HHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRV----AELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTI 76 (355) Q Consensus 1 M~~~tr~~f~~y~~~~----A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~i 76 (355) .+..-+..|..++... ....++.. ....+.|-+.+...+++.+++.+.++++++++++..-.+...-. ..++. T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~ra~~t~--~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~-~~~~~ 168 (421) T protein:vir:13 92 KRSLQLSAMSKTIRGIQLSEEERDIMSS--TNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVR-AGASV 168 (421) T ss_pred HHHHHHHHHHHhhhccchhHHHhhcccc--CCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEe-ecCCc Confidence 1122222333333211 11122221 12356676667788999999999999999999988766654322 11222 Q ss_pred cccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCC Q lcl|Aclame:pro 77 ASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTS 156 (355) Q Consensus 77 a~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~T 156 (355) ++=...+.+.. .+.....++...+..++.---..|+.+.|+. + .++|+..+++.+.+++++ -.||. T Consensus 169 ~~~~~~~E~~~-~~~s~~~f~~i~~~~~k~~~~v~iS~ell~d-s-~~~l~~~i~~~la~~~~~-----~~~~~------ 234 (421) T protein:vir:13 169 DKLANLAKDTE-LVKAMLKTQPMAYDIDDYGLLAPIDNSLLED-S-EINFLEFVNEEFAEFAVN-----TENAE------ 234 (421) T ss_pred cceeecccccc-ccccccceeEEEeeeeeeEeehhhhHHHHhh-h-HHHHHHHHHHHHHHHHHH-----Hhhhh------ Confidence 11111111111 1222223444555555555556677777764 2 247888888888877653 11221 Q ss_pred ChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEE Q lcl|Aclame:pro 157 DRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIV 236 (355) Q Consensus 157 d~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVviv 236 (355) .. | +-+|. ++ .....+|..| .+++..+ .+.|+. .-+++| T Consensus 235 -i~-~-----~~~g~------------~~---------------~~~~~~~d~i----~~~~~~l-~~~~~~--~a~~v~ 273 (421) T protein:vir:13 235 -IV-K-----QAKAV------------LA---------------EETINDYAGL----VKTINSL-VPNARK--RAIIVT 273 (421) T ss_pred -Hh-h-----hhhhc------------cc---------------cccccchHHH----HHHHHHh-hhhhcC--CCEEEE Confidence 00 0 00111 10 0011234333 3456554 444554 347888 Q ss_pred cHHHHHHHHHHHH-hhccccchhhHHHHH-HhhhhhcccccccCCccCCCc-----EEEecCCCcEEEEeeCcEEEEEEE Q lcl|Aclame:pro 237 GRKLLADKYFPLV-NKQQENSESLAADII-ISQKRIGNLPAVRVPYFPANA-----VLVTTLENLSIYFMDESHRRSIDE 309 (355) Q Consensus 237 G~dLl~~k~~~l~-n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~~-----ilIT~l~NLsIY~Q~gs~RR~~~d 309 (355) .+.... .+..+ .....+- -.... ....+|-|+|++..+++|... +++-.+++.-..+.++..+=...+ T Consensus 274 n~~~~~--~l~~lkd~~G~~i---~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~ 348 (421) T protein:vir:13 274 NSDGRA--YLDGLMDKQGRPL---LKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSK 348 (421) T ss_pred cHHHHH--HHHHhhcCCCcee---ecCcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeec Confidence 887665 23333 2211111 01110 113579999999999999764 788889985444555555544444 Q ss_pred ccchhhhhhhhhhhh-hhhc--------cccccEEEEecce----ecCccCCCCcCCCC Q lcl|Aclame:pro 310 NPKKDRVENYESMNI-DYVV--------EVYAAGCLLENIT----LGDFTAPAAPESGA 355 (355) Q Consensus 310 ~p~r~rve~y~s~Ne-~YvV--------Ed~~~~a~ienI~----~~~~~~~~~~~~~a 355 (355) ++ |+.+|. +|.+ =+.+.++.+.-.. +.....|++...++ T Consensus 349 ~~-------~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~~~~~ 400 (421) T protein:vir:13 349 EA-------GYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSSPRSG 400 (421) T ss_pred cc-------ccccCeeEEEEEeeecceeecchhhheeeecccceeeccccccCCCCcCC Confidence 33 222332 3322 2222332222111 11111222222222 No 108 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=97.06 E-value=8.5e-05 Score=42.89 Aligned_cols=286 Identities=9% Similarity=0.057 Sum_probs=140.1 Q ss_pred CCHHHHHHHHHHHHHHH----------------HHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVA----------------ELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMK 64 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A----------------~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~ 64 (355) +...+...|..|+.... ...+... +....+.|-+.+...+++.+.+.+.++++++++++...+ T Consensus 98 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t-~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~ 176 (402) T protein:vir:93 98 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 176 (402) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCC-CcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCCce Confidence 33333333333332211 1111110 112357787778889999999999999999999987655 Q ss_pred hhhhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 65 GEKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIM 144 (355) Q Consensus 65 Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~ 144 (355) +-++.. +++-++-+. .+......+ ...+...|..++.---+.|+++.|+.. .++|+..+.+.++++++.=..- T Consensus 177 ~p~~~~--~~~~a~~v~--Eg~~~~~~~-~~f~~i~~~~~k~~~~i~iS~ell~Ds--~~~l~~~i~~~la~~~~~~e~~ 249 (402) T protein:vir:93 177 IPRVSY--TLDDDDFIT--DVETAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERK 249 (402) T ss_pred eeeeec--cCCcccccc--ccccccccc-cccceeeecceeeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHH Confidence 544322 222232222 122222223 234556666666666678999988865 3478888999999888762111 Q ss_pred Hhh-cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccc Q lcl|Aclame:pro 145 AGF-NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLID 223 (355) Q Consensus 145 IGf-nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid 223 (355) .-| +| +.+..| .|++- .. +...+ ++.+ ..|.|+ +++.+| + T Consensus 250 ~~~~~g-------~g~g~p------~g~~~------------~~---~~~~~-------~~~~--~~d~l~-~~~~~l-~ 290 (402) T protein:vir:93 250 DALAVS-------PKSGLE------HMSFY------------NG---SVKEV-------EGAD--MYDAII-NALADL-H 290 (402) T ss_pred hHhhcC-------CCcccc------ceeee------------cc---ccccc-------cccc--hHHHHH-HHHhcc-C Confidence 122 22 222222 12221 00 00011 1111 246544 567655 6 Q ss_pred hhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcE Q lcl|Aclame:pro 224 EVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESH 303 (355) Q Consensus 224 ~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~ 303 (355) +.|+.. -+|||.+.-+.. -..++...+. .-.+ ....+|-|+|++....+|. +++=. +|-||.. + T Consensus 291 ~~y~~n--a~~imn~~t~~~-~~~~~~d~~~--~~~~----~~~~~llG~PV~~t~~~~~--i~~GD---f~~~~~~--~ 354 (402) T protein:vir:93 291 EDYRDN--ATIYMRYADYVK-IISVLSNGTT--NFFD----TPAEKVFGKPVVFTDAAVK--PIVGD---FNYFGIN--Y 354 (402) T ss_pred hhhhcC--CEEEEechHHHH-HHHHHhcCCC--cccc----cCCccccccceEEecCCCc--eeeec---hhhhhhh--h Confidence 777754 367776542221 2233322221 1111 2345788999999999875 55544 4545421 2 Q ss_pred EEEEEEccchhhhhhhhhhhhhhhccccccEEEE--ecceecCccCCCCcCCC Q lcl|Aclame:pro 304 RRSIDENPKKDRVENYESMNIDYVVEVYAAGCLL--ENITLGDFTAPAAPESG 354 (355) Q Consensus 304 RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~i--enI~~~~~~~~~~~~~~ 354 (355) +|.. .++..+...-..+|+...+--+..+ |.|.+....+++.+-|. T Consensus 355 ~~~~-----~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 355 DGTT-----YDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 402 (402) T ss_pred hhhh-----hhhhhcccCCceEEEEEEEeCcEEechhheEEEEeecCCCCCCC Confidence 2221 2222222223344544443334444 25555444344333333 No 109 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.03 E-value=3.5e-05 Score=45.03 Aligned_cols=277 Identities=13% Similarity=0.071 Sum_probs=134.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) .....+..|..++......-..........+.|-+...+.+.+ ..+.+..++.+++++++...|.......++.-++-. T Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (397) T protein:vir:96 112 ELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATV 190 (397) T ss_pred HHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCccccc Confidence 2233444555554433211111111123466677777777776 466777899999999988887765544433333221 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) .-+ ...........+...+.+++.---+.++.+.|+... ++|+..+.+.+.+.++.-.-.--++|+..+. T Consensus 191 ~E~--~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~--~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~------ 260 (397) T protein:vir:96 191 QQL--EKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDAS--YDVTGLIADEIQDQSLNTKNADIAAVLKTAT------ 260 (397) T ss_pred ccc--ccccccccccccceeecHhHhhcchhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccc------ Confidence 111 112112222344555555555445667788887653 4688888888888777644333334422110 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHH Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dL 240 (355) | .+ .. +.|. +.+++...+++.+ .-+++|.+.. T Consensus 261 -------------------~---------------~~------~~---~~d~-~~~~~~~~~~~~~----~a~~v~n~~~ 292 (397) T protein:vir:96 261 -------------------A---------------KS------VV---GVDG-LKDLINKEIKKVY----DVKLFISASM 292 (397) T ss_pred -------------------c---------------cc------cc---chHH-HHHHHHHhhhhhc----CcEEEEcHHH Confidence 0 00 01 2333 2345555555543 3489999877 Q ss_pred HHHHHHHHHhhccccchhhHHHHH-HhhhhhcccccccCCccCCC------cEEEecCCCcEEEEeeCcEEEEEEEccch Q lcl|Aclame:pro 241 LADKYFPLVNKQQENSESLAADII-ISQKRIGNLPAVRVPYFPAN------AVLVTTLENLSIYFMDESHRRSIDENPKK 313 (355) Q Consensus 241 l~~k~~~l~n~~~~~te~~aa~~~-~~~k~iGGlpa~~~PffP~~------~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r 313 (355) +. .+..+-..++.. ....++. ....++-|+|++..+..+.+ .+++-.|++....+-++...-...++. T Consensus 293 ~~--~l~~lkd~~G~~-~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~-- 367 (397) T protein:vir:96 293 YS--ELDKLKDKNGRY-LLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNN-- 367 (397) T ss_pred HH--HHHHhhccCCCe-EeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEeccc-- Confidence 65 333332111110 0101111 12347899999876654333 278888888644444444443333221 Q ss_pred hhhhhhhhhhhhh-hccccccEEEE-ecceecCccCC Q lcl|Aclame:pro 314 DRVENYESMNIDY-VVEVYAAGCLL-ENITLGDFTAP 348 (355) Q Consensus 314 ~rve~y~s~Ne~Y-vVEd~~~~a~i-enI~~~~~~~~ 348 (355) ++.. ++ +++.++....- +.+.....++. T Consensus 368 -----~~~~--~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 368 -----IYGQ--LLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred -----ccce--eEEEEEEEccEEecccceEEEEeecC Confidence 1111 22 33334332222 13333222211 No 110 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=97.01 E-value=0.00012 Score=42.02 Aligned_cols=285 Identities=8% Similarity=0.075 Sum_probs=145.4 Q ss_pred CCHHHH--HHHHHHHHH----------------HHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchh Q lcl|Aclame:pro 1 MRPETR--FKFNAYLTR----------------VAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAE 62 (355) Q Consensus 1 M~~~tr--~~f~~y~~~----------------~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e 62 (355) +....+ ..|..|... ..+..+... .....|.|-..+...+++.+++.+.+.+.++++++.. T Consensus 46 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~-~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~ 124 (352) T protein:vir:78 46 LNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKG 124 (352) T ss_pred cchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCC-CCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCC Confidence 111110 111111111 111222211 1223567766788899999999999999999988765 Q ss_pred hhhhhhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhh-- Q lcl|Aclame:pro 63 MKGEKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQAL-- 140 (355) Q Consensus 63 ~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~al-- 140 (355) ....++.. +++-++-+.-+ ......+ ...+...|..++.---+.|+++.|+.=+ ++++..+.+.+.+.++. T Consensus 125 ~~~p~~~~--~~~~a~~v~E~--~~~~~~~-~~f~~v~~~~~k~~~~i~is~ell~Ds~--~~l~~~i~~~la~~~~~~e 197 (352) T protein:vir:78 125 LEIPRVSY--TLDDDDFITDV--ETAKELK-LKGDTVKFTTNKFKVFAAISDTVIHGSD--VDLVNWVENALQSGLAAKE 197 (352) T ss_pred ceEEEEec--CCCcccccccc--ccccccc-ccceeeeecceeEEeechhhHHHHhhhh--HHHHHHHHHHHHHHHHHHH Confidence 44333322 22333332222 2222222 3456677777777777889999988633 57888899999988864 Q ss_pred hHHHHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhc Q lcl|Aclame:pro 141 DLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNN 220 (355) Q Consensus 141 D~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~ 220 (355) +-..+| +| +.+..|. |.|. .. .... ++ | ...| |.| -+++.. T Consensus 198 ~~~~~~-~g-------~g~~~~~------g~l~------------~~---~~~~----~t-~-~~~~---d~i-~~~~~~ 238 (352) T protein:vir:78 198 RKDALA-VS-------PKSGLEH------MSFY------------NG---SVKE----VE-G-ANMY---DAI-INALAD 238 (352) T ss_pred HHhhhh-cC-------CCCcccc------ccee------------cc---cccc----cc-c-cchH---HHH-HHHHhc Confidence 222222 23 2222222 2211 00 0000 10 1 1123 543 346664 Q ss_pred ccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEee Q lcl|Aclame:pro 221 LIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMD 300 (355) Q Consensus 221 lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~ 300 (355) +++-|++. -+++|.+.-... -..+....+.+ -.. ....+|-|+|++....+|. +++=. +|-||.. T Consensus 239 -l~~~~~~~--a~~~mn~~t~~~-l~~~~~~~~~~--~~~----~~~~~llG~PV~~~~~~~~--~~~Gd---f~~~~~~ 303 (352) T protein:vir:78 239 -LHEDYRDN--ATIYMRYADYVK-IISVLSNGTTN--FFD----TPAEKVFGKPVVFTDAAVK--PIVGD---FNYFGIN 303 (352) T ss_pred -cChhhhcC--CEEEEehHHHHH-HHHHHhccCCc--ccc----cCCccccccceEEecCCCc--eeEee---hhhhhhh Confidence 47778764 477887653331 22333222222 121 1234788999999998875 55544 4444431 Q ss_pred CcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEE--ecceecCccCCCCcCCC Q lcl|Aclame:pro 301 ESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLL--ENITLGDFTAPAAPESG 354 (355) Q Consensus 301 gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~i--enI~~~~~~~~~~~~~~ 354 (355) +++.. .++..++..-..+|+...+--+.++ |.+.+...++++.+.|. T Consensus 304 --~~~~~-----~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 304 --YDGTT-----YDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKESTGSLPS 352 (352) T ss_pred --hhhhe-----eeeeccccCCeeEEEEEeeeCceeechhheEEEEeecccCCCCC Confidence 11111 1222333444566766555555556 37777666666666666 No 111 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=96.80 E-value=0.00031 Score=39.83 Aligned_cols=287 Identities=9% Similarity=0.051 Sum_probs=146.2 Q ss_pred CCHHHHHHHHHHHHHHHHHh----------------CCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELN----------------NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMK 64 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~n----------------gv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~ 64 (355) .+......|..|+.+..... +... ...-.+.|-..+...+++.+.+.+.+.+.++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t-~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~ 161 (387) T protein:vir:93 83 DHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCc-CCCCceeechhHHHHHHHHHHhhchhhhheeeeecCCce Confidence 22222233444444332111 1100 012257777777888999999999999999999887655 Q ss_pred hhhhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 65 GEKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIM 144 (355) Q Consensus 65 Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~ 144 (355) .-++.. ++.-++-+.-+ ......+ ...+...|.+++.---+.|+++.|+. ..++|+..+.+.++++++.=..- T Consensus 162 ~p~~~~--~~~~a~~v~E~--~~~~~~~-~~f~~v~~~~~k~~~~~~iS~ell~D--s~~~l~~~i~~~la~~~~~~e~~ 234 (387) T protein:vir:93 162 IPRVSY--TLDDDDFITDV--ETAKELK-LKGDTVKFTTNKFKVFAAISDTVIHG--SDVDLVNWVENALQSGLAAKERK 234 (387) T ss_pred EEEEee--cCCccccccCc--ccccccc-cccceeeeeheeeeeechhhHHHHhh--hHHHHHHHHHHHHHHHHHHHHHH Confidence 433322 22333332222 2222222 33556677777777678888888863 13479999999999988753222 Q ss_pred HhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccch Q lcl|Aclame:pro 145 AGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDE 224 (355) Q Consensus 145 IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~ 224 (355) ..|. +.+.+..| .|+|- .. .... + ..... .|.+ -+++.+ +++ T Consensus 235 ~~~~------~g~g~g~p------~g~l~------------~~---~~~~----v--~~~~~---~d~i-~~~~~~-l~~ 276 (387) T protein:vir:93 235 DALA------VSPKSGLD------HMSFY------------NG---SVKE----V--EGADM---YDAI-INALAD-LHE 276 (387) T ss_pred hHhh------cCCCcccc------ceeee------------cc---cccc----c--cccch---HHHH-HHHHhc-cCh Confidence 2232 11222222 23331 00 0000 1 11112 3543 456765 477 Q ss_pred hhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEE Q lcl|Aclame:pro 225 VYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHR 304 (355) Q Consensus 225 ~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~R 304 (355) .|+.. -+++|.+.-.. +..+++...+.+ -.+ ....+|-|+|++....+|. +++-.|+. ||.. ++ T Consensus 277 ~~~~~--a~~~mn~~t~~-~~~~~~~d~~~~--~~~----~~~~~llG~PV~~~~~~~~--~~~GDf~~---~~~~--~~ 340 (387) T protein:vir:93 277 DYRDN--ATIYMRYADYV-KIISVLSNGTTN--FFD----TPAEKVFGKPVVFTDAAVK--PIVGDFNY---FGIN--YD 340 (387) T ss_pred hhhcC--CEEEEechHHH-HHHHHHhcCCCc--ccc----cCCccccccceEEecCCCc--eeeeehhh---hhee--hh Confidence 78764 47777754222 123343322221 111 2345888999999998875 56665554 4321 22 Q ss_pred EEEEEccchhhhhhhhhhhhhhhccccccEEEE--ecceecCccCCCCcCCC Q lcl|Aclame:pro 305 RSIDENPKKDRVENYESMNIDYVVEVYAAGCLL--ENITLGDFTAPAAPESG 354 (355) Q Consensus 305 R~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~i--enI~~~~~~~~~~~~~~ 354 (355) +... ++...+..-..+|+....--+..+ |.+.+....+++++.|. T Consensus 341 ~~~~-----~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 341 GTTY-----DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGSLPS 387 (387) T ss_pred hhee-----eecccccCCceeEEEEeeeCceeechhheEEEEeecCCCCCCC Confidence 2222 222333334455655533333444 36666555555555555 No 112 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=96.73 E-value=0.00038 Score=39.33 Aligned_cols=282 Identities=10% Similarity=0.013 Sum_probs=128.4 Q ss_pred CCHHHHHHHHHH---HHHHHHH--hCCChH-----HcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcc Q lcl|Aclame:pro 1 MRPETRFKFNAY---LTRVAEL--NNISTD-----DVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV 70 (355) Q Consensus 1 M~~~tr~~f~~y---~~~~A~~--ngv~~~-----~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~l 70 (355) ..+.....+... ..++... .++... .....+.|-+.....+...+.+.+.++++++++++.--....... T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 156 (379) T protein:vir:10 77 KSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRE 156 (379) T ss_pred cchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEe Confidence 001111111110 1111000 011100 011233455667778888888999999999998876433332211 Q ss_pred -cccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhh--hhHHHHhh Q lcl|Aclame:pro 71 -GVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQA--LDLIMAGF 147 (355) Q Consensus 71 -gv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~a--lD~i~IGf 147 (355) |.++.-+. ..+.+... |.....++...|..++.---+.|+-+.|+.. |.++..+.+.+.+.++ +|.-.+|- T Consensus 157 ~~~~~~~~~--~v~Eg~~~-~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~---~~l~~~i~~~la~~~~~~~~~~~~~g 230 (379) T protein:vir:10 157 NGAGEGAIG--AQVEGATK-GQKDYDISMIDVNTDFIAGFTRYSKKMANNL---PFLTSFIPNALRRDYAKAENAAFNAV 230 (379) T ss_pred ecCCCcccc--cccCCccc-cccccceeeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 22221111 11222222 2222346666666666666677888888764 5688888887777665 33333332 Q ss_pred cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhh Q lcl|Aclame:pro 148 NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQ 227 (355) Q Consensus 148 nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~ 227 (355) .|+. ++..+ . ....+..+|.++ +++..+.+. ++ T Consensus 231 ~~~~---------------------------~~~~~---------------~---~~~~~~~~d~i~-~~~~~~~~~-~~ 263 (379) T protein:vir:10 231 LAAN---------------------------ATAST---------------E---IITNKNKVEMLI-NEIAKQENL-DF 263 (379) T ss_pred cccc---------------------------ccccc---------------c---cccCcccHHHHH-HHHHhhhhc-cC Confidence 2211 00000 0 112234566544 455555444 33 Q ss_pred CCCCeEEEEcHHHHHHHHHHHHhhccccchhhHH-HHH---HhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcE Q lcl|Aclame:pro 228 DDPNLVAIVGRKLLADKYFPLVNKQQENSESLAA-DII---ISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESH 303 (355) Q Consensus 228 ~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa-~~~---~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~ 303 (355) ... +++|.+.-+. .+..+-..++ .-+.. ... ....++-|+|++..|.+|++.+++=.++...+-+.+|. T Consensus 264 ~~~--~~vmn~~~~~--~l~~lkd~~G--~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~- 336 (379) T protein:vir:10 264 PVT--AIVLRPTDYY--DILVTQKSVG--AGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGL- 336 (379) T ss_pred CCC--EEEEcHHHHH--HHHHhhccCC--ceeccCCccCCCCCcceecceeeEecCCCCCCceEEeecccEEEEEEece- Confidence 322 5778776433 2332211111 11100 000 11247889999999999999999988887544443332 Q ss_pred EEEEEEcc----chhhhhhhhhhhhhhhccccccEEEEecceecCccCC Q lcl|Aclame:pro 304 RRSIDENP----KKDRVENYESMNIDYVVEVYAAGCLLENITLGDFTAP 348 (355) Q Consensus 304 RR~~~d~p----~r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~ 348 (355) +-.+-.++ .+|.+.-.-..=-+..|=+.++++.++ -++- T Consensus 337 ~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~------~~~~ 379 (379) T protein:vir:10 337 SLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGD------FTAV 379 (379) T ss_pred EEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEE------ecCC Confidence 11222221 122211111111233333444444433 1111 No 113 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=96.67 E-value=0.00021 Score=40.70 Aligned_cols=286 Identities=9% Similarity=0.052 Sum_probs=136.4 Q ss_pred CCHHHHHHHHHHHHHHHH----------------HhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAE----------------LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMK 64 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~----------------~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~ 64 (355) ........|..|+..... ...... ...-.+.|-+.+...+++.+.+.+.++++++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 161 (387) T protein:vir:96 83 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCce Confidence 222222233333322211 111100 112357787778999999999999999999999987665 Q ss_pred hhhhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 65 GEKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIM 144 (355) Q Consensus 65 Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~ 144 (355) .-++..+ ++-++-+.-+ ......+ ...+...|..++.---+.|+++.|+.. .++|+..+.+.+.++++.-..- T Consensus 162 ~p~~~~~--~~~a~~v~Eg--~~~~~~~-~~f~~v~l~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~~~~~~e~~ 234 (387) T protein:vir:96 162 IPRVSYT--LDDDDFITDV--ETAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERK 234 (387) T ss_pred eeeeecc--CCcccccccc--ccccccc-cccceeeechheeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHH Confidence 5443332 2223222211 1121122 123334444444444577888988865 3578888999999988764222 Q ss_pred Hhh-cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccc Q lcl|Aclame:pro 145 AGF-NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLID 223 (355) Q Consensus 145 IGf-nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid 223 (355) .-| +| +.+.-| .|.+ ... +...++ | .. ..|.|+ +++.+| + T Consensus 235 ~~~~~g-------~g~g~~------~g~~------------~~~---~~~~~~-----~-~~---~~d~i~-~~~~~l-~ 275 (387) T protein:vir:96 235 DALAVS-------PKSGLE------HMSF------------YNG---SVKEVE-----G-AD---MYDAII-NALADL-H 275 (387) T ss_pred hHhhcC-------CCcccc------ceee------------ecc---cccccc-----c-cc---hHHHHH-HHHhcc-C Confidence 222 22 111112 1211 000 000010 1 11 246544 566654 6 Q ss_pred hhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcE Q lcl|Aclame:pro 224 EVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESH 303 (355) Q Consensus 224 ~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~ 303 (355) +.|+.. -+|+|.+.-+.. -..++...+.+ -.. ....+|-|+|++...++|. +++=.| |-||. + + T Consensus 276 ~~y~~n--a~~imn~~t~~~-~~~~~~~~~~~--~~~----~~~~~llG~PV~~~~~~~~--~~~GDf---~~~~~-~-~ 339 (387) T protein:vir:96 276 EDYRDN--ATIYMRYADYVK-IISVLSNGTTN--FFD----TPAEKVFGKPVVFTDAAVK--PIVGDF---NYFGI-N-Y 339 (387) T ss_pred hhhhcC--CEEEEechHHHH-HHHHHhcCCCc--ccc----cCCccccccceEEecCCCc--eeeech---hhhhh-h-h Confidence 777764 367776543221 22333222221 111 2345788999999998875 555444 44432 1 2 Q ss_pred EEEEEEccchhhhhhhhhhhhhhhccccccEEEE--ecceecCccCCCCcCCC Q lcl|Aclame:pro 304 RRSIDENPKKDRVENYESMNIDYVVEVYAAGCLL--ENITLGDFTAPAAPESG 354 (355) Q Consensus 304 RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~i--enI~~~~~~~~~~~~~~ 354 (355) ++... ++..+...-..+|++...--+..+ +.|.+....++..|.|- T Consensus 340 ~~~~~-----~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 340 DGTTY-----DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred hhhhh-----eecccccCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 22221 122222222344544433333334 25555544444444444 No 114 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=96.67 E-value=0.00021 Score=40.70 Aligned_cols=286 Identities=9% Similarity=0.052 Sum_probs=136.4 Q ss_pred CCHHHHHHHHHHHHHHHH----------------HhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAE----------------LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMK 64 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~----------------~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~ 64 (355) ........|..|+..... ...... ...-.+.|-+.+...+++.+.+.+.++++++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 161 (387) T protein:vir:26 83 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCce Confidence 222222233333322211 111100 112357787778999999999999999999999987665 Q ss_pred hhhhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 65 GEKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIM 144 (355) Q Consensus 65 Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~ 144 (355) .-++..+ ++-++-+.-+ ......+ ...+...|..++.---+.|+++.|+.. .++|+..+.+.+.++++.-..- T Consensus 162 ~p~~~~~--~~~a~~v~Eg--~~~~~~~-~~f~~v~l~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~~~~~~e~~ 234 (387) T protein:vir:26 162 IPRVSYT--LDDDDFITDV--ETAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERK 234 (387) T ss_pred eeeeecc--CCcccccccc--ccccccc-cccceeeechheeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHH Confidence 5443332 2223222211 1121122 123334444444444577888988865 3578888999999988764222 Q ss_pred Hhh-cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccc Q lcl|Aclame:pro 145 AGF-NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLID 223 (355) Q Consensus 145 IGf-nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid 223 (355) .-| +| +.+.-| .|.+ ... +...++ | .. ..|.|+ +++.+| + T Consensus 235 ~~~~~g-------~g~g~~------~g~~------------~~~---~~~~~~-----~-~~---~~d~i~-~~~~~l-~ 275 (387) T protein:vir:26 235 DALAVS-------PKSGLE------HMSF------------YNG---SVKEVE-----G-AD---MYDAII-NALADL-H 275 (387) T ss_pred hHhhcC-------CCcccc------ceee------------ecc---cccccc-----c-cc---hHHHHH-HHHhcc-C Confidence 222 22 111112 1211 000 000010 1 11 246544 566654 6 Q ss_pred hhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcE Q lcl|Aclame:pro 224 EVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESH 303 (355) Q Consensus 224 ~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~ 303 (355) +.|+.. -+|+|.+.-+.. -..++...+.+ -.. ....+|-|+|++...++|. +++=.| |-||. + + T Consensus 276 ~~y~~n--a~~imn~~t~~~-~~~~~~~~~~~--~~~----~~~~~llG~PV~~~~~~~~--~~~GDf---~~~~~-~-~ 339 (387) T protein:vir:26 276 EDYRDN--ATIYMRYADYVK-IISVLSNGTTN--FFD----TPAEKVFGKPVVFTDAAVK--PIVGDF---NYFGI-N-Y 339 (387) T ss_pred hhhhcC--CEEEEechHHHH-HHHHHhcCCCc--ccc----cCCccccccceEEecCCCc--eeeech---hhhhh-h-h Confidence 777764 367776543221 22333222221 111 2345788999999998875 555444 44432 1 2 Q ss_pred EEEEEEccchhhhhhhhhhhhhhhccccccEEEE--ecceecCccCCCCcCCC Q lcl|Aclame:pro 304 RRSIDENPKKDRVENYESMNIDYVVEVYAAGCLL--ENITLGDFTAPAAPESG 354 (355) Q Consensus 304 RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~i--enI~~~~~~~~~~~~~~ 354 (355) ++... ++..+...-..+|++...--+..+ +.|.+....++..|.|- T Consensus 340 ~~~~~-----~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 340 DGTTY-----DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred hhhhh-----eecccccCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 22221 122222222344544433333334 25555544444444444 No 115 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=96.67 E-value=0.00021 Score=40.70 Aligned_cols=286 Identities=9% Similarity=0.052 Sum_probs=136.4 Q ss_pred CCHHHHHHHHHHHHHHHH----------------HhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhh Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAE----------------LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMK 64 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~----------------~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~ 64 (355) ........|..|+..... ...... ...-.+.|-+.+...+++.+.+.+.++++++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 161 (387) T protein:vir:94 83 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCce Confidence 222222233333322211 111100 112357787778999999999999999999999987665 Q ss_pred hhhhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 65 GEKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIM 144 (355) Q Consensus 65 Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~ 144 (355) .-++..+ ++-++-+.-+ ......+ ...+...|..++.---+.|+++.|+.. .++|+..+.+.+.++++.-..- T Consensus 162 ~p~~~~~--~~~a~~v~Eg--~~~~~~~-~~f~~v~l~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~~~~~~e~~ 234 (387) T protein:vir:94 162 IPRVSYT--LDDDDFITDV--ETAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERK 234 (387) T ss_pred eeeeecc--CCcccccccc--ccccccc-cccceeeechheeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHH Confidence 5443332 2223222211 1121122 123334444444444577888988865 3578888999999988764222 Q ss_pred Hhh-cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccc Q lcl|Aclame:pro 145 AGF-NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLID 223 (355) Q Consensus 145 IGf-nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid 223 (355) .-| +| +.+.-| .|.+ ... +...++ | .. ..|.|+ +++.+| + T Consensus 235 ~~~~~g-------~g~g~~------~g~~------------~~~---~~~~~~-----~-~~---~~d~i~-~~~~~l-~ 275 (387) T protein:vir:94 235 DALAVS-------PKSGLE------HMSF------------YNG---SVKEVE-----G-AD---MYDAII-NALADL-H 275 (387) T ss_pred hHhhcC-------CCcccc------ceee------------ecc---cccccc-----c-cc---hHHHHH-HHHhcc-C Confidence 222 22 111112 1211 000 000010 1 11 246544 566654 6 Q ss_pred hhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcE Q lcl|Aclame:pro 224 EVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESH 303 (355) Q Consensus 224 ~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~ 303 (355) +.|+.. -+|+|.+.-+.. -..++...+.+ -.. ....+|-|+|++...++|. +++=.| |-||. + + T Consensus 276 ~~y~~n--a~~imn~~t~~~-~~~~~~~~~~~--~~~----~~~~~llG~PV~~~~~~~~--~~~GDf---~~~~~-~-~ 339 (387) T protein:vir:94 276 EDYRDN--ATIYMRYADYVK-IISVLSNGTTN--FFD----TPAEKVFGKPVVFTDAAVK--PIVGDF---NYFGI-N-Y 339 (387) T ss_pred hhhhcC--CEEEEechHHHH-HHHHHhcCCCc--ccc----cCCccccccceEEecCCCc--eeeech---hhhhh-h-h Confidence 777764 367776543221 22333222221 111 2345788999999998875 555444 44432 1 2 Q ss_pred EEEEEEccchhhhhhhhhhhhhhhccccccEEEE--ecceecCccCCCCcCCC Q lcl|Aclame:pro 304 RRSIDENPKKDRVENYESMNIDYVVEVYAAGCLL--ENITLGDFTAPAAPESG 354 (355) Q Consensus 304 RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~i--enI~~~~~~~~~~~~~~ 354 (355) ++... ++..+...-..+|++...--+..+ +.|.+....++..|.|- T Consensus 340 ~~~~~-----~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 340 DGTTY-----DTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred hhhhh-----eecccccCCceEEEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 22221 122222222344544433333334 25555544444444444 No 116 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=96.54 E-value=0.00053 Score=38.56 Aligned_cols=292 Identities=12% Similarity=0.107 Sum_probs=144.6 Q ss_pred CC-----HH---------HHHHH-HHHHHHHHHHhCCChH------------------HcceeeecCcHH-HHHHHHHHH Q lcl|Aclame:pro 1 MR-----PE---------TRFKF-NAYLTRVAELNNISTD------------------DVSKKFTVEPSV-TQTLMNTVQ 46 (355) Q Consensus 1 M~-----~~---------tr~~f-~~y~~~~A~~ngv~~~------------------~v~~~Fsv~P~~-~q~L~~~iq 46 (355) |+ +. ....| ..+...+++.+|-... ..+..+.|-|.+ .+.+++.+. T Consensus 304 ~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr 383 (632) T protein:vir:96 304 LQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILR 383 (632) T ss_pred HHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHh Confidence 00 00 00011 1112233333332110 011244555564 578999998 Q ss_pred hhHHHhCcCccccchhhhhhhh-cccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccch Q lcl|Aclame:pro 47 ASSAFLKTINILPVAEMKGEKI-GVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQD 125 (355) Q Consensus 47 ess~FL~~INv~~V~e~~Ge~v-~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~d 125 (355) +++-+.+ +.+-.+.-..|..- -.-.+|+-++=+. .+... +......+...+..++.---..|+.+.|+.. .++ T Consensus 384 ~~s~i~~-l~~~~~~~~~g~~~ip~~~~~~~a~wv~--E~~~~-~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds--~~~ 457 (632) T protein:vir:96 384 NKAIIGQ-MGARMLPGLVGDVDIPKKTSGANFYWIG--EDEDV-QDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIH 457 (632) T ss_pred hcchhhh-hcceEeecCCcceEEEEEeCCceeEeec--CCccc-cccccceeeEEeeeeEEEEehhhHHHHHhcc--chH Confidence 8776544 33333333333321 1111233332222 22222 2222346667777777777788888888764 568 Q ss_pred HHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCc Q lcl|Aclame:pro 126 FQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNG 205 (355) Q Consensus 126 F~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~gg 205 (355) ++..+++.+...++.-.-.-.++|+-.+ .+|. |.+.. + + +......+.+- T Consensus 458 ~~~~i~~~l~~a~~~~~d~a~l~G~G~~------~~p~------Gi~~~----~------------~--~~~~~~~~~~~ 507 (632) T protein:vir:96 458 VENLIREDLIEGIGVALDLAMLTGTGLA------NDPV------GLLNM----T------------G--VPALTYPAGGV 507 (632) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcccCCC------Cccc------eeeec----c------------c--ccceecccccC Confidence 9999999999999865555556774322 1232 33321 0 0 00000112334 Q ss_pred chhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCc Q lcl|Aclame:pro 206 DYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANA 285 (355) Q Consensus 206 dy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ 285 (355) +|.++..|... +...+.+....+++|....... +.+..-.+... .-+....++-|+|++...++|++. T Consensus 508 ~~~~i~~~~~~-----i~~~~~~~~~~~~~~~~~~~~~--l~~~~l~d~~G-----~~i~~~~~l~G~pv~~s~~ip~~~ 575 (632) T protein:vir:96 508 DWASVVDMETK-----ISTFNADAGRLAYLTSVTQRGA--AKKAQVFDNTG-----ERIWQNNEVNGYRAEASNQIPADT 575 (632) T ss_pred CHHHHHHHHHH-----HhhcccccCccEEEEchhHHHH--HHHHhccCCCC-----ceeecCCeecccceEeccccccCc Confidence 67776655432 3344555667899998765542 22111111111 112234678999999999999999 Q ss_pred EEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhhhhhhhhh-ccccccEEEE-ecceecCccC Q lcl|Aclame:pro 286 VLVTTLENLSIYFMDESHRRSIDENPKKDRVENYESMNIDYV-VEVYAAGCLL-ENITLGDFTA 347 (355) Q Consensus 286 ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~Yv-VEd~~~~a~i-enI~~~~~~~ 347 (355) +++-.++.+-| ...+.++=.+- ++..+.+-...|. .++++....- |.+.+....+ T Consensus 576 ~~~gd~s~~~i-~~~~~~~i~~~------~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 576 WIFGDWSQIVI-AMWGVLDLKVD------PYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EEEeecceEEE-EEecceEEEEc------cccccccCceEEEEEeecCceeechhhhhheeecC Confidence 99999998633 34455554332 1122222222332 2333332222 3566655444 No 117 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=96.47 E-value=0.00059 Score=38.29 Aligned_cols=300 Identities=10% Similarity=0.106 Sum_probs=140.0 Q ss_pred CCHH-HHHHHHHHHHHHH-------------HH-----------------hCCCh-HHcceeeecCcHHHHHHHHHHHhh Q lcl|Aclame:pro 1 MRPE-TRFKFNAYLTRVA-------------EL-----------------NNIST-DDVSKKFTVEPSVTQTLMNTVQAS 48 (355) Q Consensus 1 M~~~-tr~~f~~y~~~~A-------------~~-----------------ngv~~-~~v~~~Fsv~P~~~q~L~~~iqes 48 (355) ..+. ....|..++..++ +. .|... ....-.|.|.....+.+++.+.+. T Consensus 286 ~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~ 365 (645) T protein:vir:93 286 EQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQ 365 (645) T ss_pred hhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhh Confidence 0000 0011222222111 11 11100 011235666667788899999998 Q ss_pred HHHhCcCcc-cc-chhhh-hhhhcccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccch Q lcl|Aclame:pro 49 SAFLKTINI-LP-VAEMK-GEKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQD 125 (355) Q Consensus 49 s~FL~~INv-~~-V~e~~-Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~d 125 (355) |-+.+.-.. ++ ..... +..+-.-.+|+.++=+.-+ ... |.....++...+..++.---+.|+=+.|+.. .++ T Consensus 366 svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg--~~~-~~s~~~f~~v~l~~~kla~~~~iS~ell~ds--~~~ 440 (645) T protein:vir:93 366 TIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEG--KTK-PLTKFDFESITFSHAKVSAIAVLTEELIRFS--SPA 440 (645) T ss_pred hhHHhhccccccccccccCceeeeeeecCcceEEeccC--ccc-cccccceeEEEEeeEEEEEeehhHHHHHhhc--hHH Confidence 877655322 11 11111 2233333345555444332 222 3333356666666766555555666666543 367 Q ss_pred HHHHHHHHHHHHhhhhHHHHhhccccccc-CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCC Q lcl|Aclame:pro 126 FQRRIRDAIVKRQALDLIMAGFNGTTRAD-TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKN 204 (355) Q Consensus 126 F~~~i~~~i~~~~alD~i~IGfnG~s~A~-~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~g 204 (355) ++..+++.+.+.++.=.-.--++|+-.+. ... |.+ +..... ..-..+ T Consensus 441 ~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~----p~g------------------i~~~~~----------~~~~~~ 488 (645) T protein:vir:93 441 ADALVRNALAEAVVARLDTDFVDPKKAAVADVS----PAS------------------ITHDVK----------GTASSG 488 (645) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCcccCCcc----ccc------------------eecccc----------cccccc Confidence 88888888888776433333345533221 111 211 110000 001122 Q ss_pred cchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCC Q lcl|Aclame:pro 205 GDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPAN 284 (355) Q Consensus 205 gdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~ 284 (355) ..+.++..+...+...-+ +-..-|++|.+..... +..+ .+.+....--++-....++-|+|++...++|++ T Consensus 489 ~~~~d~~~~~~~~~~a~~-----~~~~a~~vmn~~~~~~--L~~l--kd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp~~ 559 (645) T protein:vir:93 489 NPDADAEAAFGQFVAANL-----QPTGAVWLMSSTNALA--LSMR--KNALGQKEYPDMTLLGGSFQGLPVIVSQYVGDQ 559 (645) T ss_pred chHHHHHHHHHHHHhcCC-----CccccEEEEcHHHHHH--HHhc--cccCCceeecCCCCCCceeeceeeEEeccCCcc Confidence 234555554443332221 2235689999886542 2221 111111111111123458999999999999987 Q ss_pred cEEEecCCCcEEEEeeCcEEE--------EEEEccchhh-------hhhhhhhh-h--------hhhccccccEEEEecc Q lcl|Aclame:pro 285 AVLVTTLENLSIYFMDESHRR--------SIDENPKKDR-------VENYESMN-I--------DYVVEVYAAGCLLENI 340 (355) Q Consensus 285 ~ilIT~l~NLsIY~Q~gs~RR--------~~~d~p~r~r-------ve~y~s~N-e--------~YvVEd~~~~a~ienI 340 (355) -++. .++.+- +-..+..+= .+.+.|.-+- ..+.+..| - +|.|=+.++++.|.+| T Consensus 560 ~~~g-d~s~~~-ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 560 LVLV-NAPDIY-LADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGV 637 (645) T ss_pred eeEe-ccccEE-EEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecc Confidence 5554 556542 233343332 2222232221 11222232 2 5566678888888899 Q ss_pred eecCccCC Q lcl|Aclame:pro 341 TLGDFTAP 348 (355) Q Consensus 341 ~~~~~~~~ 348 (355) +++-+.-. T Consensus 638 ~~g~~~~~ 645 (645) T protein:vir:93 638 NYGSASGG 645 (645) T ss_pred cCCcccCC Confidence 88764333 No 118 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=95.83 E-value=0.001 Score=37.01 Aligned_cols=285 Identities=11% Similarity=0.005 Sum_probs=143.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt 80 (355) +.++.|..|+++++. +.+ ....+.|-+++..++.+.+.+.|..+++++++++.- +-++-...+++-|+=+ T Consensus 67 lt~ee~~~~~~~~~~-----~~~---~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~~~~~~~~~~a~w~ 136 (377) T protein:vir:98 67 LTAEEIKFFNDIDKN-----VGG---KDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWG 136 (377) T ss_pred cCHHHHHHHHHHHhc-----cCC---CCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCc--ceEEEEecCCcceeEe Confidence 777778777776543 222 233678888899999999999999999999887641 1233333333333221 Q ss_pred cCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhh Q lcl|Aclame:pro 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) Q Consensus 81 ~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~a 160 (355) . ...++.......++...+.+++.---..|+.+.|+.=. .+++..+++.+.++++.=.-.--+||+= +. T Consensus 137 ~--e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~--~~ie~~i~~~la~~~a~~~~~a~i~G~G-------~~ 205 (377) T protein:vir:98 137 D--IFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWIKQFITEQLKEAIAVALELAIVKGDG-------LL 205 (377) T ss_pred e--cccccCcccCccceeEeecceeEEeeecccHHhhhccH--hHHHHHHHHHHHHHHHHHHhhceEeccC-------CC Confidence 1 11223333333455667777777777889999997532 2688889999999988766666667732 11 Q ss_pred hhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhh------------- Q lcl|Aclame:pro 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQ------------- 227 (355) Q Consensus 161 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~------------- 227 (355) - -+|+|..+- . ..+......+..+.|...|++ .++...+ ++-|+ T Consensus 206 q------P~Gil~~~~---------~------~~~~~~~~~~~~~~~~~~~~~-~~l~~~~-~~~~~~~a~~~m~~~t~~ 262 (377) T protein:vir:98 206 Q------PVGLLKDLS---------Q------PTVDQSTGRDITTYKTDKEAI-ADLSDLT-PDNAPKKLVPVMKHLSVN 262 (377) T ss_pred c------ceeeeeccc---------c------cccccccccccccccchhhhH-hhhhhhc-hhHHHHHHHHHHHHHHHH Confidence 1 235553210 0 001111111223334333432 2233322 22233 Q ss_pred -------CCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccc--cccCCccCCCcEEEecCCCcEEEE Q lcl|Aclame:pro 228 -------DDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLP--AVRVPYFPANAVLVTTLENLSIYF 298 (355) Q Consensus 228 -------~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlp--a~~~PffP~~~ilIT~l~NLsIY~ 298 (355) -++..+++|.+.--- +-+|.+.-.+ ..++ -.++-|+| .+..+++|++.+++-.+++--|.. T Consensus 263 ~~~klkd~~G~~i~~~n~~~~~-~~~p~~~~~~-----~~G~----~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~ 332 (377) T protein:vir:98 263 DKKRPLKIAGQVKLILNPEDRW-ALEAQFTSRN-----QFGE----YVTVLPHGITILESLAVETGKAIAFVANRYDAFM 332 (377) T ss_pred HHhhhhccCCceEEEecccchh-hccccccccC-----CCCc----cccccCCCceEEecCCCCcccEEEEEecceeEEe Confidence 233334433321000 0011000000 0000 01334445 667889999999988888844433 Q ss_pred eeCcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEe--cceecCccC--CCCcCCC Q lcl|Aclame:pro 299 MDESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTA--PAAPESG 354 (355) Q Consensus 299 Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~--~~~~~~~ 354 (355) . +.++= ..+.+.|..+|.-.+-++. +-+..+..+ .-.-..| T Consensus 333 r-~~~~i--------------~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 333 A-TASTI--------------EEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred e-cceEE--------------EeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 2 22221 1223456777766666554 222222111 1111112 No 119 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=93.81 E-value=0.003 Score=34.40 Aligned_cols=272 Identities=10% Similarity=0.090 Sum_probs=133.5 Q ss_pred hCCChHHcceeeecCcHHHHHHHHHHHh----hHHHhCcCccccchhhhh---hhhc------ccccccccccccCCCCc Q lcl|Aclame:pro 20 NNISTDDVSKKFTVEPSVTQTLMNTVQA----SSAFLKTINILPVAEMKG---EKIG------VGVTGTIASTTDTSGDK 86 (355) Q Consensus 20 ngv~~~~v~~~Fsv~P~~~q~L~~~iqe----ss~FL~~INv~~V~e~~G---e~v~------lgv~~~ia~Rt~T~~~~ 86 (355) .+++.++.+--|.++ +.+.+...+.| .=...+.| ||...-| |.+. .|....+... .. T Consensus 1 ~~~~~a~~~~~f~~~--ql~~id~~v~e~~~~~l~~~~~i---~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~-----~~ 70 (296) T protein:vir:10 1 MGVDKADAAGIWTVK--QLTASLNKAYETEYDQNSVVNLF---PVSNEIPGYAKYFEYPVFDGVGIAQIVADY-----TD 70 (296) T ss_pred CcccchhhhHHHHHH--HHHHHHHHHHhhhhcccccceec---ccccCCCCceeEEEeeeeeccCceeEeCCC-----cc Confidence 555544433334432 33333333332 22233333 3332211 1221 1222222111 11 Q ss_pred CccccccccccCcceeEEeeeecceeCHHHHHhhcccc-hHHHHHHHHHHHHhhhhHHHHhhcccccccCCChhhhhhhh Q lcl|Aclame:pro 87 ERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQ-DFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTKNTLLQ 165 (355) Q Consensus 87 ~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~-dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~Td~~anPllq 165 (355) +- |......+.........--+..+++..|.+.+... +...+-..+.++..+..+=.+.|+|.+..-.+=.-.+|.+. T Consensus 71 di-p~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~ 149 (296) T protein:vir:10 71 DL-PLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNIN 149 (296) T ss_pred cc-ceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCc Confidence 11 22111122222233333445556677888887753 68888888889999999999999994432221111222221 Q ss_pred ccch--hHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHH Q lcl|Aclame:pro 166 DVAV--GWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLAD 243 (355) Q Consensus 166 DVNk--GWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~ 243 (355) =++. -|- -...-|..+++++..+... .-....|+- .++..+ T Consensus 150 ~~~~~~~W~-----------------------------~~t~i~~Di~~~~~~l~~~---s~g~~~p~~-l~L~p~---- 192 (296) T protein:vir:10 150 NVVSGGSWS-----------------------------QPTTAVSDITSLLDIIETS---TNGQHRATH-LLLPTT---- 192 (296) T ss_pred cccccCCcc-----------------------------CHHHHHHHHHHHHHHHHHh---hCceeccee-EEeCHH---- Confidence 0100 120 0112356666666555431 112233443 333444 Q ss_pred HHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCC------cEEE--ecCCCcEEEEeeCcEEEEEEEccchhh Q lcl|Aclame:pro 244 KYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPAN------AVLV--TTLENLSIYFMDESHRRSIDENPKKDR 315 (355) Q Consensus 244 k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~------~ilI--T~l~NLsIY~Q~gs~RR~~~d~p~r~r 315 (355) .+.++++....+.....+.+ .+.+.++..+.+|.+... .+++ +..+|+++=+-. .+|+.-.+--..+- T Consensus 193 -~~~~L~~~~~~~~~t~l~~i--k~~~~~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~-~~~~~~~e~~~l~~ 268 (296) T protein:vir:10 193 -ARRIMQNLVPGTSVSYGEFF--RQNNSGVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPE-ATNALPAQPKDLHF 268 (296) T ss_pred -HHHHHhhccCCCCccHHHHH--HHhcCCceEEEeeeeccCCCCcceEEEEEEcCCceEEEEcCc-ceeeecccccCceE Confidence 33444444344555555555 356789999999999763 2343 667787775422 33444332222334 Q ss_pred hhhhhhhhhhhhccccccEEEEecceec Q lcl|Aclame:pro 316 VENYESMNIDYVVEVYAAGCLLENITLG 343 (355) Q Consensus 316 ve~y~s~Ne~YvVEd~~~~a~ienI~~~ 343 (355) .+.|..+--+-+|=...++|-+++|+|+ T Consensus 269 ~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 269 KIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEeeEeeEEEEEEECCceeEEEeeeecC Confidence 4444555556677788899999999998 No 120 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=89.27 E-value=0.027 Score=29.16 Aligned_cols=289 Identities=13% Similarity=0.154 Sum_probs=138.5 Q ss_pred CCHH---HHHHHHHHHHHHHHHhCCChHHc---ceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccc Q lcl|Aclame:pro 1 MRPE---TRFKFNAYLTRVAELNNISTDDV---SKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTG 74 (355) Q Consensus 1 M~~~---tr~~f~~y~~~~A~~ngv~~~~v---~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~ 74 (355) |++- ||.-+ |=..+++ -+.|+ -...++.+.++-|+.+.++-.++ .|..+-+- T Consensus 1 ms~~~~~t~~~~-----------~~s~~d~al~le~f~------geV~~af~~~s~~~~~~~~rti~--~g~s~~~~--- 58 (335) T protein:vir:78 1 MSFLNDLTRPNY-----------AGKNADVDIHLEEHL------GIVDKHFAYTSKFAPLMNIRDLR--GSNVVRLD--- 58 (335) T ss_pred CCcccccccccc-----------ccccchhhhhhhhhh------hHHHHHHHHhhhhccccceeeec--cceeEEEe--- Confidence 6643 34322 1111122 13343 34556788899999888877653 24444442 Q ss_pred cccccccCCCCcCccccccccccCcceeEEe-eeecceeCHHHHHhhcccchHHHHHHHHHHHHhhh--hHHHHhhcccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECNQ-INFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQAL--DLIMAGFNGTT 151 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~q-Tn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~al--D~i~IGfnG~s 151 (355) ..||+.-...+..++.+.......+..+.= +-.=++..-+.||.|-.+=|+-..+.+.+-+.+|- |+-.+ =...+ T Consensus 59 -~iG~~~~~~~~pG~~l~~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~-~~l~~ 136 (335) T protein:vir:78 59 -RLGNVEAKGRRAGEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACL-IQVIK 136 (335) T ss_pred -eeeeeeecccccCcccCCCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHH-HHHHh Confidence 223332221222222222222233322221 11223445678999998888888888888888877 55332 11222 Q ss_pred cccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceee-eCCCcchhhHHHHHHHHHhcccchhhhCCC Q lcl|Aclame:pro 152 RADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIR-VGKNGDYENIDALVMDATNNLIDEVYQDDP 230 (355) Q Consensus 152 ~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~-~G~ggdy~nLDaLv~d~~~~lid~~~~~~~ 230 (355) +|....|...|. ||. .|......++ .....++..|-.++.++...|. + ++-| T Consensus 137 aa~~~a~~~~~~------~~~------------------~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~-e--kdvP 189 (335) T protein:vir:78 137 AAAMDAPVDLED------AFS------------------PGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFI-E--RDLG 189 (335) T ss_pred hcccccccccCC------CcC------------------CCcceeeeeccccccccHHHHHHHHHHHHHHHH-h--ccCC Confidence 232223222111 110 1111111111 1133578888888999887664 4 4443 Q ss_pred -----CeEEEEcHH----HHHHHHHHHHhhccccchh---hHHHHHHhhhhhcccccccCCccCCCcEEEecCCC----- Q lcl|Aclame:pro 231 -----NLVAIVGRK----LLADKYFPLVNKQQENSES---LAADIIISQKRIGNLPAVRVPYFPANAVLVTTLEN----- 293 (355) Q Consensus 231 -----~LVvivG~d----Ll~~k~~~l~n~~~~~te~---~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~N----- 293 (355) |.|++|..+ |+.++ +++|..-..+.. .+.-- -..+-|.|++..|.||..++--++|.| T Consensus 190 ~~~~~~rv~vv~P~~y~~Ll~~~--~l~n~~~~~s~~~~~~~~g~---v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~ 264 (335) T protein:vir:78 190 DAVYSEGLTPMSPRVFSLLLEHD--KLMSVEYQATGATNDYVKSR---VAILNGVKVLETPRFATKAISAHPLGRHFNVS 264 (335) T ss_pred CCCCCccEEEeChHHHHHHhccc--ccccccccccccccccccce---eEEeeceEEEeeccCCCCCCccccccccCCcc Confidence 589999955 33332 344442111111 11111 126889999999999988755556543 Q ss_pred -------cEEEEeeC----------cEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEe--cceecCccC Q lcl|Aclame:pro 294 -------LSIYFMDE----------SHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTA 347 (355) Q Consensus 294 -------LsIY~Q~g----------s~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~ 347 (355) --+++|.. +.+..-.....=+.+..|++.+ -.+=..++++.|+ +|.-.+-.+ T Consensus 265 ~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G--~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:78 265 AEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYN--IGARRPDTAGAIELKGIEAFDITA 335 (335) T ss_pred cccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHHHcC--CcccCcceEEEEEecCCCcccccC Confidence 22335544 1122111111113333333311 1344566777776 433333222 No 121 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=88.78 E-value=0.03 Score=28.92 Aligned_cols=260 Identities=14% Similarity=0.118 Sum_probs=118.9 Q ss_pred HHHHhCCChHHcceeeecCcHH-HHHHHHHHHhhHHHhCcCcccc-chhhhhhhhcccccccccccccCCCCcCcccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSV-TQTLMNTVQASSAFLKTINILP-VAEMKGEKIGVGVTGTIASTTDTSGDKERQTADF 93 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~-~q~L~~~iqess~FL~~INv~~-V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~ 93 (355) +|..+ .. .+ =-+.|++ .+.+.+.+++++.|-+..++.. ...+.|..|.+=.-..+..-.+.+.+. .-+... T Consensus 1 MA~~~--T~--~~--~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~-~i~~~~ 73 (272) T protein:vir:98 1 MAVGT--TK--MA--QMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGE-AIPMTQ 73 (272) T ss_pred CCCcc--cc--ch--heechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCC-cccccc Confidence 32111 00 01 1234533 4455677777777755554421 112334444432211111111222221 222223 Q ss_pred ccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhh--HHHHhhcccccccCCChhhhhhhhccchhH Q lcl|Aclame:pro 94 TALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALD--LIMAGFNGTTRADTSDRTKNTLLQDVAVGW 171 (355) Q Consensus 94 ~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD--~i~IGfnG~s~A~~Td~~anPllqDVNkGW 171 (355) ...+......++ .-..++...++.....+|+...+.+.+.+.++.. ...++- T Consensus 74 ~~~~~~~~~~~~--~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~------------------------ 127 (272) T protein:vir:98 74 LGFKKTTMTIKK--AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDA------------------------ 127 (272) T ss_pred cccceEEEEeee--eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHH------------------------ Confidence 334444444444 3445666777777778899999999988887643 222221 Q ss_pred HHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHH-HHHHHh Q lcl|Aclame:pro 172 LQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADK-YFPLVN 250 (355) Q Consensus 172 lq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k-~~~l~n 250 (355) + +..+ ..++.+. ++|+ +.|++..| +.. ..+.-+++|.++....- ...+.+ T Consensus 128 ---~---------~~a~----------~~~~~~~---t~d~-i~da~~~l-~~~--~~~~~~~vv~p~~~~~L~k~~~~~ 178 (272) T protein:vir:98 128 ---L---------SKST----------QTVEATA---TVDG-VSKALDIF-NDE--DDAETVIVMNPADASTLRLDAAKE 178 (272) T ss_pred ---h---------cccc----------ccccccc---CHHH-HHHHHHHH-hcc--CCCccEEEEcHHHHHHHHHhcccc Confidence 0 0000 0112222 3454 44566544 442 23345888998865421 111111 Q ss_pred hccccchhhHHHHHH--hhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhhhhhhhhhc Q lcl|Aclame:pro 251 KQQENSESLAADIII--SQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYESMNIDYVV 328 (355) Q Consensus 251 ~~~~~te~~aa~~~~--~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvV 328 (355) - ...++... ..+. ...+|.|+|++.-+++|++.+++-.-..+.++.+.+.. -....++++ ..+.. ++- T Consensus 179 ~-~~~~~~~~-~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~-ve~~r~~~~--~~~~i-----~~~ 248 (272) T protein:vir:98 179 W-LGATEVGA-NRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTM-VETDRDITK--AINQI-----VAN 248 (272) T ss_pred c-cccccccc-cccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCce-eeecccccc--ceeEE-----EEE Confidence 1 11111110 1111 12479999999999999999999998888877766532 111122222 11211 122 Q ss_pred cccccEEEEe-----cceecCccCC Q lcl|Aclame:pro 329 EVYAAGCLLE-----NITLGDFTAP 348 (355) Q Consensus 329 Ed~~~~a~ie-----nI~~~~~~~~ 348 (355) .-|+.. .+. .++++.+... T Consensus 249 ~~~~~~-v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 249 KHYGVY-LYKAEKAVKITLKDAAKK 272 (272) T ss_pred EEEEEE-EEcCCceEEEEecccccC Confidence 333322 221 2233322222 No 122 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=88.78 E-value=0.03 Score=28.92 Aligned_cols=260 Identities=14% Similarity=0.118 Sum_probs=118.9 Q ss_pred HHHHhCCChHHcceeeecCcHH-HHHHHHHHHhhHHHhCcCcccc-chhhhhhhhcccccccccccccCCCCcCcccccc Q lcl|Aclame:pro 16 VAELNNISTDDVSKKFTVEPSV-TQTLMNTVQASSAFLKTINILP-VAEMKGEKIGVGVTGTIASTTDTSGDKERQTADF 93 (355) Q Consensus 16 ~A~~ngv~~~~v~~~Fsv~P~~-~q~L~~~iqess~FL~~INv~~-V~e~~Ge~v~lgv~~~ia~Rt~T~~~~~r~~~~~ 93 (355) +|..+ .. .+ =-+.|++ .+.+.+.+++++.|-+..++.. ...+.|..|.+=.-..+..-.+.+.+. .-+... T Consensus 1 MA~~~--T~--~~--~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~-~i~~~~ 73 (272) T protein:vir:30 1 MAVGT--TK--MA--QMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGE-AIPMTQ 73 (272) T ss_pred CCCcc--cc--ch--heechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCC-cccccc Confidence 32111 00 01 1234533 4455677777777755554421 112334444432211111111222221 222223 Q ss_pred ccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhh--HHHHhhcccccccCCChhhhhhhhccchhH Q lcl|Aclame:pro 94 TALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALD--LIMAGFNGTTRADTSDRTKNTLLQDVAVGW 171 (355) Q Consensus 94 ~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD--~i~IGfnG~s~A~~Td~~anPllqDVNkGW 171 (355) ...+......++ .-..++...++.....+|+...+.+.+.+.++.. ...++- T Consensus 74 ~~~~~~~~~~~~--~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~------------------------ 127 (272) T protein:vir:30 74 LGFKKTTMTIKK--AGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDA------------------------ 127 (272) T ss_pred cccceEEEEeee--eeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHH------------------------ Confidence 334444444444 3445666777777778899999999988887643 222221 Q ss_pred HHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHHHH-HHHHHh Q lcl|Aclame:pro 172 LQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLADK-YFPLVN 250 (355) Q Consensus 172 lq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~~k-~~~l~n 250 (355) + +..+ ..++.+. ++|+ +.|++..| +.. ..+.-+++|.++....- ...+.+ T Consensus 128 ---~---------~~a~----------~~~~~~~---t~d~-i~da~~~l-~~~--~~~~~~~vv~p~~~~~L~k~~~~~ 178 (272) T protein:vir:30 128 ---L---------SKST----------QTVEATA---TVDG-VSKALDIF-NDE--DDAETVIVMNPADASTLRLDAAKE 178 (272) T ss_pred ---h---------cccc----------ccccccc---CHHH-HHHHHHHH-hcc--CCCccEEEEcHHHHHHHHHhcccc Confidence 0 0000 0112222 3454 44566544 442 23345888998865421 111111 Q ss_pred hccccchhhHHHHHH--hhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhhhhhhhhhc Q lcl|Aclame:pro 251 KQQENSESLAADIII--SQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYESMNIDYVV 328 (355) Q Consensus 251 ~~~~~te~~aa~~~~--~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvV 328 (355) - ...++... ..+. ...+|.|+|++.-+++|++.+++-.-..+.++.+.+.. -....++++ ..+.. ++- T Consensus 179 ~-~~~~~~~~-~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~-ve~~r~~~~--~~~~i-----~~~ 248 (272) T protein:vir:30 179 W-LGATEVGA-NRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTM-VETDRDITK--AINQI-----VAN 248 (272) T ss_pred c-cccccccc-cccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCce-eeecccccc--ceeEE-----EEE Confidence 1 11111110 1111 12479999999999999999999998888877766532 111122222 11211 122 Q ss_pred cccccEEEEe-----cceecCccCC Q lcl|Aclame:pro 329 EVYAAGCLLE-----NITLGDFTAP 348 (355) Q Consensus 329 Ed~~~~a~ie-----nI~~~~~~~~ 348 (355) .-|+.. .+. .++++.+... T Consensus 249 ~~~~~~-v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 249 KHYGVY-LYKAEKAVKITLKDAAKK 272 (272) T ss_pred EEEEEE-EEcCCceEEEEecccccC Confidence 333322 221 2233322222 No 123 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=82.05 E-value=0.08 Score=26.59 Aligned_cols=297 Identities=7% Similarity=-0.014 Sum_probs=138.3 Q ss_pred CCHHHHHHHHHH-HHHHHHHhCCChHHcce--eeecCcHHHHHHHHHHHhh-HHHhCcCccccchhhhh---hhhcc--- Q lcl|Aclame:pro 1 MRPETRFKFNAY-LTRVAELNNISTDDVSK--KFTVEPSVTQTLMNTVQAS-SAFLKTINILPVAEMKG---EKIGV--- 70 (355) Q Consensus 1 M~~~tr~~f~~y-~~~~A~~ngv~~~~v~~--~Fsv~P~~~q~L~~~iqes-s~FL~~INv~~V~e~~G---e~v~l--- 70 (355) |+...--.+..+ ++.-++..|+..+.... -|. ..+-+.+...+.|- -.=|.--.+++|...-| |.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~--~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~ 78 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWT--AQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTF 78 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHH--HHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeee Confidence 887554444333 33444456665433211 233 23334444333331 11111112223221111 11111 Q ss_pred ---cccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccc-hHHHHHHHHHHHHhhhhHHHHh Q lcl|Aclame:pro 71 ---GVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQ-DFQRRIRDAIVKRQALDLIMAG 146 (355) Q Consensus 71 ---gv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~-dF~~~i~~~i~~~~alD~i~IG 146 (355) |....+... ..+ -|......+..........-+..+++..|.+++... +...+-+.+..+..+..+=.|+ T Consensus 79 ~~~G~a~~~~d~-----~~d-ip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~ 152 (319) T protein:vir:10 79 DKVGTAQIIADY-----TDD-LPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLV 152 (319) T ss_pred ccccceeeecCc-----ccc-ccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEE Confidence 112222111 111 122112222222334444456667788899998754 6777888888899999999999 Q ss_pred hcccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhh Q lcl|Aclame:pro 147 FNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVY 226 (355) Q Consensus 147 fnG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~ 226 (355) |+|..-.-.+=.-.+|.++ .++-+...+... -....-|+.+.+++..+... .-+ T Consensus 153 f~G~~~~g~~GLlN~p~~~-----------------~~~~~~~~~~~t------~t~~~i~~di~~~~~~l~~~---s~g 206 (319) T protein:vir:10 153 FKGSAPHKIVSVFNHPNIT-----------------KITSGKWIDVST------MKPETAEAELTQAIETIETI---TRG 206 (319) T ss_pred EeecccccceeEEeCCCce-----------------eeecCCCCCccc------cCHHHHHHHHHHHHHHHHHh---cCc Confidence 9994322221111222211 110000000000 00001134455555544321 001 Q ss_pred hCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCc-------EE-EecCCCcEEEE Q lcl|Aclame:pro 227 QDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANA-------VL-VTTLENLSIYF 298 (355) Q Consensus 227 ~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~-------il-IT~l~NLsIY~ 298 (355) ...|+ ..++..++ +.+++.....+.....+.+. +.+.++..+.+|.+...+ ++ ...-+|+++=+ T Consensus 207 ~~~p~-~L~L~p~~-----~~~L~~~~~~~~~t~l~~lk--~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v 278 (319) T protein:vir:10 207 QHRAT-NILIPPSM-----RKVLAIRMPETTMSYLDYFK--SQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEI 278 (319) T ss_pred eeece-EEEecHHH-----HHhhhcccCCCCeeHHHHHH--HhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEec Confidence 22233 45545553 33344333345555555553 466789999999998632 22 33467776654 Q ss_pred eeCcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEecc Q lcl|Aclame:pro 299 MDESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLENI 340 (355) Q Consensus 299 Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ienI 340 (355) -. ..|+.-.+--...-.+.|..+--+-+|=...++|-+++| T Consensus 279 ~~-~~~~~~~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 279 PE-AFNMLPAQPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred Cc-ceeeeeeeecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 22 444443333334455556666666677778888888899 No 124 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=81.75 E-value=0.056 Score=27.43 Aligned_cols=284 Identities=13% Similarity=0.111 Sum_probs=120.2 Q ss_pred HHHHHHHhCCChHHcceeeecCc-----------HHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccccccccc Q lcl|Aclame:pro 13 LTRVAELNNISTDDVSKKFTVEP-----------SVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTD 81 (355) Q Consensus 13 ~~~~A~~ngv~~~~v~~~Fsv~P-----------~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~ 81 (355) +.-++. .+..+....-.+.+ .-.-.+.++.+.+|-|+.++++-.++ .|..+.+-.-|.+.-..- T Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~--~G~tv~i~~ig~~~~~~~ 75 (332) T protein:vir:78 1 MTTLSN---FSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLR--GGKSKQFMFTGKLSAGYH 75 (332) T ss_pred Cccccc---ccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhcccccccc--ccceEEEEeccceeEeee Confidence 111111 11111110001111 11234567788899999999887665 588887765554322111 Q ss_pred CCCCcCcccccc-ccccCcceeEEee--eecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHH--HHhhcccccccCC Q lcl|Aclame:pro 82 TSGDKERQTADF-TALESSKYECNQI--NFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLI--MAGFNGTTRADTS 156 (355) Q Consensus 82 T~~~~~r~~~~~-~~l~~~~Y~c~qT--n~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i--~IGfnG~s~A~~T 156 (355) + .+..... ..+...+-.|.-. -+.. ..=+.||.|....|+...+.+.....+|...= .++- -+.+| .+ T Consensus 76 ~----~g~~l~~~~~~~~~~~~l~ID~~ky~~-~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~-l~~aa-~~ 148 (332) T protein:vir:78 76 T----PGTPIVGDAGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARV-LAKAS-AE 148 (332) T ss_pred c----CCCCCCCCCCCCCceEEEEEehhhhhH-HHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHH-HHhhh-cc Confidence 1 1111111 1233333333322 2222 22257999999888888888777666664321 1110 01111 11 Q ss_pred ChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCc--chhhHHHHHHHHHhcccchhhhCCCCeEE Q lcl|Aclame:pro 157 DRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNG--DYENIDALVMDATNNLIDEVYQDDPNLVA 234 (355) Q Consensus 157 d~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~gg--dy~nLDaLv~d~~~~lid~~~~~~~~LVv 234 (355) .+|..-.. |. ..+.++.++ +=.++-..+.|+... +|+..-...+.++ T Consensus 149 ---------------------~~~~~~~~------g~---~~~~~~~~~~~~~~~~~~~i~~a~~~-Lde~~VP~~gR~~ 197 (332) T protein:vir:78 149 ---------------------ASPVTGEP------GG---FHVNIGAGNTNDAQAIVDGFFEAAAV-LDERSAPQEGRVA 197 (332) T ss_pred ---------------------cCcccccc------cc---cccccCCccccCHHHHHHHHHHHHHH-HhhcCCCccCCEE Confidence 11100000 00 011122221 113454456777764 5887777778899 Q ss_pred EEcHH----HHHHHHHHHHhhccccchh-hHHHHHHhhhhhcccccccCCccCCCcEEEecCCCc--------------- Q lcl|Aclame:pro 235 IVGRK----LLADKYFPLVNKQQENSES-LAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENL--------------- 294 (355) Q Consensus 235 ivG~d----Ll~~k~~~l~n~~~~~te~-~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NL--------------- 294 (355) ||+.. ||..+..+++|..-..+.. +..... -.++.|.+++..|.+|..+.--....+. T Consensus 198 vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~--i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~ 275 (332) T protein:vir:78 198 VLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKG--LYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALA 275 (332) T ss_pred EeCHHHHHHHHhhcCceeeeeeccccccceeccee--eeEEeeeEEEecCccccCcccccccccccccccccccccccce Confidence 99875 2222222333332222221 221111 2478899999999999765433322221 Q ss_pred EEEEeeCcEE------EEEEEccchhhhhhhhhhhhhhhccccccEEEE---e-cceecCc Q lcl|Aclame:pro 295 SIYFMDESHR------RSIDENPKKDRVENYESMNIDYVVEVYAAGCLL---E-NITLGDF 345 (355) Q Consensus 295 sIY~Q~gs~R------R~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~i---e-nI~~~~~ 345 (355) .+.+++...= +++ +.-+..|-++|+ -++|+.-+-..|-+ | -++|..+ T Consensus 276 ~~~~h~~a~~~v~~~~~~~-~~t~~~~~~~~~---~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 276 GLIFHREAAGCIQSVAPTI-QTTSGDFNVQYQ---GDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred EEeecccceeeeeeeccch-hhhhcccchhhh---HhhhhhhhhhcCceecccceEEEeeC Confidence 1223332210 000 011122233333 23333332222221 1 1111111 No 125 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=73.17 E-value=0.17 Score=24.75 Aligned_cols=286 Identities=10% Similarity=0.102 Sum_probs=132.5 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHh----hHHHhCcCccccchhhhhh---hh----- Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQA----SSAFLKTINILPVAEMKGE---KI----- 68 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqe----ss~FL~~INv~~V~e~~Ge---~v----- 68 (355) |+=+. -...-..++-+.-+.+ .+.+--|.++ +.+.+...+.| +=...+ ++||+..-++ .+ T Consensus 3 ~~~~~--~~~~~~~~~~~~~~~~-~d~~~~fl~~--ql~~id~~v~e~~~~~~~~~~---~i~v~~~~~~~~et~~~~~~ 74 (314) T protein:vir:10 3 IKFDA--EQAKITTHLEQMGVEK-ADAAGIWAVS--QLTAALNRAYEKEYAENSVVN---IFPVTNEIPGHAKYFEYPEF 74 (314) T ss_pred cchHH--HHHHHHHHHHhhcccc-hhhhHHHHHH--HHHHHHHHHhhhhccccccce---eeccccCCCCceeEEEeeee Confidence 66442 2333333433333333 2322235443 22333333332 222333 3333322221 11 Q ss_pred -cccccccccccccCCCCcCccccccccccCcceeE--EeeeecceeCHHHHHhhcccc-hHHHHHHHHHHHHhhhhHHH Q lcl|Aclame:pro 69 -GVGVTGTIASTTDTSGDKERQTADFTALESSKYEC--NQINFDFHLKYKTLDLWARFQ-DFQRRIRDAIVKRQALDLIM 144 (355) Q Consensus 69 -~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c--~qTn~d~~i~y~~LD~WA~~~-dF~~~i~~~i~~~~alD~i~ 144 (355) ..|....+...++ +- |. .+.+.....- ..---+..+++..|.+.+... +...+-+.+..+..+..+=. T Consensus 75 e~~G~a~~~~d~~~-----di-p~--vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~ 146 (314) T protein:vir:10 75 DGVGIAQIIADYSD-----DL-PL--VDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDK 146 (314) T ss_pred ccccceeeeCCccc-----cc-ce--eecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhce Confidence 1122222221111 11 11 1122222333 333344445567777887654 67788888888888888999 Q ss_pred HhhcccccccCCChhhhhhhhcc--chhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhccc Q lcl|Aclame:pro 145 AGFNGTTRADTSDRTKNTLLQDV--AVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLI 222 (355) Q Consensus 145 IGfnG~s~A~~Td~~anPllqDV--NkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~li 222 (355) |+|+|.+.-..+=.-.+|.+.=+ ..+|- . .+.=|.-+++++..+...- T Consensus 147 i~f~G~~~~g~~GLlN~p~v~~~~~~~~Wa------T-----------------------~~ei~~Di~~~~~~l~~~s- 196 (314) T protein:vir:10 147 LVWSGSAPHGIVSVFDQPNINNVVATPNWS------V-----------------------PQNAIDDVTAMIDAVESST- 196 (314) T ss_pred EEEeecccccceeEeecCCCccccCCCCcc------c-----------------------HHHHHHHHHHHHHHHHHhc- Confidence 99999432222221222211000 01110 0 0011344444444433210 Q ss_pred chhhhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCc--------EEEecCCCc Q lcl|Aclame:pro 223 DEVYQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANA--------VLVTTLENL 294 (355) Q Consensus 223 d~~~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~--------ilIT~l~NL 294 (355) . +...|+-+ + |+..++.+++.....+..-..+.+. +..=+|....+|.+-..+ +..+.-+|+ T Consensus 197 ~--g~~~p~~l-~-----Lpp~~~~~L~~~~~~~~~tvl~~l~--~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~ 266 (314) T protein:vir:10 197 Q--GLHHVTDI-L-----LPASARRVMQGLVPQTNLSYGELFT--RNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNM 266 (314) T ss_pred C--ccccceeE-E-----ecHHHHHhhcccccCCCccHHHHHH--HhCCCcEEEEcccccccCCCcceEEEEEecCCcEE Confidence 0 12234433 2 3444556666655555555556554 445688899999988655 222555665 Q ss_pred EEEEeeCcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEecceec Q lcl|Aclame:pro 295 SIYFMDESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLENITLG 343 (355) Q Consensus 295 sIY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~ 343 (355) ++=+-. ..|+.-.+--...-.+.|..+--+-+|=...++|-+++|+|+ T Consensus 267 ~~~vp~-~~~~l~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 267 SIEIPE-VTNVLPAQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEecCc-cceeecceecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 553322 333332222223444555555556667777888888999998 No 126 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=72.25 E-value=0.19 Score=24.60 Aligned_cols=298 Identities=12% Similarity=0.094 Sum_probs=128.5 Q ss_pred CCHHHHH-HHHH-HHHHHHHHhCCChH-Hcceeee------cCcHHHHHHHHHHHhhHHHhCcCccccchhhh---hhhh Q lcl|Aclame:pro 1 MRPETRF-KFNA-YLTRVAELNNISTD-DVSKKFT------VEPSVTQTLMNTVQASSAFLKTINILPVAEMK---GEKI 68 (355) Q Consensus 1 M~~~tr~-~f~~-y~~~~A~~ngv~~~-~v~~~Fs------v~P~~~q~L~~~iqess~FL~~INv~~V~e~~---Ge~v 68 (355) |+++-+. .|++ .+...++.-+.... +..--|. |+|.+-++....+. -..|+.--+.++-..++ +-.- T Consensus 6 ~~~~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~-~~~~i~i~~~~~~~~~~~t~~~~~ 84 (329) T protein:vir:79 6 MSKEMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGS-ALRVFPVTSELSDTDKTFEYQTFD 84 (329) T ss_pred hhhhhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccc-hhhhcccccCCCCceeEEEeeeee Confidence 3332221 1111 22223332332211 0011232 22333322222222 12233322221110000 0000 Q ss_pred cccccccccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccc-hHHHHHHHHHHHHhhhhHHHHhh Q lcl|Aclame:pro 69 GVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQ-DFQRRIRDAIVKRQALDLIMAGF 147 (355) Q Consensus 69 ~lgv~~~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~-dF~~~i~~~i~~~~alD~i~IGf 147 (355) ..|....++...+ + -|..-.......-.....--+..+++..|.+.+... +...+-+.+..+..+..+=.|+| T Consensus 85 ~~G~a~~~~d~~~-----d-ip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f 158 (329) T protein:vir:79 85 KVGHAKIIADYTD-----D-LSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVF 158 (329) T ss_pred cceeeeeecCccc-----c-cceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEE Confidence 1122222221111 1 121111222222333444445567778888887654 67888888899999999999999 Q ss_pred cccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchh-- Q lcl|Aclame:pro 148 NGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEV-- 225 (355) Q Consensus 148 nG~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~-- 225 (355) +|.......=.-.+|.++-+..| +.+. +.-..++-|.++.|+.. ++... T Consensus 159 ~G~~~~g~~GLlN~p~v~~~~~~-------------------~~~~---------~~w~~kt~~ei~~di~~-~~~~l~~ 209 (329) T protein:vir:79 159 KGSKPHKIISVFEHPNLTTINSA-------------------GWNN---------AAGTGKKPETAQDELEQ-AIEKIET 209 (329) T ss_pred eecccccceeeecCCCccccccC-------------------CCCC---------ccccccCHHHHHHHHHH-HHHHHHH Confidence 99432222222222222211110 0000 00111233434444322 21211 Q ss_pred ---hhCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCC------cEEE--ecCCCc Q lcl|Aclame:pro 226 ---YQDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPAN------AVLV--TTLENL 294 (355) Q Consensus 226 ---~~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~------~ilI--T~l~NL 294 (355) +...|+ ..++..+ .+.+++.....+.....+.+. +..-++....+|.+=.. .+++ +.-+|+ T Consensus 210 ~s~g~~~p~-~L~Lpp~-----~~~~L~~~~~~~~~tvl~~lk--~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~ 281 (329) T protein:vir:79 210 LTNGQHRAN-MILIPPS-----MRKVLMVRMPETTMSYLDYFK--QQNGGITIESISELEDIDGAGTKAALVYEKDPMNM 281 (329) T ss_pred hcCceeccc-EEEecHH-----HHHHhhcccCCCCccHHHHHH--HhCCCcEEEEcccccccCCCCceEEEEEecCCceE Confidence 223333 3444444 344444434444555556553 44556777888887542 2232 566666 Q ss_pred EEEEeeCcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEecceec Q lcl|Aclame:pro 295 SIYFMDESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLENITLG 343 (355) Q Consensus 295 sIY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~ 343 (355) .+-+-. .+|+.-.+--...-.+.|..+--+-+|=-..++|-+++|.++ T Consensus 282 ~~~vp~-~~~~l~~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 282 SIEIPE-AFNMLTAQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred EEecCc-ceeeeeceecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 654322 344433322233444556666666677778888888999998 No 127 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=66.16 E-value=0.27 Score=23.68 Aligned_cols=287 Identities=10% Similarity=0.050 Sum_probs=120.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChH--H-c---ceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTD--D-V---SKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTG 74 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~--~-v---~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~ 74 (355) |-|.+ +-++++--.|-..+ | + -+.|+ -.+..+.|.+|-|+.++++-.++ .|..+.+-.-| T Consensus 1 ~a~~~------~~~~~~~~~g~~~~~~d~~al~ie~~~------geV~~~f~~~s~~~~~~~~r~i~--~G~sv~~~~iG 66 (347) T protein:vir:88 1 MANAT------GGQQIGANQGKGQSAADKLALFLKVFG------GEVLTAFVRRSVTMDKHMVRTIQ--NGKSASFPVMG 66 (347) T ss_pred CCCcc------cchhhhccCCCCccccchHHHHHHHHH------HHHHHHHHHHhhhhhcccccccc--CcceEEEeeec Confidence 66554 33344333333211 1 1 12332 34455778889999999987665 58888876655 Q ss_pred cccccccCCCCcCccccc--cccccCcceeEEeeeecc-eeCHHHHHhhcccchHHHHHHHHHHHHhhh--hHHHHhhc- Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTAD--FTALESSKYECNQINFDF-HLKYKTLDLWARFQDFQRRIRDAIVKRQAL--DLIMAGFN- 148 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~--~~~l~~~~Y~c~qTn~d~-~i~y~~LD~WA~~~dF~~~i~~~i~~~~al--D~i~IGfn- 148 (355) .....--+. ++..+ ...+...+-.|.=.++.+ ...=+.+|.|...-|+...+.+...+.+|. |...++=- T Consensus 67 ~~~~~~~~~----g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~ 142 (347) T protein:vir:88 67 RTKGYYLAP----GENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMA 142 (347) T ss_pred ceeeeeecc----ccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 543221111 11211 123444555554444322 334468999999888887777776665443 33222111 Q ss_pred -ccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcch-------hh-HHHHHHHHHh Q lcl|Aclame:pro 149 -GTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDY-------EN-IDALVMDATN 219 (355) Q Consensus 149 -G~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy-------~n-LDaLv~d~~~ 219 (355) +...++.++..-. ||-. .. .+.+|.+++- .+ .|+ +.++.. T Consensus 143 ~~a~~~~~~~~~~~--------g~~~--------------------~~--~~~~~~~~~~~~~~~~~~~~~~~-i~~a~~ 191 (347) T protein:vir:88 143 KLCNLPAASNENIA--------GLGQ--------------------AV--VLNIGAAADLVDVEARGKAILKG-LTLARA 191 (347) T ss_pred HhhccccccccccC--------Cccc--------------------cc--cccccccccccchhhhHHHHHHH-HHHHHH Confidence 1111111111111 1100 00 0112222221 11 343 444554 Q ss_pred cccchhhhCCCCeEEEEcHHHHHHHHHHHHh-----hccccchhhHHHHHHhhhhhcccccccCCccCCCcEE------- Q lcl|Aclame:pro 220 NLIDEVYQDDPNLVAIVGRKLLADKYFPLVN-----KQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVL------- 287 (355) Q Consensus 220 ~lid~~~~~~~~LVvivG~dLl~~k~~~l~n-----~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~il------- 287 (355) . +|+..-...+.++||+.+- |..|++ ..+..+...... -...++-|++++..|.+|-...- T Consensus 192 ~-Lde~~VP~~gR~~vv~P~~----y~~Ll~~~~~~~~~~~~~~~~~~--G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~ 264 (347) T protein:vir:88 192 R-LTKNYVPAGDRRFYCAPED----YSAILSALMPNAANYAALIDPET--GNIRNVMGFEVIEVPHLTVGGAGDNNPADG 264 (347) T ss_pred H-HhhcCCCCCCCEEEeCHHH----HHHHhcchhhhhhhhccccchhc--ceeeeeccceEEEeeccccccccccccccc Confidence 3 4776655668999998752 222332 222222211110 00124569999999999943222 Q ss_pred --EecCCC-----c------------EEEEeeCc---EEE---EEEEccchhhhhhhhhhhhhh--hccccccEEEEecc Q lcl|Aclame:pro 288 --VTTLEN-----L------------SIYFMDES---HRR---SIDENPKKDRVENYESMNIDY--VVEVYAAGCLLENI 340 (355) Q Consensus 288 --IT~l~N-----L------------sIY~Q~gs---~RR---~~~d~p~r~rve~y~s~Ne~Y--vVEd~~~~a~ienI 340 (355) +|.... + .++||+.. .+- +++-.-+-++..++--.=..| -|=..++++.|+ T Consensus 265 ~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~~d~i~~~~~~G~~~~rPe~a~~~~-- 342 (347) T protein:vir:88 265 VAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYAMGHGGLRPEAAGALV-- 342 (347) T ss_pred ccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechhhHHHHhhhhhhhcCceeccceEEEEE-- Confidence 222111 1 12333221 100 011011111222211111111 122333333322 Q ss_pred eecCcc Q lcl|Aclame:pro 341 TLGDFT 346 (355) Q Consensus 341 ~~~~~~ 346 (355) +..++ T Consensus 343 -~~~a~ 347 (347) T protein:vir:88 343 -FTPAA 347 (347) T ss_pred -eCCCC Confidence 21111 No 128 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=64.13 E-value=0.31 Score=23.41 Aligned_cols=303 Identities=12% Similarity=0.059 Sum_probs=125.0 Q ss_pred CCHHHH-HHHHHHHHHHHH--HhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccccc Q lcl|Aclame:pro 1 MRPETR-FKFNAYLTRVAE--LNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIA 77 (355) Q Consensus 1 M~~~tr-~~f~~y~~~~A~--~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia 77 (355) |-|..- .+.+ .+.+. .+|-..+-.-+.|+ -.+..+.+.+|-|+.+.|+-.+. .|..+.+-.-|... T Consensus 1 ~~~~~~~~~~~---t~~g~~~~~~~~~al~ie~~~------g~V~~~f~~~s~~~~~v~~r~~~--~G~sv~i~~iG~~t 69 (347) T protein:vir:33 1 MANIQGGQQIG---TNQGKGQSAADKLALFLKVFG------GEVLTAFARTSVTMPRHMLRSIA--SGKSAQFPVIGRTK 69 (347) T ss_pred CCCCccCcccc---cccccCCcccchHHHHHHHHH------HHHHHHHHHHHhhhhhhcccccc--ccceeEeeecccee Confidence 221100 0000 01111 11110000113333 45566788899999999986554 48888887766654 Q ss_pred ccccCCCCcCccccccccccCcceeEEeeeec-ceeCHHHHHhhcccchHHHHHHHHHHHHhhhh--HHHHhhccccccc Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQINFD-FHLKYKTLDLWARFQDFQRRIRDAIVKRQALD--LIMAGFNGTTRAD 154 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d-~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD--~i~IGfnG~s~A~ 154 (355) ...-|.++. -..++..+...+..|.--+.- .+..-+.||.|-..-|+...+.+.....+|-. .-.+.- T Consensus 70 ~~~~~~g~~--l~~~~~~~~~~e~~ltiD~~~y~~~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~------- 140 (347) T protein:vir:33 70 AAYLKPGEN--LDDKRKDIKHTEKVIHIDGLLTADVLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAE------- 140 (347) T ss_pred eeeecCCCC--CCCCCCCCccceEEEEechhhhhhHHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHH------- Confidence 332222111 011222233344334311111 12334589999988888888877776666543 222211 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccc-cCCccccceeeeCCCcc----hhhHHHHHHHHHhcccchhhhCC Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITD-ADGKVVSAVIRVGKNGD----YENIDALVMDATNNLIDEVYQDD 229 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~-~~g~~~~~~i~~G~ggd----y~nLDaLv~d~~~~lid~~~~~~ 229 (355) |.+++..++..+...... ..+........-|...| =.++=..+.++... +++..-.. T Consensus 141 -----------------l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~-Lde~~VP~ 202 (347) T protein:vir:33 141 -----------------LAGLVNLPDGSNENIEGLGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARAS-LTKNYVPA 202 (347) T ss_pred -----------------HHHhhhhhcccccccccccccccccccccccccccchhhhHHHHHHHHHHHHHH-HhhcCCCc Confidence 001111000000000000 00000000000011111 02333345556654 47755555 Q ss_pred CCeEEEEcHHHHHHH--HHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEec----------------- Q lcl|Aclame:pro 230 PNLVAIVGRKLLADK--YFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTT----------------- 290 (355) Q Consensus 230 ~~LVvivG~dLl~~k--~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~----------------- 290 (355) .+.+++|+.+....- --++.+..-..++.++--. -.++.|.+++..|.||..++--+. T Consensus 203 ~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~G~---V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~ 279 (347) T protein:vir:33 203 ADRTFYTTPDNYSAILAALMPNAANYQALLDPERGT---IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSST 279 (347) T ss_pred cCcEEEeCHHHHHHHhccccccccccccccccccce---eEEEeceeEEEecccccCccccccccccccccccccCCccc Confidence 688999997633210 0122222211112111111 136899999999999987643221 Q ss_pred -----CCCcE-EEEeeCcE-EEEEEE-ccchhhhhhhhhhhhhhhccccccEEEE---e---cceecCccC Q lcl|Aclame:pro 291 -----LENLS-IYFMDESH-RRSIDE-NPKKDRVENYESMNIDYVVEVYAAGCLL---E---NITLGDFTA 347 (355) Q Consensus 291 -----l~NLs-IY~Q~gs~-RR~~~d-~p~r~rve~y~s~Ne~YvVEd~~~~a~i---e---nI~~~~~~~ 347 (355) ++++. +.||+... -=...+ ..++.|-++|+. ++|+.-|-..|-+ | .|++....+ T Consensus 280 ~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~~---d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 280 TVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQA---DQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred ceeccccceeeeeecchhheeeeeeceeeeeccchhhhh---HhhhhhhhcCCceecccceEEEecCCCCC Confidence 22222 23343322 111222 334444444443 3333333222222 1 234433333 No 129 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=60.28 E-value=0.28 Score=23.58 Aligned_cols=290 Identities=12% Similarity=0.137 Sum_probs=121.5 Q ss_pred CCHHHHHHHHHHHHHHHHHh--CCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccc-- Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELN--NISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTI-- 76 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~n--gv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~i-- 76 (355) -++.-|-.|.....++=++- .+.-++. .+. .....++.+.+.+.+.|++|..+....| +|-....-....+ T Consensus 5 ~~~~~~~~~~~~~~~~p~l~m~alTLaea-~~l-~~d~~~~~VIE~l~~~s~iL~~lpf~~v---e~~~~~~~r~~~lp~ 79 (330) T protein:vir:94 5 CTPPLRGRWRTLTHQFPELKMPTVTLAES-AKL-SQDHLVSGLIETIVEVNPLYEMMPFTEI---EGNALAYNRENVLGD 79 (330) T ss_pred cCCccccceeehhccccccchhhhhhhHH-hhc-CchhhHHHHHHhhhccchHHhhcccccc---cCCcceeeeeecCCc Confidence 22333333433332211111 1111121 122 2457788889999999999987755443 3322211110000 Q ss_pred -ccc-ccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhccccccc Q lcl|Aclame:pro 77 -AST-TDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRAD 154 (355) Q Consensus 77 -a~R-t~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~ 154 (355) +-| .+++-. +..+.........|.-..-+..+.=...|...+.-|+...-....++.++....--=+||-|. T Consensus 80 a~~r~~n~~~~----~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~-- 153 (330) T protein:vir:94 80 VQFLAVGGTIT----AKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGT-- 153 (330) T ss_pred ceeeecccccc----ccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCC-- Confidence 001 011100 000100111122222211112222222333333335665555666666666666666788332 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCC-CCeE Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDD-PNLV 233 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~-~~LV 233 (355) |.-+ -|=++.+ ++.++++ .|..|-.-++|. +..||+..+..+ ..-+ T Consensus 154 ------~~~F----~GL~~~~---~~~q~i~---------------tg~~gg~~T~d~-----LDeLl~~v~~~~g~~~~ 200 (330) T protein:vir:94 154 ------GNSF----QGMMGLV---AASQTIS---------------AGANGGTLTFEL-----LDQLLDLVKDKDGQVDY 200 (330) T ss_pred ------Cccc----cchhhcC---CcccEEe---------------cCCCCCCCCHHH-----HHHHHHHhcCCCCCCcE Confidence 1101 1333322 3444432 232223334442 233444443322 1236 Q ss_pred EEEcHHHHHHHHHHHHhhc------cccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEee-C----- Q lcl|Aclame:pro 234 AIVGRKLLADKYFPLVNKQ------QENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMD-E----- 301 (355) Q Consensus 234 vivG~dLl~~k~~~l~n~~------~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~-g----- 301 (355) +++.+.... + ..-+-+. -+++...-+..+ -+++|.|.+..-+.|.+.--.|.-.==|||.-+ | T Consensus 201 ~l~n~a~~r-~-I~a~~R~~~~~~v~~~~~~~~G~~v---~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~ 275 (330) T protein:vir:94 201 LMSSFAMRR-K-YFSLLRALGGAAIGEVMTLPSGRQI---PTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNK 275 (330) T ss_pred EEechhHHH-H-HHHHHHhccCCCCCCcccccCCCEE---eeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccc Confidence 666655443 1 2222221 112222222222 268888888777777753222222233566433 1 Q ss_pred --------------cEEE--EEEEcc-chhhhhhhhhhhhhhhccccccEEEEecceec Q lcl|Aclame:pro 302 --------------SHRR--SIDENP-KKDRVENYESMNIDYVVEVYAAGCLLENITLG 343 (355) Q Consensus 302 --------------s~RR--~~~d~p-~r~rve~y~s~Ne~YvVEd~~~~a~ienI~~~ 343 (355) +.|- .+.+-+ .|=+|+-|. +-+|-...+++.++||+++ T Consensus 276 qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y~----~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 276 YGIAGLTARGSAGLRVQNVGAKENADETITRVKMYC----GFANFSQLGLAAIKGLIPG 330 (330) T ss_pred cceEeecCCCCCcceeeeCCCccccceeeEEEEEee----eeEEechhheeeeccccCC Confidence 1111 011111 223444443 3478888899999999999 No 130 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=53.19 E-value=0.53 Score=22.07 Aligned_cols=294 Identities=12% Similarity=0.159 Sum_probs=129.5 Q ss_pred CCH---HHHHHHHHHHHHHHHHhCCChHHcc---eeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccc Q lcl|Aclame:pro 1 MRP---ETRFKFNAYLTRVAELNNISTDDVS---KKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTG 74 (355) Q Consensus 1 M~~---~tr~~f~~y~~~~A~~ngv~~~~v~---~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~ 74 (355) |++ .||.-+ .|.. +++. +.|+ -...++++.++-|+.+.++-.++. |..+-+-.- T Consensus 1 ms~~~~~tr~~~----------~~s~-~d~al~le~f~------geV~~af~~~s~~~~~~~~rti~~--g~s~~~~~i- 60 (335) T protein:vir:63 1 MSFLNDLTRPNY----------AGKN-ADVDIHLEEHL------GIVDKHFAYTSKFAPLMNIRDLRG--SNVVRLDRL- 60 (335) T ss_pred CCCcccchhhhc----------cccc-chhheehhhhh------hhHHHHHHhhhhhccccceeeecc--ceeEEEeee- Confidence 653 344332 1111 2222 3443 345678888999998888876532 444444322 Q ss_pred cccccccCCCCcCccccccccccCcceeEE-eeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhc-cccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECN-QINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFN-GTTR 152 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~-qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfn-G~s~ 152 (355) |++.-......++.+.......+..+. .+-.=++..-+.||.|-.+-|+-..+.+.+-+.+|-..=.--|. -.++ T Consensus 61 ---G~~~~~~~~pG~~l~~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~a 137 (335) T protein:vir:63 61 ---GNVEAKGRRAGEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKA 137 (335) T ss_pred ---eeeeeecccCCcCcCCCCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 222221111112221111112222211 11122345567899999988888888887776666432111100 0111 Q ss_pred ccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeee-CCCcchhhHHHHHHHHHhcccchhhhCC-- Q lcl|Aclame:pro 153 ADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRV-GKNGDYENIDALVMDATNNLIDEVYQDD-- 229 (355) Q Consensus 153 A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~-G~ggdy~nLDaLv~d~~~~lid~~~~~~-- 229 (355) |..+.|...|. ||. .|......++- +...++..|.+.++++.+.| ++..-.+ T Consensus 138 a~~~a~~~~~~------~~~------------------~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~L-~e~dVP~~~ 192 (335) T protein:vir:63 138 AAMDAPVDLED------AFS------------------PGVLEKLDLTGLTAKQAADKIVRMHRRVVETF-IDRDLGDAV 192 (335) T ss_pred ccccCccccCC------CcC------------------CCcceeeeeccCcccccHHHHHHHHHHHHHHH-HhccCCCcc Confidence 11222221111 111 01111001111 11125667777788888866 5533322 Q ss_pred -CCeEEEEcHH----HHHHHHHHHHhhccccchh---hHHHHHHhhhhhcccccccCCccCCCcEEEecCCC-------- Q lcl|Aclame:pro 230 -PNLVAIVGRK----LLADKYFPLVNKQQENSES---LAADIIISQKRIGNLPAVRVPYFPANAVLVTTLEN-------- 293 (355) Q Consensus 230 -~~LVvivG~d----Ll~~k~~~l~n~~~~~te~---~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~N-------- 293 (355) .|.|++|..+ |+.++ +++|..-..+.. .+.- .--.+-|.|++..|.||...+--++|.| T Consensus 193 ~~dr~~vv~P~~y~~Ll~~~--~l~n~~~~~s~~~~~~~~g---~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d 267 (335) T protein:vir:63 193 YSEGLTPMSPRVFSLLLEHD--KLMNVEYQATGATNDYVKS---RVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEE 267 (335) T ss_pred cCceEEEeChHHHHHHhccc--cccccccccccccccccCc---eeEEeeceEEEeeccCCCCCcccccccccCCccccc Confidence 2489999865 33322 234432111111 1110 0126889999999999998866666644 Q ss_pred ----cEEEEeeCc---EEEE-----EEEccch--hhhhhhhhhhhhhhccccccEEEEecceecCccCCC Q lcl|Aclame:pro 294 ----LSIYFMDES---HRRS-----IDENPKK--DRVENYESMNIDYVVEVYAAGCLLENITLGDFTAPA 349 (355) Q Consensus 294 ----LsIY~Q~gs---~RR~-----~~d~p~r--~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~~~~~ 349 (355) --+++|+.. .+-. +..++++ +.+..|++.+ -.+=..++++.|+==-++-..-.+ T Consensus 268 ~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G--~g~lRPe~a~~i~~tg~~~~~~~~ 335 (335) T protein:vir:63 268 SERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYN--IGARRPDTAGAIELKGIGAFDITA 335 (335) T ss_pred cceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcC--CcccccceEEEEEEcCCCceeecC Confidence 233444432 0000 1111111 2333333311 133456667766611111111111 No 131 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=39.49 E-value=1 Score=20.55 Aligned_cols=279 Identities=13% Similarity=0.101 Sum_probs=126.2 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhcc------cccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGV------GVTG 74 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~l------gv~~ 74 (355) |.+..--.|-+ ++ -=.|+|.+-+.+...+. ..+|+.--..++- --|.+.. |... T Consensus 1 ~~~~~~g~f~~---~~-------------l~~id~~v~e~~~~~l~-~r~l~~v~~~~~~---~~~~~~~~~~~~~G~~~ 60 (301) T protein:vir:80 1 MQGKITATIEA---RD-------------LQAIDNVIYEPKQEELT-ARSVFPQKFDVNE---GAESYSFDVMTRSGAAK 60 (301) T ss_pred CCccccchhhH---HH-------------HHHHHHHHHHhhhhhhh-hhhhcccccCCCC---ceEEEEEeeeccceeEE Confidence 33332222110 00 00122333333333333 2223222111111 1111111 1111 Q ss_pred cccccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccc-hHHHHHHHHHHHHhhhhHHHHhhcccccc Q lcl|Aclame:pro 75 TIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQ-DFQRRIRDAIVKRQALDLIMAGFNGTTRA 153 (355) Q Consensus 75 ~ia~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~-dF~~~i~~~i~~~~alD~i~IGfnG~s~A 153 (355) .+.. +..+ -|..-...+.......+.--+..+.|..|.+.+..+ +...+-+.+.++..+..+=.+.|+|.+-. T Consensus 61 ~~~~-----~~~d-ip~~~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~ 134 (301) T protein:vir:80 61 IIAN-----GADD-LPLVDVDMVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKY 134 (301) T ss_pred EecC-----cccc-cccccccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccc Confidence 2211 1111 122222233334455555666778889999998764 67778888889999999999999994422 Q ss_pred cCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchh--hHHHHHHHHHhcccchh-----h Q lcl|Aclame:pro 154 DTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYE--NIDALVMDATNNLIDEV-----Y 226 (355) Q Consensus 154 ~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~--nLDaLv~d~~~~lid~~-----~ 226 (355) -.+=.-.+|.++ .+..+ ..+ .|...+++ +-|.++.|+. .++... + T Consensus 135 g~~GLlN~p~~~-----------------~~~~~--~~~--------~~~~~~w~~~t~~ei~~di~-~~~~~l~~~s~g 186 (301) T protein:vir:80 135 AIKGAFEATGIQ-----------------IDVSP--TTG--------VGNVSKWEKKTAEQIIDEIG-EAHTKITVLPGY 186 (301) T ss_pred cceeeecCCCcc-----------------ccccc--Ccc--------cccccccccCCHHHHHHHHH-HHHHHHHHhcCc Confidence 211111111110 00000 000 01111222 2233333322 222221 1 Q ss_pred hCCCCeEEEEcHHHHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCc-------EEE-ecCCCcEEEE Q lcl|Aclame:pro 227 QDDPNLVAIVGRKLLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANA-------VLV-TTLENLSIYF 298 (355) Q Consensus 227 ~~~~~LVvivG~dLl~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~-------ilI-T~l~NLsIY~ 298 (355) ...| ...++..++...-.-+.+ +..+.....+.+ ++.+.++..+++|.+...+ ++. ..-+|+++-+ T Consensus 187 ~~~p-~~L~L~p~~~~~L~~~~~---~~~~~~tvl~~l--~~~~~~~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v 260 (301) T protein:vir:80 187 GTAS-LKLCLPPKQFELINKKRY---SNEDSRSVLKVL--QDNAWFSAIVRVPDLAGMGTAGSDSFAVIHDSNETAELII 260 (301) T ss_pred eecc-cEEEecHHHHHhhhhccc---cCCCCeeHHHHH--HHHcCcceEEEcceeccCCCCcccEEEEEecCCcEEEEEe Confidence 1222 345556554332111222 222333334444 3467789999999998754 333 3477877765 Q ss_pred eeCcEEEEEEEccchhhhhhhhhhhhhhhccccccEEEEecc Q lcl|Aclame:pro 299 MDESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLENI 340 (355) Q Consensus 299 Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ienI 340 (355) -. .+|+.-.+--...-.+.|+.+--+-+|=...++|-+++| T Consensus 261 ~~-~~~~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 261 PM-DITRHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred cC-ceeeecceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 43 344443322223444566666666777778889999999 No 132 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=36.59 E-value=0.83 Score=21.02 Aligned_cols=304 Identities=16% Similarity=0.090 Sum_probs=125.1 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHH---cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDD---VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~---v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia 77 (355) |.+.....+-+--..-...+|-..++ .-+.|+ -.+..+.+.+|-|+.+.++-.++ .|..+.+-.-|... T Consensus 1 ~~~~~~~~~~~~n~~t~~~~~~~~~~~al~le~f~------geV~~~f~~~si~~~~~~~rti~--~Gksv~f~~iG~~t 72 (375) T protein:vir:10 1 MANANQVALGRSNLSTGTGYGGATDKYALYLKLFS------GEMFKGFQHETIARDLVTKRTLK--NGKSLQFIYTGRMT 72 (375) T ss_pred CccccccccCccccCCccccccccchHHHHHHHHh------HHHHHHHHHHHhhhccccccccc--cCceEEEEeeeeeE Confidence 55443322211000000000000011 123444 45677888899999888876554 36666555444333 Q ss_pred ccccCCCCc-Cccccc-------cccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhc- Q lcl|Aclame:pro 78 STTDTSGDK-ERQTAD-------FTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFN- 148 (355) Q Consensus 78 ~Rt~T~~~~-~r~~~~-------~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfn- 148 (355) ...-|.++. ...+.. ...++..+|. +..-+.+|.|...-||...+.+..-+.+|-..=.-=+. T Consensus 73 ~~~~t~G~~i~~~~~~d~~~te~~l~ID~~~y~--------~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~ 144 (375) T protein:vir:10 73 SSFHTPGTPILGNADKAPPVAEKTIVMDDLLIS--------SAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRS 144 (375) T ss_pred EeeecCCcCcCCccccCCCCCceEEEecchhhh--------hhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 222222211 111110 1222222222 23445899999999999988888776666432211010 Q ss_pred ccccccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCC------cchhhHHHHHHHHHhccc Q lcl|Aclame:pro 149 GTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKN------GDYENIDALVMDATNNLI 222 (355) Q Consensus 149 G~s~A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~g------gdy~nLDaLv~d~~~~li 222 (355) =..+|....|.. ..+. ..+|. ..+..|.+ .+=.++-..+.++.. .+ T Consensus 145 l~kaa~~~~p~~-----------------------~~~~-~~~Gg---~~i~~~sg~~~~~~~ta~~~~~ai~~a~~-~L 196 (375) T protein:vir:10 145 ITRGARSASPVS-----------------------ATNF-VEPGG---TQIRVGSGTNESDAFTASALVNAFYDAAA-AM 196 (375) T ss_pred HHHhhhhccccc-----------------------cccc-cccCc---ceeeeccccccccccCHHHHHHHHHHHHH-HH Confidence 000011111100 0000 00111 01122211 112345344557775 45 Q ss_pred chhhhCCCCeEEEEcHHH----HHHHHH-HHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcE----------- Q lcl|Aclame:pro 223 DEVYQDDPNLVAIVGRKL----LADKYF-PLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAV----------- 286 (355) Q Consensus 223 d~~~~~~~~LVvivG~dL----l~~k~~-~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~i----------- 286 (355) |+..-.+.+.+++|+.+. |.++.- +++|..-......+.-. --++.|++.+..+.+|-.+. T Consensus 197 de~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~~~~~g~---v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~ 273 (375) T protein:vir:10 197 DEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSALQSGNG---VIEIAGIHIYKSMNIPFLGKYGVKYGGTTGE 273 (375) T ss_pred hhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccccceeccce---EEEEeceEEEEeccccccccccccccccccc Confidence 787777778999999763 332211 23333211111111000 12688999999988885432 Q ss_pred ---------EEecCCC-------cEEEEeeC---cEEEEEEEccch--------hhhhh----hhhhh-hhhhccccccE Q lcl|Aclame:pro 287 ---------LVTTLEN-------LSIYFMDE---SHRRSIDENPKK--------DRVEN----YESMN-IDYVVEVYAAG 334 (355) Q Consensus 287 ---------lIT~l~N-------LsIY~Q~g---s~RR~~~d~p~r--------~rve~----y~s~N-e~YvVEd~~~~ 334 (355) +..+.+| =+-|+... +..=-+...|+- -++|- |+-+- .+|+|.-|-.. T Consensus 274 ~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G 353 (375) T protein:vir:10 274 TSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMG 353 (375) T ss_pred cchhhhhccccccCCcceeeccccccccccccccCceEEEEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeec Confidence 1111111 11343333 211111111111 11221 22222 34455555544 Q ss_pred EEEe----cceecCccCCCCcCCCC Q lcl|Aclame:pro 335 CLLE----NITLGDFTAPAAPESGA 355 (355) Q Consensus 335 a~ie----nI~~~~~~~~~~~~~~a 355 (355) +-+- -++|.. + +++++| T Consensus 354 ~~~lrp~~av~l~~---~-~~~~~~ 374 (375) T protein:vir:10 354 ADYLNPAAAVELYI---G-ATAPSA 374 (375) T ss_pred cCccCceeEEEEec---C-cCcccc Confidence 4443 233321 1 233334 No 133 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=31.22 E-value=1.5 Score=19.59 Aligned_cols=292 Identities=17% Similarity=0.140 Sum_probs=132.0 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcc---eeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVS---KKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~---~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia 77 (355) |.+-.--.+ .+-+ ..|.. +++. +.|+ -...++.+.++-|+.+.++-.++ .|..+-+-.-|... T Consensus 1 m~~~~~~~~----t~~~-~~~~~-~~~~l~le~~~------geV~~af~~~s~~~~~~~~r~i~--~G~s~~~~~iG~~~ 66 (334) T protein:vir:80 1 MTYPAANTH----TRPG-WGGAN-SDVSLHIEEHL------GLVDASFMYSSKFASWMNVRSLR--GTNQLRVDRVGAST 66 (334) T ss_pred CCCCcCCCc----cccc-ccccc-chheehhhhhh------hHHHHHHHHhhhhhccceeeecc--ccceEEEeeeccee Confidence 443211000 0000 00111 1111 2222 34466888899999888886553 27777765544432 Q ss_pred ccccCCCCcCccccccccccCcceeEEeee-ecceeCHHHHHhhcccchHHHHHHHHHHHHhhh--hHHHHhhccccccc Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYECNQIN-FDFHLKYKTLDLWARFQDFQRRIRDAIVKRQAL--DLIMAGFNGTTRAD 154 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn-~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~al--D~i~IGfnG~s~A~ 154 (355) .......++.+...+...+-.|.=-+ .=++..-+.||.|-.+-||...+.+..-..+|- |+-.+.= ..++|. T Consensus 67 ----~~~~~~g~~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~-l~kaa~ 141 (334) T protein:vir:80 67 ----IAGRKAGEELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQ-LQKCGD 141 (334) T ss_pred ----eeeecCCCCCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHH-HHHhhh Confidence 22222333444444445555555433 334556678999999999999999999998888 7533211 112222 Q ss_pred CCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcch-hhHHHH---HHHHHhcccchhhhCC- Q lcl|Aclame:pro 155 TSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDY-ENIDAL---VMDATNNLIDEVYQDD- 229 (355) Q Consensus 155 ~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy-~nLDaL---v~d~~~~lid~~~~~~- 229 (355) ...|..++. +| ..|..... ...|...+. .+-|+| +.++.+.| ++..-.+ T Consensus 142 ~~~~~~~~~------~~------------------~~G~~~~~-~~~g~~~~~~~~~~~l~~a~~~a~~~L-~e~dvp~~ 195 (334) T protein:vir:80 142 FLAPAHLKP------AF------------------HDGILLPS-TISGLAADAAADADVLVAAHRQGVEAM-VFRDLGDQ 195 (334) T ss_pred hcccccccc------cc------------------cCCcceee-cccccccchhhhHHHHHHHHHHHHHHH-HhcCCCCC Confidence 222221110 00 01111100 001221222 223333 44666544 5533331 Q ss_pred --CCeEEEEcHH----HHHHHHHHHHhhccccchh---hHHHHHHhhhhhcccccccCCccCCCcEEEecC--------- Q lcl|Aclame:pro 230 --PNLVAIVGRK----LLADKYFPLVNKQQENSES---LAADIIISQKRIGNLPAVRVPYFPANAVLVTTL--------- 291 (355) Q Consensus 230 --~~LVvivG~d----Ll~~k~~~l~n~~~~~te~---~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l--------- 291 (355) .+.|++|+.+ ||.++ +++|..=..+.. .+. ..-.++.|.|++..+.||...+--..+ T Consensus 196 ~~~~R~~vv~P~~y~~Ll~~~--r~~n~d~~~s~~~~~~~~---g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~ag 270 (334) T protein:vir:80 196 LMSEGVTLLDPVIFSFLLEHD--RLMNVEFGAKEGGNSFVG---GRIAMLNGVRVVETPRFPQSAITANALGADFNVTDA 270 (334) T ss_pred cCCceEEEeChHHHHHHhccc--ccccceeccccccccccc---eeEEEEeceEEEeecCCCCccccccccccccccccc Confidence 3689999954 44443 345542111111 111 112478899999999999775332211 Q ss_pred --CCcE-EEEeeCcEE---E-----EEEEccch--hhhhhhhhhhhhhhccccccEEEEecceecCc Q lcl|Aclame:pro 292 --ENLS-IYFMDESHR---R-----SIDENPKK--DRVENYESMNIDYVVEVYAAGCLLENITLGDF 345 (355) Q Consensus 292 --~NLs-IY~Q~gs~R---R-----~~~d~p~r--~rve~y~s~Ne~YvVEd~~~~a~ienI~~~~~ 345 (355) .... +++|+...= - .+..++++ +.+..|++.+- -+=..++++.+| +++.++ T Consensus 271 d~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G~--g~lRPeaa~vv~-~~~~~~ 334 (334) T protein:vir:80 271 EVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYNI--GQRRPDAVAVHD-ITVTNP 334 (334) T ss_pred cccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcCC--ceeccceEEEEE-EeeecC Confidence 1122 233433210 0 00111111 22222222221 233556666665 333332 No 134 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=22.30 E-value=2.5 Score=18.43 Aligned_cols=298 Identities=11% Similarity=0.067 Sum_probs=115.8 Q ss_pred HHHHHHHhCCC-hHHc----ceeeecCcHH-HHHHHHHHHhhHHHhCcCccccchhhhhhhhcccccccccccccCCCCc Q lcl|Aclame:pro 13 LTRVAELNNIS-TDDV----SKKFTVEPSV-TQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTTDTSGDK 86 (355) Q Consensus 13 ~~~~A~~ngv~-~~~v----~~~Fsv~P~~-~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia~Rt~T~~~~ 86 (355) |++|. .||-. +... ...|- |++ ...+....++++-|++..+-..-.-..|..|.+-.-|.... .+ .. T Consensus 1 ~~~~~-~~~~~~~~~~~~t~~~~fi--Pev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a-~d---~~ 73 (381) T protein:vir:80 1 MATIQ-GTGGYKGSAVDLSNVQVFI--PEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNISRAAV-YD---KQ 73 (381) T ss_pred Cceec-ccccccCcccchhhHHhhh--hHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccCccee-ee---ec Confidence 33333 22211 1111 12333 433 34455566666667665443322223567776655443321 11 11 Q ss_pred CccccccccccCcceeEEeeee-cceeCHHHHHhhcccchHHHHHHHHHHHHhh--hhHHHHhhcccccccCCChhhhhh Q lcl|Aclame:pro 87 ERQTADFTALESSKYECNQINF-DFHLKYKTLDLWARFQDFQRRIRDAIVKRQA--LDLIMAGFNGTTRADTSDRTKNTL 163 (355) Q Consensus 87 ~r~~~~~~~l~~~~Y~c~qTn~-d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~a--lD~i~IGfnG~s~A~~Td~~anPl 163 (355) +..+.....+...+-.+.=..+ ...+....+|.|...-|+...+.+.....+| .|...++.........+. T Consensus 74 ~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~------ 147 (381) T protein:vir:80 74 PQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQ------ 147 (381) T ss_pred CCCcccccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------ Confidence 2234444444444444443222 2235666899998888888888776666665 366666554322221110 Q ss_pred hhccchhHHHHHHhhccccccccccccCCccccceee-eCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEEcHHHHH Q lcl|Aclame:pro 164 LQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIR-VGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKLLA 242 (355) Q Consensus 164 lqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~-~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVvivG~dLl~ 242 (355) +..+. ...+..+ ... ...+ .+.+-.|+. +.++.. .+|+-.-...+.|++|+.+... T Consensus 148 ---~~~t~---------~~~i~~~-----~~~-~~~t~~~~~~t~~~----i~~a~~-~Lde~~VP~egR~lvv~P~~~~ 204 (381) T protein:vir:80 148 ---RIYSY---------DTTLGDG-----TVN-AHLTGTPAPLTYAA----LLLAKQ-KLDEADVPQEGRIVMVSPAQYI 204 (381) T ss_pred ---ccccc---------ccccccc-----ccc-cccccchhhHHHHH----HHHHHH-HHhhcCCCcCCcEEEeCHHHHH Confidence 00000 0000000 000 0000 011112333 334443 4466444345679999976444 Q ss_pred HH--HHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccchhhhhhhh Q lcl|Aclame:pro 243 DK--YFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) Q Consensus 243 ~k--~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r~rve~y~ 320 (355) +- .-.+.+..-..+..+..-. -.++-|.+++..+.+|....- -..+.-..- ...........+.. ..++ T Consensus 205 ~Ll~~~~~~~ad~~~~~~l~~G~---Ig~i~G~~Vv~Sn~lp~~~~t--~~~~~agap---~~~~~~~~~~~~~g-~~s~ 275 (381) T protein:vir:80 205 DLLSINQFISVDFSQVKPVTSGV---VGTILGMEVIVTTQIGINSLT--GYVNGQGAP---TQPTPGVLGSPYLP-DQAG 275 (381) T ss_pred HHhhchhhhhhhhccchhhhcee---eeEEcceEEEeeccccccccc--ceeeecccc---cccccccccccccc-cccc Confidence 21 1122333222222121111 136779999999999976442 111111000 00000000000000 0011 Q ss_pred hhhhhhhccccccEEEEe--cceecCccCCCCcCCCC Q lcl|Aclame:pro 321 SMNIDYVVEVYAAGCLLE--NITLGDFTAPAAPESGA 355 (355) Q Consensus 321 s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~~~~~~a 355 (355) -.+.-+.+-+|+..-..+ .+..-...-....++.. T Consensus 276 ~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~ 312 (381) T protein:vir:80 276 TANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQ 312 (381) T ss_pred ceeeeeeeeeeceeeeeeeccceeeecceeeecCCCc Confidence 122223333444433333 11111100000000000 No 135 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=21.87 E-value=2.5 Score=18.37 Aligned_cols=267 Identities=13% Similarity=0.092 Sum_probs=102.9 Q ss_pred CCHHHHHHHHHHHHHHHHHhCCChHHcceeeecCcHHH-HHHHHHHHhhHHHhCcCccccchhhhh---hhhcccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVT-QTLMNTVQASSAFLKTINILPVAEMKG---EKIGVGVTGTI 76 (355) Q Consensus 1 M~~~tr~~f~~y~~~~A~~ngv~~~~v~~~Fsv~P~~~-q~L~~~iqess~FL~~INv~~V~e~~G---e~v~lgv~~~i 76 (355) |-+.+- +++ + -+-|++- ..+.+++.+..-|-+...+. +++.| ..|.+-.-..+ T Consensus 1 Ma~~~T--------~l~--------d-----~i~Pev~~~~v~~~~~~~~~~~~~~~~~--~~l~g~~G~ti~iP~~~~i 57 (276) T protein:vir:10 1 MAQGTT--------TKS--------T-----QIVPEVLAPMMQAELDKKLRFAQFADID--STLVGQPGDTLTFPAFVYS 57 (276) T ss_pred CCccee--------ehh--------h-----hhchHHHHHHHHHHHHhhhhhcccceec--ccccCCCCCEEEeeeecCC Confidence 332110 000 0 0233332 23334444444454443332 33333 33433211111 Q ss_pred cccccCCCCcCccccccccccCcceeEEeeeecceeCHHHHHhhcccchHHHHHHHHHHHHhhhhHHHHhhcccccccCC Q lcl|Aclame:pro 77 ASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTS 156 (355) Q Consensus 77 a~Rt~T~~~~~r~~~~~~~l~~~~Y~c~qTn~d~~i~y~~LD~WA~~~dF~~~i~~~i~~~~alD~i~IGfnG~s~A~~T 156 (355) ..=.+-..+.+ -+...+..+......++-.+-+. ...++.=....|+...+.+++...+|..+-.-- T Consensus 58 gda~~~~eg~~-i~~~~lt~~~~~a~i~~~~k~~~--~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~---------- 124 (276) T protein:vir:10 58 GDATVVPEGQK-IPVDKIETNRREAKIHKIGKGTD--ITDEALLSGYGDPQGEAVRQHGLAIANKVDNDV---------- 124 (276) T ss_pred CccccccCCCc-cCccccccceeeEEeehcccccc--ccHHHHHhhccchHHHHHHHHHHHHHHHHHHHH---------- Confidence 00000111111 12222223334444444433333 334444444567777777666666664333211 Q ss_pred ChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcchhhHHHHHHHHHhcccchhhhCCCCeEEEE Q lcl|Aclame:pro 157 DRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIV 236 (355) Q Consensus 157 d~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggdy~nLDaLv~d~~~~lid~~~~~~~~LVviv 236 (355) +..+.. . ......+. - +.| .+.|++..| ++ +++..-+++| T Consensus 125 ---------------~~~l~~---------~---~~~~~~~~------~---t~d-~i~~A~~~l-gd--~~~~~~~ivv 164 (276) T protein:vir:10 125 ---------------LEALRG---------T---KLTVSADI------G---TLA-GLEAAIDTF-DD--EDLEPMVLFI 164 (276) T ss_pred ---------------HHHHhc---------c---cccccccc------c---CHH-HHHHHHHHh-cc--ccCcccEEEE Confidence 111111 1 01111111 1 233 344666544 43 2334457778 Q ss_pred cHHHHHHHHHHH-HhhccccchhhHHHHHH--hhhhhcccccccCCccCCCcEEEecCCCcEEEEeeCcEEEEEEEccch Q lcl|Aclame:pro 237 GRKLLADKYFPL-VNKQQENSESLAADIII--SQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKK 313 (355) Q Consensus 237 G~dLl~~k~~~l-~n~~~~~te~~aa~~~~--~~k~iGGlpa~~~PffP~~~ilIT~l~NLsIY~Q~gs~RR~~~d~p~r 313 (355) .++..+.-. ++ ...-...++.. ...+. .-.++.|+++|.-+..|.+..++-.-.-+-++.+++-. +|-+-+- T Consensus 165 ~p~~~~~L~-k~~~~~f~~~s~~g-~~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~---vE~dRd~ 239 (276) T protein:vir:10 165 NPKDAGKLR-SSASDNFTRATELG-DNIIVKGAFGEALGAVIVRSKKLDEGEAILAKRGAVKLITKRDFF---LETDRDP 239 (276) T ss_pred cHHHHHHHH-Hhcccccccccccc-ccceeccccceecceeEEEcCCCCcceEEEEeccceeeeecCCce---eecccch Confidence 877544211 11 00101111110 11111 12478999999999999988877766666544443311 2222222 Q ss_pred hhhhhhhhhhhhhhccccccEEEEe--cceecCccCCCCcCCCC Q lcl|Aclame:pro 314 DRVENYESMNIDYVVEVYAAGCLLE--NITLGDFTAPAAPESGA 355 (355) Q Consensus 314 ~rve~y~s~Ne~YvVEd~~~~a~ie--nI~~~~~~~~~~~~~~a 355 (355) ++..+..+.. +-|+. .+++ .+..........|. || T Consensus 240 ~~~~d~i~~~-----~~y~~-~~~~~~~vv~~t~~~~~~~~-~~ 276 (276) T protein:vir:10 240 STKTTALYSD-----KHYVA-YLYDESKAVKVTKGAGTTDS-GA 276 (276) T ss_pred hhcccEEEEe-----eEEEE-EEEcCcceEEEecCCcCCcC-CC Confidence 2222222222 22322 3333 33333323323333 33 No 136 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=21.22 E-value=2.6 Score=18.27 Aligned_cols=297 Identities=14% Similarity=0.123 Sum_probs=126.3 Q ss_pred CCHHHHHHHHHHH-HHHHHHhCCChHH--cceeeecCcHHHHHHHHHHHhhHHHhCcCccccchhhhhhhhccccccccc Q lcl|Aclame:pro 1 MRPETRFKFNAYL-TRVAELNNISTDD--VSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIA 77 (355) Q Consensus 1 M~~~tr~~f~~y~-~~~A~~ngv~~~~--v~~~Fsv~P~~~q~L~~~iqess~FL~~INv~~V~e~~Ge~v~lgv~~~ia 77 (355) |=+..- ..++ .+.+..+...-.+ .-+.|+ -.+..+.+.+|-|+.+.++-.++ .|..+.+-.-|... T Consensus 1 ma~~~~---~~~~~t~~~~~~~~~~~~a~~ie~f~------g~V~~~f~~~s~~~~~~~~~~~~--~G~sv~i~~ig~~t 69 (347) T protein:vir:15 1 MANIQG---GQQIGTNQGKGQSAADKLALFLKVFG------GEVLTAFARTSVTMPRHMLRSIA--SGKSAQFPVIGRTK 69 (347) T ss_pred CCcccc---CCccccccccCCCcchHHHHHHHHHH------HHHHHHHHHhhhhhhcccccccc--ccceeEeeecccee Confidence 222111 1111 1112111110000 113333 34566778899999999887655 48888887666554 Q ss_pred ccccCCCCcCccccccccccCcceeE--Eeeeecc-eeCHHHHHhhcccchHHHHHHHHHHHHhhh--hHHHHhhccccc Q lcl|Aclame:pro 78 STTDTSGDKERQTADFTALESSKYEC--NQINFDF-HLKYKTLDLWARFQDFQRRIRDAIVKRQAL--DLIMAGFNGTTR 152 (355) Q Consensus 78 ~Rt~T~~~~~r~~~~~~~l~~~~Y~c--~qTn~d~-~i~y~~LD~WA~~~dF~~~i~~~i~~~~al--D~i~IGfnG~s~ 152 (355) ...-|.++. -+.++..+...+-.| -+.-+.. .| +.||.|...-|+...+.+.....+|. |.-.++-- T Consensus 70 ~~~~~~g~~--l~~~~~~~~~~e~~ltID~~~~~~~~V--ddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l---- 141 (347) T protein:vir:15 70 AAYLKPGEN--LDDKRKDIKHTEKVIHIDGLLTADVLI--YDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAEL---- 141 (347) T ss_pred eeeeccCCC--CCCCCCCCccceEEEEechhhhhhHHh--hhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 322221110 011222233333333 3333322 33 68999998888888888877777765 33222111 Q ss_pred ccCCChhhhhhhhccchhHHHHHHhhccccccccccccCCccccceeeeCCCcc-------hhhHHHHHHHHHhcccchh Q lcl|Aclame:pro 153 ADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGD-------YENIDALVMDATNNLIDEV 225 (355) Q Consensus 153 A~~Td~~anPllqDVNkGWlq~~Re~a~~~v~~~~~~~~g~~~~~~i~~G~ggd-------y~nLDaLv~d~~~~lid~~ 225 (355) .+.+..+|... .+....|.........+.+|+ |.++=.++.++... +|+. T Consensus 142 --------------------~~~~~~~~~~~--~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~-Lde~ 198 (347) T protein:vir:15 142 --------------------AGLVNLPDASN--ENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARAS-LTKN 198 (347) T ss_pred --------------------HHHhhcccccc--ccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHH-Hhhc Confidence 01111111000 000000111000011112222 34433345555543 4776 Q ss_pred hhCCCCeEEEEcHH----HHHHHHHHHHhhccccchhhHHHHHHhhhhhcccccccCCccCCCcEEEe------------ Q lcl|Aclame:pro 226 YQDDPNLVAIVGRK----LLADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVT------------ 289 (355) Q Consensus 226 ~~~~~~LVvivG~d----Ll~~k~~~l~n~~~~~te~~aa~~~~~~k~iGGlpa~~~PffP~~~ilIT------------ 289 (355) .-...+.+++|+.+ ||.+.. +++..-..++.+.--.+ .++.|.+++..+.+|..+.-=+ T Consensus 199 ~VP~~gR~~vv~P~~y~~LL~~~~--~~~~d~~~~~~~~~G~V---g~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~ 273 (347) T protein:vir:15 199 YVPAADRTFYTTPDNYSAILAALM--PNAANYQALIDHERGTI---RNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAF 273 (347) T ss_pred CCCccCCEEEeCHHHHHHHhcccc--cccccccccccccceEE---EEEeceEEEecccccccccccccccccccccccc Confidence 66566889999964 444332 22222222222221111 3688999999999996543211 Q ss_pred ----------cCCCcE-EEEeeCcE-EEEEEE-ccchhhhhhhhhhhhhhhccccccEEEE---e---cceecCccC Q lcl|Aclame:pro 290 ----------TLENLS-IYFMDESH-RRSIDE-NPKKDRVENYESMNIDYVVEVYAAGCLL---E---NITLGDFTA 347 (355) Q Consensus 290 ----------~l~NLs-IY~Q~gs~-RR~~~d-~p~r~rve~y~s~Ne~YvVEd~~~~a~i---e---nI~~~~~~~ 347 (355) .+++.. +-||+... -=..++ ..++.|-++|+. +.|+.-|-..+-+ | .|++....+ T Consensus 274 ~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~~---d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 274 PATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQA---DQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred cccccceeeeccccceeeeeccceeeeeEeeceeeeecccchhhh---hhhehhhhcCCceeccccEEEEecCCCCC Confidence 112222 12232211 001111 333444444443 3333333222222 1 233333333 Done!