Query lcl|Aclame:protein:vir:104011|NCBI_annot:P2 family phage major capsid protein|genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Match_columns 337 No_of_seqs 131 out of 267 Neff 5.1 Searched_HMMs 1612 Date Sun Dec 1 21:12:16 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_43 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_43_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78186 Length: 337 100.0 1E-192 6E-196 1073.2 30.6 337 1-337 1-337 (337) 2 protein:vir:79171 Length: 337 100.0 1E-191 7E-195 1067.6 31.2 337 1-337 1-337 (337) 3 protein:vir:79157 Length: 339 100.0 9E-192 6E-195 1067.9 30.3 337 1-337 1-338 (339) 4 protein:vir:104011 Length: 337 100.0 1E-191 8E-195 1067.0 31.1 337 1-337 1-337 (337) 5 protein:vir:100331 Length: 342 100.0 8E-190 5E-193 1057.3 30.5 336 1-337 1-341 (342) 6 protein:vir:6061 Length: 357 # 100.0 2E-189 1E-192 1055.6 30.3 337 1-337 1-345 (357) 7 protein:vir:2016 Length: 357 # 100.0 2E-189 1E-192 1055.7 30.0 337 1-337 1-345 (357) 8 protein:vir:5694 Length: 357 # 100.0 2E-189 1E-192 1055.6 30.1 337 1-337 1-345 (357) 9 protein:vir:98566 Length: 355 100.0 1E-188 7E-192 1050.9 30.7 337 1-337 1-345 (355) 10 protein:vir:1829 Length: 355 # 100.0 2E-188 1E-191 1049.6 30.7 337 1-337 1-345 (355) 11 protein:vir:1153 Length: 338 # 100.0 2E-187 1E-190 1044.0 30.9 335 1-336 1-338 (338) 12 protein:vir:78777 Length: 358 100.0 2E-182 1E-185 1016.9 30.1 330 1-337 5-341 (358) 13 protein:vir:98856 Length: 343 100.0 2E-176 1E-179 984.4 30.0 328 1-337 1-336 (343) 14 protein:vir:3746 Length: 336 # 100.0 4E-176 3E-179 982.1 29.6 326 4-337 1-333 (336) 15 protein:vir:3783 Length: 336 # 100.0 8E-176 5E-179 980.4 29.5 326 4-337 1-333 (336) 16 protein:vir:270 Length: 341 # 100.0 4E-174 2E-177 971.3 27.0 323 1-337 5-332 (341) 17 protein:vir:3158 Length: 321 # 100.0 3.5E-79 2.2E-82 450.7 24.5 307 4-337 1-315 (321) 18 protein:vir:99424 Length: 360 100.0 1.5E-45 9.4E-49 266.3 21.7 325 1-337 1-360 (360) 19 protein:vir:4197 Length: 314 # 100.0 1.1E-41 6.9E-45 245.1 22.6 304 1-335 1-314 (314) 20 protein:vir:4159 Length: 315 # 100.0 2.4E-37 1.5E-40 221.3 19.0 306 1-332 1-315 (315) 21 protein:vir:4092 Length: 390 # 98.8 4.3E-09 2.6E-12 66.5 19.0 293 1-337 72-373 (390) 22 protein:vir:100247 Length: 425 98.7 7.5E-09 4.7E-12 65.1 19.0 307 1-337 109-424 (425) 23 protein:vir:100135 Length: 418 98.7 2.9E-08 1.8E-11 61.9 20.5 295 1-337 104-415 (418) 24 protein:vir:4339 Length: 395 # 98.6 4.7E-08 2.9E-11 60.8 20.3 297 1-337 89-395 (395) 25 protein:vir:98339 Length: 415 98.6 4.3E-08 2.7E-11 61.0 20.1 299 1-337 101-407 (415) 26 protein:vir:81100 Length: 415 98.6 4.3E-08 2.7E-11 61.0 20.1 299 1-337 101-407 (415) 27 protein:vir:79987 Length: 415 98.6 4.3E-08 2.7E-11 61.0 20.1 299 1-337 101-407 (415) 28 protein:vir:9410 Length: 415 # 98.6 4.1E-08 2.5E-11 61.1 19.5 297 1-337 101-407 (415) 29 protein:vir:105905 Length: 304 98.5 3.7E-08 2.3E-11 61.3 16.8 280 1-333 1-304 (304) 30 protein:vir:94142 Length: 304 98.5 3.7E-08 2.3E-11 61.3 16.8 280 1-333 1-304 (304) 31 protein:vir:4511 Length: 409 # 98.5 1.5E-07 9.6E-11 57.9 19.9 296 1-337 84-406 (409) 32 protein:vir:4600 Length: 415 # 98.5 2.4E-07 1.5E-10 56.9 20.4 295 1-337 101-404 (415) 33 protein:vir:4700 Length: 415 # 98.5 2.4E-07 1.5E-10 56.9 20.4 295 1-337 101-404 (415) 34 protein:vir:94771 Length: 298 98.5 8.3E-08 5.1E-11 59.4 17.8 281 16-333 1-298 (298) 35 protein:vir:97053 Length: 390 98.4 2.1E-07 1.3E-10 57.2 19.7 293 1-335 80-390 (390) 36 protein:vir:104085 Length: 320 98.4 1E-07 6.4E-11 58.9 17.5 298 1-337 1-320 (320) 37 protein:vir:95376 Length: 425 98.4 2.8E-07 1.8E-10 56.5 19.7 291 1-337 111-425 (425) 38 protein:vir:10364 Length: 390 98.4 6.8E-07 4.2E-10 54.4 21.0 292 1-335 83-390 (390) 39 protein:vir:4456 Length: 401 # 98.4 5.2E-07 3.3E-10 55.0 19.8 306 1-337 79-401 (401) 40 protein:vir:103955 Length: 324 98.4 5E-07 3.1E-10 55.1 19.6 292 1-337 1-318 (324) 41 protein:vir:6242 Length: 390 # 98.3 5.7E-07 3.5E-10 54.8 19.2 291 1-337 81-389 (390) 42 protein:vir:4226 Length: 326 # 98.3 4.6E-07 2.9E-10 55.3 17.8 299 1-337 3-326 (326) 43 protein:vir:1638 Length: 298 # 98.3 4.6E-07 2.9E-10 55.3 17.7 281 16-333 1-298 (298) 44 protein:vir:80376 Length: 435 98.3 1.4E-06 8.7E-10 52.7 20.3 299 1-336 88-435 (435) 45 protein:vir:41 Length: 299 # N 98.3 5.7E-07 3.5E-10 54.8 18.1 272 20-337 1-298 (299) 46 protein:vir:7771 Length: 330 # 98.3 3.3E-07 2E-10 56.1 16.6 296 1-337 1-326 (330) 47 protein:vir:2504 Length: 305 # 98.3 1.3E-06 7.9E-10 52.9 19.7 281 16-337 1-301 (305) 48 protein:vir:81070 Length: 390 98.2 1.3E-06 7.9E-10 52.9 19.6 293 1-335 80-390 (390) 49 protein:vir:1886 Length: 385 # 98.2 8.5E-07 5.3E-10 53.9 18.2 294 1-337 70-384 (385) 50 protein:vir:191 Length: 385 # 98.2 8.5E-07 5.3E-10 53.9 18.2 294 1-337 70-384 (385) 51 protein:vir:96223 Length: 324 98.2 2.2E-06 1.3E-09 51.6 20.3 292 1-337 1-318 (324) 52 protein:vir:78523 Length: 338 98.2 1.5E-06 9.3E-10 52.5 19.2 298 1-337 1-338 (338) 53 protein:vir:96392 Length: 324 98.2 2.5E-06 1.5E-09 51.3 20.2 292 1-337 1-319 (324) 54 protein:vir:78830 Length: 324 98.2 2.5E-06 1.5E-09 51.3 20.2 292 1-337 1-319 (324) 55 protein:vir:1328 Length: 392 # 98.2 1.3E-06 8E-10 52.9 18.6 296 1-337 85-391 (392) 56 protein:vir:94673 Length: 419 98.2 1.7E-06 1.1E-09 52.2 18.9 299 1-337 98-417 (419) 57 protein:vir:3991 Length: 404 # 98.2 2.8E-06 1.8E-09 51.0 19.9 284 1-337 89-396 (404) 58 protein:vir:99749 Length: 324 98.2 3.3E-06 2.1E-09 50.6 20.2 292 1-337 1-318 (324) 59 protein:vir:95763 Length: 297 98.1 1.2E-06 7.2E-10 53.1 17.3 279 1-335 1-297 (297) 60 protein:vir:100172 Length: 394 98.1 4.1E-06 2.5E-09 50.1 20.2 279 1-337 88-384 (394) 61 protein:vir:1025 Length: 408 # 98.1 2.9E-06 1.8E-09 50.9 19.1 283 1-337 89-396 (408) 62 protein:vir:7855 Length: 497 # 98.1 4.9E-06 3E-09 49.7 20.4 324 1-337 130-496 (497) 63 protein:vir:101650 Length: 497 98.1 4.9E-06 3E-09 49.7 20.4 324 1-337 130-496 (497) 64 protein:vir:8102 Length: 543 # 98.1 2.8E-06 1.7E-09 51.1 18.4 294 1-337 217-542 (543) 65 protein:vir:81160 Length: 371 98.1 5.9E-06 3.6E-09 49.3 20.2 279 1-337 71-371 (371) 66 protein:vir:485 Length: 407 # 98.0 6.1E-06 3.8E-09 49.2 20.6 303 1-337 78-400 (407) 67 protein:vir:97148 Length: 324 98.0 7.8E-06 4.8E-09 48.6 20.3 292 1-337 1-318 (324) 68 protein:vir:1433 Length: 435 # 98.0 6.2E-06 3.8E-09 49.1 18.3 297 1-336 91-435 (435) 69 protein:vir:95963 Length: 395 97.9 3.7E-06 2.3E-09 50.4 16.6 292 1-337 75-378 (395) 70 protein:vir:9759 Length: 303 # 97.9 5.2E-06 3.2E-09 49.6 17.4 283 20-334 1-303 (303) 71 protein:vir:7409 Length: 408 # 97.9 1.1E-05 6.6E-09 47.9 19.9 285 1-337 82-396 (408) 72 protein:vir:2430 Length: 318 # 97.9 6.7E-06 4.1E-09 49.0 17.5 292 1-337 1-316 (318) 73 protein:vir:1268 Length: 397 # 97.8 1.3E-05 7.9E-09 47.4 18.2 275 1-333 87-397 (397) 74 protein:vir:101607 Length: 379 97.8 1.7E-05 1.1E-08 46.7 18.8 280 1-332 77-379 (379) 75 protein:vir:9509 Length: 381 # 97.8 1.7E-05 1.1E-08 46.7 18.7 294 1-337 65-370 (381) 76 protein:vir:101291 Length: 381 97.8 1.7E-05 1.1E-08 46.7 18.7 294 1-337 65-370 (381) 77 protein:vir:81227 Length: 413 97.8 1.8E-05 1.1E-08 46.5 21.0 293 1-337 85-410 (413) 78 protein:vir:3870 Length: 400 # 97.8 1.9E-05 1.2E-08 46.4 18.2 278 1-337 101-399 (400) 79 protein:vir:9643 Length: 377 # 97.8 1.8E-05 1.1E-08 46.7 17.8 285 1-337 67-368 (377) 80 protein:vir:9574 Length: 300 # 97.8 2.2E-05 1.4E-08 46.1 18.2 283 16-334 1-300 (300) 81 protein:vir:5739 Length: 366 # 97.7 2.8E-05 1.8E-08 45.5 19.7 296 1-334 20-366 (366) 82 protein:vir:78223 Length: 333 97.7 3.1E-05 1.9E-08 45.3 18.7 297 6-337 1-332 (333) 83 protein:vir:6212 Length: 434 # 97.7 3.2E-05 2E-08 45.3 19.2 291 1-337 119-431 (434) 84 protein:vir:9309 Length: 324 # 97.7 3.3E-05 2E-08 45.2 20.5 292 1-337 1-318 (324) 85 protein:vir:2344 Length: 397 # 97.6 2.9E-05 1.8E-08 45.5 17.2 287 1-337 1-309 (397) 86 protein:vir:4953 Length: 397 # 97.6 3.7E-05 2.3E-08 44.9 20.1 279 1-337 86-385 (397) 87 protein:vir:104256 Length: 458 97.6 3.8E-05 2.4E-08 44.8 19.7 299 1-337 123-458 (458) 88 protein:vir:80684 Length: 315 97.6 3.8E-05 2.4E-08 44.8 18.9 285 16-337 1-309 (315) 89 protein:vir:3845 Length: 395 # 97.5 5.1E-05 3.2E-08 44.1 18.8 282 1-337 86-387 (395) 90 protein:vir:102119 Length: 404 97.5 5.5E-05 3.4E-08 44.0 19.2 298 1-337 80-404 (404) 91 protein:vir:4830 Length: 397 # 97.5 6.1E-05 3.8E-08 43.7 18.3 280 1-337 86-385 (397) 92 protein:vir:100884 Length: 389 97.4 6.5E-05 4.1E-08 43.5 19.8 277 1-337 83-382 (389) 93 protein:vir:8187 Length: 311 # 97.4 8.1E-05 5E-08 43.0 17.6 283 16-335 1-311 (311) 94 protein:vir:962 Length: 397 # 97.3 9E-05 5.6E-08 42.8 16.2 272 1-337 112-397 (397) 95 protein:vir:4856 Length: 293 # 97.3 0.00011 6.6E-08 42.4 16.9 266 16-337 1-281 (293) 96 protein:vir:8420 Length: 477 # 97.3 8.9E-05 5.5E-08 42.8 15.9 298 1-337 115-471 (477) 97 protein:vir:99920 Length: 311 97.3 0.00011 6.7E-08 42.3 17.4 288 16-337 1-309 (311) 98 protein:vir:4997 Length: 397 # 97.2 0.00012 7.3E-08 42.1 19.7 284 1-337 86-388 (397) 99 protein:vir:9704 Length: 394 # 97.2 0.00012 7.6E-08 42.0 17.5 277 1-337 103-390 (394) 100 protein:vir:1084 Length: 437 # 97.2 0.00015 9.3E-08 41.6 17.8 282 1-337 136-434 (437) 101 protein:vir:1383 Length: 421 # 97.1 0.00016 1E-07 41.4 17.2 284 1-337 92-392 (421) 102 protein:vir:93616 Length: 645 97.0 0.00022 1.4E-07 40.7 17.8 293 1-337 286-642 (645) 103 protein:vir:78350 Length: 383 97.0 0.00012 7.6E-08 42.0 14.0 292 1-337 72-377 (383) 104 protein:vir:100632 Length: 381 96.9 0.00013 8.3E-08 41.8 13.4 281 1-337 65-373 (381) 105 protein:vir:105038 Length: 428 96.8 0.00031 1.9E-07 39.8 19.6 294 1-334 83-428 (428) 106 protein:vir:9361 Length: 402 # 96.5 0.00057 3.5E-07 38.4 17.4 277 1-337 98-396 (402) 107 protein:vir:80128 Length: 466 96.4 0.00062 3.8E-07 38.2 18.2 310 1-337 123-451 (466) 108 protein:vir:96978 Length: 387 96.4 0.00066 4.1E-07 38.0 16.8 276 1-337 83-381 (387) 109 protein:vir:2685 Length: 387 # 96.4 0.00066 4.1E-07 38.0 16.8 276 1-337 83-381 (387) 110 protein:vir:94424 Length: 387 96.4 0.00066 4.1E-07 38.0 16.8 276 1-337 83-381 (387) 111 protein:vir:3033 Length: 272 # 96.0 0.0012 7.3E-07 36.6 18.1 260 16-337 1-269 (272) 112 protein:vir:9820 Length: 272 # 96.0 0.0012 7.3E-07 36.6 18.1 260 16-337 1-269 (272) 113 protein:vir:98635 Length: 377 95.1 0.0028 1.7E-06 34.6 14.6 283 1-337 67-368 (377) 114 protein:vir:78640 Length: 352 95.1 0.0029 1.8E-06 34.5 19.4 275 1-337 46-346 (352) 115 protein:vir:102082 Length: 392 94.9 0.0033 2.1E-06 34.2 20.2 279 1-337 89-384 (392) 116 protein:vir:102873 Length: 392 94.9 0.0033 2.1E-06 34.2 20.2 279 1-337 89-384 (392) 117 protein:vir:105004 Length: 392 94.9 0.0033 2.1E-06 34.2 20.2 279 1-337 89-384 (392) 118 protein:vir:107593 Length: 392 94.9 0.0033 2.1E-06 34.2 20.2 279 1-337 89-384 (392) 119 protein:vir:93881 Length: 387 94.9 0.0033 2.1E-06 34.2 17.9 274 1-337 83-381 (387) 120 protein:vir:96762 Length: 632 94.7 0.0037 2.3E-06 33.9 15.5 286 1-337 309-630 (632) 121 protein:vir:103285 Length: 296 93.8 0.0064 4E-06 32.6 14.1 273 20-335 1-296 (296) 122 protein:vir:107687 Length: 319 84.7 0.058 3.6E-05 27.4 14.2 296 1-332 1-319 (319) 123 protein:vir:80068 Length: 301 76.5 0.13 8.3E-05 25.4 17.6 283 1-332 1-301 (301) 124 protein:vir:78739 Length: 332 73.4 0.092 5.7E-05 26.3 6.5 272 1-337 1-299 (332) 125 protein:vir:78935 Length: 335 61.1 0.36 0.00022 23.0 10.3 296 1-337 1-333 (335) 126 protein:vir:104342 Length: 314 57.3 0.44 0.00027 22.6 16.7 288 1-335 3-314 (314) 127 protein:vir:79642 Length: 329 53.3 0.53 0.00033 22.1 16.9 292 1-335 6-329 (329) 128 protein:vir:2201 Length: 345 # 49.6 0.64 0.00039 21.7 7.4 301 1-333 1-345 (345) 129 protein:vir:80213 Length: 334 29.2 1.7 0.001 19.4 9.8 279 1-337 1-297 (334) 130 protein:vir:94933 Length: 330 28.7 1.7 0.0011 19.3 10.9 285 1-335 5-330 (330) No 1 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=100.00 E-value=9.8e-193 Score=1073.23 Aligned_cols=337 Identities=100% Similarity=1.433 Sum_probs=335.9 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++|||||||+| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) ++++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++||| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (337) T protein:vir:78 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) ||||||||||++||++|+|||++++.++|+|++|+||||+||||||+|++++|||||||++||||||||||||++||||| T Consensus 161 lqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l 240 (337) T protein:vir:78 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) T ss_pred ccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeeeeeee Q lcl|Aclame:pro 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVV 320 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvV 320 (337) +|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||||+|||||| T Consensus 241 ~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvV 320 (337) T protein:vir:78 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVV 320 (337) T ss_pred HhcCCCcHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccchhhccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccEEEeecceeccC Q lcl|Aclame:pro 321 EDFGCGCVAENIELAAA 337 (337) Q Consensus 321 Ed~~~~a~ieni~~~~a 337 (337) ||||++|+||||+|++| T Consensus 321 Ed~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 321 EDFGCGCVAENIELAAA 337 (337) T ss_pred eccccEEEEeceeecCC Confidence 99999999999999999 No 2 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=100.00 E-value=1.1e-191 Score=1067.58 Aligned_cols=337 Identities=99% Similarity=1.426 Sum_probs=335.9 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++|||||||+| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) ++++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++||| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:79 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) ||||||||||++|+++|+|||++++.++|+|++|+||||+||||||+|++++|||||||++||||||||||||++||||| T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l 240 (337) T protein:vir:79 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPI 240 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeeeeeee Q lcl|Aclame:pro 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVV 320 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvV 320 (337) +|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||||+|||||| T Consensus 241 ~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvV 320 (337) T protein:vir:79 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVV 320 (337) T ss_pred hccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchhhccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccEEEeecceeccC Q lcl|Aclame:pro 321 EDFGCGCVAENIELAAA 337 (337) Q Consensus 321 Ed~~~~a~ieni~~~~a 337 (337) ||||++|+||||+|++| T Consensus 321 Ed~~~~a~ienI~~~~a 337 (337) T protein:vir:79 321 EDFGCGCVAENIELAAA 337 (337) T ss_pred eccccEEEEeceeecCC Confidence 99999999999999999 No 3 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=100.00 E-value=9.2e-192 Score=1067.92 Aligned_cols=337 Identities=63% Similarity=1.035 Sum_probs=335.2 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDT 80 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) ++++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++||| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (339) T protein:vir:79 81 TQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPM 160 (339) T ss_pred CCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceee-cCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLV-GKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~-g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~ 239 (337) ||||||||||++||++|+|||++++.+++||++ |+||||+||||||+|++++|||||||++||||||||||||++|||| T Consensus 161 lqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~ 240 (339) T protein:vir:79 161 LQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFP 240 (339) T ss_pred ccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhh Confidence 999999999999999999999999989999988 9999999999999999999999999999999999999999999999 Q ss_pred HHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeeeeee Q lcl|Aclame:pro 240 IVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYV 319 (337) Q Consensus 240 l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~Yv 319 (337) |+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||||+||||| T Consensus 241 l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~Yv 320 (339) T protein:vir:79 241 LVNRDRDPVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNAKRDRIENYESSNDAYV 320 (339) T ss_pred HhhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccchhhccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccccEEEeecceeccC Q lcl|Aclame:pro 320 VEDFGCGCVAENIELAAA 337 (337) Q Consensus 320 VEd~~~~a~ieni~~~~a 337 (337) |||||++|+||||+|++| T Consensus 321 VEd~~~~a~iEni~~~~a 338 (339) T protein:vir:79 321 IEDLACAAMAENIALAAA 338 (339) T ss_pred eeccccEEEeeeeecccC Confidence 999999999999999999 No 4 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=100.00 E-value=1.3e-191 Score=1067.04 Aligned_cols=337 Identities=100% Similarity=1.434 Sum_probs=335.9 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++|||||||+| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) ++++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++||| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:10 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) ||||||||||++|+++|+|||++++.++|+|++|+||||+||||||+|++++|||||||++||||||||||||++||||| T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l 240 (337) T protein:vir:10 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeeeeeee Q lcl|Aclame:pro 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVV 320 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvV 320 (337) +|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||||+|||||| T Consensus 241 ~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvV 320 (337) T protein:vir:10 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVV 320 (337) T ss_pred hccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchhhccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccEEEeecceeccC Q lcl|Aclame:pro 321 EDFGCGCVAENIELAAA 337 (337) Q Consensus 321 Ed~~~~a~ieni~~~~a 337 (337) ||||++|+||||+|++| T Consensus 321 Ed~~~~a~ienI~~~~a 337 (337) T protein:vir:10 321 EDFGCGCVAENIELAAA 337 (337) T ss_pred eccccEEEEeceeecCC Confidence 99999999999999999 No 5 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=100.00 E-value=7.9e-190 Score=1057.30 Aligned_cols=336 Identities=52% Similarity=0.862 Sum_probs=331.6 Q ss_pred CChHHHHHHHHHHHHHHHhhCch----hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG----DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIAS 76 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~----~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~ 76 (337) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 80 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVAS 80 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCccccc Confidence 99999999999999999999998 78899999999999999999999999999999999999999999999999999 Q ss_pred ccCCC-CcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCCh Q lcl|Aclame:pro 77 RTDTT-KAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDR 155 (337) Q Consensus 77 Rt~t~-~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~ 155 (337) ||||+ +++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++||| T Consensus 81 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (342) T protein:vir:10 81 TTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDR 160 (342) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 99987 46899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHH Q lcl|Aclame:pro 156 QANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHD 235 (337) Q Consensus 156 ~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~ 235 (337) ++|||||||||||||++|+++|+|||++++ .+++|++|+||||+||||||+|++++|||||||++||||||||||||+| T Consensus 161 ~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~-~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLlad 239 (342) T protein:vir:10 161 NSNPLLQDVAKGWLQKMREDAKERVMNGES-TDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLAD 239 (342) T ss_pred hhCcCccccchHHHHHHHhhhhhhhcccce-eccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHH Confidence 999999999999999999999999999887 4799999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeee Q lcl|Aclame:pro 236 KYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSN 315 (337) Q Consensus 236 k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~N 315 (337) |||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+||||||||||| T Consensus 240 k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~N 319 (342) T protein:vir:10 240 KYFPIVNQQNAPTEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVPKKDRIETYESEN 319 (342) T ss_pred HHHHHHhcCCChHHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 316 DAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 316 e~YvVEd~~~~a~ieni~~~~a 337 (337) |||||||||++|+||||+|+|+ T Consensus 320 e~YvVEd~~~~a~iE~i~i~~~ 341 (342) T protein:vir:10 320 IDYVVEDYGCAALIENITLKDK 341 (342) T ss_pred cceeeeccccEEEeecceecCC Confidence 9999999999999999999999 No 6 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=100.00 E-value=1.6e-189 Score=1055.56 Aligned_cols=337 Identities=55% Similarity=0.955 Sum_probs=330.5 Q ss_pred CChHHHHHHHHHHHHHHHhhCch--hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 99999999999999999999996 6889999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-cccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 79 DTTK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 79 ~t~~-~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) ||++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:60 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSS 160 (357) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 9976 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCc-----eeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGK-----VLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~-----i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++..+|+ |++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:60 161 NQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 999999999999999999999999987665554 899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) |++|||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~ 320 (357) T protein:vir:60 241 LADKYFPIVNREQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYE 320 (357) T ss_pred hhHHhhhHhhcCCChHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ||||||||||||++|+||||+++++ T Consensus 321 s~Ne~YvVEd~~~~a~iE~i~~~~~ 345 (357) T protein:vir:60 321 SMNIDYVVEDYAAGCLVEKIKVGDF 345 (357) T ss_pred hhcceeeeeccccEEEeeeeeeccC Confidence 9999999999999999999999987 No 7 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=100.00 E-value=1.5e-189 Score=1055.73 Aligned_cols=337 Identities=55% Similarity=0.966 Sum_probs=330.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCch--hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 99999999999999999999996 6889999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-cccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 79 DTTK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 79 ~t~~-~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) +|++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:20 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 9976 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCc-----eeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGK-----VLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~-----i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++..+|+ |++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:20 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 999999999999999999999999987665554 889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) |++|||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~ 320 (357) T protein:vir:20 241 LADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYE 320 (357) T ss_pred hhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ||||||||||||++|+||||+++++ T Consensus 321 s~Ne~YvVEd~~~~a~iE~i~~~~~ 345 (357) T protein:vir:20 321 SMNIDYVVEDYAAGCLVEKIKVGDF 345 (357) T ss_pred hhcceeeeeccccEEEeeeeeeccc Confidence 9999999999999999999999987 No 8 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=100.00 E-value=1.7e-189 Score=1055.55 Aligned_cols=337 Identities=55% Similarity=0.963 Sum_probs=330.6 Q ss_pred CChHHHHHHHHHHHHHHHhhCch--hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++|||++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 99999999999999999999996 6889999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-cccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 79 DTTK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 79 ~t~~-~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) +|++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:56 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 9976 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCc-----eeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGK-----VLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~-----i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++..+|+ |++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:56 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 999999999999999999999999987665554 889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) |++|||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~ 320 (357) T protein:vir:56 241 LADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYE 320 (357) T ss_pred hhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ||||||||||||++|+||||+++++ T Consensus 321 s~Ne~YvVEd~~~~a~iE~i~i~~~ 345 (357) T protein:vir:56 321 SMNIDYVVEDYAAGCLVEKIKVGDF 345 (357) T ss_pred hhcceeeeeccccEEEeeeeeeccC Confidence 9999999999999999999999988 No 9 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=100.00 E-value=1.1e-188 Score=1050.94 Aligned_cols=337 Identities=54% Similarity=0.905 Sum_probs=330.3 Q ss_pred CChHHHHHHHHHHHHHHHhhCch--hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccc Confidence 99999999999999999999995 6899999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-cccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 79 DTTK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 79 ~t~~-~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) +|++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:98 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) T ss_pred cCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 9984 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHHhchhhhcccccccc-----CceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQA-----GKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~-----~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++..+ ++|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (355) T protein:vir:98 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 9999999999999999999999999886544 56789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) |++|||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 320 (355) T protein:vir:98 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) T ss_pred hHHHhhhHhhccCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ||||||||||||++|+||||+|+++ T Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~ 345 (355) T protein:vir:98 321 SMNIDYVVEVYAAGCLLENITLGDF 345 (355) T ss_pred hhcceeeeeccccEEEeeceeeeCC Confidence 9999999999999999999999988 No 10 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=100.00 E-value=2e-188 Score=1049.56 Aligned_cols=337 Identities=55% Similarity=0.915 Sum_probs=330.5 Q ss_pred CChHHHHHHHHHHHHHHHhhCch--hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeecc Confidence 99999999999999999999995 7899999999999999999999999999999999999999999999999999999 Q ss_pred CCCC-cccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 79 DTTK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 79 ~t~~-~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) +|++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:18 81 DTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVK 160 (355) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 9985 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccc-----cCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQ-----AGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~-----~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++.. +++|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dL 240 (355) T protein:vir:18 161 NPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKL 240 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 999999999999999999999999988654 456899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) |++|||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 320 (355) T protein:vir:18 241 LADKYFPLVNKQQENTESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) T ss_pred hHHHHhHHhhccCChHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ||||||||||||++|+||||+|+++ T Consensus 321 s~Ne~YvVEd~~~~a~ieni~~~~~ 345 (355) T protein:vir:18 321 SMNIDYVVEAYAAGCLLENITLGDF 345 (355) T ss_pred hhcceeeeeccccEEEEeeeeecCC Confidence 9999999999999999999999998 No 11 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=100.00 E-value=2.2e-187 Score=1043.96 Aligned_cols=335 Identities=65% Similarity=1.058 Sum_probs=329.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++|||||||+| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT 80 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDT 80 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CC-cccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhh Q lcl|Aclame:pro 81 TK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) Q Consensus 81 ~~-~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anP 159 (337) +. .+|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++|| T Consensus 81 ~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nP 160 (338) T protein:vir:11 81 TGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANP 160 (338) T ss_pred CCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCc Confidence 75 46999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccchhHHHHHHHhchhhhccccccccCceeecCC--cccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKA--GDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 160 llqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~g--gdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) |||||||||||++|+++|+|||++++ .+++|.+|.| |||+||||||+|++++|||||||++||||||||||||++|| T Consensus 161 llqDVNkGWlQ~~Re~ap~rv~~~~~-~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~ 239 (338) T protein:vir:11 161 LLQDVNIGWFQQYRNNAPARVLKEGK-TTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKY 239 (338) T ss_pred CccccchhHHHHHHhhhhhhhhhccc-ccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHH Confidence 99999999999999999999999986 5788988655 99999999999999999999999999999999999999999 Q ss_pred HHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeeee Q lcl|Aclame:pro 238 FPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDA 317 (337) Q Consensus 238 ~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~ 317 (337) |||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||||+||| T Consensus 240 ~~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~ 319 (338) T protein:vir:11 240 FPMVNKDQPATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPEKNRIENYESSNDA 319 (338) T ss_pred hHHHhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeccccEEEeecceecc Q lcl|Aclame:pro 318 YVVEDFGCGCVAENIELAA 336 (337) Q Consensus 318 YvVEd~~~~a~ieni~~~~ 336 (337) |||||||++|+||||+|+| T Consensus 320 YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 320 YVVEDYGLGCLVENIEVAE 338 (338) T ss_pred eeeeccccEEEeecceecC Confidence 9999999999999999999 No 12 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=100.00 E-value=1.9e-182 Score=1016.90 Aligned_cols=330 Identities=29% Similarity=0.479 Sum_probs=320.8 Q ss_pred CChHHHHHHHHHHHHHHHhhCch--hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+||+||+|+++|||||+||++|||++|+|++||+|++|++|+||||| T Consensus 5 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt 84 (358) T protein:vir:78 5 LTVQAEQRLNKYCDALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLYTGRK 84 (358) T ss_pred ccHHHHHHHHHHHHHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCcccceec Confidence 99999999999999999999994 7899999999999999999999999999999999999999999999999999999 Q ss_pred CCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCCh---hHHHHHHHHHHHHHhhhhHHhcccccccCCcCCh Q lcl|Aclame:pro 79 DTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFA---DFQQRIRDVILNQGALDRIMIGWNGVKAAATTDR 155 (337) Q Consensus 79 ~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~---dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~ 155 (337) +| |+|++++++++++|+|+|||||+||+|++||+||||| ||++||++++.+|+|||||||||||+|+|++||| T Consensus 85 ~t----r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (358) T protein:vir:78 85 KG----GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADDTDP 160 (358) T ss_pred CC----CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 98 8899999999999999999999999999999999998 8999999999999999999999999999999999 Q ss_pred hhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCC--cccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHH Q lcl|Aclame:pro 156 QANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKA--GDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 156 ~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~g--gdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl 233 (337) ++|||||||||||||++|+++|+|||++++. +++|++|+| |||+||||||+|++++|||||||++|||||||||||| T Consensus 161 ~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~-~~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLl 239 (358) T protein:vir:78 161 TANPLGQDVNKGWHQLAREWKGGSQIIKAAA-GEKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTDLV 239 (358) T ss_pred hhCcCccccchHHHHHHHhhchhhhhccccc-cCceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhh Confidence 9999999999999999999999999999885 467777755 9999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceee Q lcl|Aclame:pro 234 HDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYES 313 (337) Q Consensus 234 ~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s 313 (337) ++|||||+|++++|||++|+|+++ |+||||||++|||||+++||||+|||||||||+|++||+++|||+||||||||| T Consensus 240 a~k~~~l~n~~~~pTE~~Aa~~i~--k~iGGlpa~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s 317 (358) T protein:vir:78 240 AAAQAKLYSEATKPSEQIAAQQLA--KSIAGRKAYIPPFFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQDSKSFDNQYW 317 (358) T ss_pred hHHhhhHhhcCCCcHHHHHHHHHH--HHhCCCeEEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhh Confidence 999999999999999999999985 899999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 314 SNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 314 ~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) |||||||||||++|+||||+|..+ T Consensus 318 ~Ne~YvVEd~~~~a~iE~i~v~~~ 341 (358) T protein:vir:78 318 RMEGYALGEHKAYGGFEEADIEIG 341 (358) T ss_pred hcceeeeeccccEEEEeeeeeeeC Confidence 999999999999999999998743 No 13 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=100.00 E-value=1.5e-176 Score=984.44 Aligned_cols=328 Identities=29% Similarity=0.415 Sum_probs=314.8 Q ss_pred CChHHHHHHHHHHHHHHHhhCch----hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG----DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIAS 76 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~----~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~ 76 (337) |+++||++|++|++++|++|||+ +++++|+|+||+||+|+++|||||+||++|||++|+|++|+++.+|.+|+++| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~ 80 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYG 80 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccC Confidence 99999999999999999999996 67899999999999999999999999999999999999999999999999999 Q ss_pred ccCCC-Cc-ccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChh-HHHHHHHHHHHHHhhhhHHhcccccccCCcC Q lcl|Aclame:pro 77 RTDTT-KA-ARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFAD-FQQRIRDVILNQGALDRIMIGWNGVKAAATT 153 (337) Q Consensus 77 Rt~t~-~~-~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~d-F~~r~~~~i~~~~aLD~i~IGfnG~s~A~~T 153 (337) |++|. ++ +|.| .++++|+|+|||||+||+|++||+|||||| |++|+++++.+|+|||||||||||+|+|++| T Consensus 81 r~~t~~~~~~~~~-----~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T 155 (343) T protein:vir:98 81 AHDRRTPIQQRWT-----RQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDT 155 (343) T ss_pred ccccCCCcccccc-----CCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCC Confidence 99884 33 5644 455689999999999999999999999998 9999999999999999999999999999999 Q ss_pred ChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHH Q lcl|Aclame:pro 154 DRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 d~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl 233 (337) +|||||||||||||++||++|+|||++++.+++++.+|+||||+||||||+|+++ +||||||++|||||||||||| T Consensus 156 ---~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dLl 231 (343) T protein:vir:98 156 ---SDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQ-GLDARHRDAGDLVFLVGADLV 231 (343) T ss_pred ---CCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHh-cCchHHhcCCCEEEEEchhhh Confidence 6999999999999999999999999999887777889999999999999999985 899999999999999999999 Q ss_pred HHHHHHHHhc-cCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 234 HDKYFPIVNA-TQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 234 ~~k~~~l~n~-~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) ++|||||+|+ +++|||++|+++++++|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 232 a~~~~~l~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 311 (343) T protein:vir:98 232 AKEASLVYKGNGLIATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDDDKKAVRDSY 311 (343) T ss_pred hhhhhhhhhhcCCChHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 9999999997 679999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ||||||||||||++|+||||+++-+ T Consensus 312 s~Ne~YvVEd~~~~a~iE~i~v~~~ 336 (343) T protein:vir:98 312 YRNEAYAVEDCGKFMAVDFTKVKLS 336 (343) T ss_pred hhcceeeeeccccEEEeeeeeeeec Confidence 9999999999999999999999888 No 14 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=100.00 E-value=4e-176 Score=982.15 Aligned_cols=326 Identities=30% Similarity=0.443 Sum_probs=312.5 Q ss_pred HHHHHHHHHHHHHHHhhCchh----hcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccC Q lcl|Aclame:pro 4 ETRQAYEKYAAQIAKLNDTGD----VSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 4 ~tr~~~~~y~~~~a~~ngv~~----~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~ 79 (337) -||++|++|++++|++|||++ ++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++|||||||+ T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCcccccccC Confidence 677999999999999999964 4589999999999999999999999999999999999999999999999999999 Q ss_pred CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHH-HHHHHHHHHHHhhhhHHhcccccccCCcCChhhh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQ-QRIRDVILNQGALDRIMIGWNGVKAAATTDRQAN 158 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~-~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~an 158 (337) |+ |+|+++ ++++++|+|+|||||+||+|++||+|||||||+ .+++.++.+|+|||||||||||+|+|++|| | T Consensus 81 t~---R~~~~~-~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~Td---n 153 (336) T protein:vir:37 81 TG---RNLANL-DHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTT---K 153 (336) T ss_pred CC---cccccc-CcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCC---C Confidence 96 666675 899999999999999999999999999999966 567778888999999999999999999998 9 Q ss_pred hhhhccchhHHHHHHHhchhhhccccccccCcee-ecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 159 PLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVL-VGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 159 PllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~-~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) ||||||||||||++||++|+|||++++.++|||. +|+||||+||||||+|+++ +||||||++||||||||||||++|| T Consensus 154 PllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dLla~~~ 232 (336) T protein:vir:37 154 ADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQ-GLDFRHQNRNDLVFLVGADLVSKET 232 (336) T ss_pred CcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHh-cCchHHhcCCCeEEEEchhhhhhhh Confidence 9999999999999999999999999988889975 5999999999999999997 6899999999999999999999999 Q ss_pred HHHHhc-cCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeee Q lcl|Aclame:pro 238 FPIVNA-TQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSND 316 (337) Q Consensus 238 ~~l~n~-~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne 316 (337) ++|+|+ +++|||++|+++++++|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||||||| T Consensus 233 ~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne 312 (336) T protein:vir:37 233 KLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGLVTSYYRQE 312 (336) T ss_pred hhhhhhcCCCHHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchhhhcc Confidence 999997 5799999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeccccEEEeecceeccC Q lcl|Aclame:pro 317 AYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 317 ~YvVEd~~~~a~ieni~~~~a 337 (337) ||||||||++|+||||++... T Consensus 313 ~YvVEd~~~~a~iE~i~v~~~ 333 (336) T protein:vir:37 313 GYVVEDLGLMTAIDHTKVKLN 333 (336) T ss_pred eeeeeccccEEEeeeeeeeec Confidence 999999999999999999987 No 15 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=100.00 E-value=8.5e-176 Score=980.39 Aligned_cols=326 Identities=30% Similarity=0.431 Sum_probs=311.2 Q ss_pred HHHHHHHHHHHHHHHhhCchh----hcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccC Q lcl|Aclame:pro 4 ETRQAYEKYAAQIAKLNDTGD----VSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 4 ~tr~~~~~y~~~~a~~ngv~~----~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~ 79 (337) -||++|++|++++|++|||++ ++++|+|+||+||+|+++|||||+||++||+++|+|++||+|++|++|||||||+ T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccC Confidence 678999999999999999964 4589999999999999999999999999999999999999999999999999999 Q ss_pred CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHH-HHHHHHHHHHHhhhhHHhcccccccCCcCChhhh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQ-QRIRDVILNQGALDRIMIGWNGVKAAATTDRQAN 158 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~-~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~an 158 (337) |++.++. .++++++|+|+|||||+||+|++||+|||||||+ .+++.++.+|+|||||||||||+|+|++|| | T Consensus 81 t~r~r~~----~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~Td---n 153 (336) T protein:vir:37 81 TGRNLAT----LDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTT---K 153 (336) T ss_pred CCCCccc----cCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCC---C Confidence 9865333 5799999999999999999999999999999955 567788888899999999999999999999 9 Q ss_pred hhhhccchhHHHHHHHhchhhhccccccccCcee-ecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 159 PLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVL-VGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 159 PllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~-~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) ||||||||||||++||++|+|||++++.++|||. +|+||||+||||||+|+++ +||||||++||||||||||||++|| T Consensus 154 PllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dLla~~~ 232 (336) T protein:vir:37 154 TDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQ-GLDFRHQNRNDLVFLVGADLVSKET 232 (336) T ss_pred ccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHh-ccchHHhcCCCeEEEEchhhhhhhh Confidence 9999999999999999999999999988889976 5999999999999999997 7999999999999999999999999 Q ss_pred HHHHhc-cCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeee Q lcl|Aclame:pro 238 FPIVNA-TQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSND 316 (337) Q Consensus 238 ~~l~n~-~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne 316 (337) +||+|+ +++|||++|+++++++|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||||||| T Consensus 233 ~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne 312 (336) T protein:vir:37 233 KLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGLVTSYYRQE 312 (336) T ss_pred hhhhhhcCCCHHHHHHHHHHHHHHhhCCceEEEccccCCCceEEeeccccEEEEecCcEEEEEEEccccccccchhhhcc Confidence 999997 5799999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeccccEEEeecceeccC Q lcl|Aclame:pro 317 AYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 317 ~YvVEd~~~~a~ieni~~~~a 337 (337) ||||||||++|+||||++... T Consensus 313 ~YvVEd~~~~a~iE~i~v~~~ 333 (336) T protein:vir:37 313 GYVVEDLGLMTAIDHTKVKLN 333 (336) T ss_pred eeeeeccccEEEeeeeeeecc Confidence 999999999999999999997 No 16 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=100.00 E-value=3.8e-174 Score=971.30 Aligned_cols=323 Identities=32% Similarity=0.519 Sum_probs=311.9 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|++||+|+++|||||+||++||+++|+|++||+|++|++|+|||||+| T Consensus 5 m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdt 84 (341) T protein:vir:27 5 LTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTGRKAG 84 (341) T ss_pred ccHHHHHHHHHHHHHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceeeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhC---ChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAK---FADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~---~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) ++++|+ + ++++++|+|+|||||+||+|++||+||| ||||++|+++++++|||||||||||||+|+|++|||++ T Consensus 85 ~R~~r~---~-~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td~~a 160 (341) T protein:vir:27 85 GRFTKQ---V-GVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTDPSA 160 (341) T ss_pred Cceecc---c-ccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCChhh Confidence 766555 4 7999999999999999999999999999 99999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) |||||||||||||++||++|+|||+++ ++++|+||||+||||||+|++++|||||||++||||||||||||++|| T Consensus 161 nPllqDVNkGWlQ~~Re~a~~rVl~~~-----~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~ 235 (341) T protein:vir:27 161 NPLGQDVNEGWIAFVKNRKASQVVDVD-----VYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQ 235 (341) T ss_pred cccccccchhHHHHHHhhcccceeccc-----eeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhh Confidence 999999999999999999999999864 567799999999999999999999999999999999999999999999 Q ss_pred HHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeeee Q lcl|Aclame:pro 238 FPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDA 317 (337) Q Consensus 238 ~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~ 317 (337) +||+|++++|||++|+|++ +|+||||||++|||||++++|||+|||||||||+|++||+++|||+|||||+|+| + T Consensus 236 ~~l~n~~~~ptE~~Aa~~i--~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~yes---~ 310 (341) T protein:vir:27 236 AKLYDKADKPSEQIAAQKL--DKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSKTHTG---A 310 (341) T ss_pred hhhhccCCCCHHHHHHHHH--HHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEeccccccccchhh---h Confidence 9999999999999999987 7899999999999999999999999999999999999999999999999999977 8 Q ss_pred eeeeccccEEEee--cceeccC Q lcl|Aclame:pro 318 YVVEDFGCGCVAE--NIELAAA 337 (337) Q Consensus 318 YvVEd~~~~a~ie--ni~~~~a 337 (337) ||||||||++++| +|++.-+ T Consensus 311 YvVEdyg~~~~~~~~~vkl~~~ 332 (341) T protein:vir:27 311 WKVTQWVCWKRSPLTTQKKSTS 332 (341) T ss_pred heeehhhhhhhccccccccCcc Confidence 9999999999999 5555555 No 17 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=3.5e-79 Score=450.72 Aligned_cols=307 Identities=14% Similarity=0.149 Sum_probs=262.3 Q ss_pred HHHHHHHHHHHHHHHhhCc--hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC- Q lcl|Aclame:pro 4 ETRQAYEKYAAQIAKLNDT--GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT- 80 (337) Q Consensus 4 ~tr~~~~~y~~~~a~~ngv--~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t- 80 (337) -+++.|++|++++++.+++ +++...|+|.|+++|+|+++++|+|.||++||+++|++.+|+++.+|+++++. |+.+ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~-~~~~e 79 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHR-RPQDE 79 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCccc-ccccc Confidence 5678899999999999986 67888999999999999999999999999999999999999999999988776 5554 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) +..+|++.++ .+++.+|.|++++++++|+|++||+||++|||++++++.+++++|+|++++||||++++.++ T Consensus 80 ~~~~~~~~~~-~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~------- 151 (321) T protein:vir:31 80 GEWNENESDV-STGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDS------- 151 (321) T ss_pred cccccccccc-eeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCc------- Confidence 4455666665 58999999999999999999999999999999999999999999999999999999876543 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) +++||+||||++|++++ +++.+++..++|.+ .+++. .||++||+++++|+|||++++.+.+.+| T Consensus 152 ~~~~n~G~l~~a~~~~~--------------~~~~~~~~~~~d~l-~~l~~-~l~~~yr~~~~~v~im~~~~~~~~~~~l 215 (321) T protein:vir:31 152 FENQNDGFITVAEGDVE--------------TIDAADDILDNDLV-IRTIA-GLDSKYRARMNPALIVSEDQLLSYHYTL 215 (321) T ss_pred ccccchhhhhhhccccc--------------cccccccccCHHHH-HHHHH-hccHhHhcCCCeEEEechHHHHHHHHHH Confidence 68999999999887532 23445566667754 46665 6799999999999999999999888887 Q ss_pred HhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc----cccceeceeeeee Q lcl|Aclame:pro 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP----ERDRIENYESSND 316 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p----~r~rve~y~s~Ne 316 (337) .+.. .|.+..+. .-...++|+|+|++.+||||++++++|+|+||++|++.+.++|+..+.+ +++|+++|+++|+ T Consensus 216 ~~~~-~~~~~~~l-~~~~~~tl~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (321) T protein:vir:31 216 TDRD-TPLGDNVI-MGEADVNPFSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDD 293 (321) T ss_pred hcCC-Cccccchh-hccccccccceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeec Confidence 6653 45554322 2235678999999999999999999999999999999998777766644 5799999999999 Q ss_pred eeeeeccccEEEeecceecc-C Q lcl|Aclame:pro 317 AYVVEDFGCGCVAENIELAA-A 337 (337) Q Consensus 317 ~YvVEd~~~~a~ieni~~~~-a 337 (337) +||||||+++|++|||+... . T Consensus 294 ~~~ve~~~a~a~~~~i~~~~~~ 315 (321) T protein:vir:31 294 DFAIENTEAVVLAEGLGDPLEH 315 (321) T ss_pred ceeEeccccEEEEecCCcchhc Confidence 99999999999999998632 2 No 18 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=100.00 E-value=1.5e-45 Score=266.29 Aligned_cols=325 Identities=14% Similarity=0.159 Sum_probs=233.4 Q ss_pred CChHHH--HHHHHHHHHHHHhhCc-hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhce--eeecccccccc Q lcl|Aclame:pro 1 MRKETR--QAYEKYAAQIAKLNDT-GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGE--KLGLSVSGPIA 75 (337) Q Consensus 1 M~~~tr--~~~~~y~~~~a~~ngv-~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge--~v~lgv~g~ia 75 (337) |.+++- +.-|+++..+++.+-. ++++ +|.+.|+++++|..++|+++.||++|+++++...+|+ +|++|.-...+ T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~l~-~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~r~ 79 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAELD-GFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSGHT 79 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhccccccC-ceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeeccc Confidence 877653 5678999999988764 5654 7999999999999999999999999999999999999 66665533333 Q ss_pred ccc---CCCCcccccccccccCCceeEEEEeeeeeecCHHHHHH--HhCChhHHHHHHHHHHHHHhhhhHHhcccccccC Q lcl|Aclame:pro 76 SRT---DTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDM--WAKFADFQQRIRDVILNQGALDRIMIGWNGVKAA 150 (337) Q Consensus 76 ~Rt---~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~--WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A 150 (337) +-. ++.+..+++..+.-..-..+.|.. +|.++.+.. |....+|++.+.++++++++.|+.++||||.+.. T Consensus 80 ~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~-----~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds 154 (360) T protein:vir:99 80 RDEEGSRTENSEAESGSVKFNATDKSYYIL-----VEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASS 154 (360) T ss_pred cccCCCCCcCCcCccccCccccccceeeEe-----echHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchh Confidence 211 122233333233322222333332 455565555 5556689999999999999999999999999887 Q ss_pred CcC--ChhhhhhhhccchhHHHHHHHhchhhhcccc----ccccCc------------eeecCCcccccHHHHHHHHHhc Q lcl|Aclame:pro 151 ATT--DRQANPLLQDVNIGWLQQYRERAAQRVLHEG----AKQAGK------------VLVGKAGDYENLDALVMDIVSS 212 (337) Q Consensus 151 ~~T--d~~anPllqDVN~GWlq~~Re~a~~~v~~~~----~~~~~~------------i~~g~ggdy~nLDaLv~d~~~~ 212 (337) .++ |-..+|++ ++|+|||++++.+ ++.+-..+ ...+++ ..-|.|+-|....+|+.+++.. T Consensus 155 ~d~~~~~~~d~fl-~~~dGwlKka~~~-~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~ 232 (360) T protein:vir:99 155 GNLQSIGGAAELD-NTFKGWIARAEGD-AQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQT 232 (360) T ss_pred cccccCcccchhh-hhhHHHHHHhhcc-cchhhccccccccccccccccccccchhhhccccccccccchHHHHHHHHHh Confidence 654 33456776 9999999999976 33321100 000110 1236667789999999999987 Q ss_pred ccChhHcCCC--CEEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEE Q lcl|Aclame:pro 213 MIDPWFQEDT--GLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYY 290 (337) Q Consensus 213 lid~~~r~~~--~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~ 290 (337) | +..||+.+ .++++++.+........|.++....... +.+-....++-|.|++.||+||++.+|+|+++||.++. T Consensus 233 L-p~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~--~l~g~~~~~~~Gipi~~v~~~pd~~~mlT~p~NLi~g~ 309 (360) T protein:vir:99 233 L-DSRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSA--VIFGDSDITPFSYDLVGVNGFPDEYMMFTDPNNLAFGL 309 (360) T ss_pred c-chhhhcCcccceEEEccCchHHHHHHHHhccCcccchh--heecccccccceeeeEEcCCCCCCceEEeccCceeEEe Confidence 6 55689877 4589999998876666665554332221 11112345777999999999999999999999997767 Q ss_pred ecCceEEEEEEcccc---cc--eeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 291 QEGARRRTLKEVPER---DR--IENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 291 Q~gs~RR~~~d~p~r---~r--ve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) -++.+.++..+ |+| +| +.++.+...+|++||++++|+++||+-.+| T Consensus 310 ~~~iri~~~~e-~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 310 YEEMELDQSTD-TDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred eeeeEEeeccc-chhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 66666665444 333 33 566778899999999999999999999999 No 19 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=1.1e-41 Score=245.10 Aligned_cols=304 Identities=15% Similarity=0.140 Sum_probs=224.0 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceec-cchhhceeeecccccccccccC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLP-VTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~-V~~~~Ge~v~lgv~g~ia~Rt~ 79 (337) |.- -|+.|+ +=+.-.+++.+ .+.+.|.+.++|.++|+|+|.||+.++++. +...+++.-.+|+.+.+++..+ T Consensus 1 ~~~-~~~~~~-----~~k~it~~d~~-gG~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~ 73 (314) T protein:vir:41 1 MDF-LNKPFQ-----ITPKIDVPDLG-KGILAVQRFGEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRN 73 (314) T ss_pred Cch-hhhHHH-----hhcccccccCC-CceeChHHHHHHHHHHHhccchhhheeeecccCccceeecccccCcccccccc Confidence 321 122222 11122355554 578999999999999999999999999984 5666676666777777665555 Q ss_pred CCCc-ccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhh Q lcl|Aclame:pro 80 TTKA-ARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQAN 158 (337) Q Consensus 80 t~~~-~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~an 158 (337) .++. ...|.+-..++..+|.|++....+.|+|+.|++||..|||++.+.+.+++|+|.|+.+++|||.+...+ ++ T Consensus 74 ~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s----~~ 149 (314) T protein:vir:41 74 TSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTT----GR 149 (314) T ss_pred cccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcC----cc Confidence 4443 334666677999999999999999999999999999999999999999999999999999999875544 45 Q ss_pred hhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHH Q lcl|Aclame:pro 159 PLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 159 PllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~ 238 (337) |+++ +++|||++.. +.++...+++|.+.+.++.+++.+|.++|+++.+++|+||+++.+ .++. T Consensus 150 ~~~~-~p~G~l~~a~---------------~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~-~~~r 212 (314) T protein:vir:41 150 ELYR-INDGWMKLAG---------------NQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIY-NGYR 212 (314) T ss_pred cchh-cchhhhhhcc---------------cceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHH-HHHH Confidence 7777 9999999732 122233567889999999999998877777788899999999977 5666 Q ss_pred HHHhccCCh-HHHHHHHHHHhhhhhcCccccccCcc-----CCCceEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 239 PIVNATQAP-TERLAADLIVSQKRIGNLPAVRVPFF-----PKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 239 ~l~n~~~~p-tE~~A~~~~~~~k~iGGlpa~~vPff-----P~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) .++...++| .... ..-....+|.|+|++.+|+| |++.+++|.++|| ||.-.-..||..+-+++.+++..+. T Consensus 213 ~~l~~~~~~l~~~~--~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nl-v~~~~~~ir~~~~~~a~~~~~~~~~ 289 (314) T protein:vir:41 213 KQLLVRETGLGDSA--LIGATGLQYDGIPIQYVPALDALGDDKARALLTVPTNL-VYGFWRNIRIEPKRDAAMRRTEYIA 289 (314) T ss_pred HHHhccCCcccchh--hhCCCCceecceeeEecccccccCCCCceEEEechhhe-EEEeeceeEEeecccCcCCeEEEEE Confidence 665433322 1111 11123457899999999987 6799999999999 5544445566666667778999999 Q ss_pred eeeeeeeeecccc--EEEeecceec Q lcl|Aclame:pro 313 SSNDAYVVEDFGC--GCVAENIELA 335 (337) Q Consensus 313 s~Ne~YvVEd~~~--~a~ieni~~~ 335 (337) ...-++.+|+.+. .+.+++..=+ T Consensus 290 ~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 290 SLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred EEEeceEEEEcCcEEEEEeeccCCC Confidence 9888877765544 4444543333 No 20 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=2.4e-37 Score=221.33 Aligned_cols=306 Identities=13% Similarity=0.098 Sum_probs=204.0 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceec-cchhhceeeecccccccc-ccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLP-VTELEGEKLGLSVSGPIA-SRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~-V~~~~Ge~v~lgv~g~ia-~Rt 78 (337) |--..-.+.++...-+ +.-++++. ..|.+.|++.++|+++++|+|.||++|+++. ....+++.-.+|+.+++. |++ T Consensus 1 ~~~~~~~~~~~~~~~~-k~~t~~d~-~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~ 78 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIV-PKIDVPDL-GRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRD 78 (315) T ss_pred CcccchhhcCChhhhh-hhcCCcCC-CCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccc Confidence 1111111111111111 23456665 4788999999999999999999999999864 455666655566555543 555 Q ss_pred CCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhh Q lcl|Aclame:pro 79 DTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQAN 158 (337) Q Consensus 79 ~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~an 158 (337) .++...+.|.....++..+|.|++..+.++|+|+.|+.|+..|||++.+.+.+++++|.|+.+++|||.+.+.++ T Consensus 79 ~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p----- 153 (315) T protein:vir:41 79 ETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDP----- 153 (315) T ss_pred cccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCc----- Confidence 555566667676789999999999999999999999999999999999999999999999999999998876543 Q ss_pred hhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHH Q lcl|Aclame:pro 159 PLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 159 PllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~ 238 (337) ++ ..|+|||++++..+...... ++.++. ..| ++.+++.+|.++++++.+++|+||+++.+.. +. T Consensus 154 -~~-~~~~G~l~~a~~~~~~~~~~-----------~~a~~~-~~d-~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~-~r 217 (315) T protein:vir:41 154 -LL-RMSDGWLKLASEKLTESDVD-----------PEAEDW-PMN-LFDTMIESLPTPYRNNLPNMKFYVTWDIYRA-YR 217 (315) T ss_pred -cc-cccccceecccccccccccc-----------cccccc-cHH-HHHHHHHhcChHHhhcCCceEEEEcHHHHHH-HH Confidence 22 36899999866542211110 111110 122 4556777765555556689999999999864 45 Q ss_pred HHHhccCChHHHHHHHHHHhhhhhcCccccccCcc-----CCCceEEecchhcEEEEecCceEEEEEEcccccceeceee Q lcl|Aclame:pro 239 PIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFF-----PKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYES 313 (337) Q Consensus 239 ~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPff-----P~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s 313 (337) .+......+--.. ........+|.|+|++.+|.| |++.+++|.++||.+....+.+++.- .+++..++..|.. T Consensus 218 klk~~~g~~lw~~-~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~-~~a~~~~~~~~~~ 295 (315) T protein:vir:41 218 DALKGRETGLGDQ-ALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPD-YDAEMRLTKYVAS 295 (315) T ss_pred HHhccCCCccccc-hhhcCCCceecccceEecccccccCCCCccEEEecccceEEEeccccEEEee-ecCCCCceEEEEE Confidence 5543322221110 011112358889999888776 67889999999998766655444333 3344444443332 Q ss_pred -e-eeeeeeeccccEEEeecc Q lcl|Aclame:pro 314 -S-NDAYVVEDFGCGCVAENI 332 (337) Q Consensus 314 -~-Ne~YvVEd~~~~a~ieni 332 (337) | .-+|++|++ +++++.+| T Consensus 296 ~r~d~~~~~~~~-~a~~~~~v 315 (315) T protein:vir:41 296 LRTDNHYEDEEG-AVSATITV 315 (315) T ss_pred EEeceeEEeccc-eeEeeeeC Confidence 2 445788885 67777788 No 21 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.77 E-value=4.3e-09 Score=66.47 Aligned_cols=293 Identities=15% Similarity=0.072 Sum_probs=163.9 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |+++-|+.++++.+. .+. ..-.+.|-+.+.+.+.+.+.+.|.++++++++++.--.+...... +++-+.-... T Consensus 72 l~~~~r~~~~~~~~~----~~~--~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~-~~~~a~~~~E 144 (390) T protein:vir:40 72 LTSDESKYYNEVIAG----NGF--AGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVG-DVATAWWGPL 144 (390) T ss_pred ccHHHHHHHHHHHhc----cCc--ccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEc-CCcceeeecc Confidence 566666666555432 222 223456777889999999999999999999999875433333222 2222222221 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) .....+..-..++...|.+++.--...|+.+.|+... .+|++.+++.++++++.-.-.--++|+-.. -| T Consensus 145 -~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~--~~l~~~i~~~la~~i~~~~~~a~l~G~G~~-------~P- 213 (390) T protein:vir:40 145 -CAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGP--SWLDQYVRTILGEAMALGLEAGIVNGSGKD-------QP- 213 (390) T ss_pred -ccccCccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHhhhhcccCCC-------cc- Confidence 1233333334577888888888888899999998553 479999999999999887776777775311 12 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCc--ccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAG--DYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~gg--dy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~ 238 (337) .|+|... .-+ +.+.......+ .+.+...++..+...+.+...+..+..|++|.+....++-. T Consensus 214 -----~Gil~~~-----~~~------~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~ 277 (390) T protein:vir:40 214 -----IGMMRDL-----NNV------TAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIY 277 (390) T ss_pred -----ceeeecc-----ccc------cccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHH Confidence 4655321 000 01111111112 23334444444444333322334457899999754332211 Q ss_pred --HHH-hccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc--cccceeceee Q lcl|Aclame:pro 239 --PIV-NATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP--ERDRIENYES 313 (337) Q Consensus 239 --~l~-n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p--~r~rve~y~s 313 (337) ..+ +....+- ......|+|++.-+++|++.+++-.+++.-|+. ++..+-..-++. .++.+.-.-. T Consensus 278 ~~~~~~d~~G~~v---------~~~~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~-~~~~~v~~~~~~~f~~~~~~~r~~ 347 (390) T protein:vir:40 278 AATSYMTPQGVWV---------TGILPVPLEIVQSVAVPVGKAVAGRAKDYFMGI-GSEQVIRTSTEYRLLDDETLYYAK 347 (390) T ss_pred HHhhccCCCCccc---------cccCCCceeEEEcCCCCCCcEEEEeeceEEEEe-ecceEEEecchhhhhcCcEEEEEE Confidence 122 2222221 122346999999999999999999998875554 333433222222 2344433334 Q ss_pred eeeeeeeeccccEEEee--cceeccC Q lcl|Aclame:pro 314 SNDAYVVEDFGCGCVAE--NIELAAA 337 (337) Q Consensus 314 ~Ne~YvVEd~~~~a~ie--ni~~~~a 337 (337) .--+..|-|.++++.++ .++=..+ T Consensus 348 ~r~dg~v~~~~A~~~l~~~~~~~~~~ 373 (390) T protein:vir:40 348 QYANGRPKDNSSFLVFDITGLEGSPA 373 (390) T ss_pred EEeCCEEecccceEEEEeeccCCCCC Confidence 44455555666555543 2211111 No 22 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.73 E-value=7.5e-09 Score=65.11 Aligned_cols=307 Identities=13% Similarity=0.088 Sum_probs=168.7 Q ss_pred CChHHHHHHHHHHHHHHHhhCc---hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDT---GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv---~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~R 77 (337) =+.+.+..|..|+..--....+ .+..-.|.|-+.+.+.+.+.+++.+.+++.++++++....+.. -...+++.++- T Consensus 109 ~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~-~~~~~~~~a~w 187 (425) T protein:vir:10 109 RDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSK-LFNMGGTTSGW 187 (425) T ss_pred ccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEE-EEEcCCcceee Confidence 2233466677777543221111 1223346677777889999999999999999999998655443 33444555533 Q ss_pred cCCCCccccc-ccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChh Q lcl|Aclame:pro 78 TDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQ 156 (337) Q Consensus 78 t~t~~~~R~p-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~ 156 (337) +. .+...| .+...++...|.+++.---+.|+.+.|+... ++|+..+.+.+.+.++.=.-.--+||+-.. T Consensus 188 v~--E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~--~~l~~~i~~~la~ai~~~~d~~~l~G~G~~------ 257 (425) T protein:vir:10 188 VG--EASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAE--IDLESWLATEVQTEFAKQEGKAFLAGDGTN------ 257 (425) T ss_pred ec--cccccccccccccceeeeeheeeEeehHhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHhhhhcccCCC------ Confidence 22 222223 3444677788999999889999999998653 789999999999999886666667774311 Q ss_pred hhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHH Q lcl|Aclame:pro 157 ANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDK 236 (337) Q Consensus 157 anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k 236 (337) + ..|+|...-..........+. ...+..+..+. .+.|.|+ |++.+| ++.|+.. -+++|.+..+.. T Consensus 258 -~------p~Gil~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~~d~l~-~l~~~l-~~~~~~~--a~~vmn~~~~~~- 322 (425) T protein:vir:10 258 -K------PNGLLTYIAGGANAAKHPFGA--IEVVNSGAAAD-ITSDGII-DLVYDL-PSAFTGN--ARFAMNRNTQRQ- 322 (425) T ss_pred -C------cceeeeccccccccccccccc--ccccccccccc-ccHHHHH-HHHhhh-hhhhccC--CEEEEchHHHHH- Confidence 1 236664322111000000000 01112222222 3455555 667665 6777764 478899887653 Q ss_pred HHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCC-----CceEEecchhcEEEEecCceEEEEEEcccccceece Q lcl|Aclame:pro 237 YFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPK-----RALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENY 311 (337) Q Consensus 237 ~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~-----~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y 311 (337) -..|-+..+.|--.--.+ -....+|-|+|++..+++|. ..+++=.+++.-..+.+...+.....--.++.+.-+ T Consensus 323 L~~lkD~~G~~l~~~~~~-~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~~~ 401 (425) T protein:vir:10 323 VRKLKDGQGNYLWQPSYV-AGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKPYVLFY 401 (425) T ss_pred HHHhhcCCCceeeccCcc-CCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecccccCCcEEEE Confidence 222222222221000000 01234788999999999995 336776777654445555554433222223332222 Q ss_pred eeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 312 ESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 312 ~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) -..--+..|-|.+.++. ++++-| T Consensus 402 ~~~r~d~~v~~~~A~~~---l~~~as 424 (425) T protein:vir:10 402 TTKRVGGGLLNPEPMRA---MKVAAS 424 (425) T ss_pred EEEEeccEeecccceEE---EEeecc Confidence 22223344444444433 333333 No 23 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.67 E-value=2.9e-08 Score=61.93 Aligned_cols=295 Identities=12% Similarity=-0.020 Sum_probs=159.5 Q ss_pred CChHHHHHHHHHHHHHH-------------HhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeee Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIA-------------KLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLG 67 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a-------------~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~ 67 (337) .+..-...|..++.... ...+.......+.|-+.+.+.+.+.+.+.+.+++.++++++..-.+..+. T Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 183 (418) T protein:vir:10 104 TESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTV 183 (418) T ss_pred hhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEE Confidence 11122222222222211 11122222345568888889999999999999999999998766666665 Q ss_pred cccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccccc Q lcl|Aclame:pro 68 LSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGV 147 (337) Q Consensus 68 lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~ 147 (337) ....++-++=+. .+...|..-..++...+.+++.---+.|+.+.|+.- ++|+..+++.+.++++.-.-.--+||+ T Consensus 184 ~~~~~~~a~~v~--E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~l~~~i~~~l~~a~~~~~d~a~l~G~ 258 (418) T protein:vir:10 184 ETGFTNNAAAVA--EGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDA---PALQSYIDGRARYGLQLTEEGQILKGD 258 (418) T ss_pred EecCCCceeeec--cCccccccccceeeEEEeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 433333332221 122234444567888888888888888999999864 689999999999998887777777884 Q ss_pred ccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEE Q lcl|Aclame:pro 148 KAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVI 227 (337) Q Consensus 148 s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvi 227 (337) -.. .+|. |.+... ... ... ...++..++|.++. ++..+-+. ++... +++ T Consensus 259 g~~------~~p~------Gi~~~~------------~~~--~~~-~~~~~~~~~~~i~~-~~~~~~~~-~~~~~--~~v 307 (418) T protein:vir:10 259 GTG------ANIL------GILPQA------------SAF--MPS-ITLANATPIDKIRL-ALLQAVLA-EFPAT--GIV 307 (418) T ss_pred CCC------cccc------cccccc------------ccc--ccc-ccccccccHHHHHH-HHHhhccc-cCCCC--EEE Confidence 321 1232 333321 000 011 12223344555443 34444333 33322 688 Q ss_pred ECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc---- Q lcl|Aclame:pro 228 CGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP---- 303 (337) Q Consensus 228 vG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p---- 303 (337) |.+..... ...+-.....|-=.-... ....+|-|+|++..+++|++.+++-.+++....+..+...=.+-.+. T Consensus 308 ~n~~~~~~-L~~lkd~~G~~i~~~~~~--~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f 384 (418) T protein:vir:10 308 LNPIDWAS-IELTKDSQGRYIVGNPVN--GTTPRLWNLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDF 384 (418) T ss_pred EcHHHHHH-HHHhhcCCCceecccccc--CCCceecceeeEEcCCCCCCcEEEeeccceEEEEEecceEEEEecccchhh Confidence 99987653 222222222111000001 12458899999999999999999998887433232233222221111 Q ss_pred cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .+|.+.-.-..--++.|-+...++.+ ++..+ T Consensus 385 ~~~~~~~r~~~~~d~~~~~~~a~~~~---~~~~~ 415 (418) T protein:vir:10 385 EKNMVSIRAEERLALAVYRPESFVTG---ALVEQ 415 (418) T ss_pred hcCceEEEEEEeeccEEecccceEEE---EeccC Confidence 12222221122224445555555543 34444 No 24 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.62 E-value=4.7e-08 Score=60.78 Aligned_cols=297 Identities=9% Similarity=-0.064 Sum_probs=157.3 Q ss_pred CChHHHHHHHHHHHHHHH------hhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK------LNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPI 74 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~------~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~i 74 (337) .....+..|..+...... .....+.+..+.|-|...+.+.+.+.+.+.+++.++++++.--.+.........+- T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~ 168 (395) T protein:vir:43 89 AESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNN 168 (395) T ss_pred HHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCc Confidence 111122222222221111 01111223345688889999999999999999999999987544444433221122 Q ss_pred ccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCC Q lcl|Aclame:pro 75 ASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTD 154 (337) Q Consensus 75 a~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td 154 (337) ++-+ +.+.-.|..-..++...+.+++.--.+.|+.+.|+.. ++++..+++.++++++.-.-.--+||+-.. T Consensus 169 a~~v--~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~l~~~v~~~la~a~~~~~d~~~l~G~g~~---- 239 (395) T protein:vir:43 169 AAPV--SEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDA---SALQSYIDARARYGLMLVEECQLLYGNGTG---- 239 (395) T ss_pred eeee--cCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCCC---- Confidence 2211 1122234344567888899999888899999998863 678899999999998885555566774321 Q ss_pred hhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHH Q lcl|Aclame:pro 155 RQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 155 ~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~ 234 (337) +|. .| +++.......... +.......+|.+ .+++..+ ++.++.. -+++|.+.... T Consensus 240 ---~~~-----~G------------i~~~~~~~~~~~~-~~~~~~~~~~~i-~~~~~~~-~~~~~~~--~~~vmn~~~~~ 294 (395) T protein:vir:43 240 ---ANL-----HG------------IIPQAQAYAPPSG-VVVTAEQRIDRI-RLAILQA-QLAEFPA--SGIVLNPIDWA 294 (395) T ss_pred ---Ccc-----cc------------ccccccccccccc-cccccchhHHHH-HHHHHhh-ccccCCC--cEEEEcHHHHH Confidence 111 11 2221111111111 112222334433 3444444 4444432 37889998765 Q ss_pred HHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEccc----ccceec Q lcl|Aclame:pro 235 DKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPE----RDRIEN 310 (337) Q Consensus 235 ~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~----r~rve~ 310 (337) . ...+-...+.|-=..... ....++-|+|++..+++|++.+++-.+++....+-++...=.+-++.. +|.+.- T Consensus 295 ~-l~~lkd~~G~~i~~~~~~--~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~ 371 (395) T protein:vir:43 295 L-IELNKDAENRYIIGSPQN--GTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTI 371 (395) T ss_pred H-HHHhhccCCceecccccc--CCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEE Confidence 3 222222222211000001 124578899999999999999999999986544433332222222221 333322 Q ss_pred eeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 311 YESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 311 y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) +-..--++.|-+..+++.+ ++.-| T Consensus 372 r~~~r~d~~v~~~~a~~~~---~~taa 395 (395) T protein:vir:43 372 RAEERLAFAVYRPEAFVTG---SLTAS 395 (395) T ss_pred EEEEeeccEEecccceEEE---EeccC Confidence 2223334555555555544 55556 No 25 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.62 E-value=4.3e-08 Score=60.96 Aligned_cols=299 Identities=11% Similarity=0.079 Sum_probs=157.7 Q ss_pred CChHHHHHHHHHHHHHHHh--hCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccc-cccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL--NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVS-GPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~-g~ia~R 77 (337) +....+..|..++...... .++....-.+.|-..+...+.+.+.+.+.+++.+++++++...|.....-.+ ++-+.- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:98 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCcccee Confidence 2222333333333222111 1121222233444467889999999999999999999999888876544322 222222 Q ss_pred cCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 78 t~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) ... +......+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|.-....... T Consensus 181 v~E-~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~-- 255 (415) T protein:vir:98 181 VEE-LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-- 255 (415) T ss_pred ecc-ccccCcccccceeeEEeeeeeeEeeehhhHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc-- Confidence 211 122222333456677777777776788999998863 257889999999988877555555565433221110 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) -.++ ............ .+.|.++ +++..+.+++++.. +++|.+..+..-. T Consensus 256 -------~~~~----------------~~~~~~~~~~~~---~~~~~i~-~~~~~~~~~~~~~~---~~v~n~~~~~~l~ 305 (415) T protein:vir:98 256 -------SSGF----------------EKEGKKLEVKKA---KSLDDIK-DAINLNVKPNYEHN---VAIVSQTMFAKLD 305 (415) T ss_pred -------cccc----------------cccccccccccc---cchhHHH-HHHHhhhhhccCCC---EEEEcHHHHHHHH Confidence 0000 000001111122 3355554 66776766655443 7899998876321 Q ss_pred HHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc-----eEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 238 FPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 238 ~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~-----iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) .+-...+.|-=.. .-.-....+|-|+|++..|++|... +++-.++++-+.+.++..+-.+.+.....+.---+ T Consensus 306 -~lkd~~G~~l~~~-~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 383 (415) T protein:vir:98 306 -KMKDKLGNYLIQP-DVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIA 383 (415) T ss_pred -HhhccCCceeecc-CcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEE Confidence 2222211111000 0000124589999999999999765 78888888766666555554443322111111111 Q ss_pred eeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) - --+..|-+..+++.++--.-+.- T Consensus 384 ~-r~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:98 384 V-RQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred E-EeccEEeccccEEEEEEeccCCC Confidence 1 12445556666666542222221 No 26 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.62 E-value=4.3e-08 Score=60.96 Aligned_cols=299 Identities=11% Similarity=0.079 Sum_probs=157.7 Q ss_pred CChHHHHHHHHHHHHHHHh--hCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccc-cccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL--NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVS-GPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~-g~ia~R 77 (337) +....+..|..++...... .++....-.+.|-..+...+.+.+.+.+.+++.+++++++...|.....-.+ ++-+.- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:81 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCcccee Confidence 2222333333333222111 1121222233444467889999999999999999999999888876544322 222222 Q ss_pred cCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 78 t~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) ... +......+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|.-....... T Consensus 181 v~E-~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~-- 255 (415) T protein:vir:81 181 VEE-LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-- 255 (415) T ss_pred ecc-ccccCcccccceeeEEeeeeeeEeeehhhHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc-- Confidence 211 122222333456677777777776788999998863 257889999999988877555555565433221110 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) -.++ ............ .+.|.++ +++..+.+++++.. +++|.+..+..-. T Consensus 256 -------~~~~----------------~~~~~~~~~~~~---~~~~~i~-~~~~~~~~~~~~~~---~~v~n~~~~~~l~ 305 (415) T protein:vir:81 256 -------SSGF----------------EKEGKKLEVKKA---KSLDDIK-DAINLNVKPNYEHN---VAIVSQTMFAKLD 305 (415) T ss_pred -------cccc----------------cccccccccccc---cchhHHH-HHHHhhhhhccCCC---EEEEcHHHHHHHH Confidence 0000 000001111122 3355554 66776766655443 7899998876321 Q ss_pred HHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc-----eEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 238 FPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 238 ~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~-----iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) .+-...+.|-=.. .-.-....+|-|+|++..|++|... +++-.++++-+.+.++..+-.+.+.....+.---+ T Consensus 306 -~lkd~~G~~l~~~-~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 383 (415) T protein:vir:81 306 -KMKDKLGNYLIQP-DVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIA 383 (415) T ss_pred -HhhccCCceeecc-CcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEE Confidence 2222211111000 0000124589999999999999765 78888888766666555554443322111111111 Q ss_pred eeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) - --+..|-+..+++.++--.-+.- T Consensus 384 ~-r~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:81 384 V-RQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred E-EeccEEeccccEEEEEEeccCCC Confidence 1 12445556666666542222221 No 27 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.62 E-value=4.3e-08 Score=60.96 Aligned_cols=299 Identities=11% Similarity=0.079 Sum_probs=157.7 Q ss_pred CChHHHHHHHHHHHHHHHh--hCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccc-cccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL--NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVS-GPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~-g~ia~R 77 (337) +....+..|..++...... .++....-.+.|-..+...+.+.+.+.+.+++.+++++++...|.....-.+ ++-+.- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:79 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCcccee Confidence 2222333333333222111 1121222233444467889999999999999999999999888876544322 222222 Q ss_pred cCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 78 t~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) ... +......+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|.-....... T Consensus 181 v~E-~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~-- 255 (415) T protein:vir:79 181 VEE-LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-- 255 (415) T ss_pred ecc-ccccCcccccceeeEEeeeeeeEeeehhhHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc-- Confidence 211 122222333456677777777776788999998863 257889999999988877555555565433221110 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) -.++ ............ .+.|.++ +++..+.+++++.. +++|.+..+..-. T Consensus 256 -------~~~~----------------~~~~~~~~~~~~---~~~~~i~-~~~~~~~~~~~~~~---~~v~n~~~~~~l~ 305 (415) T protein:vir:79 256 -------SSGF----------------EKEGKKLEVKKA---KSLDDIK-DAINLNVKPNYEHN---VAIVSQTMFAKLD 305 (415) T ss_pred -------cccc----------------cccccccccccc---cchhHHH-HHHHhhhhhccCCC---EEEEcHHHHHHHH Confidence 0000 000001111122 3355554 66776766655443 7899998876321 Q ss_pred HHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc-----eEEecchhcEEEEecCceEEEEEEcccccceecee Q lcl|Aclame:pro 238 FPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYE 312 (337) Q Consensus 238 ~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~-----iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~ 312 (337) .+-...+.|-=.. .-.-....+|-|+|++..|++|... +++-.++++-+.+.++..+-.+.+.....+.---+ T Consensus 306 -~lkd~~G~~l~~~-~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 383 (415) T protein:vir:79 306 -KMKDKLGNYLIQP-DVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIA 383 (415) T ss_pred -HhhccCCceeecc-CcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEE Confidence 2222211111000 0000124589999999999999765 78888888766666555554443322111111111 Q ss_pred eeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 313 SSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) - --+..|-+..+++.++--.-+.- T Consensus 384 ~-r~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:79 384 V-RQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred E-EeccEEeccccEEEEEEeccCCC Confidence 1 12445556666666542222221 No 28 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.60 E-value=4.1e-08 Score=61.09 Aligned_cols=297 Identities=11% Similarity=0.085 Sum_probs=157.9 Q ss_pred CChHHHHHHHHHHHHHHH--hhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecc-cccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK--LNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLS-VSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lg-v~g~ia~R 77 (337) +...-+..|..++..... ..+.....-.+.|-+.+...+.+.+.+.+.+++.+++++++...|...... .+++-++- T Consensus 101 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:94 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCcccee Confidence 222223334333333221 111112223445555678899999999999999999999987777754332 23333322 Q ss_pred cCCCCccccc-ccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChh Q lcl|Aclame:pro 78 TDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQ 156 (337) Q Consensus 78 t~t~~~~R~p-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~ 156 (337) .. .+...| .+...++...+..++.---+.|+.+.|+.-. .+|+..+.+.+.++++.-.-.--++|.-...... T Consensus 181 v~--Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~-- 254 (415) T protein:vir:94 181 VE--ELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGS-- 254 (415) T ss_pred cc--ccccccccccccceeeEeeheeeeeechhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc-- Confidence 22 222222 3344577777777777777788999888543 6899999999999888766555566643322111 Q ss_pred hhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHH Q lcl|Aclame:pro 157 ANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDK 236 (337) Q Consensus 157 anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k 236 (337) ...++.. ............| |.+ .+++..+.+++++.. +++|.+.....- T Consensus 255 -------~~~~~~~----------------~~~~~~~~~~~~~---~~i-~~~~~~~~~~~~~~~---~~vmn~~~~~~l 304 (415) T protein:vir:94 255 -------TSSGFEK----------------EGKKLEVKKAKSL---DDI-KDAINLNVKPNYEHN---VAIVSQTMFAKL 304 (415) T ss_pred -------ccccccc----------------cccccccccccch---HHH-HHHHHhhhhhccCCC---EEEEcHHHHHHH Confidence 0111100 0001111222334 433 356666666665543 788998776522 Q ss_pred HHHHHhccCChHHHHHHHH-HHhhhhhcCccccccCccCCCc-----eEEecchhcEEEEecCceEEEEEEcccccceec Q lcl|Aclame:pro 237 YFPIVNATQAPTERLAADL-IVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) Q Consensus 237 ~~~l~n~~~~ptE~~A~~~-~~~~k~iGGlpa~~vPffP~~~-----iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~ 310 (337) ..+-...+.|- ..... -....+|-|+|++..|++|.+. +++-.++++-+.+.++..+-...+.. .+..-. T Consensus 305 -~~lkd~~G~~l--~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~-~~~~~~ 380 (415) T protein:vir:94 305 -DKMKDKLGNYL--IQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM-HFGECL 380 (415) T ss_pred -HHhhccCCCee--eccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc-cCceEE Confidence 22222222210 00000 0124578899999999999776 78888898765555454443333321 111111 Q ss_pred eeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 311 YESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 311 y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .-..--+..|-+..+++.++--+-+.- T Consensus 381 r~~~r~d~~~~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:94 381 MIAVRQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEEEeccEEeccccEEEEEEeccCCC Confidence 111123455666666666653222222 No 29 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.50 E-value=3.7e-08 Score=61.33 Aligned_cols=280 Identities=14% Similarity=0.067 Sum_probs=161.3 Q ss_pred CChHHHHHHHHHHHHHHHhhCch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~ 79 (337) |--.+. ..-++. ...-.+.|-+.+.+.+.+.+.+.+.+++.++++++.--.. ++-.-.+++.++-.. T Consensus 1 ma~~~~-----------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~ 68 (304) T protein:vir:10 1 MATPTY-----------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-KFTYLAKGVGAYWVS 68 (304) T ss_pred Cccccc-----------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEeCCcceEEee Confidence 222221 111121 1223567888888999999999999999999998764322 222222344443332 Q ss_pred CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anP 159 (337) . ....|..-..++...+..++.---+.|+.+.|..= ..+|+..+.+.+.++++.-.-.-.+||+-....+....+. T Consensus 69 E--~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~ 144 (304) T protein:vir:10 69 E--TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT--AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKP 144 (304) T ss_pred c--CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccc Confidence 2 22334444667778888888777888888887732 3789999999999999999888889996543322111111 Q ss_pred hhhccchhHHHHHHHhchhhhccccccccCceee-cCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHH Q lcl|Aclame:pro 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLV-GKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 160 llqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~-g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~ 238 (337) .+..+ . ...... +..-.|.+|-. ++..+. +.++... +++|.+..+..- . T Consensus 145 ~~~~~--------------------~--~~~~~~~~~~~~~~~i~~----~~~~l~-~~~~~~~--~~v~~~~~~~~L-~ 194 (304) T protein:vir:10 145 LVEGA--------------------E--EKGNVVTDTNNLYVDLSA----LMATIE-DEELDPN--GVLTTRSFRSKM-R 194 (304) T ss_pred ccccc--------------------c--ccccccccccchHHHHHH----HHHHhh-hccCCcC--EEEEcHHHHHHH-H Confidence 11111 0 000111 11112555444 444333 3333332 788999888742 2 Q ss_pred HHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc----eEEecchhcEEEEecCceEEEEEEcc----------- Q lcl|Aclame:pro 239 PIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA----LMVTKLSNLSIYYQEGARRRTLKEVP----------- 303 (337) Q Consensus 239 ~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~----iliT~l~NLsiY~Q~gs~RR~~~d~p----------- 303 (337) .+-.....|- ......++-|+|++..+++|... +++..++++- +...+..+-.+.+++ T Consensus 195 ~lkd~~G~~l------~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~~-~~~~~~~~i~~~~e~~~~~~~~~~~~ 267 (304) T protein:vir:10 195 NALDANDRPL------FDANGNEIMGLPLSYTGADVYDKKKSLALMGDWDYAR-YGILQGIEYAISEDATLTTLQASDAS 267 (304) T ss_pred HhhccCCcEe------ecCCCccccceeeEEecccccCCCCcEEEEEehhhEE-EEEecceEEEEeecceeeeecccccC Confidence 3333322221 11123578899999999999665 8888999874 444444444444432 Q ss_pred -------cccceeceeeeeeeeeeeccccEEEeecce Q lcl|Aclame:pro 304 -------ERDRIENYESSNDAYVVEDFGCGCVAENIE 333 (337) Q Consensus 304 -------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~ 333 (337) ++|++.-.-..--++.|.+.++++.+...+ T Consensus 268 g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 268 GQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 223333223333566778888888776666 No 30 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.50 E-value=3.7e-08 Score=61.33 Aligned_cols=280 Identities=14% Similarity=0.067 Sum_probs=161.3 Q ss_pred CChHHHHHHHHHHHHHHHhhCch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~ 79 (337) |--.+. ..-++. ...-.+.|-+.+.+.+.+.+.+.+.+++.++++++.--.. ++-.-.+++.++-.. T Consensus 1 ma~~~~-----------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~ 68 (304) T protein:vir:94 1 MATPTY-----------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-KFTYLAKGVGAYWVS 68 (304) T ss_pred Cccccc-----------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEeCCcceEEee Confidence 222221 111121 1223567888888999999999999999999998764322 222222344443332 Q ss_pred CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anP 159 (337) . ....|..-..++...+..++.---+.|+.+.|..= ..+|+..+.+.+.++++.-.-.-.+||+-....+....+. T Consensus 69 E--~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~ 144 (304) T protein:vir:94 69 E--TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT--AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKP 144 (304) T ss_pred c--CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccc Confidence 2 22334444667778888888777888888887732 3789999999999999999888889996543322111111 Q ss_pred hhhccchhHHHHHHHhchhhhccccccccCceee-cCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHH Q lcl|Aclame:pro 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLV-GKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 160 llqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~-g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~ 238 (337) .+..+ . ...... +..-.|.+|-. ++..+. +.++... +++|.+..+..- . T Consensus 145 ~~~~~--------------------~--~~~~~~~~~~~~~~~i~~----~~~~l~-~~~~~~~--~~v~~~~~~~~L-~ 194 (304) T protein:vir:94 145 LVEGA--------------------E--EKGNVVTDTNNLYVDLSA----LMATIE-DEELDPN--GVLTTRSFRSKM-R 194 (304) T ss_pred ccccc--------------------c--ccccccccccchHHHHHH----HHHHhh-hccCCcC--EEEEcHHHHHHH-H Confidence 11111 0 000111 11112555444 444333 3333332 788999888742 2 Q ss_pred HHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc----eEEecchhcEEEEecCceEEEEEEcc----------- Q lcl|Aclame:pro 239 PIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA----LMVTKLSNLSIYYQEGARRRTLKEVP----------- 303 (337) Q Consensus 239 ~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~----iliT~l~NLsiY~Q~gs~RR~~~d~p----------- 303 (337) .+-.....|- ......++-|+|++..+++|... +++..++++- +...+..+-.+.+++ T Consensus 195 ~lkd~~G~~l------~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~~-~~~~~~~~i~~~~e~~~~~~~~~~~~ 267 (304) T protein:vir:94 195 NALDANDRPL------FDANGNEIMGLPLSYTGADVYDKKKSLALMGDWDYAR-YGILQGIEYAISEDATLTTLQASDAS 267 (304) T ss_pred HhhccCCcEe------ecCCCccccceeeEEecccccCCCCcEEEEEehhhEE-EEEecceEEEEeecceeeeecccccC Confidence 3333322221 11123578899999999999665 8888999874 444444444444432 Q ss_pred -------cccceeceeeeeeeeeeeccccEEEeecce Q lcl|Aclame:pro 304 -------ERDRIENYESSNDAYVVEDFGCGCVAENIE 333 (337) Q Consensus 304 -------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~ 333 (337) ++|++.-.-..--++.|.+.++++.+...+ T Consensus 268 g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 268 GQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 223333223333566778888888776666 No 31 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.49 E-value=1.5e-07 Score=57.93 Aligned_cols=296 Identities=12% Similarity=0.100 Sum_probs=156.7 Q ss_pred CChHHHHHHHHHHHHHH--------------HhhCc-hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhcee Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIA--------------KLNDT-GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEK 65 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a--------------~~ngv-~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~ 65 (337) ...+.++.|.+|+.... +..++ .+..-.|.|-+.....+.+.+++.+.+++.++++++..-.... T Consensus 84 ~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 163 (409) T protein:vir:45 84 QDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTME 163 (409) T ss_pred hhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEE Confidence 22333444555654321 11221 1222346677777788999999999999999999986533222 Q ss_pred e-ecccccccccccCCCCcccccccccccCCceeEEEEeee-eeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhc Q lcl|Aclame:pro 66 L-GLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDY-DTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIG 143 (337) Q Consensus 66 v-~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~-d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IG 143 (337) + ..+..+..+. -.+.....|..-..++.....-++.-. -+.|+.+.|+... ++|+..+.+.+++++++-.-.-- T Consensus 164 ~~~~~~~~~~~~--~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~--~~l~~~i~~~la~a~~~~~~~a~ 239 (409) T protein:vir:45 164 WATADGTSEVGV--LLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSA--IDMEAYLARRIAERIGRGEARYL 239 (409) T ss_pred EEeeccCccccc--cccccccccccccccceeeeeeeeeeeeehhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHh Confidence 2 1111112221 122222333333334444443333322 2358999998853 79999999999999998777777 Q ss_pred ccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCC Q lcl|Aclame:pro 144 WNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTG 223 (337) Q Consensus 144 fnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~ 223 (337) +||+-...+. .| .-++.... +....+..++ .+.|.++ +++.. |++.|+..+. T Consensus 240 l~G~G~~~~~----~p------------------~Gil~~~~---~~~~~~~~~~-~~~d~i~-~l~~~-l~~~~~~~a~ 291 (409) T protein:vir:45 240 IQGTGAGTPK----QP------------------KGLAASVT---GTTQTAAANA-VKWQEIL-ALKHS-IDPAYRRGPK 291 (409) T ss_pred hccCCCCCcc----cc------------------ceeeeccc---cccccccccc-cchHHHH-HHHHh-hhhhhccCCe Confidence 7886543322 12 12222111 1111122222 3445444 56654 5788899899 Q ss_pred EEEEECHHHHHHHHHHHH-hccCChH--HHHHHHHHHhhhhhcCccccccCccCC-----CceEEecchhcEEEEecCce Q lcl|Aclame:pro 224 LVVICGRELLHDKYFPIV-NATQAPT--ERLAADLIVSQKRIGNLPAVRVPFFPK-----RALMVTKLSNLSIYYQEGAR 295 (337) Q Consensus 224 LVvivG~dLl~~k~~~l~-n~~~~pt--E~~A~~~~~~~k~iGGlpa~~vPffP~-----~~iliT~l~NLsiY~Q~gs~ 295 (337) .+++|.+..+.. +..+ ...+.|- .-... ....++-|+|++...++|. ..+++=.+++.-|..+.+.. T Consensus 292 ~~~~~n~~~~~~--l~~lkd~~G~~i~~~~~~~---~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~ 366 (409) T protein:vir:45 292 FRLAFNDNTLKL--ISEMEDGQGRPLWLPDIVG---VAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMI 366 (409) T ss_pred EEEEECHHHHHH--HHHhhcCCCceeeccCcCC---CCCceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceE Confidence 999999988753 3333 2222221 00001 1235788999999999996 34666677776555443333 Q ss_pred EEEEEEcc--cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 296 RRTLKEVP--ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 296 RR~~~d~p--~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) -+.. +++ +++.+--+-..--++.|-|.+.++. +++..| T Consensus 367 ~~~~-~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~---l~~k~s 406 (409) T protein:vir:45 367 LKRL-VERYAEYDQTGFLAFHRFDCILEDTSAIKA---LVGKGS 406 (409) T ss_pred EEEe-ecccccCCcEEEEEEEEeccEeechhheEE---EEeccC Confidence 2222 222 2233322222233444555554443 334444 No 32 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.47 E-value=2.4e-07 Score=56.92 Aligned_cols=295 Identities=11% Similarity=0.085 Sum_probs=154.6 Q ss_pred CChHHHHHHHHHHHHHHHh--hCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeec-ccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL--NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL-SVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~l-gv~g~ia~R 77 (337) +....+..|..+....... .++....-.+.|-..+...+.+.+.+.+.+++.+++++++...|..... ..+++-++- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:46 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceee Confidence 2222333343333322211 1111112233455567788999999999999999999999887765332 222333333 Q ss_pred cCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 78 t~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) ...+ ......+...++...+..++.---+.|+.+.|+... .+|+..+.+.+.++++.-.-.--++|.-...... T Consensus 181 v~Eg-~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~--- 254 (415) T protein:vir:46 181 VEEL-EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--- 254 (415) T ss_pred cccc-cccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc--- Confidence 3221 112123344577778888877777889999997643 5889999999999988766666666643222110 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) . ..++ .............. .|.+ .+++..+.++++... +++|.+..+.. T Consensus 255 --~----~~~~----------------~~~~~~~~~~~~~~---~~~i-~~~~~~~~~~~~~~~---~~v~n~~~~~~-- 303 (415) T protein:vir:46 255 --T----SSGF----------------EKEGKKLEVKKAKS---LDDI-KDAINLNVKPNYEHN---VAIVSQTMFAK-- 303 (415) T ss_pred --c----cccc----------------ccccceeccccccc---hHHH-HHHHHhhhhhccCCC---EEEEcHHHHHH-- Confidence 0 0000 00000111112223 4433 366666666665443 78899988763 Q ss_pred HHHHh-ccCChHHHHHHHHHHhhhhhcCccccccCccCCCc-----eEEecchhcEEEEecCceEEEEEEcccccceece Q lcl|Aclame:pro 238 FPIVN-ATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPERDRIENY 311 (337) Q Consensus 238 ~~l~n-~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~-----iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y 311 (337) +..+. ..+.|-=.... .-....+|-|+|++..|++|... +++=.++++.+.+.+....-...+... +..-.. T Consensus 304 L~~lkd~~G~~i~~~~~-~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~-~~~~~~ 381 (415) T protein:vir:46 304 LDKMKDKLGNYLIQPDV-KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH-FGECLM 381 (415) T ss_pred HHHhhccCCCeeeccCc-CCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeecccc-CceEEE Confidence 22232 22111100000 01134588999999999999654 777788876555554444333332211 111111 Q ss_pred eeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 312 ESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 312 ~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) -..--+..|-+.++++.+ ++..+ T Consensus 382 ~~~r~d~~v~~~~a~~~~---~~~~~ 404 (415) T protein:vir:46 382 IAVRQDCRILDYKSAIVI---EYDDS 404 (415) T ss_pred EEEEeccEEeccccEEEE---Eeecc Confidence 111224445555555554 33333 No 33 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.47 E-value=2.4e-07 Score=56.92 Aligned_cols=295 Identities=11% Similarity=0.085 Sum_probs=154.6 Q ss_pred CChHHHHHHHHHHHHHHHh--hCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeec-ccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL--NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL-SVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~l-gv~g~ia~R 77 (337) +....+..|..+....... .++....-.+.|-..+...+.+.+.+.+.+++.+++++++...|..... ..+++-++- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:47 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceee Confidence 2222333343333322211 1111112233455567788999999999999999999999887765332 222333333 Q ss_pred cCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 78 t~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) ...+ ......+...++...+..++.---+.|+.+.|+... .+|+..+.+.+.++++.-.-.--++|.-...... T Consensus 181 v~Eg-~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~--- 254 (415) T protein:vir:47 181 VEEL-EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--- 254 (415) T ss_pred cccc-cccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc--- Confidence 3221 112123344577778888877777889999997643 5889999999999988766666666643222110 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) . ..++ .............. .|.+ .+++..+.++++... +++|.+..+.. T Consensus 255 --~----~~~~----------------~~~~~~~~~~~~~~---~~~i-~~~~~~~~~~~~~~~---~~v~n~~~~~~-- 303 (415) T protein:vir:47 255 --T----SSGF----------------EKEGKKLEVKKAKS---LDDI-KDAINLNVKPNYEHN---VAIVSQTMFAK-- 303 (415) T ss_pred --c----cccc----------------ccccceeccccccc---hHHH-HHHHHhhhhhccCCC---EEEEcHHHHHH-- Confidence 0 0000 00000111112223 4433 366666666665443 78899988763 Q ss_pred HHHHh-ccCChHHHHHHHHHHhhhhhcCccccccCccCCCc-----eEEecchhcEEEEecCceEEEEEEcccccceece Q lcl|Aclame:pro 238 FPIVN-ATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPERDRIENY 311 (337) Q Consensus 238 ~~l~n-~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~-----iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y 311 (337) +..+. ..+.|-=.... .-....+|-|+|++..|++|... +++=.++++.+.+.+....-...+... +..-.. T Consensus 304 L~~lkd~~G~~i~~~~~-~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~-~~~~~~ 381 (415) T protein:vir:47 304 LDKMKDKLGNYLIQPDV-KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMH-FGECLM 381 (415) T ss_pred HHHhhccCCCeeeccCc-CCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeecccc-CceEEE Confidence 22232 22111100000 01134588999999999999654 777788876555554444333332211 111111 Q ss_pred eeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 312 ESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 312 ~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) -..--+..|-+.++++.+ ++..+ T Consensus 382 ~~~r~d~~v~~~~a~~~~---~~~~~ 404 (415) T protein:vir:47 382 IAVRQDCRILDYKSAIVI---EYDDS 404 (415) T ss_pred EEEEeccEEeccccEEEE---Eeecc Confidence 111224445555555554 33333 No 34 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.46 E-value=8.3e-08 Score=59.42 Aligned_cols=281 Identities=10% Similarity=-0.004 Sum_probs=167.5 Q ss_pred HHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCCCCcccccccccccCC Q lcl|Aclame:pro 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~ 95 (337) ||- +-.+.|-|...+.+.+.++++|.+++..+++++.--+. ++-.-.+++-++-... +...|..-..++. T Consensus 1 ma~-------~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~-~~p~~~~~~~a~~v~E--g~~~~~~~~~f~~ 70 (298) T protein:vir:94 1 MVL-------NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGE-KVFTFTMDSEIDVVAE--SGKKTHGGVTLAP 70 (298) T ss_pred Cee-------ccccccChhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecCcceEEeeC--CccccccccceeE Confidence 332 22346778889999999999999999999998865322 3333234444544432 2333444456788 Q ss_pred ceeEEEEeeeeeecCHHHHHHHh-CChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHH Q lcl|Aclame:pro 96 NRYRCEKTDYDTAIPYRKLDMWA-KFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~WA-~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re 174 (337) ....+++.---+.|+.+.|.+.. ...+|.+.+.+.++++++...-.--+||+....-++.. +. ..-++.... T Consensus 71 v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~----~~-~~~~~~~~~-- 143 (298) T protein:vir:94 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASA----VI-GTNHFDSKV-- 143 (298) T ss_pred EEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccc----cc-ccccccccc-- Confidence 88888888889999999997665 35789999999999999988877778885322211110 00 000111110 Q ss_pred hchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHH Q lcl|Aclame:pro 175 RAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAAD 254 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~ 254 (337) ...+ ..+..-..++..+.+++..+.+..+.. . +++|.+.....- ..|-.....|-=.. .. T Consensus 144 -------------~~~~--~~~~~~~~~~~~i~~~~~~~~~~~~~~--~-~~vmn~~~~~~l-~~lkd~~G~~l~~~-~~ 203 (298) T protein:vir:94 144 -------------TQKV--EAPRGIADPNGAIENAVELLTGVDADV--T-GIAINPSFRSAL-AKQKDLQGNALFPE-LK 203 (298) T ss_pred -------------cccc--ccccccccHHHHHHHHHHhhhhcCCCc--c-EEEEcHHHHHHH-HHhhccCCCeeecC-cc Confidence 0011 111222344555666665443333222 2 688988777632 22322222221000 00 Q ss_pred HHHhhhhhcCccccccCccCCC------ceEEecchhcEEEEecCceEEEEEEcccccce-eceeeee---------eee Q lcl|Aclame:pro 255 LIVSQKRIGNLPAVRVPFFPKR------ALMVTKLSNLSIYYQEGARRRTLKEVPERDRI-ENYESSN---------DAY 318 (337) Q Consensus 255 ~~~~~k~iGGlpa~~vPffP~~------~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rv-e~y~s~N---------e~Y 318 (337) .-....++-|+|++..+++|.+ .+++-.++++-.|..++..+-.+.+..+-++. .+|..+| -++ T Consensus 204 ~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~ 283 (298) T protein:vir:94 204 WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW 283 (298) T ss_pred cCCCCceecceeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEecc Confidence 0012357889999999999975 47778888887777676666666553322221 1222222 467 Q ss_pred eeeccccEEEeecce Q lcl|Aclame:pro 319 VVEDFGCGCVAENIE 333 (337) Q Consensus 319 vVEd~~~~a~ieni~ 333 (337) .|.+.++++.+.+++ T Consensus 284 ~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 284 GILDATKFARVTEAN 298 (298) T ss_pred EeecccceEEEEecC Confidence 888999999999888 No 35 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.45 E-value=2.1e-07 Score=57.23 Aligned_cols=293 Identities=10% Similarity=-0.040 Sum_probs=155.2 Q ss_pred CChHHHHHHHHHHHHHHHh------------hC---chhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhcee Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL------------ND---TGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEK 65 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~------------ng---v~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~ 65 (337) -.......+..+....... +. .......+.|-|...+.+.+.+++.+.+++.++++++..-.... T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:97 80 DMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEE Confidence 0000011122222211111 11 11223345678888899999999999999999999987655555 Q ss_pred eecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccc Q lcl|Aclame:pro 66 LGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWN 145 (337) Q Consensus 66 v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfn 145 (337) .......+-++-+. .+.. .|..-..++...+..++.--.+.|+.+.|+.. ++++..+.+.+++.++.-.-.--++ T Consensus 160 ~~~~~~~~~a~~v~-Eg~~-~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds---~~l~~~i~~~la~a~~~~~d~a~l~ 234 (390) T protein:vir:97 160 VQETGFVNNAAIVA-EGAL-KPESSLKFAKKTDTTHVIAHTMKATRQILSDA---PQLASYMNNRLIRGLKVKEDAEILR 234 (390) T ss_pred EEEecCCcceeeec-CCcc-ccccccceeEEEEeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 54432222232222 1222 23333457777888887777788899888764 6799999999999988866666677 Q ss_pred ccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEE Q lcl|Aclame:pro 146 GVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 146 G~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LV 225 (337) |+-.+ .+| +|.+. ... ......+.. .-..+|. +.+++..+ ++.++... + T Consensus 235 G~g~~------~~p------~Gi~~------------~~~--~~~~~~~~~-~~~~~d~-~~~~~~~~-~~~~~~~~--~ 283 (390) T protein:vir:97 235 GTGAN------DGL------LGLIP------------QAT--TYAAPTTIA-GATRVDQ-LRLAMLQA-SLAEYPAS--G 283 (390) T ss_pred cCCCC------ccc------cceee------------ccc--ccccccccc-ccchHHH-HHHHHHhh-ccccCCCC--E Confidence 73211 112 23331 111 001111111 1233443 44455555 44444332 6 Q ss_pred EEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEc-c- Q lcl|Aclame:pro 226 VICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEV-P- 303 (337) Q Consensus 226 vivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~-p- 303 (337) ++|.+..+..- ..|-.....|=-....+ ....++-|+|++..+++|++.+++-.+++--.++.+....=.+.+. + T Consensus 284 ~v~n~~~~~~L-~~lkd~~G~~l~~~~~~--~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 360 (390) T protein:vir:97 284 IVINPIDWAAI-ELAKDANNQYLIGNARG--TLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDD 360 (390) T ss_pred EEEcHHHHHHH-HHhhcCCCceeecCccC--CCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccc Confidence 88898776522 22322222221000111 2356889999999999999999999998733333333333333332 2 Q ss_pred -cccceeceeeeeeeeeeeccccEEEeecceec Q lcl|Aclame:pro 304 -ERDRIENYESSNDAYVVEDFGCGCVAENIELA 335 (337) Q Consensus 304 -~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~ 335 (337) .++.+.-.-..--++.|=+...++.+ +|+ T Consensus 361 f~~~~~~~r~~~r~d~~v~~~~a~v~~---~~a 390 (390) T protein:vir:97 361 FQRNMVTVLAEERLALVVYRPEALITG---SFA 390 (390) T ss_pred cccCcEEEEEEEeeccEEeccccEEEE---EeC Confidence 24444333333334445555554443 444 No 36 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.42 E-value=1e-07 Score=58.87 Aligned_cols=298 Identities=13% Similarity=0.050 Sum_probs=172.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |...+. |+.=...+++.. +....-.|.|.+.+.+.+.+.+.|.++++++++++.-... ++-.-.+++-++-.. T Consensus 1 ~~~~~~--~~~~~~~~~~t~---~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~~p~~~~~~~a~~v~- 73 (320) T protein:vir:10 1 MAAGTA--FQVDHAQIAQTG---DTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQ-KIPHWIGDVSAQWIG- 73 (320) T ss_pred CCCCcc--CCHHHHHhhccc---cccccccccHHHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEeCCcceEEec- Confidence 333222 111111122111 1122235889999999999999999999999998864322 222222344443333 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) .....|..-..++...+.+++.---..|+.+.|+.= .++++..+.+.+.++++...-.--++|+-....+. T Consensus 74 -E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~------ 144 (320) T protein:vir:10 74 -EGDMKPITKGNMTSQNIAPHKIATIFVASAETVRAN--PANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTY------ 144 (320) T ss_pred -CCccccccccceeEEEEeeEEEEEeehhhHHHHhcC--hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcc------ Confidence 233345555668888999999999999999998842 37899999999999999877777788864221111 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) +....+..... ........+-..+|.+..++.. +++..+++ ..+++|.+.....-. .| T Consensus 145 ----------------~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~v~n~~~~~~L~-~l 202 (320) T protein:vir:10 145 ----------------LAQTTKSVSLA--DPGGATASDLTAYDAVAVNGLS-LLVNAKKK--WTHTLLDDIVEPILN-GA 202 (320) T ss_pred ----------------cccccccccce--ecccccccccccHHHHHHHHHh-hhhcccCC--CcEEEEcHHHHHHHH-Hh Confidence 11111111100 0111223334446666666664 44555444 448899998866432 22 Q ss_pred HhccCChH----HHHHHHHHHhhhhhcCccccccCccCCCce--EEecchhcEEEEecCceEEEEEEcc----------- Q lcl|Aclame:pro 241 VNATQAPT----ERLAADLIVSQKRIGNLPAVRVPFFPKRAL--MVTKLSNLSIYYQEGARRRTLKEVP----------- 303 (337) Q Consensus 241 ~n~~~~pt----E~~A~~~~~~~k~iGGlpa~~vPffP~~~i--liT~l~NLsiY~Q~gs~RR~~~d~p----------- 303 (337) -.....+- -..-........++-|+|++..+++|++.. ++..++++-| ...+..+-.+.++. T Consensus 203 kd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~~-~~~~~~~i~~~~~~~~~~~~~~~~~ 281 (320) T protein:vir:10 203 KDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVIW-GQVGGLSFDVTDQATLNLGTPTEPN 281 (320) T ss_pred hccCCceeeccccccCccccccCceeeeeeeEecCCCCCCceEEEEeecceEEE-EEecCeEEEEeecceeeeccccccc Confidence 22211110 000001112345789999999999999974 4577777643 33344433333222 Q ss_pred -----cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 -----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 -----~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++|++.----.--++.|.+.++++.+.++.=.+| T Consensus 282 ~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 282 FVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred cchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 2333332222334788999999999999998888 No 37 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=98.42 E-value=2.8e-07 Score=56.47 Aligned_cols=291 Identities=12% Similarity=0.106 Sum_probs=145.3 Q ss_pred CChHHHHHH-----------HHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecc Q lcl|Aclame:pro 1 MRKETRQAY-----------EKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLS 69 (337) Q Consensus 1 M~~~tr~~~-----------~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lg 69 (337) .+...+..+ ..+..... +......-.+.|-+.+...+.+.+++.+.+++.++++++.- +.++-.- T Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g--~~~ip~~ 186 (425) T protein:vir:95 111 NRLQVREMLKTGEYYKRSEVVEFYEKFR--NLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG--TTRILVD 186 (425) T ss_pred HHHHHHHHHhhhhhhhhhHHHHHHHHHH--hhcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecCc--eeEEEEe Confidence 000111100 00111110 00111123345555678889999999999999999988742 2233322 Q ss_pred cccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccccccc Q lcl|Aclame:pro 70 VSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKA 149 (337) Q Consensus 70 v~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~ 149 (337) .+++-++=+.-+ ......+...++...+..++.---+.|+.+.|+.+. ++|+..+++.+.+.++.-.-.--++|+-. T Consensus 187 ~~~~~a~~v~E~-~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~il~G~G~ 263 (425) T protein:vir:95 187 TDTSPATWIEQS-GALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSI--INLDDYVTKKIARAIAKALDLAIVKGTGA 263 (425) T ss_pred cCCccccccccc-cccccccccccceeeeeheeeeeeehhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHhhccCCC Confidence 223323222211 112222323455666666666666788999888775 37999999999999988777777788533 Q ss_pred CCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCcee-ecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEE Q lcl|Aclame:pro 150 AATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVL-VGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) Q Consensus 150 A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~-~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVviv 228 (337) ..+ -|+ |+|..+-. .. .++ .+....|.+|..+ +. ++.+-++....++++| T Consensus 264 ~~~-----~p~------Gil~~~~~------------~~-~~~~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~v~ 314 (425) T protein:vir:95 264 ANK-----QPL------GIIPSLPP------------EN-QVTVEADNNLLKNLVKQ----IG-LIDTGDDSVGEIVAVM 314 (425) T ss_pred Ccc-----ccc------eeeccccc------------cc-ccccccccchHHHHHHH----HH-hhhhhccccCceEEEE Confidence 211 122 55432110 00 111 1122345555443 32 3456667777889888 Q ss_pred CHHHHHHHHHHH--H-hccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccc Q lcl|Aclame:pro 229 GRELLHDKYFPI--V-NATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPER 305 (337) Q Consensus 229 G~dLl~~k~~~l--~-n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r 305 (337) .+.-+-.+-..| . ...+.+--... .....++-|+|++.-+++|++.+++=.+++..|. .++...-..-++. T Consensus 315 ~~~~~~~~l~~l~~~kd~~g~~i~~~~---~~~~~~l~G~pvv~~~~~~~~~i~~Gd~~~~~~~-~~~~~~i~~~~~~-- 388 (425) T protein:vir:95 315 KRSTYYNRLVEFSIQVDSNGNVVGKLP---NLRTPDLLGLRVVFNNFLDDDTVLFGEFEQYTLV-ERENITIDSSTHV-- 388 (425) T ss_pred eChHHHHHHHHHHhhcCCCCceeeccC---CCCCccccceeeEEcCcCCCccEEEEecccEEEE-eecceEEEeeccc-- Confidence 875432211122 1 11111110000 1124467799999999999999999888884443 3333333332221 Q ss_pred cceeceeeeeeeeeeec--------cccEEEee-cceeccC Q lcl|Aclame:pro 306 DRIENYESSNDAYVVED--------FGCGCVAE-NIELAAA 337 (337) Q Consensus 306 ~rve~y~s~Ne~YvVEd--------~~~~a~ie-ni~~~~a 337 (337) .|..-.-+|.++. ++.++.++ .-....| T Consensus 389 ----~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 389 ----KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred ----ccccCceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 1222223444433 33333332 1111122 No 38 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.38 E-value=6.8e-07 Score=54.40 Aligned_cols=292 Identities=11% Similarity=-0.066 Sum_probs=152.5 Q ss_pred CChHHHHHHHHHHHHH------------HHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeec Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQI------------AKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL 68 (337) Q Consensus 1 M~~~tr~~~~~y~~~~------------a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~l 68 (337) .+......+..+...- ..............+-|.....+.+.+.+.+.+++.++++++..-.+....+ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 162 (390) T protein:vir:10 83 VASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQE 162 (390) T ss_pred hhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEE Confidence 1111111111111100 0011111112334567778889999999999999999999987655555544 Q ss_pred ccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccc Q lcl|Aclame:pro 69 SVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVK 148 (337) Q Consensus 69 gv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s 148 (337) ....+-++-.. .+......+ ..++...+..++.---+.|+.+.|+.- ++++..+.+.++++++.-.-.--++|+- T Consensus 163 ~~~~~~a~~v~-Eg~~~~~~~-~~~~~i~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~l~~~~~~~~~~~il~G~G 237 (390) T protein:vir:10 163 TGFVNNAAIVA-EGALKPESS-LKFAKKTDTTHVIAHTMKATRQILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTG 237 (390) T ss_pred ecCCcceeeec-CCccccccc-cceeEEEEeeEEEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 32222222221 222232333 457788888888888889999988863 6899999999999987743333345531 Q ss_pred cCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEE Q lcl|Aclame:pro 149 AAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) Q Consensus 149 ~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVviv 228 (337) .+ .+|.+ +++.... ..+..+..++ ...| .+.+++..+. +.++... +++| T Consensus 238 ~~------~~p~G------------------i~~~~~~--~~~~~~~~~~-~~~~-~~~~~~~~l~-~~~~~~~--~~v~ 286 (390) T protein:vir:10 238 AN------DGLLG------------------LIPQATT--YAAPTTIAGA-TRVD-QLRLAMLQAS-LAEYPAS--GIVI 286 (390) T ss_pred CC------ccccc------------------ccccccc--cccccccccc-chHH-HHHHHHHhhc-cccCCCC--EEEE Confidence 11 11222 2221110 1111122221 1234 3555666554 4444433 6778 Q ss_pred CHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchh-cEEEEecCceEEEEEEc---cc Q lcl|Aclame:pro 229 GRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSN-LSIYYQEGARRRTLKEV---PE 304 (337) Q Consensus 229 G~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~N-LsiY~Q~gs~RR~~~d~---p~ 304 (337) .+..+.. -..|-.....|-=.. .......++-|+|++..+++|++.+++-.+++ .-++...|. +=.+.+. -. T Consensus 287 n~~~~~~-L~~lkd~~g~~l~~~--~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~-~i~~~~~~~~~~ 362 (390) T protein:vir:10 287 NPIDWAA-IELAKDANNQYLIGN--ARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDA-RVEIGYVNDDFQ 362 (390) T ss_pred cHHHHHH-HHHhhcCCCceeecC--CcCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecce-EEEEeecccccc Confidence 8876652 222222222210000 01123457899999999999999999998886 445544443 2222221 12 Q ss_pred ccceeceeeeeeeeeeeccccEEEeecceec Q lcl|Aclame:pro 305 RDRIENYESSNDAYVVEDFGCGCVAENIELA 335 (337) Q Consensus 305 r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~ 335 (337) ++.+.-+-..--++.|-++.+++. |+|+ T Consensus 363 ~~~~~~r~~~r~d~~v~~~~a~~~---~~~a 390 (390) T protein:vir:10 363 RNMVTVLAEERLALVVYRPEALIS---GSFA 390 (390) T ss_pred cCcEEEEEEEeeccEEeccccEEE---EEeC Confidence 344433333334455555555544 4455 No 39 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.35 E-value=5.2e-07 Score=55.02 Aligned_cols=306 Identities=13% Similarity=0.126 Sum_probs=162.8 Q ss_pred CChHHHHHHHHHHHHHHHh--hCc--------hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL--NDT--------GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSV 70 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--ngv--------~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv 70 (337) +..+.|..|..|+...... ... .+..-.+.|-+.+.+.+.+.+++.+.+++.++++++.-... ++.... T Consensus 79 ~~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~ 157 (401) T protein:vir:44 79 VAAEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDY-KKLVNL 157 (401) T ss_pred hhHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCce-EEEEec Confidence 6666788888887532111 000 01122466767778899999999999999999998864332 333334 Q ss_pred ccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccC Q lcl|Aclame:pro 71 SGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAA 150 (337) Q Consensus 71 ~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A 150 (337) +++.++-+.-+ ..+...+...++...|..++.---+.|+.+.|+. ...+|+..+.+.+++.++.-.-.--+||+-.- T Consensus 158 ~~~~a~wv~E~-~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~ 234 (401) T protein:vir:44 158 GGTASGWVGET-DTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDD--AFFNVEAWINSELATEFAEQEEIAFTTGDGTK 234 (401) T ss_pred CCccceeeccc-cccCccccccceeeeeehhheeeehhhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHhhhhccCCCC Confidence 44444332221 1121122234555556655555556788888874 23589999999999999887777777774321 Q ss_pred CcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECH Q lcl|Aclame:pro 151 ATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGR 230 (337) Q Consensus 151 ~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~ 230 (337) +| +|.|..............+. ...+..+.. +..+.|.++ +++..| ++.|+.. -|++|.+ T Consensus 235 -------~p------~Gil~~~~~~~~~~~~~~~~--~~~~~t~~~-~~~~~d~i~-~~~~~l-~~~~~~~--a~~v~n~ 294 (401) T protein:vir:44 235 -------KP------KGFLAYESTEESDKARAFGK--LQHIVSGEA-TAVTADAII-KLIYTL-RKAHRTG--AKFMMNN 294 (401) T ss_pred -------cc------ceeecccccccccccccccc--ccccccccc-cccCHHHHH-HHHHhc-chhhhcC--CEEEEcH Confidence 12 35554433221111111111 111222222 224466655 566655 6666664 3788999 Q ss_pred HHHHHHHHHHHhccCChHHHHHHHHH-HhhhhhcCccccccCccCCCc-----eEEecchh-cEEEEecCceEEEEEEcc Q lcl|Aclame:pro 231 ELLHDKYFPIVNATQAPTERLAADLI-VSQKRIGNLPAVRVPFFPKRA-----LMVTKLSN-LSIYYQEGARRRTLKEVP 303 (337) Q Consensus 231 dLl~~k~~~l~n~~~~ptE~~A~~~~-~~~k~iGGlpa~~vPffP~~~-----iliT~l~N-LsiY~Q~gs~RR~~~d~p 303 (337) ..+. +-..|-+..+.|- .-..+. ....++-|+|++..+++|..+ +++=.++- ..|+-..| .+-...+.- T Consensus 295 ~~~~-~L~~lkd~~G~~l--~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~-~~~~~~~~~ 370 (401) T protein:vir:44 295 NSLF-AIRLLKDTEGNYL--WRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIG-TRILRDPYT 370 (401) T ss_pred HHHH-HHHHhhccCCcee--ecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecc-eEEeeeccc Confidence 7764 2222323322221 000000 123579999999999999644 66655643 33332222 332222222 Q ss_pred cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .++.+.-+-..--|..|=|.+++++ ++++-| T Consensus 371 ~~~~v~~~a~~r~d~~~~~~~a~~~---l~~~aa 401 (401) T protein:vir:44 371 NKPFVGFYTTKRTGGMLVDSQAIKL---LKIAAA 401 (401) T ss_pred cCCcEEEEEEEEeccEEecccceEE---EEeecC Confidence 2333333333334445555555544 455556 No 40 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.35 E-value=5e-07 Score=55.11 Aligned_cols=292 Identities=11% Similarity=0.034 Sum_probs=165.0 Q ss_pred CChHHH-----HHHHHHHHHHHHhhC--ch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccc Q lcl|Aclame:pro 1 MRKETR-----QAYEKYAAQIAKLND--TG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr-----~~~~~y~~~~a~~ng--v~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g 72 (337) |++.-. ++|..++.+.+..+- +. .......|-+.+.+.+.+.+.+.|.+++..+++++.-.... +-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-~p~~~~~ 79 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWADK 79 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE-EEEEeCC Confidence 665433 333333333333221 11 11223467778889999999999999999999988743322 2222234 Q ss_pred ccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCc Q lcl|Aclame:pro 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) Q Consensus 73 ~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~ 152 (337) +.+.-.. .+...|..-..++...+.+++.---..|+.+.|+... ++|+..+.+.+.++++.-.-.-.++|.-.. T Consensus 80 ~~a~~v~--Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~-- 153 (324) T protein:vir:10 80 PGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-- 153 (324) T ss_pred cceeEec--cCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCC-- Confidence 4443332 2233344456788888999998888899999998663 689999999999988876555666774211 Q ss_pred CChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) +. |. | ++.... .+.......-.|..|. +++..+ ++.++... +++|.+.. T Consensus 154 ~~----~~------~------------i~~~~~--~~~~~~~~~~t~~~i~----~~~~~l-~~~~~~~~--~~v~n~~~ 202 (324) T protein:vir:10 154 PF----GK------S------------IAQSIE--KTNKVIKGDFTQDNII----DLEALL-EDDELEAN--AFISKTQN 202 (324) T ss_pred cc----Cc------c------------cccccc--ccceeccccCCHHHHH----HHHHhh-hhccCCCC--EEEEcHHH Confidence 11 11 0 111000 0000011111233333 445444 34444332 67888888 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCC--ceEEecchhcEEEEecCceEEEEEEcc------- Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKR--ALMVTKLSNLSIYYQEGARRRTLKEVP------- 303 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~--~iliT~l~NLsiY~Q~gs~RR~~~d~p------- 303 (337) +.. -..+-.....|- .. -....++-|+|++..|..|.+ .+++..++++-|-... ..+-.+.++. T Consensus 203 ~~~-L~~l~d~~g~~~--~~---~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~-~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:10 203 RSL-LRKIVDPETKER--IY---DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQ-LIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhccCCcee--ec---CCCCccccceeEEeecCCCCCcceEEEEecccEEEEEec-CcEEEEeeccccccccc Confidence 763 222322222221 00 012357899999998886644 5888999987543333 3443333332 Q ss_pred ---------cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ---------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ---------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++|.+.-.-..--++.|-+.++++.+.+.+.+.. T Consensus 276 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~ 318 (324) T protein:vir:10 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCC Confidence 2333333333445778889999888877666554 No 41 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.32 E-value=5.7e-07 Score=54.82 Aligned_cols=291 Identities=11% Similarity=0.052 Sum_probs=144.1 Q ss_pred CChHHHHHHHHHHHH------------HHHhhCchhhcceEeechHHHHHHHHH-HHhhHHHhcccceeccchhhceeee Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQ------------IAKLNDTGDVSKKFAVEPTVQQRLETK-MQESSEFLKRINVLPVTELEGEKLG 67 (337) Q Consensus 1 M~~~tr~~~~~y~~~------------~a~~ngv~~~~~~Fsv~P~~~q~L~~~-iqess~FL~~Inv~~V~~~~Ge~v~ 67 (337) .+........+|+.. .....+. ..+....+-|++.+.+... +.+++.+.+..+++++....+-.+- T Consensus 81 ~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t-~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p 159 (390) T protein:vir:62 81 AQRSADVDDDATLRAGNLGEARSFEFAPEKRDGT-KAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFT 159 (390) T ss_pred chhhcchHHHHHHhhhhhhhhHHHHhhhhhhccc-ccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEE Confidence 111111111112111 0011111 1222334556666665554 4455544445578776543222333 Q ss_pred cccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccccc Q lcl|Aclame:pro 68 LSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGV 147 (337) Q Consensus 68 lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~ 147 (337) .-.+++.++-+. .....|..-..++...|..++.=--+.|+++.|+.. .++|+..+++.+.++++.=.-.--+||+ T Consensus 160 ~~~~~~~a~wv~--E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~l~G~ 235 (390) T protein:vir:62 160 VITGRSSASIVG--ETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQ--VLDLVGFLVSDAGPAIGDAMGRHFITGT 235 (390) T ss_pred EEcCCcceeeec--ccccccccccceeeeEeeeeeEEeehHHHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 333334443322 222233334457778888888888889999999873 3589999999999888764444445773 Q ss_pred ccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEE Q lcl|Aclame:pro 148 KAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVI 227 (337) Q Consensus 148 s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvi 227 (337) - . | +|++.. .......+..+.. +-.+.|.|+ +++.+| ++.|+.. -+++ T Consensus 236 G-----~----p------~Gi~~~------------~~~~~~~~~~~~~-~~~~~~~l~-~~~~~l-~~~~~~~--a~~v 283 (390) T protein:vir:62 236 G-----Q----P------RGILTD------------ASPATATFLATDT-DSKVSDALI-DLFHEV-PSAYRAN--AKYV 283 (390) T ss_pred C-----c----c------cccccc------------ccccccceecccc-cccchHHHH-HHHHhh-hhhhhcC--CEEE Confidence 2 1 2 355532 1111111222211 223345443 455554 6667654 4789 Q ss_pred ECHHHHHHHHHHHHh-ccCCh--HHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc- Q lcl|Aclame:pro 228 CGRELLHDKYFPIVN-ATQAP--TERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP- 303 (337) Q Consensus 228 vG~dLl~~k~~~l~n-~~~~p--tE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p- 303 (337) |.+..+.. +..+. ....| ..-++. ....++.|+|++..+++|++.+++=.++..-|....+. .-....++ T Consensus 284 mn~~~~~~--L~~lkd~~g~~l~~~~~~~---g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~~-~v~~~~~~~ 357 (390) T protein:vir:62 284 VNDLRAAQ--MRKLKDANGQYLWQSGLTV---GAPSLFNGKVVETDDGMPADKILFADLSKYRVRFAGSL-RVDRSVDAK 357 (390) T ss_pred EchHHHHH--HHHhhccCCCeeecCCcCC---CccceecccceEEecCCCCccEEEeeccceeEEeecce-EEEeecccc Confidence 99988753 22232 22211 010111 12347999999999999999998877776544433322 22211222 Q ss_pred -cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 -ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 -~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .+|++.-.-..--+..|-|.+++..+ ++..| T Consensus 358 ~~~~~~~~~~~~r~d~~~~~~~A~~~l---~~~~~ 389 (390) T protein:vir:62 358 FSTDQIVYRFLQRADGLLVDARGAKVL---TVTPG 389 (390) T ss_pred ccCCcEEEEEEEEeCcEeechhheEEE---EeecC Confidence 23334333333334444555554444 34455 No 42 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.27 E-value=4.6e-07 Score=55.31 Aligned_cols=299 Identities=13% Similarity=0.067 Sum_probs=169.2 Q ss_pred CCh-HHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccC Q lcl|Aclame:pro 1 MRK-ETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~-~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~ 79 (337) |++ +++..+.....+ .-.+...+..-.|-|++.+.+.+.+++.+..+++++++++.-- +.++-.-.+++-++... T Consensus 3 ~~~~r~~~~~~~~e~~---a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~-~~~~p~~~~~~~a~~v~ 78 (326) T protein:vir:42 3 VNPDRTTPFLGVNDPK---VAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTT-GQKIPHWTGDVSASWIG 78 (326) T ss_pred CCccchhhhcCcchhh---heeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCC-ceEEEEEeCCcceEEec Confidence 444 333322222222 1222222223358889999999999999999999999988732 23443334555555543 Q ss_pred CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anP 159 (337) - +...|..-..++...+..++.---..|+.+.|+. ...+|+..+.+.+.++++.-.-.-.|||+-.. +| T Consensus 79 E--g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~--s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~-------~p 147 (326) T protein:vir:42 79 E--GDMKPITKGNMTSQTIAPHKIATIFVASAETVRA--NPANYLGTMRTKVATAFAMAFDNAAINGTDSP-------FP 147 (326) T ss_pred C--CccccccccceeEEEEeeEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHHhhcccCCC-------cc Confidence 2 2333444466888889999888888888888774 34789999999999999998888888995421 12 Q ss_pred hhhccchhHHHHHHHhchhhhccccc--cccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHH Q lcl|Aclame:pro 160 LLQDVNIGWLQQYRERAAQRVLHEGA--KQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 160 llqDVN~GWlq~~Re~a~~~v~~~~~--~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~ 237 (337) .+ ++.... ........+..++-..-|....++.. .+.+.++. ..+++|.+..+..-. T Consensus 148 ~g------------------i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~--~a~~v~n~~~~~~L~ 206 (326) T protein:vir:42 148 TF------------------LAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALS-LLVNAGKK--WTHTLLDDITEPILN 206 (326) T ss_pred cc------------------ccccccccceeecccccccccchhHHHHHHHHHh-hhhhhccC--ccEEEEeHHHHHHHH Confidence 11 111000 00011111222222223333334433 33454444 347788887775322 Q ss_pred HHHHhccCChH----HHHHHHHHHhhhhhcCccccccCccCCCceEE--ecchhcEEEEecCceEEEEEEccc------- Q lcl|Aclame:pro 238 FPIVNATQAPT----ERLAADLIVSQKRIGNLPAVRVPFFPKRALMV--TKLSNLSIYYQEGARRRTLKEVPE------- 304 (337) Q Consensus 238 ~~l~n~~~~pt----E~~A~~~~~~~k~iGGlpa~~vPffP~~~ili--T~l~NLsiY~Q~gs~RR~~~d~p~------- 304 (337) .|-.....|- -..-........++-|+|++..+++|++..++ ..++++-+ ...+...-.+.++.- T Consensus 207 -~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~-~~~~~~~v~~~~e~~~~~~~~~ 284 (326) T protein:vir:42 207 -GAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVW-GQVGGLSFDVTDQATLNLGTPQ 284 (326) T ss_pred -HhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEE-EEecceEEEEeecceeeecccc Confidence 2322211111 00000011124578899999999999998654 57777643 344444333333221 Q ss_pred ---------ccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 305 ---------RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 305 ---------r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) +|++.-.--.--++.|.+..+++.+.++.-++| T Consensus 285 ~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 285 APNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred cccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 233322222223678999999999999999999 No 43 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.27 E-value=4.6e-07 Score=55.32 Aligned_cols=281 Identities=10% Similarity=0.001 Sum_probs=163.1 Q ss_pred HHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCCCCcccccccccccCC Q lcl|Aclame:pro 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~ 95 (337) ||. +-.+.|-|...+.+.+.++++|.+++...++++.--. .++-.-.+++-++-..- ....|..-..++. T Consensus 1 ma~-------~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~E--~~~~~~~~~~f~~ 70 (298) T protein:vir:16 1 MVL-------NKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAE--SGKKTHGGVTLAP 70 (298) T ss_pred Ccc-------cCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEecC--CccccccccceeE Confidence 332 2234688899999999999999999999999886422 33433344455544432 2333444456788 Q ss_pred ceeEEEEeeeeeecCHHHHH-HHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHH Q lcl|Aclame:pro 96 NRYRCEKTDYDTAIPYRKLD-MWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD-~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re 174 (337) ..+..++.---+.|+.+.|- .+-...+|++.+.+.++++++.-.-.-.+||+-...-+... +.+.. T Consensus 71 v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~--~~~~~----------- 137 (298) T protein:vir:16 71 QTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASA--VIGTN----------- 137 (298) T ss_pred EEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccc--ccccc----------- Confidence 88888888888899999874 34456789999999999998887777778884332211100 00000 Q ss_pred hchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHH Q lcl|Aclame:pro 175 RAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAAD 254 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~ 254 (337) .... . .......+. .-.++++.+.+++..+ ...+++- . +++|.+.....- ..+-...+.|-=. ... T Consensus 138 -----~~~~-~-~~~~~~~~~--~~~~~~~~i~~~~~~~-~~~~~~~-~-~~vmn~~~~~~l-~~lkd~~G~~i~~-~~~ 203 (298) T protein:vir:16 138 -----HFDS-K-VTQKVEAPR--GIADPNGAIENAVELL-TGVDADV-T-GIAINPSFRSAL-AKQKDLQDNALFP-ELK 203 (298) T ss_pred -----cccc-c-ccccccccc--ccccHHHHHHHHHHHh-hhcCCCc-c-EEEEcHHHHHHH-HHhhccCCCeeec-Ccc Confidence 0000 0 000011111 1123444445555433 3333322 2 588888777632 2222222222100 000 Q ss_pred HHHhhhhhcCccccccCccCCC------ceEEecchhcEEEEecCceEEEEEEccc----------ccceeceeeeeeee Q lcl|Aclame:pro 255 LIVSQKRIGNLPAVRVPFFPKR------ALMVTKLSNLSIYYQEGARRRTLKEVPE----------RDRIENYESSNDAY 318 (337) Q Consensus 255 ~~~~~k~iGGlpa~~vPffP~~------~iliT~l~NLsiY~Q~gs~RR~~~d~p~----------r~rve~y~s~Ne~Y 318 (337) .-....++-|+|++..+++|+. .+++--+++.-.|..++..+-.+.+.-+ +|++.-.-..--++ T Consensus 204 ~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~ 283 (298) T protein:vir:16 204 WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW 283 (298) T ss_pred cCCCCceecceeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEcc Confidence 1112358899999999999974 4666788888777766666655544322 22222222233467 Q ss_pred eeeccccEEEeecce Q lcl|Aclame:pro 319 VVEDFGCGCVAENIE 333 (337) Q Consensus 319 vVEd~~~~a~ieni~ 333 (337) .|-+..++|.+++++ T Consensus 284 ~v~~~~a~~~l~~at 298 (298) T protein:vir:16 284 GILDATKFARVTEAN 298 (298) T ss_pred EeecccceEEEeecC Confidence 888899999998888 No 44 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.27 E-value=1.4e-06 Score=52.68 Aligned_cols=299 Identities=9% Similarity=0.037 Sum_probs=159.6 Q ss_pred CC--hHHHHHHHHHHHHHHHh------------------------hCchhhcceEeechHHHHHHHHHHHhhHHHhcc-c Q lcl|Aclame:pro 1 MR--KETRQAYEKYAAQIAKL------------------------NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKR-I 53 (337) Q Consensus 1 M~--~~tr~~~~~y~~~~a~~------------------------ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~-I 53 (337) .+ ......|..++..++.. +...+..-.+.|-..+.+.+.+.+++.+.+++. . T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~ 167 (435) T protein:vir:80 88 PKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGA 167 (435) T ss_pred cchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccc Confidence 00 00111122333222211 111111123345556678899999988877664 3 Q ss_pred ceeccchhhceeeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHH Q lcl|Aclame:pro 54 NVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILN 133 (337) Q Consensus 54 nv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~ 133 (337) ++++...-. .++-.-.+++-++-+.- ....|..-..++...+..++.---+.|+.+.|+..+-.|+++..+.+.+.+ T Consensus 168 ~~v~~~~~~-~~~p~~~~~~~a~~v~E--~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~~ 244 (435) T protein:vir:80 168 RTLPLSNGN-ITIPRLKGGAIVGYIGA--DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTA 244 (435) T ss_pred eeeecCCCc-eEEEEEeCCcceeeecc--CccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHHH Confidence 455443321 22222223444433332 223344445678888999999999999999999988889999999999999 Q ss_pred HHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcc Q lcl|Aclame:pro 134 QGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSM 213 (337) Q Consensus 134 ~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~l 213 (337) +++.-.-.--+||+..+. .| +|++.. .. .........++.+...++.+.+++..+ T Consensus 245 a~~~~~d~a~l~G~G~~~------~p------~Gi~~~------------~~-~~~~~~~~~~~~~~~~~~d~~~~~~~~ 299 (435) T protein:vir:80 245 AIGAREDKAFIRDDGTAN------TP------KGLRFW------------AL-PGNVITASDGSTLQKIETDLGKAILAL 299 (435) T ss_pred HHHHHHHHHhhccCCCCC------cc------cceeec------------cc-ccceeecccccchhhHHHHHHHHHHHh Confidence 999876666678743221 12 243321 10 011112223344444444444444433 Q ss_pred cChhHcCCCCEEEEECHHHHHHHHHHHHh-ccCChHHHHHHHHHHhhhhhcCccccccCccCCC--------ceEEecch Q lcl|Aclame:pro 214 IDPWFQEDTGLVVICGRELLHDKYFPIVN-ATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKR--------ALMVTKLS 284 (337) Q Consensus 214 id~~~r~~~~LVvivG~dLl~~k~~~l~n-~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~--------~iliT~l~ 284 (337) ... .......+++|.+..... +..+. ....|-=. + ....++-|+|++..+++|.+ .+++-.++ T Consensus 300 ~~~-~~~~~~~~~vmn~~~~~~--L~~lkd~~G~~l~~---~--~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s 371 (435) T protein:vir:80 300 ENA-DANLTQPGWIMAPRTFRF--LEGLRDGNGNKVYP---E--LANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFG 371 (435) T ss_pred hcc-ccccccCEEEEcHHHHHH--HHhhhccCCceecc---C--CCCCeEeeeeeEEeccccccccCCCCcceEEEEEcc Confidence 221 122234688999987752 22222 22222100 0 13458999999999999985 57777777 Q ss_pred hcEEEEecCceEEEEEEccccc----ceeceeeeee---------eeeeeccccEEEeecceecc Q lcl|Aclame:pro 285 NLSIYYQEGARRRTLKEVPERD----RIENYESSND---------AYVVEDFGCGCVAENIELAA 336 (337) Q Consensus 285 NLsiY~Q~gs~RR~~~d~p~r~----rve~y~s~Ne---------~YvVEd~~~~a~ieni~~~~ 336 (337) +.-|. ..+..+-.+.++.... .+.+++.+|. ++.|=+.++++.+.+|.++. T Consensus 372 ~~~i~-~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 372 DVFIG-EEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred cEEEE-eecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 75433 3344443333332211 1112222222 34455888888999999988 No 45 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.27 E-value=5.7e-07 Score=54.83 Aligned_cols=272 Identities=14% Similarity=0.089 Sum_probs=161.2 Q ss_pred hCchhh------cceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCCCCccccccccccc Q lcl|Aclame:pro 20 NDTGDV------SKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTAL 93 (337) Q Consensus 20 ngv~~~------~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l 93 (337) -|-... .....|-+.+.+.+.+.+++.|.+++..+++++.-....... .+++-++=+ +.+...|..-..+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~--~~~~~a~~v--~E~~~~~~~~~~f 76 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTF--MSGVGAFWV--DEAERIQTSKPTF 76 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEE--EcCCceeee--ecCccccccccce Confidence 333211 123457778889999999999999999999998754333332 234444322 2333445445678 Q ss_pred CCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHH Q lcl|Aclame:pro 94 DSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYR 173 (337) Q Consensus 94 ~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~R 173 (337) +...+..++.--.+.|+.+.|+. .-++|+..+.+.+.+.++.-.-.-=+||+-.. .| .|.|+... T Consensus 77 ~~v~l~~~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~-------~~------~gil~~~~ 141 (299) T protein:vir:41 77 TKAKMRSKKMGVIIPTTKENLNY--SVTNFFSLMQAEIVEAFYKKFDQAVFTGVESP-------YN------WNILKSAT 141 (299) T ss_pred eEEEEeeEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHHHhhcccCc-------cc------cccccccc Confidence 88899999988888999999983 23789999999999998876555556875211 12 24444211 Q ss_pred HhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHH Q lcl|Aclame:pro 174 ERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAA 253 (337) Q Consensus 174 e~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~ 253 (337) . ...+...+ -.++|.| .+++..+ .+.++... +++|.+..... ...+-.....|--.. T Consensus 142 ~--------------~~~~~~~~--~~~~~~l-~~~~~~l-~~~~~~~~--~~v~n~~~~~~-L~~lkd~~G~~l~~~-- 198 (299) T protein:vir:41 142 D--------------ASNLVEET--ANKYDDL-NEAIGLI-EAEDLEPN--GIATIRKQRVK-YRSTKDGNGMPIFNT-- 198 (299) T ss_pred c--------------cceeeccc--cccHHHH-HHHHHhh-hcccCCcC--EEEEcHHHHHH-HHHhhccCCceeecC-- Confidence 1 00011111 1234443 4566554 45544432 68999987653 223333332221110 Q ss_pred HHHHhhhhhcCccccccCccCCCc----eEEecchhcEEEEecCceEEEEEEcc----------------cccceeceee Q lcl|Aclame:pro 254 DLIVSQKRIGNLPAVRVPFFPKRA----LMVTKLSNLSIYYQEGARRRTLKEVP----------------ERDRIENYES 313 (337) Q Consensus 254 ~~~~~~k~iGGlpa~~vPffP~~~----iliT~l~NLsiY~Q~gs~RR~~~d~p----------------~r~rve~y~s 313 (337) .......++-|+|++..+++|.++ +++-.++++-+....+ .+-.+.++. .++.+.-.-- T Consensus 199 ~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 277 (299) T protein:vir:41 199 ATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAYYGILRG-VEYEILTEATLTTVADETGKPLNLAERDMAAIKAT 277 (299) T ss_pred CcCCCCceecceeeEEecccCCCCCceEEEEEecccEEEEEecC-cEEEEeecccccccccccccchhhhhcCcEEEEEE Confidence 111123578899999999999998 9999999976655544 333333332 2333332223 Q ss_pred eeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 314 SNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 314 ~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ..-+..|.+.++++.+. .+-| T Consensus 278 ~~~d~~v~~~~A~~~l~---~~aa 298 (299) T protein:vir:41 278 FEVGFMVVKDEAFSAVQ---PKAG 298 (299) T ss_pred EEeccEEecccceEEEE---eccC Confidence 34466777777777774 3333 No 46 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.25 E-value=3.3e-07 Score=56.12 Aligned_cols=296 Identities=12% Similarity=0.006 Sum_probs=166.1 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |.-++..... .....+..-.+-|.+.+.+.+.+++.+.+++.++++++..-... +-.-.+++-++...- T Consensus 1 m~~~~~~a~~----------~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-~p~~~~~~~a~~v~E 69 (330) T protein:vir:77 1 MAGSTVPSTQ----------VALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGIS-IPHWTGAVSASWTGE 69 (330) T ss_pred Ccccccchhh----------ccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceE-EEEEcCCcceeEecC Confidence 3322211111 01112233457788999999999999999999999887753333 222234444444332 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) +...|..-..++...+.+++.--...|+.+.|+. .-++|+..+.+.++++++.-.-.--|||+-... +| T Consensus 70 --g~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~------~~- 138 (330) T protein:vir:77 70 --AERKPITKGSFGKQELEPVKITTIFAESAEVVRL--NPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPS------AF- 138 (330) T ss_pred --CCccccccceeeEEEEeEEEEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCCCC------cc- Confidence 2333444456788899999998888999998874 347899999999999999988888889964221 11 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) .|++..... ....... ..++ +.+..-..+|.|+ +++..+ ...++ ..-+++|.+..+..- ..+ T Consensus 139 -----~g~~~~~~~---~~~~~~~----~~~~-~~~~~~~~~~~l~-~~~~~~-~~~~~--~~~~~vmn~~~~~~l-~~l 200 (330) T protein:vir:77 139 -----KGYLAETTK---VVSLADT----NLTT-ASGPQGNAYLAVN-NALSLL-VNSGK--KWTGTLLDNVTEPIL-NTA 200 (330) T ss_pred -----ccccccccc---cceeecc----cccc-cccccchhHHHHH-HHHHhh-hhcCC--CccEEEEcHHHHHHH-HHH Confidence 344443211 1111111 0111 1111212233332 333333 23222 234789999887632 222 Q ss_pred HhccCChH----HHHHHHHHHhhhhhcCccccccCccCCCc------eEEecchhcEEEEecCceEEEEEEc-------- Q lcl|Aclame:pro 241 VNATQAPT----ERLAADLIVSQKRIGNLPAVRVPFFPKRA------LMVTKLSNLSIYYQEGARRRTLKEV-------- 302 (337) Q Consensus 241 ~n~~~~pt----E~~A~~~~~~~k~iGGlpa~~vPffP~~~------iliT~l~NLsiY~Q~gs~RR~~~d~-------- 302 (337) -...+.|- ............++-|+|++..+++|++. +++..+++.-|..+.|. .-.+.++ T Consensus 201 kd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~-~i~~~~e~~~~~~~~ 279 (330) T protein:vir:77 201 VDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGL-SFDVTDQATLDFGEE 279 (330) T ss_pred hccCCceeecCccccccccccCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCc-EEEEeecceeeeccc Confidence 22222111 00000011234578899999999999876 88888888765544443 2222222 Q ss_pred ------------ccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 303 ------------PERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 303 ------------p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) =.+|++.-.-..--++.|-+.++++.|.+..-+.= T Consensus 280 ~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~ 326 (330) T protein:vir:77 280 QGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAGTD 326 (330) T ss_pred ccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEeccCCcC Confidence 13344444444445788899999888866552222 No 47 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.25 E-value=1.3e-06 Score=52.92 Aligned_cols=281 Identities=10% Similarity=0.063 Sum_probs=160.8 Q ss_pred HHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCCCCc--c-cccccccc Q lcl|Aclame:pro 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKA--A-RQPIDPTA 92 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~--~-R~p~~~~~ 92 (337) || ......-..-|-+.+.+.+.+.+++.+.+++..+++++.--. .++-.-.+++-++-...+.. + ..|..-.. T Consensus 1 ma---~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~ 76 (305) T protein:vir:25 1 MA---DISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATLPEADWVGESATDPKGVKPTSKVT 76 (305) T ss_pred CC---CccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCc-EEEEEEeCCcceEEeecccccccccccccccc Confidence 33 333334456788888899999999999999999999886432 22222223344433322211 1 12333455 Q ss_pred cCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHH Q lcl|Aclame:pro 93 LDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQY 172 (337) Q Consensus 93 l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~ 172 (337) ++...+..++.---..|+.+.|+.-. ++|+..+++.++++++.-.-.--|||+-... .. T Consensus 77 f~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~--~~----------------- 135 (305) T protein:vir:25 77 WANRTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKPA--SW----------------- 135 (305) T ss_pred eeeEEeeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHhhhheeccCCCC--Cc----------------- Confidence 67778888888888899999997643 6899999999999999988888889964211 10 Q ss_pred HHhchhhhccccccc-cCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHH Q lcl|Aclame:pro 173 RERAAQRVLHEGAKQ-AGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERL 251 (337) Q Consensus 173 Re~a~~~v~~~~~~~-~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~ 251 (337) .+..+....... ....+.+..-.+.++..++..+...+.+..+... .++|.+.....- ..+-.....| T Consensus 136 ---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~v~~~~~~~~l-~~lkd~~G~~---- 204 (305) T protein:vir:25 136 ---VSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPD---TLLSSLALRYEV-ANIRDANGNP---- 204 (305) T ss_pred ---cccccccccccccccccccccchhhhHHHHHHHHHHHhhhhcccccc---eeEecHHHHHHH-HHhhccCCce---- Confidence 011111111100 0001111222334444444444433322222222 367787766542 2222222222 Q ss_pred HHHHHHhhhhhcCccccccCccCCC----ceEEecchhcEEEEecCceEEEEEEc----ccccceeceee--------ee Q lcl|Aclame:pro 252 AADLIVSQKRIGNLPAVRVPFFPKR----ALMVTKLSNLSIYYQEGARRRTLKEV----PERDRIENYES--------SN 315 (337) Q Consensus 252 A~~~~~~~k~iGGlpa~~vPffP~~----~iliT~l~NLsiY~Q~gs~RR~~~d~----p~r~rve~y~s--------~N 315 (337) +....++-|+|++..++.|.. .+++-.++++-|..+.|..= .+.++ ....++.-|++ .- T Consensus 205 ----i~~~~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r 279 (305) T protein:vir:25 205 ----VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITV-KFLDQATLGTGENQINLAERDMVALRLKAR 279 (305) T ss_pred ----eecCCcccccceEEcCccCCCCCccEEEEEecceEEEEEecCeEE-EEeeeeeeecCCceeeeeecCcEEEEEEEe Confidence 223457899999999998754 57788889876666655422 22221 11122222221 11 Q ss_pred eeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 316 DAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 316 e~YvVEd~~~~a~ieni~~~~a 337 (337) -|+.|-++.+++.+.+++++.. T Consensus 280 ~~~~v~~p~a~v~~~~~~~~~~ 301 (305) T protein:vir:25 280 FAYVLGVSATAQGANKTPVAVV 301 (305) T ss_pred ecceeeCcccEEEEcccccccc Confidence 3678899999999999887653 No 48 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.24 E-value=1.3e-06 Score=52.90 Aligned_cols=293 Identities=10% Similarity=-0.039 Sum_probs=153.3 Q ss_pred CChHHHHHHHHHHHHHHHhhCc---------------hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhcee Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDT---------------GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEK 65 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv---------------~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~ 65 (337) ....-...+..+.......-+. ........+-|.....+.+.+.+.+.+++.++++++..-.... T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:81 80 DMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEE Confidence 0000011112222221111110 1112334577788889999999999999999999987655555 Q ss_pred eecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccc Q lcl|Aclame:pro 66 LGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWN 145 (337) Q Consensus 66 v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfn 145 (337) ..+....+-+.-+. .+. ..|..-..++...+..++.--.+.|+.+.|+.. ++++..+.+.+++.++.-.-.--+| T Consensus 160 ~~~~~~~~~a~~v~-Eg~-~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~~~~~i~~~l~~~~~~~~d~a~l~ 234 (390) T protein:vir:81 160 VQETGFVNNAAIVA-EGA-LKPESSLKFAKKTDTTHVIAHTMKATRQILSDA---PQLASYMNNRLIRGLKVKEDAEILR 234 (390) T ss_pred EEEecCCcceeeec-CCc-ccccccceeeEEEEeeeEEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 55433222222121 122 223333468888899999988999999999874 5799999999999888866666667 Q ss_pred ccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEE Q lcl|Aclame:pro 146 GVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 146 G~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LV 225 (337) |.-.. .+| +|.+ +.... ..+. ...++....|. +.+++..+.+..+... + T Consensus 235 G~g~~------~~~------~Gi~------------~~~~~--~~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~---~ 283 (390) T protein:vir:81 235 GTGAN------DGL------LGLI------------PQATT--YAAP-TTIAGATRVDQ-LRLAMLQASLAEYNPS---G 283 (390) T ss_pred cCCCC------Ccc------ccee------------ecccc--cccc-cccccchhHHH-HHHHHHhhccccCCCC---E Confidence 73211 112 2333 11110 0111 11222233454 4455665644443333 7 Q ss_pred EEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEccc- Q lcl|Aclame:pro 226 VICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPE- 304 (337) Q Consensus 226 vivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~- 304 (337) ++|.+..+.. -..|-.....|-=..... ....++-|+|++..+++|++.+++=.+++.-..+.++..+-...+.+. T Consensus 284 ~v~~~~~~~~-l~~lkd~~G~~l~~~~~~--~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~ 360 (390) T protein:vir:81 284 IVINPIDWAA-IELAKDANNQYLIGNARG--TLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGED 360 (390) T ss_pred EEEcHHHHHH-HHHhhcCCCceeecCccc--ccCceecceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccch Confidence 8889987652 222322222211000001 124578899999999999999999999874333433444433333222 Q ss_pred --ccceeceeeeeeeeeeeccccEEEeecceec Q lcl|Aclame:pro 305 --RDRIENYESSNDAYVVEDFGCGCVAENIELA 335 (337) Q Consensus 305 --r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~ 335 (337) ++.+.-.-..--++.|-+...++ -|+++ T Consensus 361 ~~~~~v~~r~~~r~d~~v~~~~a~v---~~t~a 390 (390) T protein:vir:81 361 FQRNMITVLAEERLALVVYRPEALI---SGSFA 390 (390) T ss_pred hhcCcEEEEEEEeeccEEecccceE---EEEeC Confidence 12221111111122333333333 33444 No 49 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.22 E-value=8.5e-07 Score=53.87 Aligned_cols=294 Identities=9% Similarity=-0.048 Sum_probs=152.9 Q ss_pred CCh----HHHHHHHHHHHHHHHhh---C---------chhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhce Q lcl|Aclame:pro 1 MRK----ETRQAYEKYAAQIAKLN---D---------TGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGE 64 (337) Q Consensus 1 M~~----~tr~~~~~y~~~~a~~n---g---------v~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge 64 (337) +.. ..+...+.+........ . .........|-|.+...+.+.+.+.+.+++.++++++.--.++ T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 149 (385) T protein:vir:18 70 NPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALE 149 (385) T ss_pred ccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceE Confidence 111 11122222222211100 0 0111223457788899999999999999999999998765555 Q ss_pred eeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcc Q lcl|Aclame:pro 65 KLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGW 144 (337) Q Consensus 65 ~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGf 144 (337) .......++-++-+. .+...|..-..++...+..++.--.+.|+.+.|+.. ++++..+.+.++++++.-.-.--+ T Consensus 150 ~~~~~~~~~~a~~v~--E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~la~a~~~~~d~~~l 224 (385) T protein:vir:18 150 YVREEVFTNNADVVA--EKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA---PMLQSYINNRLMYGLALKEEGQLL 224 (385) T ss_pred EEEEecCCcceeeec--cCccccccccceeEEEEeeeeEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 554433233332221 122234334468888889888888889999988864 678999999999998874444444 Q ss_pred cccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCE Q lcl|Aclame:pro 145 NGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGL 224 (337) Q Consensus 145 nG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~L 224 (337) +|.-.. +| |.-+++.... .....+..+ ...+|.|+. ++..+ .+.+++.. T Consensus 225 ~G~g~~-------~~-----------------~~Gi~~~~~~--~~~~~~~~~-~~~~d~i~~-~~~~l-~~~~~~~~-- 273 (385) T protein:vir:18 225 NGDGTG-------DN-----------------LEGLNKVATA--YDTSLNATG-DTRADIIAH-AIYQV-TESEFSAS-- 273 (385) T ss_pred hccCCC-------Cc-----------------cccccccccc--ccccccccc-cchHHHHHH-HHHhh-ccccCCCC-- Confidence 662111 11 1112211111 111122222 245665544 44444 44444432 Q ss_pred EEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchh-cEEEEecCceEEEEEEcc Q lcl|Aclame:pro 225 VVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSN-LSIYYQEGARRRTLKEVP 303 (337) Q Consensus 225 VvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~N-LsiY~Q~gs~RR~~~d~p 303 (337) +++|.+..+.. ...+-.....|-=... .-....++-|+|++..+++|++.+++-.+++ .-|+.+.|.. -.+.++. T Consensus 274 ~~~~~~~~~~~-l~~lkd~~G~~l~~~~--~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~-v~~~~~~ 349 (385) T protein:vir:18 274 GIVLNPRDWHN-IALLKDNEGRYIFGGP--QAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDAT-VEVSRED 349 (385) T ss_pred EEEEcHHHHHH-HHHhhcCCCceeccCc--ccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceE-EEEeccc Confidence 88999987652 1222222111110000 0123567889999999999999999998876 4455444432 2221111 Q ss_pred ----cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ----~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .++.+.-+-..--++.|-+..+++ -+++..| T Consensus 350 ~~~~~~~~~~~~~~~r~~~~v~~~~a~~---~~~~~aa 384 (385) T protein:vir:18 350 RDNFVKNMLTILCEERLALAHYRPTAII---KGTFSSG 384 (385) T ss_pred cchhhcCcEEEEEEEeeccEEecccceE---EEEeccC Confidence 122222222222233333333333 3455555 No 50 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.22 E-value=8.5e-07 Score=53.87 Aligned_cols=294 Identities=9% Similarity=-0.048 Sum_probs=152.9 Q ss_pred CCh----HHHHHHHHHHHHHHHhh---C---------chhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhce Q lcl|Aclame:pro 1 MRK----ETRQAYEKYAAQIAKLN---D---------TGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGE 64 (337) Q Consensus 1 M~~----~tr~~~~~y~~~~a~~n---g---------v~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge 64 (337) +.. ..+...+.+........ . .........|-|.+...+.+.+.+.+.+++.++++++.--.++ T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ 149 (385) T protein:vir:19 70 NPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALE 149 (385) T ss_pred ccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceE Confidence 111 11122222222211100 0 0111223457788899999999999999999999998765555 Q ss_pred eeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcc Q lcl|Aclame:pro 65 KLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGW 144 (337) Q Consensus 65 ~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGf 144 (337) .......++-++-+. .+...|..-..++...+..++.--.+.|+.+.|+.. ++++..+.+.++++++.-.-.--+ T Consensus 150 ~~~~~~~~~~a~~v~--E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~la~a~~~~~d~~~l 224 (385) T protein:vir:19 150 YVREEVFTNNADVVA--EKALKPESDITFSKQTANVKTIAHWVQASRQVMDDA---PMLQSYINNRLMYGLALKEEGQLL 224 (385) T ss_pred EEEEecCCcceeeec--cCccccccccceeEEEEeeeeEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHH Confidence 554433233332221 122234334468888889888888889999988864 678999999999998874444444 Q ss_pred cccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCE Q lcl|Aclame:pro 145 NGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGL 224 (337) Q Consensus 145 nG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~L 224 (337) +|.-.. +| |.-+++.... .....+..+ ...+|.|+. ++..+ .+.+++.. T Consensus 225 ~G~g~~-------~~-----------------~~Gi~~~~~~--~~~~~~~~~-~~~~d~i~~-~~~~l-~~~~~~~~-- 273 (385) T protein:vir:19 225 NGDGTG-------DN-----------------LEGLNKVATA--YDTSLNATG-DTRADIIAH-AIYQV-TESEFSAS-- 273 (385) T ss_pred hccCCC-------Cc-----------------cccccccccc--ccccccccc-cchHHHHHH-HHHhh-ccccCCCC-- Confidence 662111 11 1112211111 111122222 245665544 44444 44444432 Q ss_pred EEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchh-cEEEEecCceEEEEEEcc Q lcl|Aclame:pro 225 VVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSN-LSIYYQEGARRRTLKEVP 303 (337) Q Consensus 225 VvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~N-LsiY~Q~gs~RR~~~d~p 303 (337) +++|.+..+.. ...+-.....|-=... .-....++-|+|++..+++|++.+++-.+++ .-|+.+.|.. -.+.++. T Consensus 274 ~~~~~~~~~~~-l~~lkd~~G~~l~~~~--~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~-v~~~~~~ 349 (385) T protein:vir:19 274 GIVLNPRDWHN-IALLKDNEGRYIFGGP--QAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDAT-VEVSRED 349 (385) T ss_pred EEEEcHHHHHH-HHHhhcCCCceeccCc--ccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceE-EEEeccc Confidence 88999987652 1222222111110000 0123567889999999999999999998876 4455444432 2221111 Q ss_pred ----cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ----~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .++.+.-+-..--++.|-+..+++ -+++..| T Consensus 350 ~~~~~~~~~~~~~~~r~~~~v~~~~a~~---~~~~~aa 384 (385) T protein:vir:19 350 RDNFVKNMLTILCEERLALAHYRPTAII---KGTFSSG 384 (385) T ss_pred cchhhcCcEEEEEEEeeccEEecccceE---EEEeccC Confidence 122222222222233333333333 3455555 No 51 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.22 E-value=2.2e-06 Score=51.64 Aligned_cols=292 Identities=11% Similarity=0.016 Sum_probs=162.9 Q ss_pred CChH-----HHHHHHHHHHHHHHhhCc--h-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccc Q lcl|Aclame:pro 1 MRKE-----TRQAYEKYAAQIAKLNDT--G-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~-----tr~~~~~y~~~~a~~ngv--~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g 72 (337) |++. ..++|..++...+..+-. . .......|-+.+...+.+.+.+.|.+++..+++++.-.... +-.=.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-~p~~~~~ 79 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWADK 79 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE-EEEEecC Confidence 5443 233344444433333221 1 11234457778889999999999999999999988743222 2221233 Q ss_pred ccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCc Q lcl|Aclame:pro 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) Q Consensus 73 ~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~ 152 (337) +.+.-. +.+...|..-..++...+..++.--...|+.+.|++.. ++|...+.+.+.++++.-.-.--|+|.-.. T Consensus 80 ~~a~~v--~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~aia~~~d~~~l~G~g~~-- 153 (324) T protein:vir:96 80 PGAYWV--GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-- 153 (324) T ss_pred cceeee--cCCccccccccceeEEEEEeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCC-- Confidence 344333 22333344446788889999999888999999999754 789999999999998877666667874311 Q ss_pred CChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) . .|. .++.... .........-.|.+|- +++..+ ++.+++. + +++|.+.. T Consensus 154 ~----~~~------------------~~~~~~~--~~~~~~~~~~~~~~i~----~~~~~i-~~~~~~~-~-~~i~n~~~ 202 (324) T protein:vir:96 154 P----FGK------------------SIAQSIK--KTNKVIKGDFTQDNII----DLEALL-EDDELEA-N-AFISKTQN 202 (324) T ss_pred C----cCc------------------ccccccc--ccceecccccchHHHH----HHHHhh-hhccCCC-C-EEEEcHHH Confidence 1 111 1111100 0001111122344443 344433 4444332 2 68898887 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccC--CCceEEecchhcEEEEecCceEEEEEEcc------- Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFP--KRALMVTKLSNLSIYYQEGARRRTLKEVP------- 303 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP--~~~iliT~l~NLsiY~Q~gs~RR~~~d~p------- 303 (337) +..- ..+-.....|-- . -....++-|+|++..|..+ ++.+++-.++++-|- ..+..+-.+-++. T Consensus 203 ~~~L-~~lkd~~G~~~~--~---~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~-~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:96 203 RSLL-RKIVDPETKERI--Y---DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYG-IPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHHH-HHhhCCCCCeee--c---CCCCCcccceeeEeecCCCCCcceEEEEecceEEEE-EecCcEEEEeeccccccccc Confidence 6632 222222222211 0 0134578999999877654 445888888886543 3344444443332 Q ss_pred ---------cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ---------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ---------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .+|.+.---..--++.|-+.++++.+...+-+.. T Consensus 276 ~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~ 318 (324) T protein:vir:96 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCC Confidence 2233322222333788888888887765544444 No 52 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.21 E-value=1.5e-06 Score=52.51 Aligned_cols=298 Identities=11% Similarity=0.024 Sum_probs=162.8 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhc-----ceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVS-----KKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIA 75 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~-----~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia 75 (337) |-. ++.+... ..|.+.-. ..--|-+++.+.+.+.+++.|.+++.++++++.--.. ++-.-..++.+ T Consensus 1 ~~~-----~~e~~~~---~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~ip~~~~~~~a 71 (338) T protein:vir:78 1 MAT-----LNELAPN---TAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGET-IIPTTVKRPEV 71 (338) T ss_pred Ccc-----hHHhhhh---hcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecCccc Confidence 222 1111111 12221111 1114666788999999999999999999998764333 23222344444 Q ss_pred cccCC------CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccccccc Q lcl|Aclame:pro 76 SRTDT------TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKA 149 (337) Q Consensus 76 ~Rt~t------~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~ 149 (337) ..+.. +.+...|..-..++...+.+++.---..|+.+.|+... ++|+..+++.+.++++.-.-.--+||+.. T Consensus 72 ~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~a~~~~~d~~~l~G~g~ 149 (338) T protein:vir:78 72 GQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNP--SGLYTKLQADLAYAIGRGIDLAVFHGKSP 149 (338) T ss_pred eeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHhhcccCC Confidence 33321 11233344445678888999999888899999888633 78999999999999998887778888765 Q ss_pred CCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEEC Q lcl|Aclame:pro 150 AATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICG 229 (337) Q Consensus 150 A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG 229 (337) ...+.| .|++.-. .....+ .......+....|..|.. ++..+.... +...-+++|. T Consensus 150 ~~~~~~----------~gi~~~~-------~~~~~~-~~~~~~~~~~~~~~~~~~----~~~~~~~~~--~~~~~~~~m~ 205 (338) T protein:vir:78 150 LTGSAL----------QGIDTNN-------VIVNTT-NVDYLQTGTTPLLDRFLD----GYDLVSANT--DVDFNGWAAD 205 (338) T ss_pred Cccccc----------ccccccc-------cccccc-ccccccccchhhHHHHHH----HHHHhhhhc--cccceEEEEc Confidence 543322 1211100 000000 011111111222333332 332221111 1123478888 Q ss_pred HHHHHHH-HHH-HHhccCChH--HHHHHHHHHhhhhhcCccccccCccCCC---------ceEEecchhcEEEEecCceE Q lcl|Aclame:pro 230 RELLHDK-YFP-IVNATQAPT--ERLAADLIVSQKRIGNLPAVRVPFFPKR---------ALMVTKLSNLSIYYQEGARR 296 (337) Q Consensus 230 ~dLl~~k-~~~-l~n~~~~pt--E~~A~~~~~~~k~iGGlpa~~vPffP~~---------~iliT~l~NLsiY~Q~gs~R 296 (337) +...+.- ..+ +-+....|- +- ..-....+|-|+|++..+++|++ .+++--+++.-+....+ .. T Consensus 206 ~~~~~~L~~~~~l~d~~g~~l~~~~---~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~-~~ 281 (338) T protein:vir:78 206 PRYRARLLRSQAYRDANGNVDPTRI---NLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADE-IR 281 (338) T ss_pred hHHHHHHHHHhhhccCCCceeeccc---ccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecc-cE Confidence 7665421 111 222222221 11 01123458889999999999964 25666776655544444 23 Q ss_pred EEEEEcc----------------cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 297 RTLKEVP----------------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 297 R~~~d~p----------------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) -.+.++. .+|++.-.-..--++.|-+.++++.+.+.+=.+| T Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 282 VKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred EEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCCCC Confidence 2333322 1333333334445788999999999999888888 No 53 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.20 E-value=2.5e-06 Score=51.30 Aligned_cols=292 Identities=11% Similarity=0.037 Sum_probs=163.7 Q ss_pred CCh-----HHHHHHHHHHHHHHHhhC--ch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccc Q lcl|Aclame:pro 1 MRK-----ETRQAYEKYAAQIAKLND--TG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~-----~tr~~~~~y~~~~a~~ng--v~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g 72 (337) |++ .+++.|..+....+..+. +- .....+.|-+.+...+.+.+.+.|.+++.++++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 554 344445545444443332 11 1223456777888999999999999999999998763222 22222233 Q ss_pred ccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCc Q lcl|Aclame:pro 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) Q Consensus 73 ~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~ 152 (337) +-++=. +.+...|..-..++...+..++.---..|+.+.|+.-. ++|...+.+.+.++++.-.-.-.|+|+-... T Consensus 80 ~~a~~v--~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~- 154 (324) T protein:vir:96 80 PGAYWV--GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP- 154 (324) T ss_pred cceeEe--cCCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCC- Confidence 333322 22333344445678888888888888888888888543 7899999999999998877777788853111 Q ss_pred CChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) . | .-+..... .........-.|.+|-. ++..+ ++.+++.. +++|.+.. T Consensus 155 -~----~------------------~gi~~~~~--~~~~~~~~~~t~~~i~~----~~~~l-~~~~~~~~--~~vmn~~~ 202 (324) T protein:vir:96 155 -F----G------------------KSIAQSIE--KTNKVIKGDFTQDNIID----LEALL-EDDELEAN--AFISKTQN 202 (324) T ss_pred -c----C------------------cccccccc--ccceeccccccHHHHHH----HHHhh-hhccCCCC--EEEEcHHH Confidence 0 1 11111100 00001111223444443 33333 44444432 68888877 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCcc--CCCceEEecchhcEEEEecCceEEEEEEcc------- Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFF--PKRALMVTKLSNLSIYYQEGARRRTLKEVP------- 303 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPff--P~~~iliT~l~NLsiY~Q~gs~RR~~~d~p------- 303 (337) +.. -..+-.....|- +.. ....++-|+|++..|.. +++.+++-.++++- +-..+..+-.+.+++ T Consensus 203 ~~~-L~~l~d~~G~~~--~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~-~g~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:96 203 RSL-LRKIVDPETKER--IYD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhccCCCee--ecC---CCCCcccceeeEeeCCCCCCcceEEEEecceEE-EEEecCcEEEEeeccccccccc Confidence 663 222322222221 110 13457899999988875 55568888888864 333444444443332 Q ss_pred ---------cccceeceeeeeeeeeeeccccEEEeecceecc-C Q lcl|Aclame:pro 304 ---------ERDRIENYESSNDAYVVEDFGCGCVAENIELAA-A 337 (337) Q Consensus 304 ---------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~-a 337 (337) .+|++.-.--.--++.|-+.+++|.+.+.+.+. | T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~ 319 (324) T protein:vir:96 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCC Confidence 233333333334467788888888776554443 3 No 54 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.20 E-value=2.5e-06 Score=51.30 Aligned_cols=292 Identities=11% Similarity=0.037 Sum_probs=163.7 Q ss_pred CCh-----HHHHHHHHHHHHHHHhhC--ch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccc Q lcl|Aclame:pro 1 MRK-----ETRQAYEKYAAQIAKLND--TG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~-----~tr~~~~~y~~~~a~~ng--v~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g 72 (337) |++ .+++.|..+....+..+. +- .....+.|-+.+...+.+.+.+.|.+++.++++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 554 344445545444443332 11 1223456777888999999999999999999998763222 22222233 Q ss_pred ccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCc Q lcl|Aclame:pro 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) Q Consensus 73 ~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~ 152 (337) +-++=. +.+...|..-..++...+..++.---..|+.+.|+.-. ++|...+.+.+.++++.-.-.-.|+|+-... T Consensus 80 ~~a~~v--~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~- 154 (324) T protein:vir:78 80 PGAYWV--GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP- 154 (324) T ss_pred cceeEe--cCCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCCC- Confidence 333322 22333344445678888888888888888888888543 7899999999999998877777788853111 Q ss_pred CChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) . | .-+..... .........-.|.+|-. ++..+ ++.+++.. +++|.+.. T Consensus 155 -~----~------------------~gi~~~~~--~~~~~~~~~~t~~~i~~----~~~~l-~~~~~~~~--~~vmn~~~ 202 (324) T protein:vir:78 155 -F----G------------------KSIAQSIE--KTNKVIKGDFTQDNIID----LEALL-EDDELEAN--AFISKTQN 202 (324) T ss_pred -c----C------------------cccccccc--ccceeccccccHHHHHH----HHHhh-hhccCCCC--EEEEcHHH Confidence 0 1 11111100 00001111223444443 33333 44444432 68888877 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCcc--CCCceEEecchhcEEEEecCceEEEEEEcc------- Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFF--PKRALMVTKLSNLSIYYQEGARRRTLKEVP------- 303 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPff--P~~~iliT~l~NLsiY~Q~gs~RR~~~d~p------- 303 (337) +.. -..+-.....|- +.. ....++-|+|++..|.. +++.+++-.++++- +-..+..+-.+.+++ T Consensus 203 ~~~-L~~l~d~~G~~~--~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~-~g~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:78 203 RSL-LRKIVDPETKER--IYD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhccCCCee--ecC---CCCCcccceeeEeeCCCCCCcceEEEEecceEE-EEEecCcEEEEeeccccccccc Confidence 663 222322222221 110 13457899999988875 55568888888864 333444444443332 Q ss_pred ---------cccceeceeeeeeeeeeeccccEEEeecceecc-C Q lcl|Aclame:pro 304 ---------ERDRIENYESSNDAYVVEDFGCGCVAENIELAA-A 337 (337) Q Consensus 304 ---------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~-a 337 (337) .+|++.-.--.--++.|-+.+++|.+.+.+.+. | T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~ 319 (324) T protein:vir:78 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCC Confidence 233333333334467788888888776554443 3 No 55 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.19 E-value=1.3e-06 Score=52.86 Aligned_cols=296 Identities=12% Similarity=0.045 Sum_probs=146.4 Q ss_pred CChHHHHHHHHHH--------HHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhccc-ceeccchhhceeeecccc Q lcl|Aclame:pro 1 MRKETRQAYEKYA--------AQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRI-NVLPVTELEGEKLGLSVS 71 (337) Q Consensus 1 M~~~tr~~~~~y~--------~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~I-nv~~V~~~~Ge~v~lgv~ 71 (337) ........++... .......+.... ..-.+-|++...+...+.+.+..|+.+ +++++..-..-.+-...+ T Consensus 85 ~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~-~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 163 (392) T protein:vir:13 85 ADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAG-NPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITG 163 (392) T ss_pred hhHHHHHHHhccchhhhHHHHhhhhhhcccccC-CCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcC Confidence 1111111111110 001111222111 122455666666666665555555554 666654433333333333 Q ss_pred cccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCC Q lcl|Aclame:pro 72 GPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAA 151 (337) Q Consensus 72 g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~ 151 (337) ++-++=+ +.....|..-..++...|..++.---+.|+++.|+.. -++|+..+.+.+.+.++.=.-.-=+||+- T Consensus 164 ~~~a~~v--~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~l~G~G--- 236 (392) T protein:vir:13 164 RATAGIV--GETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQ--VLDLVGFLVSDAGPAIGDAMGRHFLTGTG--- 236 (392) T ss_pred Ccceeee--cccccccccccceeeEEeeeeeEEeeehhHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhcccC--- Confidence 4444322 2222234344567888888888888889999999975 35888999999988887643333445531 Q ss_pred cCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHH Q lcl|Aclame:pro 152 TTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRE 231 (337) Q Consensus 152 ~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~d 231 (337) | ..| +|+|... ......+.. ..++....|.|+ +++.+| ++.|+... +++|.+. T Consensus 237 -t---~~p------~Gil~~~------------~~~~~~~~~-~~~~~~~~d~l~-~~~~~l-~~~~~~~a--~~v~n~~ 289 (392) T protein:vir:13 237 -T---GQP------RGILTDA------------TGANAAFGE-ADADSKVSDALI-DLFHEV-PSAYRKNA--KFVVNDL 289 (392) T ss_pred -C---ccc------ccccccc------------ccccccccc-cccccccHHHHH-HHHHhh-hhhhhcCC--EEEEcHH Confidence 1 123 2554321 111111111 122334566554 566654 67777643 6888888 Q ss_pred HHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc--ccccee Q lcl|Aclame:pro 232 LLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP--ERDRIE 309 (337) Q Consensus 232 Ll~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p--~r~rve 309 (337) .+.. ...|-+....|-=....+ .....++.|+|++..+++|++.+++-.++++-|.. .+..+-....++ .++++. T Consensus 290 ~~~~-l~~lkd~~G~~l~~~~~~-~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i~~-~~~~~i~~~~~~~~~~~~~~ 366 (392) T protein:vir:13 290 RAAQ-MRKLKDANGQYLWQSALT-VGAPDTFNGKVVETDDGMPADKVLFADLSKYRVRF-AGSLRVDRSVDAKFSTDQIV 366 (392) T ss_pred HHHH-HHHhhccCCceeecCCcC-CCCCceecceeeEEcCCCCCCcEEEeeccceeEEe-ecceEEEeeccccccCCcEE Confidence 7663 222333322221000000 01235789999999999999999999998865543 333333222222 112222 Q ss_pred ceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 310 NYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) -+-..--++.|-|.+++.. +++..| T Consensus 367 ~r~~~r~d~~~~~~~A~~~---~~~~~a 391 (392) T protein:vir:13 367 YRFLQRADGLLVDARGAKV---LTVTPA 391 (392) T ss_pred EEEEEEeccEEecccceEE---EEeecc Confidence 2222222333444444332 344444 No 56 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.17 E-value=1.7e-06 Score=52.21 Aligned_cols=299 Identities=10% Similarity=-0.022 Sum_probs=150.5 Q ss_pred CChHHHHHHHHHHHHHHH---hhCch----hhcceEeechHHHHHHHHHH-HhhHHHhcccceeccchhhceeeec-ccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK---LNDTG----DVSKKFAVEPTVQQRLETKM-QESSEFLKRINVLPVTELEGEKLGL-SVS 71 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~---~ngv~----~~~~~Fsv~P~~~q~L~~~i-qess~FL~~Inv~~V~~~~Ge~v~l-gv~ 71 (337) |....+..+..+...... .+... .....+.+.|.......... ..++.+.+.++++++.--....... +.+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 177 (419) T protein:vir:94 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGT 177 (419) T ss_pred HHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeecccc Confidence 111111122222211110 00000 11334566777766665555 4445566778888875433222211 111 Q ss_pred ccccccc----CCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccccc Q lcl|Aclame:pro 72 GPIASRT----DTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGV 147 (337) Q Consensus 72 g~ia~Rt----~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~ 147 (337) .+..+.. -.+.+...|..-..++...+..++.---+.|+.+.|+.. ++|+..+.+.++++++.=.-.-.+||+ T Consensus 178 ~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~la~a~~~~~d~aii~G~ 254 (419) T protein:vir:94 178 AGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLNGN 254 (419) T ss_pred ccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 1111111 111122233333457777888888877788999999864 679999999999999876666667874 Q ss_pred ccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCc-ccccHHHHHHHHHhcccChhHcCCCCEEE Q lcl|Aclame:pro 148 KAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAG-DYENLDALVMDIVSSMIDPWFQEDTGLVV 226 (337) Q Consensus 148 s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~gg-dy~nLDaLv~d~~~~lid~~~r~~~~LVv 226 (337) -. .+|. |++..-. +... ....+ ..+. +...+|. +.+++..+..+.++.. ++ T Consensus 255 G~-------~~p~------Gi~~~~~------~~~~-~~~~~----~~~~t~~~~~~~-l~~~~~~~~~~~~~~~---~~ 306 (419) T protein:vir:94 255 GS-------TEMQ------GILTTPG------IGTY-QQPKP----TAPATDEPPLVD-IRRAKTVAEIAGFPPD---GV 306 (419) T ss_pred Cc-------cccc------ceecccc------cccc-ccccc----ccccccchhHHH-HHHHHHhhhhccCCCC---EE Confidence 32 1333 7765311 1000 00000 0111 1122332 3334544544444433 78 Q ss_pred EECHHHHHHHHHHHHhccCCh---HHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc Q lcl|Aclame:pro 227 ICGRELLHDKYFPIVNATQAP---TERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP 303 (337) Q Consensus 227 ivG~dLl~~k~~~l~n~~~~p---tE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p 303 (337) +|.+..+..- ..+....+.+ .+- +.+ ....+|-|+|++..+++|++.+++-.+++...++.+....-.+.+.. T Consensus 307 v~n~~~~~~l-~~~k~~~~~~~~~~~~-~~~--~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~ 382 (419) T protein:vir:94 307 VVHPQDWESI-ELDQAPGSGVFRVIAN-VQG--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTDSH 382 (419) T ss_pred EEcHHHHHHH-HHHhhcCCCceeecCC-ccc--CCCccccceeeEEcCCCCCccEEEeeccceEEEEEecceEEEEeccc Confidence 8988775532 2222222221 110 001 12458899999999999999999999988665665554444333322 Q ss_pred c----ccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 E----RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ~----r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) . ++.+.-.-..--++.|-+...+|.+ ++..| T Consensus 383 ~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~---~~~aa 417 (419) T protein:vir:94 383 ADFFTANTLVILAEFRANLAVYQPKAFVRV---TFAAA 417 (419) T ss_pred cchhhcCcEEEEEEEeeccEEeccccEEEE---EeccC Confidence 2 3333333333344555666665544 45555 No 57 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.16 E-value=2.8e-06 Score=50.99 Aligned_cols=284 Identities=10% Similarity=0.101 Sum_probs=165.3 Q ss_pred CChHHHHHHHHHHHHHHHhh-Cc--------hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccc- Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLN-DT--------GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSV- 70 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~n-gv--------~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv- 70 (337) .+...+..|..|+....... .. .+..-.+.|-+.+.+.+.+.+.+.+.+++.++++++....|....+-. T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 168 (404) T protein:vir:39 89 LKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (404) T ss_pred hHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeec Confidence 34445555655654322111 11 112234567778889999999999999999999999988888765422 Q ss_pred -ccccccccCCCCccccc-ccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccc Q lcl|Aclame:pro 71 -SGPIASRTDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVK 148 (337) Q Consensus 71 -~g~ia~Rt~t~~~~R~p-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s 148 (337) .++.+.-...+. -.| .+...++...+.+++.-=-+.|+.+.|+.. .++|+..+.+.+.+.++.=.-.--++|+. T Consensus 169 ~~~~~a~~v~Eg~--~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~d~~il~g~g 244 (404) T protein:vir:39 169 DVTPLTVMDAEDG--KIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDT--AENILAWLSSWIAKKVVVTRNQAIIAAMG 244 (404) T ss_pred CCccceeeecCcc--ccccccccceeeEEeeeeeEEeeehhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 223333332222 122 234456777777777776678888888763 36788888888888887644444445532 Q ss_pred cCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEE Q lcl|Aclame:pro 149 AAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) Q Consensus 149 ~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVviv 228 (337) ... | .+.. .+.|.++ +++...+++.|+.. -+++| T Consensus 245 ~~~-------~---------------------------------~~~~---~~~~~i~-~~~~~~~~~~~~~~--a~~v~ 278 (404) T protein:vir:39 245 TVP-------K---------------------------------KPTI---AKFDDVI-TMINTSVDPAIIAT--SSLLT 278 (404) T ss_pred ccc-------c---------------------------------cccc---ccHHHHH-HHHHHhhhhhhccC--CEEEE Confidence 110 0 0011 2355543 34555677877654 58899 Q ss_pred CHHHHHHHHHHHHh-ccCChHHHHHHHHHHhhhhhcCccccccC--ccCCCc-----eEEecchhcEEEEecCceEEEEE Q lcl|Aclame:pro 229 GRELLHDKYFPIVN-ATQAPTERLAADLIVSQKRIGNLPAVRVP--FFPKRA-----LMVTKLSNLSIYYQEGARRRTLK 300 (337) Q Consensus 229 G~dLl~~k~~~l~n-~~~~ptE~~A~~~~~~~k~iGGlpa~~vP--ffP~~~-----iliT~l~NLsiY~Q~gs~RR~~~ 300 (337) .+..+.. +..+. ..+.|-=. ..-.-....+|-|+|++... .+|..+ +++-.|++.-..+.++..+-.+. T Consensus 279 n~~~~~~--L~~lkd~~G~~l~~-~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 355 (404) T protein:vir:39 279 NQSGLNK--LALVKTAEGKYLLE-PDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPT 355 (404) T ss_pred cHHHHHH--HHHhhccCCceeec-cCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEe Confidence 9987653 22222 22222100 00001133578899999865 456544 77777777655555555554443 Q ss_pred Ecc----cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 301 EVP----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 301 d~p----~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) +.. +++.+--.-..--++.|-+..+++.+.--..++| T Consensus 356 ~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~ 396 (404) T protein:vir:39 356 NIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQ 396 (404) T ss_pred ccchhhhhhceeeEEEEeeeccEEecccceEEEEeeccccC Confidence 332 2333333333344688889999999887777776 No 58 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.16 E-value=3.3e-06 Score=50.61 Aligned_cols=292 Identities=12% Similarity=0.048 Sum_probs=165.6 Q ss_pred CChHH-----HHHHHHHHHHHHHhhC--ch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccc Q lcl|Aclame:pro 1 MRKET-----RQAYEKYAAQIAKLND--TG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~t-----r~~~~~y~~~~a~~ng--v~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g 72 (337) |++.- .+.|..++.+.+.... +. .......|-+.+...+.+.+.+.|.+++..+++++.-... ++-.-.++ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 66543 3334444444333222 11 1122346777889999999999999999999998874332 22221223 Q ss_pred ccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCc Q lcl|Aclame:pro 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) Q Consensus 73 ~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~ 152 (337) +-++-.. .+...|..-..++...+.+++.---..|+.+.|+... ++|+..+.+.+.++++.-.-.--++|.-.. T Consensus 80 ~~a~~v~--Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~-- 153 (324) T protein:vir:99 80 PGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-- 153 (324) T ss_pred cceeEec--cCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCCC-- Confidence 3332222 2233344456788888999998888899999998774 789999999999987765555556774311 Q ss_pred CChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) +. |. | ++.... .. .. ...+. .+.|. +.+++..| ++.++... +++|.+.. T Consensus 154 ~~----~~------~------------~~~~~~-~~--~~-~~~~~-~~~~~-i~~~~~~l-~~~~~~~~--~~v~n~~~ 202 (324) T protein:vir:99 154 PF----GK------S------------IAQSIE-KT--NK-VIKGD-FTQDN-IIDLEALL-EDDELEAN--AFISKTQN 202 (324) T ss_pred cc----Cc------c------------cccccc-cc--ce-ecccc-CCHHH-HHHHHHhh-hhccCCCC--EEEEcHHH Confidence 11 11 0 111100 00 01 11111 12333 33555544 45544443 68889888 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCC--ceEEecchhcEEEEecCceEEEEEEcc------- Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKR--ALMVTKLSNLSIYYQEGARRRTLKEVP------- 303 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~--~iliT~l~NLsiY~Q~gs~RR~~~d~p------- 303 (337) +.. -..+-.....|- .. -....++-|+|++..|..|.+ .+++..++++- |...+..+-.+.++. T Consensus 203 ~~~-L~~l~d~~g~~~--~~---~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:99 203 RSL-LRKIVDPETKER--IY---DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhcCCCcee--ec---CCCCccccceeEEeecCCCCCcceEEEEecccEE-EEEecCcEEEEeeccccccccc Confidence 763 222322222211 00 012357889999999987655 58888898864 444444444443332 Q ss_pred ---------cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ---------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ---------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .+|.+.---..--+++|.+.++++.+.+.+.+.. T Consensus 276 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~ 318 (324) T protein:vir:99 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCC Confidence 2344443333445788899999999876665554 No 59 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.13 E-value=1.2e-06 Score=53.13 Aligned_cols=279 Identities=11% Similarity=0.056 Sum_probs=164.3 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |+-+.-..++... ..+..-.|-+.+.+.+.+.+.+.|.+++..+++++.-..+..+-...+++.++-..- T Consensus 1 m~~~~~~~~~~~~----------t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 70 (297) T protein:vir:95 1 MTVQTFNPENVLV----------SQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNE 70 (297) T ss_pred CCccccccccccc----------cCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeec Confidence 5543332222211 112233577888899999999999999999999886555555555555555554432 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) + . ..|..-..++...+.+++.---..|+.+.|++.. ++|+..+++.++++++...-.-.+||+-....+ T Consensus 71 g-~-~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~------- 139 (297) T protein:vir:95 71 T-E-KIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW--KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFAN------- 139 (297) T ss_pred C-c-cccccccceeEEEEeeEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHHhcccCCcccc------- Confidence 2 2 2333345678888888888888889998888553 689999999999999887777777885422211 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) .++.... ......+.+-+|.+|-. ++..+.+..+.. -+++|.++....- ..| T Consensus 140 ------------------gi~~~~~--~~~~~~~~~~t~~~i~~----~~~~l~~~~~~~---~~~v~~~~~~~~L-~~l 191 (297) T protein:vir:95 140 ------------------SVAKAAK--DANKVIGGPINYDNILK----LQDALYDADVEP---NAFVSKIQNRSAL-REA 191 (297) T ss_pred ------------------ccccccc--ccceecccccCHHHHHH----HHHHhhhccCCc---CEEEEcHHHHHHH-HHh Confidence 1111110 00011111224544433 444444433222 3789999987633 334 Q ss_pred HhccCChHHHHHHHHHHhhhhhcCccccccCc--cCCCceEEecchhcEEEEecCceEEEEEEcc--------------- Q lcl|Aclame:pro 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPF--FPKRALMVTKLSNLSIYYQEGARRRTLKEVP--------------- 303 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPf--fP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p--------------- 303 (337) -.....|- ...+..++-|+|++..|. .+++.+++-.++++-+.. .+..+-.+.++. T Consensus 192 ~d~~G~~i------~~~~~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~~~~~ 264 (297) T protein:vir:95 192 RDGNKVSI------YDKAANTIDGITTVDLKSARFEKGDLLAGDFDNLIYGV-PYNITYKISEEGQISTITNADGTPINL 264 (297) T ss_pred hccCCcee------ecCCCCcccceeeEeecCCCCCCceEEEEecccEEEEE-ecCeEEEEeeccccccccccCccchhh Confidence 33322221 112345788999986554 688889999999876544 444443333332 Q ss_pred -cccceeceeeeeeeeeeeccccEEEeecceec Q lcl|Aclame:pro 304 -ERDRIENYESSNDAYVVEDFGCGCVAENIELA 335 (337) Q Consensus 304 -~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~ 335 (337) ++|.+.-.-...-++.|-+.+++|.+...+=+ T Consensus 265 ~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 265 FEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred hhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 23444333444557778888887766422222 No 60 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.12 E-value=4.1e-06 Score=50.15 Aligned_cols=279 Identities=9% Similarity=0.050 Sum_probs=151.8 Q ss_pred CChHHHHHHHHHHHHHHH-----hhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK-----LNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIA 75 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~-----~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia 75 (337) .....+..|..|+..... ..+.....-.+.|-+.+.+.+.+.+.+.+.+++.+++++|...+|.......++.-+ T Consensus 88 ~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (394) T protein:vir:10 88 PIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRF 167 (394) T ss_pred HHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCcc Confidence 234455567777654221 111222233477877889999999999999999999999988777765443322222 Q ss_pred cccCCCCccccc-ccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCC Q lcl|Aclame:pro 76 SRTDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTD 154 (337) Q Consensus 76 ~Rt~t~~~~R~p-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td 154 (337) .= .+.+...| .+...++...+..++.---+.|+.+.|+. ..++|+..+.+.++++++.-.-.--.+|... T Consensus 168 ~~--~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~il~g~g~----- 238 (394) T protein:vir:10 168 SS--VAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIAD--SAVDLTSLVGQSINEKSVNTYNAMIAPVLQS----- 238 (394) T ss_pred cc--ccccccccccccccceeEEeeeeeeEeeehhHHHHHhh--hhHHHHHHHHHHHHHHHHHHHHHHHhhcccc----- Confidence 11 11112222 23344666666666665567888888875 3478999999999888776322111222110 Q ss_pred hhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHH Q lcl|Aclame:pro 155 RQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 155 ~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~ 234 (337) + ..+......+.|.++ +++...+++.|. =+++|.+..+. T Consensus 239 ----------------------------------~--~~~~~~~~~~~d~l~-~~~~~~~~~~~~----a~~vmn~~~~~ 277 (394) T protein:vir:10 239 ----------------------------------F--TAKATTTDTLVDSLK-HILNVDLDPAYS----RALVVTQSLFN 277 (394) T ss_pred ----------------------------------c--ccccccccccHHHHH-HHHHhhhhhhcc----CEEEecHHHHH Confidence 0 111122345677765 455667788874 27999998866 Q ss_pred HHHHHHHhccCCh------HHHHHHHHHHhhhhhcCccccccCcc--CCC----ceEEecchhcEEEEecCceEEEEEEc Q lcl|Aclame:pro 235 DKYFPIVNATQAP------TERLAADLIVSQKRIGNLPAVRVPFF--PKR----ALMVTKLSNLSIYYQEGARRRTLKEV 302 (337) Q Consensus 235 ~k~~~l~n~~~~p------tE~~A~~~~~~~k~iGGlpa~~vPff--P~~----~iliT~l~NLsiY~Q~gs~RR~~~d~ 302 (337) . -..|-.....| +.-.. . ....++-|+|++.++.. |.. .+++-.|++.-+.+-.+..+-...++ T Consensus 278 ~-l~~lkd~~G~~i~~~~~~~~~~-~--~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~ 353 (394) T protein:vir:10 278 T-LDTLKDKNGRYLLHDASDSITD-G--TAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDS 353 (394) T ss_pred H-HHHhhccCCCeeeecccccccc-C--CcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecc Confidence 4 12222222211 11000 0 12247889999887743 322 17777888733333333344444444 Q ss_pred ccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 303 PERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 303 p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ....+.--.+.|- +..|-+...++.++ +.++ T Consensus 354 ~~~~~~~~~~~r~-d~~~~~~~ai~~~~---~~~~ 384 (394) T protein:vir:10 354 KIYGRYLGAAFRF-GVKQADSNAGYFVT---NTDA 384 (394) T ss_pred cccceeEEEEEEe-ccEEeccccEEEEE---eecc Confidence 4444332222232 34555566666654 3333 No 61 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.12 E-value=2.9e-06 Score=50.93 Aligned_cols=283 Identities=12% Similarity=0.134 Sum_probs=158.3 Q ss_pred CChHHHHHHHHHHHHHHHhh---------CchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLN---------DTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVS 71 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~n---------gv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~ 71 (337) .+...+..|.+|+....... ...+..-.+.|-+.+.+.+.+.+.+.+.+++.++++++....|.....-.+ T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 168 (408) T protein:vir:10 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) T ss_pred hHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeecc Confidence 22223333444433211110 001122346676677889999999999999999999999888886544222 Q ss_pred --cccccccCCCCccccc-ccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccc Q lcl|Aclame:pro 72 --GPIASRTDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVK 148 (337) Q Consensus 72 --g~ia~Rt~t~~~~R~p-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s 148 (337) ++.+.-+. .....| .+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|+. T Consensus 169 ~~~~~a~~v~--E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g 244 (408) T protein:vir:10 169 DVTPLTVMDA--EDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT--AENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) T ss_pred ccccceeeec--CccccccccCcceeeEEeeeeeEEeeehhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 12222221 122223 244457777888888877788888888863 35889999999998888654443444432 Q ss_pred cCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEE Q lcl|Aclame:pro 149 AAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) Q Consensus 149 ~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVviv 228 (337) ... .+ + .=.+.|.|+. ++...+++.|+.. -+++| T Consensus 245 ~~~-------------------------------------~~---~---~~~~~~~l~~-~~~~~~~~~~~~~--a~~v~ 278 (408) T protein:vir:10 245 AAP-------------------------------------KK---P---TIAKFDDVIT-MINTAVDPAIIAT--SSLLT 278 (408) T ss_pred ccc-------------------------------------cc---c---ccccHHHHHH-HHHHhhhhhhccC--CEEEE Confidence 110 00 0 0024566554 3444567777653 58899 Q ss_pred CHHHHHHHHHHHHh-ccCChHHHHHHHH-HHhhhhhcCccccccC--ccCCCc-----eEEecchhcEEEEecCceEEEE Q lcl|Aclame:pro 229 GRELLHDKYFPIVN-ATQAPTERLAADL-IVSQKRIGNLPAVRVP--FFPKRA-----LMVTKLSNLSIYYQEGARRRTL 299 (337) Q Consensus 229 G~dLl~~k~~~l~n-~~~~ptE~~A~~~-~~~~k~iGGlpa~~vP--ffP~~~-----iliT~l~NLsiY~Q~gs~RR~~ 299 (337) .+..+.. +..+. ..+.|- ..... .....++-|+|++.++ .+|..+ +++-.+++.-..+.++...-.+ T Consensus 279 n~~~~~~--l~~lkd~~G~~i--~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~ 354 (408) T protein:vir:10 279 NQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLP 354 (408) T ss_pred cHHHHHH--HHHhhccCCceE--eccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEE Confidence 9988764 22232 111211 00000 0123588999999876 567655 7777888754344334444333 Q ss_pred EEcc----cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 300 KEVP----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 300 ~d~p----~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .+++ .++.+--+-..--+.+|-+...++.++--..+++ T Consensus 355 ~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~ 396 (408) T protein:vir:10 355 TNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) T ss_pred cccccchhhcCceEEEEEEeeccEEeccccEEEEEeeccccC Confidence 3332 2333333334445667777777776653333333 No 62 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.09 E-value=4.9e-06 Score=49.70 Aligned_cols=324 Identities=11% Similarity=-0.000 Sum_probs=164.1 Q ss_pred CChHHHHHHHHHHHHHHH--hhCc-hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK--LNDT-GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv-~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~R 77 (337) ...+.+..+..+....+. .+.+ .+..-.+.|-|.+...+.+.+++.+.+++.+++++++--...........+-++- T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~w 209 (497) T protein:vir:78 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCccee Confidence 111112222222222111 1111 1122346788999999999999999999999999887533322221111122222 Q ss_pred cCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccC------- Q lcl|Aclame:pro 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAA------- 150 (337) Q Consensus 78 t~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A------- 150 (337) + +.+...|..-..++...+..++.---+.|+.+.|+.. |+++..+++.+.+.++.=.-.--+||+-.. T Consensus 210 v--~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~---~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~ 284 (497) T protein:vir:78 210 V--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQ 284 (497) T ss_pred e--ccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccc Confidence 2 1223334444557777788777777788999999874 678999999999988864333333332110 Q ss_pred -----CcC---------Chh--------hhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHH Q lcl|Aclame:pro 151 -----ATT---------DRQ--------ANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMD 208 (337) Q Consensus 151 -----~~T---------d~~--------anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d 208 (337) ..+ ... ..-....+|..|+..++..+....... ..+-+..+..-++..++-++. T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~- 360 (497) T protein:vir:78 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAG---SGSGVAGSYPTAAEIAENVFD- 360 (497) T ss_pred ccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhh---hccchhccccchhhhhhHHHH- Confidence 000 000 001223455666666665433222211 111112222233344443333 Q ss_pred HHhcccChhHcCCCCEEEEECHHHHHHHHHHHHh-ccC-----ChHHHHHHHHHHhhhhhcCccccccCccCCCceEEec Q lcl|Aclame:pro 209 IVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVN-ATQ-----APTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTK 282 (337) Q Consensus 209 ~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n-~~~-----~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~ 282 (337) ++..+- .-....++ +++|.+.-+.. ..++. ..+ .+..-.+.+.....+++-|+|++..|++|++.+++-. T Consensus 361 ~~~~~~-~~~~~~~~-~~vmn~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd 436 (497) T protein:vir:78 361 AFVDIQ-LTLFQTPN-AVVMNPRDWEL--LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH 436 (497) T ss_pred HHhhhh-hhcccCCC-eEEEchHHHHH--HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEee Confidence 232222 22333344 46677654432 22222 111 1111122333344568889999999999999999987 Q ss_pred chhcEEEE-ecCceEEEEEEc----ccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 283 LSNLSIYY-QEGARRRTLKEV----PERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 283 l~NLsiY~-Q~gs~RR~~~d~----p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++...+.+ -++..+-.+-+. =.+|.+.----.--++.|-+.+.++.++-...+.| T Consensus 437 ~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~ 496 (497) T protein:vir:78 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) T ss_pred cccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCccC Confidence 76644432 333333333221 12333332222233456667777777776666666 No 63 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.09 E-value=4.9e-06 Score=49.70 Aligned_cols=324 Identities=11% Similarity=-0.000 Sum_probs=164.1 Q ss_pred CChHHHHHHHHHHHHHHH--hhCc-hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK--LNDT-GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv-~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~R 77 (337) ...+.+..+..+....+. .+.+ .+..-.+.|-|.+...+.+.+++.+.+++.+++++++--...........+-++- T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~w 209 (497) T protein:vir:10 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCccee Confidence 111112222222222111 1111 1122346788999999999999999999999999887533322221111122222 Q ss_pred cCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccC------- Q lcl|Aclame:pro 78 TDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAA------- 150 (337) Q Consensus 78 t~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A------- 150 (337) + +.+...|..-..++...+..++.---+.|+.+.|+.. |+++..+++.+.+.++.=.-.--+||+-.. T Consensus 210 v--~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~---~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~ 284 (497) T protein:vir:10 210 V--AEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQ 284 (497) T ss_pred e--ccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccc Confidence 2 1223334444557777788777777788999999874 678999999999988864333333332110 Q ss_pred -----CcC---------Chh--------hhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHH Q lcl|Aclame:pro 151 -----ATT---------DRQ--------ANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMD 208 (337) Q Consensus 151 -----~~T---------d~~--------anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d 208 (337) ..+ ... ..-....+|..|+..++..+....... ..+-+..+..-++..++-++. T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~- 360 (497) T protein:vir:10 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAG---SGSGVAGSYPTAAEIAENVFD- 360 (497) T ss_pred ccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhh---hccchhccccchhhhhhHHHH- Confidence 000 000 001223455666666665433222211 111112222233344443333 Q ss_pred HHhcccChhHcCCCCEEEEECHHHHHHHHHHHHh-ccC-----ChHHHHHHHHHHhhhhhcCccccccCccCCCceEEec Q lcl|Aclame:pro 209 IVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVN-ATQ-----APTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTK 282 (337) Q Consensus 209 ~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n-~~~-----~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~ 282 (337) ++..+- .-....++ +++|.+.-+.. ..++. ..+ .+..-.+.+.....+++-|+|++..|++|++.+++-. T Consensus 361 ~~~~~~-~~~~~~~~-~~vmn~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd 436 (497) T protein:vir:10 361 AFVDIQ-LTLFQTPN-AVVMNPRDWEL--LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH 436 (497) T ss_pred HHhhhh-hhcccCCC-eEEEchHHHHH--HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEee Confidence 232222 22333344 46677654432 22222 111 1111122333344568889999999999999999987 Q ss_pred chhcEEEE-ecCceEEEEEEc----ccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 283 LSNLSIYY-QEGARRRTLKEV----PERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 283 l~NLsiY~-Q~gs~RR~~~d~----p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++...+.+ -++..+-.+-+. =.+|.+.----.--++.|-+.+.++.++-...+.| T Consensus 437 ~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~ 496 (497) T protein:vir:10 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) T ss_pred cccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCccC Confidence 76644432 333333333221 12333332222233456667777777776666666 No 64 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.08 E-value=2.8e-06 Score=51.06 Aligned_cols=294 Identities=10% Similarity=0.003 Sum_probs=149.8 Q ss_pred CChHHHHHHHHHHH--------H-------HHHhhCchhhcceEeechHHHHH-HHHHHHhhHHHhcccceeccchhhce Q lcl|Aclame:pro 1 MRKETRQAYEKYAA--------Q-------IAKLNDTGDVSKKFAVEPTVQQR-LETKMQESSEFLKRINVLPVTELEGE 64 (337) Q Consensus 1 M~~~tr~~~~~y~~--------~-------~a~~ngv~~~~~~Fsv~P~~~q~-L~~~iqess~FL~~Inv~~V~~~~Ge 64 (337) -+...+..+..++. . -+...++.+.+-.+-|-+.+... +...+.+++.+.+..++++. .|. T Consensus 217 ~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~---~g~ 293 (543) T protein:vir:81 217 SSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA---TGD 293 (543) T ss_pred hhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC---Ccc Confidence 00111111111111 0 01112222222233344455544 45777888888888887665 343 Q ss_pred e-eecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhc Q lcl|Aclame:pro 65 K-LGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIG 143 (337) Q Consensus 65 ~-v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IG 143 (337) . +....+++.+.-+. .+...|..-..++...+..++.---+.|+.+.|+. .++|...+.+.+.+.++.-.-.-. T Consensus 294 ~~~~~~~~~~~a~~v~--Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d---~~~~~~~i~~~l~~~~~~~~d~ai 368 (543) T protein:vir:81 294 VWHGVSSAAVQWSWDA--EFEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQD---EANVTETVALLFAEGKDELEAVTL 368 (543) T ss_pred eEEEEecCCcceeecc--cCccccccccccceeeeeeeeeEeeehhhHHHHhc---cHHHHHHHHHHHHHHHHHHHHHHH Confidence 2 22333344443322 22233445566888899999999999999999974 269999999999999998777777 Q ss_pred ccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCC Q lcl|Aclame:pro 144 WNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTG 223 (337) Q Consensus 144 fnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~ 223 (337) |||.-.+ ..|. |.+ +........++.+..+. ..+|. +.+++.. +++.|+. . T Consensus 369 l~G~Gt~------~~p~------Gi~------------~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~-l~~~~~~--~ 419 (543) T protein:vir:81 369 TTGTGQG------NQPT------GIV------------TALAGTAAEIAPVTAET-FALAD-VYAVYEQ-LAARHRR--Q 419 (543) T ss_pred hccCCCC------cccc------cch------------hhccccccccccccccc-ccHHH-HHHHHHh-hhccccC--C Confidence 8884211 1222 222 21111111222222222 12222 2344443 4566654 3 Q ss_pred EEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc----------eEEecchhcEEEEecC Q lcl|Aclame:pro 224 LVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA----------LMVTKLSNLSIYYQEG 293 (337) Q Consensus 224 LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~----------iliT~l~NLsiY~Q~g 293 (337) -+++|.+..+..- ..+-...+.|-=.... -....+|-|+|++..+++|.+. +++-.++++-|....| T Consensus 420 ~~~v~n~~~~~~l-~~lkd~~G~~l~~~~~--~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~ 496 (543) T protein:vir:81 420 GAWLANNLIYNKI-RQFDTQGGAGLWTTIG--NGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIG 496 (543) T ss_pred cEEEEcHHHHHHH-HHhhcCCCceeccCcC--CCCCccccceeeEEeccccccccccccCCcceEEEeeccceeEEeecc Confidence 5889999886532 2232222222110000 0123478899999999999875 7888888887766555 Q ss_pred ceEEEEEE-----cccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 294 ARRRTLKE-----VPERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 294 s~RR~~~d-----~p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ..=....+ +-.++.+--+-..--|+.|-+..+++.+. +.-| T Consensus 497 ~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~---~~~~ 542 (543) T protein:vir:81 497 MTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLN---VETA 542 (543) T ss_pred cEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEE---eccc Confidence 32222111 11112222222222345555555555443 3333 No 65 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.05 E-value=5.9e-06 Score=49.27 Aligned_cols=279 Identities=14% Similarity=0.126 Sum_probs=150.2 Q ss_pred CChHHHHHHHHHHHH-HHHhhCch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceee-ecccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQ-IAKLNDTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKL-GLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~-~a~~ngv~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v-~lgv~g~ia~R 77 (337) ++...+..|..++.. ..+..... ...-.+.|-+.+...+.+.+.+.|.+++.+++++++-..|... ....+++-++- T Consensus 71 ~~~~~~~~~~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 150 (371) T protein:vir:81 71 VKENEVEAFVNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVE 150 (371) T ss_pred hHHHHHHHHHHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceee Confidence 444555666666543 22222221 2234566777788999999999999999999999987777653 33333333322 Q ss_pred cCCCCccccc-ccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChh Q lcl|Aclame:pro 78 TDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQ 156 (337) Q Consensus 78 t~t~~~~R~p-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~ 156 (337) ... +...| .+...++.....+++.---+.|+.+.|+... ++|+.-+.+.+.++++.-.-..=++|+.... T Consensus 151 v~E--g~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~----- 221 (371) T protein:vir:81 151 VAE--GAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDST--EAIVNTLVRWIGDESRVTRNGLIINVLNTKA----- 221 (371) T ss_pred ecc--ccccccccccceeeEEeeeeEEEEeehhhHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccc----- Confidence 222 22222 2334567777888888777899999988643 6889999999998877644433344432111 Q ss_pred hhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHH Q lcl|Aclame:pro 157 ANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDK 236 (337) Q Consensus 157 anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k 236 (337) | .+ -.+.|.+... +...+++.|+. ..+++|.+..... T Consensus 222 --~----------------------------~~---------~~~~~~i~~~-~~~~l~~~~~~--~a~~vmn~~~~~~- 258 (371) T protein:vir:81 222 --K----------------------------TA---------IADLDGLKQI-INVQLDPVFRS--TSSVIVNQDAFNW- 258 (371) T ss_pred --c----------------------------cc---------cccHHHHHHH-HHhhcchhhhc--CCEEEEcHHHHHH- Confidence 0 00 0234444433 34456777764 4588899987653 Q ss_pred HHHHHh-ccCChHHHHHHHHHHhhhhhcCccccccCccCCC------------ceEEecchh-cEEEEecCceEEEEEEc Q lcl|Aclame:pro 237 YFPIVN-ATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKR------------ALMVTKLSN-LSIYYQEGARRRTLKEV 302 (337) Q Consensus 237 ~~~l~n-~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~------------~iliT~l~N-LsiY~Q~gs~RR~~~d~ 302 (337) +..+. ....|-=. ..-.-....++-|+|++..+++|.+ .+++=.+++ ..++.+.|..= .+-+. T Consensus 259 -L~~lkd~~g~~l~~-~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i-~~~~~ 335 (371) T protein:vir:81 259 -LDTLKDQNGQYLLQ-PSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEI-MSSNV 335 (371) T ss_pred -HHHhhccCCCeeee-cccCCCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEE-EEecc Confidence 22222 21111000 0000023457889999999999854 345555554 33333333321 11111 Q ss_pred c----cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 303 P----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 303 p----~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) . .++.+--.-..--++.|-+...++.+ ++.-| T Consensus 336 ~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~---~~~~A 371 (371) T protein:vir:81 336 AMDAFETDATLWRAIERMDVKMRDDEAFVFG---EVQLA 371 (371) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEE---EEecC Confidence 1 12222222222223344444444433 45555 No 66 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.04 E-value=6.1e-06 Score=49.16 Aligned_cols=303 Identities=12% Similarity=0.133 Sum_probs=157.5 Q ss_pred CChHHHHHHHHHHHHHHH--hhCch--------hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK--LNDTG--------DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSV 70 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~--------~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv 70 (337) ...+.+..|..|+.+-.. +...+ +..-.+.|-+.+...+.+.+++.+.+++.++++++...... +..-. T Consensus 78 ~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~-~~~~~ 156 (407) T protein:vir:48 78 VASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDYK-KLVNL 156 (407) T ss_pred hhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCceE-EEEec Confidence 666677788888653210 00000 11223456556788899999999999999999988754333 32333 Q ss_pred ccccccccCCCCcccccc-cccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccccccc Q lcl|Aclame:pro 71 SGPIASRTDTTKAARQPI-DPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKA 149 (337) Q Consensus 71 ~g~ia~Rt~t~~~~R~p~-~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~ 149 (337) +++-++-+. .+...|. +...++...|..++.---+.|+.+.|+. ...+|+..+.+.+.+.++.=.-.-=+||+-. T Consensus 157 ~~~~a~~v~--E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~ 232 (407) T protein:vir:48 157 GGTTSGWVG--ETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDD--AFFNVEDWINSELALEFAEQEEIAFTSGDGS 232 (407) T ss_pred CCcceeeec--ccccccccccccceeEEeeeeeeEeehhhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHhhhhccCCC Confidence 444443332 2222232 2335667778888877778999999985 2257888888888887776433333566321 Q ss_pred CCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEEC Q lcl|Aclame:pro 150 AATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICG 229 (337) Q Consensus 150 A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG 229 (337) ..|. |=|..............+. ...+..+..+ -.+.|.|+ +++.+| ++.|+..+ +++|. T Consensus 233 -------~~p~------Gil~~~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~d~i~-~l~~~l-~~~~~~~a--~~v~n 292 (407) T protein:vir:48 233 -------KKPK------GFLAYESTDEDDKTRAFGK--LQHIASGAAS-GVTADAII-KLIYTL-RKAHRSGA--KFMMN 292 (407) T ss_pred -------Cccc------eeeeccccccccccccccc--cccccccccc-ccChHHHH-HHHHhh-chhhhcCC--EEEEc Confidence 1122 2121110000000000000 0112222222 24456654 666654 77777754 67888 Q ss_pred HHHHHHHHHHHHh-ccCChH--HHHHHHHHHhhhhhcCccccccCccCCCc-----eEEecchh-cEEEEecCceEEEEE Q lcl|Aclame:pro 230 RELLHDKYFPIVN-ATQAPT--ERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSN-LSIYYQEGARRRTLK 300 (337) Q Consensus 230 ~dLl~~k~~~l~n-~~~~pt--E~~A~~~~~~~k~iGGlpa~~vPffP~~~-----iliT~l~N-LsiY~Q~gs~RR~~~ 300 (337) +..++. +..+. ..+.|- .-... ....++-|+|++..+++|..+ +++=.|+. ..|+-..| .+-... T Consensus 293 ~~~~~~--L~~lkD~~Gr~l~~~~~~~---g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~-~~i~~d 366 (407) T protein:vir:48 293 NSSLFA--IRLLKDNDGNYLWRPGIEL---GQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIG-TRILRD 366 (407) T ss_pred HHHHHH--HHHhhccCCceeeccCcCC---CCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeec-eEEEee Confidence 887642 22232 222221 00000 123478999999999999733 66666654 33332333 332211 Q ss_pred EcccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 301 EVPERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 301 d~p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) +.-.++.+.-+-..--++.|-|.++++. ++++.| T Consensus 367 ~~~~~~~~~~~~~~r~d~~v~~~~a~~~---l~~~aa 400 (407) T protein:vir:48 367 PYTNKPFVGFYTTKRTGGMLVDSQAIKL---MKIGAA 400 (407) T ss_pred ccccCCcEEEEEEEEeccEEecccceEE---EEeecc Confidence 1122344433333344555566665554 344444 No 67 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.99 E-value=7.8e-06 Score=48.59 Aligned_cols=292 Identities=12% Similarity=0.042 Sum_probs=165.0 Q ss_pred CChHHH-----HHHHHHHHHHHHhhC--ch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccc Q lcl|Aclame:pro 1 MRKETR-----QAYEKYAAQIAKLND--TG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr-----~~~~~y~~~~a~~ng--v~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g 72 (337) |++... ..|..+....+.... +. .......|-+.+.+.+.+.+.+.|.+++..+++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~ 79 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCce-EEEEEecC Confidence 765432 334444444443222 11 1234456667788999999999999999999998763222 22222233 Q ss_pred ccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCc Q lcl|Aclame:pro 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) Q Consensus 73 ~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~ 152 (337) +-+.-+. .+...|..-..++...+.+++.---..|+.+.|+... ++|+..+.+.+.++++.-.-..-++|+-... T Consensus 80 ~~a~~v~--Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~- 154 (324) T protein:vir:97 80 PGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNNP- 154 (324) T ss_pred cceeEec--cCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhccCCCCc- Confidence 3333322 2223344456788889999999888899999898654 7899999999999988877777778853211 Q ss_pred CChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) .|. -+..... .......+.-.|.+|- +++..+ .+-++... +++|.+.. T Consensus 155 -----~~~------------------gi~~~~~--~~~~~~~~~~~~~~i~----~~~~~l-~~~~~~~~--~~v~n~~~ 202 (324) T protein:vir:97 155 -----FGK------------------SIAQSIE--KTNKVIKGDFTQDNII----DLEALL-EDDELEAN--AFISKTQN 202 (324) T ss_pred -----cCc------------------ccccccc--ccceeccccCCHHHHH----HHHHhh-hhccCCCC--EEEEcHHH Confidence 111 1111100 0011112223344443 344444 33333322 67888887 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccC--CCceEEecchhcEEEEecCceEEEEEEcc------- Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFP--KRALMVTKLSNLSIYYQEGARRRTLKEVP------- 303 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP--~~~iliT~l~NLsiY~Q~gs~RR~~~d~p------- 303 (337) +.. -..+-.....| ... -....++-|+|++..|..| .+.+++-.++++-|-. .+..+-.+-++. T Consensus 203 ~~~-L~~lkd~~g~~--~~~---~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~-~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:97 203 RSL-LRKIVDPETKE--RIY---DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGI-PQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhcCCCce--eec---CCCCccccceeeEeecCCCCCcceEEEEecccEEEEE-ecCcEEEEeeccccccccc Confidence 763 22232222211 000 0123578899999888755 5568888888875433 333433333332 Q ss_pred ---------cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ---------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ---------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++|.+.---..--++.|-+.++++.+.+.+-+.. T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) T protein:vir:97 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCC Confidence 2333333333444778888888888876554333 No 68 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=97.96 E-value=6.2e-06 Score=49.14 Aligned_cols=297 Identities=10% Similarity=0.050 Sum_probs=154.7 Q ss_pred CChHHHHHHHHHHHHHHHhhC------------------------chhhcceEeechHHHHHHHHHHHhhHHHhcc-cce Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLND------------------------TGDVSKKFAVEPTVQQRLETKMQESSEFLKR-INV 55 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ng------------------------v~~~~~~Fsv~P~~~q~L~~~iqess~FL~~-Inv 55 (337) +.. -...|..|+..++..-| ..+..-.+.|-+.+.+.+.+.+++.+.+++. .++ T Consensus 91 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~ 169 (435) T protein:vir:14 91 LEV-KGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGART 169 (435) T ss_pred hhh-hHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhccee Confidence 111 11223333333222111 0111112445556678899999988887764 344 Q ss_pred eccchhhceee-ecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHH Q lcl|Aclame:pro 56 LPVTELEGEKL-GLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQ 134 (337) Q Consensus 56 ~~V~~~~Ge~v-~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~ 134 (337) ++.. .|..- -.-.+++-++-+.- ....|..-..++...|.+++.---+.|+.+.|+.-+-.|+++..+.+.+.++ T Consensus 170 ~~~~--~~~~~~p~~~~~~~a~~v~E--~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~a 245 (435) T protein:vir:14 170 LPLS--NGNITIPRLKGGAIVGYIGA--DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAA 245 (435) T ss_pred eecC--CCceEEEEEeCCcceeeecc--CccccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHH Confidence 4443 33211 11112333333222 1222333345778888898888889999999999666688999999999999 Q ss_pred HhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhccc Q lcl|Aclame:pro 135 GALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMI 214 (337) Q Consensus 135 ~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~li 214 (337) ++.-.-.--++|+..+. +| .|++.. .. .......-.++.+..+.+.+.+++..+ T Consensus 246 i~~~~d~a~l~G~G~~~------~p------~Gi~~~------------~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~- 299 (435) T protein:vir:14 246 IGAREDKAFIRDDGTAN------TP------KGLRFW------------AL-PSNVITASDASTLQKIETDLGKVILAL- 299 (435) T ss_pred HHHHHHHHhhccCCCCc------cc------cceeec------------cc-ccceeccccccchhhHHHHHHHHHHHh- Confidence 88644444457743221 12 244321 00 011111112223333333333333322 Q ss_pred ChhHcCCCCEEEEECHHHHHHHHHHHHhcc-CChHHHHHHHHHHhhhhhcCccccccCccCCC--------ceEEecchh Q lcl|Aclame:pro 215 DPWFQEDTGLVVICGRELLHDKYFPIVNAT-QAPTERLAADLIVSQKRIGNLPAVRVPFFPKR--------ALMVTKLSN 285 (337) Q Consensus 215 d~~~r~~~~LVvivG~dLl~~k~~~l~n~~-~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~--------~iliT~l~N 285 (337) ..........+++|.+..+..- ..+.-. ..|- - -+ ....++-|+|++..+++|.+ .+++-.++. T Consensus 300 ~~~~~~~~~~~~v~n~~~~~~L--~~lkd~~G~~l-~--~~--~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~ 372 (435) T protein:vir:14 300 ENADANLTQPGWIMAPRTFRFL--EGLRDGNGNKV-Y--PE--LANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGD 372 (435) T ss_pred hhccccccCCEEEEcHHHHHHH--HHhhccCCcee-c--cC--CCCCeeecceeEeeccccccccCCCccceEEEeeccc Confidence 1111222345899999887542 223222 1221 0 00 12457889999999999985 577777776 Q ss_pred cEEEEecCceEEEEEEccc-------------ccceeceeeeeeeeeeeccccEEEeecceecc Q lcl|Aclame:pro 286 LSIYYQEGARRRTLKEVPE-------------RDRIENYESSNDAYVVEDFGCGCVAENIELAA 336 (337) Q Consensus 286 LsiY~Q~gs~RR~~~d~p~-------------r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~ 336 (337) .-| ..++..+-.+.++.. +|++.---..=-++.|=+..+++.+.++.++. T Consensus 373 ~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 373 VFI-GEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred EEE-EEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 433 344444444333321 22222222222357788888899999988888 No 69 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=97.94 E-value=3.7e-06 Score=50.37 Aligned_cols=292 Identities=14% Similarity=0.108 Sum_probs=151.2 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) ++++-|..++.... +. .....+.|-+.+.+.+.+.+++.|.+++.++++++.- ...+-...+++-++-.. T Consensus 75 l~~ee~~~~~~~~~------~t-~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~~- 144 (395) T protein:vir:95 75 LTSEERKFFNDINY------DV-GYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI--KTRVIKADPAGQAVWGK- 144 (395) T ss_pred cchHHHHHHHHHhh------cc-CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEee- Confidence 44444444433211 11 1123467878889999999999999999999988752 12333333333332211 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) ...++.+..-..++...+.+++.---..|+.+.|+.= ..+++..+++.++++++.=.-.--+||+-...+. |. T Consensus 145 e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds--~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~q-----P~ 217 (395) T protein:vir:95 145 VFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFG--PAWIERFVRTQIQEAISVALESAIINGGGAAKTQ-----PV 217 (395) T ss_pred cccccCccccccceeeeeceeeEEEeecccHHHHhcc--hhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcC-----ce Confidence 1233434334456777788888777788999999742 2468888999999999887766677886544311 32 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHH---hcc----cChhHcCCCCEEEEECHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIV---SSM----IDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~---~~l----id~~~r~~~~LVvivG~dLl 233 (337) |+|..+-.. .....++. ..+.+ .|.+++.++..+. ..+ .....+....++++|.+..+ T Consensus 218 ------Gil~~~~~~--~~~~~~~~-~~~~~------t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~ 282 (395) T protein:vir:95 218 ------GLMKDVNTN--SGAVTDKA-SSGTL------TFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDS 282 (395) T ss_pred ------eeeeccccc--cccccccc-ccchh------hhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhh Confidence 665322110 00111110 01111 1233333322211 110 11112334567888887655 Q ss_pred HHHHH-HHHh-ccCChHHHHHHHHHHhhhhhc-CccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc--cccce Q lcl|Aclame:pro 234 HDKYF-PIVN-ATQAPTERLAADLIVSQKRIG-NLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP--ERDRI 308 (337) Q Consensus 234 ~~k~~-~l~n-~~~~ptE~~A~~~~~~~k~iG-Glpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p--~r~rv 308 (337) .+..- ++.. ....|. ..+| |+|++.-+++|++.+++-.+++..|+...| .+-..-++. .++++ T Consensus 283 ~~~~g~~~~~~~~G~~~-----------~~lg~g~~v~~~~~~p~~~i~fgdfs~y~i~~r~~-~~i~~~~~~~~~~d~~ 350 (395) T protein:vir:95 283 WDVQARYTYLTANGGFV-----------TVLPYNVTIITSEFVPEGKLVAFVTDRYNAVRGGG-LTVKKFDQTLALEDAV 350 (395) T ss_pred hhcCCcceeccCCCcce-----------eccCCcceEEEcCCCCCCcEEEEecccEEEEEecc-eEEEeccchhhhCCcE Confidence 44221 1111 111111 1122 778899999999999998888866654333 332222221 12333 Q ss_pred eceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 309 ENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 309 e~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ..+-..--+-.+=|.+.+.+++ |++.++ T Consensus 351 ~f~~~~r~dg~~~~~~A~~~l~-i~~~~~ 378 (395) T protein:vir:95 351 LFTAKTFAYGQPDDNKASAVYD-LKVASA 378 (395) T ss_pred EEEEEEEECCEEeccccEEEEE-eeccCC Confidence 3333333333444444444332 344444 No 70 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=97.93 E-value=5.2e-06 Score=49.57 Aligned_cols=283 Identities=11% Similarity=0.011 Sum_probs=159.8 Q ss_pred hCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCCCCcccccccccccCCceeE Q lcl|Aclame:pro 20 NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYR 99 (337) Q Consensus 20 ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~ 99 (337) .|+.. +-.+.|-+.+.+.+.+.+++.|.+++..+++++.--. .++-.-.+++-+.-.. .....|..-..++...+. T Consensus 1 m~t~t-~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~--E~~~~~~s~~~f~~v~l~ 76 (303) T protein:vir:97 1 MGTET-SKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNG-SKEFTFTLDSDIDVVA--ENGKKTHGGLSLEPVTIV 76 (303) T ss_pred CcccC-CCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEecCcceEEee--cCccccccccceeeEEee Confidence 55543 3457888999999999999999999999999876322 2333323444443322 223334444567778888 Q ss_pred EEEeeeeeecCHHHHHHH-hCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchh Q lcl|Aclame:pro 100 CEKTDYDTAIPYRKLDMW-AKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQ 178 (337) Q Consensus 100 c~qtn~d~~i~y~~LD~W-A~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~ 178 (337) .++.---+.++-+.|-+= ...++|.+.+.+.++++++.-.-.-.+||+.-+..++-. | +|+. T Consensus 77 ~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~--~------~~~~--------- 139 (303) T protein:vir:97 77 PIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASD--V------IGTN--------- 139 (303) T ss_pred eEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccc--c------cccc--------- Confidence 888887788888877322 335789999999999999988888888986433322211 1 1110 Q ss_pred hhccccccccCceeecCCcc-cccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHHHHH Q lcl|Aclame:pro 179 RVLHEGAKQAGKVLVGKAGD-YENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAADLIV 257 (337) Q Consensus 179 ~v~~~~~~~~~~i~~g~ggd-y~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~ 257 (337) ... +. ....+..+.+.+ |.++.+++ ..+.+..+..+ .++|.+.....- ..+-.....|--....+.-. T Consensus 140 -~~~-~~-~~~~~~~~~~~~~~~~i~~~~----~~~~~~~~~~~---~~vmn~~~~~~L-~~lkd~~g~~~~~~~~~~~~ 208 (303) T protein:vir:97 140 -HFD-SK-VTQVVKFTESEDADANIEAAV----NLIQGAEGVVT---GLAMDTEFSTAL-AKVTNGEMGPKMYPELAWGA 208 (303) T ss_pred -ccc-cc-cccccccccccchHHHHHHHH----HHHhhcCCCcc---EEEEcHHHHHHH-HHhhccCCCeEEecCccCCC Confidence 000 00 011111222222 44444443 32222222222 588888777632 22322222221100001111 Q ss_pred hhhhhcCccccccCccCCCc--------eEEecchhcEEEEecCceEEEEEEcc----------cccceeceeeeeeeee Q lcl|Aclame:pro 258 SQKRIGNLPAVRVPFFPKRA--------LMVTKLSNLSIYYQEGARRRTLKEVP----------ERDRIENYESSNDAYV 319 (337) Q Consensus 258 ~~k~iGGlpa~~vPffP~~~--------iliT~l~NLsiY~Q~gs~RR~~~d~p----------~r~rve~y~s~Ne~Yv 319 (337) ...+|-|+|++.-.++|... +++=.+++.-.|..++..+-.+-+.- .+|.+.---..--++. T Consensus 209 ~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~ 288 (303) T protein:vir:97 209 NPDSINGLKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWG 288 (303) T ss_pred CCceecceeeEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccE Confidence 22378899999999998653 45555666555555555444443321 2333333323334678 Q ss_pred eeccccEEEeeccee Q lcl|Aclame:pro 320 VEDFGCGCVAENIEL 334 (337) Q Consensus 320 VEd~~~~a~ieni~~ 334 (337) |-+.++++.+.+.++ T Consensus 289 v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 289 ILDAKSFARVTKGEV 303 (303) T ss_pred eecccceEEeeCCCC Confidence 888899999988888 No 71 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=97.93 E-value=1.1e-05 Score=47.86 Aligned_cols=285 Identities=11% Similarity=0.121 Sum_probs=155.9 Q ss_pred CChHHHHHHHHHHHHHHHh--------hCc--------hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhce Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL--------NDT--------GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGE 64 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--------ngv--------~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge 64 (337) +....+.....|....... +.+ ....-.+.|-+.+...+.+.+.+.+.+++.+++++++...|. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~ 161 (408) T protein:vir:74 82 LNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGS 161 (408) T ss_pred ccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcce Confidence 2222222222222222111 111 122235677778888999999999999999999999988876 Q ss_pred eeecc--cccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHh Q lcl|Aclame:pro 65 KLGLS--VSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMI 142 (337) Q Consensus 65 ~v~lg--v~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~I 142 (337) ..... -.++.+..+..+. .....+...++...+.+++.---+.|+.+.|+. ...+|+..+.+.+.+.++.=.-.- T Consensus 162 ~~~~~~~~~~~~~~~v~E~~-~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~~~~~~d~~ 238 (408) T protein:vir:74 162 RVYEKWTDVTPLKAMDEEDG-KIPDLDNPRLTIIKYLIKRYAGIITATNTLLKD--TAENILAWLSSWIAKKVVVTRNQA 238 (408) T ss_pred EEEEeecCCccccccccccc-ccccccccceeeEEeeeeeEEeeehhHHHHHhh--chHHHHHHHHHHHHHHHHHHHHHH Confidence 54332 2233333332221 121123345677777777777778899998875 235789999999988887644444 Q ss_pred cccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCC Q lcl|Aclame:pro 143 GWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDT 222 (337) Q Consensus 143 GfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~ 222 (337) -++|+.... | .+--.+.|.++. ++...+++.|+. T Consensus 239 il~G~G~~~-------~------------------------------------~~~~~~~~~i~~-~~~~~l~~~~~~-- 272 (408) T protein:vir:74 239 IIAAMGTVP-------K------------------------------------KPTIANFDDVIT-MINTSVDPAIIA-- 272 (408) T ss_pred Hhhcccccc-------c------------------------------------ccccccHHHHHH-HHHHhhhhhhcC-- Confidence 445532110 0 001124555554 344566888876 Q ss_pred CEEEEECHHHHHHHHHHHHhccCChHHHHHHHHH-HhhhhhcCccccccC--ccCCCc-----eEEecchhcEEEEecCc Q lcl|Aclame:pro 223 GLVVICGRELLHDKYFPIVNATQAPTERLAADLI-VSQKRIGNLPAVRVP--FFPKRA-----LMVTKLSNLSIYYQEGA 294 (337) Q Consensus 223 ~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~-~~~k~iGGlpa~~vP--ffP~~~-----iliT~l~NLsiY~Q~gs 294 (337) .-+++|.+..+..-. .|-...+.|- ...... ....+|-|+|++..+ ++|..+ +++=.++..-..+.++. T Consensus 273 ~a~~v~n~~~~~~l~-~lkd~~G~~l--~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~ 349 (408) T protein:vir:74 273 TSSLLTNQSGLNKLA-LVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDREN 349 (408) T ss_pred CCEEEEcHHHHHHHH-HhhcCCCceE--eccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecc Confidence 458899998765322 2211212211 000111 123588999999877 477543 66666776555555544 Q ss_pred eEEEEEEcc----cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 295 RRRTLKEVP----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 295 ~RR~~~d~p----~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .+-.+-+.. .++.+--.-..--++.|-+..+++.++--.+..+ T Consensus 350 ~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~ 396 (408) T protein:vir:74 350 MSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQ 396 (408) T ss_pred eEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecccCC Confidence 443332211 2233322222333566777777777764444444 No 72 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.90 E-value=6.7e-06 Score=48.96 Aligned_cols=292 Identities=14% Similarity=0.057 Sum_probs=165.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |+.-+. |+.-...++ ...+......|-|.+.+.+.+.+++.+.+++.++++++.-.... +-.-.+++-+.-.. T Consensus 1 ~~~~~~--~~~e~~~~~---~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~-ip~~~~~~~a~~v~- 73 (318) T protein:vir:24 1 MAAGTA--FAVDHAQIA---QTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQK-IPHWVGDVSAQWIG- 73 (318) T ss_pred CCCCCC--CCHHHHHhh---cccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceE-EEEEeCCcceEEec- Confidence 444322 222112222 12233444578888999999999999999999999988643322 22222333333222 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) .+...|..-..++...+.+++.---+.|+.+.|+. ..++|+..+++.+.++++.-.-.--+||+-....+. . T Consensus 74 -Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~-----~ 145 (318) T protein:vir:24 74 -EGDMKPITKGNMTSQTIAPHKIATIFVASAETVRA--NPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTY-----I 145 (318) T ss_pred -CCccccccccceeEEEEeeEEEEEeehhhHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcc-----c Confidence 22333444456888899999988888888888874 336899999999999999876666678864222110 0 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) +. ... .... .+..+.=...|..+.+++.. +.+-++. ..+++|.+.....- ..+ T Consensus 146 ~~--------------------~~~--~~~~-~~~~~~~~~~~~~~~~~~~~-~~~~~~~--~~~~v~n~~~~~~L-~~l 198 (318) T protein:vir:24 146 GQ--------------------TTK--AISI-ADTTGATTVYDQVAVNGLSL-LVNDGKK--WTHTLLDDITEPIL-NGA 198 (318) T ss_pred cc--------------------ccc--cccc-cccccccchHHHHHHHHHHh-hccccCC--CCEEEEcHHHHHHH-HHh Confidence 00 000 0000 01111113344445555543 3444433 35889999887632 233 Q ss_pred HhccCC------hHHHHHHHHHHhhhhhcCccccccCccCCCce--EEecchhcEEEEecCceEEEEEEcc--------- Q lcl|Aclame:pro 241 VNATQA------PTERLAADLIVSQKRIGNLPAVRVPFFPKRAL--MVTKLSNLSIYYQEGARRRTLKEVP--------- 303 (337) Q Consensus 241 ~n~~~~------ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~i--liT~l~NLsiY~Q~gs~RR~~~d~p--------- 303 (337) -..... ++.--.. .....++-|+|++..|..|++.. ++-.++.+- |...+..+-.+-++. T Consensus 199 kd~~G~~l~~~~~~~~~~~--~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~-~~~~~~l~i~~~~~~~~~~~~~~~ 275 (318) T protein:vir:24 199 KDQNGRPLFIESTYGEAAS--PFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLI-WGQIGGLSFDVTDQATLNLGTVES 275 (318) T ss_pred hccCCceeecCccccCccc--cccCceEEEEeeEEeCCCCCCccEEEEeecceEE-EEEecCeEEEEeeccceecccccc Confidence 222111 1111111 11235788999999999998875 455677653 333333333222221 Q ss_pred -------cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 -------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 -------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .+|++.----.--++.|.+.++++.|.++.-+-+ T Consensus 276 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~ 316 (318) T protein:vir:24 276 PNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGG 316 (318) T ss_pred ccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCC Confidence 2333332223344788899999999888887777 No 73 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=97.85 E-value=1.3e-05 Score=47.42 Aligned_cols=275 Identities=12% Similarity=0.118 Sum_probs=144.4 Q ss_pred CChHHHHHHHHHHHHHH------------------HhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhh Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIA------------------KLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELE 62 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a------------------~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~ 62 (337) ....+...++++...+- ...+.....-.+.|-+.....+.+.+.+.+.+++.+++++++... T Consensus 87 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 166 (397) T protein:vir:12 87 NEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRS 166 (397) T ss_pred hhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCc Confidence 22222222222222111 011111222345666677788999999999999999999999888 Q ss_pred ceeee-cccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHH Q lcl|Aclame:pro 63 GEKLG-LSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIM 141 (337) Q Consensus 63 Ge~v~-lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~ 141 (337) |+... ...+++.+.-...+. .....+...++...+.+++.---+.|+.+.|+... .+|++.+.+.++++++.-.-. T Consensus 167 ~~~~~~~~~~~~~a~~v~Eg~-~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~--~~l~~~i~~~l~~~~~~~~d~ 243 (397) T protein:vir:12 167 GTRLLEKNADMVPFSPVEELG-NLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSD--QAIMTYVAKWFAKKSVVTRNN 243 (397) T ss_pred eeEEEEEecCCcceeeecccc-cccccccccceeEEeeheeeEeeehhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHH Confidence 87643 333333333222221 11112334566777777777777888988886433 578899999999988876555 Q ss_pred hcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCC Q lcl|Aclame:pro 142 IGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQED 221 (337) Q Consensus 142 IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~ 221 (337) --++|+.... | . | -.+.|.++. ++...+++.++. T Consensus 244 ~il~G~g~~~-------~------------------~----------g---------~~~~~~i~~-~~~~~l~~~~~~- 277 (397) T protein:vir:12 244 LILAAIASLK-------K------------------V----------D---------IDGLDGIKK-ALNVTLDPMVAP- 277 (397) T ss_pred HHHhcccccc-------c------------------c----------c---------cccHHHHHH-HHhhccchhhhC- Confidence 5666643211 1 0 0 023455443 454456888775 Q ss_pred CCEEEEECHHHHHHHHHHHHhccCChHHHHHHHH-HHhhhhhcCccccccCcc-CCCc-----eEEecchhcE-EEEecC Q lcl|Aclame:pro 222 TGLVVICGRELLHDKYFPIVNATQAPTERLAADL-IVSQKRIGNLPAVRVPFF-PKRA-----LMVTKLSNLS-IYYQEG 293 (337) Q Consensus 222 ~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~-~~~~k~iGGlpa~~vPff-P~~~-----iliT~l~NLs-iY~Q~g 293 (337) ..+++|.+.....- ..|-+..+.|- ....+ -....++-|+|++..+.+ |+.+ +++-.+++.- ++...+ T Consensus 278 -~a~~~~n~~~~~~L-~~lkd~~G~~l--~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 353 (397) T protein:vir:12 278 -GSIVLTNQDGYDWL-DTLKDGTGRYL--LQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQ 353 (397) T ss_pred -CCEEEEcHHHHHHH-HHhhccCCcee--ecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecc Confidence 46889999886532 22322222210 00000 113458889999877654 4332 7888888754 333333 Q ss_pred ceEEEEEEcccccceeceeeeeeeeeee--------ccccEEEee-cce Q lcl|Aclame:pro 294 ARRRTLKEVPERDRIENYESSNDAYVVE--------DFGCGCVAE-NIE 333 (337) Q Consensus 294 s~RR~~~d~p~r~rve~y~s~Ne~YvVE--------d~~~~a~ie-ni~ 333 (337) ..-.+.+.+.. .|..-..+|.++ +...++.+. -++ T Consensus 354 -~~i~~~~~~~~----~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 354 -QSIASTDTGAG----AFETNSTKVRGIEREDVRKWDEDAVVFGQITVE 397 (397) T ss_pred -eEEEEeccccc----hhhcCceEEEEEEeeccEEecccceEEEEEeeC Confidence 22222222221 111112234333 333333332 112 No 74 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=97.82 E-value=1.7e-05 Score=46.74 Aligned_cols=280 Identities=9% Similarity=-0.010 Sum_probs=139.0 Q ss_pred CChHHHHHHHHH---HHH-------HHHhhC--chhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeec Q lcl|Aclame:pro 1 MRKETRQAYEKY---AAQ-------IAKLND--TGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL 68 (337) Q Consensus 1 M~~~tr~~~~~y---~~~-------~a~~ng--v~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~l 68 (337) ..+.....+... ..+ .....| .......+.+-+.....+...+.+.+.+++.++++++.--....... T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 156 (379) T protein:vir:10 77 KSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRE 156 (379) T ss_pred cchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEe Confidence 000000111000 000 001111 11112233455667778888888899999999998886544333321 Q ss_pred -ccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhh--hhHHhccc Q lcl|Aclame:pro 69 -SVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGAL--DRIMIGWN 145 (337) Q Consensus 69 -gv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aL--D~i~IGfn 145 (337) |.++.- -.-.+.+...|..-..++...|..++.---+.|+-+.|+.. |.++..+++.+.+.++. |.-.+|-. T Consensus 157 ~~~~~~~--~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~---~~l~~~i~~~la~~~~~~~~~~~~~g~ 231 (379) T protein:vir:10 157 NGAGEGA--IGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNL---PFLTSFIPNALRRDYAKAENAAFNAVL 231 (379) T ss_pred ecCCCcc--cccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 222211 11222233444444456677777777766678888888764 66888888888876653 44444433 Q ss_pred ccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEE Q lcl|Aclame:pro 146 GVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 146 G~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LV 225 (337) |+.. + .+. -....+..+|.++. ++..+.+..++.. + T Consensus 232 ~~~~---------------------------~----------~~~---~~~~~~~~~d~i~~-~~~~~~~~~~~~~---~ 267 (379) T protein:vir:10 232 AANA---------------------------T----------AST---EIITNKNKVEMLIN-EIAKQENLDFPVT---A 267 (379) T ss_pred cccc---------------------------c----------ccc---ccccCcccHHHHHH-HHHhhhhccCCCC---E Confidence 3210 0 000 01112334666554 4555544544443 6 Q ss_pred EEECHHHHHHHHHHHHh-ccCChHHH--HHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEc Q lcl|Aclame:pro 226 VICGRELLHDKYFPIVN-ATQAPTER--LAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEV 302 (337) Q Consensus 226 vivG~dLl~~k~~~l~n-~~~~ptE~--~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~ 302 (337) ++|.+.-+.. ...+. ..+.|=-. ..++ -....++-|+|++.-|.+|++.+++=.++...+-+.+|..-....+. T Consensus 268 ~vmn~~~~~~--l~~lkd~~G~~l~~~~~~~~-~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~ 344 (379) T protein:vir:10 268 IVLRPTDYYD--ILVTQKSVGAGYGLPGVVTQ-DNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVE 344 (379) T ss_pred EEEcHHHHHH--HHHhhccCCceeccCCccCC-CCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecc Confidence 8888865542 22222 11111000 0000 01234788999999999999999988888755544444322211111 Q ss_pred ---ccccceeceeeeeeeeeeeccccEEEee--cc Q lcl|Aclame:pro 303 ---PERDRIENYESSNDAYVVEDFGCGCVAE--NI 332 (337) Q Consensus 303 ---p~r~rve~y~s~Ne~YvVEd~~~~a~ie--ni 332 (337) -.+|.+.-.--.=-+..|=|++.++.++ .| T Consensus 345 ~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 345 GTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred cccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 2333333222222345556666666654 44 No 75 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=97.81 E-value=1.7e-05 Score=46.68 Aligned_cols=294 Identities=11% Similarity=0.021 Sum_probs=146.3 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) ++.+-|+.|+++.. +. +....|-|-+...+++.+.+.+.|.+++.++++++.- +.++-...+++.|+=..- T Consensus 65 lt~~e~~~~~~~~~------~~-~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~~e 135 (381) T protein:vir:95 65 LSANQRSFFMDINK------NV-NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) T ss_pred ccHHHHHHHHHHhc------cc-CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc--ceEEEEecCCcceeeecc Confidence 55566655554322 12 2233578999999999999999999999999988752 234444444444433221 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) . .++....-..++...+.+++.---..|+.+.|++ .-.+++..+++.+.+++|.=.-.-=.||+-. . .| T Consensus 136 ~-~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~D--s~~~ie~~i~~~la~~~a~~~~~a~i~G~G~---~----qP- 204 (381) T protein:vir:95 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK---D----QP- 204 (381) T ss_pred c-ccccccccccceeeeecceeEEeechhhHHHhhc--CHHHHHHHHHHHHHHHHHHHhhheeEeccCC---C----Cc- Confidence 1 2232222234666677777777778899999987 2347889999999999887554445566431 1 23 Q ss_pred hhccchhHHHHHHHhchhhhcccccc----ccCceeecC-CcccccHHHHHHHHHhcccChhHc-----CCCCEEEEECH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAK----QAGKVLVGK-AGDYENLDALVMDIVSSMIDPWFQ-----EDTGLVVICGR 230 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~----~~~~i~~g~-ggdy~nLDaLv~d~~~~lid~~~r-----~~~~LVvivG~ 230 (337) +|+|..+-. ....+.+.. ..+.++.-. ..-|..|.+++..+ ..|+. -....+++|.+ T Consensus 205 -----~Gil~~~~~---~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~-----~~~~~~~~~~~~~~a~~~mn~ 271 (381) T protein:vir:95 205 -----IGLNRQVQK---GVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYH-----STNEKGKSVAVKGNVTMVVNP 271 (381) T ss_pred -----eeeeeccCc---ccccccccccccccccccccccchhhHHHHHHHHHhh-----ccccccccccccCceEEEEcc Confidence 344432111 011111110 011111100 11123333333332 23322 23467889998 Q ss_pred HHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc--cccce Q lcl|Aclame:pro 231 ELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP--ERDRI 308 (337) Q Consensus 231 dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p--~r~rv 308 (337) ..... ..++....+.. ++-+ ...--|.+++.-+++|++.+++-.+++--|.-..|- +-..-++. .+|++ T Consensus 272 ~t~~~-l~~~~~~~~~~-----G~~v--~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~-~i~~~~~~~~~~d~~ 342 (381) T protein:vir:95 272 SDAFE-VQAQYTHLNAN-----GVYV--TALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGI-NVQKFKETLALDDMD 342 (381) T ss_pred ccHHh-hccccccCCCC-----Ccee--ecCCCCceEEecCCCCcCcEEEEecccEEEEEeccc-EEEeechhHhhcCCe Confidence 65542 12221110100 0100 001126668888999999999988888555443332 22111110 01111 Q ss_pred eceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 309 ENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 309 e~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .-.-..--+-.+=|.+.+..++ |++.++ T Consensus 343 ~f~a~~r~dg~~~~~~A~~v~~-l~~~~~ 370 (381) T protein:vir:95 343 LYTAKQFAYGKAKDNKVAAVWK-LDLKGH 370 (381) T ss_pred EEEEEEEEcCEEecCceEEEEE-EEecCC Confidence 1111111111112223333333 555555 No 76 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=97.81 E-value=1.7e-05 Score=46.68 Aligned_cols=294 Identities=11% Similarity=0.021 Sum_probs=146.3 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) ++.+-|+.|+++.. +. +....|-|-+...+++.+.+.+.|.+++.++++++.- +.++-...+++.|+=..- T Consensus 65 lt~~e~~~~~~~~~------~~-~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~~e 135 (381) T protein:vir:10 65 LSANQRSFFMDINK------NV-NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) T ss_pred ccHHHHHHHHHHhc------cc-CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc--ceEEEEecCCcceeeecc Confidence 55566655554322 12 2233578999999999999999999999999988752 234444444444433221 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) . .++....-..++...+.+++.---..|+.+.|++ .-.+++..+++.+.+++|.=.-.-=.||+-. . .| T Consensus 136 ~-~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~D--s~~~ie~~i~~~la~~~a~~~~~a~i~G~G~---~----qP- 204 (381) T protein:vir:10 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK---D----QP- 204 (381) T ss_pred c-ccccccccccceeeeecceeEEeechhhHHHhhc--CHHHHHHHHHHHHHHHHHHHhhheeEeccCC---C----Cc- Confidence 1 2232222234666677777777778899999987 2347889999999999887554445566431 1 23 Q ss_pred hhccchhHHHHHHHhchhhhcccccc----ccCceeecC-CcccccHHHHHHHHHhcccChhHc-----CCCCEEEEECH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAK----QAGKVLVGK-AGDYENLDALVMDIVSSMIDPWFQ-----EDTGLVVICGR 230 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~----~~~~i~~g~-ggdy~nLDaLv~d~~~~lid~~~r-----~~~~LVvivG~ 230 (337) +|+|..+-. ....+.+.. ..+.++.-. ..-|..|.+++..+ ..|+. -....+++|.+ T Consensus 205 -----~Gil~~~~~---~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~-----~~~~~~~~~~~~~~a~~~mn~ 271 (381) T protein:vir:10 205 -----IGLNRQVQK---GVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYH-----STNEKGKSVAVKGNVTMVVNP 271 (381) T ss_pred -----eeeeeccCc---ccccccccccccccccccccccchhhHHHHHHHHHhh-----ccccccccccccCceEEEEcc Confidence 344432111 011111110 011111100 11123333333332 23322 23467889998 Q ss_pred HHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc--cccce Q lcl|Aclame:pro 231 ELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP--ERDRI 308 (337) Q Consensus 231 dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p--~r~rv 308 (337) ..... ..++....+.. ++-+ ...--|.+++.-+++|++.+++-.+++--|.-..|- +-..-++. .+|++ T Consensus 272 ~t~~~-l~~~~~~~~~~-----G~~v--~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r~~~-~i~~~~~~~~~~d~~ 342 (381) T protein:vir:10 272 SDAFE-VQAQYTHLNAN-----GVYV--TALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGI-NVQKFKETLALDDMD 342 (381) T ss_pred ccHHh-hccccccCCCC-----Ccee--ecCCCCceEEecCCCCcCcEEEEecccEEEEEeccc-EEEeechhHhhcCCe Confidence 65542 12221110100 0100 001126668888999999999988888555443332 22111110 01111 Q ss_pred eceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 309 ENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 309 e~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .-.-..--+-.+=|.+.+..++ |++.++ T Consensus 343 ~f~a~~r~dg~~~~~~A~~v~~-l~~~~~ 370 (381) T protein:vir:10 343 LYTAKQFAYGKAKDNKVAAVWK-LDLKGH 370 (381) T ss_pred EEEEEEEEcCEEecCceEEEEE-EEecCC Confidence 1111111111112223333333 555555 No 77 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.80 E-value=1.8e-05 Score=46.53 Aligned_cols=293 Identities=13% Similarity=0.032 Sum_probs=152.3 Q ss_pred CC-----------------hHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhc Q lcl|Aclame:pro 1 MR-----------------KETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEG 63 (337) Q Consensus 1 M~-----------------~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~G 63 (337) +. ...+.-+.... ..+. .+.......+.|-+...+.+...+.+.+.+++.+++++++-..+ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 162 (413) T protein:vir:81 85 AGDQIKQQAGGAQLNYSVGEYVAPRVKAAS-DPAS-TATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTI 162 (413) T ss_pred hhhHHHHHHHHHHhhhhhhhhhhhHHHhhh-hhhh-hcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCce Confidence 00 00000000000 0011 11112234456777888999999999999999999998876554 Q ss_pred eeeecc---cccccccccCCCCcccccc-cccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 64 EKLGLS---VSGPIASRTDTTKAARQPI-DPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDR 139 (337) Q Consensus 64 e~v~lg---v~g~ia~Rt~t~~~~R~p~-~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~ 139 (337) ...... +...-++-.. .+...|. +...++...+..++.=-.+.|+.+.|++. +.|...++..++++++.=. T Consensus 163 ~~~~~~~~~~~~~~a~~v~--Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~ 237 (413) T protein:vir:81 163 KYLMEKANRVVEGGFKTVA--EGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDY---DFLVSYINARLLEELAIEE 237 (413) T ss_pred eEEEeccccccccccceec--CcccccccCcccceeeEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHH Confidence 432211 1111111111 1122232 33446666777777666778999999875 5699999999998888755 Q ss_pred HHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcc-cChhH Q lcl|Aclame:pro 140 IMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSM-IDPWF 218 (337) Q Consensus 140 i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~l-id~~~ 218 (337) -.--+||+-. . +| -+|++... ....+..+.+.++ .| .+.+++..+ ++.-+ T Consensus 238 d~~~l~G~G~---~----~~-----~~Gi~~~~--------------~~~~~~~~~~~~~--~~-~i~~~~~~~~~~~~~ 288 (413) T protein:vir:81 238 ERQLLLGDGT---G----NN-----LTGLLKRD--------------GIQTLAVSNKDEL--AD-SIYKAMTNISLATPF 288 (413) T ss_pred HHHHhccCCC---C----Cc-----cccccccc--------------ccccccccccchh--HH-HHHHHHHHhhhhccC Confidence 5555677421 1 11 12444310 0111222222221 22 233333222 22223 Q ss_pred cCCCCEEEEECHHHHHHHHHHHHhccCChHH------HHHHHHHHhhhhhcCccccccCccCCCceEEecchh-cEEEEe Q lcl|Aclame:pro 219 QEDTGLVVICGRELLHDKYFPIVNATQAPTE------RLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSN-LSIYYQ 291 (337) Q Consensus 219 r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE------~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~N-LsiY~Q 291 (337) +. . .++|.+..+.. -..|-.....|-= ..+.-......++-|+|++..+++|++.+++-.+++ +-++.. T Consensus 289 ~~--~-~~vmn~~~~~~-l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~ 364 (413) T protein:vir:81 289 QA--D-ALVINPLDYQE-LRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRK 364 (413) T ss_pred CC--c-EEEEcHHHHHH-HHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEe Confidence 32 2 57788876653 2222222222110 000111123457889999999999999999999987 444444 Q ss_pred cCceEEEEEEc----ccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 292 EGARRRTLKEV----PERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 292 ~gs~RR~~~d~----p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .|.. -.+-+. -.++.+.-.-..--+..|-+..+++.+ +++.| T Consensus 365 ~~~~-v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l---~~~~~ 410 (413) T protein:vir:81 365 GGVR-IDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQL---DVAEV 410 (413) T ss_pred cceE-EEEeccccchhhcCcEEEEEEEeeccEEecccceEEE---EecCC Confidence 4543 222222 135555544444456777777777765 46666 No 78 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=97.78 E-value=1.9e-05 Score=46.43 Aligned_cols=278 Identities=11% Similarity=0.051 Sum_probs=140.0 Q ss_pred CChHH-------HHHHHHHHHHHH--------HhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhcee Q lcl|Aclame:pro 1 MRKET-------RQAYEKYAAQIA--------KLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEK 65 (337) Q Consensus 1 M~~~t-------r~~~~~y~~~~a--------~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~ 65 (337) +.... +.....+....+ ...++....-.+.|-+.....+.+.+.+.+.+++.+++++|+...|.. T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 180 (400) T protein:vir:38 101 TRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTY 180 (400) T ss_pred hHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEE Confidence 00000 000111111111 111222223345566678899999999999999999999999887766 Q ss_pred eecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccc Q lcl|Aclame:pro 66 LGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWN 145 (337) Q Consensus 66 v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfn 145 (337) ..+..+++.++-...+ +.........++...+..++.--=+.|+.+.|+. ..++|+..+.+.+.++++.=.-.-.++ T Consensus 181 ~~~~~~~~~~~~~~E~-~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~~~~~~~~~~i~~ 257 (400) T protein:vir:38 181 PTVANATTKMVTVAEL-EKNPAMAKPEFKPVNWSVETYRQALPVSQESIDD--SAIDLVGLIAQNGQQIKVNTTNGAVAT 257 (400) T ss_pred EEEecCCCcccccccc-ccccccccccceeeEeehhheeeehhhHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 5544333333222211 2221122334555555555555556677777762 135788888888888876533333333 Q ss_pred ccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEE Q lcl|Aclame:pro 146 GVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 146 G~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LV 225 (337) |+... .+..-.+.|.+ .+++...+++.+. -+ T Consensus 258 ~~~~~--------------------------------------------~~~~~~~~~~~-~~~~~~~~~~~~~----a~ 288 (400) T protein:vir:38 258 LLKGF--------------------------------------------TAKTISSVDDL-KHINNVDLDPAYS----RV 288 (400) T ss_pred ccccc--------------------------------------------cccccccHHHH-HHHHHhhhhhhhC----cE Confidence 32211 00111224444 3455556666542 38 Q ss_pred EEECHHHHHHHHHHHHhccCChHHHHHHHH-HHhhhhhcCccccccCccCCCc-----eEEecchhcEEEEecCceEEEE Q lcl|Aclame:pro 226 VICGRELLHDKYFPIVNATQAPTERLAADL-IVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTL 299 (337) Q Consensus 226 vivG~dLl~~k~~~l~n~~~~ptE~~A~~~-~~~~k~iGGlpa~~vPffP~~~-----iliT~l~NLsiY~Q~gs~RR~~ 299 (337) ++|.+..+..- ..|-...+.|- ....+ -....++-|+|++..+.+|... +++=.|++..+.+-+....-+. T Consensus 289 ~v~~~~~~~~l-~~lkd~~G~~i--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~ 365 (400) T protein:vir:38 289 IIASQSFYNFL-DTVKDGNGRYL--LQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRW 365 (400) T ss_pred EEEcHHHHHHH-HHhhccCCCee--eecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEE Confidence 89998886641 12222211111 00000 0134578999999999998654 6777778765555443444444 Q ss_pred EEcccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 300 KEVPERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 300 ~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .++......--.+- --+..|-+...++. |++..+ T Consensus 366 ~~~~~~~~~~~~~~-r~d~~~~~~~a~~~---l~~~~~ 399 (400) T protein:vir:38 366 VDDQIYGQFLQAGM-RFGVSVADEKAGYF---LTYTPK 399 (400) T ss_pred ecccccceeEEEEE-EeccEEecccceEE---EEeecC Confidence 33332222111111 12333344444444 455555 No 79 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=97.77 E-value=1.8e-05 Score=46.65 Aligned_cols=285 Identities=11% Similarity=0.024 Sum_probs=149.1 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) ++++.|..|++++.. ..+....+-|-+.+..++.+.+.+.|.+++.++++++.- +.++-...+++-++=..- T Consensus 67 lt~ee~~~~~~~~~~------~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~wv~e 138 (377) T protein:vir:96 67 LTAEEIKFFNDIDKN------VGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDI 138 (377) T ss_pred cCHHHHHHHHHHHhc------CCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceeEeec Confidence 777777777665432 223344567877899999999999999999999988742 334444444444433221 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) ..++.+..-..++...+.+++.---..|+++.|+.=. .+++..+++.+.++++.=.-.--+||+-... T Consensus 139 -~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~--~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~--------- 206 (377) T protein:vir:96 139 -FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWLKQFITEQLKEAIAVALELAIVKGNGLLQ--------- 206 (377) T ss_pred -ccccccccCccceeEeeeeeeEEeechhhHHHhhcch--hhHHHHHHHHHHHHHHHHHhhceEeccCCCc--------- Confidence 1233333334577788888888888899999997522 4688889999999988755555567743211 Q ss_pred hhccchhHHHHHHHhchhhhccccc--cccCceeecCCcccccHHH---HHHHHHhccc--C--hhHcCCCCEEEEECHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGA--KQAGKVLVGKAGDYENLDA---LVMDIVSSMI--D--PWFQEDTGLVVICGRE 231 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~--~~~~~i~~g~ggdy~nLDa---Lv~d~~~~li--d--~~~r~~~~LVvivG~d 231 (337) -+|+|...........-..++ ....+...|+ ..+.+-|. +..+++..+- + -..+-.+..|++|-+. T Consensus 207 ----P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~ 281 (377) T protein:vir:96 207 ----PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIAD-LSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPE 281 (377) T ss_pred ----ceeeeeccccccccccccccccceeeccccccc-cccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchh Confidence 135544321110000000000 0011111111 11233333 3333332210 0 0112345789999986 Q ss_pred HHHHHH--HHHHhccCChHHHHHHHHHHhhhhhcCcc--ccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccc Q lcl|Aclame:pro 232 LLHDKY--FPIVNATQAPTERLAADLIVSQKRIGNLP--AVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDR 307 (337) Q Consensus 232 Ll~~k~--~~l~n~~~~ptE~~A~~~~~~~k~iGGlp--a~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~r 307 (337) ...+-+ ....++...|. ++.|+| .+.-+++|++.+++-.+++--|....| .|-. T Consensus 282 t~~~~~~~~~~~~~~G~~~------------~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r~~-~~i~--------- 339 (377) T protein:vir:96 282 DRWTLEAKFTSRNQFGEYV------------TVLPHGITILESLAVETGKAIAFVANRYDAFMATA-STIE--------- 339 (377) T ss_pred hHHhccccccccCCCCCce------------eccCCCceEEecCCCCcccEEEEEcCcEEEEEecc-cEEE--------- Confidence 544321 11111222221 334554 667799999999998888844443332 2221 Q ss_pred eeceeeeeeeeeeeccccEEEee--ccee--ccC Q lcl|Aclame:pro 308 IENYESSNDAYVVEDFGCGCVAE--NIEL--AAA 337 (337) Q Consensus 308 ve~y~s~Ne~YvVEd~~~~a~ie--ni~~--~~a 337 (337) .+.+.|..+|.-.+-++. +-.. .+| T Consensus 340 -----~~~~~~~~~d~~~f~~~~r~dG~~~d~~a 368 (377) T protein:vir:96 340 -----EYDQTFAMEDLQLYLTKNYFYGKAKDNHT 368 (377) T ss_pred -----eehhhhhhcCCeEEEEEEEEcCEEecCCc Confidence 112334445444444433 1111 112 No 80 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=97.76 E-value=2.2e-05 Score=46.13 Aligned_cols=283 Identities=10% Similarity=0.015 Sum_probs=153.8 Q ss_pred HHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCCCCcccccccccccCC Q lcl|Aclame:pro 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~ 95 (337) ||. +..+ .-.-|-|++...+.+.+++.|.+++..+++++.--. ..+-.-.+++-|+=.. .+...|..-..++. T Consensus 1 ma~--~t~~--~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~-~~~p~~~~~~~a~wv~--Eg~~~~~s~~~f~~ 73 (300) T protein:vir:95 1 MSE--AQLS--KGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNG-QREFVFDFDSDIDIVA--ENGKKTHGGVSLDP 73 (300) T ss_pred Ccc--cccC--CcceechhhHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEee--CCccccccccccee Confidence 332 1111 223578899999999999999999988888766432 2233323444443332 23444555567888 Q ss_pred ceeEEEEeeeeeecCHHHHHHHh-CChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHH Q lcl|Aclame:pro 96 NRYRCEKTDYDTAIPYRKLDMWA-KFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~WA-~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re 174 (337) ..+.+++.--.+.|+.+.|-++. ..+++.+.+.+.+.+.++.=.-.-.|+|+-...-+.- +|. | . T Consensus 74 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~--~~~------~----~-- 139 (300) T protein:vir:95 74 VTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQAS--TII------G----D-- 139 (300) T ss_pred eEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCc--ccc------c----c-- Confidence 99999999999999999997764 4689999999999999997777777788532221110 000 0 0 Q ss_pred hchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHH Q lcl|Aclame:pro 175 RAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAAD 254 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~ 254 (337) ... .+. ....+..+..-.|.+|. +++..+ +..+++ +. +++|.+..... -..|-.....|-=.. .. T Consensus 140 ----~~~-~~~-~~~~~~~~~~~~~~~i~----~~~~~~-~~~~~~-~~-~~vmn~~~~~~-L~~lkd~~G~~i~~~-~~ 204 (300) T protein:vir:95 140 ----NCF-DKK-VTQTVPFKDTNPDESME----DAVGMI-DGSERD-IT-GAILDPIFTTA-LSKMKNAEGGKLYPE-LA 204 (300) T ss_pred ----ccc-ccc-cceeecccccchHHHHH----HHHHHh-hhcCCC-cc-EEEECHHHHHH-HHHhhccCCCeeccC-cc Confidence 000 000 00000011111233333 344323 333333 23 68888877653 222222222221000 00 Q ss_pred HHHhhhhhcCccccccCccCCCc------eEEecchhcEEEEecCceEEEEEEccccc-ceeceeeee---------eee Q lcl|Aclame:pro 255 LIVSQKRIGNLPAVRVPFFPKRA------LMVTKLSNLSIYYQEGARRRTLKEVPERD-RIENYESSN---------DAY 318 (337) Q Consensus 255 ~~~~~k~iGGlpa~~vPffP~~~------iliT~l~NLsiY~Q~gs~RR~~~d~p~r~-rve~y~s~N---------e~Y 318 (337) .-....++-|+|++..+++|... +++--++++-.|--+....-++.+..+.+ .-.+|...| -++ T Consensus 205 ~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~ 284 (300) T protein:vir:95 205 WGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGW 284 (300) T ss_pred ccCCCceecceeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecc Confidence 01134689999999999999776 56677776554433344444443332211 111222222 345 Q ss_pred eeeccccEEEeeccee Q lcl|Aclame:pro 319 VVEDFGCGCVAENIEL 334 (337) Q Consensus 319 vVEd~~~~a~ieni~~ 334 (337) .|.+..+++.+-++-= T Consensus 285 ~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 285 GIMDAASFARIVKTGG 300 (300) T ss_pred eeecccceEEEecCCC Confidence 6666666666542211 No 81 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.69 E-value=2.8e-05 Score=45.52 Aligned_cols=296 Identities=9% Similarity=0.041 Sum_probs=154.2 Q ss_pred CChH----HHHHHHHHHHHHHHhhC---------------------ch--hhcceEeechHHHHHHHHHHHhhHHHhcc- Q lcl|Aclame:pro 1 MRKE----TRQAYEKYAAQIAKLND---------------------TG--DVSKKFAVEPTVQQRLETKMQESSEFLKR- 52 (337) Q Consensus 1 M~~~----tr~~~~~y~~~~a~~ng---------------------v~--~~~~~Fsv~P~~~q~L~~~iqess~FL~~- 52 (337) .+++ .-..|..|...+|..-| +. ..+-.+.|-+.+...+.+.+.+.+.+.+. T Consensus 20 ~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg 99 (366) T protein:vir:57 20 IKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILG 99 (366) T ss_pred cccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcchhhhc Confidence 0000 00112222222221111 10 11123345557778899999988877665 Q ss_pred cceeccchhhcee-eecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHH Q lcl|Aclame:pro 53 INVLPVTELEGEK-LGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVI 131 (337) Q Consensus 53 Inv~~V~~~~Ge~-v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i 131 (337) .++++.. .|.. +-.-.+++-++-+. .....|..-..++...+..++.---+.|+-+.|+.- .++++..+++.+ T Consensus 100 ~~~v~~~--~g~~~~p~~t~~~~a~wv~--E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds--~~~~~~~i~~~l 173 (366) T protein:vir:57 100 ARSIPLP--NGNLSMPRLSGGATAGYVG--EGKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRA--GFNVEQLLLGDI 173 (366) T ss_pred eeeeecC--CCceEEEEEeCCcceeeec--cCccccccccceeEEEEeeEEEEEeehhhHHHHhhh--hHHHHHHHHHHH Confidence 5665543 3331 11112333333322 222233333557778888888888888898888743 268999999999 Q ss_pred HHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccC-ceeecCCcccccHHHHHHHHH Q lcl|Aclame:pro 132 LNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAG-KVLVGKAGDYENLDALVMDIV 210 (337) Q Consensus 132 ~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~-~i~~g~ggdy~nLDaLv~d~~ 210 (337) .++++.-.-.--++|.-.+. +|. |.+ ........ ....|.+.++..+|+++--+. T Consensus 174 ~~a~~~~~d~a~l~G~G~~~------~p~------Gi~------------~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~ 229 (366) T protein:vir:57 174 LSAIATREDKAFLRDDGTGD------TPK------GMK------------AVATAANRLVAWTGTAINLTTIDEYLDSLI 229 (366) T ss_pred HHHHHHHHHHHhhccCCCCc------ccc------cee------------eccccccceeeccccccchhhHHHHHHHHH Confidence 99999766666667743221 232 222 11111111 112356678888887654332 Q ss_pred hcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCC--------ceEEec Q lcl|Aclame:pro 211 SSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKR--------ALMVTK 282 (337) Q Consensus 211 ~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~--------~iliT~ 282 (337) . .....-......+++|.+.....- ..|-.....|-= -. ....++-|+|++..+++|++ .+++-. T Consensus 230 ~-~~~~~~~~~~~a~~vmn~~~~~~L-~~lkd~~G~~l~---~~--~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gd 302 (366) T protein:vir:57 230 L-KHMDSNSNMIRCGWGLSNRTYMTL-FGLRDGNGNKVY---PE--MSQGILKGYPIQRTSAIPANLGDDGNESEIYFCD 302 (366) T ss_pred H-hhhccccccccCEEEecHHHHHHH-HhhhccCCceec---cC--CCCCeecceeeEEccccccccccCCCccEEEEEe Confidence 2 111111122356788998876532 122222222210 01 13457899999999999984 366677 Q ss_pred chhcEEEEecCceEEEEEEccc-------------ccceeceeeeeeeeeeeccccEEEeeccee Q lcl|Aclame:pro 283 LSNLSIYYQEGARRRTLKEVPE-------------RDRIENYESSNDAYVVEDFGCGCVAENIEL 334 (337) Q Consensus 283 l~NLsiY~Q~gs~RR~~~d~p~-------------r~rve~y~s~Ne~YvVEd~~~~a~ieni~~ 334 (337) ++++-|. ..+..+-.+-+++. +|.+.-=--.--++.|-+.+.++.+.+|.+ T Consensus 303 fs~~~i~-~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 303 FNDVVIG-EDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred cceEEEE-EecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 7765433 33343333322221 111111111113567779999999999999 No 82 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=97.66 E-value=3.1e-05 Score=45.29 Aligned_cols=297 Identities=12% Similarity=0.034 Sum_probs=155.0 Q ss_pred HHHHHHHHHHHHHhhCchhhc-----ceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 6 RQAYEKYAAQIAKLNDTGDVS-----KKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 6 r~~~~~y~~~~a~~ngv~~~~-----~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) -..++.+... .-|+.... ....|-+.+...+.+.+++.|.++++.+++++.- .+.++-.-.+++.++-... T Consensus 1 ~a~l~el~~~---~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 1 MATLNELLPN---SAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVGQVGV 76 (333) T ss_pred CchhHHhhhh---cccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEeecC Confidence 1111111111 11221111 1124667788999999999999999999998763 2234434334444433322 Q ss_pred CC------cccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCC Q lcl|Aclame:pro 81 TK------AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTD 154 (337) Q Consensus 81 ~~------~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td 154 (337) +. ....|..-..++......++.---..|+.+.|+. ..++|+..+++.+.++++.-.---.+||+-....+- T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~--s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~ 154 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARM--NPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSA 154 (333) T ss_pred cccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcc Confidence 21 2233444555666677777777778888888752 236899999999999999877777788876554332 Q ss_pred hhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHH Q lcl|Aclame:pro 155 RQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 155 ~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~ 234 (337) + . |.+. .... ...+.....+.+++ ..+|.++. ++..+.....++ .-+++|.+.... T Consensus 155 ~----~------g~~~-------~~~~---~~~~~~~~~~~~~~-~~~~~i~~-~~~~~~~~~~~~--~~~~vmn~~~~~ 210 (333) T protein:vir:78 155 L----Q------GIDT-------DNVI---ANTTNVDYLQETGD-PLLDRLLD-GYDLVSANTDVE--FNGWAVDPRFRA 210 (333) T ss_pred c----c------cccc-------cccc---cccccccccccccc-hhHHHHHH-HHHhhccccccC--ceEEEEcchHHH Confidence 1 1 1110 0000 00111222333333 33554433 343332222222 226777876654 Q ss_pred HH-HHH-HHhccCChHHHHHHHHHHhhhhhcCccccccCccCCC---------ceEEecchhcEEEEecCceEEEEEEcc Q lcl|Aclame:pro 235 DK-YFP-IVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKR---------ALMVTKLSNLSIYYQEGARRRTLKEVP 303 (337) Q Consensus 235 ~k-~~~-l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~---------~iliT~l~NLsiY~Q~gs~RR~~~d~p 303 (337) .- ... +-+....|- -..........++-|+|++..+++|++ .+++..+++.-|....+ .+-.+.++- T Consensus 211 ~L~~~~~~~d~~G~~i-~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~-~~i~~~~~~ 288 (333) T protein:vir:78 211 HLLRAQAYRDANGNVD-PSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADE-IRIKMSDTA 288 (333) T ss_pred HHHHHhhhcCCCCcee-ecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeec-cEEEEeccc Confidence 21 111 111111110 000000112357889999999999976 48888888866655444 222222211 Q ss_pred -------------cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 -------------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 -------------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .+|.+.-.-..--++.|.|...++.+ +.++| T Consensus 289 ~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l---~~~~a 332 (333) T protein:vir:78 289 TLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKF---VDDEQ 332 (333) T ss_pred cccccccceeehhhcCcEEEEEEEEEccEEecccceEEE---eccCC Confidence 12222222223346777777777765 45555 No 83 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=97.66 E-value=3.2e-05 Score=45.26 Aligned_cols=291 Identities=13% Similarity=0.023 Sum_probs=146.9 Q ss_pred CChHHHHHHHHHHHHH-----HHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQI-----AKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIA 75 (337) Q Consensus 1 M~~~tr~~~~~y~~~~-----a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia 75 (337) -..+-|..|..|+... +...++....-.|.|-+.+.+.+.+.+.+.+.+.+..+++++.- +-++-+-..++.+ T Consensus 119 ~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~--~~~~p~~~~~~~a 196 (434) T protein:vir:62 119 KETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE--NIKYPVLVKKAEA 196 (434) T ss_pred HHHHHHHHHHHHhccccchhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccCC--ceEEEEEecCCcc Confidence 1123355566665432 12222322334566767778899999999999999889887642 2122222222222 Q ss_pred cccC-CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCC Q lcl|Aclame:pro 76 SRTD-TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTD 154 (337) Q Consensus 76 ~Rt~-t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td 154 (337) +-.. .+.+...|..-..++...+..++.---+.|+.+.|+.- ..+|++.+++.++++++.-.-.--+||+-....+ T Consensus 197 ~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~- 273 (434) T protein:vir:62 197 QGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLART--GLPIEQIVMDELKKAYVRKETQYMVNGDEANNIN- 273 (434) T ss_pred cceecccccccccccccceeeEEeeheeeEeehhhHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccc- Confidence 2111 11222333333456666777777666778888888864 2589999999999999876666666775432211 Q ss_pred hhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHH Q lcl|Aclame:pro 155 RQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 155 ~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~ 234 (337) . | ++.. ..+.....+ -...|.++ +++..+ ++.|+.. -+++|.+..+. T Consensus 274 -----~------g------------~~~~-----~~~~~~~~~-~~~~d~l~-~l~~~l-~~~~~~~--a~~v~n~~~~~ 320 (434) T protein:vir:62 274 -----D------G------------ALAK-----KAVEFKTDE-KNLYDALV-KMKNTP-VKEVRKK--ARWVLNTAALT 320 (434) T ss_pred -----c------c------------eeec-----ccccccccc-cchhhHHH-HHHhhc-chhhhcC--CEEEEcHHHHH Confidence 1 1 1111 111111111 12345554 566654 6666653 47899998776 Q ss_pred HHHHHHHhccCChHHHHHHHHH-HhhhhhcCccccccCccCCCc------eEEecchhcEEEEecCceEEEEEEcccccc Q lcl|Aclame:pro 235 DKYFPIVNATQAPTERLAADLI-VSQKRIGNLPAVRVPFFPKRA------LMVTKLSNLSIYYQEGARRRTLKEVPERDR 307 (337) Q Consensus 235 ~k~~~l~n~~~~ptE~~A~~~~-~~~k~iGGlpa~~vPffP~~~------iliT~l~NLsiY~Q~gs~RR~~~d~p~r~r 307 (337) . -..|-...+.|-=....+.. ....+|-|+|++..+++|... +++=.|+..-|+-..|...-..-+ T Consensus 321 ~-L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~------ 393 (434) T protein:vir:62 321 K-IETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLV------ 393 (434) T ss_pred H-HHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccceEEEEeeceeEEEeeh------ Confidence 3 22232222222100000000 112368899999999999665 444445444343333433222111 Q ss_pred eeceeeee-eeeeeeccccEEEe--------ecceeccC Q lcl|Aclame:pro 308 IENYESSN-DAYVVEDFGCGCVA--------ENIELAAA 337 (337) Q Consensus 308 ve~y~s~N-e~YvVEd~~~~a~i--------eni~~~~a 337 (337) +.|...| -+|.++..--+-+| =.+++..| T Consensus 394 -~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~ 431 (434) T protein:vir:62 394 -ELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAP 431 (434) T ss_pred -hhhcccCceEEEEEeeecceeecCcccceEEEEEeccC Confidence 1122222 23444443322232 12333333 No 84 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.65 E-value=3.3e-05 Score=45.18 Aligned_cols=292 Identities=13% Similarity=0.061 Sum_probs=161.3 Q ss_pred CChHHHHHHH--HHHHH---HHHhh--Cch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccc Q lcl|Aclame:pro 1 MRKETRQAYE--KYAAQ---IAKLN--DTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr~~~~--~y~~~---~a~~n--gv~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g 72 (337) |...-..+++ .|... ..+.+ ++- .......|-+.+...+.+.+++.|.+++...++++.--. -++-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~ 79 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADK 79 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEecC Confidence 7665554443 22221 11111 111 112334677788999999999999999999998876322 122222233 Q ss_pred ccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCc Q lcl|Aclame:pro 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) Q Consensus 73 ~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~ 152 (337) +-++=. +.+...|..-..++...+..++.---..|+.+.|++.. ++|...+++.+.++++.-.-.--++|.-.. T Consensus 80 ~~a~~v--~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~g~~-- 153 (324) T protein:vir:93 80 PGAYWV--GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-- 153 (324) T ss_pred cceeee--cCCccccccccceeEEEEEeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhcCCCCC-- Confidence 444322 23334454556788889999999888899999998754 789999999999988866555557774211 Q ss_pred CChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHH Q lcl|Aclame:pro 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dL 232 (337) .. | .-++..... ......+. .+.|. +.+++..+ ++.+++.. +++|.+.. T Consensus 154 ~~----~------------------~~~~~~~~~----~~~~~~~~-~~~~~-i~~~~~~l-~~~~~~~~--~~v~n~~~ 202 (324) T protein:vir:93 154 PF----G------------------KSIAQSIEK----TNKVIKGD-FTQDN-IIDLEALL-EDDELEAN--AFISKTQN 202 (324) T ss_pred Cc----C------------------ccccccccc----cceecccc-ccHHH-HHHHHHhh-hhccCCCC--EEEEcHHH Confidence 10 1 111111100 00111111 12333 33455544 44444433 68888887 Q ss_pred HHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCc--cCCCceEEecchhcEEEEecCceEEEEEEcc------- Q lcl|Aclame:pro 233 LHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPF--FPKRALMVTKLSNLSIYYQEGARRRTLKEVP------- 303 (337) Q Consensus 233 l~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPf--fP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p------- 303 (337) +..- ..+-+....|-- . -....++-|+|++..|. .+.+.+++-.++++- |...+..+-.+.++. T Consensus 203 ~~~L-~~l~d~~G~~~~--~---~~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~~-~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:93 203 RSLL-RKIVDPETKERI--Y---DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHHH-HHhhCCCCCeee--c---CCCCCcccceeeEeecCCCCCcceEEEEecceEE-EEEecCcEEEEeeccccccccc Confidence 6632 233333222210 0 01355788999988665 455568888888864 333443433333332 Q ss_pred ---------cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ---------ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ---------~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++|.+.-.--.--++.|-+.++++.+.+.+.+.. T Consensus 276 ~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~ 318 (324) T protein:vir:93 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCC Confidence 2222222222333788888888887765544442 No 85 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.64 E-value=2.9e-05 Score=45.49 Aligned_cols=287 Identities=10% Similarity=0.038 Sum_probs=151.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |--.... ..++. .+-... .-.+-|.+.+.+.+.+++.+.+++..+++++.-... ++-.-..++-+.-.. T Consensus 1 ~g~~~e~------~~~~~-~~t~~~--~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-~ip~~~~~~~a~wv~- 69 (397) T protein:vir:23 1 MGFSADH------SQIAQ-TKDTMF--TGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGI-VIPHWTGDVSAQWIG- 69 (397) T ss_pred CCcCHHH------HHHhh-ccCCCC--ccccchhHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEcCCcceEEec- Confidence 3222211 11111 111111 224788899999999999999999999888763222 222223344443332 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) .....|..-..++...|..++.---..|+.+.|+. ..++|+..+++.+.++++.-.-.--++|.-... |. T Consensus 70 -Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~d--s~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~-------~~ 139 (397) T protein:vir:23 70 -EGDMKPITKGNMTKRDVHPAKIATIFVASAETVRA--NPANYLGTMRTKVATAIAMAFDNAALHGTNAPS-------AF 139 (397) T ss_pred -CCccccccccceeEEEEeeEEEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHHhhcccCCc-------cc Confidence 22333444456788888888888888999998873 238899999999999999877777778854211 11 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) .||+.. ....+.......| |.+ .++...|... +++ .-+++|.+..... -..+ T Consensus 140 -----~~~~~~---------------~~~~~~~~~~~~~---~~~-~~~~~~l~~~-~~~--~a~~vmn~~~~~~-L~~l 191 (397) T protein:vir:23 140 -----QGYLDQ---------------SNKTQSISPNAYQ---GLG-VSGLTKLVTD-GKK--WTHTLLDDTVEPV-LNGS 191 (397) T ss_pred -----cccccc---------------ccceeeecccchh---HHH-HHHHHhhhhc-ccC--CCEEEEcHHHHHH-HHHh Confidence 122111 0011111222222 222 2344444333 333 2478898877652 2222 Q ss_pred HhccCChHH--HHHH--HHHHhhhhhcCccccccCccCCCce--EEecchhcEEEEecCceEEEEEEcc----------- Q lcl|Aclame:pro 241 VNATQAPTE--RLAA--DLIVSQKRIGNLPAVRVPFFPKRAL--MVTKLSNLSIYYQEGARRRTLKEVP----------- 303 (337) Q Consensus 241 ~n~~~~ptE--~~A~--~~~~~~k~iGGlpa~~vPffP~~~i--liT~l~NLsiY~Q~gs~RR~~~d~p----------- 303 (337) -.....|-= .... .......++-|+|++..+++|++.+ ++..++++-|....+ .+-.+.++. T Consensus 192 kd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~-i~i~~~~e~~~~~~~~~~~~ 270 (397) T protein:vir:23 192 VDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGG-LSFDVTDQATLNLGSQESPN 270 (397) T ss_pred hccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEec-eEEEEeeeeeeeeccccccc Confidence 222212110 0000 1111235788999999999999986 456788866544444 333332222 Q ss_pred -----cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 -----ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 -----~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++|++.-.--.--++.|-+.+.++.+..-....+ T Consensus 271 ~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~ 309 (397) T protein:vir:23 271 FVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTT 309 (397) T ss_pred eeeeeeccceeEEEEeeeccceecccceEEEeeccccce Confidence 2222222222234455666666666542111111 No 86 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.61 E-value=3.7e-05 Score=44.85 Aligned_cols=279 Identities=13% Similarity=0.132 Sum_probs=152.8 Q ss_pred CChHHHHHHHHHHHHH-----HHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeec--ccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQI-----AKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL--SVSGP 73 (337) Q Consensus 1 M~~~tr~~~~~y~~~~-----a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~l--gv~g~ 73 (337) ++..-+..|..|+..- +...........+.|-..+...+.+.+.+.+.+++.+++++++...|..... ...++ T Consensus 86 ~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (397) T protein:vir:49 86 VKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITG 165 (397) T ss_pred HHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCc Confidence 3344445555554321 1112122223456676678889999999999999999999999888876543 22233 Q ss_pred cccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcC Q lcl|Aclame:pro 74 IASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATT 153 (337) Q Consensus 74 ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~T 153 (337) .++-+..+ ......+...++...+.+++.---+.|+.+.|+.- .++|+..+++.+.++++.-.-.--++|+..... T Consensus 166 ~a~~v~E~-~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~- 241 (397) T protein:vir:49 166 LANIDDEA-GKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADS--AENILAWLSGWIAKKVVVTRNKAILEAIAALPT- 241 (397) T ss_pred ceeeecCc-cccccccccceeeEEeeeeeEEeeehhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc- Confidence 34333322 11222334556777778877777778888888753 368999999999999887655555566332110 Q ss_pred ChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHH Q lcl|Aclame:pro 154 DRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 d~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl 233 (337) .+.-.+.|.++ +++.. |++.|+.. -+++|.+..+ T Consensus 242 ------------------------------------------~~~~~~~d~i~-~~~~~-l~~~~~~~--a~~vmn~~~~ 275 (397) T protein:vir:49 242 ------------------------------------------KPTLTKWDDII-DLEAK-VDPAIKQT--SFFLTNTSGF 275 (397) T ss_pred ------------------------------------------ccccccHHHHH-HHHHh-hhhhhcCC--CEEEEcHHHH Confidence 00112455544 45554 46666553 5889999887 Q ss_pred HHHHHHHHh-ccCChHHHHHHHHH-HhhhhhcCccccccC--ccCCCc-----eEEecchhc-EEEEecCceEEEEEEc- Q lcl|Aclame:pro 234 HDKYFPIVN-ATQAPTERLAADLI-VSQKRIGNLPAVRVP--FFPKRA-----LMVTKLSNL-SIYYQEGARRRTLKEV- 302 (337) Q Consensus 234 ~~k~~~l~n-~~~~ptE~~A~~~~-~~~k~iGGlpa~~vP--ffP~~~-----iliT~l~NL-siY~Q~gs~RR~~~d~- 302 (337) .. +..+. ..+.|= ....+. ....++-|+|++.++ .+|.++ +++=.|++. -++.+.| .+-..-+. T Consensus 276 ~~--l~~lkd~~G~~l--~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~-~~i~~~~~~ 350 (397) T protein:vir:49 276 TA--LKKVKNALGDYL--MERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQH-MSLLSTNIG 350 (397) T ss_pred HH--HHHhhcCCCcee--eccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecc-eEEEEeccc Confidence 52 22232 222210 000000 124589999998765 366654 666666653 3333333 22222111 Q ss_pred ---ccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 303 ---PERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 303 ---p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) -.++.+--+-..--++.|-+...++.+ ++..+ T Consensus 351 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~---~~~~~ 385 (397) T protein:vir:49 351 GGAFETDTTKVRVIDRFDVVATDTEAFVPA---SFKAI 385 (397) T ss_pred cchhhcCceeEEEEeeeCcEEecccceEEE---Eeecc Confidence 122222222222334555555555554 34443 No 87 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=97.61 E-value=3.8e-05 Score=44.81 Aligned_cols=299 Identities=12% Similarity=0.059 Sum_probs=138.5 Q ss_pred CCh--------HHHHHHHHHHHHH------------HHhhC-chhhcceEeechHHHHHHHHHHHhhHHHhcccceeccc Q lcl|Aclame:pro 1 MRK--------ETRQAYEKYAAQI------------AKLND-TGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVT 59 (337) Q Consensus 1 M~~--------~tr~~~~~y~~~~------------a~~ng-v~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~ 59 (337) +.. .-+..|..+..+- ...+. .....-.+.|-+.+.+.+.+.+++++.+++.++++++. T Consensus 123 ~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~ 202 (458) T protein:vir:10 123 LYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMS 202 (458) T ss_pred chhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecC Confidence 000 0111111111110 00000 00112344666788999999999999999999998875 Q ss_pred hhhceeeecccccccccccCCC-Ccccc---cccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHH Q lcl|Aclame:pro 60 ELEGEKLGLSVSGPIASRTDTT-KAARQ---PIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQG 135 (337) Q Consensus 60 ~~~Ge~v~lgv~g~ia~Rt~t~-~~~R~---p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~ 135 (337) --... +..-..++-++=+.-+ ..+-. +..-..++...+..++.--.+.|+.+.|+... ++|+..+.+.+.+.+ T Consensus 203 ~~~~~-~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~--~~~~~~i~~~l~~~i 279 (458) T protein:vir:10 203 SKILT-MLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAH 279 (458) T ss_pred CcceE-EEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcch--HHHHHHHHHHHHHHH Confidence 42222 1112222222221111 11111 01112355667777777777899999887643 689999999999998 Q ss_pred hhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeec-C--CcccccHHHHHHHHHhc Q lcl|Aclame:pro 136 ALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVG-K--AGDYENLDALVMDIVSS 212 (337) Q Consensus 136 aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g-~--ggdy~nLDaLv~d~~~~ 212 (337) +.-.-.--+||+-. ..| +|.+. .....++.+..+ . ..+-.+.|.|+ +++.. T Consensus 280 ~~~~d~~~l~G~G~-------~~p------~Gi~~------------~~~~~~~~~~~~~~~~~~~~~~~~~i~-~~~~~ 333 (458) T protein:vir:10 280 AVSIEEAFMTGDGS-------GKP------KGLLT------------LASEDSAKVVTEAKADGSVLVTAKTIS-KLRRK 333 (458) T ss_pred HHHHHHHhhcCCCC-------Ccc------ceeee------------cccccccceeecccccccccccHHHHH-HHHHh Confidence 86555555777421 122 23222 211111121111 1 11223345444 35554 Q ss_pred ccChhHcCCCCEEEEECHHHHHHHHHHHHh-ccCChHHH---HHHHHHHhhhhhcCccccccCccCCCc----eEEecch Q lcl|Aclame:pro 213 MIDPWFQEDTGLVVICGRELLHDKYFPIVN-ATQAPTER---LAADLIVSQKRIGNLPAVRVPFFPKRA----LMVTKLS 284 (337) Q Consensus 213 lid~~~r~~~~LVvivG~dLl~~k~~~l~n-~~~~ptE~---~A~~~~~~~k~iGGlpa~~vPffP~~~----iliT~l~ 284 (337) +++.++. .-+++|.+..+.. +..+. ....|--. ..........++-|+|++...++|+.+ +++=-+. T Consensus 334 -l~~~~~~--~~~~v~~~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~ 408 (458) T protein:vir:10 334 -LGRHGLK--LSKLVLIVSMDAY--YDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYK 408 (458) T ss_pred -hhhhhcC--CCEEEEcHHHHHH--HHhhcccCCceeeccccccccccCcCceecceeeEEccccccccCCcceEEEEec Confidence 4566654 3568899887753 22232 22222100 001111223478899999999999863 4554553 Q ss_pred h-cEEEEecCceEEEEEEcccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 285 N-LSIYYQEGARRRTLKEVPERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 285 N-LsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) + .-|+- ++..+-...+--..+.+..+-..=-+..|=.+..++ -++++.| T Consensus 409 ~~~~~~~-~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v---~~~~aa~ 458 (458) T protein:vir:10 409 DNFVMPR-QRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVV---SGTYAAS 458 (458) T ss_pred ccEEEEE-eeceEEEeecccCCCceEEEEEEEecceEecccceE---EEeeccC Confidence 3 33322 222221111111122221111111122222222221 1555555 No 88 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.61 E-value=3.8e-05 Score=44.80 Aligned_cols=285 Identities=11% Similarity=0.012 Sum_probs=156.5 Q ss_pred HHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCCCCcccccccccccCC Q lcl|Aclame:pro 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~ 95 (337) || .++. .+-.+.|-+...+.+++.++++|.++++.+++++.-- +-++-.-.+++-++-... +...|..-..++. T Consensus 1 Ma--~~~~-~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~-~~~ip~~~~~~~a~wv~E--g~~~~~s~~~f~~ 74 (315) T protein:vir:80 1 MA--DDFL-SAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGE--GEVKPSASVDVSA 74 (315) T ss_pred CC--CCcC-CcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeeC--Cccccccccceee Confidence 44 2332 2456789889999999999999999999999887532 223333344555544332 3344555567888 Q ss_pred ceeEEEEeeeeeecCHHHHHHHhC--ChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHH Q lcl|Aclame:pro 96 NRYRCEKTDYDTAIPYRKLDMWAK--FADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYR 173 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~WA~--~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~R 173 (337) ....+++.---+.|+-+.|.+..- ...+++.+.+.+++.++.=.-...|||+.-...+.+. T Consensus 75 v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~----------------- 137 (315) T protein:vir:80 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAAS----------------- 137 (315) T ss_pred eEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccc----------------- Confidence 888899888888888888755432 2346788888888888876667788996432211110 Q ss_pred HhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHH--HH Q lcl|Aclame:pro 174 ERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTE--RL 251 (337) Q Consensus 174 e~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE--~~ 251 (337) -+..... ...+.....+..|.+++.++.- + ....++. +. +++|.+.....-. .|......++- .+ T Consensus 138 -----~~~~~~~-~~~~~~~~~~~~~~d~~~~~~~-~---~~~~~~~-~~-~~imn~~~~~~L~-~l~~~~g~~~~g~~~ 204 (315) T protein:vir:80 138 -----AVHTSLN-KTKNIVDATDSATADLVKAVGL-I---AGAGLQV-PN-GVALDPAFSFALS-TEVYPKGSPLAGQPM 204 (315) T ss_pred -----ccccccc-cccceeeccccchHHHHHHHHH-H---hhccCcc-ce-EEEEcHHHHHHHH-HHhhccCCccccccc Confidence 0000000 0111111223346777666543 2 2222221 22 6889988766432 22211111110 00 Q ss_pred HHHH-HHhhhhhcCccccccCccCCCc---------eEEecchhcEEEEecCceEEEEEEcc----------cccceece Q lcl|Aclame:pro 252 AADL-IVSQKRIGNLPAVRVPFFPKRA---------LMVTKLSNLSIYYQEGARRRTLKEVP----------ERDRIENY 311 (337) Q Consensus 252 A~~~-~~~~k~iGGlpa~~vPffP~~~---------iliT~l~NLsiY~Q~gs~RR~~~d~p----------~r~rve~y 311 (337) --.+ .....++-|+|++..+++|++. +++--++++-|-..++ .+-.+-+.. ++|++.-. T Consensus 205 ~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~-~~i~i~~~~~~~~~~~~~~~~~~v~~r 283 (315) T protein:vir:80 205 YPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRN-FPIELIEYGDPDQTGRDLKGHNEVMVR 283 (315) T ss_pred ccccccCCCceecceeeEecCcCCcccccccccccEEEEeecccEEEEEecC-eeEEEeccccccCcccchhhcCcEEEE Confidence 0000 0122478999999999999764 4455666654433322 222222221 12333332 Q ss_pred eeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 312 ESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 312 ~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) --.--+..|.+.+.++.+++..-..+ T Consensus 284 ~~~r~~~~v~~~~a~~~l~~~~a~~~ 309 (315) T protein:vir:80 284 AEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) T ss_pred EEEEecceeecccceEEEeeccCCCC Confidence 22335677788888888765544333 No 89 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=97.52 E-value=5.1e-05 Score=44.12 Aligned_cols=282 Identities=14% Similarity=0.087 Sum_probs=153.7 Q ss_pred CChHHHHHHHHHHHHHHHhh--Cch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeec--ccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLN--DTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL--SVSGPIA 75 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~n--gv~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~l--gv~g~ia 75 (337) ....++...+.++...-+.- ++. ...-.+.|-+.+...+.+.+.+.+.+++.+++++++...|....+ ...++.+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a 165 (395) T protein:vir:38 86 GKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLK 165 (395) T ss_pred hhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccc Confidence 33333444444443322111 111 222345666677889999999999999999999999888886432 2223334 Q ss_pred cccCCCCccccc-ccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCC Q lcl|Aclame:pro 76 SRTDTTKAARQP-IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTD 154 (337) Q Consensus 76 ~Rt~t~~~~R~p-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td 154 (337) +-... +...| .+...++...+.+++.---+.|+.+.|+.. .++|++.+.+.+.+.++.-.-.-=+||.-.... T Consensus 166 ~~v~E--~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~-- 239 (395) T protein:vir:38 166 DLDDE--SALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDT--VDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPK-- 239 (395) T ss_pred ccccc--ccccccccccceeeEEeeeeeeEeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-- Confidence 32222 12223 233456677788887777778888888752 358999999999999886544444454321110 Q ss_pred hhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHH Q lcl|Aclame:pro 155 RQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 155 ~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~ 234 (337) .+... +.|.++ ++++..+++.|+. .-+++|.+..+. T Consensus 240 --------------------------------------~~~~~---~~~~i~-~~~~~~l~~~~~~--~a~~v~n~~~~~ 275 (395) T protein:vir:38 240 --------------------------------------KPTIS---QFDNIK-DLENNTLDPAIES--TSSFITNQSGYN 275 (395) T ss_pred --------------------------------------ccccc---cHHHHH-HHHHHhhhhhhcC--CCEEEEcHHHHH Confidence 00111 234443 3444456777775 458899998765 Q ss_pred HHHHHHHhccCChH--HHHHHHHHHhhhhhcCccccccCccCCC------ceEEecchhc-EEEEecCceEEEEEEcc-- Q lcl|Aclame:pro 235 DKYFPIVNATQAPT--ERLAADLIVSQKRIGNLPAVRVPFFPKR------ALMVTKLSNL-SIYYQEGARRRTLKEVP-- 303 (337) Q Consensus 235 ~k~~~l~n~~~~pt--E~~A~~~~~~~k~iGGlpa~~vPffP~~------~iliT~l~NL-siY~Q~gs~RR~~~d~p-- 303 (337) . -..|-...+.|- .-... ....+|-|+|++..+..|.. .+++--+++. -|+...|. .-.+.+.+ T Consensus 276 ~-L~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~-~i~~~~~~~~ 350 (395) T protein:vir:38 276 I-LSKVKDADGRYLMQPDVTS---PDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQM-QIDTTNVGAG 350 (395) T ss_pred H-HHHhhccCCceeeccCcCC---CCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecce-EEEEeccccc Confidence 3 222222222211 00000 12357889999998764333 2677777763 34434442 22222222 Q ss_pred --cccceeceeeeeeeeeeeccccEEEeecceecc-C Q lcl|Aclame:pro 304 --ERDRIENYESSNDAYVVEDFGCGCVAENIELAA-A 337 (337) Q Consensus 304 --~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~-a 337 (337) .++.+--.-..--+..|-+..+++.++--..+. + T Consensus 351 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~ 387 (395) T protein:vir:38 351 SFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQA 387 (395) T ss_pred hhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 233333333333456777777777776221111 1 No 90 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=97.50 E-value=5.5e-05 Score=43.96 Aligned_cols=298 Identities=13% Similarity=0.132 Sum_probs=145.0 Q ss_pred CChHHHHHHHHHHHHHHHhh------------CchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeee- Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLN------------DTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLG- 67 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~n------------gv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~- 67 (337) .....+.....++......+ .-.+....+.|-+.+...+.+.+++.+.+++.+++++|....|.... T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~ 159 (404) T protein:vir:10 80 GALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYE 159 (404) T ss_pred HHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEE Confidence 11112222222222221111 11122345667778889999999999999999999999988886532 Q ss_pred cccccccccccCCCCccccccc--ccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhccc Q lcl|Aclame:pro 68 LSVSGPIASRTDTTKAARQPID--PTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWN 145 (337) Q Consensus 68 lgv~g~ia~Rt~t~~~~R~p~~--~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfn 145 (337) ...+++-+.-...+. ..|.+ -..++...+..++.---+.|+.+.|+. ..++|...+++.+++.++.-.-.-=++ T Consensus 160 ~~~~~~~~~~v~e~~--~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~il~ 235 (404) T protein:vir:10 160 KRSKQKPMKPLSENQ--QIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKF--ADKSLEDWIINWFVDKVRITRNAEILY 235 (404) T ss_pred EecCCcceeeccccc--cccccccccceeeeEeeheeeEeeehhhHHHHhh--cHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 223333333333222 12221 122444555555555556777777763 124788888888888777533222235 Q ss_pred ccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEE Q lcl|Aclame:pro 146 GVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 146 G~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LV 225 (337) |+- +. .+|.+ ++... +...+..+....|..|..++. ..+++-|+. ..+ T Consensus 236 G~g---~~---~~~~g------------------i~~~~--~~~~~~~~~~~~~~~~~~~~~----~~l~~~~~~--~~~ 283 (404) T protein:vir:10 236 GAG---GD---EHATG------------------IMTAN--KFKKITLPKSPALKDFKKCKN----VELLNVFKA--TSS 283 (404) T ss_pred cCC---CC---Ccccc------------------eeecc--ccceeeccccccHHHHHHHHH----hhhhccccC--CCE Confidence 522 11 11221 11111 111233444555555544332 224555544 457 Q ss_pred EEECHHHHHHHHHHHHhccCChHHHHHHHH-HHhhhhhcCccccccCc-cCCCc-----eEEecchhcEEEEecCceEEE Q lcl|Aclame:pro 226 VICGRELLHDKYFPIVNATQAPTERLAADL-IVSQKRIGNLPAVRVPF-FPKRA-----LMVTKLSNLSIYYQEGARRRT 298 (337) Q Consensus 226 vivG~dLl~~k~~~l~n~~~~ptE~~A~~~-~~~~k~iGGlpa~~vPf-fP~~~-----iliT~l~NLsiY~Q~gs~RR~ 298 (337) ++|.+..++. -..|-...+.|- ..... -....++-|+|++.+|. +|+.+ +++-.+++.-..+.++...=. T Consensus 284 ~v~n~~~~~~-L~~lkd~~G~~l--~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~ 360 (404) T protein:vir:10 284 WIVNQDGFNY-LDSLEDKTGRPY--LQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELA 360 (404) T ss_pred EEEcHHHHHH-HHHhhccCCcee--eccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEE Confidence 8999987653 122211111111 00000 01234788999986554 56554 777777764333333343333 Q ss_pred EEEccc----ccceeceeeeeeeeeeeccccEEEee-cceeccC Q lcl|Aclame:pro 299 LKEVPE----RDRIENYESSNDAYVVEDFGCGCVAE-NIELAAA 337 (337) Q Consensus 299 ~~d~p~----r~rve~y~s~Ne~YvVEd~~~~a~ie-ni~~~~a 337 (337) +.+++. ++.+--+-..--++.|-+...++.+. -..-.+| T Consensus 361 ~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 361 TTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred EeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 322322 23333223333355666666666554 2222333 No 91 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=97.47 E-value=6.1e-05 Score=43.68 Aligned_cols=280 Identities=13% Similarity=0.102 Sum_probs=151.4 Q ss_pred CChHHHHHHHHHHHHHH-----HhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeec--ccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIA-----KLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL--SVSGP 73 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a-----~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~l--gv~g~ 73 (337) +...-+..|..|+..-- .........-.+.|-+.+...+.+.+.+.+.+++..++++++...|..... ...++ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (397) T protein:vir:48 86 VKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITG 165 (397) T ss_pred HHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCc Confidence 33344444444443211 011111122346677788899999999999999999999999888886643 22334 Q ss_pred cccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcC Q lcl|Aclame:pro 74 IASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATT 153 (337) Q Consensus 74 ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~T 153 (337) .+..+..+.. ....+...++...+..++.---+.|+.+.|+.- ..+|+..+++.+.++++.-.-.--+||+..+.. T Consensus 166 ~a~~v~E~~~-~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~- 241 (397) T protein:vir:48 166 LAKLDDEAGS-IGTNDDPKLYPIRYAIKRYAGISTVTNSLLADS--AENILAWLSGWIAKKVVVTRNKAILEAIATLPT- 241 (397) T ss_pred ceeeeccccc-cccccccceeeEEeeheeeeeehhhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccccc- Confidence 4433332221 111122334555555555555578888888763 357888899999988887666666677432110 Q ss_pred ChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHH Q lcl|Aclame:pro 154 DRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 d~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl 233 (337) .+.. .+.|.++ +++.. +++.|+. .-+++|.+..+ T Consensus 242 ---------------------------------------~~~~---~~~d~i~-~~~~~-l~~~~~~--~a~~v~n~~~~ 275 (397) T protein:vir:48 242 ---------------------------------------KPTL---TKWDDII-DLQAK-VDPAIKQ--TSFFLTNTSGF 275 (397) T ss_pred ---------------------------------------cccc---ccHHHHH-HHHHH-hhhhhcC--CCEEEECHHHH Confidence 0000 1345444 45544 4566665 35888999887 Q ss_pred HHHHHHHHhccCChH--HHHHHHHHHhhhhhcCccccccC--ccC-----CCceEEecchhcEEEEecCceEEEEEEcc- Q lcl|Aclame:pro 234 HDKYFPIVNATQAPT--ERLAADLIVSQKRIGNLPAVRVP--FFP-----KRALMVTKLSNLSIYYQEGARRRTLKEVP- 303 (337) Q Consensus 234 ~~k~~~l~n~~~~pt--E~~A~~~~~~~k~iGGlpa~~vP--ffP-----~~~iliT~l~NLsiY~Q~gs~RR~~~d~p- 303 (337) +. -..|-+..+.|- .-... ....+|-|+|++.++ ++| ...+++=.|++...++.++..+-.+.+.. T Consensus 276 ~~-L~~lkd~~G~~i~~~~~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~ 351 (397) T protein:vir:48 276 TA-LKKVKNAFGDYLMERDVKS---PTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGG 351 (397) T ss_pred HH-HHHhhcCCCceeeccCcCC---CCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccch Confidence 53 222222222221 00101 124588999998765 344 33467777777655555554443333222 Q ss_pred ---cccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 ---ERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ---~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .++.+--.-..--++.|-+...++.+ ++..+ T Consensus 352 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~---~~~~~ 385 (397) T protein:vir:48 352 GAFETDTTKIRVIDRFDVVATDTESFVPA---SFKAI 385 (397) T ss_pred hhhhcCceeEEEEeeeccEEecccceEEE---Eeccc Confidence 22222222222234555566555554 44444 No 92 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=97.45 E-value=6.5e-05 Score=43.52 Aligned_cols=277 Identities=9% Similarity=0.044 Sum_probs=143.5 Q ss_pred CChH----HHHHHHHHHHHHH----HhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccc Q lcl|Aclame:pro 1 MRKE----TRQAYEKYAAQIA----KLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~----tr~~~~~y~~~~a----~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g 72 (337) |... .+..|..|+..-. ...+.....-.|.|-+.+.+.+.+.+.+.+.+++.++++++...+|....+.-++ T Consensus 83 ~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 162 (389) T protein:vir:10 83 LSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRAT 162 (389) T ss_pred cchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCC Confidence 2222 2345666654221 1112222234567766778889999999999999999999988777765443222 Q ss_pred ccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhh--hHHhcccccccC Q lcl|Aclame:pro 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALD--RIMIGWNGVKAA 150 (337) Q Consensus 73 ~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD--~i~IGfnG~s~A 150 (337) .-+.-. +.+..+.+.+-..++...+..++.---+.|+.+.|+. ..++|+..+++.++++++.- ...++-.|+ T Consensus 163 ~~~~~~-~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~i~~g~~~--- 236 (389) T protein:vir:10 163 DRFSSV-AELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIAD--SAVDLTALVGQSIKEKSVNTYNAMIAPVLQS--- 236 (389) T ss_pred Cccccc-cccccccccccccceeeeeeheeeEeeehhhHHHHhh--hhHHHHHHHHHHHHHHHHHHHHHHHhhhhcc--- Confidence 222111 1222333233345666677777666666777777764 34589999999999888752 111111110 Q ss_pred CcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECH Q lcl|Aclame:pro 151 ATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGR 230 (337) Q Consensus 151 ~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~ 230 (337) + ..+......+.|.++ ++++..+++.+.. +++|.+ T Consensus 237 ---------------------------------~-------~~~~~~~~~~~d~l~-~~~~~~~~~~~~a----~~~~n~ 271 (389) T protein:vir:10 237 ---------------------------------F-------TAKKTTTDTLVDSLK-HILNVDLDPAYSR----ALVVTQ 271 (389) T ss_pred ---------------------------------c-------ccccccccccHHHHH-HHHHhhhhhhhCc----EEEecH Confidence 0 001111234566665 4555567777632 789999 Q ss_pred HHHHHHHHHHHhccCCh------HHHHHHHHHHhhhhhcCccccccCc-cCCCc-----eEEecchhcE-EEEecCceEE Q lcl|Aclame:pro 231 ELLHDKYFPIVNATQAP------TERLAADLIVSQKRIGNLPAVRVPF-FPKRA-----LMVTKLSNLS-IYYQEGARRR 297 (337) Q Consensus 231 dLl~~k~~~l~n~~~~p------tE~~A~~~~~~~k~iGGlpa~~vPf-fP~~~-----iliT~l~NLs-iY~Q~gs~RR 297 (337) ..+..- ..|-...+.| +...++ ....++-|+|++.++- +|+.. +++-.|++.- |+.+.| .+- T Consensus 272 ~~~~~L-~~lkd~~G~~i~~~~~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~-~~i 346 (389) T protein:vir:10 272 SLFNTL-DTLKDKNGRYLLHDASDSITDG---TAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQ-VTL 346 (389) T ss_pred HHHHHH-HHhhccCCCeeeecCccccccc---ccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecc-eEE Confidence 876421 1122221111 111000 1234789999987664 34332 7888888854 443433 333 Q ss_pred EEEEcccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 298 TLKEVPERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 298 ~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ...++......---..| -+..|=+...++.+ ++.++ T Consensus 347 ~~~~~~~~~~~~~~~~r-~d~~~~~~~a~~~~---~~~~~ 382 (389) T protein:vir:10 347 AWEDSKIYGKYLGAAFR-FGVQKADSKAGYFV---TNTDV 382 (389) T ss_pred EeeccccccceEEEEEE-eccEEecccceEEE---Eeecc Confidence 33333222221111111 22334444444444 34443 No 93 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.38 E-value=8.1e-05 Score=43.03 Aligned_cols=283 Identities=11% Similarity=-0.015 Sum_probs=152.6 Q ss_pred HHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCCCCcccccccccccCC Q lcl|Aclame:pro 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~ 95 (337) ||.. + +-.+.|-+...+.+.+.++++|..++..+++++.--. .++-.-.+++-++-.. .+...|..-..++. T Consensus 1 mat~----~-~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~-~~~p~~~~~~~a~wv~--Eg~~~~~~~~~f~~ 72 (311) T protein:vir:81 1 MVAL----A-TGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVG--EGAQKSESTATFAP 72 (311) T ss_pred Ccee----c-CCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCc-eEEEEEeCCceeEEee--cCcccccccceeeE Confidence 3221 1 2357788888999999999999999999998875422 2222223444444332 22333444456888 Q ss_pred ceeEEEEeeeeeecCHHHHHHHhC-ChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHH Q lcl|Aclame:pro 96 NRYRCEKTDYDTAIPYRKLDMWAK-FADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~WA~-~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re 174 (337) ..+.+++.--.+.|+.+.|..+.. ..+|++.+.+.++++++.-.-.-.+||+.....+.+ T Consensus 73 v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~------------------- 133 (311) T protein:vir:81 73 VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAAL------------------- 133 (311) T ss_pred EEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCccc------------------- Confidence 899999998889999999976654 568999999999999999888888999653332221 Q ss_pred hchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHH Q lcl|Aclame:pro 175 RAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAAD 254 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~ 254 (337) .-+..........+.. ..++-.+.|.++..++. ++... ..++. .++|.+..+..- ..|-.....|-=. ... T Consensus 134 ---~gi~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~-~~~~~-~~~~~-~~vmn~~~~~~l-~~lkd~~G~~l~~-~~~ 204 (311) T protein:vir:81 134 ---SGSPAKILDTTNIVEL-TTGTSATPDLAVEAAVG-LVLGD-NLSPD-GVALDNTFSFML-ATQRDSQGRKLYP-ELG 204 (311) T ss_pred ---ccccccccccceeeee-cccccchHHHHHHHHHH-Hhhhc-CCCce-EEEEcHHHHHHH-HhhhccCCCeeec-Ccc Confidence 1111111111111111 12233445666666654 33332 33333 478888776532 2232222222100 001 Q ss_pred HHHhhhhhcCccccccCccCCCceE------------------EecchhcEEEEecCceEEEEEEcccccceeceeeeee Q lcl|Aclame:pro 255 LIVSQKRIGNLPAVRVPFFPKRALM------------------VTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSND 316 (337) Q Consensus 255 ~~~~~k~iGGlpa~~vPffP~~~il------------------iT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne 316 (337) .-....++-|+|++..-++|.+... +=-++++-|-...+..= .+-++.+-+...++..+|. T Consensus 205 ~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~-~~~~~~~~~~~~~~~~~~~ 283 (311) T protein:vir:81 205 FGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPL-ELIEFGDPDGLGDLKRQNQ 283 (311) T ss_pred ccCCCceecceeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceE-EEeccCCCCcchhhhhcCc Confidence 1123567889999998888876533 33333333322333221 2222211111122222222 Q ss_pred ---------eeeeeccccEEEeecceec Q lcl|Aclame:pro 317 ---------AYVVEDFGCGCVAENIELA 335 (337) Q Consensus 317 ---------~YvVEd~~~~a~ieni~~~ 335 (337) ++.|=+.++++.+...+-+ T Consensus 284 v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 284 IAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred EEEEEEEEeccEeecccceEEEEeeccC Confidence 2344455555555443333 No 94 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.30 E-value=9e-05 Score=42.75 Aligned_cols=272 Identities=11% Similarity=0.074 Sum_probs=136.4 Q ss_pred CChHHHHHHHHHHHHHHH--hhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK--LNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt 78 (337) .....+..+..++..... ..+.......+.|-+...+.+.+ ..+.+..++.+++++++...|.......++..++-. T Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (397) T protein:vir:96 112 ELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATV 190 (397) T ss_pred HHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCccccc Confidence 223334445555443321 12233344556677777888877 466777899999999998888766554443333322 Q ss_pred CCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhh Q lcl|Aclame:pro 79 DTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQAN 158 (337) Q Consensus 79 ~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~an 158 (337) .. ...+.......++...+.+++.---+.++.+.|+... ++++..+++.+.+.++.-.-.--++|+..+.. T Consensus 191 ~E-~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~--~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~------ 261 (397) T protein:vir:96 191 QQ-LEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDAS--YDVTGLIADEIQDQSLNTKNADIAAVLKTATA------ 261 (397) T ss_pred cc-cccccccccccccceeecHhHhhcchhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------ Confidence 11 1122112223455556666555445567777777653 57888888888888776433322333221110 Q ss_pred hhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHH Q lcl|Aclame:pro 159 PLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 159 PllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~ 238 (337) .+. .+.|.++ +++...+++.+ + -+++|.+..+..- . T Consensus 262 -----------------------------~~~---------~~~d~~~-~~~~~~~~~~~-~---a~~v~n~~~~~~l-~ 297 (397) T protein:vir:96 262 -----------------------------KSV---------VGVDGLK-DLINKEIKKVY-D---VKLFISASMYSEL-D 297 (397) T ss_pred -----------------------------ccc---------cchHHHH-HHHHHhhhhhc-C---cEEEEcHHHHHHH-H Confidence 000 1234333 45555566643 3 3899999776532 1 Q ss_pred HHHhccCChHHHHHHHHH-HhhhhhcCccccccCcc-CCC-----ceEEecchhcEEEEecCceEEEEEEcccccceece Q lcl|Aclame:pro 239 PIVNATQAPTERLAADLI-VSQKRIGNLPAVRVPFF-PKR-----ALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENY 311 (337) Q Consensus 239 ~l~n~~~~ptE~~A~~~~-~~~k~iGGlpa~~vPff-P~~-----~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y 311 (337) .|-...+.|- ....+. ....++-|+|++..+.. |+. .+++-.|++.-..+-++...-...++..... T Consensus 298 ~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~---- 371 (397) T protein:vir:96 298 KLKDKNGRYL--LQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNNIYGQ---- 371 (397) T ss_pred HhhccCCCeE--eccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEecccccce---- Confidence 2222222221 000110 12357889999876654 333 2787777775433433444433333221111 Q ss_pred eeeeeee-eeeccccEEEe----ecceeccC Q lcl|Aclame:pro 312 ESSNDAY-VVEDFGCGCVA----ENIELAAA 337 (337) Q Consensus 312 ~s~Ne~Y-vVEd~~~~a~i----eni~~~~a 337 (337) ++ +++.++....- =-+++.-| T Consensus 372 -----~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 372 -----LLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred -----eEEEEEEEccEEecccceEEEEeecC Confidence 12 22333332211 12334444 No 95 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=97.28 E-value=0.00011 Score=42.37 Aligned_cols=266 Identities=13% Similarity=0.110 Sum_probs=144.1 Q ss_pred HHHhhCch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeec--ccccccccccCCCCcccccccccc Q lcl|Aclame:pro 16 IAKLNDTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGL--SVSGPIASRTDTTKAARQPIDPTA 92 (337) Q Consensus 16 ~a~~ngv~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~l--gv~g~ia~Rt~t~~~~R~p~~~~~ 92 (337) +.+..... ...-.+.|-+.+.+.+.+.+++.+.+++..+++++....|..... ...++.++-+..+. .....+... T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~-~~~~~~~~~ 79 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAG-KIADIDDPK 79 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCc-ccccccccc Confidence 33222222 223456777777899999999999999999999999888875543 23334444333222 121133456 Q ss_pred cCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHH Q lcl|Aclame:pro 93 LDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQY 172 (337) Q Consensus 93 l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~ 172 (337) ++...+.|++.---..|+.+.|+... .++++.+++.++++++.-.-.--++|..- T Consensus 80 ~~~i~l~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~la~~~~~~~~~~i~~g~~~----------------------- 134 (293) T protein:vir:48 80 LSLIKYTIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILGVVDK----------------------- 134 (293) T ss_pred eeEEEEeeeEEEEeehhhHHHHhhhh--HHHHHHHHHHHHHHHHHHHHhHHhhcccc----------------------- Confidence 77888999999888899999998654 57888888888888765221111122110 Q ss_pred HHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHH Q lcl|Aclame:pro 173 RERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLA 252 (337) Q Consensus 173 Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A 252 (337) . +..+.=.+.|.|+- ++..+ ++.++.. -+++|.+..++. -..|-.....|- .. T Consensus 135 -----------~---------~~~~~~~~~d~i~~-~~~~l-~~~~~~~--a~~vmn~~~~~~-L~~lkd~~g~~l--~~ 187 (293) T protein:vir:48 135 -----------L---------PTKPTLTKWDDIID-LEAKV-DPAIKQT--SFFLTNTSGFTA-LKKVKNALGDYL--ME 187 (293) T ss_pred -----------c---------cccccccCHHHHHH-HHHhh-hhhhcCC--CEEEEcHHHHHH-HHHhhccCCceE--ee Confidence 0 00111123454443 55544 5556654 378889887753 122222222210 00 Q ss_pred HHH-HHhhhhhcCccccccC--ccCCCc-----eEEecchhc-EEEEecCceEEEEEE---cccccceeceeeeeeeeee Q lcl|Aclame:pro 253 ADL-IVSQKRIGNLPAVRVP--FFPKRA-----LMVTKLSNL-SIYYQEGARRRTLKE---VPERDRIENYESSNDAYVV 320 (337) Q Consensus 253 ~~~-~~~~k~iGGlpa~~vP--ffP~~~-----iliT~l~NL-siY~Q~gs~RR~~~d---~p~r~rve~y~s~Ne~YvV 320 (337) ..+ -....++-|+|++.++ ++|..+ +++-.+++. -+..+.+..=..... .-+++.+--+-..--++++ T Consensus 188 ~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~ 267 (293) T protein:vir:48 188 RDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVA 267 (293) T ss_pred cCcCCCCCceecceeeEEecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEE Confidence 000 0134588999998754 455432 566667763 344444432111111 1123333222233335566 Q ss_pred eccccEEEeecceeccC Q lcl|Aclame:pro 321 EDFGCGCVAENIELAAA 337 (337) Q Consensus 321 Ed~~~~a~ieni~~~~a 337 (337) -+..+++.++ +..+ T Consensus 268 ~~~~a~~~l~---~~~~ 281 (293) T protein:vir:48 268 TDTEAFVPAS---FKAI 281 (293) T ss_pred ecccceEEEE---eecc Confidence 6666666554 3333 No 96 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=97.28 E-value=8.9e-05 Score=42.78 Aligned_cols=298 Identities=13% Similarity=0.094 Sum_probs=143.6 Q ss_pred CChHHH-----HHHHHHHHHH------------------HHhhCchhhcceEeechH-HHHHHHHHHHhhHHHhccccee Q lcl|Aclame:pro 1 MRKETR-----QAYEKYAAQI------------------AKLNDTGDVSKKFAVEPT-VQQRLETKMQESSEFLKRINVL 56 (337) Q Consensus 1 M~~~tr-----~~~~~y~~~~------------------a~~ngv~~~~~~Fsv~P~-~~q~L~~~iqess~FL~~Inv~ 56 (337) +.+..+ .......... +......+..-.+.|-|. +.+.+.+.+++++.+++.+.++ T Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~ 194 (477) T protein:vir:84 115 LAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTE 194 (477) T ss_pred HHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhcee Confidence 000000 0000000000 000001111224556665 4678999999999999999999 Q ss_pred ccchhhceeeecc-ccccccc-ccCCCC---cccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHH Q lcl|Aclame:pro 57 PVTELEGEKLGLS-VSGPIAS-RTDTTK---AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVI 131 (337) Q Consensus 57 ~V~~~~Ge~v~lg-v~g~ia~-Rt~t~~---~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i 131 (337) +++...|..-..- .+|+..+ -+.-+. ....|..-..++...+.+++.---+.|+.+.|+..+ ++++..+++.+ T Consensus 195 ~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l 272 (477) T protein:vir:84 195 PLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAA--VSVDEFVFRDL 272 (477) T ss_pred eecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeeeHHHHHHHhccc--hhHHHHHHHHH Confidence 9988877642211 1222222 121111 123343344577778888888888889999988765 68999999999 Q ss_pred HHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceee-cCCcccccHHHHHHH-- Q lcl|Aclame:pro 132 LNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLV-GKAGDYENLDALVMD-- 208 (337) Q Consensus 132 ~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~-g~ggdy~nLDaLv~d-- 208 (337) .++++.=.-.--++|+-.+ .+|. |.+.. . +.++++. +.+..+..+|.+..+ T Consensus 273 ~~~~~~~~d~~~l~G~Gt~------~~p~------Gi~~~------------~--~~~~~~~~~~~~t~~~~~~~~~~i~ 326 (477) T protein:vir:84 273 AADYANKLNVQVISGTGSN------NQVV------GVRAT------------A--GITQVTATSAGSALEKHQIIYQKIA 326 (477) T ss_pred HHHHHHHHHHHHhccCCCC------Cccc------eeeec------------c--ccccccccccccchhhHHHHHHHHH Confidence 9998865555566874321 1232 33311 0 1112222 234567778777544 Q ss_pred -HHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCCh----H--H-----HHHHHHH-HhhhhhcCccccccCccCC Q lcl|Aclame:pro 209 -IVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAP----T--E-----RLAADLI-VSQKRIGNLPAVRVPFFPK 275 (337) Q Consensus 209 -~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~p----t--E-----~~A~~~~-~~~k~iGGlpa~~vPffP~ 275 (337) ++.. +++-++..+..+ +|.+..++ ....|-.....| . + .+..... ....++.|+|++..|++|+ T Consensus 327 ~~~~~-~~~~~~~~~~~~-v~~~~~~~-~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~ 403 (477) T protein:vir:84 327 DAIQR-VHTSRFLEPEVI-VMHPRRWA-SFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPT 403 (477) T ss_pred HHHhh-ccccccCCccEE-EEcHHHHH-HHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccc Confidence 4433 345555555544 44554433 112222221111 0 0 0000110 1234788999999999997 Q ss_pred C--------ceEEecchhcEEEEecCceEEEEEEcccccceeceeeeeeeeeeeccccEEEee------cceeccC Q lcl|Aclame:pro 276 R--------ALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVVEDFGCGCVAE------NIELAAA 337 (337) Q Consensus 276 ~--------~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie------ni~~~~a 337 (337) + .+++-.++.+-| -++..+ +...++. ..++ -.-.|.|.-|-.+.++- -|+...+ T Consensus 404 ~~~~~~d~~~i~~gd~~~~~i--~~~~~~--~~~~~~~--~~~~--~~~~~~v~~~~~~~~~r~~~afv~~t~~~~ 471 (477) T protein:vir:84 404 TLGTGTDQDVIHVLRASDLAL--FESSVR--MRALQET--RAEN--LSVLLQVYGYLAFTAARFPQSVVEIGGTAL 471 (477) T ss_pred cccccCCcceEEEEEeceEEE--Eeecee--EEecccc--cccc--ceeeeeehhhhhhhhhccccceEEeecccc Confidence 5 467777766533 223222 2222221 1111 11123332221111110 1222221 No 97 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=97.28 E-value=0.00011 Score=42.33 Aligned_cols=288 Identities=9% Similarity=-0.080 Sum_probs=148.8 Q ss_pred HHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCCCCcccccccccccCC Q lcl|Aclame:pro 16 IAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~ 95 (337) ||.. +.+-.+.|-+.+.+.+.+.+.+.|.+++..+++++..-. .++-.-.+++.++-.. .....|..-..++. T Consensus 1 Mat~----tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~-~~~p~~~~~~~a~wv~--Eg~~~~~~~~~f~~ 73 (311) T protein:vir:99 1 MATF----GTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGN-EDIITFNGRPKAEFVG--EGQQKSSTTGEFDF 73 (311) T ss_pred Ccee----cCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEeCCceeEEee--cCcccccccceeeE Confidence 4421 223456787788899999999999999999999887522 3443333444444332 22334444456788 Q ss_pred ceeEEEEeeeeeecCHHHHHHHhC-ChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHH Q lcl|Aclame:pro 96 NRYRCEKTDYDTAIPYRKLDMWAK-FADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRE 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~WA~-~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re 174 (337) ..+..++.---+.|+.+.|.++.. ..+|.+.+++.+.++++.-.-.-.|+|.-....+ +|.+ ..+|+.+ T Consensus 74 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~----~~~g---~~~~~~~--- 143 (311) T protein:vir:99 74 VTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGT----VIPG---WSNYLGA--- 143 (311) T ss_pred EEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCc----cccc---ccccccc--- Confidence 888888888899999999987754 5899999999999999998888888886432221 1211 1111111 Q ss_pred hchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHH Q lcl|Aclame:pro 175 RAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAAD 254 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~ 254 (337) ....++.+. .+-..+++.+.+++..+ .....+-+--.++|.+.....- ..|-.....|-=. ... T Consensus 144 ------------~~~~~~~~~-~~~~~~~~~i~~~~~~~-~~~~~~~~~~~~vmn~~~~~~L-~~lkd~~G~~l~~-~~~ 207 (311) T protein:vir:99 144 ------------ASKRVELTA-DTIANPDLAIEAAVGLL-VANGHPTPVNGLALHPSIAWGL-STARYTDGRKKFP-ELG 207 (311) T ss_pred ------------ccceeeccc-cccchhHHHHHHHHHHH-hhhccCCCccEEEEcHHHHHHH-HhhhccCCCeeec-Ccc Confidence 111222221 22234566666665433 2222222222478888776532 2222222222100 000 Q ss_pred HHHhhhhhcCccccccCccCCCceEE----------------ecchhcEEE-EecCceEEEEEEcccccceec-eeeeee Q lcl|Aclame:pro 255 LIVSQKRIGNLPAVRVPFFPKRALMV----------------TKLSNLSIY-YQEGARRRTLKEVPERDRIEN-YESSND 316 (337) Q Consensus 255 ~~~~~k~iGGlpa~~vPffP~~~ili----------------T~l~NLsiY-~Q~gs~RR~~~d~p~r~rve~-y~s~Ne 316 (337) .-....++-|+|++...++|.+.... -.++++--| ..++..=+... ..+-+...+ |++-.- T Consensus 208 ~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~d~~ 286 (311) T protein:vir:99 208 LGIGVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIK-YGDPDGQGDLKRHNQI 286 (311) T ss_pred cCCCCceecceeeEeecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEee-cCCCCcchhhhhcCcE Confidence 01123578899999999888655432 223332211 22222111111 111111111 222222 Q ss_pred eeeeeccccEEEee--cceeccC Q lcl|Aclame:pro 317 AYVVEDFGCGCVAE--NIELAAA 337 (337) Q Consensus 317 ~YvVEd~~~~a~ie--ni~~~~a 337 (337) +|-+|-+--++..+ -|.+.++ T Consensus 287 ~~r~~~r~d~~v~~~~~v~~~~~ 309 (311) T protein:vir:99 287 ALRLEIVYGWYVFTDRFVVIENA 309 (311) T ss_pred EEEEEEeecceecChhHeeeecc Confidence 23222222222221 2334444 No 98 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=97.25 E-value=0.00012 Score=42.14 Aligned_cols=284 Identities=13% Similarity=0.122 Sum_probs=151.0 Q ss_pred CChHHHHHHHHHHHHH-----HHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccc--ccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQI-----AKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSV--SGP 73 (337) Q Consensus 1 M~~~tr~~~~~y~~~~-----a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv--~g~ 73 (337) +...-+..|.+|+..- ..........-.+.|-+.+...+.+.+.+.+.+++..++++++...|....... .++ T Consensus 86 ~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (397) T protein:vir:49 86 VKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITG 165 (397) T ss_pred HHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCc Confidence 4444555566665421 111111122234677667778999999999999999999999988887553322 223 Q ss_pred cccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcC Q lcl|Aclame:pro 74 IASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATT 153 (337) Q Consensus 74 ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~T 153 (337) .+.-+..+. .....+...++...+.+++.---+.|+.+.|+.-. .+|...+.+.+.++++.-.-.--++|+-... T Consensus 166 ~a~~v~E~~-~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~-- 240 (397) T protein:vir:49 166 LAKLDDEGG-QIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILEAIGTLP-- 240 (397) T ss_pred ceeeecccc-ccccccccceeeeEeeeeeeEeehhhHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHHHHhcccccc-- Confidence 333332221 11112233466677777777777788888886532 5789999999999988866666667743210 Q ss_pred ChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHH Q lcl|Aclame:pro 154 DRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 d~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl 233 (337) | .. .-.+.|.++ +++.. +++.|+.. -+++|.+..+ T Consensus 241 -----~---------------------------~~---------~~~~~d~i~-~~~~~-l~~~~~~~--a~~v~n~~~~ 275 (397) T protein:vir:49 241 -----N---------------------------KP---------TLAKWDDII-DLQAK-VDPAIKQT--SLFLTNTSGF 275 (397) T ss_pred -----c---------------------------cc---------cccCHHHHH-HHHHh-hhhhhcCC--CEEEEcHHHH Confidence 0 00 002345544 45554 46666554 4889999887 Q ss_pred HHHHHHHHhccCChHHHHHHHHH-HhhhhhcCccccccC--ccCCC-----ceEEecchhc-EEEEecCceEEEEEE--- Q lcl|Aclame:pro 234 HDKYFPIVNATQAPTERLAADLI-VSQKRIGNLPAVRVP--FFPKR-----ALMVTKLSNL-SIYYQEGARRRTLKE--- 301 (337) Q Consensus 234 ~~k~~~l~n~~~~ptE~~A~~~~-~~~k~iGGlpa~~vP--ffP~~-----~iliT~l~NL-siY~Q~gs~RR~~~d--- 301 (337) .. -..|-+..+.|- ....+. ....++-|+|++.++ .+|.. .+++-.|++- -++.+.|-.=..... T Consensus 276 ~~-l~~lkd~~g~~l--~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 352 (397) T protein:vir:49 276 TA-LKKVKNAMGDYL--MERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGG 352 (397) T ss_pred HH-HHHhhccCCcee--ecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccc Confidence 63 222322222220 000010 123578999998755 45643 3677777763 333343332111111 Q ss_pred cccccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 302 VPERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 302 ~p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) .-.++.+--.-..--+..|-+...++.+.-=..+.+ T Consensus 353 ~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~ 388 (397) T protein:vir:49 353 AFETDTTKVRVIDRFDVVSTDTEAFVPASFKAIADQ 388 (397) T ss_pred hhhcCeeeEEEEEeeccEEecccceEEEEecccccc Confidence 112333222222333455566666665541111111 No 99 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=97.23 E-value=0.00012 Score=42.02 Aligned_cols=277 Identities=8% Similarity=0.028 Sum_probs=137.2 Q ss_pred CChHHHHHHHHHHHHHHH-------hhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK-------LNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGP 73 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~-------~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ 73 (337) ........+..+...... ..|+....-.+.|-+.....+.+.+.+.+.+++.++++++..-.+....+..+++ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 182 (394) T protein:vir:97 103 NDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATT 182 (394) T ss_pred hhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCC Confidence 011112222222222221 1122222334566677888999999999999999999999888777655543332 Q ss_pred cccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcC Q lcl|Aclame:pro 74 IASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATT 153 (337) Q Consensus 74 ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~T 153 (337) -++-+.. +......+...++...+.+++.=--+.|+.+.|+.= .++|+..+.+.++++++.-.-.--.+|... T Consensus 183 ~~~~v~E-~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds--~~~~~~~i~~~la~~~~~~~~~~i~~g~~~---- 255 (394) T protein:vir:97 183 KMVTVAE-LEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA--DVDLVGIVSESISQIKVNTTNDAIAKVLKS---- 255 (394) T ss_pred ccceecc-cccccccccccceeEEeehhheeeehhhHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHHHHhhcccc---- Confidence 2222211 112211233456666777776665667777777632 257888888888887775211111111100 Q ss_pred ChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHH Q lcl|Aclame:pro 154 DRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 d~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl 233 (337) +.+..-.+.|.++ ++++..+++.+. =+++|.+..+ T Consensus 256 ----------------------------------------~~~~~~~~~~~~~-~~~~~~~~~~~~----a~~v~n~~~~ 290 (394) T protein:vir:97 256 ----------------------------------------FTTKTVKNLDEIK-ALLNGGFDPAYN----VSLIVSQSFY 290 (394) T ss_pred ----------------------------------------ccccccccHHHHH-HHHHhhhhhhhC----CEEEEcHHHH Confidence 1111223456555 455666777653 2688988776 Q ss_pred HHHHHHHHh-ccCChHHHHHHHHH-HhhhhhcCccccccCc--cCCCceEEecchhcEEEEecCceEEEEEEccccccee Q lcl|Aclame:pro 234 HDKYFPIVN-ATQAPTERLAADLI-VSQKRIGNLPAVRVPF--FPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIE 309 (337) Q Consensus 234 ~~k~~~l~n-~~~~ptE~~A~~~~-~~~k~iGGlpa~~vPf--fP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve 309 (337) .. +..+. ..+.|- ....+. ....++-|+|++..|. +|.+.+++=.+++...++-+....-...+++.....- T Consensus 291 ~~--l~~lkd~~G~~i--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (394) T protein:vir:97 291 QT--LDTLKDGNGRYL--LQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYL 366 (394) T ss_pred HH--HHHhhccCCCee--eecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEecccccceeE Confidence 53 22232 222211 000000 1234788999998774 6777788877776433332333322222222211100 Q ss_pred ceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 310 NYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) -.+-| -+..|-+...++. |++..+ T Consensus 367 ~~~~r-~d~~v~~~~a~~~---~~~~~~ 390 (394) T protein:vir:97 367 QAVLR-FGVSKVDDKAGYY---VTFTPE 390 (394) T ss_pred EEEEE-EccEEecccceEE---EEeccc Confidence 01111 1223333333333 333333 No 100 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=97.15 E-value=0.00015 Score=41.56 Aligned_cols=282 Identities=8% Similarity=0.028 Sum_probs=133.5 Q ss_pred CChHHHHHHHHHHHHHHH--hhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK--LNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt 78 (337) .....+..|..++..--. ..........|.|-..+...+.. +.+.+..++.+++++++...+.......+++.++-. T Consensus 136 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (437) T protein:vir:10 136 IADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEKE-VHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAH 214 (437) T ss_pred HHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHHH-hhhhhhhhhcceeEeeccCceeeEEeeccccccccc Confidence 122222333333332111 11112223445554555555544 566777888999999887777655443333333222 Q ss_pred CCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhh Q lcl|Aclame:pro 79 DTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQAN 158 (337) Q Consensus 79 ~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~an 158 (337) .-+ ..+...+-..++...+..++.---+.|+.+.|+... ++|+..+++.+.++++.-.-.-=+||...+ T Consensus 215 ~e~-~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~-------- 283 (437) T protein:vir:10 215 TEY-GQTTKNATPVITPILWDLKTYTGGYVFSQELISDSS--YDWQAELQSRLIELRDNTDDSLIITALTDG-------- 283 (437) T ss_pred ccc-ccccccccccceeeeeehhheeeehhhhHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhhhccc-------- Confidence 221 111111222345555555555555678888888643 578888888888888753222222332100 Q ss_pred hhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHH Q lcl|Aclame:pro 159 PLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 159 PllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~ 238 (337) ..+.. .+. +.|.+ .|+++.-+++.|+... +++|.+..+.. -. T Consensus 284 ----------------------------~~~~~---~~~---~~~~~-~~~~~~~l~~~~~~~~--~~~~~~~~~~~-l~ 325 (437) T protein:vir:10 284 ----------------------------IKKTT---STY---LLGDL-KKVLNVTLKPQDSAAA--SIVMSQSAYNL-FD 325 (437) T ss_pred ----------------------------ccccc---ccc---chhhH-HHHHHhhhhhhhhcCC--EEEEcHHHHHH-HH Confidence 00000 011 12222 3445445678887654 88999988663 22 Q ss_pred HHHhccCChHHHHHHHHH-HhhhhhcCccccccCcc--CCCc-----eEEecchhcE-EEEecCceEEEEEEccccccee Q lcl|Aclame:pro 239 PIVNATQAPTERLAADLI-VSQKRIGNLPAVRVPFF--PKRA-----LMVTKLSNLS-IYYQEGARRRTLKEVPERDRIE 309 (337) Q Consensus 239 ~l~n~~~~ptE~~A~~~~-~~~k~iGGlpa~~vPff--P~~~-----iliT~l~NLs-iY~Q~gs~RR~~~d~p~r~rve 309 (337) .|-...+.|- ....+. ....++-|+|++..+.+ |..+ +++=.|++.- |+...+..= .+ .+.++-.. T Consensus 326 ~lkd~~g~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~-~~--~~~~~~~~ 400 (437) T protein:vir:10 326 MATDAMGRPL--LQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITG-QF--QDTYDIWY 400 (437) T ss_pred HhhccCCCee--eccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEE-EE--eccccccc Confidence 2222222221 000111 12458999999998765 5443 6666666543 322233221 11 11121111 Q ss_pred ceee--eeeeeeeeccccEEEee----cceeccC Q lcl|Aclame:pro 310 NYES--SNDAYVVEDFGCGCVAE----NIELAAA 337 (337) Q Consensus 310 ~y~s--~Ne~YvVEd~~~~a~ie----ni~~~~a 337 (337) .+.. .--+..|=|...++.+- -+...+| T Consensus 401 ~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~ 434 (437) T protein:vir:10 401 KQLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQS 434 (437) T ss_pred ceeeEEEEEccEEecccceEEEEeeccccccCCC Confidence 1110 01144555666666653 3333333 No 101 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=97.12 E-value=0.00016 Score=41.36 Aligned_cols=284 Identities=10% Similarity=0.041 Sum_probs=135.3 Q ss_pred CChHHHHHHHHHHHHH----HHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQI----AKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIAS 76 (337) Q Consensus 1 M~~~tr~~~~~y~~~~----a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~ 76 (337) .+...+..|..++... ....++....-.+.|-+.+...+.+.+++.+.+++.++++++..-.+...-. ..++.++ T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~-~~~~~~~ 170 (421) T protein:vir:13 92 KRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVR-AGASVDK 170 (421) T ss_pred HHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEe-ecCCccc Confidence 1111222333333211 1112233333455676677788999999999999999999988776654322 1222221 Q ss_pred ccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChh Q lcl|Aclame:pro 77 RTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQ 156 (337) Q Consensus 77 Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~ 156 (337) =...+.+.-.|..-..++...+..++.---..|+.+.|+. + -++|+..+++.+.+++++ -.||. .. T Consensus 171 ~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~d-s-~~~l~~~i~~~la~~~~~-----~~~~~-------i~ 236 (421) T protein:vir:13 171 LANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLED-S-EINFLEFVNEEFAEFAVN-----TENAE-------IV 236 (421) T ss_pred eeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhh-h-HHHHHHHHHHHHHHHHHH-----Hhhhh-------Hh Confidence 1111222222333344555556555555556677777764 2 257888888888887763 11211 00 Q ss_pred hhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHH Q lcl|Aclame:pro 157 ANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDK 236 (337) Q Consensus 157 anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k 236 (337) | .|.-++. .+ ...+ .|.++ +++..+ ++.++.. -+++|.+..+..- T Consensus 237 -~-----------------~~~g~~~----~~------~~~~---~d~i~-~~~~~l-~~~~~~~--a~~v~n~~~~~~l 281 (421) T protein:vir:13 237 -K-----------------QAKAVLA----EE------TIND---YAGLV-KTINSL-VPNARKR--AIIVTNSDGRAYL 281 (421) T ss_pred -h-----------------hhhhccc----cc------cccc---hHHHH-HHHHHh-hhhhcCC--CEEEEcHHHHHHH Confidence 0 1111111 11 1123 34433 455554 4445443 4788888776532 Q ss_pred HHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc-----eEEecchhcEEEEecCceEEEEEEccc--cccee Q lcl|Aclame:pro 237 YFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA-----LMVTKLSNLSIYYQEGARRRTLKEVPE--RDRIE 309 (337) Q Consensus 237 ~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~-----iliT~l~NLsiY~Q~gs~RR~~~d~p~--r~rve 309 (337) ..|-.....|-=.-... ....+|-|+|++..+++|... +++-.+++.-..+.++..+-...+++. ++.+- T Consensus 282 -~~lkd~~G~~i~~~~~~--~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~ 358 (421) T protein:vir:13 282 -DGLMDKQGRPLLKELSD--GGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETI 358 (421) T ss_pred -HHhhcCCCceeecCcCC--CCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeecccccccCeeE Confidence 22222221111000000 123578999999999999764 688888885434444455544444432 11111 Q ss_pred ceeeeeeeeeeeccccEEEeecc------eeccC Q lcl|Aclame:pro 310 NYESSNDAYVVEDFGCGCVAENI------ELAAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~~~~a~ieni------~~~~a 337 (337) ---..--++++=+.+.++++.-. .+.++ T Consensus 359 ~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~ 392 (421) T protein:vir:13 359 ARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEV 392 (421) T ss_pred EEEEeeecceeecchhhheeeecccceeeccccc Confidence 00001112223333333222211 11111 No 102 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=97.00 E-value=0.00022 Score=40.66 Aligned_cols=293 Identities=12% Similarity=0.128 Sum_probs=140.3 Q ss_pred CChH-HHHHHHHHHHHHH-------------Hh-----------------hCc---hhhcceEeechHHHHHHHHHHHhh Q lcl|Aclame:pro 1 MRKE-TRQAYEKYAAQIA-------------KL-----------------NDT---GDVSKKFAVEPTVQQRLETKMQES 46 (337) Q Consensus 1 M~~~-tr~~~~~y~~~~a-------------~~-----------------ngv---~~~~~~Fsv~P~~~q~L~~~iqes 46 (337) ..+. ....|..++..++ +. .|. ....-.|.|.....+.+.+.+.+. T Consensus 286 ~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~ 365 (645) T protein:vir:93 286 EQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQ 365 (645) T ss_pred hhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhh Confidence 0000 0011222221111 11 011 011134556666778899999988 Q ss_pred HHHhcccce-ec-cchhhc-eeeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhH Q lcl|Aclame:pro 47 SEFLKRINV-LP-VTELEG-EKLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADF 123 (337) Q Consensus 47 s~FL~~Inv-~~-V~~~~G-e~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF 123 (337) |-+.+.-.. ++ .....| .++-.-.+|+.++=+.. +...|..-..++...+..++.---+.|+=+.|+.- -+++ T Consensus 366 svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~E--g~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds--~~~~ 441 (645) T protein:vir:93 366 TIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGE--GKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFS--SPAA 441 (645) T ss_pred hhHHhhccccccccccccCceeeeeeecCcceEEecc--CccccccccceeEEEEeeEEEEEeehhHHHHHhhc--hHHH Confidence 877655322 11 111122 23333334444443322 22334344467777777776655555565666533 3788 Q ss_pred HHHHHHHHHHHHhh--hhHHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecC-Ccccc Q lcl|Aclame:pro 124 QQRIRDVILNQGAL--DRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGK-AGDYE 200 (337) Q Consensus 124 ~~r~~~~i~~~~aL--D~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~-ggdy~ 200 (337) +..+++.+.+.++. |...|+=.|+-.+ .. .|.+ +..... .+.. +..+. T Consensus 442 ~~~i~~~l~~aia~~~d~a~l~g~g~~~~-~~----~p~g------------------i~~~~~------~~~~~~~~~~ 492 (645) T protein:vir:93 442 DALVRNALAEAVVARLDTDFVDPKKAAVA-DV----SPAS------------------ITHDVK------GTASSGNPDA 492 (645) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCcccC-Cc----cccc------------------eecccc------ccccccchHH Confidence 88899988888875 6665533332211 11 1211 111100 0111 22334 Q ss_pred cHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEE Q lcl|Aclame:pro 201 NLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMV 280 (337) Q Consensus 201 nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~ili 280 (337) ++..+...+...-+ +-+.-|++|.+.....- ..+-.....| +--++-....++-|+|++...++|++-++. T Consensus 493 d~~~~~~~~~~a~~-----~~~~a~~vmn~~~~~~L-~~lkd~~G~~---~~~~~~~~~~tL~G~PV~~s~~vp~~~~~g 563 (645) T protein:vir:93 493 DAEAAFGQFVAANL-----QPTGAVWLMSSTNALAL-SMRKNALGQK---EYPDMTLLGGSFQGLPVIVSQYVGDQLVLV 563 (645) T ss_pred HHHHHHHHHHhcCC-----CccccEEEEcHHHHHHH-HhccccCCce---eecCCCCCCceeeceeeEEeccCCcceeEe Confidence 45544443322211 12346899999866532 1121111111 101111234589999999999999875544 Q ss_pred ecchhcEEEEecCceE--------EEEEEccccccee-------c-eee--------eeeeeeeeccccEEEeecceecc Q lcl|Aclame:pro 281 TKLSNLSIYYQEGARR--------RTLKEVPERDRIE-------N-YES--------SNDAYVVEDFGCGCVAENIELAA 336 (337) Q Consensus 281 T~l~NLsiY~Q~gs~R--------R~~~d~p~r~rve-------~-y~s--------~Ne~YvVEd~~~~a~ieni~~~~ 336 (337) .++.+-|- ..+... -.+.+.|.-+... + |+. .--+|.|=+.++++.+.+|+++- T Consensus 564 -d~s~~~ig-~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~ 641 (645) T protein:vir:93 564 -NAPDIYLA-DDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGS 641 (645) T ss_pred -ccccEEEE-EecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCc Confidence 45544322 222222 1122222222111 1 221 22367778899999999999999 Q ss_pred C Q lcl|Aclame:pro 337 A 337 (337) Q Consensus 337 a 337 (337) | T Consensus 642 ~ 642 (645) T protein:vir:93 642 A 642 (645) T ss_pred c Confidence 9 No 103 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=96.97 E-value=0.00012 Score=42.02 Aligned_cols=292 Identities=12% Similarity=0.024 Sum_probs=139.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) +.++.|..|+++... . +....|.|-+.+..++.+.+.+.|.+++.++++++.- +.++-...+++.++=+.- T Consensus 72 lt~~e~~~~~~~~~~------~-~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~--~~~i~~~~~~~~a~w~~e 142 (383) T protein:vir:78 72 ITNEEIKFFNDINKE------V-GYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL--RTKFLKSETSGVAVWGKI 142 (383) T ss_pred hhHHHHHHHHHHhcc------C-CCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC--ceEEEEEcCCcceEEeec Confidence 455555555433221 1 2234578888899999999999999999999988752 234444444444432222 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) . .++.+..-..++...+.+++.=--..|+.+.|+.=. .+++..+++.+.+++|.=.-.--++|+-. .. | T Consensus 143 ~-~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~--~~ie~~i~~~l~~~~a~~~~~a~i~G~G~---~q----P- 211 (383) T protein:vir:78 143 F-GEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGP--AWVKRFVVTQIEEAFAVALESAYIVGDGN---DK----P- 211 (383) T ss_pred c-cccccccCcceeeEeecceeeEeeccchHHHhhccH--HHHHHHHHHHHHHHHHHHHhhheEeccCC---CC----c- Confidence 1 223222223455556666666566789999997522 36888888888888886444445566431 11 2 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecC--CcccccHHHHHHHHHhccc----ChhHcCCCCEEEEECHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGK--AGDYENLDALVMDIVSSMI----DPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~--ggdy~nLDaLv~d~~~~li----d~~~r~~~~LVvivG~dLl~ 234 (337) +|+|..+= .......+.. ..+...|. ..+-.++-.++..+.+..- ....+-...++++|++.-.. T Consensus 212 -----~Gil~~~~---~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 282 (383) T protein:vir:78 212 -----IGLNRKVG---KGSTVVDGVY-AEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAW 282 (383) T ss_pred -----eeeeeccC---Cccccccccc-ccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchh Confidence 35543110 0000011100 00111111 0111222222222222110 01112345678888874222 Q ss_pred HHHHHHHh---ccCChHHHHHHHHHHhhhhhc--CccccccCccCCCceEEecchhcEEEEecCceEEEEEEccc--ccc Q lcl|Aclame:pro 235 DKYFPIVN---ATQAPTERLAADLIVSQKRIG--NLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPE--RDR 307 (337) Q Consensus 235 ~k~~~l~n---~~~~ptE~~A~~~~~~~k~iG--Glpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~--r~r 307 (337) + -.|.+. ....|. ++- |++.+.-+++|++.++.-.++.--|.. ++..|-..-++-. +++ T Consensus 283 ~-~~~~~~~~~~~G~~~------------t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~-r~~~~i~~~~~~~f~~d~ 348 (383) T protein:vir:78 283 D-VKKQYTSLNANGVYV------------TALPFNLNIIESLFVPEKKAISYVAERYDALI-GGPLDIGTYDQTLAIEDL 348 (383) T ss_pred h-hccchhccCCCCcee------------eecCCCceEEecCCCCcccEEEeeccceEEEe-cccceEEecchhhhhcCc Confidence 1 122221 111111 222 444667799999999988888755543 3333322211110 111 Q ss_pred eeceee-eeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 308 IENYES-SNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 308 ve~y~s-~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) +.-.-. |-++ -+=|.+.+..++ |++.++ T Consensus 349 ~~f~~~~r~dG-~~~~~~A~~vl~-~~~~~~ 377 (383) T protein:vir:78 349 NLYAAKQFAYG-KAKDDKAAAVWT-LNINPA 377 (383) T ss_pred eEEEEEEEEcC-EEecCCeEEEEE-EEecCC Confidence 111111 1111 222444444554 566666 No 104 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=96.87 E-value=0.00013 Score=41.81 Aligned_cols=281 Identities=11% Similarity=0.048 Sum_probs=138.0 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) ++.+-|+.|+++... . +..-.+.|-+....++.+.+.+.|.+++.++++++.- +.++....+++.++=..- T Consensus 65 l~~~e~~~~~~~~~~------t-~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~--~~~i~~~~~~~~a~W~~e 135 (381) T protein:vir:10 65 LSANQRNFFMDINKS------V-GYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) T ss_pred cCHHHHHHHHHHhhc------C-CCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc--ceEEEeecCCcceEEeec Confidence 444444444432211 1 1223467888899999999999999999999998742 334444444444432221 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) ..++.+..-..++...+.+++.---..|+.+.|+.-. .|++..++..+.+++|.=.-.-=.||+-. . -| T Consensus 136 -~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~--~~le~~i~~~la~~~a~~~~~afi~GdG~---~----qP- 204 (381) T protein:vir:10 136 -YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP--AWIERFVRVQIEEAFAVALETAFLKGTGK---D----QP- 204 (381) T ss_pred -ccccccccCccceeEeecceeEEeeccccHHHHhccH--HHHHHHHHHHHHHHHHHHhhceeEecccC---C----Cc- Confidence 1233322223466666777777677889999998764 36778888888887775332222355431 1 13 Q ss_pred hhccchhHHHHHHHhchhhhcccccc----ccCceeec-CCcccccHHHHHHHHHhcccChh--HcCCCCEEEEECHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAK----QAGKVLVG-KAGDYENLDALVMDIVSSMIDPW--FQEDTGLVVICGRELL 233 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~----~~~~i~~g-~ggdy~nLDaLv~d~~~~lid~~--~r~~~~LVvivG~dLl 233 (337) +|+|..+ ++......+.. ..+.++.- ....|..|.+++..+. ...-. .......+++|.+.-. T Consensus 205 -----~Gil~~~---~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~vmn~~t~ 274 (381) T protein:vir:10 205 -----IGLNRQV---QKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHS--TNEKGKSVAVKGNVTMVVNPSDA 274 (381) T ss_pred -----eeeeecC---CccccccccccccccccccccccchhhHHHHHHHHHHhhh--hhhccccccccCceEEEEchhhH Confidence 4665321 11111111110 01111110 0112333444333321 11111 1123466888887654 Q ss_pred HHHHHHHH---hccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceec Q lcl|Aclame:pro 234 HDKYFPIV---NATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIEN 310 (337) Q Consensus 234 ~~k~~~l~---n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~ 310 (337) .. -.++. ++...+.- ..--|.|++.-|++|++.+++--+++--|.-..|.+=+.. + T Consensus 275 ~~-l~~~~~~~~~~G~~v~----------~lp~g~~vv~~~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~---~------- 333 (381) T protein:vir:10 275 FE-VQAQYTHLNANGVYVT----------ALPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGGINVQKF---K------- 333 (381) T ss_pred Hh-hccccccCCCCCceee----------cCCCCceeEEcCCCCcCcEEEEEcccEEEEEecccEEEee---c------- Confidence 42 22221 11111110 0012677888999999999999988866654443321111 1 Q ss_pred eeeeeeeeeeeccccEEEee----cc-----------e---eccC Q lcl|Aclame:pro 311 YESSNDAYVVEDFGCGCVAE----NI-----------E---LAAA 337 (337) Q Consensus 311 y~s~Ne~YvVEd~~~~a~ie----ni-----------~---~~~a 337 (337) +.|..+|.-.+-++. .+ + ..+| T Consensus 334 -----~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~~ 373 (381) T protein:vir:10 334 -----ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKPA 373 (381) T ss_pred -----hhhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCccc Confidence 123333332222222 01 1 1111 No 105 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=96.83 E-value=0.00031 Score=39.83 Aligned_cols=294 Identities=11% Similarity=0.070 Sum_probs=142.5 Q ss_pred CChH--HHHHHHHHHHHHH---------------------H--hhCchhhcceEeechHHHHHHHHHHHhhHHHhcc-cc Q lcl|Aclame:pro 1 MRKE--TRQAYEKYAAQIA---------------------K--LNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKR-IN 54 (337) Q Consensus 1 M~~~--tr~~~~~y~~~~a---------------------~--~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~-In 54 (337) +.+. ....+..+...++ . ..+....+-.+.|-......+.+.+++++.+++. .+ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~ 162 (428) T protein:vir:10 83 AEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGAR 162 (428) T ss_pred cccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhcce Confidence 0000 0000101111000 0 0011111112345556678899999999987776 45 Q ss_pred eeccchhhce-eeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHH Q lcl|Aclame:pro 55 VLPVTELEGE-KLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILN 133 (337) Q Consensus 55 v~~V~~~~Ge-~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~ 133 (337) +++.. .|. ++-.-.+++-++-+..+ ...|..-..++...+..++.---+.|+.+.|+. ..++|+..+.+.+.+ T Consensus 163 ~~~~~--~g~~~~p~~~~~~~a~~v~Eg--~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~d--s~~~l~~~i~~~l~~ 236 (428) T protein:vir:10 163 SIPLP--NGNMSLPRLAGGATASYTGEN--QDAKVSEARFDDVKLTAKTMIAMVPISNALIGR--AGFNVEQLVLQDILT 236 (428) T ss_pred eeecC--CcceEEEEEeCCcceeeeccC--ccccccccceeeEEeeeEEEEEeehhhHHHHhh--hhHHHHHHHHHHHHH Confidence 55543 233 12111233334333221 222322344666667777777778899998874 137899999999999 Q ss_pred HHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCcee--ecCCcccccHHHHHHHHHh Q lcl|Aclame:pro 134 QGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVL--VGKAGDYENLDALVMDIVS 211 (337) Q Consensus 134 ~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~--~g~ggdy~nLDaLv~d~~~ 211 (337) +++.-.-.--+||.-.. .+|. -+++........+. .+...++..+|.++.-+.. T Consensus 237 ai~~~~d~~~l~G~G~~------~~p~------------------Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (428) T protein:vir:10 237 AISVREDKAFMRDDGTG------DTPI------------------GMKARATQWNRLLPWAADAAVNLDTIDTYLDSIIL 292 (428) T ss_pred HHHHHHHHHHhccCCCC------cccc------------------ccccccccccccccccccccccHHHHHHHHHHHHH Confidence 98865555556874321 1232 22221111111111 1234445555544332211 Q ss_pred c-ccChhHcCCCCEEEEECHHHHHHHHHHHHh-ccCChHHHHHHHHHHhhhhhcCccccccCccCCCc--------eEEe Q lcl|Aclame:pro 212 S-MIDPWFQEDTGLVVICGRELLHDKYFPIVN-ATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA--------LMVT 281 (337) Q Consensus 212 ~-lid~~~r~~~~LVvivG~dLl~~k~~~l~n-~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~--------iliT 281 (337) . ..... .....+++|.+..+.. +..+. ....|- --. ..+.+|.|+|++..+++|++. +++- T Consensus 293 ~~~~~~~--~~~~~~~v~n~~~~~~--L~~lkd~~G~~i---~~~--~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~g 363 (428) T protein:vir:10 293 MSMDGNS--NMISSGWGMSNRTYMK--LFGLRDGNGNKV---YPE--MAQGMLKGYPIQRTSAIPANLGEGGKESEIYFA 363 (428) T ss_pred hhhcccc--ccccCEEEEcHHHHHH--HHHhhccCCcee---ccC--CCCCeeeceeeEEeccccccccCCCccceEEEE Confidence 0 11111 1224578999887752 22222 221221 001 134579999999999999863 5666 Q ss_pred cchhcEEEEecCceEEEEEEcccc----cceeceeeeee---------eeeeeccccEEEeeccee Q lcl|Aclame:pro 282 KLSNLSIYYQEGARRRTLKEVPER----DRIENYESSND---------AYVVEDFGCGCVAENIEL 334 (337) Q Consensus 282 ~l~NLsiY~Q~gs~RR~~~d~p~r----~rve~y~s~Ne---------~YvVEd~~~~a~ieni~~ 334 (337) .++.+-|.. .+..+-.+-++... ..+.+++..|. ++.|=+.++++.+.+|++ T Consensus 364 d~s~~~i~~-~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 364 DFNDVVIGE-DGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred ecceEEEEE-ecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 666544432 33333332222110 11122333332 456778888888899999 No 106 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=96.50 E-value=0.00057 Score=38.37 Aligned_cols=277 Identities=9% Similarity=0.080 Sum_probs=134.1 Q ss_pred CChHHHHHHHHHHHHHHH--------------hh--Cc-hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAK--------------LN--DT-GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEG 63 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--------------~n--gv-~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~G 63 (337) +...+...+..|+..... .+ .. .+..-.+.|-+.+...+.+.+.+.+.+++.++++++...++ T Consensus 98 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~~ 177 (402) T protein:vir:93 98 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEI 177 (402) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCCcee Confidence 333332233333322110 00 00 11123467777788999999999999999999999876655 Q ss_pred eeeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhc Q lcl|Aclame:pro 64 EKLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIG 143 (337) Q Consensus 64 e~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IG 143 (337) -++.. +++-++-... +..+...+ ..++...|..++.---+.|+.+.|+..+ ++|+..+.+.++++++.=..-.- T Consensus 178 p~~~~--~~~~a~~v~E-g~~~~~~~-~~f~~i~~~~~k~~~~i~iS~ell~Ds~--~~l~~~i~~~la~~~~~~e~~~~ 251 (402) T protein:vir:93 178 PRVSY--TLDDDDFITD-VETAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGSD--VDLVNWVENALQSGLAAKERKDA 251 (402) T ss_pred eeeec--cCCccccccc-cccccccc-cccceeeecceeeeeechhhHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHhH Confidence 44432 2222322221 22222222 3356666666666666788999888653 67899999999998876211111 Q ss_pred ccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCC Q lcl|Aclame:pro 144 WNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTG 223 (337) Q Consensus 144 fnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~ 223 (337) |. +.+....| .|++ .... ...+ ++.+ ..|.|+ +++.+ |++.|+... T Consensus 252 ~~------~g~g~g~p------~g~~------------~~~~--~~~~---~~~~--~~d~l~-~~~~~-l~~~y~~na- 297 (402) T protein:vir:93 252 LA------VSPKSGLE------HMSF------------YNGS--VKEV---EGAD--MYDAII-NALAD-LHEDYRDNA- 297 (402) T ss_pred hh------cCCCcccc------ceee------------eccc--cccc---cccc--hHHHHH-HHHhc-cChhhhcCC- Confidence 21 11111122 1222 1100 0011 1111 246555 56665 467777643 Q ss_pred EEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc Q lcl|Aclame:pro 224 LVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP 303 (337) Q Consensus 224 LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p 303 (337) +++|.+.-+.. ...+....+.+- .+ ....+|-|+|++.....|. +++= |+|.||.. +++... .+ T Consensus 298 -~~imn~~t~~~-~~~~~~d~~~~~--~~----~~~~~llG~PV~~t~~~~~--i~~G---Df~~~~~~--~~~~~~-~~ 361 (402) T protein:vir:93 298 -TIYMRYADYVK-IISVLSNGTTNF--FD----TPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGIN--YDGTTY-DT 361 (402) T ss_pred -EEEEechHHHH-HHHHHhcCCCcc--cc----cCCccccccceEEecCCCc--eeee---chhhhhhh--hhhhhh-hh Confidence 67787654332 122333222221 11 1345788999999998875 5554 45555531 222221 12 Q ss_pred cccceeceeeeeeeeeeeccccEEEee--c---ceeccC Q lcl|Aclame:pro 304 ERDRIENYESSNDAYVVEDFGCGCVAE--N---IELAAA 337 (337) Q Consensus 304 ~r~rve~y~s~Ne~YvVEd~~~~a~ie--n---i~~~~a 337 (337) .++ ...-.-+|+...+--+..++ - .++..| T Consensus 362 ~~~----~~~~~~~~~~~~r~Dg~v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 362 DKD----VKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 396 (402) T ss_pred hhc----ccCCceEEEEEEEeCcEEechhheEEEEeecC Confidence 221 11222333332222222221 1 223333 No 107 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=96.45 E-value=0.00062 Score=38.18 Aligned_cols=310 Identities=13% Similarity=0.063 Sum_probs=135.6 Q ss_pred CChHHHHHH------HHHHHHHHHhhCch-hhcc-eEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccc Q lcl|Aclame:pro 1 MRKETRQAY------EKYAAQIAKLNDTG-DVSK-KFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr~~~------~~y~~~~a~~ngv~-~~~~-~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g 72 (337) |....|..+ ..+...+....... ..+. ...|-..+...+++.+.+.+.+++.+++++++-. -.+.+...+ T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g~--~~~~~~~~~ 200 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKGT--ARQNIAGAI 200 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCce--eEeeeecCC Confidence 222222221 11211111111111 1111 2334445778899999999999999999988531 122222223 Q ss_pred ccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCc Q lcl|Aclame:pro 73 PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAAT 152 (337) Q Consensus 73 ~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~ 152 (337) +.++-+.- +......+ ..++...|.+++.---+.|+.+.|+. ..++|+..+++.++++++.=.-.--+||+- + T Consensus 201 ~~a~wv~E-~~~~~~~~-~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~ail~G~G---~ 273 (466) T protein:vir:80 201 PEGVWTEA-VANLNELS-LSFSQIEVDGYKVGGFIPIPNSTLED--SDLNLADEILDAIGQAIGFALDKAILYGTG---T 273 (466) T ss_pred cceeeccc-cccccccc-ccccceeecceeeeeehhhhHHHHhc--chHHHHHHHHHHHHHHHHHHHhhheeeccC---C Confidence 33322221 12233333 34677778888877778999999973 224789999999999877644444445532 1 Q ss_pred CChhhhhhhhccchhHHHHHHHhchhhhccccccccCce---------eecCCcccccHHHHHHHHHhcccChhHcCCCC Q lcl|Aclame:pro 153 TDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKV---------LVGKAGDYENLDALVMDIVSSMIDPWFQEDTG 223 (337) Q Consensus 153 Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i---------~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~ 223 (337) .+| +|+|...-...........+.....+ ..+..+.+...|. +..+ ..+.+.. ..+. T Consensus 274 ----~~P------~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~--~~~~ 339 (466) T protein:vir:80 274 ----KMP------VGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSEL-VLKL-SKARANY--SNGM 339 (466) T ss_pred ----CCc------ceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHH-HHHH-Hhhhccc--cCCc Confidence 122 35543210000000000000000000 0112222222232 2221 1222222 4455 Q ss_pred EEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc Q lcl|Aclame:pro 224 LVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP 303 (337) Q Consensus 224 LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p 303 (337) .++++.......- ..+.-..+....-. ... .....+.|+|++.-|++|++.+++--++..-|+...|. +-..-++. T Consensus 340 ~~w~~~~~~~~~l-~~~~~~~~~~g~~~-~~~-~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~-~i~~~~~~ 415 (466) T protein:vir:80 340 KFWAMSSNTHAVL-MSKAITFNSAGALV-ASL-NNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAERADI-KLAQSEHV 415 (466) T ss_pred eeEEecchhHHHh-hcccccccCCcccc-ccC-CCcccccccceeecCccCccceeeeccccEEEEeecce-EEEechhh Confidence 6777776554421 22210111111110 010 11224789999999999999998887777555543332 22211111 Q ss_pred c--ccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 304 E--RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 304 ~--r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) . +|.+.-.--.--+..|=|.+.+..++-=++.++ T Consensus 416 ~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~ 451 (466) T protein:vir:80 416 RFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPT 451 (466) T ss_pred hhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcc Confidence 1 111111111111222233344444331112222 No 108 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=96.41 E-value=0.00066 Score=38.04 Aligned_cols=276 Identities=8% Similarity=0.055 Sum_probs=131.2 Q ss_pred CChHHHHHHHHHHHHHHHh------------------hCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhh Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL------------------NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELE 62 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~------------------ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~ 62 (337) ........|..|+...... .|. +..-.+.|-+.+...+.+.+.+.+.+++.++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 161 (387) T protein:vir:96 83 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCce Confidence 2222222233333222110 011 112256787788999999999999999999999998666 Q ss_pred ceeeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHh Q lcl|Aclame:pro 63 GEKLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMI 142 (337) Q Consensus 63 Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~I 142 (337) .-++.. +++-++-..-+ ......+ ..++...|..++.---+.|+++.|+.. .++|+..+.+.++++++.-..-. T Consensus 162 ~p~~~~--~~~~a~~v~Eg-~~~~~~~-~~f~~v~l~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:96 162 IPRVSY--TLDDDDFITDV-ETAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred eeeeec--cCCcccccccc-ccccccc-cccceeeechheeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHh Confidence 555433 22223322211 1122122 234444455444444578889988865 36789999999999887632222 Q ss_pred cccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCC Q lcl|Aclame:pro 143 GWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDT 222 (337) Q Consensus 143 GfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~ 222 (337) -|. +.+.+.-|. |.+ .... ...+ ++. ...|.|+ +++.+ +++-|+... T Consensus 236 ~~~------~g~g~g~~~------g~~------------~~~~--~~~~---~~~--~~~d~i~-~~~~~-l~~~y~~na 282 (387) T protein:vir:96 236 ALA------VSPKSGLEH------MSF------------YNGS--VKEV---EGA--DMYDAII-NALAD-LHEDYRDNA 282 (387) T ss_pred Hhh------cCCCccccc------eee------------eccc--cccc---ccc--chHHHHH-HHHhc-cChhhhcCC Confidence 221 111111121 111 1000 0011 111 1256554 45665 467777653 Q ss_pred CEEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEc Q lcl|Aclame:pro 223 GLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEV 302 (337) Q Consensus 223 ~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~ 302 (337) +++|.+.-+.. ...+....+.|- .+ ....++-|+|++.....|. +++= |+|-||. + +++...+ T Consensus 283 --~~imn~~t~~~-~~~~~~~~~~~~--~~----~~~~~llG~PV~~~~~~~~--~~~G---Df~~~~~-~-~~~~~~~- 345 (387) T protein:vir:96 283 --TIYMRYADYVK-IISVLSNGTTNF--FD----TPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGI-N-YDGTTYD- 345 (387) T ss_pred --EEEEechHHHH-HHHHHhcCCCcc--cc----cCCccccccceEEecCCCc--eeee---chhhhhh-h-hhhhhhe- Confidence 67787654432 223333333221 11 1345788999999998875 5554 4454542 1 2222221 Q ss_pred ccccceeceeeeeeeeeeeccccEEEee-----cceeccC Q lcl|Aclame:pro 303 PERDRIENYESSNDAYVVEDFGCGCVAE-----NIELAAA 337 (337) Q Consensus 303 p~r~rve~y~s~Ne~YvVEd~~~~a~ie-----ni~~~~a 337 (337) +..+...-.-+|++...--+..++ -+++..| T Consensus 346 ----~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:96 346 ----TDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred ----ecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 111111222233332222222221 1333333 No 109 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=96.41 E-value=0.00066 Score=38.04 Aligned_cols=276 Identities=8% Similarity=0.055 Sum_probs=131.2 Q ss_pred CChHHHHHHHHHHHHHHHh------------------hCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhh Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL------------------NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELE 62 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~------------------ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~ 62 (337) ........|..|+...... .|. +..-.+.|-+.+...+.+.+.+.+.+++.++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 161 (387) T protein:vir:26 83 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCce Confidence 2222222233333222110 011 112256787788999999999999999999999998666 Q ss_pred ceeeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHh Q lcl|Aclame:pro 63 GEKLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMI 142 (337) Q Consensus 63 Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~I 142 (337) .-++.. +++-++-..-+ ......+ ..++...|..++.---+.|+++.|+.. .++|+..+.+.++++++.-..-. T Consensus 162 ~p~~~~--~~~~a~~v~Eg-~~~~~~~-~~f~~v~l~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:26 162 IPRVSY--TLDDDDFITDV-ETAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred eeeeec--cCCcccccccc-ccccccc-cccceeeechheeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHh Confidence 555433 22223322211 1122122 234444455444444578889988865 36789999999999887632222 Q ss_pred cccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCC Q lcl|Aclame:pro 143 GWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDT 222 (337) Q Consensus 143 GfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~ 222 (337) -|. +.+.+.-|. |.+ .... ...+ ++. ...|.|+ +++.+ +++-|+... T Consensus 236 ~~~------~g~g~g~~~------g~~------------~~~~--~~~~---~~~--~~~d~i~-~~~~~-l~~~y~~na 282 (387) T protein:vir:26 236 ALA------VSPKSGLEH------MSF------------YNGS--VKEV---EGA--DMYDAII-NALAD-LHEDYRDNA 282 (387) T ss_pred Hhh------cCCCccccc------eee------------eccc--cccc---ccc--chHHHHH-HHHhc-cChhhhcCC Confidence 221 111111121 111 1000 0011 111 1256554 45665 467777653 Q ss_pred CEEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEc Q lcl|Aclame:pro 223 GLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEV 302 (337) Q Consensus 223 ~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~ 302 (337) +++|.+.-+.. ...+....+.|- .+ ....++-|+|++.....|. +++= |+|-||. + +++...+ T Consensus 283 --~~imn~~t~~~-~~~~~~~~~~~~--~~----~~~~~llG~PV~~~~~~~~--~~~G---Df~~~~~-~-~~~~~~~- 345 (387) T protein:vir:26 283 --TIYMRYADYVK-IISVLSNGTTNF--FD----TPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGI-N-YDGTTYD- 345 (387) T ss_pred --EEEEechHHHH-HHHHHhcCCCcc--cc----cCCccccccceEEecCCCc--eeee---chhhhhh-h-hhhhhhe- Confidence 67787654432 223333333221 11 1345788999999998875 5554 4454542 1 2222221 Q ss_pred ccccceeceeeeeeeeeeeccccEEEee-----cceeccC Q lcl|Aclame:pro 303 PERDRIENYESSNDAYVVEDFGCGCVAE-----NIELAAA 337 (337) Q Consensus 303 p~r~rve~y~s~Ne~YvVEd~~~~a~ie-----ni~~~~a 337 (337) +..+...-.-+|++...--+..++ -+++..| T Consensus 346 ----~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:26 346 ----TDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred ----ecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 111111222233332222222221 1333333 No 110 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=96.41 E-value=0.00066 Score=38.04 Aligned_cols=276 Identities=8% Similarity=0.055 Sum_probs=131.2 Q ss_pred CChHHHHHHHHHHHHHHHh------------------hCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhh Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKL------------------NDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELE 62 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~------------------ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~ 62 (337) ........|..|+...... .|. +..-.+.|-+.+...+.+.+.+.+.+++.++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 161 (387) T protein:vir:94 83 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCce Confidence 2222222233333222110 011 112256787788999999999999999999999998666 Q ss_pred ceeeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHh Q lcl|Aclame:pro 63 GEKLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMI 142 (337) Q Consensus 63 Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~I 142 (337) .-++.. +++-++-..-+ ......+ ..++...|..++.---+.|+++.|+.. .++|+..+.+.++++++.-..-. T Consensus 162 ~p~~~~--~~~~a~~v~Eg-~~~~~~~-~~f~~v~l~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:94 162 IPRVSY--TLDDDDFITDV-ETAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred eeeeec--cCCcccccccc-ccccccc-cccceeeechheeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHh Confidence 555433 22223322211 1122122 234444455444444578889988865 36789999999999887632222 Q ss_pred cccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCC Q lcl|Aclame:pro 143 GWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDT 222 (337) Q Consensus 143 GfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~ 222 (337) -|. +.+.+.-|. |.+ .... ...+ ++. ...|.|+ +++.+ +++-|+... T Consensus 236 ~~~------~g~g~g~~~------g~~------------~~~~--~~~~---~~~--~~~d~i~-~~~~~-l~~~y~~na 282 (387) T protein:vir:94 236 ALA------VSPKSGLEH------MSF------------YNGS--VKEV---EGA--DMYDAII-NALAD-LHEDYRDNA 282 (387) T ss_pred Hhh------cCCCccccc------eee------------eccc--cccc---ccc--chHHHHH-HHHhc-cChhhhcCC Confidence 221 111111121 111 1000 0011 111 1256554 45665 467777653 Q ss_pred CEEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEc Q lcl|Aclame:pro 223 GLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEV 302 (337) Q Consensus 223 ~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~ 302 (337) +++|.+.-+.. ...+....+.|- .+ ....++-|+|++.....|. +++= |+|-||. + +++...+ T Consensus 283 --~~imn~~t~~~-~~~~~~~~~~~~--~~----~~~~~llG~PV~~~~~~~~--~~~G---Df~~~~~-~-~~~~~~~- 345 (387) T protein:vir:94 283 --TIYMRYADYVK-IISVLSNGTTNF--FD----TPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGI-N-YDGTTYD- 345 (387) T ss_pred --EEEEechHHHH-HHHHHhcCCCcc--cc----cCCccccccceEEecCCCc--eeee---chhhhhh-h-hhhhhhe- Confidence 67787654432 223333333221 11 1345788999999998875 5554 4454542 1 2222221 Q ss_pred ccccceeceeeeeeeeeeeccccEEEee-----cceeccC Q lcl|Aclame:pro 303 PERDRIENYESSNDAYVVEDFGCGCVAE-----NIELAAA 337 (337) Q Consensus 303 p~r~rve~y~s~Ne~YvVEd~~~~a~ie-----ni~~~~a 337 (337) +..+...-.-+|++...--+..++ -+++..| T Consensus 346 ----~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l~~ka~ 381 (387) T protein:vir:94 346 ----TDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKEN 381 (387) T ss_pred ----ecccccCCceEEEEEEEeCcEeechhheEEEEeecC Confidence 111111222233332222222221 1333333 No 111 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=95.97 E-value=0.0012 Score=36.63 Aligned_cols=260 Identities=13% Similarity=0.132 Sum_probs=129.8 Q ss_pred HHHhhCchhhcceEeechH-HHHHHHHHHHhhHHHhcccceec-cchhhceeeecccccccccccCCCCccccccccccc Q lcl|Aclame:pro 16 IAKLNDTGDVSKKFAVEPT-VQQRLETKMQESSEFLKRINVLP-VTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTAL 93 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~-~~q~L~~~iqess~FL~~Inv~~-V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l 93 (337) ||..+= ..+. -+.|+ ..+.+.+.+++++.|-+..++.. ...+.|..|.+=.-+.+..-.+.+.+.--|..-.+. T Consensus 1 MA~~~T--~~~~--~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~ 76 (272) T protein:vir:30 1 MAVGTT--KMAQ--MLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGF 76 (272) T ss_pred CCCccc--cchh--eechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccccccccc Confidence 332110 0111 34563 34556677777777755444421 122334445442222222222222233334344445 Q ss_pred CCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhh--hhHHhcccccccCCcCChhhhhhhhccchhHHHH Q lcl|Aclame:pro 94 DSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGAL--DRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQ 171 (337) Q Consensus 94 ~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aL--D~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~ 171 (337) +......++. -..++...++.....+|+...+.+.+.+.++. |...++- T Consensus 77 ~~~~~~~~~~--~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~--------------------------- 127 (272) T protein:vir:30 77 KKTTMTIKKA--GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDA--------------------------- 127 (272) T ss_pred ceEEEEeeee--eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHH--------------------------- Confidence 5555555553 44567777777777899999999999988864 3333321 Q ss_pred HHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHH-HHHhccCChHHH Q lcl|Aclame:pro 172 YRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF-PIVNATQAPTER 250 (337) Q Consensus 172 ~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~-~l~n~~~~ptE~ 250 (337) +.... ..++.+. ++|+ +.|++..| +.. ..+.-+++|++.....-.. .+.+- ...++- T Consensus 128 ---------~~~a~-----~~~~~~~---t~d~-i~da~~~l-~~~--~~~~~~~vv~p~~~~~L~k~~~~~~-~~~~~~ 185 (272) T protein:vir:30 128 ---------LSKST-----QTVEATA---TVDG-VSKALDIF-NDE--DDAETVIVMNPADASTLRLDAAKEW-LGATEV 185 (272) T ss_pred ---------hcccc-----ccccccc---CHHH-HHHHHHHH-hcc--CCCccEEEEcHHHHHHHHHhccccc-cccccc Confidence 00000 0112222 3554 34566544 433 2334589999987654211 11211 111111 Q ss_pred HHHHHH--HhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeeeeee--eeccccE Q lcl|Aclame:pro 251 LAADLI--VSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYV--VEDFGCG 326 (337) Q Consensus 251 ~A~~~~--~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~Yv--VEd~~~~ 326 (337) . .+.+ ....+|.|+|++.-+++|++.+++-.-..+.++.+.+..-. ...++++. .+......-|. |=+..++ T Consensus 186 ~-~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve-~~r~~~~~--~~~i~~~~~~~~~v~~~~~v 261 (272) T protein:vir:30 186 G-ANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVE-TDRDITKA--INQIVANKHYGVYLYKAEKA 261 (272) T ss_pred c-ccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceee-eccccccc--eeEEEEEEEEEEEEEcCCce Confidence 0 0111 12347999999999999999999999999988887774422 22223222 22222222221 2222222 Q ss_pred EEeecceeccC Q lcl|Aclame:pro 327 CVAENIELAAA 337 (337) Q Consensus 327 a~ieni~~~~a 337 (337) + .+++++| T Consensus 262 v---~~t~~~a 269 (272) T protein:vir:30 262 V---KITLKDA 269 (272) T ss_pred E---EEEeccc Confidence 2 4577888 No 112 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=95.97 E-value=0.0012 Score=36.63 Aligned_cols=260 Identities=13% Similarity=0.132 Sum_probs=129.8 Q ss_pred HHHhhCchhhcceEeechH-HHHHHHHHHHhhHHHhcccceec-cchhhceeeecccccccccccCCCCccccccccccc Q lcl|Aclame:pro 16 IAKLNDTGDVSKKFAVEPT-VQQRLETKMQESSEFLKRINVLP-VTELEGEKLGLSVSGPIASRTDTTKAARQPIDPTAL 93 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~-~~q~L~~~iqess~FL~~Inv~~-V~~~~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l 93 (337) ||..+= ..+. -+.|+ ..+.+.+.+++++.|-+..++.. ...+.|..|.+=.-+.+..-.+.+.+.--|..-.+. T Consensus 1 MA~~~T--~~~~--~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~ 76 (272) T protein:vir:98 1 MAVGTT--KMAQ--MLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGF 76 (272) T ss_pred CCCccc--cchh--eechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccccccccc Confidence 332110 0111 34563 34556677777777755444421 122334445442222222222222233334344445 Q ss_pred CCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhh--hhHHhcccccccCCcCChhhhhhhhccchhHHHH Q lcl|Aclame:pro 94 DSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGAL--DRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQ 171 (337) Q Consensus 94 ~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aL--D~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~ 171 (337) +......++. -..++...++.....+|+...+.+.+.+.++. |...++- T Consensus 77 ~~~~~~~~~~--~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~--------------------------- 127 (272) T protein:vir:98 77 KKTTMTIKKA--GKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDA--------------------------- 127 (272) T ss_pred ceEEEEeeee--eeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHH--------------------------- Confidence 5555555553 44567777777777899999999999988864 3333321 Q ss_pred HHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHH-HHHhccCChHHH Q lcl|Aclame:pro 172 YRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF-PIVNATQAPTER 250 (337) Q Consensus 172 ~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~-~l~n~~~~ptE~ 250 (337) +.... ..++.+. ++|+ +.|++..| +.. ..+.-+++|++.....-.. .+.+- ...++- T Consensus 128 ---------~~~a~-----~~~~~~~---t~d~-i~da~~~l-~~~--~~~~~~~vv~p~~~~~L~k~~~~~~-~~~~~~ 185 (272) T protein:vir:98 128 ---------LSKST-----QTVEATA---TVDG-VSKALDIF-NDE--DDAETVIVMNPADASTLRLDAAKEW-LGATEV 185 (272) T ss_pred ---------hcccc-----ccccccc---CHHH-HHHHHHHH-hcc--CCCccEEEEcHHHHHHHHHhccccc-cccccc Confidence 00000 0112222 3554 34566544 433 2334589999987654211 11211 111111 Q ss_pred HHHHHH--HhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcccccceeceeeeeeeee--eeccccE Q lcl|Aclame:pro 251 LAADLI--VSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYV--VEDFGCG 326 (337) Q Consensus 251 ~A~~~~--~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~Yv--VEd~~~~ 326 (337) . .+.+ ....+|.|+|++.-+++|++.+++-.-..+.++.+.+..-. ...++++. .+......-|. |=+..++ T Consensus 186 ~-~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve-~~r~~~~~--~~~i~~~~~~~~~v~~~~~v 261 (272) T protein:vir:98 186 G-ANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVE-TDRDITKA--INQIVANKHYGVYLYKAEKA 261 (272) T ss_pred c-ccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceee-eccccccc--eeEEEEEEEEEEEEEcCCce Confidence 0 0111 12347999999999999999999999999988887774422 22223222 22222222221 2222222 Q ss_pred EEeecceeccC Q lcl|Aclame:pro 327 CVAENIELAAA 337 (337) Q Consensus 327 a~ieni~~~~a 337 (337) + .+++++| T Consensus 262 v---~~t~~~a 269 (272) T protein:vir:98 262 V---KITLKDA 269 (272) T ss_pred E---EEEeccc Confidence 2 4577888 No 113 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=95.08 E-value=0.0028 Score=34.58 Aligned_cols=283 Identities=12% Similarity=0.037 Sum_probs=141.9 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) ++++.|..|+++++. +. +....+.|-+.+..++.+.+.+.|..++.++++++.- +-++-...+++-++=..- T Consensus 67 lt~ee~~~~~~~~~~-----~~-~~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~~~~~~~~~~a~w~~e 138 (377) T protein:vir:98 67 LTAEEIKFFNDIDKN-----VG-GKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDI 138 (377) T ss_pred cCHHHHHHHHHHHhc-----cC-CCCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCc--ceEEEEecCCcceeEeec Confidence 777778888776543 22 2233567878899999999999999999999887642 223433333343332221 Q ss_pred CCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhh Q lcl|Aclame:pro 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) Q Consensus 81 ~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPl 160 (337) ..++.+..-..++...+.+++.---..|+.+.|+.=. .|++..+++.+.++++.=.-.--+||+-.- - | T Consensus 139 -~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~--~~ie~~i~~~la~~~a~~~~~a~i~G~G~~---q----P- 207 (377) T protein:vir:98 139 -FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWIKQFITEQLKEAIAVALELAIVKGDGLL---Q----P- 207 (377) T ss_pred -ccccCcccCccceeEeecceeEEeeecccHHhhhccH--hHHHHHHHHHHHHHHHHHHhhceEeccCCC---c----c- Confidence 1233333334566677777777777889999997522 368888999999999876556666774311 1 1 Q ss_pred hhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHH Q lcl|Aclame:pro 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l 240 (337) +|+|..+ +..+.. ..-..+..+.|...|++ .++...+ ++.|+. ..+.+|.+.-+..... | T Consensus 208 -----~Gil~~~----~~~~~~------~~~~~~~~~~~~~~~~~-~~l~~~~-~~~~~~--~a~~~m~~~t~~~~~k-l 267 (377) T protein:vir:98 208 -----VGLLKDL----SQPTVD------QSTGRDITTYKTDKEAI-ADLSDLT-PDNAPK--KLVPVMKHLSVNDKKR-P 267 (377) T ss_pred -----eeeeecc----cccccc------cccccccccccchhhhH-hhhhhhc-hhHHHH--HHHHHHHHHHHHHHhh-h Confidence 3554221 000000 00111223334434433 2233222 233333 1223333222221110 0 Q ss_pred HhccC------ChHHHHHHH---HHH-h---hhhhcCcc--ccccCccCCCceEEecchhcEEEEecCceEEEEEEcccc Q lcl|Aclame:pro 241 VNATQ------APTERLAAD---LIV-S---QKRIGNLP--AVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPER 305 (337) Q Consensus 241 ~n~~~------~ptE~~A~~---~~~-~---~k~iGGlp--a~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r 305 (337) -...+ .|+.....+ ... + -.++-|+| .+.-+++|++.+++--+++--|+...|-+ T Consensus 268 kd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~---------- 337 (377) T protein:vir:98 268 LKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATAST---------- 337 (377) T ss_pred hccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecceeEEeecceE---------- Confidence 00000 111110000 000 0 01333445 56778999999998888774444332211 Q ss_pred cceeceeeeeeeeeeeccccEEEee--cceecc--C Q lcl|Aclame:pro 306 DRIENYESSNDAYVVEDFGCGCVAE--NIELAA--A 337 (337) Q Consensus 306 ~rve~y~s~Ne~YvVEd~~~~a~ie--ni~~~~--a 337 (337) ...+.+.|..+|.-.+-++. +-+..+ | T Consensus 338 -----i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a 368 (377) T protein:vir:98 338 -----IEEYDQTFAMEDLQLYLTKNYFYGKAKDNHT 368 (377) T ss_pred -----EEeechhhhhcCceEEEEEEEEcCEEeccCc Confidence 12233445566555554443 111211 2 No 114 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=95.05 E-value=0.0029 Score=34.52 Aligned_cols=275 Identities=9% Similarity=0.074 Sum_probs=135.7 Q ss_pred CChHHH--HHHHHHHHHHH----------------HhhCch-hhcceEeechHHHHHHHHHHHhhHHHhcccceeccchh Q lcl|Aclame:pro 1 MRKETR--QAYEKYAAQIA----------------KLNDTG-DVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTEL 61 (337) Q Consensus 1 M~~~tr--~~~~~y~~~~a----------------~~ngv~-~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~ 61 (337) +....+ ..|..|..... ...+.. +..-.|.|-..+...+.+.+++.+.+.+.++++++... T Consensus 46 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~ 125 (352) T protein:vir:78 46 LNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL 125 (352) T ss_pred cchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCc Confidence 111111 11222211111 111111 22235666567888999999999999999999888654 Q ss_pred hceeeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhh--hh Q lcl|Aclame:pro 62 EGEKLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGAL--DR 139 (337) Q Consensus 62 ~Ge~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aL--D~ 139 (337) +..++. .+++-++-... ....|..-..++...|..++.---+.|+++.|+.=+ ++++..+.+.++++++. +- T Consensus 126 ~~p~~~--~~~~~a~~v~E--~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~--~~l~~~i~~~la~~~~~~e~~ 199 (352) T protein:vir:78 126 EIPRVS--YTLDDDDFITD--VETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSD--VDLVNWVENALQSGLAAKERK 199 (352) T ss_pred eEEEEe--cCCCccccccc--ccccccccccceeeeecceeEEeechhhHHHHhhhh--HHHHHHHHHHHHHHHHHHHHH Confidence 443332 22233332222 122222234566777777777777899999887633 67889999999998874 22 Q ss_pred HHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHc Q lcl|Aclame:pro 140 IMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQ 219 (337) Q Consensus 140 i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r 219 (337) ..+| +| +....|. |.| .... ...++ | ... .|.++ +++.. |++-|+ T Consensus 200 ~~~~-~g-------~g~~~~~------g~l------------~~~~--~~~~t-~-~~~---~d~i~-~~~~~-l~~~~~ 244 (352) T protein:vir:78 200 DALA-VS-------PKSGLEH------MSF------------YNGS--VKEVE-G-ANM---YDAII-NALAD-LHEDYR 244 (352) T ss_pred hhhh-cC-------CCCcccc------cce------------eccc--ccccc-c-cch---HHHHH-HHHhc-cChhhh Confidence 2332 22 2222222 211 1100 00111 1 112 45443 45554 577787 Q ss_pred CCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEE Q lcl|Aclame:pro 220 EDTGLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTL 299 (337) Q Consensus 220 ~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~ 299 (337) +. -+++|.+..+.. -..+....+.|- ++ ....++-|+|++.....|. +++= |+|.||.. +.+.. T Consensus 245 ~~--a~~~mn~~t~~~-l~~~~~~~~~~~--~~----~~~~~llG~PV~~~~~~~~--~~~G---df~~~~~~--~~~~~ 308 (352) T protein:vir:78 245 DN--ATIYMRYADYVK-IISVLSNGTTNF--FD----TPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGIN--YDGTT 308 (352) T ss_pred cC--CEEEEehHHHHH-HHHHHhccCCcc--cc----cCCccccccceEEecCCCc--eeEe---ehhhhhhh--hhhhe Confidence 74 377887755432 223333333321 11 1234677999999998875 4554 45555431 11111 Q ss_pred EEcccccceeceeeeeeeeeeeccccEEEee-----cceeccC Q lcl|Aclame:pro 300 KEVPERDRIENYESSNDAYVVEDFGCGCVAE-----NIELAAA 337 (337) Q Consensus 300 ~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie-----ni~~~~a 337 (337) .++..++..-..+|+...+--+..++ -++++.| T Consensus 309 -----~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 309 -----YDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKES 346 (352) T ss_pred -----eeeeccccCCeeEEEEEeeeCceeechhheEEEEeecc Confidence 11222233334455544433333332 2223333 No 115 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=94.86 E-value=0.0033 Score=34.18 Aligned_cols=279 Identities=10% Similarity=0.101 Sum_probs=133.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceee-ecccccccccccC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKL-GLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v-~lgv~g~ia~Rt~ 79 (337) ++...+........+.+. +......-.+.|-+.+...+.+.+.+.|.+++.+++++|.-..|... ....+++-++-.. T Consensus 89 ~~~~~~~~~~~~~~~~~~-~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 167 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRAM-SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) T ss_pred ccHHHHHHHhhhhhhhhc-cccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeec Confidence 222222222222222111 11112234567877778899999999999999999999988777643 3333333343332 Q ss_pred CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anP 159 (337) .+ ......+...++.....+++.---+.|+.+.|+.. .++|...+.+.+.+.++.-.-.-=+||...+.. T Consensus 168 E~-~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~------- 237 (392) T protein:vir:10 168 EM-GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------- 237 (392) T ss_pred cc-ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------- Confidence 22 22222233457777888888877888999999863 378999999999988876332222233221110 Q ss_pred hhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHH Q lcl|Aclame:pro 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 160 llqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~ 239 (337) ... .+.|.++. +++..+++.|+. ..+++|.+..+..-. . T Consensus 238 ----------------------------------~~~---~~~d~i~~-~~~~~l~~~~~~--~a~~vm~~~~~~~L~-~ 276 (392) T protein:vir:10 238 ----------------------------------QAI---KSLDDIKD-VLNVKLDPAISP--NAILLTNQDGFNYLD-K 276 (392) T ss_pred ----------------------------------cCc---cCHHHHHH-HHHHhhhhhhcc--CCEEEEcHHHHHHHH-H Confidence 011 23454443 344456787764 478999998866421 2 Q ss_pred HHhccCChH--HHHHHHHHHhhhhhcCccc-cccCcc-CC------Cc--eEEecchhcEEEEecCceEEEEEEccc--- Q lcl|Aclame:pro 240 IVNATQAPT--ERLAADLIVSQKRIGNLPA-VRVPFF-PK------RA--LMVTKLSNLSIYYQEGARRRTLKEVPE--- 304 (337) Q Consensus 240 l~n~~~~pt--E~~A~~~~~~~k~iGGlpa-~~vPff-P~------~~--iliT~l~NLsiY~Q~gs~RR~~~d~p~--- 304 (337) |-...+.|- .-.. -....++-|.|. ++.+.. |. +. +++=.+++...-..++..+-.+-+.-. T Consensus 277 lkd~~G~~l~~~~~~---~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f 353 (392) T protein:vir:10 277 LKDKDGKYILQSDPT---QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAF 353 (392) T ss_pred hhccCCCeEeecCcc---CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchh Confidence 211111110 0000 012345667654 434332 21 11 344444442221222222222211100 Q ss_pred -ccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 305 -RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 305 -r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++.+--.-..--++.|=+.+.++.+ ++..+ T Consensus 354 ~~~~~~~r~~~r~d~~v~~~~a~~~l---~~~~~ 384 (392) T protein:vir:10 354 TRNTLDLRAIQRDDVQMWDNEAAVYG---EIDLS 384 (392) T ss_pred hcCceEEEEEEeeccEEecccceEEE---Eeccc Confidence 1111111111112333334444433 22222 No 116 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=94.86 E-value=0.0033 Score=34.18 Aligned_cols=279 Identities=10% Similarity=0.101 Sum_probs=133.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceee-ecccccccccccC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKL-GLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v-~lgv~g~ia~Rt~ 79 (337) ++...+........+.+. +......-.+.|-+.+...+.+.+.+.|.+++.+++++|.-..|... ....+++-++-.. T Consensus 89 ~~~~~~~~~~~~~~~~~~-~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 167 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRAM-SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) T ss_pred ccHHHHHHHhhhhhhhhc-cccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeec Confidence 222222222222222111 11112234567877778899999999999999999999988777643 3333333343332 Q ss_pred CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anP 159 (337) .+ ......+...++.....+++.---+.|+.+.|+.. .++|...+.+.+.+.++.-.-.-=+||...+.. T Consensus 168 E~-~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~------- 237 (392) T protein:vir:10 168 EM-GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------- 237 (392) T ss_pred cc-ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------- Confidence 22 22222233457777888888877888999999863 378999999999988876332222233221110 Q ss_pred hhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHH Q lcl|Aclame:pro 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 160 llqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~ 239 (337) ... .+.|.++. +++..+++.|+. ..+++|.+..+..-. . T Consensus 238 ----------------------------------~~~---~~~d~i~~-~~~~~l~~~~~~--~a~~vm~~~~~~~L~-~ 276 (392) T protein:vir:10 238 ----------------------------------QAI---KSLDDIKD-VLNVKLDPAISP--NAILLTNQDGFNYLD-K 276 (392) T ss_pred ----------------------------------cCc---cCHHHHHH-HHHHhhhhhhcc--CCEEEEcHHHHHHHH-H Confidence 011 23454443 344456787764 478999998866421 2 Q ss_pred HHhccCChH--HHHHHHHHHhhhhhcCccc-cccCcc-CC------Cc--eEEecchhcEEEEecCceEEEEEEccc--- Q lcl|Aclame:pro 240 IVNATQAPT--ERLAADLIVSQKRIGNLPA-VRVPFF-PK------RA--LMVTKLSNLSIYYQEGARRRTLKEVPE--- 304 (337) Q Consensus 240 l~n~~~~pt--E~~A~~~~~~~k~iGGlpa-~~vPff-P~------~~--iliT~l~NLsiY~Q~gs~RR~~~d~p~--- 304 (337) |-...+.|- .-.. -....++-|.|. ++.+.. |. +. +++=.+++...-..++..+-.+-+.-. T Consensus 277 lkd~~G~~l~~~~~~---~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f 353 (392) T protein:vir:10 277 LKDKDGKYILQSDPT---QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAF 353 (392) T ss_pred hhccCCCeEeecCcc---CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchh Confidence 211111110 0000 012345667654 434332 21 11 344444442221222222222211100 Q ss_pred -ccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 305 -RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 305 -r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++.+--.-..--++.|=+.+.++.+ ++..+ T Consensus 354 ~~~~~~~r~~~r~d~~v~~~~a~~~l---~~~~~ 384 (392) T protein:vir:10 354 TRNTLDLRAIQRDDVQMWDNEAAVYG---EIDLS 384 (392) T ss_pred hcCceEEEEEEeeccEEecccceEEE---Eeccc Confidence 1111111111112333334444433 22222 No 117 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=94.86 E-value=0.0033 Score=34.18 Aligned_cols=279 Identities=10% Similarity=0.101 Sum_probs=133.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceee-ecccccccccccC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKL-GLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v-~lgv~g~ia~Rt~ 79 (337) ++...+........+.+. +......-.+.|-+.+...+.+.+.+.|.+++.+++++|.-..|... ....+++-++-.. T Consensus 89 ~~~~~~~~~~~~~~~~~~-~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 167 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRAM-SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) T ss_pred ccHHHHHHHhhhhhhhhc-cccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeec Confidence 222222222222222111 11112234567877778899999999999999999999988777643 3333333343332 Q ss_pred CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anP 159 (337) .+ ......+...++.....+++.---+.|+.+.|+.. .++|...+.+.+.+.++.-.-.-=+||...+.. T Consensus 168 E~-~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~------- 237 (392) T protein:vir:10 168 EM-GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------- 237 (392) T ss_pred cc-ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------- Confidence 22 22222233457777888888877888999999863 378999999999988876332222233221110 Q ss_pred hhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHH Q lcl|Aclame:pro 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 160 llqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~ 239 (337) ... .+.|.++. +++..+++.|+. ..+++|.+..+..-. . T Consensus 238 ----------------------------------~~~---~~~d~i~~-~~~~~l~~~~~~--~a~~vm~~~~~~~L~-~ 276 (392) T protein:vir:10 238 ----------------------------------QAI---KSLDDIKD-VLNVKLDPAISP--NAILLTNQDGFNYLD-K 276 (392) T ss_pred ----------------------------------cCc---cCHHHHHH-HHHHhhhhhhcc--CCEEEEcHHHHHHHH-H Confidence 011 23454443 344456787764 478999998866421 2 Q ss_pred HHhccCChH--HHHHHHHHHhhhhhcCccc-cccCcc-CC------Cc--eEEecchhcEEEEecCceEEEEEEccc--- Q lcl|Aclame:pro 240 IVNATQAPT--ERLAADLIVSQKRIGNLPA-VRVPFF-PK------RA--LMVTKLSNLSIYYQEGARRRTLKEVPE--- 304 (337) Q Consensus 240 l~n~~~~pt--E~~A~~~~~~~k~iGGlpa-~~vPff-P~------~~--iliT~l~NLsiY~Q~gs~RR~~~d~p~--- 304 (337) |-...+.|- .-.. -....++-|.|. ++.+.. |. +. +++=.+++...-..++..+-.+-+.-. T Consensus 277 lkd~~G~~l~~~~~~---~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f 353 (392) T protein:vir:10 277 LKDKDGKYILQSDPT---QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAF 353 (392) T ss_pred hhccCCCeEeecCcc---CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchh Confidence 211111110 0000 012345667654 434332 21 11 344444442221222222222211100 Q ss_pred -ccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 305 -RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 305 -r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++.+--.-..--++.|=+.+.++.+ ++..+ T Consensus 354 ~~~~~~~r~~~r~d~~v~~~~a~~~l---~~~~~ 384 (392) T protein:vir:10 354 TRNTLDLRAIQRDDVQMWDNEAAVYG---EIDLS 384 (392) T ss_pred hcCceEEEEEEeeccEEecccceEEE---Eeccc Confidence 1111111111112333334444433 22222 No 118 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=94.86 E-value=0.0033 Score=34.18 Aligned_cols=279 Identities=10% Similarity=0.101 Sum_probs=133.4 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceee-ecccccccccccC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKL-GLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v-~lgv~g~ia~Rt~ 79 (337) ++...+........+.+. +......-.+.|-+.+...+.+.+.+.|.+++.+++++|.-..|... ....+++-++-.. T Consensus 89 ~~~~~~~~~~~~~~~~~~-~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 167 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRAM-SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) T ss_pred ccHHHHHHHhhhhhhhhc-cccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeec Confidence 222222222222222111 11112234567877778899999999999999999999988777643 3333333343332 Q ss_pred CCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANP 159 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anP 159 (337) .+ ......+...++.....+++.---+.|+.+.|+.. .++|...+.+.+.+.++.-.-.-=+||...+.. T Consensus 168 E~-~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~------- 237 (392) T protein:vir:10 168 EM-GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK------- 237 (392) T ss_pred cc-ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------- Confidence 22 22222233457777888888877888999999863 378999999999988876332222233221110 Q ss_pred hhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHH Q lcl|Aclame:pro 160 LLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 160 llqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~ 239 (337) ... .+.|.++. +++..+++.|+. ..+++|.+..+..-. . T Consensus 238 ----------------------------------~~~---~~~d~i~~-~~~~~l~~~~~~--~a~~vm~~~~~~~L~-~ 276 (392) T protein:vir:10 238 ----------------------------------QAI---KSLDDIKD-VLNVKLDPAISP--NAILLTNQDGFNYLD-K 276 (392) T ss_pred ----------------------------------cCc---cCHHHHHH-HHHHhhhhhhcc--CCEEEEcHHHHHHHH-H Confidence 011 23454443 344456787764 478999998866421 2 Q ss_pred HHhccCChH--HHHHHHHHHhhhhhcCccc-cccCcc-CC------Cc--eEEecchhcEEEEecCceEEEEEEccc--- Q lcl|Aclame:pro 240 IVNATQAPT--ERLAADLIVSQKRIGNLPA-VRVPFF-PK------RA--LMVTKLSNLSIYYQEGARRRTLKEVPE--- 304 (337) Q Consensus 240 l~n~~~~pt--E~~A~~~~~~~k~iGGlpa-~~vPff-P~------~~--iliT~l~NLsiY~Q~gs~RR~~~d~p~--- 304 (337) |-...+.|- .-.. -....++-|.|. ++.+.. |. +. +++=.+++...-..++..+-.+-+.-. T Consensus 277 lkd~~G~~l~~~~~~---~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f 353 (392) T protein:vir:10 277 LKDKDGKYILQSDPT---QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAF 353 (392) T ss_pred hhccCCCeEeecCcc---CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchh Confidence 211111110 0000 012345667654 434332 21 11 344444442221222222222211100 Q ss_pred -ccceeceeeeeeeeeeeccccEEEeecceeccC Q lcl|Aclame:pro 305 -RDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) Q Consensus 305 -r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~~a 337 (337) ++.+--.-..--++.|=+.+.++.+ ++..+ T Consensus 354 ~~~~~~~r~~~r~d~~v~~~~a~~~l---~~~~~ 384 (392) T protein:vir:10 354 TRNTLDLRAIQRDDVQMWDNEAAVYG---EIDLS 384 (392) T ss_pred hcCceEEEEEEeeccEEecccceEEE---Eeccc Confidence 1111111111112333334444433 22222 No 119 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=94.85 E-value=0.0033 Score=34.16 Aligned_cols=274 Identities=9% Similarity=0.044 Sum_probs=136.5 Q ss_pred CChHHHHHHHHHHHHHHHhh----------------Cc-hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLN----------------DT-GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEG 63 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~n----------------gv-~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~G 63 (337) .+......|..|+.+..... ++ .+..-.+.|-+.+...+.+.+.+.+.+.+.++++++...+. T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~~ 162 (387) T protein:vir:93 83 DHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEI 162 (387) T ss_pred hhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCCceE Confidence 22222233444443332110 01 01122467767788889999999999999999998876554 Q ss_pred eeeecccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhc Q lcl|Aclame:pro 64 EKLGLSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIG 143 (337) Q Consensus 64 e~v~lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IG 143 (337) -++. .+++-++-...+ ... |..-..++...|.+++.---+.|+++.|+. ..++|+..+.+.++++++.=..-.. T Consensus 163 p~~~--~~~~~a~~v~E~-~~~-~~~~~~f~~v~~~~~k~~~~~~iS~ell~D--s~~~l~~~i~~~la~~~~~~e~~~~ 236 (387) T protein:vir:93 163 PRVS--YTLDDDDFITDV-ETA-KELKLKGDTVKFTTNKFKVFAAISDTVIHG--SDVDLVNWVENALQSGLAAKERKDA 236 (387) T ss_pred EEEe--ecCCccccccCc-ccc-cccccccceeeeeheeeeeechhhHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHhH Confidence 4432 223333332221 222 222244666777777776667888888863 1257999999999998875322222 Q ss_pred ccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCC Q lcl|Aclame:pro 144 WNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTG 223 (337) Q Consensus 144 fnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~ 223 (337) |. +.+....| .|++. .... ..+ ++.+ ..|.+ -+++.+ +++.|+... T Consensus 237 ~~------~g~g~g~p------~g~l~------------~~~~--~~v---~~~~--~~d~i-~~~~~~-l~~~~~~~a- 282 (387) T protein:vir:93 237 LA------VSPKSGLD------HMSFY------------NGSV--KEV---EGAD--MYDAI-INALAD-LHEDYRDNA- 282 (387) T ss_pred hh------cCCCcccc------ceeee------------cccc--ccc---cccc--hHHHH-HHHHhc-cChhhhcCC- Confidence 21 11111122 23321 1100 011 1111 13554 356665 577787754 Q ss_pred EEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc Q lcl|Aclame:pro 224 LVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP 303 (337) Q Consensus 224 LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p 303 (337) +++|.+.-+.. ...+....+.+- . .....+|-|+|++.....|. +++-.|+ -||. + +++...+ T Consensus 283 -~~~mn~~t~~~-~~~~~~d~~~~~--~----~~~~~~llG~PV~~~~~~~~--~~~GDf~---~~~~-~-~~~~~~~-- 345 (387) T protein:vir:93 283 -TIYMRYADYVK-IISVLSNGTTNF--F----DTPAEKVFGKPVVFTDAAVK--PIVGDFN---YFGI-N-YDGTTYD-- 345 (387) T ss_pred -EEEEechHHHH-HHHHHhcCCCcc--c----ccCCccccccceEEecCCCc--eeeeehh---hhhe-e-hhhheee-- Confidence 67887644332 223343333221 1 12345788999999998875 5665554 4443 1 2222221 Q ss_pred cccceeceeeeeeeeeee--------ccccEEEeecceeccC Q lcl|Aclame:pro 304 ERDRIENYESSNDAYVVE--------DFGCGCVAENIELAAA 337 (337) Q Consensus 304 ~r~rve~y~s~Ne~YvVE--------d~~~~a~ieni~~~~a 337 (337) +...+.....+|+.. |.++++. +++..| T Consensus 346 ---~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~---l~~k~~ 381 (387) T protein:vir:93 346 ---TDKDVKKGEYLFVLTAWYDQQRTLDSAFRI---AKAKEN 381 (387) T ss_pred ---ecccccCCceeEEEEeeeCceeechhheEE---EEeecC Confidence 122222333344433 3333332 233333 No 120 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=94.70 E-value=0.0037 Score=33.91 Aligned_cols=286 Identities=11% Similarity=0.078 Sum_probs=137.4 Q ss_pred CChHH---------HHHH-HHHHHHHHHhhCch--------------------hhcceEeechHH-HHHHHHHHHhhHHH Q lcl|Aclame:pro 1 MRKET---------RQAY-EKYAAQIAKLNDTG--------------------DVSKKFAVEPTV-QQRLETKMQESSEF 49 (337) Q Consensus 1 M~~~t---------r~~~-~~y~~~~a~~ngv~--------------------~~~~~Fsv~P~~-~q~L~~~iqess~F 49 (337) |.+.. ...+ ..+...+++.+|.+ +.+-.+.|-|.+ .+.+++.+.+++-+ T Consensus 309 l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i 388 (632) T protein:vir:96 309 LMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAII 388 (632) T ss_pred HHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchh Confidence 00000 0000 11122233322210 111234455554 57889998887755 Q ss_pred hcccceeccchhhceeee-cccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHH Q lcl|Aclame:pro 50 LKRINVLPVTELEGEKLG-LSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIR 128 (337) Q Consensus 50 L~~Inv~~V~~~~Ge~v~-lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~ 128 (337) .+ +.+-.+.-..|..-. .-.+|+-++=+.. ....|..-..++...+..++.---..|+.+.|+.- .++++..++ T Consensus 389 ~~-l~~~~~~~~~g~~~ip~~~~~~~a~wv~E--~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds--~~~~~~~i~ 463 (632) T protein:vir:96 389 GQ-MGARMLPGLVGDVDIPKKTSGANFYWIGE--DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIR 463 (632) T ss_pred hh-hcceEeecCCcceEEEEEeCCceeEeecC--CccccccccceeeEEeeeeEEEEehhhHHHHHhcc--chHHHHHHH Confidence 44 333333333343211 1112333322221 11223333456677777777777778888888753 478999999 Q ss_pred HHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeec-CCcccccHHHHHH Q lcl|Aclame:pro 129 DVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVG-KAGDYENLDALVM 207 (337) Q Consensus 129 ~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g-~ggdy~nLDaLv~ 207 (337) +.+.+.++.-.-.-.++|+..+. +|. |.+.. . ....+... .+-+|.++-.|.. T Consensus 464 ~~l~~a~~~~~d~a~l~G~G~~~------~p~------Gi~~~------------~--~~~~~~~~~~~~~~~~i~~~~~ 517 (632) T protein:vir:96 464 EDLIEGIGVALDLAMLTGTGLAN------DPV------GLLNM------------T--GVPALTYPAGGVDWASVVDMET 517 (632) T ss_pred HHHHHHHHHHHHHHhhcccCCCC------ccc------eeeec------------c--cccceecccccCCHHHHHHHHH Confidence 99999998544444457753221 132 33221 0 01112222 2225666555443 Q ss_pred HHHhcccChhHcCCCCEEEEECHHHHHHHHHH-HHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhc Q lcl|Aclame:pro 208 DIVSSMIDPWFQEDTGLVVICGRELLHDKYFP-IVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNL 286 (337) Q Consensus 208 d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~-l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NL 286 (337) . |...+.+.+..+++|.......-... +......| +....++-|+|++...++|++.+++-.++.+ T Consensus 518 ~-----i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~--------i~~~~~l~G~pv~~s~~ip~~~~~~gd~s~~ 584 (632) T protein:vir:96 518 K-----ISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQI 584 (632) T ss_pred H-----HhhcccccCccEEEEchhHHHHHHHHhccCCCCce--------eecCCeecccceEeccccccCcEEEeecceE Confidence 2 33445566678999998665432221 22222222 2234578899999999999999999999886 Q ss_pred EEEEecCceEEEEEEcccccceeceeeeeeeeee-eccccEEEe-ecceeccC Q lcl|Aclame:pro 287 SIYYQEGARRRTLKEVPERDRIENYESSNDAYVV-EDFGCGCVA-ENIELAAA 337 (337) Q Consensus 287 siY~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvV-Ed~~~~a~i-eni~~~~a 337 (337) -|... |..+-.+ .|. ..+.+-.-.+.+ ++++....- |.+.+..- T Consensus 585 ~i~~~-~~~~i~~--~~~----~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~ 630 (632) T protein:vir:96 585 VIAMW-GVLDLKV--DPY----TKAASDGLVLRVFQDVDAGVRRKEAFCIAKK 630 (632) T ss_pred EEEEe-cceEEEE--ccc----cccccCceEEEEEeecCceeechhhhhheee Confidence 54433 4433322 121 111111122222 222221111 12322222 No 121 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=93.78 E-value=0.0064 Score=32.61 Aligned_cols=273 Identities=12% Similarity=0.068 Sum_probs=131.8 Q ss_pred hCchhhcc--eEeechHHHHHHHHHHHh----hHHHhcccceeccchhhceeeecc------cccccccccCCCCccccc Q lcl|Aclame:pro 20 NDTGDVSK--KFAVEPTVQQRLETKMQE----SSEFLKRINVLPVTELEGEKLGLS------VSGPIASRTDTTKAARQP 87 (337) Q Consensus 20 ngv~~~~~--~Fsv~P~~~q~L~~~iqe----ss~FL~~Inv~~V~~~~Ge~v~lg------v~g~ia~Rt~t~~~~R~p 87 (337) .+++++.. -|.+ +.-+.+...+.| .=...+.|.+...-..--|.+..+ ....+...++. -| T Consensus 1 ~~~~~a~~~~~f~~--~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~d-----ip 73 (296) T protein:vir:10 1 MGVDKADAAGIWTV--KQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDD-----LP 73 (296) T ss_pred CcccchhhhHHHHH--HHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccc-----cc Confidence 33332222 2222 122223333222 222333333222111111222222 22222222111 11 Q ss_pred ccccccCCceeEEEEeeeeeecCHHHHHHHhCC-hhHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhhhhhhhccch Q lcl|Aclame:pro 88 IDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKF-ADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPLLQDVNI 166 (337) Q Consensus 88 ~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~-~dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~anPllqDVN~ 166 (337) ..-.+.+.........--+..+++..|.+.+.. -+...+-..+.++..+..+=.++|+|.+..-.+=.-.+|.+.=++. T Consensus 74 ~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~ 153 (296) T protein:vir:10 74 LVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVS 153 (296) T ss_pred eeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccc Confidence 111222222233334444556667778888775 4677888888889999999999999954433222222232211110 Q ss_pred --hHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEECHHHHHHHHHHHHhcc Q lcl|Aclame:pro 167 --GWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNAT 244 (337) Q Consensus 167 --GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG~dLl~~k~~~l~n~~ 244 (337) -|-+ + ..-|..+++++..+... .-....|+ ..++..++...-. +.++ T Consensus 154 ~~~W~~------~------------------t~i~~Di~~~~~~l~~~---s~g~~~p~-~l~L~p~~~~~L~-~~~~-- 202 (296) T protein:vir:10 154 GGSWSQ------P------------------TTAVSDITSLLDIIETS---TNGQHRAT-HLLLPTTARRIMQ-NLVP-- 202 (296) T ss_pred cCCccC------H------------------HHHHHHHHHHHHHHHHh---hCceecce-eEEeCHHHHHHHh-hccC-- Confidence 1200 0 01244455555544421 11223444 3444655544321 1222 Q ss_pred CChHHHHHHHHHHhhhhhcCccccccCccCCC------ceEE--ecchhcEEEEecCceEEEEEEcccccceeceeeeee Q lcl|Aclame:pro 245 QAPTERLAADLIVSQKRIGNLPAVRVPFFPKR------ALMV--TKLSNLSIYYQEGARRRTLKEVPERDRIENYESSND 316 (337) Q Consensus 245 ~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~------~ili--T~l~NLsiY~Q~gs~RR~~~d~p~r~rve~y~s~Ne 316 (337) .+-....+.+ .+.+.++..+.+|.+... .+++ +.-+|+++=+-..- |++-..--..+-.+.|..+-- T Consensus 203 --~~~~t~l~~i--k~~~~~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~-~~~~~e~~~l~~~~~~~~~~~ 277 (296) T protein:vir:10 203 --GTSVSYGEFF--RQNNSGVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEAT-NALPAQPKDLHFKIPVTSKAT 277 (296) T ss_pred --CCCccHHHHH--HHhcCCceEEEeeeeccCCCCcceEEEEEEcCCceEEEEcCcce-eeecccccCceEEEeeEeeEE Confidence 2223334443 567789999999999652 2344 67888887664432 333332223344455566666 Q ss_pred eeeeeccccEEEeecceec Q lcl|Aclame:pro 317 AYVVEDFGCGCVAENIELA 335 (337) Q Consensus 317 ~YvVEd~~~~a~ieni~~~ 335 (337) +-+|=.+.++|.+++|+|+ T Consensus 278 Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 278 GLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEEEECCceeEEEeeeecC Confidence 7888899999999999999 No 122 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=84.72 E-value=0.058 Score=27.35 Aligned_cols=296 Identities=9% Similarity=-0.013 Sum_probs=139.7 Q ss_pred CChHHHHHHHHH-HHHHHHhhCch-hhc---ceEeechHHHHHHHHHHHh----hHHHhcccceeccchhhceeeecccc Q lcl|Aclame:pro 1 MRKETRQAYEKY-AAQIAKLNDTG-DVS---KKFAVEPTVQQRLETKMQE----SSEFLKRINVLPVTELEGEKLGLSVS 71 (337) Q Consensus 1 M~~~tr~~~~~y-~~~~a~~ngv~-~~~---~~Fsv~P~~~q~L~~~iqe----ss~FL~~Inv~~V~~~~Ge~v~lgv~ 71 (337) |+...-..+... ++.-++..|+. ++. --|. ...-+.+...+.| .=...+.|.+...-..--|.+..++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~--~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~ 78 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWT--AQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTF 78 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHH--HHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeee Confidence 887554433333 22223344553 221 1232 2223333333322 11122222222111111222222222 Q ss_pred cccccccCC-CC-cccccccccccCCceeEEEEeeeeeecCHHHHHHHhCCh-hHHHHHHHHHHHHHhhhhHHhcccccc Q lcl|Aclame:pro 72 GPIASRTDT-TK-AARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFA-DFQQRIRDVILNQGALDRIMIGWNGVK 148 (337) Q Consensus 72 g~ia~Rt~t-~~-~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~-dF~~r~~~~i~~~~aLD~i~IGfnG~s 148 (337) .+ +|.... +. .+--|..-.+.+.........--+..+++..|.+++... +...+-+.+.++..+..+=.|+|+|.. T Consensus 79 ~~-~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~ 157 (319) T protein:vir:10 79 DK-VGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSA 157 (319) T ss_pred cc-ccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecc Confidence 11 122211 00 010122112222233334444556677788899998753 577777888888999999999999954 Q ss_pred cCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecC-C--cccccHHHHHHHHHhcccChhHcCCCCEE Q lcl|Aclame:pro 149 AAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGK-A--GDYENLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 149 ~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~-g--gdy~nLDaLv~d~~~~lid~~~r~~~~LV 225 (337) ..-.+=.-.+|.++= ++. +.+ ...++ . .-|..+.+++..+... .-....|+ + T Consensus 158 ~~g~~GLlN~p~~~~-----------------~~~---~~~-~~~~t~t~~~i~~di~~~~~~l~~~---s~g~~~p~-~ 212 (319) T protein:vir:10 158 PHKIVSVFNHPNITK-----------------ITS---GKW-IDVSTMKPETAEAELTQAIETIETI---TRGQHRAT-N 212 (319) T ss_pred cccceeEEeCCCcee-----------------eec---CCC-CCccccCHHHHHHHHHHHHHHHHHh---cCceeece-E Confidence 333222222222210 100 000 00000 0 1133344455444321 11222333 5 Q ss_pred EEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc-------eE-EecchhcEEEEecCceEE Q lcl|Aclame:pro 226 VICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA-------LM-VTKLSNLSIYYQEGARRR 297 (337) Q Consensus 226 vivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~-------il-iT~l~NLsiY~Q~gs~RR 297 (337) .++..++...-..+ .+ .+-....+.+ .+.+.++..+.+|.+...+ ++ ...-+|+++=+-. ..|+ T Consensus 213 L~L~p~~~~~L~~~-~~----~~~~t~l~~l--k~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~-~~~~ 284 (319) T protein:vir:10 213 ILIPPSMRKVLAIR-MP----ETTMSYLDYF--KSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPE-AFNM 284 (319) T ss_pred EEecHHHHHhhhcc-cC----CCCeeHHHHH--HHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCc-ceee Confidence 55676665432221 22 2223334443 4567799999999997532 33 3357777776644 3344 Q ss_pred EEEEcccccceeceeeeeeeeeeeccccEEEeecc Q lcl|Aclame:pro 298 TLKEVPERDRIENYESSNDAYVVEDFGCGCVAENI 332 (337) Q Consensus 298 ~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ieni 332 (337) +-..--...-.+.|..+--|-+|=.+.++|.+++| T Consensus 285 ~~~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 285 LPAQPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eeeeecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 44433345556677777778888899999999999 No 123 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=76.53 E-value=0.13 Score=25.36 Aligned_cols=283 Identities=11% Similarity=0.050 Sum_probs=135.5 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |.+..--.|- .+ +-=.|+|.+-+.+..-+. ..+|+ .+...-..-.|.+...+..+ +|.... T Consensus 1 ~~~~~~g~f~---~~-----------~l~~id~~v~e~~~~~l~-~r~l~---~v~~~~~~~~~~~~~~~~~~-~G~~~~ 61 (301) T protein:vir:80 1 MQGKITATIE---AR-----------DLQAIDNVIYEPKQEELT-ARSVF---PQKFDVNEGAESYSFDVMTR-SGAAKI 61 (301) T ss_pred CCccccchhh---HH-----------HHHHHHHHHHHhhhhhhh-hhhhc---ccccCCCCceEEEEEeeecc-ceeEEE Confidence 3332211110 00 000133333333333333 22232 22211111112222111111 111111 Q ss_pred -C-CcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCCh-hHHHHHHHHHHHHHhhhhHHhcccccccCCcCChhh Q lcl|Aclame:pro 81 -T-KAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFA-DFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQA 157 (337) Q Consensus 81 -~-~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~-dF~~r~~~~i~~~~aLD~i~IGfnG~s~A~~Td~~a 157 (337) + +.+--|..-...+.......+.--+..+.|..|.+.+... +...+-+.+.++..+..+=.+.|+|.+-.-.+=.-. T Consensus 62 ~~~~~~dip~~~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN 141 (301) T protein:vir:80 62 IANGADDLPLVDVDMVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFE 141 (301) T ss_pred ecCcccccccccccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeec Confidence 0 0000122222333444455566666788899999998753 577777888889999999999999955332221111 Q ss_pred hhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccc--cHHHHHHHHHhcccChh-----HcCCCCEEEEECH Q lcl|Aclame:pro 158 NPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYE--NLDALVMDIVSSMIDPW-----FQEDTGLVVICGR 230 (337) Q Consensus 158 nPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~--nLDaLv~d~~~~lid~~-----~r~~~~LVvivG~ 230 (337) .|.+ ..+..+... .+...+++ +-|.++.|+.. ++..- +...| ...++.. T Consensus 142 ~p~~-----------------~~~~~~~~~-----~~~~~~w~~~t~~ei~~di~~-~~~~l~~~s~g~~~p-~~L~L~p 197 (301) T protein:vir:80 142 ATGI-----------------QIDVSPTTG-----VGNVSKWEKKTAEQIIDEIGE-AHTKITVLPGYGTAS-LKLCLPP 197 (301) T ss_pred CCCc-----------------ccccccCcc-----cccccccccCCHHHHHHHHHH-HHHHHHHhcCceecc-cEEEecH Confidence 1211 001100000 01111222 23333333322 22221 11222 4566777 Q ss_pred HHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc-------eEE-ecchhcEEEEecCceEEEEEEc Q lcl|Aclame:pro 231 ELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA-------LMV-TKLSNLSIYYQEGARRRTLKEV 302 (337) Q Consensus 231 dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~-------ili-T~l~NLsiY~Q~gs~RR~~~d~ 302 (337) +....-..++++....-|+ .+.+ .+.+.++..+.+|.+...+ +++ ..-+|+++-+-.. .|++-.+- T Consensus 198 ~~~~~L~~~~~~~~~~~tv---l~~l--~~~~~~~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~-~~~~~~e~ 271 (301) T protein:vir:80 198 KQFELINKKRYSNEDSRSV---LKVL--QDNAWFSAIVRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMD-ITRHPEEY 271 (301) T ss_pred HHHHhhhhccccCCCCeeH---HHHH--HHHcCcceEEEcceeccCCCCcccEEEEEecCCcEEEEEecCc-eeeeccee Confidence 6665544444433322232 3433 4577889999999997654 333 3478888877653 33333322 Q ss_pred ccccceeceeeeeeeeeeeccccEEEeecc Q lcl|Aclame:pro 303 PERDRIENYESSNDAYVVEDFGCGCVAENI 332 (337) Q Consensus 303 p~r~rve~y~s~Ne~YvVEd~~~~a~ieni 332 (337) -...-.+.|..+--+-+|=.+.++|.+++| T Consensus 272 ~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 272 SFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred cCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 223455677888778888899999999999 No 124 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=73.40 E-value=0.092 Score=26.25 Aligned_cols=272 Identities=11% Similarity=0.064 Sum_probs=115.8 Q ss_pred CChHHHHHHHHHHHHHHHhhCch--hhcceEeec-hHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTG--DVSKKFAVE-PTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~-P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~R 77 (337) |.-.. +-+....+. +|.. +.....++- ..-.-.+..+.+.+|-|+.+.++-.++ .|..+.+-.-|.+.-. T Consensus 1 ~~~~~----~~~~~~~~~-~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~--~G~tv~i~~ig~~~~~ 73 (332) T protein:vir:78 1 MTTLS----NFSLPNQAN-GGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLR--GGKSKQFMFTGKLSAG 73 (332) T ss_pred Ccccc----cccCCcccc-CCccccccccchhhhhhhhhhhHHHHHHHHhhhhhcccccccc--ccceEEEEeccceeEe Confidence 22100 011111111 1211 111111110 122334567778889999988887665 4887766555443322 Q ss_pred cCCCCcccccccccccCCceeEE--EEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhh--hHHhcccccccCCcC Q lcl|Aclame:pro 78 TDTTKAARQPIDPTALDSNRYRC--EKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALD--RIMIGWNGVKAAATT 153 (337) Q Consensus 78 t~t~~~~R~p~~~~~l~~~~Y~c--~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD--~i~IGfnG~s~A~~T 153 (337) .-+.+..-.+. .+++..+-.| -+.-+.. ..-+.||.|...-|+...+.+.....+|-. .-.++ --..+| .+ T Consensus 74 ~~~~g~~l~~~--~~~~~~~~~l~ID~~ky~~-~~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~-~l~~aa-~~ 148 (332) T protein:vir:78 74 YHTPGTPIVGD--AGIKANEKTLVMDDLLVSS-QFVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIAR-VLAKAS-AE 148 (332) T ss_pred eecCCCCCCCC--CCCCCceEEEEEehhhhhH-HHHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHH-HHHhhh-cc Confidence 11111111110 1233333333 2222222 222579999999888888887777766652 22221 001111 11 Q ss_pred ChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcc--cccHHHHHHHHHhcccChhHcCCCCEEEEECHH Q lcl|Aclame:pro 154 DRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGD--YENLDALVMDIVSSMIDPWFQEDTGLVVICGRE 231 (337) Q Consensus 154 d~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggd--y~nLDaLv~d~~~~lid~~~r~~~~LVvivG~d 231 (337) . +|..-..++ ..+.++.++. =.++-..+.++.. .+|+..-...+.+++|++. T Consensus 149 ~---------------------~~~~~~~g~----~~~~~~~~~~~~~~~~~~~i~~a~~-~Lde~~VP~~gR~~vv~P~ 202 (332) T protein:vir:78 149 A---------------------SPVTGEPGG----FHVNIGAGNTNDAQAIVDGFFEAAA-VLDERSAPQEGRVAVLSPR 202 (332) T ss_pred c---------------------Ccccccccc----cccccCCccccCHHHHHHHHHHHHH-HHhhcCCCccCCEEEeCHH Confidence 0 000000000 1122222221 1345455677766 4588877777899999985 Q ss_pred HHHH----HHHHHHhccCChHH-HHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEccccc Q lcl|Aclame:pro 232 LLHD----KYFPIVNATQAPTE-RLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERD 306 (337) Q Consensus 232 Ll~~----k~~~l~n~~~~ptE-~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p~r~ 306 (337) .... +-.+++|..-..+. .+...- ...++.|.+++..|.+|..+.--....+.+ T Consensus 203 ~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~--~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~------------------- 261 (332) T protein:vir:78 203 QYYSLISSVDTNILNREIGNSQGDMNSGK--GLYSIAGIRILKSNNLAGLYGQDLSSAAVT------------------- 261 (332) T ss_pred HHHHHHhhcCceeeeeeccccccceecce--eeeEEeeeEEEecCccccCccccccccccc------------------- Confidence 4332 11223332111111 111111 135788999999999997654333222221 Q ss_pred ceeceeeeeeeeeee---------ccccEEEee----cceeccC Q lcl|Aclame:pro 307 RIENYESSNDAYVVE---------DFGCGCVAE----NIELAAA 337 (337) Q Consensus 307 rve~y~s~Ne~YvVE---------d~~~~a~ie----ni~~~~a 337 (337) ..+-+|-++ ....++.++ -|+..++ T Consensus 262 ------~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~ 299 (332) T protein:vir:78 262 ------GENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSG 299 (332) T ss_pred ------ccccccccccccceEEeecccceeeeeeeccchhhhhc Confidence 000111111 111111111 1111121 No 125 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=61.09 E-value=0.36 Score=23.02 Aligned_cols=296 Identities=12% Similarity=0.107 Sum_probs=132.2 Q ss_pred CChH---HHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccc Q lcl|Aclame:pro 1 MRKE---TRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~---tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~R 77 (337) |++- ||.-+ +..+ ++.+- | + ..-.-...++.+.++-|+.+.++-.++. |..+-+- ..|| T Consensus 1 ms~~~~~t~~~~-------~~s~--~d~al-~-l-e~f~geV~~af~~~s~~~~~~~~rti~~--g~s~~~~----~iG~ 62 (335) T protein:vir:78 1 MSFLNDLTRPNY-------AGKN--ADVDI-H-L-EEHLGIVDKHFAYTSKFAPLMNIRDLRG--SNVVRLD----RLGN 62 (335) T ss_pred CCcccccccccc-------cccc--chhhh-h-h-hhhhhHHHHHHHHhhhhccccceeeecc--ceeEEEe----eeee Confidence 6553 23221 1111 11111 1 1 1112345567888999998888775532 4444332 3333 Q ss_pred cCCCC-cccccccccccCCceeEEE-EeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhh--hhHHhcccccccCCcC Q lcl|Aclame:pro 78 TDTTK-AARQPIDPTALDSNRYRCE-KTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGAL--DRIMIGWNGVKAAATT 153 (337) Q Consensus 78 t~t~~-~~R~p~~~~~l~~~~Y~c~-qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aL--D~i~IGfnG~s~A~~T 153 (337) +.-.. .+=++.+.......+..+. .+-.=++..-+.||.|-.+=|+-..+.+.+-+..|- |+-.+ =...++|... T Consensus 63 ~~~~~~~pG~~l~~~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~-~~l~~aa~~~ 141 (335) T protein:vir:78 63 VEAKGRRAGEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACL-IQVIKAAAMD 141 (335) T ss_pred eeecccccCcccCCCCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHH-HHHHhhcccc Confidence 33211 1111111111222222211 111123455678999999888888888888888887 65433 1122223333 Q ss_pred ChhhhhhhhccchhHHHHHHHhchhhhccccccccCceee-cCCcccccHHHHHHHHHhcccChhHcCC---CCEEEEEC Q lcl|Aclame:pro 154 DRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLV-GKAGDYENLDALVMDIVSSMIDPWFQED---TGLVVICG 229 (337) Q Consensus 154 d~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~-g~ggdy~nLDaLv~d~~~~lid~~~r~~---~~LVvivG 229 (337) .|...|. ||. .|...+-.++. ...+++..|-.++.++.+.|. +..-.+ .|.|++|. T Consensus 142 a~~~~~~------~~~-------------~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l~-ekdvP~~~~~~rv~vv~ 201 (335) T protein:vir:78 142 APVDLED------AFS-------------PGVLEKLDLTGLTAKEAAEKIVRMHRRVVETFI-ERDLGDAVYSEGLTPMS 201 (335) T ss_pred cccccCC------CcC-------------CCcceeeeeccccccccHHHHHHHHHHHHHHHH-hccCCCCCCCccEEEeC Confidence 3222111 111 01000111111 123477788888999887664 432222 16899999 Q ss_pred HHHHHH--HHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchh------------cEEEEecCc- Q lcl|Aclame:pro 230 RELLHD--KYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSN------------LSIYYQEGA- 294 (337) Q Consensus 230 ~dLl~~--k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~N------------LsiY~Q~gs- 294 (337) .+-... +.-+++|..=..+.-.....-.....+.|.|++..|.||.+++--++|.| --+++|... T Consensus 202 P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al 281 (335) T protein:vir:78 202 PRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTL 281 (335) T ss_pred hHHHHHHhcccccccccccccccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceE Confidence 743221 11123443211111000000012336789999999999988755555543 223344441 Q ss_pred ---------eEEEEEEcccccceeceeeeeeeeeeeccccEEEee--cceeccC Q lcl|Aclame:pro 295 ---------RRRTLKEVPERDRIENYESSNDAYVVEDFGCGCVAE--NIELAAA 337 (337) Q Consensus 295 ---------~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ie--ni~~~~a 337 (337) .+........=+-|-.|++. +=.+=.+++++.|+ ++.-.+- T Consensus 282 ~t~~~~~~~~e~~~~~~~~~~~i~~~~a~--G~g~lRPe~a~~i~~tg~~~~~~ 333 (335) T protein:vir:78 282 ITAQVAPVQAKLWEDHDQFSWVLDTFQMY--NIGARRPDTAGAIELKGIEAFDI 333 (335) T ss_pred EEEEEEecccceeeccchhhHhhhHHHHc--CCcccCcceEEEEEecCCCcccc Confidence 11111111111222222221 12234677777776 3322222 No 126 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=57.31 E-value=0.44 Score=22.55 Aligned_cols=288 Identities=13% Similarity=0.108 Sum_probs=135.1 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhh--cceEeec--hHHHHHHHHHHHhhHHHhcccceeccchhhce---eee------ Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDV--SKKFAVE--PTVQQRLETKMQESSEFLKRINVLPVTELEGE---KLG------ 67 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~--~~~Fsv~--P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge---~v~------ 67 (337) |+=.. -+..-..++-+ .++.++ .--|.++ -.+.+++.+....+=...+ ++||+..-++ .+. T Consensus 3 ~~~~~--~~~~~~~~~~~-~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~---~i~v~~~~~~~~et~~~~~~e~ 76 (314) T protein:vir:10 3 IKFDA--EQAKITTHLEQ-MGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVN---IFPVTNEIPGHAKYFEYPEFDG 76 (314) T ss_pred cchHH--HHHHHHHHHHh-hcccchhhhHHHHHHHHHHHHHHHhhhhccccccce---eeccccCCCCceeEEEeeeecc Confidence 55442 22223333322 332222 2234443 1233333333332323333 3343322221 111 Q ss_pred cccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCC-hhHHHHHHHHHHHHHhhhhHHhcccc Q lcl|Aclame:pro 68 LSVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKF-ADFQQRIRDVILNQGALDRIMIGWNG 146 (337) Q Consensus 68 lgv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~-~dF~~r~~~~i~~~~aLD~i~IGfnG 146 (337) .|....+...++.- |..-.+.+...-....---+..+++..|.+.+.. -+...+-+.+.++..+..+-.|+|+| T Consensus 77 ~G~a~~~~d~~~di-----p~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G 151 (314) T protein:vir:10 77 VGIAQIIADYSDDL-----PLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSG 151 (314) T ss_pred ccceeeeCCccccc-----ceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEee Confidence 12222222222210 1111112222222333334445556777777764 36778888888888888899999999 Q ss_pred cccCCcCChhhhhhhhcc--chhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCE Q lcl|Aclame:pro 147 VKAAATTDRQANPLLQDV--NIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGL 224 (337) Q Consensus 147 ~s~A~~Td~~anPllqDV--N~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~L 224 (337) .+.-..+=.-.+|.+.=+ ..+|- .++.+ |.-+++++..+...- -+...|+- T Consensus 152 ~~~~g~~GLlN~p~v~~~~~~~~Wa------T~~ei------------------~~Di~~~~~~l~~~s---~g~~~p~~ 204 (314) T protein:vir:10 152 SAPHGIVSVFDQPNINNVVATPNWS------VPQNA------------------IDDVTAMIDAVESST---QGLHHVTD 204 (314) T ss_pred cccccceeEeecCCCccccCCCCcc------cHHHH------------------HHHHHHHHHHHHHhc---Ccccccee Confidence 443332222222222100 11220 01111 233444444433210 12234443 Q ss_pred EEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCc--------eEEecchhcEEEEecCceE Q lcl|Aclame:pro 225 VVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRA--------LMVTKLSNLSIYYQEGARR 296 (337) Q Consensus 225 VvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~--------iliT~l~NLsiY~Q~gs~R 296 (337) .++..+ ++.++......+-.-..+.+ .+..=+|....+|.+-..+ +..+.-+|+++=+-.. .| T Consensus 205 -l~Lpp~-----~~~~L~~~~~~~~~tvl~~l--~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~-~~ 275 (314) T protein:vir:10 205 -ILLPAS-----ARRVMQGLVPQTNLSYGELF--TRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEV-TN 275 (314) T ss_pred -EEecHH-----HHHhhcccccCCCccHHHHH--HHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCcc-ce Confidence 334443 33344433223334444554 4556688999999987554 2336677777655443 23 Q ss_pred EEEEEcccccceeceeeeeeeeeeeccccEEEeecceec Q lcl|Aclame:pro 297 RTLKEVPERDRIENYESSNDAYVVEDFGCGCVAENIELA 335 (337) Q Consensus 297 R~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~ 335 (337) ++-..--...-.+.|..+--|-+|=.+.++|.+++|+|+ T Consensus 276 ~l~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 276 VLPAQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred eecceecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 333333334556667777777888899999999999999 No 127 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=53.29 E-value=0.53 Score=22.09 Aligned_cols=292 Identities=12% Similarity=0.064 Sum_probs=131.4 Q ss_pred CChHHHHHHHHHHHH----HHHhhCch-hh--cceEe------echHHHHHHHHHHHhhHHHhcccceeccchhhceeee Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQ----IAKLNDTG-DV--SKKFA------VEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLG 67 (337) Q Consensus 1 M~~~tr~~~~~y~~~----~a~~ngv~-~~--~~~Fs------v~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~ 67 (337) |++. .+++++-.. .++.-+.. +. .--|. |+|.+-++....+. -..|+.--+..+ .-=|.+. T Consensus 6 ~~~~--~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~-~~~~i~i~~~~~---~~~~~~t 79 (329) T protein:vir:79 6 MSKE--MKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGS-ALRVFPVTSELS---DTDKTFE 79 (329) T ss_pred hhhh--hccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccc-hhhhcccccCCC---CceeEEE Confidence 4432 223322222 22222211 11 11122 22222222222222 122322222111 1111111 Q ss_pred c------ccccccccccCCCCcccccccccccCCceeEEEEeeeeeecCHHHHHHHhCC-hhHHHHHHHHHHHHHhhhhH Q lcl|Aclame:pro 68 L------SVSGPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKF-ADFQQRIRDVILNQGALDRI 140 (337) Q Consensus 68 l------gv~g~ia~Rt~t~~~~R~p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~-~dF~~r~~~~i~~~~aLD~i 140 (337) . |....+.+.++. -|..-.+.+...-.....--+..+.+..|.+.+.. -+...+-+.+.++..+..+= T Consensus 80 ~~~~~~~G~a~~~~d~~~d-----ip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n 154 (329) T protein:vir:79 80 YQTFDKVGHAKIIADYTDD-----LSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVN 154 (329) T ss_pred eeeeecceeeeeecCcccc-----cceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhc Confidence 1 222222222111 01111222222233444444556777888888764 36778888888999999999 Q ss_pred HhcccccccCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChh--- Q lcl|Aclame:pro 141 MIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPW--- 217 (337) Q Consensus 141 ~IGfnG~s~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~--- 217 (337) .|+|+|.+....+=.-.+|.++-+..| ..+ .+.-..++-|.++.|+.+-+..-| T Consensus 155 ~i~f~G~~~~g~~GLlN~p~v~~~~~~-----------------~~~------~~~w~~kt~~ei~~di~~~~~~l~~~s 211 (329) T protein:vir:79 155 HLVFKGSKPHKIISVFEHPNLTTINSA-----------------GWN------NAAGTGKKPETAQDELEQAIEKIETLT 211 (329) T ss_pred cEEEeecccccceeeecCCCccccccC-----------------CCC------CccccccCHHHHHHHHHHHHHHHHHhc Confidence 999999543332222222322211110 000 001111333444444333211111 Q ss_pred -HcCCCCEEEEECHHHHHHHHHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCC------ceEE--ecchhcEE Q lcl|Aclame:pro 218 -FQEDTGLVVICGRELLHDKYFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKR------ALMV--TKLSNLSI 288 (337) Q Consensus 218 -~r~~~~LVvivG~dLl~~k~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~------~ili--T~l~NLsi 288 (337) +...|+ ..++..++... +......+-....+.+ .+.+-++....+|.+=.. .+++ +.-+|+.+ T Consensus 212 ~g~~~p~-~L~Lpp~~~~~-----L~~~~~~~~~tvl~~l--k~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~ 283 (329) T protein:vir:79 212 NGQHRAN-MILIPPSMRKV-----LMVRMPETTMSYLDYF--KQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSI 283 (329) T ss_pred Cceeccc-EEEecHHHHHH-----hhcccCCCCccHHHHH--HHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEE Confidence 223344 45556654432 2221112223334544 445667888888888432 2232 67777777 Q ss_pred EEecCceEEEEEEcccccceeceeeeeeeeeeeccccEEEeecceec Q lcl|Aclame:pro 289 YYQEGARRRTLKEVPERDRIENYESSNDAYVVEDFGCGCVAENIELA 335 (337) Q Consensus 289 Y~Q~gs~RR~~~d~p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~ 335 (337) =+-. ..|++-..--...-.+.|..+--+-+|=-+.++|.+++|.++ T Consensus 284 ~vp~-~~~~l~~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 284 EIPE-AFNMLTAQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred ecCc-ceeeeeceecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 5544 333333333334455677787777888889999999999999 No 128 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=49.56 E-value=0.64 Score=21.66 Aligned_cols=301 Identities=13% Similarity=0.058 Sum_probs=122.0 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccCC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~t 80 (337) |-+.+-..--.-..+.+. +|..+...-|- ..-.-.+..+.+++|-|+.+.++-.++ .|..+-+-..|...-+.-+ T Consensus 1 ~~~~~~~~~~~~~~~~~~-~~~~~~~al~l--e~f~geV~~~f~~~s~~~~~~~~r~i~--~gks~~~~~iG~~~~~~~~ 75 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGV-VAAGDKLALFL--KVFGGEVLTAFARTSVTTSRHMVRSIS--SGKSAQFPVLGRTQAAYLA 75 (345) T ss_pred Ccccccchhccccccccc-ccCCchhHHHH--HHHhHHHHHHHHHHhhhcccceeeecc--ccceEEEeeecceEEEeee Confidence 333222110000000000 01111000000 122345678889999999888876554 2665555544443333222 Q ss_pred CCcc----cc----cccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhh--hhHHhcccccccC Q lcl|Aclame:pro 81 TKAA----RQ----PIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGAL--DRIMIGWNGVKAA 150 (337) Q Consensus 81 ~~~~----R~----p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aL--D~i~IGfnG~s~A 150 (337) .+.+ ++ ....+.+++-.|. +..-+.+|.|..+-||...+.+...+..|- |.-.+.=-+ .+| T Consensus 76 ~G~~l~~~~~~~~~~e~~ltID~~~y~--------~~~VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~-k~a 146 (345) T protein:vir:22 76 PGENLDDKRKDIKHTEKVITIDGLLTA--------DVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIA-GLC 146 (345) T ss_pred cCCCCCCCCCCcccceEEEEecchhhh--------hhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHH-Hhh Confidence 2211 10 1111334444443 334458999999999999888888877665 332221111 112 Q ss_pred CcCChh-hhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCCCCEEEEEC Q lcl|Aclame:pro 151 ATTDRQ-ANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICG 229 (337) Q Consensus 151 ~~Td~~-anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~~~LVvivG 229 (337) ..++|. .||- ||....-- .+...+...+.. ...+ .++=+.+.++.. .+|+..-...+.+++|+ T Consensus 147 ~~~~~~~~~~~------~~~~~~~~----~~~~~g~~~t~~---~~~~--~~~~~ai~~a~~-~Lde~~VP~~~R~~vv~ 210 (345) T protein:vir:22 147 NVESKYNENIE------GLGTATVI----ETTQNKAALTDQ---VALG--KEIIAALTKARA-ALTKNYVPAADRVFYCD 210 (345) T ss_pred ccccccccccc------cccccccc----cccccccccccc---ccCH--HHHHHHHHHHHH-HhhhcCCCccCCEEEeC Confidence 222221 1111 22111000 000000000000 0011 233344556654 56888888788999999 Q ss_pred HHHHHHH-HHHHHhccCChHHHHHHHHHHhhhhhcCccccccCccCCCceE----E------ecc------------hh- Q lcl|Aclame:pro 230 RELLHDK-YFPIVNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALM----V------TKL------------SN- 285 (337) Q Consensus 230 ~dLl~~k-~~~l~n~~~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~il----i------T~l------------~N- 285 (337) ++....- ..+.++..+.-....... ....++.|.+++..|.+|....= . +.. +| T Consensus 211 P~~y~~Ll~~~~~~~~~~~~~~~~~~--G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 288 (345) T protein:vir:22 211 PDSYSAILAALMPNAANYAALIDPEK--GSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNV 288 (345) T ss_pred hHHHHHHhcccccccccccccccccc--ceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCce Confidence 9654421 111111111111000000 12346789999999999853110 0 001 11 Q ss_pred cEEEEecCceE-EEEEE-cccccceeceee-ee-----eeeeeeccccEEEee-cce Q lcl|Aclame:pro 286 LSIYYQEGARR-RTLKE-VPERDRIENYES-SN-----DAYVVEDFGCGCVAE-NIE 333 (337) Q Consensus 286 LsiY~Q~gs~R-R~~~d-~p~r~rve~y~s-~N-----e~YvVEd~~~~a~ie-ni~ 333 (337) --+.+|+...= -+..+ ..|.-|=+.|++ -- .|.-|=.+++++.|. .|| T Consensus 289 ~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 289 IGLFMHRSAVGTVKLRDLALERARRANFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred EEEEEehhheeeeeeecceeeeeechhHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 11333333210 00011 111111122221 00 112233556655554 555 No 129 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=29.24 E-value=1.7 Score=19.35 Aligned_cols=279 Identities=14% Similarity=0.095 Sum_probs=117.7 Q ss_pred CChHHHHHHHHHHHHHHHhhCchhhcceEeec-hHHHHHHHHHHHhhHHHhcccceeccchhhceeeecccccccccccC Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVE-PTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~-P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~Rt~ 79 (337) |.+-.-.-+ .+.+ ++.....-++- ..-.-....+.++++-|+.+.++-.++. |..+-+-.-|...-..- T Consensus 1 m~~~~~~~~----t~~~----~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~--G~s~~~~~iG~~~~~~~ 70 (334) T protein:vir:80 1 MTYPAANTH----TRPG----WGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRG--TNQLRVDRVGASTIAGR 70 (334) T ss_pred CCCCcCCCc----cccc----cccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccc--cceEEEeeecceeeeee Confidence 443211000 0000 11111111111 1222345678888999998888765532 76666654443332222 Q ss_pred CCCcccccccccccCCceeEEEEee-eeeecCHHHHHHHhCChhHHHHHHHHHHHHHhh--hhHHhcccccccCCcCChh Q lcl|Aclame:pro 80 TTKAARQPIDPTALDSNRYRCEKTD-YDTAIPYRKLDMWAKFADFQQRIRDVILNQGAL--DRIMIGWNGVKAAATTDRQ 156 (337) Q Consensus 80 t~~~~R~p~~~~~l~~~~Y~c~qtn-~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aL--D~i~IGfnG~s~A~~Td~~ 156 (337) +.+. +-+...+...+-.|.=-+ .=++..-+.||.|-.+-||...+.+...+..|- |+-.+. -..++|....|. T Consensus 71 ~~g~---~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~-~l~kaa~~~~~~ 146 (334) T protein:vir:80 71 KAGE---ELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACII-QLQKCGDFLAPA 146 (334) T ss_pred cCCC---CCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHH-HHHHhhhhcccc Confidence 2111 222222333444443222 233556678999999999999999999998888 753321 111222222221 Q ss_pred hhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCccc-ccHHHH---HHHHHhcccChhHcC---CCCEEEEEC Q lcl|Aclame:pro 157 ANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDY-ENLDAL---VMDIVSSMIDPWFQE---DTGLVVICG 229 (337) Q Consensus 157 anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy-~nLDaL---v~d~~~~lid~~~r~---~~~LVvivG 229 (337) .++. || ..|..+.-.+ .|...+. .+-|+| ..++.+. +++.--. ..+.|++|+ T Consensus 147 ~~~~------~~-------------~~G~~~~~~~-~g~~~~~~~~~~~l~~a~~~a~~~-L~e~dvp~~~~~~R~~vv~ 205 (334) T protein:vir:80 147 HLKP------AF-------------HDGILLPSTI-SGLAADAAADADVLVAAHRQGVEA-MVFRDLGDQLMSEGVTLLD 205 (334) T ss_pred cccc------cc-------------cCCcceeecc-cccccchhhhHHHHHHHHHHHHHH-HHhcCCCCCcCCceEEEeC Confidence 1100 00 0010000000 1221222 223333 3466654 4554333 236899999 Q ss_pred HHH----HHHHHHHHHhcc--CChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEecCceEEEEEEcc Q lcl|Aclame:pro 230 REL----LHDKYFPIVNAT--QAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVP 303 (337) Q Consensus 230 ~dL----l~~k~~~l~n~~--~~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~gs~RR~~~d~p 303 (337) ..- +.++ +++|.. ...+-...+. ....++.|.|++..|.||...+--..+ +-+.....-+ T Consensus 206 P~~y~~Ll~~~--r~~n~d~~~s~~~~~~~~--g~i~~v~G~~V~~Sn~~P~~~~t~~~~----------g~~~~~~agd 271 (334) T protein:vir:80 206 PVIFSFLLEHD--RLMNVEFGAKEGGNSFVG--GRIAMLNGVRVVETPRFPQSAITANAL----------GADFNVTDAE 271 (334) T ss_pred hHHHHHHhccc--ccccceeccccccccccc--eeEEEEeceEEEeecCCCCcccccccc----------cccccccccc Confidence 753 3332 245431 1111110011 124478899999999999775322211 1111111111 Q ss_pred cccceeceeeeeeeeeeeccccEEEeecceec-cC Q lcl|Aclame:pro 304 ERDRIENYESSNDAYVVEDFGCGCVAENIELA-AA 337 (337) Q Consensus 304 ~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~-~a 337 (337) --.++.-+..+. ....+|-+.+. |. T Consensus 272 ~t~~~~~~~~~~---------Al~t~~~~~~~~e~ 297 (334) T protein:vir:80 272 VRRKMITFIPSM---------ALISAQVHPVSAQF 297 (334) T ss_pred ccceEEEEEeCc---------eEEEEEEeecceee Confidence 111111111111 11122211111 11 No 130 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=28.66 E-value=1.7 Score=19.28 Aligned_cols=285 Identities=12% Similarity=0.100 Sum_probs=115.0 Q ss_pred CChHHHHHHHHHHHHHHHhh--Cc--hhhcceEeechHHHHHHHHHHHhhHHHhcccceeccchhhceeeeccccccccc Q lcl|Aclame:pro 1 MRKETRQAYEKYAAQIAKLN--DT--GDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIAS 76 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~n--gv--~~~~~~Fsv~P~~~q~L~~~iqess~FL~~Inv~~V~~~~Ge~v~lgv~g~ia~ 76 (337) -++.-|-.|+....+.=++- .+ .++ .+ -.....++.+.+.+.+.|++|+.+....| +|-....- T Consensus 5 ~~~~~~~~~~~~~~~~p~l~m~alTLaea-~~-l~~d~~~~~VIE~l~~~s~iL~~lpf~~v---e~~~~~~~------- 72 (330) T protein:vir:94 5 CTPPLRGRWRTLTHQFPELKMPTVTLAES-AK-LSQDHLVSGLIETIVEVNPLYEMMPFTEI---EGNALAYN------- 72 (330) T ss_pred cCCccccceeehhccccccchhhhhhhHH-hh-cCchhhHHHHHHhhhccchHHhhcccccc---cCCcceee------- Confidence 11111222222211100000 00 011 11 22456788888999999999988765443 33221111 Q ss_pred ccCC--CCcccc------cccccccCCceeEEEEeeeeeecCHHHHHHHhCChhHHHHHHHHHHHHHhhhhHHhcccccc Q lcl|Aclame:pro 77 RTDT--TKAARQ------PIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVK 148 (337) Q Consensus 77 Rt~t--~~~~R~------p~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~WA~~~dF~~r~~~~i~~~~aLD~i~IGfnG~s 148 (337) |+.+ +..-|. |..+.........|.-..-+.-+.=...|..-+--|+...--...++.++....--=+||.| T Consensus 73 r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs 152 (330) T protein:vir:94 73 RENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDG 152 (330) T ss_pred eeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 2211 111121 11111111222222221111111111222222223455444555555555555555578843 Q ss_pred cCCcCChhhhhhhhccchhHHHHHHHhchhhhccccccccCceeecCCcccccHHHHHHHHHhcccChhHcCC-CCEEEE Q lcl|Aclame:pro 149 AAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQED-TGLVVI 227 (337) Q Consensus 149 ~A~~Td~~anPllqDVN~GWlq~~Re~a~~~v~~~~~~~~~~i~~g~ggdy~nLDaLv~d~~~~lid~~~r~~-~~LVvi 227 (337) . |.-. -|=++.+ ++.+++. .|..|-.-++|. +..||+..+..+ ..-+++ T Consensus 153 ~--------~~~F----~GL~~~~---~~~q~i~----------tg~~gg~~T~d~-----LDeLl~~v~~~~g~~~~~l 202 (330) T protein:vir:94 153 T--------GNSF----QGMMGLV---AASQTIS----------AGANGGTLTFEL-----LDQLLDLVKDKDGQVDYLM 202 (330) T ss_pred C--------Cccc----cchhhcC---CcccEEe----------cCCCCCCCCHHH-----HHHHHHHhcCCCCCCcEEE Confidence 2 1000 0222221 3334432 232233344443 234444443322 123666 Q ss_pred ECHHHHHHHHHHHHh----ccC-ChHHHHHHHHHHhhhhhcCccccccCccCCCceEEecchhcEEEEec---------- Q lcl|Aclame:pro 228 CGRELLHDKYFPIVN----ATQ-APTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQE---------- 292 (337) Q Consensus 228 vG~dLl~~k~~~l~n----~~~-~ptE~~A~~~~~~~k~iGGlpa~~vPffP~~~iliT~l~NLsiY~Q~---------- 292 (337) +.+..... ...+.- ..- +++.-.-+.- .-+++|.|.+..-+.|.+.--.|.-.==|||.=+ T Consensus 203 ~n~a~~r~-I~a~~R~~~~~~v~~~~~~~~G~~---v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV 278 (330) T protein:vir:94 203 SSFAMRRK-YFSLLRALGGAAIGEVMTLPSGRQ---IPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGI 278 (330) T ss_pred echhHHHH-HHHHHHhccCCCCCCcccccCCCE---EeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccce Confidence 66654432 111211 111 1222211111 1267787777766777653211221122444332 Q ss_pred ---------CceEEEEEEc----ccccceeceeeeeeeeeeeccccEEEeecceec Q lcl|Aclame:pro 293 ---------GARRRTLKEV----PERDRIENYESSNDAYVVEDFGCGCVAENIELA 335 (337) Q Consensus 293 ---------gs~RR~~~d~----p~r~rve~y~s~Ne~YvVEd~~~~a~ieni~~~ 335 (337) |-.=|.+-.. -.|=+|+-|. +-+|-...+++.++||+++ T Consensus 279 ~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y~----~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 279 AGLTARGSAGLRVQNVGAKENADETITRVKMYC----GFANFSQLGLAAIKGLIPG 330 (330) T ss_pred EeecCCCCCcceeeeCCCccccceeeEEEEEee----eeEEechhheeeeccccCC Confidence 1111222111 1234555554 4688899999999999999 Done!