Query lcl|NC_015266.1_cdsid_YP_004306451.1 [gene=39] [protein=gp39] [protein_id=YP_004306451.1] [location=complement(26405..27418)] Match_columns 337 No_of_seqs 113 out of 230 Neff 5.1 Searched_HMMs 1612 Date Thu Nov 7 12:59:10 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_39 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_39_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78186 Length: 337 100.0 3E-191 2E-194 1064.9 30.3 337 1-337 1-337 (337) 2 protein:vir:79171 Length: 337 100.0 3E-190 2E-193 1059.3 30.4 337 1-337 1-337 (337) 3 protein:vir:104011 Length: 337 100.0 4E-190 2E-193 1058.9 30.4 337 1-337 1-337 (337) 4 protein:vir:79157 Length: 339 100.0 5E-190 3E-193 1058.2 30.0 337 1-337 1-338 (339) 5 protein:vir:100331 Length: 342 100.0 4E-188 2E-191 1048.2 30.5 336 1-337 1-341 (342) 6 protein:vir:6061 Length: 357 # 100.0 7E-188 4E-191 1046.6 30.0 337 1-337 1-345 (357) 7 protein:vir:2016 Length: 357 # 100.0 7E-188 4E-191 1046.7 29.9 337 1-337 1-345 (357) 8 protein:vir:5694 Length: 357 # 100.0 8E-188 5E-191 1046.4 29.9 337 1-337 1-345 (357) 9 protein:vir:98566 Length: 355 100.0 8E-187 5E-190 1041.0 30.2 337 1-337 1-345 (355) 10 protein:vir:1829 Length: 355 # 100.0 1E-186 7E-190 1039.9 30.4 337 1-337 1-345 (355) 11 protein:vir:1153 Length: 338 # 100.0 1E-185 7E-189 1034.6 30.5 335 1-336 1-338 (338) 12 protein:vir:78777 Length: 358 100.0 8E-181 5E-184 1008.1 30.0 330 1-337 5-341 (358) 13 protein:vir:98856 Length: 343 100.0 5E-175 3E-178 976.3 29.6 328 1-337 1-336 (343) 14 protein:vir:3746 Length: 336 # 100.0 1E-174 9E-178 973.7 28.7 326 4-337 1-333 (336) 15 protein:vir:3783 Length: 336 # 100.0 3E-174 2E-177 971.7 28.7 326 4-337 1-333 (336) 16 protein:vir:270 Length: 341 # 100.0 2E-172 1E-175 962.1 26.4 323 1-337 5-332 (341) 17 protein:vir:3158 Length: 321 # 100.0 4.2E-79 2.6E-82 450.3 22.5 307 4-337 1-315 (321) 18 protein:vir:99424 Length: 360 100.0 9E-47 5.6E-50 273.0 20.3 328 1-337 1-360 (360) 19 protein:vir:4197 Length: 314 # 100.0 4.4E-42 2.7E-45 247.3 19.7 302 1-335 1-314 (314) 20 protein:vir:4159 Length: 315 # 100.0 7.5E-39 4.7E-42 229.6 17.8 305 1-332 1-315 (315) 21 protein:vir:100247 Length: 425 98.8 1.1E-09 6.8E-13 69.7 16.0 308 1-337 109-424 (425) 22 protein:vir:4092 Length: 390 # 98.7 9.9E-09 6.1E-12 64.5 17.8 292 1-337 72-368 (390) 23 protein:vir:100135 Length: 418 98.6 2E-08 1.3E-11 62.8 17.2 297 1-337 104-418 (418) 24 protein:vir:9410 Length: 415 # 98.5 3.2E-08 2E-11 61.7 16.7 295 1-337 101-407 (415) 25 protein:vir:79987 Length: 415 98.5 4.2E-08 2.6E-11 61.0 17.2 296 1-337 101-407 (415) 26 protein:vir:81100 Length: 415 98.5 4.2E-08 2.6E-11 61.0 17.2 296 1-337 101-407 (415) 27 protein:vir:98339 Length: 415 98.5 4.2E-08 2.6E-11 61.0 17.2 296 1-337 101-407 (415) 28 protein:vir:4339 Length: 395 # 98.5 5.6E-08 3.5E-11 60.3 16.8 295 1-337 89-395 (395) 29 protein:vir:4511 Length: 409 # 98.5 5.6E-08 3.4E-11 60.4 16.7 296 1-337 84-406 (409) 30 protein:vir:95376 Length: 425 98.4 8.5E-08 5.3E-11 59.3 16.8 292 1-337 111-425 (425) 31 protein:vir:4700 Length: 415 # 98.4 7.9E-08 4.9E-11 59.5 16.5 296 1-337 101-407 (415) 32 protein:vir:4600 Length: 415 # 98.4 7.9E-08 4.9E-11 59.5 16.5 296 1-337 101-407 (415) 33 protein:vir:4456 Length: 401 # 98.4 8E-08 5E-11 59.5 16.3 305 1-337 79-401 (401) 34 protein:vir:94771 Length: 298 98.4 4.9E-08 3E-11 60.7 14.9 279 16-333 1-298 (298) 35 protein:vir:1328 Length: 392 # 98.4 6.5E-08 4E-11 60.0 15.4 290 1-337 85-391 (392) 36 protein:vir:94142 Length: 304 98.4 3.9E-08 2.4E-11 61.2 13.7 280 1-333 1-304 (304) 37 protein:vir:105905 Length: 304 98.4 3.9E-08 2.4E-11 61.2 13.7 280 1-333 1-304 (304) 38 protein:vir:6242 Length: 390 # 98.4 1.1E-07 6.9E-11 58.7 16.0 291 1-337 81-389 (390) 39 protein:vir:485 Length: 407 # 98.3 4.9E-07 3E-10 55.2 18.0 303 1-337 78-400 (407) 40 protein:vir:104085 Length: 320 98.2 2E-07 1.2E-10 57.3 14.5 298 1-337 1-320 (320) 41 protein:vir:97053 Length: 390 98.2 4.2E-07 2.6E-10 55.6 15.8 291 1-335 80-390 (390) 42 protein:vir:78523 Length: 338 98.2 3.7E-07 2.3E-10 55.8 15.4 298 1-337 1-338 (338) 43 protein:vir:4226 Length: 326 # 98.2 2.8E-07 1.8E-10 56.5 14.5 299 1-337 3-326 (326) 44 protein:vir:7771 Length: 330 # 98.2 2.1E-07 1.3E-10 57.2 13.7 295 1-337 1-325 (330) 45 protein:vir:10364 Length: 390 98.2 1.1E-06 6.8E-10 53.3 17.5 292 1-335 83-390 (390) 46 protein:vir:80376 Length: 435 98.1 1.4E-06 8.4E-10 52.7 17.6 298 1-336 88-435 (435) 47 protein:vir:3991 Length: 404 # 98.1 8.7E-07 5.4E-10 53.8 16.5 282 1-337 89-396 (404) 48 protein:vir:1025 Length: 408 # 98.1 9.6E-07 5.9E-10 53.6 16.4 283 1-337 89-396 (408) 49 protein:vir:94673 Length: 419 98.1 7.4E-07 4.6E-10 54.2 15.5 297 1-337 98-417 (419) 50 protein:vir:103955 Length: 324 98.1 1.3E-06 8.1E-10 52.8 16.6 292 1-337 1-318 (324) 51 protein:vir:1638 Length: 298 # 98.1 6.1E-07 3.8E-10 54.6 14.5 281 16-333 1-298 (298) 52 protein:vir:2504 Length: 305 # 98.1 1.6E-06 9.9E-10 52.4 16.6 281 19-337 1-301 (305) 53 protein:vir:81160 Length: 371 98.1 2.2E-06 1.4E-09 51.6 17.3 276 1-337 71-371 (371) 54 protein:vir:81070 Length: 390 98.0 2E-06 1.2E-09 51.9 16.8 290 1-335 80-390 (390) 55 protein:vir:96223 Length: 324 98.0 3.5E-06 2.1E-09 50.5 18.0 292 1-337 1-318 (324) 56 protein:vir:100172 Length: 394 98.0 2.8E-06 1.7E-09 51.0 17.2 282 1-337 88-387 (394) 57 protein:vir:7855 Length: 497 # 98.0 3.5E-06 2.2E-09 50.5 17.6 324 1-337 130-496 (497) 58 protein:vir:101650 Length: 497 98.0 3.5E-06 2.2E-09 50.5 17.6 324 1-337 130-496 (497) 59 protein:vir:191 Length: 385 # 98.0 1.1E-06 7E-10 53.2 14.8 291 1-337 74-384 (385) 60 protein:vir:1886 Length: 385 # 98.0 1.1E-06 7E-10 53.2 14.8 291 1-337 74-384 (385) 61 protein:vir:95763 Length: 297 98.0 1E-06 6.5E-10 53.4 14.3 279 1-335 1-297 (297) 62 protein:vir:6212 Length: 434 # 97.9 4E-06 2.5E-09 50.2 16.6 288 1-337 119-431 (434) 63 protein:vir:78830 Length: 324 97.9 5.2E-06 3.2E-09 49.5 17.1 292 1-337 1-319 (324) 64 protein:vir:96392 Length: 324 97.9 5.2E-06 3.2E-09 49.5 17.1 292 1-337 1-319 (324) 65 protein:vir:41 Length: 299 # N 97.9 2.4E-06 1.5E-09 51.4 14.8 272 20-337 1-298 (299) 66 protein:vir:1268 Length: 397 # 97.9 6.6E-06 4.1E-09 49.0 17.0 280 1-337 87-394 (397) 67 protein:vir:104256 Length: 458 97.9 5.6E-06 3.5E-09 49.4 16.6 293 1-337 126-458 (458) 68 protein:vir:8102 Length: 543 # 97.9 6.1E-06 3.8E-09 49.2 16.7 291 1-337 217-542 (543) 69 protein:vir:99749 Length: 324 97.8 8.8E-06 5.5E-09 48.3 17.2 292 1-337 1-318 (324) 70 protein:vir:7409 Length: 408 # 97.8 8.6E-06 5.3E-09 48.4 17.1 284 1-337 82-396 (408) 71 protein:vir:78223 Length: 333 97.8 6.5E-06 4E-09 49.0 16.4 294 6-337 1-332 (333) 72 protein:vir:97148 Length: 324 97.8 1.3E-05 8E-09 47.4 17.4 292 1-337 1-318 (324) 73 protein:vir:9759 Length: 303 # 97.7 3.9E-06 2.4E-09 50.2 13.9 283 20-334 1-303 (303) 74 protein:vir:1433 Length: 435 # 97.7 1.8E-05 1.1E-08 46.6 17.4 297 1-336 91-435 (435) 75 protein:vir:81227 Length: 413 97.7 1.7E-05 1.1E-08 46.7 17.2 293 1-337 85-410 (413) 76 protein:vir:3845 Length: 395 # 97.7 1.3E-05 7.9E-09 47.4 16.0 282 1-337 86-386 (395) 77 protein:vir:4953 Length: 397 # 97.7 2.6E-05 1.6E-08 45.7 17.5 280 1-337 86-385 (397) 78 protein:vir:3870 Length: 400 # 97.7 1.2E-05 7.4E-09 47.6 15.3 278 1-337 101-399 (400) 79 protein:vir:9643 Length: 377 # 97.6 2.9E-05 1.8E-08 45.5 17.2 284 1-335 67-377 (377) 80 protein:vir:9574 Length: 300 # 97.6 1.5E-05 9.6E-09 47.0 15.5 281 16-337 1-300 (300) 81 protein:vir:9309 Length: 324 # 97.6 3.4E-05 2.1E-08 45.1 17.4 292 1-337 1-318 (324) 82 protein:vir:2430 Length: 318 # 97.6 1.2E-05 7.3E-09 47.6 14.9 292 1-337 1-316 (318) 83 protein:vir:95963 Length: 395 97.6 1.3E-05 8.1E-09 47.4 14.4 292 1-337 75-378 (395) 84 protein:vir:102119 Length: 404 97.5 3E-05 1.8E-08 45.4 15.8 296 1-337 80-400 (404) 85 protein:vir:100884 Length: 389 97.5 4.8E-05 3E-08 44.2 16.6 279 1-337 83-382 (389) 86 protein:vir:9509 Length: 381 # 97.5 3.8E-05 2.3E-08 44.8 15.9 287 1-337 65-370 (381) 87 protein:vir:101291 Length: 381 97.5 3.8E-05 2.3E-08 44.8 15.9 287 1-337 65-370 (381) 88 protein:vir:4997 Length: 397 # 97.5 6.4E-05 4E-08 43.6 17.2 281 1-337 86-385 (397) 89 protein:vir:4830 Length: 397 # 97.4 4.8E-05 3E-08 44.2 16.0 279 1-337 86-385 (397) 90 protein:vir:80684 Length: 315 97.4 7.7E-05 4.8E-08 43.1 17.2 285 16-337 1-309 (315) 91 protein:vir:8187 Length: 311 # 97.4 6.2E-05 3.9E-08 43.6 15.8 283 16-335 1-311 (311) 92 protein:vir:1383 Length: 421 # 97.3 4.8E-05 3E-08 44.2 15.0 276 1-337 92-392 (421) 93 protein:vir:101607 Length: 379 97.3 8.2E-05 5.1E-08 43.0 15.7 280 1-332 77-379 (379) 94 protein:vir:8420 Length: 477 # 97.3 9.8E-05 6.1E-08 42.6 16.0 298 1-337 115-474 (477) 95 protein:vir:2344 Length: 397 # 97.2 5E-05 3.1E-08 44.2 14.2 287 1-337 1-309 (397) 96 protein:vir:1084 Length: 437 # 97.2 8.9E-05 5.5E-08 42.8 15.6 282 1-337 136-434 (437) 97 protein:vir:9704 Length: 394 # 97.2 7.1E-05 4.4E-08 43.3 14.8 277 1-337 103-390 (394) 98 protein:vir:99920 Length: 311 97.2 8.2E-05 5.1E-08 43.0 15.1 289 16-337 1-309 (311) 99 protein:vir:962 Length: 397 # 97.1 6.5E-05 4E-08 43.6 13.8 272 1-337 112-397 (397) 100 protein:vir:4856 Length: 293 # 97.1 0.00012 7.2E-08 42.2 14.9 264 16-337 1-281 (293) 101 protein:vir:105038 Length: 428 97.0 0.00022 1.3E-07 40.7 16.8 294 1-334 83-428 (428) 102 protein:vir:9361 Length: 402 # 97.0 0.00014 8.4E-08 41.8 14.3 273 1-337 98-396 (402) 103 protein:vir:93881 Length: 387 97.0 0.00019 1.2E-07 41.0 15.1 273 1-337 83-381 (387) 104 protein:vir:5739 Length: 366 # 96.9 0.00028 1.8E-07 40.0 16.3 295 1-334 20-366 (366) 105 protein:vir:78640 Length: 352 96.8 0.00033 2.1E-07 39.6 16.2 275 1-337 46-346 (352) 106 protein:vir:78350 Length: 383 96.8 0.00027 1.6E-07 40.2 14.5 295 1-337 72-377 (383) 107 protein:vir:96978 Length: 387 96.8 0.00036 2.3E-07 39.4 15.0 272 1-337 83-381 (387) 108 protein:vir:94424 Length: 387 96.8 0.00036 2.3E-07 39.4 15.0 272 1-337 83-381 (387) 109 protein:vir:2685 Length: 387 # 96.8 0.00036 2.3E-07 39.4 15.0 272 1-337 83-381 (387) 110 protein:vir:93616 Length: 645 96.7 0.0004 2.5E-07 39.2 15.6 293 1-337 286-642 (645) 111 protein:vir:80128 Length: 466 96.6 0.0002 1.3E-07 40.8 12.5 304 1-337 123-451 (466) 112 protein:vir:100632 Length: 381 96.1 0.0005 3.1E-07 38.7 12.2 287 1-337 65-373 (381) 113 protein:vir:98635 Length: 377 95.3 0.0024 1.5E-06 35.0 13.5 281 1-335 67-377 (377) 114 protein:vir:103285 Length: 296 95.0 0.0031 1.9E-06 34.3 13.7 272 20-335 1-296 (296) 115 protein:vir:102082 Length: 392 94.6 0.0041 2.5E-06 33.7 16.9 277 1-337 89-384 (392) 116 protein:vir:102873 Length: 392 94.6 0.0041 2.5E-06 33.7 16.9 277 1-337 89-384 (392) 117 protein:vir:107593 Length: 392 94.6 0.0041 2.5E-06 33.7 16.9 277 1-337 89-384 (392) 118 protein:vir:105004 Length: 392 94.6 0.0041 2.5E-06 33.7 16.9 277 1-337 89-384 (392) 119 protein:vir:107687 Length: 319 94.2 0.0051 3.1E-06 33.2 12.3 292 1-332 1-319 (319) 120 protein:vir:96762 Length: 632 94.1 0.0055 3.4E-06 33.0 13.6 286 1-337 309-630 (632) 121 protein:vir:9820 Length: 272 # 94.0 0.0056 3.5E-06 32.9 15.9 258 16-337 1-269 (272) 122 protein:vir:3033 Length: 272 # 94.0 0.0056 3.5E-06 32.9 15.9 258 16-337 1-269 (272) 123 protein:vir:78935 Length: 335 85.2 0.055 3.4E-05 27.5 9.9 295 1-337 1-333 (335) 124 protein:vir:80068 Length: 301 84.8 0.058 3.6E-05 27.4 15.8 279 1-332 1-301 (301) 125 protein:vir:104342 Length: 314 81.6 0.084 5.2E-05 26.5 16.4 288 1-335 3-314 (314) 126 protein:vir:78739 Length: 332 76.2 0.052 3.2E-05 27.6 5.9 288 1-337 1-332 (332) 127 protein:vir:79642 Length: 329 50.2 0.62 0.00038 21.7 15.3 295 1-335 6-329 (329) 128 protein:vir:1541 Length: 347 # 38.0 1.1 0.00068 20.4 10.0 293 1-336 1-347 (347) 129 protein:vir:8885 Length: 347 # 34.7 1.3 0.00079 20.0 13.4 291 1-337 1-346 (347) 130 protein:vir:94933 Length: 330 33.9 1.3 0.00082 19.9 10.0 285 1-335 5-330 (330) 131 protein:vir:80213 Length: 334 25.6 2 0.0013 18.9 10.2 295 1-337 1-334 (334) 132 protein:vir:6324 Length: 335 # 21.4 2.6 0.0016 18.3 10.8 298 1-337 1-330 (335) 133 protein:vir:93742 Length: 274 21.1 2.7 0.0016 18.3 15.1 256 1-337 1-270 (274) No 1 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=100.00 E-value=3.2e-191 Score=1064.93 Aligned_cols=337 Identities=89% Similarity=1.327 Sum_probs=335.9 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|||||||+| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (337) T protein:vir:78 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCcceeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ++++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++||| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (337) T protein:vir:78 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCChhhCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) ||||||||||++|+++|+|||++++.++|+|++|+||||+||||||+|++++|||||||++|||||||||||+++||||| T Consensus 161 lqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~~~l 240 (337) T protein:vir:78 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) T ss_pred ccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhccccee Q lcl|NC_015266. 241 VNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSNDAYVV 320 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~YvV 320 (337) +|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||+|||||||| T Consensus 241 ~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvV 320 (337) T protein:vir:78 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVV 320 (337) T ss_pred HhcCCCcHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccchhhccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCcEEEeeceeeccC Q lcl|NC_015266. 321 EDFGCGCVAENIELVAA 337 (337) Q Consensus 321 Ed~~~~a~iEnI~~~~a 337 (337) ||||++|+||||+|++| T Consensus 321 Ed~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 321 EDFGCGCVAENIELAAA 337 (337) T ss_pred eccccEEEEeceeecCC Confidence 99999999999999999 No 2 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=100.00 E-value=3.4e-190 Score=1059.29 Aligned_cols=337 Identities=89% Similarity=1.327 Sum_probs=335.9 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|||||||+| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:79 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ++++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++||| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:79 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) ||||||||||++|+++|+|||++++.++|+|++|+||||+||||||+|++++|||||||++|||||||||||+++||||| T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l 240 (337) T protein:vir:79 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVAICGRELLHDKYFPI 240 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhccccee Q lcl|NC_015266. 241 VNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSNDAYVV 320 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~YvV 320 (337) +|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||+|||||||| T Consensus 241 ~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvV 320 (337) T protein:vir:79 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVV 320 (337) T ss_pred hccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchhhccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCcEEEeeceeeccC Q lcl|NC_015266. 321 EDFGCGCVAENIELVAA 337 (337) Q Consensus 321 Ed~~~~a~iEnI~~~~a 337 (337) ||||++|+||||+|++| T Consensus 321 Ed~~~~a~ienI~~~~a 337 (337) T protein:vir:79 321 EDFGCGCVAENIELAAA 337 (337) T ss_pred eccccEEEEeceeecCC Confidence 99999999999999999 No 3 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=100.00 E-value=4e-190 Score=1058.92 Aligned_cols=337 Identities=89% Similarity=1.328 Sum_probs=335.9 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|+|||||+| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrt~t 80 (337) T protein:vir:10 1 MRKETRQAYEKYAAQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRINVLPVTELEGEKLGLSVSGPIASRTDT 80 (337) T ss_pred CChHHHHHHHHHHHHHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccCceeccccceeeEEeeccCcceeeeecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ++++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++||| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfnG~s~A~~Td~~~nPl 160 (337) T protein:vir:10 81 TKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVILNQGALDRIMIGWNGVKAAATTDRQANPL 160 (337) T ss_pred CCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) ||||||||||++|+++|+|||++++.++|+|++|+||||+||||||+|++++|||||||++|||||||||||+++||||| T Consensus 161 lqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~lI~~~~~~d~~LVvivG~dLladk~~~l 240 (337) T protein:vir:10 161 LQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) T ss_pred ccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhccccee Q lcl|NC_015266. 241 VNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSNDAYVV 320 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~YvV 320 (337) +|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||+|||||||| T Consensus 241 ~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvV 320 (337) T protein:vir:10 241 VNATQAPTERLAADLIVSQKRIGNLPAVRVPFFPKRALMVTKLSNLSIYYQEGARRRTLKEVPERDRIENYESSNDAYVV 320 (337) T ss_pred hccCCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchhhccceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCcEEEeeceeeccC Q lcl|NC_015266. 321 EDFGCGCVAENIELVAA 337 (337) Q Consensus 321 Ed~~~~a~iEnI~~~~a 337 (337) ||||++|+||||+|++| T Consensus 321 Ed~~~~a~ienI~~~~a 337 (337) T protein:vir:10 321 EDFGCGCVAENIELAAA 337 (337) T ss_pred eccccEEEEeceeecCC Confidence 99999999999999999 No 4 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=100.00 E-value=5.5e-190 Score=1058.18 Aligned_cols=337 Identities=62% Similarity=1.043 Sum_probs=335.1 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|||||||+| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtdt 80 (339) T protein:vir:79 1 MRNDTRRLFAAYKAAIAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSINFYGVPEQEGEKIGLGVSGPVASTTDT 80 (339) T ss_pred CChHHHHHHHHHHHHHHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccCcccccccceeeEEeeccCcceeecccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ++++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++||| T Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPl 160 (339) T protein:vir:79 81 TQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAIIKRQALDRIMIGFNGVSRAATSDRVANPM 160 (339) T ss_pred CCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeecCCChhhCcC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceec-CCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLV-GKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~-G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~ 239 (337) ||||||||||++|+++|+|||++++.++++|++ |+||||+||||||+|++++|||||||++|||||||||||+++|||| T Consensus 161 lqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~~lId~~~~~d~dLVvivG~dLla~k~~~ 240 (339) T protein:vir:79 161 LQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITNHLVEPWYAEDPDLVVVCGRNLLSDKYFP 240 (339) T ss_pred ccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhhhHhhh Confidence 999999999999999999999999988999988 9999999999999999999999999999999999999999999999 Q ss_pred HHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhcccce Q lcl|NC_015266. 240 IVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSNDAYV 319 (337) Q Consensus 240 l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~Yv 319 (337) |+|++++|||++|+|+++|+|+||||||++|||||+++|+||+|||||||||+|++||+++|||+|||||||+||||||| T Consensus 241 l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~Yv 320 (339) T protein:vir:79 241 LVNRDRDPVQQIAADLIISQKRIGNLPAIRVPYFPANGLLVTRLDNLSIYYQEGGRRRTILDNAKRDRIENYESSNDAYV 320 (339) T ss_pred HhhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEeccccccccchhhccceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCcEEEeeceeeccC Q lcl|NC_015266. 320 VEDFGCGCVAENIELVAA 337 (337) Q Consensus 320 VEd~~~~a~iEnI~~~~a 337 (337) |||||++|+||||+|++| T Consensus 321 VEd~~~~a~iEni~~~~a 338 (339) T protein:vir:79 321 IEDLACAAMAENIALAAA 338 (339) T ss_pred eeccccEEEeeeeecccC Confidence 999999999999999999 No 5 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=100.00 E-value=3.7e-188 Score=1048.16 Aligned_cols=336 Identities=53% Similarity=0.862 Sum_probs=331.7 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc----cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccce Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD----DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIAS 76 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~----~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iag 76 (337) |+++||++|++|++++|++|||+ +++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iag 80 (342) T protein:vir:10 1 MKDLTLEKYNAYLARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDFLKSISFVFVDEQTGETLGLDSAHTVAS 80 (342) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHHhccCcccccccceeeEEecccCccccc Confidence 99999999999999999999998 88899999999999999999999999999999999999999999999999999 Q ss_pred eccCC-CcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCCh Q lcl|NC_015266. 77 RTDTT-KAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDK 155 (337) Q Consensus 77 Rt~t~-~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~ 155 (337) ||||+ +++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++||| T Consensus 81 rtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (342) T protein:vir:10 81 TTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKVANVAAKQRKRDLIMIGFNGTSRAATSDR 160 (342) T ss_pred ccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 99986 56899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHH Q lcl|NC_015266. 156 AANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHD 235 (337) Q Consensus 156 ~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~ 235 (337) ++|||||||||||||++|+++|+|||++++ .+++|++|+||||+||||||+|++++|||||||++|||||||||||+++ T Consensus 161 ~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~-~~~~i~iG~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLlad 239 (342) T protein:vir:10 161 NSNPLLQDVAKGWLQKMREDAKERVMNGES-TDNQVLVGKGQEYANLDALVMDATEELIDEWHRDDTDLVVITGRKLLAD 239 (342) T ss_pred hhCcCccccchHHHHHHHhhhhhhhcccce-eccceeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHH Confidence 999999999999999999999999999887 4789999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhc Q lcl|NC_015266. 236 KYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSN 315 (337) Q Consensus 236 k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~N 315 (337) |||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||+||| T Consensus 240 k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~N 319 (342) T protein:vir:10 240 KYFPIVNQQNAPTEELAADIVISQKRIGGLKAVRVPFFPANAILITKLENLAIYVQEGTTRKHIENVPKKDRIETYESEN 319 (342) T ss_pred HHHHHHhcCCChHHHHHHHHHHhhhhhcCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 316 DAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 316 e~YvVEd~~~~a~iEnI~~~~a 337 (337) |||||||||++|+||||+++++ T Consensus 320 e~YvVEd~~~~a~iE~i~i~~~ 341 (342) T protein:vir:10 320 IDYVVEDYGCAALIENITLKDK 341 (342) T ss_pred cceeeeccccEEEeecceecCC Confidence 9999999999999999999999 No 6 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=100.00 E-value=7e-188 Score=1046.65 Aligned_cols=337 Identities=56% Similarity=0.956 Sum_probs=330.7 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc--cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD--DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|+||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:60 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 99999999999999999999997 6789999999999999999999999999999999999999999999999999999 Q ss_pred cCCC-cccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 79 DTTK-AERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 79 ~t~~-~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) +|++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:60 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDLIMAGFNGVRRAETSDRSS 160 (357) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 9975 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCc-----eecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGK-----VLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~-----i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++..+|+ |++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:60 161 NQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 999999999999999999999999987766654 889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchh Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYE 312 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~ 312 (337) +++|||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~ 320 (357) T protein:vir:60 241 LADKYFPIVNREQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYE 320 (357) T ss_pred hhHHhhhHhhcCCChHHHHHHHHHHHhhhhcCcceEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 313 SSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ||||||||||||++|+||||+++++ T Consensus 321 s~Ne~YvVEd~~~~a~iE~i~~~~~ 345 (357) T protein:vir:60 321 SMNIDYVVEDYAAGCLVEKIKVGDF 345 (357) T ss_pred hhcceeeeeccccEEEeeeeeeccC Confidence 9999999999999999999999987 No 7 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=100.00 E-value=6.9e-188 Score=1046.66 Aligned_cols=337 Identities=57% Similarity=0.966 Sum_probs=330.6 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc--cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD--DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++||||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:20 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 99999999999999999999996 6789999999999999999999999999999999999999999999999999999 Q ss_pred cCCC-cccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 79 DTTK-AERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 79 ~t~~-~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) +|++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++||++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:20 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 9976 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCc-----eecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGK-----VLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~-----i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++..+|+ |++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:20 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 999999999999999999999999997766654 889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchh Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYE 312 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~ 312 (337) +++|||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~ 320 (357) T protein:vir:20 241 LADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYE 320 (357) T ss_pred hhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 313 SSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ||||||||||||++|+||||+++++ T Consensus 321 s~Ne~YvVEd~~~~a~iE~i~~~~~ 345 (357) T protein:vir:20 321 SMNIDYVVEDYAAGCLVEKIKVGDF 345 (357) T ss_pred hhcceeeeeccccEEEeeeeeeccc Confidence 9999999999999999999999987 No 8 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=100.00 E-value=7.6e-188 Score=1046.44 Aligned_cols=337 Identities=56% Similarity=0.963 Sum_probs=330.8 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc--cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD--DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|+||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lg~~g~iagrt 80 (357) T protein:vir:56 1 MRQETRFKFNAYLSRVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLTRINIVPVSEMKGEKIGIGVTGSIASTT 80 (357) T ss_pred CChHHHHHHHHHHHHHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhccCCccccccceeeEEecccCccccccc Confidence 99999999999999999999997 6789999999999999999999999999999999999999999999999999999 Q ss_pred cCCC-cccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 79 DTTK-AERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 79 ~t~~-~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) +|++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~ 160 (357) T protein:vir:56 81 DTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNAIIKRQSLDFIMAGFNGVKRAETSDRSS 160 (357) T ss_pred cCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHHHHHHHhhccceecccceeeeccCChhh Confidence 9976 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCc-----eecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGK-----VLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~-----i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++..+|+ |++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (357) T protein:vir:56 161 NPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDALVMDATNNLIEPWYQEDPDLVVIVGRQL 240 (357) T ss_pred CcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 999999999999999999999999987766654 889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchh Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYE 312 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~ 312 (337) +++|||||+|++++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~pTE~~Aa~~i~s~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~ 320 (357) T protein:vir:56 241 LADKYFPIVNKEQDNSEMLAADVIISQKRIGNLPAVRVPYFPADAMLITKLENLSIYYMDDSHRRVIEENPKLDRVENYE 320 (357) T ss_pred hhhhhhhHhhccCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 313 SSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ||||||||||||++|+||||+++++ T Consensus 321 s~Ne~YvVEd~~~~a~iE~i~i~~~ 345 (357) T protein:vir:56 321 SMNIDYVVEDYAAGCLVEKIKVGDF 345 (357) T ss_pred hhcceeeeeccccEEEeeeeeeccC Confidence 9999999999999999999999987 No 9 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=100.00 E-value=7.6e-187 Score=1040.95 Aligned_cols=337 Identities=55% Similarity=0.898 Sum_probs=330.3 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc--cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD--DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|+||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:98 1 MRPETRFKFNAYLTRVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLKTINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEeeeccCccccccc Confidence 99999999999999999999995 6899999999999999999999999999999999999999999999999999999 Q ss_pred cCCC-cccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 79 DTTK-AERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 79 ~t~~-~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) +|++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:98 81 DTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDAIVKRQALDLIMAGFNGTTRADTSDRTK 160 (355) T ss_pred cCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 9974 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccC-----CceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEA-----GKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~-----~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++..+ ++|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~D~~~~lI~~~~~~d~dLVvivG~dL 240 (355) T protein:vir:98 161 NTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDALVMDATNNLIDEVYQDDPNLVAIVGRKL 240 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 9999999999999999999999999987544 45889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchh Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYE 312 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~ 312 (337) +++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 320 (355) T protein:vir:98 241 LADKYFPLVNKQQENSESLAADIIISQKRIGNLPAVRVPYFPANAVLVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) T ss_pred hHHHhhhHhhccCCcHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 313 SSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ||||||||||||++|+||||+|+++ T Consensus 321 s~Ne~YvVEd~~~~a~ienI~~~~~ 345 (355) T protein:vir:98 321 SMNIDYVVEVYAAGCLLENITLGDF 345 (355) T ss_pred hhcceeeeeccccEEEeeceeeeCC Confidence 9999999999999999999999988 No 10 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=100.00 E-value=1.2e-186 Score=1039.89 Aligned_cols=337 Identities=56% Similarity=0.909 Sum_probs=330.5 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc--cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD--DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|+||||| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~i~lgv~g~iagrt 80 (355) T protein:vir:18 1 MRQETRFKFNAYLTQLAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQMINILPVAEMKGEKIGVGVTGTIASTT 80 (355) T ss_pred CChHHHHHHHHHHHHHHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhhcCceeccccceeeEEeeccCcceeecc Confidence 99999999999999999999995 8899999999999999999999999999999999999999999999999999999 Q ss_pred cCCC-cccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 79 DTTK-AERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 79 ~t~~-~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) +|++ ++|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 81 dT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfNG~s~A~~Td~~~ 160 (355) T protein:vir:18 81 DTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDAIVQRQALDFIMAGFNGTTRADTSDRVK 160 (355) T ss_pred ccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeeccCChhh Confidence 9974 689999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccC-----CceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEA-----GKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~-----~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) |||||||||||||++|+++|+|||++++..+ ++|++|+||||+||||||+|++++|||||||++||||||||||| T Consensus 161 nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAlV~d~~~~lI~~~~~~d~dLVvivG~dL 240 (355) T protein:vir:18 161 NPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDALVMDGTNTLIDEIYQDDPKLVAIVGRKL 240 (355) T ss_pred CcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHHHHHHHhccCChHHhcCCCEEEEEchhh Confidence 9999999999999999999999999987544 46899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchh Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYE 312 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~ 312 (337) +++|||||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 241 la~k~~~l~n~~~~ptE~~Aa~~i~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 320 (355) T protein:vir:18 241 LADKYFPLVNKQQENTESLAADIIISQKRIGNLPAVRVPYFPANAVFVTTLENLSIYFMDESHRRSIDENPKKDRVENYE 320 (355) T ss_pred hHHHHhHHhhccCChHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 313 SSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ||||||||||||++|+||||+|+++ T Consensus 321 s~Ne~YvVEd~~~~a~ieni~~~~~ 345 (355) T protein:vir:18 321 SMNIDYVVEAYAAGCLLENITLGDF 345 (355) T ss_pred hhcceeeeeccccEEEEeeeeecCC Confidence 9999999999999999999999987 No 11 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=100.00 E-value=1.1e-185 Score=1034.56 Aligned_cols=335 Identities=62% Similarity=1.028 Sum_probs=329.2 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|+|||||+| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdT 80 (338) T protein:vir:11 1 MRNETRKQFDAYLAQLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQINVYGVDELQGEKIGIGVSGTIASRTDT 80 (338) T ss_pred CCHHHHHHHHHHHHHHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccCceecccceeeeEeeeccCccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCc-ccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhh Q lcl|NC_015266. 81 TKA-ERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANP 159 (337) Q Consensus 81 ~~~-~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nP 159 (337) +.. +|+|++++++++++|+|+|||||+||+|++||+|||||||++|+++++.+|+|||||||||||+|+|++|||++|| T Consensus 81 ~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~k~~ALD~i~IGfnG~s~A~~Td~~~nP 160 (338) T protein:vir:11 81 TGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAILKRQALDRLMIGFNGTSAAATTNRAANP 160 (338) T ss_pred CCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHhhchhhhcccceeeccCCChhhCc Confidence 764 6999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccchhHHHHHHhhchhhhcccccccCCceecCC--CcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 160 LLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGK--GGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 160 llqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~--ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) |||||||||||++|+++|+|||++++ .+++|.+|. +|||+||||||+|++++|||||||++|||||||||||+++|| T Consensus 161 llqDVNkGWlQ~~Re~ap~rv~~~~~-~~~~i~i~~g~~gdy~nLDalV~d~~~~lI~~~~~~d~dLVvivG~dLladk~ 239 (338) T protein:vir:11 161 LLQDVNIGWFQQYRNNAPARVLKEGK-TTGKVVVGNGADADYKNLDALVFDVVSSLIDPWHRRDPGLVVILGRELVHDKY 239 (338) T ss_pred CccccchhHHHHHHhhhhhhhhhccc-ccceeeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhhHHHH Confidence 99999999999999999999999986 578888865 499999999999999999999999999999999999999999 Q ss_pred HHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhccc Q lcl|NC_015266. 238 FPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSNDA 317 (337) Q Consensus 238 ~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~ 317 (337) |||+|+.++|||++|+|+++|+|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||+||||| T Consensus 240 ~~l~n~~~~ptE~~Aa~~~~s~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~ 319 (338) T protein:vir:11 240 FPMVNKDQPATEKIATDLILSQKRMGGLPPVEVPYVPEKGLMVTTLKNLSLYWQIGGRRRYLKEVPEKNRIENYESSNDA 319 (338) T ss_pred hHHHhcCCChHHHHHHHHHHHhhhhCCceeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCCcEEEeeceeecc Q lcl|NC_015266. 318 YVVEDFGCGCVAENIELVA 336 (337) Q Consensus 318 YvVEd~~~~a~iEnI~~~~ 336 (337) |||||||++|+||||++++ T Consensus 320 YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 320 YVVEDYGLGCLVENIEVAE 338 (338) T ss_pred eeeeccccEEEeecceecC Confidence 9999999999999999999 No 12 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=100.00 E-value=7.5e-181 Score=1008.09 Aligned_cols=330 Identities=30% Similarity=0.483 Sum_probs=320.3 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc--cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD--DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~--~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt 78 (337) |+++||++|++|++++|++|||+ +++++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|+||||| T Consensus 5 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrt 84 (358) T protein:vir:78 5 LTVQAEQRLNKYCDALAKAYGIDISKLDKQFSVTGPVETTLRSALLASVEFLGLITCLDVDQIKGQVVQVGVGQLYTGRK 84 (358) T ss_pred ccHHHHHHHHHHHHHHHHHhCCChhHccceeeeChHHHHHHHHHHHHHHHHhhcCcccccccceeeEEeecCCcccceec Confidence 99999999999999999999995 7899999999999999999999999999999999999999999999999999999 Q ss_pred cCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCc---cHHHHHHHHHHHHHhhchhhhcccceeccCCCCh Q lcl|NC_015266. 79 DTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFP---DFQQRIRNVILNQSALDRIMIGWNGVKAALSTDK 155 (337) Q Consensus 79 ~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~---dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~ 155 (337) +| |+|++++++++++|+|+|||||+||+|++||+||||| ||++||++++.+|+|||||||||||+|+|++||| T Consensus 85 ~t----r~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~r~~~~i~~~~ALD~i~IGfNGts~A~~Td~ 160 (358) T protein:vir:78 85 KG----GRFKGKVGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIKLVGEFVNKAFALDMLRVGWNGVSAADDTDP 160 (358) T ss_pred CC----CccccccccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHHHHHHHHHHHHhhccceecccceeeccCCCh Confidence 98 7899999999999999999999999999999999999 7999999999999999999999999999999999 Q ss_pred hhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCC--cccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHH Q lcl|NC_015266. 156 AANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKG--GDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 156 ~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~g--gdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLl 233 (337) ++|||||||||||||++|+++|+|||++++.+ ++|++|+| |||+||||||+|++++|||||||++|||||||||||+ T Consensus 161 ~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~-~~i~ig~g~~Gdy~NLDalV~D~~~~lI~~~~~~d~dLVvivG~dLl 239 (358) T protein:vir:78 161 TANPLGQDVNKGWHQLAREWKGGSQIIKAAAG-EKIYFDPDGKGEYKTLDEMASDLINTTIDPLFQQDPRLVVLVGTDLV 239 (358) T ss_pred hhCcCccccchHHHHHHHhhchhhhhcccccc-CceeecCCCCCccccHHHHHHHHHhccCChHHhcCCCEEEEEchhhh Confidence 99999999999999999999999999998854 56777755 9999999999999999999999999999999999999 Q ss_pred HHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhh Q lcl|NC_015266. 234 HDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYES 313 (337) Q Consensus 234 a~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s 313 (337) ++|||||+|++++|||++|+|+++ |+||||||++|||||+++||||+|||||||||+|++||+++|||+||||||||| T Consensus 240 a~k~~~l~n~~~~pTE~~Aa~~i~--k~iGGlpa~~~PfFP~~~ilVT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s 317 (358) T protein:vir:78 240 AAAQAKLYSEATKPSEQIAAQQLA--KSIAGRKAYIPPFFPGKRMVVTTLDNLHCYTQRGTRKRKADDNQDSKSFDNQYW 317 (358) T ss_pred hHHhhhHhhcCCCcHHHHHHHHHH--HHhCCCeEEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchhh Confidence 999999999999999999999885 799999999999999999999999999999999999999999999999999999 Q ss_pred hcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 314 SNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 314 ~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) |||||||||||++|+||||++..+ T Consensus 318 ~Ne~YvVEd~~~~a~iE~i~v~~~ 341 (358) T protein:vir:78 318 RMEGYALGEHKAYGGFEEADIEIG 341 (358) T ss_pred hcceeeeeccccEEEEeeeeeeeC Confidence 999999999999999999998633 No 13 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=100.00 E-value=4.7e-175 Score=976.29 Aligned_cols=328 Identities=28% Similarity=0.412 Sum_probs=314.2 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc----cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccce Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD----DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIAS 76 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~----~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iag 76 (337) |+++||++|++|++++|++|||+ +++++|+|+|+++|+|+++|||||+||++||+++|+|++|+++.+|.+|+++| T Consensus 1 M~~~tr~~~~~y~~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~~INvv~V~q~~g~v~~~~~sg~~t~ 80 (343) T protein:vir:98 1 MNKTAQELFYSLIGDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNFLEKINCVFSERYQRAIDLRSNRKRHYG 80 (343) T ss_pred CChHHHHHHHHHHHHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHHhhcCceecchhhcceEEEeecCccccC Confidence 99999999999999999999996 67899999999999999999999999999999999999999999999999999 Q ss_pred eccCC-Cc-ccccccccccCccceeeEeeccccccCHHHHHHHhcCcc-HHHHHHHHHHHHHhhchhhhcccceeccCCC Q lcl|NC_015266. 77 RTDTT-KA-ERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPD-FQQRIRNVILNQSALDRIMIGWNGVKAALST 153 (337) Q Consensus 77 Rt~t~-~~-~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~d-F~~~~~~~i~~~~alD~i~IGfNG~s~A~~T 153 (337) |++|. ++ +|.| +++++|+|+|||||+||+|++||+|||||| |++|+++++.+|+|||||||||||+|+|++| T Consensus 81 r~~t~~~~~~~~~-----~~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~deF~~r~~~~i~~~~ALD~i~IGfNGts~A~~T 155 (343) T protein:vir:98 81 AHDRRTPIQQRWT-----RQVMSMNVSRQIQACLIPWAKLDQWGHLKDKFASLYAEFVQNQIALDMIKIGFYGTSVGTDT 155 (343) T ss_pred ccccCCCcccccc-----CCCCccEEEEeeeeeeccHHHHHHhhcChhHHHHHHHHHHHHHHhhccceecccceeeccCC Confidence 99884 44 4544 456789999999999999999999999998 9999999999999999999999999999998 Q ss_pred ChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHH Q lcl|NC_015266. 154 DKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 D~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLl 233 (337) +|||||||||||||++|+++|+|||++++.+++++.+|+||||+||||||+|+++ +||||||++|||||||||||+ T Consensus 156 ---~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dLl 231 (343) T protein:vir:98 156 ---SDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELAYDLKQ-GLDARHRDAGDLVFLVGADLV 231 (343) T ss_pred ---CCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHHHHHHh-cCchHHhcCCCEEEEEchhhh Confidence 6999999999999999999999999999887777788999999999999999986 899999999999999999999 Q ss_pred HHHHHHHHhc-cCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchh Q lcl|NC_015266. 234 HDKYFPIVNT-TQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYE 312 (337) Q Consensus 234 a~k~~~l~n~-~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~ 312 (337) ++|||||+|+ +++|||++|+|+++++|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||| T Consensus 232 a~~~~~l~n~~~~~ptEk~Aa~~~~~~k~iGGl~a~~~PfFP~~~llVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~ 311 (343) T protein:vir:98 232 AKEASLVYKGNGLIATEKAALNTHDLMKSFGGMPAMIVPNMPPRAAIVTSLSNLSIYTQEGSMRRGMKDDDDKKAVRDSY 311 (343) T ss_pred hhhhhhhhhhcCCChHHHHHHHHHHHHHhhCCCeeEEccccCCCceEEeeccccEEEEecCcEEEEEEeccccccccchh Confidence 9999999996 679999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 313 SSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ||||||||||||++|+||||++.-+ T Consensus 312 s~Ne~YvVEd~~~~a~iE~i~v~~~ 336 (343) T protein:vir:98 312 YRNEAYAVEDCGKFMAVDFTKVKLS 336 (343) T ss_pred hhcceeeeeccccEEEeeeeeeeec Confidence 9999999999999999999888777 No 14 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=100.00 E-value=1.4e-174 Score=973.69 Aligned_cols=326 Identities=30% Similarity=0.431 Sum_probs=312.1 Q ss_pred HHHHHHHHHHHHHHHhcCcccc----cceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceecc Q lcl|NC_015266. 4 ETRQAYRKYAAQIAKLNDTDDV----SQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 4 ~tr~~~~~y~~~~a~~ngv~~~----~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~ 79 (337) -||++|++|++++|++|||+++ +++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|+|||||+ T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQINMIQVAHTKGQKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEeeeccCcccccccC Confidence 6779999999999999999744 489999999999999999999999999999999999999999999999999999 Q ss_pred CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHH-HHHHHHHHHHHhhchhhhcccceeccCCCChhhh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQ-QRIRNVILNQSALDRIMIGWNGVKAALSTDKAAN 158 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~-~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~n 158 (337) |+ |+|+++ ++++++|+|+|||||+||+|++||+|||||||+ .++++++.+|+|||||||||||+|+|++|| | T Consensus 81 t~---R~~~~~-~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~r~iALD~i~IGfnG~s~A~~Td---n 153 (336) T protein:vir:37 81 TG---RNLANL-DHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVADNTT---K 153 (336) T ss_pred CC---cccccc-CcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHHHHHhhchhhhcccceeeccCCC---C Confidence 97 666665 899999999999999999999999999999966 667888888999999999999999999998 9 Q ss_pred hhhhccchhHHHHHHhhchhhhcccccccCCcee-cCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 159 PLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVL-VGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 159 PllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~-~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) ||||||||||||++|+++|+|||++++.++|+|. +|+||||+||||||+|+++ +||||||++|||||||||||+++|| T Consensus 154 PllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dLla~~~ 232 (336) T protein:vir:37 154 ADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQ-GLDFRHQNRNDLVFLVGADLVSKET 232 (336) T ss_pred CcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHh-cCchHHhcCCCeEEEEchhhhhhhh Confidence 9999999999999999999999999988889876 4999999999999999997 6899999999999999999999999 Q ss_pred HHHHhc-cCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhcc Q lcl|NC_015266. 238 FPIVNT-TQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSND 316 (337) Q Consensus 238 ~~l~n~-~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne 316 (337) +||+|+ +++|||++|+++++++|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||||||| T Consensus 233 ~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne 312 (336) T protein:vir:37 233 KLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGLVTSYYRQE 312 (336) T ss_pred hhhhhhcCCCHHHHHHHHHHHHHHhhCCceeEEccccCCCceEEeechhcEEEEecCcEEEEEEEccccccccchhhhcc Confidence 999997 5799999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeecCCcEEEeeceeeccC Q lcl|NC_015266. 317 AYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 317 ~YvVEd~~~~a~iEnI~~~~a 337 (337) ||||||||++|+||||++... T Consensus 313 ~YvVEd~~~~a~iE~i~v~~~ 333 (336) T protein:vir:37 313 GYVVEDLGLMTAIDHTKVKLN 333 (336) T ss_pred eeeeeccccEEEeeeeeeeec Confidence 999999999999999999887 No 15 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=100.00 E-value=3.3e-174 Score=971.70 Aligned_cols=326 Identities=30% Similarity=0.430 Sum_probs=311.0 Q ss_pred HHHHHHHHHHHHHHHhcCcccc----cceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceecc Q lcl|NC_015266. 4 ETRQAYRKYAAQIAKLNDTDDV----SQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 4 ~tr~~~~~y~~~~a~~ngv~~~----~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~ 79 (337) -||++|++|++++|++|||+++ +++|+|+|+++|+|+++|||||+||++||+++|+|++||+|++|++|+|||||+ T Consensus 1 mtr~~~~~y~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~INvv~V~e~~Ge~v~lg~~g~iagrtd 80 (336) T protein:vir:37 1 MNKQAYYALAAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKGINMVQVAHTKGTKLFGATEKGVTGRKQ 80 (336) T ss_pred CcHHHHHHHHHHHHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhcCceeecccccceEEeeccCcccccccC Confidence 6779999999999999999754 489999999999999999999999999999999999999999999999999999 Q ss_pred CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHH-HHHHHHHHHHHhhchhhhcccceeccCCCChhhh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQ-QRIRNVILNQSALDRIMIGWNGVKAALSTDKAAN 158 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~-~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~n 158 (337) |++.++ +.++++++|+|+|||||+||+|++||+|||||||+ .++++++.+|+|||||||||||+|+|++|| | T Consensus 81 t~r~r~----~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~r~iALD~i~IGfnG~s~A~~Td---n 153 (336) T protein:vir:37 81 TGRNLA----TLDHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQNQVALDILQIGWNGQSVATNTT---K 153 (336) T ss_pred CCCCcc----ccCCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHHHHHhcchhhhcccceeeccCCC---C Confidence 985533 35799999999999999999999999999999955 667888888999999999999999999999 9 Q ss_pred hhhhccchhHHHHHHhhchhhhcccccccCCcee-cCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 159 PLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVL-VGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 159 PllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~-~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) ||||||||||||++|+++|+|||++++.++|+|. +|+||||+||||||+|+++ +||||||++|||||||||||+++|| T Consensus 154 PllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~-~I~~~~~~d~dLVvivG~dLla~~~ 232 (336) T protein:vir:37 154 TDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQ-GLDFRHQNRNDLVFLVGADLVSKET 232 (336) T ss_pred ccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHh-ccchHHhcCCCeEEEEchhhhhhhh Confidence 9999999999999999999999999988889976 4999999999999999997 7999999999999999999999999 Q ss_pred HHHHhc-cCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhcc Q lcl|NC_015266. 238 FPIVNT-TQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSND 316 (337) Q Consensus 238 ~~l~n~-~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne 316 (337) +||+|+ +++|||++|+++++++|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||+|||| T Consensus 233 ~~l~~~~~~~PtE~~Aa~~~~~~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne 312 (336) T protein:vir:37 233 KLIQQKHGLTPTEKAALGSHNLMGSFGGMNAITPPNFPARAAAVTTLKNLSVYTEAESVRRSLRNDEDKKGLVTSYYRQE 312 (336) T ss_pred hhhhhhcCCCHHHHHHHHHHHHHHhhCCceEEEccccCCCceEEeeccccEEEEecCcEEEEEEEccccccccchhhhcc Confidence 999997 5799999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeecCCcEEEeeceeeccC Q lcl|NC_015266. 317 AYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 317 ~YvVEd~~~~a~iEnI~~~~a 337 (337) ||||||||++|+||||++... T Consensus 313 ~YvVEd~~~~a~iE~i~v~~~ 333 (336) T protein:vir:37 313 GYVVEDLGLMTAIDHTKVKLN 333 (336) T ss_pred eeeeeccccEEEeeeeeeecc Confidence 999999999999999999887 No 16 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=100.00 E-value=1.8e-172 Score=962.13 Aligned_cols=323 Identities=31% Similarity=0.505 Sum_probs=312.7 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |+++||++|++|++++|++|||++++++|+|+|+++|+|+++|||||+||++||+++|++++||+|++|++|+|||||+| T Consensus 5 m~~~tr~~~~~y~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~Invv~V~e~~Ge~v~lg~~g~iagrtdt 84 (341) T protein:vir:27 5 LTQSAREYMDNFAQQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEFLKMITVTTVDQIEGQVVDVGVSGLYTGRKAG 84 (341) T ss_pred ccHHHHHHHHHHHHHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHhhhcCccccccceeeeEeecccccceeeccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhc---CccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAK---FPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~---~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) ++++|+ + ++++++|+|+|||||+||+|++||+||| ||||++|+++++.+|+|||||||||||+|+|++|||++ T Consensus 85 ~R~~r~---~-~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~~~~i~~~~ALD~i~IGfnGts~A~~Td~~a 160 (341) T protein:vir:27 85 GRFTKQ---V-GVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHLTEFSNQMFALDIMRIGWNGVSAEADTDPSA 160 (341) T ss_pred Cceecc---c-ccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHHHHHHHHHHhhhhhhhcccceeeccCCChhh Confidence 766665 3 7999999999999999999999999999 89999999999999999999999999999999999999 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) |||||||||||||++||++|+|||+++ ++..|+||||+||||||+|++++|||||||++|||||||||||+++|| T Consensus 161 nPllqDVNkGWlQ~~Re~a~~rVl~~~-----~~~~g~~gdy~nLDAlV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~ 235 (341) T protein:vir:27 161 NPLGQDVNEGWIAFVKNRKASQVVDVD-----VYFDETNGDYRTLDAMASDIINNQIHPMFRNDPRLTVFVGSGLIGAAQ 235 (341) T ss_pred cccccccchhHHHHHHhhcccceeccc-----eeeccCCCccccHHHHHHHHHhcccChHHhcCCCEEEEEchhhhhhhh Confidence 999999999999999999999999864 567799999999999999999999999999999999999999999999 Q ss_pred HHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhccc Q lcl|NC_015266. 238 FPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSNDA 317 (337) Q Consensus 238 ~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~ 317 (337) |||+|++++|||++|+|++ +|+||||||++|||||+++||||+|||||||||+|++||+++|||+|||||||+| + T Consensus 236 ~~l~n~~~~ptE~~Aa~~i--~k~iGGlpa~~~PffP~~~~lVT~L~NLsIY~Q~gs~RR~~~d~p~r~rie~yes---~ 310 (341) T protein:vir:27 236 AKLYDKADKPSEQIAAQKL--DKTIAGRPAYVPPFLPDNAMVVTIPENLQVLTQHGTAQRKAKHESDRKRSKTHTG---A 310 (341) T ss_pred hhhhccCCCCHHHHHHHHH--HHhhCCCeEEEccccCCCceEEeeccceEEEEecCcEEEEEEeccccccccchhh---h Confidence 9999999999999999988 7899999999999999999999999999999999999999999999999999977 8 Q ss_pred ceeecCCcEEEee--ceeeccC Q lcl|NC_015266. 318 YVVEDFGCGCVAE--NIELVAA 337 (337) Q Consensus 318 YvVEd~~~~a~iE--nI~~~~a 337 (337) ||||||||++++| +|++..+ T Consensus 311 YvVEdyg~~~~~~~~~vkl~~~ 332 (341) T protein:vir:27 311 WKVTQWVCWKRSPLTTQKKSTS 332 (341) T ss_pred heeehhhhhhhccccccccCcc Confidence 9999999999999 6777666 No 17 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=4.2e-79 Score=450.28 Aligned_cols=307 Identities=13% Similarity=0.160 Sum_probs=261.3 Q ss_pred HHHHHHHHHHHHHHHhcCc--ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC- Q lcl|NC_015266. 4 ETRQAYRKYAAQIAKLNDT--DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT- 80 (337) Q Consensus 4 ~tr~~~~~y~~~~a~~ngv--~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t- 80 (337) -+++.|++|++++++.+++ ++++..|+|.|+++|+|.++++|+|+||++||+++|++.+|+++.+|+++++. |+.+ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~-~~~~e 79 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHR-RPQDE 79 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCccc-ccccc Confidence 6788899999999999886 67889999999999999999999999999999999999999999999987775 5554 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) +..+|.+.++ ++++.+|.|++++++++|+|++||+||++|||++++++.+++++|+|++++||||++++.++ T Consensus 80 ~~~~~~~~~~-~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~------- 151 (321) T protein:vir:31 80 GEWNENESDV-STGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDS------- 151 (321) T ss_pred cccccccccc-eeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCc------- Confidence 4555665555 68999999999999999999999999999999999999999999999999999999876554 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) +++||+||||++|++++. ++.+++..++|.+ .+++. .||++||+++++|+|||++++.+++.+| T Consensus 152 ~~~~n~G~l~~a~~~~~~--------------~~~~~~~~~~d~l-~~l~~-~l~~~yr~~~~~v~im~~~~~~~~~~~l 215 (321) T protein:vir:31 152 FENQNDGFITVAEGDVET--------------IDAADDILDNDLV-IRTIA-GLDSKYRARMNPALIVSEDQLLSYHYTL 215 (321) T ss_pred ccccchhhhhhhcccccc--------------ccccccccCHHHH-HHHHH-hccHhHhcCCCeEEEechHHHHHHHHHH Confidence 789999999998875321 2334455566654 45665 6799999999999999999999888878 Q ss_pred HhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeecc----ccceecchhhhcc Q lcl|NC_015266. 241 VNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNP----KRDQIENYESSND 316 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p----~r~r~e~y~s~Ne 316 (337) .+.. +|.+..+.. -..+++|+|+|++++||||++++++|+|+||++|++.+.++|+..+.+ +++|+++|+++|+ T Consensus 216 ~~~~-~~~~~~~l~-~~~~~tl~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (321) T protein:vir:31 216 TDRD-TPLGDNVIM-GEADVNPFSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDD 293 (321) T ss_pred hcCC-Cccccchhh-ccccccccceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeec Confidence 7653 455443221 235679999999999999999999999999999999998777766644 5789999999999 Q ss_pred cceeecCCcEEEeeceee-ccC Q lcl|NC_015266. 317 AYVVEDFGCGCVAENIEL-VAA 337 (337) Q Consensus 317 ~YvVEd~~~~a~iEnI~~-~~a 337 (337) |||||||+++|++|||+. .+. T Consensus 294 ~~~ve~~~a~a~~~~i~~~~~~ 315 (321) T protein:vir:31 294 DFAIENTEAVVLAEGLGDPLEH 315 (321) T ss_pred ceeEeccccEEEEecCCcchhc Confidence 999999999999999986 233 No 18 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=100.00 E-value=9e-47 Score=273.04 Aligned_cols=328 Identities=14% Similarity=0.169 Sum_probs=232.9 Q ss_pred CChHHH--HHHHHHHHHHHHhcCc-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccccccee Q lcl|NC_015266. 1 MKKETR--QAYRKYAAQIAKLNDT-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr--~~~~~y~~~~a~~ngv-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagR 77 (337) |.+++. +..|+++..+++.+-. ++++ +|.+.|+++++|.+++|+++.||++|+++++.+.+|+.-.+|+++.+. | T Consensus 1 ~~~~~~~~~~~n~~~~~i~k~~it~~~l~-~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~kig~G~r~~-r 78 (360) T protein:vir:99 1 MSSNSTIDSVRNQNMNSLSQKDIGLAELD-GFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQFGVPRLSG-H 78 (360) T ss_pred CcchhHHHHHhhhHHHHHHhhhccccccC-ceeecHHHHHHHHHHHhhccchhhhcceeecccccccccccccceeec-c Confidence 877654 4568999999988754 5665 799999999999999999999999999999999999955555533333 4 Q ss_pred ccCCCcccccccccccCccce-eeEeeccccccCHHHHHHHhc--CccHHHHHHHHHHHHHhhchhhhcccceeccCCC- Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRY-RCEKTDYDTAITYRKLDAWAK--FPDFQQRIRNVILNQSALDRIMIGWNGVKAALST- 153 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y-~c~qtn~d~~i~y~~LD~wA~--~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~T- 153 (337) ..+.++.........-...+| ..+..-.-++|.++.+....+ ..+|++.+++.++++++.|+.++||||.+...++ T Consensus 79 ~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~g~~ds~d~~ 158 (360) T protein:vir:99 79 TRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIRAGASSGNLQ 158 (360) T ss_pred ccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhhccchhcccc Confidence 332221111111111122223 233333444667777766555 5579999999999999999999999999987654 Q ss_pred -ChhhhhhhhccchhHHHHHHhhchhhhcccccccCC-----------------ceecCCCcccccHHHHHHHHHhcccC Q lcl|NC_015266. 154 -DKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAG-----------------KVLVGKGGDYVNLDALVMDIVSSMID 215 (337) Q Consensus 154 -D~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~-----------------~i~~G~ggdy~nLDalV~da~~~li~ 215 (337) |-..+|++ ++|+||||+++.+ ++.+ .++.++++ ...-|.|+-|....+|+.+++.. || T Consensus 159 ~~~~~d~fl-~~~dGwlKka~~~-~~~i-d~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~~~~-Lp 234 (360) T protein:vir:99 159 SIGGAAELD-NTFKGWIARAEGD-AQSV-DDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNETIQT-LD 234 (360) T ss_pred cCcccchhh-hhhHHHHHHhhcc-cchh-hccccccccccccccccccccchhhhccccccccccchHHHHHHHHHh-cc Confidence 33345655 9999999999987 3222 22211110 01236677789999999999986 58 Q ss_pred hhHcCCCC--eEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecC Q lcl|NC_015266. 216 PWFQEDTG--LVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEG 293 (337) Q Consensus 216 ~~~r~~~d--LVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~g 293 (337) ..||+.+. ++++++.+........|.++....... +.+-....++-|+|++.||+||++.+|+|+++||.++.-++ T Consensus 235 ~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t~LGd~--~l~g~~~~~~~Gipi~~v~~~pd~~~mlT~p~NLi~g~~~~ 312 (360) T protein:vir:99 235 SRYRESDAYSPVLMTSPNQVQSYTMSLTEREDPLGSA--VIFGDSDITPFSYDLVGVNGFPDEYMMFTDPNNLAFGLYEE 312 (360) T ss_pred hhhhcCcccceEEEccCchHHHHHHHHhccCcccchh--heecccccccceeeeEEcCCCCCCceEEeccCceeEEeeee Confidence 88998874 489999998776666665554432221 11112335677999999999999999999999996666666 Q ss_pred ceEEeEeecccc---ce--ecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 294 ARRRSLIDNPKR---DQ--IENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 294 s~RR~~~d~p~r---~r--~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .|.+...+ |+| .| +.+|.+...+|++||++++|+++||+-..| T Consensus 313 iri~~~~e-~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 313 MELDQSTD-TDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred eEEeeccc-chhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 66654444 333 22 456667789999999999999999999999 No 19 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=4.4e-42 Score=247.32 Aligned_cols=302 Identities=14% Similarity=0.119 Sum_probs=223.5 Q ss_pred CChHHHHHHHHHHHHHHH---hcCcccccceeeecHHHHHHHHHHHHhhhhhhccccccc-chhhhhhhhccccccccce Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK---LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILP-VTELEGEKLGLSVSGPIAS 76 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~---~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~-V~~~~Ge~v~~gv~g~iag 76 (337) |. ++.+..+ .-.+++.+ .+.+.|.+.++|.++++|+|.||++++++. +...+++.-.+|+++.+++ T Consensus 1 ~~---------~~~~~~~~~k~it~~d~~-gG~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~ 70 (314) T protein:vir:41 1 MD---------FLNKPFQITPKIDVPDLG-KGILAVQRFGEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEP 70 (314) T ss_pred Cc---------hhhhHHHhhcccccccCC-CceeChHHHHHHHHHHHhccchhhheeeecccCccceeecccccCccccc Confidence 32 2222222 23456655 567999999999999999999999999984 5666777767787777665 Q ss_pred eccCCC-cccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCCh Q lcl|NC_015266. 77 RTDTTK-AERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDK 155 (337) Q Consensus 77 Rt~t~~-~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~ 155 (337) ..+.+. ....|.+..+.++.+|.|++....++|+|+.|+.||..|+|++.+.+.+++++|.|+.+++|||.....++ T Consensus 71 ~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~-- 148 (314) T protein:vir:41 71 GRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTG-- 148 (314) T ss_pred ccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCc-- Confidence 555433 33446666788999999999999999999999999999999999999999999999999999998755554 Q ss_pred hhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHH Q lcl|NC_015266. 156 AANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHD 235 (337) Q Consensus 156 ~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~ 235 (337) +|+++ +++|||++... . +.-.++++|.+.+.++.+++.+|.++|+++.+++|+||+++.+ . T Consensus 149 --~~~~~-~p~G~l~~a~~----~-----------~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~-~ 209 (314) T protein:vir:41 149 --RELYR-INDGWMKLAGN----Q-----------YTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIY-N 209 (314) T ss_pred --ccchh-cchhhhhhccc----c-----------eeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHH-H Confidence 46666 99999996321 1 1123456789999999999998866666778899999999976 5 Q ss_pred HHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCcc-----CCCceEEecccccEEEEecCceEEeEeeccccceecc Q lcl|NC_015266. 236 KYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFF-----PKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIEN 310 (337) Q Consensus 236 k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPff-----P~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~ 310 (337) ++..++...+++--..+ ..-....++.|+|++.+|+| |++.|++|.++|| ||.-.-..|+..+-+.+.+++.. T Consensus 210 ~~r~~l~~~~~~l~~~~-~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~~nl-v~~~~~~ir~~~~~~a~~~~~~~ 287 (314) T protein:vir:41 210 GYRKQLLVRETGLGDSA-LIGATGLQYDGIPIQYVPALDALGDDKARALLTVPTNL-VYGFWRNIRIEPKRDAAMRRTEY 287 (314) T ss_pred HHHHHHhccCCcccchh-hhCCCCceecceeeEecccccccCCCCceEEEechhhe-EEEeeceeEEeecccCcCCeEEE Confidence 67766654333311111 11123457999999999997 6799999999999 55444445555555666778888 Q ss_pred hhhhcccceeecCCcE--EEeeceeec Q lcl|NC_015266. 311 YESSNDAYVVEDFGCG--CVAENIELV 335 (337) Q Consensus 311 y~s~Ne~YvVEd~~~~--a~iEnI~~~ 335 (337) +.+..-++.+|+.+.+ +++++..-+ T Consensus 288 ~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 288 IASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred EEEEEeceEEEEcCcEEEEEeeccCCC Confidence 8888888777766444 444544444 No 20 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=7.5e-39 Score=229.58 Aligned_cols=305 Identities=12% Similarity=0.088 Sum_probs=205.2 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhccccccc-chhhhhhhhccccccccc-eec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILP-VTELEGEKLGLSVSGPIA-SRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~-V~~~~Ge~v~~gv~g~ia-gRt 78 (337) |-.-.-.+.+....-+ +..++++. ..|.+.|++.++|.++++|+|.||++++++. ....+++.-.+|+++++. |++ T Consensus 1 ~~~~~~~~~~~~~~~~-k~~t~~d~-~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g~~ 78 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIV-PKIDVPDL-GRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPGRD 78 (315) T ss_pred CcccchhhcCChhhhh-hhcCCcCC-CCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCcccccccc Confidence 2211111122222212 33456665 4778999999999999999999999999864 556666666666655543 555 Q ss_pred cCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhh Q lcl|NC_015266. 79 DTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAAN 158 (337) Q Consensus 79 ~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~n 158 (337) -++...+.|....++...+|.|+++.+.++|+|+.|+.|+..|+|++.+.+.+++++|.|+.+++|||.+.+.++ T Consensus 79 ~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p----- 153 (315) T protein:vir:41 79 ETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDP----- 153 (315) T ss_pred cccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCc----- Confidence 566666667777789999999999999999999999999999999999999999999999999999998866443 Q ss_pred hhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHc-CCCCeEEEeChHHHHHHH Q lcl|NC_015266. 159 PLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQ-EDTGLVVICGRELLHDKY 237 (337) Q Consensus 159 PllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r-~~~dLVvivG~dLla~k~ 237 (337) +...|+|||++.+..+...... ++.++. .. .++.+++.+| ++.|| +.+++|+||+++.++ ++ T Consensus 154 --~~~~~~G~l~~a~~~~~~~~~~-----------~~a~~~-~~-d~l~~l~~sl-~~~yr~~~~~~~~imn~~t~~-~~ 216 (315) T protein:vir:41 154 --LLRMSDGWLKLASEKLTESDVD-----------PEAEDW-PM-NLFDTMIESL-PTPYRNNLPNMKFYVTWDIYR-AY 216 (315) T ss_pred --cccccccceecccccccccccc-----------cccccc-cH-HHHHHHHHhc-ChHHhhcCCceEEEEcHHHHH-HH Confidence 2347899999766542211111 111111 11 2445566665 55555 567999999999885 45 Q ss_pred HHHHhccCChhHHHHHHHHHhhhhhcCceeEECCcc-----CCCceEEecccccEEEEecCceEEeEeeccccceecchh Q lcl|NC_015266. 238 FPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFF-----PKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYE 312 (337) Q Consensus 238 ~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPff-----P~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~ 312 (337) .++....+.+--.- ........++.|+|++.+|.| |++.|++|.++||.+....+.++....+ ++..++..|. T Consensus 217 rklk~~~g~~lw~~-~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~-a~~~~~~~~~ 294 (315) T protein:vir:41 217 RDALKGRETGLGDQ-ALTGANSILYDGRPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYD-AEMRLTKYVA 294 (315) T ss_pred HHHhccCCCccccc-hhhcCCCceecccceEecccccccCCCCccEEEecccceEEEeccccEEEeeec-CCCCceEEEE Confidence 55554333221110 111112458999999998887 5788999999999876665554444333 2222222222 Q ss_pred -hh-cccceeecCCcEEEeece Q lcl|NC_015266. 313 -SS-NDAYVVEDFGCGCVAENI 332 (337) Q Consensus 313 -s~-Ne~YvVEd~~~~a~iEnI 332 (337) .| .-+|++|+. +++++.+| T Consensus 295 ~~r~d~~~~~~~~-~a~~~~~v 315 (315) T protein:vir:41 295 SLRTDNHYEDEEG-AVSATITV 315 (315) T ss_pred EEEeceeEEeccc-eeEeeeeC Confidence 23 445778884 67777788 No 21 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.79 E-value=1.1e-09 Score=69.72 Aligned_cols=308 Identities=13% Similarity=0.101 Sum_probs=166.3 Q ss_pred CChHHHHHHHHHHHHHHH---hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK---LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~---~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagR 77 (337) =+.+.+..|..|+..--. ++......-.|.|-+.+...+.+.+++.+.+++..+++++....+. +-+..+++.++- T Consensus 109 ~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~-~~~~~~~~~a~w 187 (425) T protein:vir:10 109 RDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFS-KLFNMGGTTSGW 187 (425) T ss_pred ccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeeccCCceE-EEEEcCCcceee Confidence 223446678777754221 1111222335677777788999999999999999999998765443 334445555543 Q ss_pred ccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) +..+ ......+...+....|.+++.---..|+.+.|+..+ ++|+..+.+.+.+.++.=.-.--+||+-. . T Consensus 188 v~E~-~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~--~~l~~~i~~~la~ai~~~~d~~~l~G~G~-------~ 257 (425) T protein:vir:10 188 VGEA-SQRPQTNAATFQPLSFASGEIYANPAATQQILDDAE--IDLESWLATEVQTEFAKQEGKAFLAGDGT-------N 257 (425) T ss_pred eccc-cccccccccccceeeeeheeeEeehHhHHHHHhcch--hHHHHHHHHHHHHHHHHHHHhhhhcccCC-------C Confidence 3222 222223444577788899998889999999998653 68999999999999988777777787531 1 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) + ..|+|...-.......-..+ ....+..++.+. .+.|.++ |++.+| ++.|+... +++|.+..... . T Consensus 258 ~------p~Gil~~~~~~~~~~~~~~~--~~~~~~~~~~~~-~~~d~l~-~l~~~l-~~~~~~~a--~~vmn~~~~~~-L 323 (425) T protein:vir:10 258 K------PNGLLTYIAGGANAAKHPFG--AIEVVNSGAAAD-ITSDGII-DLVYDL-PSAFTGNA--RFAMNRNTQRQ-V 323 (425) T ss_pred C------cceeeecccccccccccccc--cccccccccccc-ccHHHHH-HHHhhh-hhhhccCC--EEEEchHHHHH-H Confidence 2 23666432211100000000 001112222222 3456665 567664 78888765 88999887642 2 Q ss_pred HHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCC-----CceEEecccccEEEEecCceEEeEeeccccceecchh Q lcl|NC_015266. 238 FPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPK-----RAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYE 312 (337) Q Consensus 238 ~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~-----~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~ 312 (337) ..+-+..+.|--.-..+ -....+|-|+|++..++||. ..|++=.+++.-..+++.+.+....+--.++.+.-+- T Consensus 324 ~~lkD~~G~~l~~~~~~-~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~~~~ 402 (425) T protein:vir:10 324 RKLKDGQGNYLWQPSYV-AGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKPYVLFYT 402 (425) T ss_pred HHhhcCCCceeeccCcc-CCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecccccCCcEEEEE Confidence 22222222221000000 01235788999999999995 3477777776533444444443221111111111100 Q ss_pred hhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 313 SSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 313 s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ..=-|..|-+.++++. +++..+ T Consensus 403 ~~r~d~~v~~~~A~~~---l~~~as 424 (425) T protein:vir:10 403 TKRVGGGLLNPEPMRA---MKVAAS 424 (425) T ss_pred EEEeccEeecccceEE---EEeecc Confidence 0012222333333322 333334 No 22 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.66 E-value=9.9e-09 Score=64.47 Aligned_cols=292 Identities=13% Similarity=0.066 Sum_probs=157.3 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) ++.+-|+.++++.+. .+ ...-.+.|-+.+...+.+.+.+.|.++++++++++.-..+...... +++-+.-... T Consensus 72 l~~~~r~~~~~~~~~----~~--~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~-~~~~a~~~~E 144 (390) T protein:vir:40 72 LTSDESKYYNEVIAG----NG--FAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVG-DVATAWWGPL 144 (390) T ss_pred ccHHHHHHHHHHHhc----cC--cccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEc-CCcceeeecc Confidence 555556555544322 12 2233556778888999999999999999999999865333322222 2222222221 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) .....+..-..++...|.+++.--...|+.+.|+... .+++..+++.+.++++.-.-.--++|+-.. -| T Consensus 145 -~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~--~~l~~~i~~~la~~i~~~~~~a~l~G~G~~-------~P- 213 (390) T protein:vir:40 145 -CAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGP--SWLDQYVRTILGEAMALGLEAGIVNGSGKD-------QP- 213 (390) T ss_pred -ccccCccccccceeeEeeeeeEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHhhhhcccCCC-------cc- Confidence 1223333335577888888888888899999998542 379999999999999887777777885311 12 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHH-- Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF-- 238 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~-- 238 (337) .|+|... .-+ +.+. ....-...-.+.+...++..+...+.+...+.....+++|.+....++.. T Consensus 214 -----~Gil~~~-----~~~-~~~~---~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~ 279 (390) T protein:vir:40 214 -----IGMMRDL-----NNV-TAGE---HPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAA 279 (390) T ss_pred -----ceeeecc-----ccc-cccc---cccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHH Confidence 4555311 100 0000 00000111223444444444443332222233446789998754332211 Q ss_pred HHH-hccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeecc--ccceecchhhhc Q lcl|NC_015266. 239 PIV-NTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNP--KRDQIENYESSN 315 (337) Q Consensus 239 ~l~-n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p--~r~r~e~y~s~N 315 (337) ..+ +..+.+ + ......|+|++..+++|++.+++-.+++.-|+. .+..+=..-+.. .++.+--.-..- T Consensus 280 ~~~~d~~G~~---v------~~~~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~-~~~~~v~~~~~~~f~~~~~~~r~~~r 349 (390) T protein:vir:40 280 TSYMTPQGVW---V------TGILPVPLEIVQSVAVPVGKAVAGRAKDYFMGI-GSEQVIRTSTEYRLLDDETLYYAKQY 349 (390) T ss_pred hhccCCCCcc---c------cccCCCceeEEEcCCCCCCcEEEEeeceEEEEe-ecceEEEecchhhhhcCcEEEEEEEE Confidence 122 222222 1 122446999999999999999999998865543 344432222111 112222211222 Q ss_pred ccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 316 DAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 316 e~YvVEd~~~~a~iEnI~~~~a 337 (337) -+..|-|.++++.++ +..+ T Consensus 350 ~dg~v~~~~A~~~l~---~~~~ 368 (390) T protein:vir:40 350 ANGRPKDNSSFLVFD---ITGL 368 (390) T ss_pred eCCEEecccceEEEE---eecc Confidence 233444444444432 2222 No 23 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.58 E-value=2e-08 Score=62.76 Aligned_cols=297 Identities=11% Similarity=-0.023 Sum_probs=158.1 Q ss_pred CChHHHHHHHHHHHH-------------HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQ-------------IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLG 67 (337) Q Consensus 1 M~~~tr~~~~~y~~~-------------~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~ 67 (337) .+......|..++.. .....+.......+.|-+.+.+.+.+.+.+.+.+++.++++++..-.+.... T Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~ 183 (418) T protein:vir:10 104 TESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTV 183 (418) T ss_pred hhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeEEE Confidence 111122222222221 1112222233345678888889999999999999999999998765555555 Q ss_pred cccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccce Q lcl|NC_015266. 68 LSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGV 147 (337) Q Consensus 68 ~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~ 147 (337) ....++-++=+.. ....|..-..++...+.+++.---+.|+.+.|+.. ++|+..+++.+.++++.-.-.--+||+ T Consensus 184 ~~~~~~~a~~v~E--~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~l~~~i~~~l~~a~~~~~d~a~l~G~ 258 (418) T protein:vir:10 184 ETGFTNNAAAVAE--GAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDA---PALQSYIDGRARYGLQLTEEGQILKGD 258 (418) T ss_pred EecCCCceeeecc--CccccccccceeeEEEeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 4333333322221 12223334567788888888888888999999864 589999999999988887777778884 Q ss_pred eccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEE Q lcl|NC_015266. 148 KAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVI 227 (337) Q Consensus 148 s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvi 227 (337) -.. .+|. |.+.. ..... .. ....+..++|.++. ++..+ .+.++... +++ T Consensus 259 g~~------~~p~------Gi~~~------------~~~~~--~~-~~~~~~~~~~~i~~-~~~~~-~~~~~~~~--~~v 307 (418) T protein:vir:10 259 GTG------ANIL------GILPQ------------ASAFM--PS-ITLANATPIDKIRL-ALLQA-VLAEFPAT--GIV 307 (418) T ss_pred CCC------cccc------ccccc------------ccccc--cc-ccccccccHHHHHH-HHHhh-ccccCCCC--EEE Confidence 421 1232 33321 11000 11 12223345565543 34444 44444443 688 Q ss_pred eChHHHHHHHHHHHhccCChhHHHHHHH-HHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccc-- Q lcl|NC_015266. 228 CGRELLHDKYFPIVNTTQAPTEQLAADL-IVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPK-- 304 (337) Q Consensus 228 vG~dLla~k~~~l~n~~~~ptE~~A~~~-~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~-- 304 (337) |.+..... ...+-...+.| +-.+. -....++-|+|++..+++|++.+++-.+++....+..++..=.+-.... T Consensus 308 ~n~~~~~~-L~~lkd~~G~~---i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~ 383 (418) T protein:vir:10 308 LNPIDWAS-IELTKDSQGRY---IVGNPVNGTTPRLWNLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDD 383 (418) T ss_pred EcHHHHHH-HHHhhcCCCce---eccccccCCCceecceeeEEcCCCCCCcEEEeeccceEEEEEecceEEEEecccchh Confidence 89887542 22232222222 00010 0124588999999999999999999998874322222332222211111 Q ss_pred --cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 305 --RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 305 --r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ++.+.-.-..--++.|-++.+++.++-..-+.. T Consensus 384 f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 384 FEKNMVSIRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEeccCCCC Confidence 111111001112334444555554432222222 No 24 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.51 E-value=3.2e-08 Score=61.66 Aligned_cols=295 Identities=11% Similarity=0.093 Sum_probs=157.1 Q ss_pred CChHHHHHHHHHHHHHHH--hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc-cccccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL-SVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~-gv~g~iagR 77 (337) .....+..|..++..... ..+....+-.+.|-+.+...+.+.+.+.+.+++.++++++....|..... ..+++-++- T Consensus 101 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:94 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEeecCCcccee Confidence 222333344444433221 12222223455565667889999999999999999999998777664333 223333333 Q ss_pred ccCCCccccc-ccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChh Q lcl|NC_015266. 78 TDTTKAERQP-IDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKA 156 (337) Q Consensus 78 t~t~~~~R~~-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~ 156 (337) ...+ ...| .+...++...+..++.---..|+.+.|+... .+|+..+.+.+.++++.-.-.--++|.-...... T Consensus 181 v~Eg--~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~-- 254 (415) T protein:vir:94 181 VEEL--EENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGS-- 254 (415) T ss_pred cccc--ccccccccccceeeEeeheeeeeechhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc-- Confidence 3222 1222 3334566777777777767789999888543 4899999999998887766666666644222111 Q ss_pred hhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHH Q lcl|NC_015266. 157 ANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDK 236 (337) Q Consensus 157 ~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k 236 (337) ...++.. ........+...| |.++ +++..+.++.++. -+++|.+..... T Consensus 255 -------~~~~~~~----------------~~~~~~~~~~~~~---~~i~-~~~~~~~~~~~~~---~~~vmn~~~~~~- 303 (415) T protein:vir:94 255 -------TSSGFEK----------------EGKKLEVKKAKSL---DDIK-DAINLNVKPNYEH---NVAIVSQTMFAK- 303 (415) T ss_pred -------ccccccc----------------cccccccccccch---HHHH-HHHHhhhhhccCC---CEEEEcHHHHHH- Confidence 1111100 0001111122334 4433 4666665665543 378888876542 Q ss_pred HHHHHh-ccCCh--hHHHHHHHHHhhhhhcCceeEECCccCCCc-----eEEecccccEEEEecCceEEeEeecccccee Q lcl|NC_015266. 237 YFPIVN-TTQAP--TEQLAADLIVSQKRIGNLPAVRVPFFPKRA-----MMVTKLENLSIYFQEGARRRSLIDNPKRDQI 308 (337) Q Consensus 237 ~~~l~n-~~~~p--tE~~A~~~~~~~k~igGl~a~~vPffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~ 308 (337) +..+. ..+.| .... .-....++-|+|++..|++|.+. +++-.++++-+.+.+++.+=...+....... T Consensus 304 -l~~lkd~~G~~l~~~~~---~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~ 379 (415) T protein:vir:94 304 -LDKMKDKLGNYLIQPDV---KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGEC 379 (415) T ss_pred -HHHhhccCCCeeeccCc---CCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceE Confidence 22232 22222 0000 00123578899999999999776 7888899875555555444333321111000 Q ss_pred cchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 309 ENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 309 e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) --.+. --+..|-++.+++.++--+.+.. T Consensus 380 ~r~~~-r~d~~~~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:94 380 LMIAV-RQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEEE-EeccEEeccccEEEEEEeccCCC Confidence 00011 12445556666666653332222 No 25 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.50 E-value=4.2e-08 Score=61.00 Aligned_cols=296 Identities=11% Similarity=0.086 Sum_probs=156.8 Q ss_pred CChHHHHHHHHHHHHHHH--hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccc-cccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSV-SGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv-~g~iagR 77 (337) +....+..|..++..... ..++......+.|-..+...+.+.+.+.+..++.++++++....|.....-. ++.-++- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:79 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCcccee Confidence 222233334333332221 1122222234445556788999999999999999999999888777544332 2233332 Q ss_pred ccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) ...+ ......+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|.-....... T Consensus 181 v~E~-~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~-- 255 (415) T protein:vir:79 181 VEEL-EENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-- 255 (415) T ss_pred eccc-cccCcccccceeeEEeeeeeeEeeehhhHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc-- Confidence 2221 22222333456677777777766788999998763 247898899999888877665555666432221110 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) -.++ ..........+. .+.|.++ +++..+.++.++. -+++|.++.+.. T Consensus 256 -------~~~~----------------~~~~~~~~~~~~---~~~~~i~-~~~~~~~~~~~~~---~~~v~n~~~~~~-- 303 (415) T protein:vir:79 256 -------SSGF----------------EKEGKKLEVKKA---KSLDDIK-DAINLNVKPNYEH---NVAIVSQTMFAK-- 303 (415) T ss_pred -------cccc----------------cccccccccccc---cchhHHH-HHHHhhhhhccCC---CEEEEcHHHHHH-- Confidence 0000 000001111122 3445554 5676665555443 378899987653 Q ss_pred HHHHh-ccCChh--HHHHHHHHHhhhhhcCceeEECCccCCCc-----eEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 238 FPIVN-TTQAPT--EQLAADLIVSQKRIGNLPAVRVPFFPKRA-----MMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 238 ~~l~n-~~~~pt--E~~A~~~~~~~k~igGl~a~~vPffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) +..+. ..+.|- .-. .-....+|-|+|++..|++|... +++-.++++-+.+..+..+=...+........ T Consensus 304 l~~lkd~~G~~l~~~~~---~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~ 380 (415) T protein:vir:79 304 LDKMKDKLGNYLIQPDV---KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECL 380 (415) T ss_pred HHHhhccCCceeeccCc---CCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEE Confidence 22232 211210 000 00123589999999999999765 88888888766666555554443321111110 Q ss_pred chhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 310 NYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) --+-| -+..|-++.+++.++--+-+.- T Consensus 381 ~~~~r-~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:79 381 MIAVR-QDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEEE-eccEEeccccEEEEEEeccCCC Confidence 01111 2344556666666553332222 No 26 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.50 E-value=4.2e-08 Score=61.00 Aligned_cols=296 Identities=11% Similarity=0.086 Sum_probs=156.8 Q ss_pred CChHHHHHHHHHHHHHHH--hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccc-cccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSV-SGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv-~g~iagR 77 (337) +....+..|..++..... ..++......+.|-..+...+.+.+.+.+..++.++++++....|.....-. ++.-++- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:81 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCcccee Confidence 222233334333332221 1122222234445556788999999999999999999999888777544332 2233332 Q ss_pred ccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) ...+ ......+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|.-....... T Consensus 181 v~E~-~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~-- 255 (415) T protein:vir:81 181 VEEL-EENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-- 255 (415) T ss_pred eccc-cccCcccccceeeEEeeeeeeEeeehhhHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc-- Confidence 2221 22222333456677777777766788999998763 247898899999888877665555666432221110 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) -.++ ..........+. .+.|.++ +++..+.++.++. -+++|.++.+.. T Consensus 256 -------~~~~----------------~~~~~~~~~~~~---~~~~~i~-~~~~~~~~~~~~~---~~~v~n~~~~~~-- 303 (415) T protein:vir:81 256 -------SSGF----------------EKEGKKLEVKKA---KSLDDIK-DAINLNVKPNYEH---NVAIVSQTMFAK-- 303 (415) T ss_pred -------cccc----------------cccccccccccc---cchhHHH-HHHHhhhhhccCC---CEEEEcHHHHHH-- Confidence 0000 000001111122 3445554 5676665555443 378899987653 Q ss_pred HHHHh-ccCChh--HHHHHHHHHhhhhhcCceeEECCccCCCc-----eEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 238 FPIVN-TTQAPT--EQLAADLIVSQKRIGNLPAVRVPFFPKRA-----MMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 238 ~~l~n-~~~~pt--E~~A~~~~~~~k~igGl~a~~vPffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) +..+. ..+.|- .-. .-....+|-|+|++..|++|... +++-.++++-+.+..+..+=...+........ T Consensus 304 l~~lkd~~G~~l~~~~~---~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~ 380 (415) T protein:vir:81 304 LDKMKDKLGNYLIQPDV---KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECL 380 (415) T ss_pred HHHhhccCCceeeccCc---CCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEE Confidence 22232 211210 000 00123589999999999999765 88888888766666555554443321111110 Q ss_pred chhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 310 NYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) --+-| -+..|-++.+++.++--+-+.- T Consensus 381 ~~~~r-~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:81 381 MIAVR-QDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEEE-eccEEeccccEEEEEEeccCCC Confidence 01111 2344556666666553332222 No 27 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.50 E-value=4.2e-08 Score=61.00 Aligned_cols=296 Identities=11% Similarity=0.086 Sum_probs=156.8 Q ss_pred CChHHHHHHHHHHHHHHH--hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccc-cccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSV-SGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv-~g~iagR 77 (337) +....+..|..++..... ..++......+.|-..+...+.+.+.+.+..++.++++++....|.....-. ++.-++- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:98 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeecCCcccee Confidence 222233334333332221 1122222234445556788999999999999999999999888777544332 2233332 Q ss_pred ccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) ...+ ......+...++...+..++.---+.|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|.-....... T Consensus 181 v~E~-~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~-- 255 (415) T protein:vir:98 181 VEEL-EENPELAVKPFFQLAYDINTHRGYFRISREAIEDA--KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGST-- 255 (415) T ss_pred eccc-cccCcccccceeeEEeeeeeeEeeehhhHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccc-- Confidence 2221 22222333456677777777766788999998763 247898899999888877665555666432221110 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) -.++ ..........+. .+.|.++ +++..+.++.++. -+++|.++.+.. T Consensus 256 -------~~~~----------------~~~~~~~~~~~~---~~~~~i~-~~~~~~~~~~~~~---~~~v~n~~~~~~-- 303 (415) T protein:vir:98 256 -------SSGF----------------EKEGKKLEVKKA---KSLDDIK-DAINLNVKPNYEH---NVAIVSQTMFAK-- 303 (415) T ss_pred -------cccc----------------cccccccccccc---cchhHHH-HHHHhhhhhccCC---CEEEEcHHHHHH-- Confidence 0000 000001111122 3445554 5676665555443 378899987653 Q ss_pred HHHHh-ccCChh--HHHHHHHHHhhhhhcCceeEECCccCCCc-----eEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 238 FPIVN-TTQAPT--EQLAADLIVSQKRIGNLPAVRVPFFPKRA-----MMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 238 ~~l~n-~~~~pt--E~~A~~~~~~~k~igGl~a~~vPffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) +..+. ..+.|- .-. .-....+|-|+|++..|++|... +++-.++++-+.+..+..+=...+........ T Consensus 304 l~~lkd~~G~~l~~~~~---~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~ 380 (415) T protein:vir:98 304 LDKMKDKLGNYLIQPDV---KEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECL 380 (415) T ss_pred HHHhhccCCceeeccCc---CCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEE Confidence 22232 211210 000 00123589999999999999765 88888888766666555554443321111110 Q ss_pred chhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 310 NYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) --+-| -+..|-++.+++.++--+-+.- T Consensus 381 ~~~~r-~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:98 381 MIAVR-QDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEEE-eccEEeccccEEEEEEeccCCC Confidence 01111 2344556666666553332222 No 28 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.46 E-value=5.6e-08 Score=60.34 Aligned_cols=295 Identities=9% Similarity=-0.065 Sum_probs=153.0 Q ss_pred CChHHHHHHHHHHHHHHH------hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK------LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPI 74 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~------~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~i 74 (337) .....+..|..+...... .....+.+-.+.|-|...+.+.+.+.+.+.+++.++++++.--.+.........+- T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~ 168 (395) T protein:vir:43 89 AESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYVRETGFVNN 168 (395) T ss_pred HHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEEEEecCCCc Confidence 111222222222221111 11112223345688999999999999999999999999986443333332211122 Q ss_pred ceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCC Q lcl|NC_015266. 75 ASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTD 154 (337) Q Consensus 75 agRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD 154 (337) ++-+ +...-.|..-..++...+.+++.--.+.|+.+.|+.. ++++..+++.+.++++.-.-.--+||+-.. T Consensus 169 a~~v--~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~l~~~v~~~la~a~~~~~d~~~l~G~g~~---- 239 (395) T protein:vir:43 169 AAPV--SEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDA---SALQSYIDARARYGLMLVEECQLLYGNGTG---- 239 (395) T ss_pred eeee--cCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCCC---- Confidence 2211 1122223334567788899999888899999998863 578899999999888876666667884321 Q ss_pred hhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHH Q lcl|NC_015266. 155 KAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 155 ~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla 234 (337) +|. +| +++.......... +.......+|.+ .+++..+ ++.++... +++|.+.... T Consensus 240 ---~~~-----~G------------i~~~~~~~~~~~~-~~~~~~~~~~~i-~~~~~~~-~~~~~~~~--~~vmn~~~~~ 294 (395) T protein:vir:43 240 ---ANL-----HG------------IIPQAQAYAPPSG-VVVTAEQRIDRI-RLAILQA-QLAEFPAS--GIVLNPIDWA 294 (395) T ss_pred ---Ccc-----cc------------ccccccccccccc-cccccchhHHHH-HHHHHhh-ccccCCCc--EEEEcHHHHH Confidence 121 11 2221111111111 112222334443 3344433 55555543 7889988654 Q ss_pred HHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhh Q lcl|NC_015266. 235 DKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESS 314 (337) Q Consensus 235 ~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~ 314 (337) . ...+-...+.|-=.-... ....++-|+|++..+++|++.+++-.+++....+.+++..=.+-+... ++..+ T Consensus 295 ~-l~~lkd~~G~~i~~~~~~--~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----~~f~~ 366 (395) T protein:vir:43 295 L-IELNKDAENRYIIGSPQN--GTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTEND-----KDFEN 366 (395) T ss_pred H-HHHhhccCCceecccccc--CCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEecccc-----chhhc Confidence 2 222222222221000000 124578899999999999999999999986544433332222222111 11122 Q ss_pred c-ccceeec-CCcEEEee----ceeeccC Q lcl|NC_015266. 315 N-DAYVVED-FGCGCVAE----NIELVAA 337 (337) Q Consensus 315 N-e~YvVEd-~~~~a~iE----nI~~~~a 337 (337) | -+|.++- +++...-. .+++..| T Consensus 367 ~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 367 NMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred CcEEEEEEEeeccEEecccceEEEEeccC Confidence 2 2333333 22222221 2466666 No 29 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.46 E-value=5.6e-08 Score=60.36 Aligned_cols=296 Identities=11% Similarity=0.081 Sum_probs=152.6 Q ss_pred CChHHHHHHHHHHHHH--------------HHhcCc-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQI--------------AKLNDT-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEK 65 (337) Q Consensus 1 M~~~tr~~~~~y~~~~--------------a~~ngv-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~ 65 (337) ...+.+..|.+|+... ++..++ .+..-.|.|-+.....+.+.+++.+.+++..+++++..-.... T Consensus 84 ~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 163 (409) T protein:vir:45 84 QDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQILTTSDGRTME 163 (409) T ss_pred hhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCceEE Confidence 2233344555555331 111222 2223356777778888999999999999999999886432221 Q ss_pred h-ccccccccceeccCCCcccccccccccCccceeeEeecc-ccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhc Q lcl|NC_015266. 66 L-GLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDY-DTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIG 143 (337) Q Consensus 66 v-~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~-d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IG 143 (337) + ..+..+..+. -.+.....|..-...+.....-++.-. -+.|+.+.|+.. .++|+..+.+.+.+++++-.-.-- T Consensus 164 ~~~~~~~~~~~~--~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds--~~~l~~~i~~~la~a~~~~~~~a~ 239 (409) T protein:vir:45 164 WATADGTSEVGV--LLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDS--AIDMEAYLARRIAERIGRGEARYL 239 (409) T ss_pred EEeeccCccccc--cccccccccccccccceeeeeeeeeeeeehhhhHHHHhcc--HHHHHHHHHHHHHHHHHHHHHHHh Confidence 1 1111111121 112222233322334444443334322 246899999885 368999999999999998777777 Q ss_pred ccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCC Q lcl|NC_015266. 144 WNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTG 223 (337) Q Consensus 144 fNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~d 223 (337) +||+-...++- |.-++.... +....+..+. .+.|.++ +++.. |++.|+..+. T Consensus 240 l~G~G~~~~~~----------------------p~Gil~~~~---~~~~~~~~~~-~~~d~i~-~l~~~-l~~~~~~~a~ 291 (409) T protein:vir:45 240 IQGTGAGTPKQ----------------------PKGLAASVT---GTTQTAAANA-VKWQEIL-ALKHS-IDPAYRRGPK 291 (409) T ss_pred hccCCCCCccc----------------------cceeeeccc---cccccccccc-cchHHHH-HHHHh-hhhhhccCCe Confidence 88875432221 222232211 1111222222 3445444 56654 5888999888 Q ss_pred eEEEeChHHHHHHHHHHHh-ccCCh--hHHHHHHHHHhhhhhcCceeEECCccCC-----CceEEecccccEEEEecCce Q lcl|NC_015266. 224 LVVICGRELLHDKYFPIVN-TTQAP--TEQLAADLIVSQKRIGNLPAVRVPFFPK-----RAMMVTKLENLSIYFQEGAR 295 (337) Q Consensus 224 LVvivG~dLla~k~~~l~n-~~~~p--tE~~A~~~~~~~k~igGl~a~~vPffP~-----~~ilvT~l~NLsIY~Q~gs~ 295 (337) .+++|.+..+.. +..+. ..+.| ..-... ....++-|+|++...++|. ..|++=.+++.-|....+.. T Consensus 292 ~~~~~n~~~~~~--l~~lkd~~G~~i~~~~~~~---~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~ 366 (409) T protein:vir:45 292 FRLAFNDNTLKL--ISEMEDGQGRPLWLPDIVG---VAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMI 366 (409) T ss_pred EEEEECHHHHHH--HHHhhcCCCceeeccCcCC---CCCceecceeeEEecCcCCccCCccEEEEeehhhhheeeccceE Confidence 999999887652 22332 22222 000000 1235788999999999996 44666677776554433332 Q ss_pred EEeEeeccc--cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 296 RRSLIDNPK--RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 296 RR~~~d~p~--r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -...+++. ++.+--+-..--++.|-+.++++. +++..| T Consensus 367 -~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~---l~~k~s 406 (409) T protein:vir:45 367 -LKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKA---LVGKGS 406 (409) T ss_pred -EEEeecccccCCcEEEEEEEEeccEeechhheEE---EEeccC Confidence 22222221 122111111112333333333332 223222 No 30 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=98.42 E-value=8.5e-08 Score=59.34 Aligned_cols=292 Identities=11% Similarity=0.081 Sum_probs=146.7 Q ss_pred CChHHHHH-----------HHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccc Q lcl|NC_015266. 1 MKKETRQA-----------YRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLS 69 (337) Q Consensus 1 M~~~tr~~-----------~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~g 69 (337) .+...+.. ...+...+.. .....+-.+.|-+.....+.+.+++.+.+++.++++++.- +.++.+- T Consensus 111 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~gg~~vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~g--~~~ip~~ 186 (425) T protein:vir:95 111 NRLQVREMLKTGEYYKRSEVVEFYEKFRN--LRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKG--TTRILVD 186 (425) T ss_pred HHHHHHHHHhhhhhhhhhHHHHHHHHHHh--hcccccCceeccHHHHHHHHHHHHhhhhHHHhhceeecCc--eeEEEEe Confidence 00000100 0111111111 1111223455656678889999999999999999988741 2222222 Q ss_pred cccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceec Q lcl|NC_015266. 70 VSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKA 149 (337) Q Consensus 70 v~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~ 149 (337) .+++-++=+..+ ......+...++...+..++.---+.|+.+.|+.+.- +|+..+++.+.+.++.-.-.--++|+-. T Consensus 187 ~~~~~a~~v~E~-~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~--~l~~~i~~~l~~~i~~~~d~~il~G~G~ 263 (425) T protein:vir:95 187 TDTSPATWIEQS-GALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSII--NLDDYVTKKIARAIAKALDLAIVKGTGA 263 (425) T ss_pred cCCccccccccc-cccccccccccceeeeeheeeeeeehhhHHHHhccHH--HHHHHHHHHHHHHHHHHHHHHhhccCCC Confidence 222222222211 1121223234556667777766677889998887653 6999999999999988777777888642 Q ss_pred cCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeC Q lcl|NC_015266. 150 ALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICG 229 (337) Q Consensus 150 A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG 229 (337) .. .-|+ |+|.. .+........+....|.+|..+ +. ++.+-++....++++|. T Consensus 264 ~~-----~~p~------Gil~~------------~~~~~~~~~~~~~~~~~~~~~~----~~-~~~~~~~~~~~~~~v~~ 315 (425) T protein:vir:95 264 AN-----KQPL------GIIPS------------LPPENQVTVEADNNLLKNLVKQ----IG-LIDTGDDSVGEIVAVMK 315 (425) T ss_pred Cc-----cccc------eeecc------------cccccccccccccchHHHHHHH----HH-hhhhhccccCceEEEEe Confidence 21 1122 55432 1110000011122335555443 32 34566677778888888 Q ss_pred hHHHHHHHHHHHh---ccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccc Q lcl|NC_015266. 230 RELLHDKYFPIVN---TTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRD 306 (337) Q Consensus 230 ~dLla~k~~~l~n---~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~ 306 (337) +.-+-.+-..+-. ..+.+--... .....++-|+|++..+++|++.+++=.+++.-| ...+...-..-++. T Consensus 316 ~~~~~~~l~~l~~~kd~~g~~i~~~~---~~~~~~l~G~pvv~~~~~~~~~i~~Gd~~~~~~-~~~~~~~i~~~~~~--- 388 (425) T protein:vir:95 316 RSTYYNRLVEFSIQVDSNGNVVGKLP---NLRTPDLLGLRVVFNNFLDDDTVLFGEFEQYTL-VERENITIDSSTHV--- 388 (425) T ss_pred ChHHHHHHHHHHhhcCCCCceeeccC---CCCCccccceeeEEcCcCCCccEEEEecccEEE-EeecceEEEeeccc--- Confidence 7532221112211 1111100000 012346779999999999999999998888443 33343333332221 Q ss_pred eecchhhhcccceeec--------CCcEEEee-ceeeccC Q lcl|NC_015266. 307 QIENYESSNDAYVVED--------FGCGCVAE-NIELVAA 337 (337) Q Consensus 307 r~e~y~s~Ne~YvVEd--------~~~~a~iE-nI~~~~a 337 (337) .|..-..+|.++. +++++.++ .-....| T Consensus 389 ---~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 389 ---KFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred ---ccccCceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 1222234444443 33333332 1112223 No 31 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.41 E-value=7.9e-08 Score=59.52 Aligned_cols=296 Identities=11% Similarity=0.083 Sum_probs=154.4 Q ss_pred CChHHHHHHHHHHHHHHHh--cCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc-cccccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKL--NDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL-SVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~-gv~g~iagR 77 (337) +....+..|..+....... .++......+.|-......+.+.+.+.+.+++.++++++...+|..... ..+++-++- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:47 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceee Confidence 2223333344443332211 1112222344455567788999999999999999999998877754332 223333333 Q ss_pred ccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) ...+. .....+...++...+..++.---+.|+.+.|+... .+|+..+.+.+.++++.-.-.--++|.-...... T Consensus 181 v~Eg~-~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~--- 254 (415) T protein:vir:47 181 VEELE-ENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--- 254 (415) T ss_pred ccccc-ccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc--- Confidence 32221 11123344567777777777777899999997633 4889999999999988777666677743221110 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) .+.........+...+.. .+|.++ +++..+.++++.. -+++|.++.+.. T Consensus 255 ----------------------~~~~~~~~~~~~~~~~~~---~~~~i~-~~~~~~~~~~~~~---~~~v~n~~~~~~-- 303 (415) T protein:vir:47 255 ----------------------TSSGFEKEGKKLEVKKAK---SLDDIK-DAINLNVKPNYEH---NVAIVSQTMFAK-- 303 (415) T ss_pred ----------------------cccccccccceecccccc---chHHHH-HHHHhhhhhccCC---CEEEEcHHHHHH-- Confidence 000000000111111222 334433 5666665665443 278899887652 Q ss_pred HHHHh-ccCChh--HHHHHHHHHhhhhhcCceeEECCccCCCc-----eEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 238 FPIVN-TTQAPT--EQLAADLIVSQKRIGNLPAVRVPFFPKRA-----MMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 238 ~~l~n-~~~~pt--E~~A~~~~~~~k~igGl~a~~vPffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) +..+. ..+.|- .-.. -....+|-|+|++..+++|... +++=.++++-+.+.+....=...+.......- T Consensus 304 L~~lkd~~G~~i~~~~~~---~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~ 380 (415) T protein:vir:47 304 LDKMKDKLGNYLIQPDVK---EKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECL 380 (415) T ss_pred HHHhhccCCCeeeccCcC---CCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEE Confidence 22232 221210 0000 0124588999999999999654 78888888655555444443333211111100 Q ss_pred chhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 310 NYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -.+. --+..|-++.+++.+.--.-+.. T Consensus 381 ~~~~-r~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:47 381 MIAV-RQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEE-EeccEEeccccEEEEEeeccCCC Confidence 0011 12344445555555432222222 No 32 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.41 E-value=7.9e-08 Score=59.52 Aligned_cols=296 Identities=11% Similarity=0.083 Sum_probs=154.4 Q ss_pred CChHHHHHHHHHHHHHHHh--cCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc-cccccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKL--NDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL-SVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~-gv~g~iagR 77 (337) +....+..|..+....... .++......+.|-......+.+.+.+.+.+++.++++++...+|..... ..+++-++- T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 180 (415) T protein:vir:46 101 VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSEVAALEK 180 (415) T ss_pred hhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEecCCcceee Confidence 2223333344443332211 1112222344455567788999999999999999999998877754332 223333333 Q ss_pred ccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhh Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAA 157 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~ 157 (337) ...+. .....+...++...+..++.---+.|+.+.|+... .+|+..+.+.+.++++.-.-.--++|.-...... T Consensus 181 v~Eg~-~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~--- 254 (415) T protein:vir:46 181 VEELE-ENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMARTIAATRNKAIIDVITKGSTGS--- 254 (415) T ss_pred ccccc-ccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc--- Confidence 32221 11123344567777777777777899999997633 4889999999999988777666677743221110 Q ss_pred hhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 158 NPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 158 nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) .+.........+...+.. .+|.++ +++..+.++++.. -+++|.++.+.. T Consensus 255 ----------------------~~~~~~~~~~~~~~~~~~---~~~~i~-~~~~~~~~~~~~~---~~~v~n~~~~~~-- 303 (415) T protein:vir:46 255 ----------------------TSSGFEKEGKKLEVKKAK---SLDDIK-DAINLNVKPNYEH---NVAIVSQTMFAK-- 303 (415) T ss_pred ----------------------cccccccccceecccccc---chHHHH-HHHHhhhhhccCC---CEEEEcHHHHHH-- Confidence 000000000111111222 334433 5666665665443 278899887652 Q ss_pred HHHHh-ccCChh--HHHHHHHHHhhhhhcCceeEECCccCCCc-----eEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 238 FPIVN-TTQAPT--EQLAADLIVSQKRIGNLPAVRVPFFPKRA-----MMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 238 ~~l~n-~~~~pt--E~~A~~~~~~~k~igGl~a~~vPffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) +..+. ..+.|- .-.. -....+|-|+|++..+++|... +++=.++++-+.+.+....=...+.......- T Consensus 304 L~~lkd~~G~~i~~~~~~---~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~ 380 (415) T protein:vir:46 304 LDKMKDKLGNYLIQPDVK---EKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECL 380 (415) T ss_pred HHHhhccCCCeeeccCcC---CCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEE Confidence 22232 221210 0000 0124588999999999999654 78888888655555444443333211111100 Q ss_pred chhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 310 NYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -.+. --+..|-++.+++.+.--.-+.. T Consensus 381 ~~~~-r~d~~v~~~~a~~~~~~~~~~~~ 407 (415) T protein:vir:46 381 MIAV-RQDCRILDYKSAIVIEYDDSERG 407 (415) T ss_pred EEEE-EeccEEeccccEEEEEeeccCCC Confidence 0011 12344445555555432222222 No 33 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.40 E-value=8e-08 Score=59.49 Aligned_cols=305 Identities=13% Similarity=0.149 Sum_probs=160.4 Q ss_pred CChHHHHHHHHHHHHHHH--hcC-------c-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LND-------T-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSV 70 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ng-------v-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv 70 (337) +..+.+..|..|+..... ... . .+..-.+.|-+.+.+.+.+.+++.+.+++..+++++.--.. ++.... T Consensus 79 ~~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~ 157 (401) T protein:vir:44 79 VAAEHKDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSDY-KKLVNL 157 (401) T ss_pred hhHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCce-EEEEec Confidence 666678888888743211 000 0 11122567777788899999999999999999998754322 333334 Q ss_pred ccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceecc Q lcl|NC_015266. 71 SGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAA 150 (337) Q Consensus 71 ~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A 150 (337) +++.++-+..+. .....+...++...|..++.---..|+.+.|+. ...+|+..+...+.+.++.-.-.--+||+-.. T Consensus 158 ~~~~a~wv~E~~-~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~ 234 (401) T protein:vir:44 158 GGTASGWVGETD-TRSQTATSRLGLIEPFMGEIYGNPQATQKMLDD--AFFNVEAWINSELATEFAEQEEIAFTTGDGTK 234 (401) T ss_pred CCccceeecccc-ccCccccccceeeeeehhheeeehhhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHhhhhccCCCC Confidence 444443322221 111122234555566666665567788888884 23489999999999999888777778885421 Q ss_pred CCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeCh Q lcl|NC_015266. 151 LSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGR 230 (337) Q Consensus 151 ~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~ 230 (337) +| +|.|..............+. ...+..|+.++ .+.|.++ +++..| ++.|+... +++|.+ T Consensus 235 -------~p------~Gil~~~~~~~~~~~~~~~~--~~~~~t~~~~~-~~~d~i~-~~~~~l-~~~~~~~a--~~v~n~ 294 (401) T protein:vir:44 235 -------KP------KGFLAYESTEESDKARAFGK--LQHIVSGEATA-VTADAII-KLIYTL-RKAHRTGA--KFMMNN 294 (401) T ss_pred -------cc------ceeecccccccccccccccc--ccccccccccc-cCHHHHH-HHHHhc-chhhhcCC--EEEEcH Confidence 22 35554333221111111111 11122233322 4566665 567654 78888765 889999 Q ss_pred HHHHHHHHHHHhccCChh--HHHHHHHHHhhhhhcCceeEECCccCCCc-----eEEeccc-ccEEEEecCceEEeEeec Q lcl|NC_015266. 231 ELLHDKYFPIVNTTQAPT--EQLAADLIVSQKRIGNLPAVRVPFFPKRA-----MMVTKLE-NLSIYFQEGARRRSLIDN 302 (337) Q Consensus 231 dLla~k~~~l~n~~~~pt--E~~A~~~~~~~k~igGl~a~~vPffP~~~-----ilvT~l~-NLsIY~Q~gs~RR~~~d~ 302 (337) ..+. +...+-...+.|- .-.. -....++-|+|++..+++|..+ +++=.++ +..|+ .+.+.+-...+- T Consensus 295 ~~~~-~L~~lkd~~G~~l~~~~~~---~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~-~~~~~~~~~~~~ 369 (401) T protein:vir:44 295 NSLF-AIRLLKDTEGNYLWRPGLE---LGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIV-DRIGTRILRDPY 369 (401) T ss_pred HHHH-HHHHhhccCCceeecCCcC---CCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEE-EecceEEeeecc Confidence 8654 2223323322330 1000 0123579999999999999644 6666664 33333 222232211111 Q ss_pred cccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 303 PKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 303 p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -.++.+.-+-..--|..|=+.+++++ +++..| T Consensus 370 ~~~~~v~~~a~~r~d~~~~~~~a~~~---l~~~aa 401 (401) T protein:vir:44 370 TNKPFVGFYTTKRTGGMLVDSQAIKL---LKIAAA 401 (401) T ss_pred ccCCcEEEEEEEEeccEEecccceEE---EEeecC Confidence 11122111111112233333333333 555556 No 34 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.39 E-value=4.9e-08 Score=60.67 Aligned_cols=279 Identities=11% Similarity=0.009 Sum_probs=166.8 Q ss_pred HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccCCCcccccccccccCc Q lcl|NC_015266. 16 IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~ 95 (337) +|. +-.+.|-|...+.+.+.++++|.+++..+++++.--+. ++-.-.+++-++-+..+ ...|..-..++. T Consensus 1 ma~-------~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~-~~p~~~~~~~a~~v~Eg--~~~~~~~~~f~~ 70 (298) T protein:vir:94 1 MVL-------NKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGE-KVFTFTMDSEIDVVAES--GKKTHGGVTLAP 70 (298) T ss_pred Cee-------ccccccChhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecCcceEEeeCC--ccccccccceeE Confidence 222 22446778889999999999999999999988765322 33333344555544332 233444456778 Q ss_pred cceeeEeeccccccCHHHHHHHhc-CccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHh Q lcl|NC_015266. 96 NRYRCEKTDYDTAITYRKLDAWAK-FPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRD 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~wA~-~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re 174 (337) ....+++.---+.|+.+.|.+... ..+|++.+.+.+.++++.....--+||+....-++..- -...++.. T Consensus 71 v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~-----~~~~~~~~---- 141 (298) T protein:vir:94 71 QTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAV-----IGTNHFDS---- 141 (298) T ss_pred EEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccc-----cccccccc---- Confidence 888888888899999999977654 56899999999999999888777788854222111100 00001110 Q ss_pred hchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChh--HHHH Q lcl|NC_015266. 175 RAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPT--EQLA 252 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~pt--E~~A 252 (337) . . +.....+..-..++..+.+++..+ ...+++.. +++|.+...+. ...+....+.|- +-.. T Consensus 142 --------~----~-~~~~~~~~~~~~~~~~i~~~~~~~-~~~~~~~~--~~vmn~~~~~~-l~~lkd~~G~~l~~~~~~ 204 (298) T protein:vir:94 142 --------K----V-TQKVEAPRGIADPNGAIENAVELL-TGVDADVT--GIAINPSFRSA-LAKQKDLQGNALFPELKW 204 (298) T ss_pred --------c----c-ccccccccccccHHHHHHHHHHhh-hhcCCCcc--EEEEcHHHHHH-HHHhhccCCCeeecCccc Confidence 0 0 001112222234555566666544 43333322 68888876653 222332222221 1000 Q ss_pred HHHHHhhhhhcCceeEECCccCCC------ceEEecccccEEEEecCceEEeEeeccccce-ecchhhhc---------c Q lcl|NC_015266. 253 ADLIVSQKRIGNLPAVRVPFFPKR------AMMVTKLENLSIYFQEGARRRSLIDNPKRDQ-IENYESSN---------D 316 (337) Q Consensus 253 ~~~~~~~k~igGl~a~~vPffP~~------~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r-~e~y~s~N---------e 316 (337) -....++-|+|++..+++|.+ .+++--++++-.|...+..+-.+.+..+-++ ..+|..+| - T Consensus 205 ---~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~ 281 (298) T protein:vir:94 205 ---GATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFL 281 (298) T ss_pred ---CCCCceecceeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEe Confidence 012357889999999999975 4777888888667666666666644322111 12333444 4 Q ss_pred cceeecCCcEEEeecee Q lcl|NC_015266. 317 AYVVEDFGCGCVAENIE 333 (337) Q Consensus 317 ~YvVEd~~~~a~iEnI~ 333 (337) +..|.++++++.+.+++ T Consensus 282 ~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 282 GWGILDATKFARVTEAN 298 (298) T ss_pred ccEeecccceEEEEecC Confidence 56788888899998888 No 35 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.38 E-value=6.5e-08 Score=59.98 Aligned_cols=290 Identities=11% Similarity=0.077 Sum_probs=144.4 Q ss_pred CChHHHHHHHHHH--------HHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhccc-ccccchhhhhhhhccccc Q lcl|NC_015266. 1 MKKETRQAYRKYA--------AQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSI-NILPVTELEGEKLGLSVS 71 (337) Q Consensus 1 M~~~tr~~~~~y~--------~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~I-nv~~V~~~~Ge~v~~gv~ 71 (337) ........++... .......+....+ ...+-|++...+...+.+.+..|..+ +++++..-..-.+-...+ T Consensus 85 ~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~-g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 163 (392) T protein:vir:13 85 ADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGN-PNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVITG 163 (392) T ss_pred hhHHHHHHHhccchhhhHHHHhhhhhhcccccCC-CccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEcC Confidence 1111111111110 0011112222222 22456666666666666665556554 566554322223333333 Q ss_pred cccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccC Q lcl|NC_015266. 72 GPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAAL 151 (337) Q Consensus 72 g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~ 151 (337) ++-++=+. .....|..-..++...|..++.---+.|+.+.|+.. .++|+..+.+.+.+.++.=.-.-=+||+- T Consensus 164 ~~~a~~v~--E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~l~G~G--- 236 (392) T protein:vir:13 164 RATAGIVG--ETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQ--VLDLVGFLVSDAGPAIGDAMGRHFLTGTG--- 236 (392) T ss_pred Ccceeeec--ccccccccccceeeEEeeeeeEEeeehhHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhcccC--- Confidence 34443221 122223333567788888888888899999999974 34788888888888877644444446632 Q ss_pred CCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChH Q lcl|NC_015266. 152 STDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRE 231 (337) Q Consensus 152 ~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~d 231 (337) | ..| +|+|. ..+.....+..+ .++....|.|+ +++.+ +++.|+... +++|.+. T Consensus 237 -t---~~p------~Gil~------------~~~~~~~~~~~~-~~~~~~~d~l~-~~~~~-l~~~~~~~a--~~v~n~~ 289 (392) T protein:vir:13 237 -T---GQP------RGILT------------DATGANAAFGEA-DADSKVSDALI-DLFHE-VPSAYRKNA--KFVVNDL 289 (392) T ss_pred -C---ccc------ccccc------------cccccccccccc-ccccccHHHHH-HHHHh-hhhhhhcCC--EEEEcHH Confidence 1 123 25543 111111111111 22334556554 56665 478888865 7888888 Q ss_pred HHHHHHHHHHhccCCh--hHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 232 LLHDKYFPIVNTTQAP--TEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 232 Lla~k~~~l~n~~~~p--tE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) .+.. ...+-+..+.| ..-... ....++.|+|++..+++|++.|++-.++++-|.. .+..+=....++ T Consensus 290 ~~~~-l~~lkd~~G~~l~~~~~~~---g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i~~-~~~~~i~~~~~~------ 358 (392) T protein:vir:13 290 RAAQ-MRKLKDANGQYLWQSALTV---GAPDTFNGKVVETDDGMPADKVLFADLSKYRVRF-AGSLRVDRSVDA------ 358 (392) T ss_pred HHHH-HHHhhccCCceeecCCcCC---CCCceecceeeEEcCCCCCCcEEEeeccceeEEe-ecceEEEeeccc------ Confidence 7652 22333332222 111000 1235789999999999999999999988865533 333332222121 Q ss_pred chhhhc-ccceeecCCcEEEee-----ceeeccC Q lcl|NC_015266. 310 NYESSN-DAYVVEDFGCGCVAE-----NIELVAA 337 (337) Q Consensus 310 ~y~s~N-e~YvVEd~~~~a~iE-----nI~~~~a 337 (337) |...| -+|..+.+--+..+. -+++..| T Consensus 359 -~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~~a 391 (392) T protein:vir:13 359 -KFSTDQIVYRFLQRADGLLVDARGAKVLTVTPA 391 (392) T ss_pred -cccCCcEEEEEEEEeccEEecccceEEEEeecc Confidence 22222 244333321122221 2344444 No 36 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.36 E-value=3.9e-08 Score=61.22 Aligned_cols=280 Identities=14% Similarity=0.088 Sum_probs=160.1 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceecc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~ 79 (337) |--.+. ..-++. ...-.+.|-++..+.+.+.+.+.+.+++..+++++.--.. .+-.-.+++.++-.. T Consensus 1 ma~~~~-----------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~ 68 (304) T protein:vir:94 1 MATPTY-----------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-KFTYLAKGVGAYWVS 68 (304) T ss_pred Cccccc-----------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEeCCcceEEee Confidence 222222 111121 1223567888888999999999999999999988764222 222222444444443 Q ss_pred CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANP 159 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nP 159 (337) .+ ...|..-..++...+..++.---..|+.+.|..= ..+|+..+.+.+.+.++.-.-.-.+||+-....+....+. T Consensus 69 E~--~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~ 144 (304) T protein:vir:94 69 ET--ERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT--AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKP 144 (304) T ss_pred cC--cccccccceeeEEEEEEEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccc Confidence 22 2234444667777888888888888999887732 3689999999999999999999999996533222111111 Q ss_pred hhhccchhHHHHHHhhchhhhcccccccCCceec-CCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHH Q lcl|NC_015266. 160 LLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLV-GKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 160 llqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~-G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~ 238 (337) .+.... .. .... ++.-.|++|-.| +..+ .+.++... +++|.+..+.. .. T Consensus 145 --------------------~~~~~~-~~-~~~~~~~~~~~~~i~~~----~~~l-~~~~~~~~--~~v~~~~~~~~-L~ 194 (304) T protein:vir:94 145 --------------------LVEGAE-EK-GNVVTDTNNLYVDLSAL----MATI-EDEELDPN--GVLTTRSFRSK-MR 194 (304) T ss_pred --------------------cccccc-cc-ccccccccchHHHHHHH----HHHh-hhccCCcC--EEEEcHHHHHH-HH Confidence 111110 00 0111 112235555444 4433 44444443 78899887763 22 Q ss_pred HHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCc----eEEecccccEEEEecCceEEeEeeccc---------c Q lcl|NC_015266. 239 PIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRA----MMVTKLENLSIYFQEGARRRSLIDNPK---------R 305 (337) Q Consensus 239 ~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~----ilvT~l~NLsIY~Q~gs~RR~~~d~p~---------r 305 (337) .+-...+.|- ......++-|+|++..+++|... +++..++++- +...+..+-.+.++.. . T Consensus 195 ~lkd~~G~~l------~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~~-~~~~~~~~i~~~~e~~~~~~~~~~~~ 267 (304) T protein:vir:94 195 NALDANDRPL------FDANGNEIMGLPLSYTGADVYDKKKSLALMGDWDYAR-YGILQGIEYAISEDATLTTLQASDAS 267 (304) T ss_pred HhhccCCcEe------ecCCCccccceeeEEecccccCCCCcEEEEEehhhEE-EEEecceEEEEeecceeeeecccccC Confidence 3333332331 11123578899999999999664 8888998864 4444555444444321 0 Q ss_pred ceecchhhhc---------ccceeecCCcEEEeecee Q lcl|NC_015266. 306 DQIENYESSN---------DAYVVEDFGCGCVAENIE 333 (337) Q Consensus 306 ~r~e~y~s~N---------e~YvVEd~~~~a~iEnI~ 333 (337) -...++..+| -++.|.++++++.+...+ T Consensus 268 g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 268 GQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 1122233333 445667777777766555 No 37 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.36 E-value=3.9e-08 Score=61.22 Aligned_cols=280 Identities=14% Similarity=0.088 Sum_probs=160.1 Q ss_pred CChHHHHHHHHHHHHHHHhcCcc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceecc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~ 79 (337) |--.+. ..-++. ...-.+.|-++..+.+.+.+.+.+.+++..+++++.--.. .+-.-.+++.++-.. T Consensus 1 ma~~~~-----------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~ 68 (304) T protein:vir:10 1 MATPTY-----------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKK-KFTYLAKGVGAYWVS 68 (304) T ss_pred Cccccc-----------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEeCCcceEEee Confidence 222222 111121 1223567888888999999999999999999988764222 222222444444443 Q ss_pred CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANP 159 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nP 159 (337) .+ ...|..-..++...+..++.---..|+.+.|..= ..+|+..+.+.+.+.++.-.-.-.+||+-....+....+. T Consensus 69 E~--~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~ 144 (304) T protein:vir:10 69 ET--ERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT--AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKP 144 (304) T ss_pred cC--cccccccceeeEEEEEEEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccc Confidence 22 2234444667777888888888888999887732 3689999999999999999999999996533222111111 Q ss_pred hhhccchhHHHHHHhhchhhhcccccccCCceec-CCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHH Q lcl|NC_015266. 160 LLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLV-GKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 160 llqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~-G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~ 238 (337) .+.... .. .... ++.-.|++|-.| +..+ .+.++... +++|.+..+.. .. T Consensus 145 --------------------~~~~~~-~~-~~~~~~~~~~~~~i~~~----~~~l-~~~~~~~~--~~v~~~~~~~~-L~ 194 (304) T protein:vir:10 145 --------------------LVEGAE-EK-GNVVTDTNNLYVDLSAL----MATI-EDEELDPN--GVLTTRSFRSK-MR 194 (304) T ss_pred --------------------cccccc-cc-ccccccccchHHHHHHH----HHHh-hhccCCcC--EEEEcHHHHHH-HH Confidence 111110 00 0111 112235555444 4433 44444443 78899887763 22 Q ss_pred HHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCc----eEEecccccEEEEecCceEEeEeeccc---------c Q lcl|NC_015266. 239 PIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRA----MMVTKLENLSIYFQEGARRRSLIDNPK---------R 305 (337) Q Consensus 239 ~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~----ilvT~l~NLsIY~Q~gs~RR~~~d~p~---------r 305 (337) .+-...+.|- ......++-|+|++..+++|... +++..++++- +...+..+-.+.++.. . T Consensus 195 ~lkd~~G~~l------~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gd~~~~~-~~~~~~~~i~~~~e~~~~~~~~~~~~ 267 (304) T protein:vir:10 195 NALDANDRPL------FDANGNEIMGLPLSYTGADVYDKKKSLALMGDWDYAR-YGILQGIEYAISEDATLTTLQASDAS 267 (304) T ss_pred HhhccCCcEe------ecCCCccccceeeEEecccccCCCCcEEEEEehhhEE-EEEecceEEEEeecceeeeecccccC Confidence 3333332331 11123578899999999999664 8888998864 4444555444444321 0 Q ss_pred ceecchhhhc---------ccceeecCCcEEEeecee Q lcl|NC_015266. 306 DQIENYESSN---------DAYVVEDFGCGCVAENIE 333 (337) Q Consensus 306 ~r~e~y~s~N---------e~YvVEd~~~~a~iEnI~ 333 (337) -...++..+| -++.|.++++++.+...+ T Consensus 268 g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 268 GQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred ccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 1122233333 445667777777766555 No 38 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.35 E-value=1.1e-07 Score=58.69 Aligned_cols=291 Identities=11% Similarity=0.065 Sum_probs=139.8 Q ss_pred CChHHHHHHHHHHHH------------HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhc-ccccccchhhhhhhhc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQ------------IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLK-SINILPVTELEGEKLG 67 (337) Q Consensus 1 M~~~tr~~~~~y~~~------------~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~-~Inv~~V~~~~Ge~v~ 67 (337) .+........+|+.. .....+. ..+....+-|++.+++...+.+.+..|. ..+++++....+-.+- T Consensus 81 ~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t-~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p 159 (390) T protein:vir:62 81 AQRSADVDDDATLRAGNLGEARSFEFAPEKRDGT-KAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFT 159 (390) T ss_pred chhhcchHHHHHHhhhhhhhhHHHHhhhhhhccc-ccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEE Confidence 111111111111110 0011111 1222334556666665555444444444 5577776432222333 Q ss_pred cccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccce Q lcl|NC_015266. 68 LSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGV 147 (337) Q Consensus 68 ~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~ 147 (337) .-.+++.++-+.- ....|..-..++...|..++.=--+.|+++.|+.. .++|+..+++.+.+.++.=.-.--+||+ T Consensus 160 ~~~~~~~a~wv~E--~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~l~G~ 235 (390) T protein:vir:62 160 VITGRSSASIVGE--TAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQ--VLDLVGFLVSDAGPAIGDAMGRHFITGT 235 (390) T ss_pred EEcCCcceeeecc--cccccccccceeeeEeeeeeEEeehHHHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 3334444433322 22223333457778888888888889999999873 3478888888888887765444455774 Q ss_pred eccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEE Q lcl|NC_015266. 148 KAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVI 227 (337) Q Consensus 148 s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvi 227 (337) - . | +|++. ........+..+.. +-...|.|+ +++.+| ++.|+... +++ T Consensus 236 G-----~----p------~Gi~~------------~~~~~~~~~~~~~~-~~~~~~~l~-~~~~~l-~~~~~~~a--~~v 283 (390) T protein:vir:62 236 G-----Q----P------RGILT------------DASPATATFLATDT-DSKVSDALI-DLFHEV-PSAYRANA--KYV 283 (390) T ss_pred C-----c----c------ccccc------------cccccccceecccc-cccchHHHH-HHHHhh-hhhhhcCC--EEE Confidence 2 1 2 35443 21111111222211 223445444 455554 77788754 889 Q ss_pred eChHHHHHHHHHHHhccC-Ch--hHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeecc- Q lcl|NC_015266. 228 CGRELLHDKYFPIVNTTQ-AP--TEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNP- 303 (337) Q Consensus 228 vG~dLla~k~~~l~n~~~-~p--tE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p- 303 (337) |.+..+.. +..+...+ .| ..-++. ....++.|+|++..+++|++.|++-.++..-|....+ ..-....++ T Consensus 284 mn~~~~~~--L~~lkd~~g~~l~~~~~~~---g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~-~~v~~~~~~~ 357 (390) T protein:vir:62 284 VNDLRAAQ--MRKLKDANGQYLWQSGLTV---GAPSLFNGKVVETDDGMPADKILFADLSKYRVRFAGS-LRVDRSVDAK 357 (390) T ss_pred EchHHHHH--HHHhhccCCCeeecCCcCC---CccceecccceEEecCCCCccEEEeeccceeEEeecc-eEEEeecccc Confidence 99987652 22332222 11 011111 1235799999999999999999987776644433222 211111111 Q ss_pred -ccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 304 -KRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 304 -~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .++++.-.-..--|..|-++.++.. +++..| T Consensus 358 ~~~~~~~~~~~~r~d~~~~~~~A~~~---l~~~~~ 389 (390) T protein:vir:62 358 FSTDQIVYRFLQRADGLLVDARGAKV---LTVTPG 389 (390) T ss_pred ccCCcEEEEEEEEeCcEeechhheEE---EEeecC Confidence 1122211111112223333333333 344455 No 39 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.28 E-value=4.9e-07 Score=55.20 Aligned_cols=303 Identities=12% Similarity=0.153 Sum_probs=154.9 Q ss_pred CChHHHHHHHHHHHHHHH--hcC--------cccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LND--------TDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSV 70 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ng--------v~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv 70 (337) ...+.+..|.+|+.+-.. +.. .....-.+.|-+.+...+.+.+++.+.+++..+++++..... ++.... T Consensus 78 ~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~-~~~~~~ 156 (407) T protein:vir:48 78 VASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSDY-KKLVNL 156 (407) T ss_pred hhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCce-EEEEec Confidence 666777788888753210 000 011122455666678889999999999999999888765432 222333 Q ss_pred ccccceeccCCCccccc-ccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceec Q lcl|NC_015266. 71 SGPIASRTDTTKAERQP-IDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKA 149 (337) Q Consensus 71 ~g~iagRt~t~~~~R~~-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~ 149 (337) +++-++-+..+ ...| .+....+...|..++.---..|+.+.|+. ...+|+..+.+.+.+.++.=.-.-=+||+-. T Consensus 157 ~~~~a~~v~E~--~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~ 232 (407) T protein:vir:48 157 GGTTSGWVGET--DARPETATSKLGLIEPFMGEIYGNPQATQKMLDD--AFFNVEDWINSELALEFAEQEEIAFTSGDGS 232 (407) T ss_pred CCcceeeeccc--ccccccccccceeEEeeeeeeEeehhhHHHHHhc--chHHHHHHHHHHHHHHHHHHHHhhhhccCCC Confidence 44444433222 2223 23345666778888887788999999985 2247888888888887765444334566321 Q ss_pred cCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeC Q lcl|NC_015266. 150 ALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICG 229 (337) Q Consensus 150 A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG 229 (337) ..|. |=|..............+ ....+..+..+. .+.|.++ +++.+ |++.|+..+ +++|. T Consensus 233 -------~~p~------Gil~~~~~~~~~~~~~~~--~~~~~~~~~~~~-~~~d~i~-~l~~~-l~~~~~~~a--~~v~n 292 (407) T protein:vir:48 233 -------KKPK------GFLAYESTDEDDKTRAFG--KLQHIASGAASG-VTADAII-KLIYT-LRKAHRSGA--KFMMN 292 (407) T ss_pred -------Cccc------eeeecccccccccccccc--cccccccccccc-cChHHHH-HHHHh-hchhhhcCC--EEEEc Confidence 1222 212110000000000000 011122222222 4456665 56665 588888876 78898 Q ss_pred hHHHHHHHHHHHh-ccCCh--hHHHHHHHHHhhhhhcCceeEECCccCCCc-----eEEecccc-cEEEEecCceEEeEe Q lcl|NC_015266. 230 RELLHDKYFPIVN-TTQAP--TEQLAADLIVSQKRIGNLPAVRVPFFPKRA-----MMVTKLEN-LSIYFQEGARRRSLI 300 (337) Q Consensus 230 ~dLla~k~~~l~n-~~~~p--tE~~A~~~~~~~k~igGl~a~~vPffP~~~-----ilvT~l~N-LsIY~Q~gs~RR~~~ 300 (337) +..++. +..+. ..+.| ..-... ....++-|+|++..+++|..+ |++=.|+. ..|+-.. +.+-... T Consensus 293 ~~~~~~--L~~lkD~~Gr~l~~~~~~~---g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~-~~~i~~d 366 (407) T protein:vir:48 293 NSSLFA--IRLLKDNDGNYLWRPGIEL---GQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRI-GTRILRD 366 (407) T ss_pred HHHHHH--HHHhhccCCceeeccCcCC---CCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEee-ceEEEee Confidence 887642 22232 22222 000000 123578999999999999733 66666654 3343222 3322111 Q ss_pred eccccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 301 DNPKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 301 d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) +--.++.+.-+-..--|..|-+.++++. +++..| T Consensus 367 ~~~~~~~~~~~~~~r~d~~v~~~~a~~~---l~~~aa 400 (407) T protein:vir:48 367 PYTNKPFVGFYTTKRTGGMLVDSQAIKL---MKIGAA 400 (407) T ss_pred ccccCCcEEEEEEEEeccEEecccceEE---EEeecc Confidence 1111222222111223344444444443 344444 No 40 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.21 E-value=2e-07 Score=57.30 Aligned_cols=298 Identities=14% Similarity=0.070 Sum_probs=170.5 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |...+. |+.=...+++.. +....-.|-|.+.+.+.+.+.+.+.++++++++++.-... .+-.-.+++-++-... T Consensus 1 ~~~~~~--~~~~~~~~~~t~---~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~~p~~~~~~~a~~v~E 74 (320) T protein:vir:10 1 MAAGTA--FQVDHAQIAQTG---DTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQ-KIPHWIGDVSAQWIGE 74 (320) T ss_pred CCCCcc--CCHHHHHhhccc---cccccccccHHHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEeCCcceEEecC Confidence 333322 110011122211 1112225889999999999999999999999998863322 2222234444444332 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ....|..-..++...+.+++.---..|+.+.|+.= .++++..+.+.+.++++...-.--++|+-....+. T Consensus 75 --~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~------ 144 (320) T protein:vir:10 75 --GDMKPITKGNMTSQNIAPHKIATIFVASAETVRAN--PANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTY------ 144 (320) T ss_pred --CccccccccceeEEEEeeEEEEEeehhhHHHHhcC--hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcc------ Confidence 22334444667888999999999999999999842 26899999999999998888777899965221111 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) +....+..... .....+..+-..+|.+..++.. ++++.+++. .+++|.+.....- ..+ T Consensus 145 ----------------~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~--~~~v~n~~~~~~L-~~l 202 (320) T protein:vir:10 145 ----------------LAQTTKSVSLA--DPGGATASDLTAYDAVAVNGLS-LLVNAKKKW--THTLLDDIVEPIL-NGA 202 (320) T ss_pred ----------------cccccccccce--ecccccccccccHHHHHHHHHh-hhhcccCCC--cEEEEcHHHHHHH-HHh Confidence 11111111100 0111223344556666666664 457766664 4889999876532 222 Q ss_pred HhccCChh--H--HHHHHHHHhhhhhcCceeEECCccCCCce--EEecccccEEEEecCceEEeEeeccc---------- Q lcl|NC_015266. 241 VNTTQAPT--E--QLAADLIVSQKRIGNLPAVRVPFFPKRAM--MVTKLENLSIYFQEGARRRSLIDNPK---------- 304 (337) Q Consensus 241 ~n~~~~pt--E--~~A~~~~~~~k~igGl~a~~vPffP~~~i--lvT~l~NLsIY~Q~gs~RR~~~d~p~---------- 304 (337) -...+.+- + ..-........++-|+|++..+++|++.. ++..++++- +...+..+=.+.++.. T Consensus 203 kd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~~~~~ 281 (320) T protein:vir:10 203 KDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVI-WGQVGGLSFDVTDQATLNLGTPTEPN 281 (320) T ss_pred hccCCceeeccccccCccccccCceeeeeeeEecCCCCCCceEEEEeecceEE-EEEecCeEEEEeecceeeeccccccc Confidence 22211110 0 00001112345789999999999999974 457777764 3334444333322221 Q ss_pred ------cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 305 ------RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 305 ------r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ++++.----.--++.|.+.++++.+.++.--+| T Consensus 282 ~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 282 FVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred cchhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 122211111124778889999999998887777 No 41 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.19 E-value=4.2e-07 Score=55.56 Aligned_cols=291 Identities=10% Similarity=-0.018 Sum_probs=151.2 Q ss_pred CChHHHHHHHHHHHHHHH------------hcC---cccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK------------LND---TDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEK 65 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~------------~ng---v~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~ 65 (337) -.......+..+...... .+. ....+..+.|.|...+.+.+.+.+.+.+++.++++++..-.... T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:97 80 DMFVASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEE Confidence 000001111222211111 111 11223455688889999999999999999999999987544444 Q ss_pred hccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccc Q lcl|NC_015266. 66 LGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWN 145 (337) Q Consensus 66 v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfN 145 (337) .......+-++-+.. +.. .|..-..++...+..++.--.+.|+.+.|+.. ++++..+.+.+.+.++.=.-.--++ T Consensus 160 ~~~~~~~~~a~~v~E-g~~-~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds---~~l~~~i~~~la~a~~~~~d~a~l~ 234 (390) T protein:vir:97 160 VQETGFVNNAAIVAE-GAL-KPESSLKFAKKTDTTHVIAHTMKATRQILSDA---PQLASYMNNRLIRGLKVKEDAEILR 234 (390) T ss_pred EEEecCCcceeeecC-Ccc-ccccccceeEEEEeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 443322222322221 122 23333457777888888777888999988764 5799999999999888777777778 Q ss_pred ceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeE Q lcl|NC_015266. 146 GVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 146 G~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLV 225 (337) |+-.. .+| +|.+ +.... .....+..+ -..+|.+ .+++..+ ++.++... + T Consensus 235 G~g~~------~~p------~Gi~------------~~~~~--~~~~~~~~~-~~~~d~~-~~~~~~~-~~~~~~~~--~ 283 (390) T protein:vir:97 235 GTGAN------DGL------LGLI------------PQATT--YAAPTTIAG-ATRVDQL-RLAMLQA-SLAEYPAS--G 283 (390) T ss_pred cCCCC------ccc------ccee------------ecccc--ccccccccc-cchHHHH-HHHHHhh-ccccCCCC--E Confidence 74211 112 1332 21110 011111112 1334443 4455544 55555544 6 Q ss_pred EEeChHHHHHHHHHHHhccCCh--hHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeecc Q lcl|NC_015266. 226 VICGRELLHDKYFPIVNTTQAP--TEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNP 303 (337) Q Consensus 226 vivG~dLla~k~~~l~n~~~~p--tE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p 303 (337) ++|.+..+.. ...+-...+.| .+. .+ ..+.++-|+|++..+++|++.+++-.+++--.++......=.+.+.+ T Consensus 284 ~v~n~~~~~~-L~~lkd~~G~~l~~~~--~~--~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~ 358 (390) T protein:vir:97 284 IVINPIDWAA-IELAKDANNQYLIGNA--RG--TLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVN 358 (390) T ss_pred EEEcHHHHHH-HHHhhcCCCceeecCc--cC--CCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecc Confidence 8888876552 22232222222 111 11 13468899999999999999999999987333333344333332222 Q ss_pred ---ccceecchhhhcccceeecCCcEEEeeceeec Q lcl|NC_015266. 304 ---KRDQIENYESSNDAYVVEDFGCGCVAENIELV 335 (337) Q Consensus 304 ---~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~ 335 (337) .++.+.-.-..--++.|=++.+++ .|+|+ T Consensus 359 ~~f~~~~~~~r~~~r~d~~v~~~~a~v---~~~~a 390 (390) T protein:vir:97 359 DDFQRNMVTVLAEERLALVVYRPEALI---TGSFA 390 (390) T ss_pred cccccCcEEEEEEEeeccEEeccccEE---EEEeC Confidence 122222111111222233333333 23444 No 42 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.18 E-value=3.7e-07 Score=55.82 Aligned_cols=298 Identities=11% Similarity=0.003 Sum_probs=158.8 Q ss_pred CChHHHHHHHHHHHHHHHhcCccccc-----ceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVS-----QKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIA 75 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~-----~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~ia 75 (337) |-. ++++.. ...|.+.-. ..--|-+++.+.+.+.+++.|.+++..+++++.--. .++-.-..++.+ T Consensus 1 ~~~-----~~e~~~---~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~~~a 71 (338) T protein:vir:78 1 MAT-----LNELAP---NTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGE-TIIPTTVKRPEV 71 (338) T ss_pred Ccc-----hHHhhh---hhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEecCccc Confidence 221 111111 112222111 122477778899999999999999999988876432 222222333443 Q ss_pred eeccC------CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceec Q lcl|NC_015266. 76 SRTDT------TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKA 149 (337) Q Consensus 76 gRt~t------~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~ 149 (337) +.+.. +.....|..-..++...+.+++.---..|+.+.|+... ++|+..+++.+.++++.-.-.--+||+.. T Consensus 72 ~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~--~~~~~~i~~~la~a~~~~~d~~~l~G~g~ 149 (338) T protein:vir:78 72 GQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNP--SGLYTKLQADLAYAIGRGIDLAVFHGKSP 149 (338) T ss_pred eeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHhhcccCC Confidence 33321 11223344445678888999999988999999988633 68999999999999988888888888775 Q ss_pred cCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeC Q lcl|NC_015266. 150 ALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICG 229 (337) Q Consensus 150 A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG 229 (337) ...+.| .|++.- ......+ .......+....|..|..+ +..+......+ .-+++|. T Consensus 150 ~~~~~~----------~gi~~~-------~~~~~~~-~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~--~~~~~m~ 205 (338) T protein:vir:78 150 LTGSAL----------QGIDTN-------NVIVNTT-NVDYLQTGTTPLLDRFLDG----YDLVSANTDVD--FNGWAAD 205 (338) T ss_pred Cccccc----------cccccc-------ccccccc-ccccccccchhhHHHHHHH----HHHhhhhcccc--ceEEEEc Confidence 443322 111110 0000000 0111111222233333333 33222222222 3478888 Q ss_pred hHHHHH-HHH-HHHhccCCh--hHHHHHHHHHhhhhhcCceeEECCccCCC---------ceEEecccccEEEEecCceE Q lcl|NC_015266. 230 RELLHD-KYF-PIVNTTQAP--TEQLAADLIVSQKRIGNLPAVRVPFFPKR---------AMMVTKLENLSIYFQEGARR 296 (337) Q Consensus 230 ~dLla~-k~~-~l~n~~~~p--tE~~A~~~~~~~k~igGl~a~~vPffP~~---------~ilvT~l~NLsIY~Q~gs~R 296 (337) +...+. ... .+.+..+.| .+-. .-....+|-|+|++..+++|++ .+++--+++.-|....+ .+ T Consensus 206 ~~~~~~L~~~~~l~d~~g~~l~~~~~---~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~-~~ 281 (338) T protein:vir:78 206 PRYRARLLRSQAYRDANGNVDPTRIN---LAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADE-IR 281 (338) T ss_pred hHHHHHHHHHhhhccCCCceeecccc---cCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecc-cE Confidence 765542 111 122222222 1110 1123468899999999999964 25666666655443333 22 Q ss_pred EeEeeccccc-------eecchhhh---------cccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 297 RSLIDNPKRD-------QIENYESS---------NDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 297 R~~~d~p~r~-------r~e~y~s~---------Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -.+.+..... ..-++..+ --++.|-+..+++.+.+.+-.+| T Consensus 282 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 282 VKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred EEEeecccccccccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecccCCCC Confidence 2222221111 11122222 24678888888888888888888 No 43 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.17 E-value=2.8e-07 Score=56.48 Aligned_cols=299 Identities=13% Similarity=0.091 Sum_probs=168.9 Q ss_pred CChHH-HHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceecc Q lcl|NC_015266. 1 MKKET-RQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~t-r~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~ 79 (337) |++.. +..+..... +...+...+..-.|-|++.+.+.+.+++.+..+++.+++++.--. .++-.-.+++-++... T Consensus 3 ~~~~r~~~~~~~~e~---~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~~p~~~~~~~a~~v~ 78 (326) T protein:vir:42 3 VNPDRTTPFLGVNDP---KVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTG-QKIPHWTGDVSASWIG 78 (326) T ss_pred CCccchhhhcCcchh---hheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCc-eEEEEEeCCcceEEec Confidence 44422 222211111 222222222233588899999999999999999999999876322 2343444555555553 Q ss_pred CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANP 159 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nP 159 (337) . ....|..-..++...+..++.---..|+.+.|+. ...+|+..+.+.+.++++.-.-.-.+||+-. .+| T Consensus 79 E--g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~--s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs-------~~p 147 (326) T protein:vir:42 79 E--GDMKPITKGNMTSQTIAPHKIATIFVASAETVRA--NPANYLGTMRTKVATAFAMAFDNAAINGTDS-------PFP 147 (326) T ss_pred C--CccccccccceeEEEEeeEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHHhhcccCC-------Ccc Confidence 2 2233444466888889999988888899888874 2368999999999999998888888999651 122 Q ss_pred hhhccchhHHHHHHhhchhhhcccccc--cCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH Q lcl|NC_015266. 160 LLQDVNIGWLQQYRDRAGHRVLHEGAK--EAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY 237 (337) Q Consensus 160 llqDVNkGWlq~~Re~a~~~v~~~~~~--~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~ 237 (337) .+ ++..... .......+..++-..-|....+++. .+.+.++.. .+++|.+..+.. . T Consensus 148 ~g------------------i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~--a~~v~n~~~~~~-L 205 (326) T protein:vir:42 148 TF------------------LAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALS-LLVNAGKKW--THTLLDDITEPI-L 205 (326) T ss_pred cc------------------ccccccccceeecccccccccchhHHHHHHHHHh-hhhhhccCc--cEEEEeHHHHHH-H Confidence 11 1111000 0011111222332333333444443 345666654 478888877653 2 Q ss_pred HHHHhccCCh--hHHH-HHH-HHHhhhhhcCceeEECCccCCCceEE--ecccccEEEEecCceEEeEeecc-------- Q lcl|NC_015266. 238 FPIVNTTQAP--TEQL-AAD-LIVSQKRIGNLPAVRVPFFPKRAMMV--TKLENLSIYFQEGARRRSLIDNP-------- 303 (337) Q Consensus 238 ~~l~n~~~~p--tE~~-A~~-~~~~~k~igGl~a~~vPffP~~~ilv--T~l~NLsIY~Q~gs~RR~~~d~p-------- 303 (337) ..+-...+.| .+-. +.. ......++-|+|++..+++|++..++ ..++++- |...+...=.+.++. T Consensus 206 ~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~-~~~~~~~~v~~~~e~~~~~~~~~ 284 (326) T protein:vir:42 206 NGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLV-WGQVGGLSFDVTDQATLNLGTPQ 284 (326) T ss_pred HHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEE-EEEecceEEEEeecceeeecccc Confidence 2232221111 0000 000 01123578899999999999997654 5677664 334444433332221 Q ss_pred ccceecchhh--------hcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 304 KRDQIENYES--------SNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 304 ~r~r~e~y~s--------~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .-..+..|++ .=-++.|.+..+++.+.++.-++| T Consensus 285 ~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 285 APNFVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred cccchhhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 1111222221 112568889999999999999999 No 44 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.16 E-value=2.1e-07 Score=57.16 Aligned_cols=295 Identities=13% Similarity=0.044 Sum_probs=163.9 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |.-++..... .....+....+-|.+.+.+.+.+++.+.+++..+++++..-... +-.-.+++-++.... T Consensus 1 m~~~~~~a~~----------~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-~p~~~~~~~a~~v~E 69 (330) T protein:vir:77 1 MAGSTVPSTQ----------VALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGIS-IPHWTGAVSASWTGE 69 (330) T ss_pred Ccccccchhh----------ccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceE-EEEEcCCcceeEecC Confidence 3322211111 11122334468889999999999999999999998887643322 233334455554432 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ....|..-..++...+.+++.--...|+.+.|+. ..++|+..+.+.+.++++.-.-.--|||+-.. +|. T Consensus 70 --g~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~-------~~~ 138 (330) T protein:vir:77 70 --AERKPITKGSFGKQELEPVKITTIFAESAEVVRL--NPLNYLNTMRTKIAEAIALKFDAAAIHGIDKP-------SAF 138 (330) T ss_pred --CCccccccceeeEEEEeEEEEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHHHHHHHHhhcccCCC-------Ccc Confidence 2233344456788889999999999999998874 24689999999999999988888888996521 111 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) .|++..... ........ .+. +.+..-..+|.|+ +++..+ ...+++ .-+++|.+..+.. ...+ T Consensus 139 -----~g~~~~~~~---~~~~~~~~----~~~-~~~~~~~~~~~l~-~~~~~~-~~~~~~--~~~~vmn~~~~~~-l~~l 200 (330) T protein:vir:77 139 -----KGYLAETTK---VVSLADTN----LTT-ASGPQGNAYLAVN-NALSLL-VNSGKK--WTGTLLDNVTEPI-LNTA 200 (330) T ss_pred -----ccccccccc---cceeeccc----ccc-cccccchhHHHHH-HHHHhh-hhcCCC--ccEEEEcHHHHHH-HHHH Confidence 344443211 11111111 111 1112212233333 334333 333333 3478999988763 2223 Q ss_pred HhccCChh--HHHHHH--HHHhhhhhcCceeEECCccCCCc------eEEecccccEEEEecCceEEeEeecc------- Q lcl|NC_015266. 241 VNTTQAPT--EQLAAD--LIVSQKRIGNLPAVRVPFFPKRA------MMVTKLENLSIYFQEGARRRSLIDNP------- 303 (337) Q Consensus 241 ~n~~~~pt--E~~A~~--~~~~~k~igGl~a~~vPffP~~~------ilvT~l~NLsIY~Q~gs~RR~~~d~p------- 303 (337) -...+.|- +..... ......++-|+|++..+++|++. +++..+++.-|..+.+ ..-.+.++. T Consensus 201 kd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~-~~i~~~~e~~~~~~~~ 279 (330) T protein:vir:77 201 VDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGG-LSFDVTDQATLDFGEE 279 (330) T ss_pred hccCCceeecCccccccccccCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecC-cEEEEeecceeeeccc Confidence 22222210 000000 11134578899999999999875 8888888875544433 222222221 Q ss_pred -------------ccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 304 -------------KRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 304 -------------~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .+|++.-.-..=-++.|-+++++|.|.... +.| T Consensus 280 ~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~-~~~ 325 (330) T protein:vir:77 280 QGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVKLTDQV-AGT 325 (330) T ss_pred ccccccccccchhhcCcEEEEEEEEeccEEecccceEEEEecc-CCc Confidence 111111111112378888888888876655 333 No 45 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.15 E-value=1.1e-06 Score=53.26 Aligned_cols=292 Identities=10% Similarity=-0.048 Sum_probs=147.0 Q ss_pred CChHHHHHHHHHHHH------------HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQ------------IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL 68 (337) Q Consensus 1 M~~~tr~~~~~y~~~------------~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~ 68 (337) .+.+....+..+... ..........+....+-|.....+.+.+.+.+.+++.++++++..-.+....+ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 162 (390) T protein:vir:10 83 VASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSALIEYVQE 162 (390) T ss_pred hhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEE Confidence 111111111111100 00111111122344577788889999999999999999999887554444443 Q ss_pred ccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccccee Q lcl|NC_015266. 69 SVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVK 148 (337) Q Consensus 69 gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s 148 (337) ....+-++-... .......+ ..++...+..++.---+.|+.+.|+.- ++++..+.+.+.+.++.-.-.--++|+- T Consensus 163 ~~~~~~a~~v~E-g~~~~~~~-~~~~~i~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~l~~~~~~~~~~~il~G~G 237 (390) T protein:vir:10 163 TGFVNNAAIVAE-GALKPESS-LKFAKKTDTTHVIAHTMKATRQILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTG 237 (390) T ss_pred ecCCcceeeecC-Cccccccc-cceeEEEEeeEEEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCC Confidence 322222222221 22222233 457778888888888889999988863 5799999999998887644444445532 Q ss_pred ccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEe Q lcl|NC_015266. 149 AALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) Q Consensus 149 ~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVviv 228 (337) .+ .+|. -+++..... .+..+..++ ...|. +.+++..+ .+.++... +++| T Consensus 238 ~~------~~p~------------------Gi~~~~~~~--~~~~~~~~~-~~~~~-~~~~~~~l-~~~~~~~~--~~v~ 286 (390) T protein:vir:10 238 AN------DGLL------------------GLIPQATTY--AAPTTIAGA-TRVDQ-LRLAMLQA-SLAEYPAS--GIVI 286 (390) T ss_pred CC------cccc------------------ccccccccc--ccccccccc-chHHH-HHHHHHhh-ccccCCCC--EEEE Confidence 11 1122 122221110 111111221 22343 55566655 55566554 6778 Q ss_pred ChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccc-cEEEEecCceEEeEeecc---c Q lcl|NC_015266. 229 GRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLEN-LSIYFQEGARRRSLIDNP---K 304 (337) Q Consensus 229 G~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~N-LsIY~Q~gs~RR~~~d~p---~ 304 (337) .+..+.. ...+-...+.|-=.- ..-..+.++-|+|++..+++|++.+++-.+++ .-|+...+ .+=.+.+.. . T Consensus 287 n~~~~~~-L~~lkd~~g~~l~~~--~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~-~~i~~~~~~~~~~ 362 (390) T protein:vir:10 287 NPIDWAA-IELAKDANNQYLIGN--ARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWD-ARVEIGYVNDDFQ 362 (390) T ss_pred cHHHHHH-HHHhhcCCCceeecC--CcCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEEecc-eEEEEeecccccc Confidence 8876542 222222222210000 00112458899999999999999999998886 44544333 222222211 1 Q ss_pred cceecchhhhcccceeecCCcEEEeeceeec Q lcl|NC_015266. 305 RDQIENYESSNDAYVVEDFGCGCVAENIELV 335 (337) Q Consensus 305 r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~ 335 (337) ++.+.-+-..--+..|-++.+++ .|+|+ T Consensus 363 ~~~~~~r~~~r~d~~v~~~~a~~---~~~~a 390 (390) T protein:vir:10 363 RNMVTVLAEERLALVVYRPEALI---SGSFA 390 (390) T ss_pred cCcEEEEEEEeeccEEeccccEE---EEEeC Confidence 22222111111222333333333 34444 No 46 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.13 E-value=1.4e-06 Score=52.74 Aligned_cols=298 Identities=10% Similarity=0.055 Sum_probs=156.2 Q ss_pred CC--hHHHHHHHHHHHHHHH------------------------hcCcccccceeeecHHHHHHHHHHHHhhhhhhcc-c Q lcl|NC_015266. 1 MK--KETRQAYRKYAAQIAK------------------------LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKS-I 53 (337) Q Consensus 1 M~--~~tr~~~~~y~~~~a~------------------------~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~-I 53 (337) .+ ......|..++..++. .+......-.+.|-......+.+.+++.+.+++. - T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~ 167 (435) T protein:vir:80 88 PKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGA 167 (435) T ss_pred cchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccc Confidence 00 0011112222222221 1111122223455556678899999988877664 3 Q ss_pred ccccchhhhhh-hhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHH Q lcl|NC_015266. 54 NILPVTELEGE-KLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVIL 132 (337) Q Consensus 54 nv~~V~~~~Ge-~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~ 132 (337) ++++.. .|. ++-.-.+++-++-+..+ ...|..-..++...+..++.---..|+.+.|+..+-.|+++..+.+.+. T Consensus 168 ~~v~~~--~~~~~~p~~~~~~~a~~v~E~--~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~~~i~~~l~ 243 (435) T protein:vir:80 168 RTLPLS--NGNITIPRLKGGAIVGYIGAD--TDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLT 243 (435) T ss_pred eeeecC--CCceEEEEEeCCcceeeeccC--ccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHHHHHHHHHH Confidence 344433 232 22222234444443322 2233344567788899999999999999999998888999999999999 Q ss_pred HHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhc Q lcl|NC_015266. 133 NQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSS 212 (337) Q Consensus 133 ~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~ 212 (337) ++++.-.-.--+||+-.+. .| +|++. .... ........++.+..+++.+.+++.. T Consensus 244 ~a~~~~~d~a~l~G~G~~~------~p------~Gi~~------------~~~~-~~~~~~~~~~~~~~~~~d~~~~~~~ 298 (435) T protein:vir:80 244 AAIGAREDKAFIRDDGTAN------TP------KGLRF------------WALP-GNVITASDGSTLQKIETDLGKAILA 298 (435) T ss_pred HHHHHHHHHHhhccCCCCC------cc------cceee------------cccc-cceeecccccchhhHHHHHHHHHHH Confidence 9999877766678854221 12 24332 1110 1111122333444444444444433 Q ss_pred ccChhHcCCCCeEEEeChHHHHHHHHHHHh-ccCChhHHHHHHHHHhhhhhcCceeEECCccCCC--------ceEEecc Q lcl|NC_015266. 213 MIDPWFQEDTGLVVICGRELLHDKYFPIVN-TTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKR--------AMMVTKL 283 (337) Q Consensus 213 li~~~~r~~~dLVvivG~dLla~k~~~l~n-~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~--------~ilvT~l 283 (337) +... .......+++|.+..... +..+. ..+.|-= . -....++-|+|++..+++|.+ .+++..+ T Consensus 299 ~~~~-~~~~~~~~~vmn~~~~~~--L~~lkd~~G~~l~----~-~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~ 370 (435) T protein:vir:80 299 LENA-DANLTQPGWIMAPRTFRF--LEGLRDGNGNKVY----P-ELANGMLKGYPVGKTTQVPINLGEAGKESEIYFTDF 370 (435) T ss_pred hhcc-ccccccCEEEEcHHHHHH--HHhhhccCCceec----c-CCCCCeEeeeeeEEeccccccccCCCCcceEEEEEc Confidence 3221 112224588999887642 22222 2122210 0 013458999999999999985 5777777 Q ss_pred cccEEEEecCceEEeEeeccccc----eecchhhhcc---------cceeecCCcEEEeeceeecc Q lcl|NC_015266. 284 ENLSIYFQEGARRRSLIDNPKRD----QIENYESSND---------AYVVEDFGCGCVAENIELVA 336 (337) Q Consensus 284 ~NLsIY~Q~gs~RR~~~d~p~r~----r~e~y~s~Ne---------~YvVEd~~~~a~iEnI~~~~ 336 (337) ++.-|. ..+..+=.+.+...+. .+.+++.+|. +..|=++.+++.+.+|..+. T Consensus 371 s~~~i~-~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 371 GDVFIG-EEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred ccEEEE-eecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 775433 3344333332222111 1112233332 22333677777778887777 No 47 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.13 E-value=8.7e-07 Score=53.80 Aligned_cols=282 Identities=9% Similarity=0.094 Sum_probs=163.3 Q ss_pred CChHHHHHHHHHHHHHHHh-cC--------cccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccc-- Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKL-ND--------TDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLS-- 69 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~-ng--------v~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~g-- 69 (337) .+...+..|..|+...... +. ....+..+.|-+.+.+.+.+.+.+.+.+++.++++++....|....+- T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 168 (404) T protein:vir:39 89 LKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (404) T ss_pred hHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHhhhhHHhhcceeeccCCcceEEEEeec Confidence 3444555566565432111 11 112233566777888999999999999999999999988888766442 Q ss_pred cccccceeccCCCccccc-ccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccccee Q lcl|NC_015266. 70 VSGPIASRTDTTKAERQP-IDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVK 148 (337) Q Consensus 70 v~g~iagRt~t~~~~R~~-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s 148 (337) ..++.+.-...+.. .| .+...++...+.+++.---..|+.+.|+.. .++|+..+.+.+.+.++.=.-.--++|+. T Consensus 169 ~~~~~a~~v~Eg~~--~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~d~~il~g~g 244 (404) T protein:vir:39 169 DVTPLTVMDAEDGK--IPDLDNPRLTIIKYLIKRYAGIITATNTLLKDT--AENILAWLSSWIAKKVVVTRNQAIIAAMG 244 (404) T ss_pred CCccceeeecCccc--cccccccceeeEEeeeeeEEeeehhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 22233333333222 12 234456677777777776778999888763 35788888888888887655444455532 Q ss_pred ccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEe Q lcl|NC_015266. 149 AALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) Q Consensus 149 ~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVviv 228 (337) ... . .+.. .+.|.++ +++...+++.|+.. -+++| T Consensus 245 ~~~------------------------------------~----~~~~---~~~~~i~-~~~~~~~~~~~~~~--a~~v~ 278 (404) T protein:vir:39 245 TVP------------------------------------K----KPTI---AKFDDVI-TMINTSVDPAIIAT--SSLLT 278 (404) T ss_pred ccc------------------------------------c----cccc---ccHHHHH-HHHHHhhhhhhccC--CEEEE Confidence 110 0 0111 2345543 34555678888775 48899 Q ss_pred ChHHHHHHHHHHHh-ccCCh--hHHHHHHHHHhhhhhcCceeEECC--ccCCCc-----eEEecccccEEEEecCceEEe Q lcl|NC_015266. 229 GRELLHDKYFPIVN-TTQAP--TEQLAADLIVSQKRIGNLPAVRVP--FFPKRA-----MMVTKLENLSIYFQEGARRRS 298 (337) Q Consensus 229 G~dLla~k~~~l~n-~~~~p--tE~~A~~~~~~~k~igGl~a~~vP--ffP~~~-----ilvT~l~NLsIY~Q~gs~RR~ 298 (337) .+..+.. +..+. ..+.| ..-. .-....+|-|+|++... .+|..+ +++-.|++.-..+.++..+=. T Consensus 279 n~~~~~~--L~~lkd~~G~~l~~~~~---~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~ 353 (404) T protein:vir:39 279 NQSGLNK--LALVKTAEGKYLLEPDP---TKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLL 353 (404) T ss_pred cHHHHHH--HHHhhccCCceeeccCc---CCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEE Confidence 9877642 22232 22222 0000 01123588899999865 466543 777778876555555555543 Q ss_pred Eeeccc----cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 299 LIDNPK----RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 299 ~~d~p~----r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) +.+... ++.+--.-..--++.|-++.+++.+.--...+| T Consensus 354 ~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~ 396 (404) T protein:vir:39 354 PTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQ 396 (404) T ss_pred EeccchhhhhhceeeEEEEeeeccEEecccceEEEEeeccccC Confidence 333221 222222222224667888888888876666665 No 48 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.11 E-value=9.6e-07 Score=53.58 Aligned_cols=283 Identities=11% Similarity=0.121 Sum_probs=153.9 Q ss_pred CChHHHHHHHHHHHHHHHh---------cCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccc- Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKL---------NDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSV- 70 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~---------ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv- 70 (337) .+......|.+|+...... .......-.+.|-+.+.+.+.+.+.+.+.+++.++++++....|.....-. T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~ 168 (408) T protein:vir:10 89 LKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWT 168 (408) T ss_pred hHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeeccCCcceEEEeecc Confidence 2222333344443321110 011122235677777789999999999999999999999988887654322 Q ss_pred -ccccceeccCCCccccc-ccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccccee Q lcl|NC_015266. 71 -SGPIASRTDTTKAERQP-IDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVK 148 (337) Q Consensus 71 -~g~iagRt~t~~~~R~~-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s 148 (337) ..+.+.-+..+ ...| .+...++...+..++.---..|+.+.|+.. ..+|+..+.+.+.++++.-.-.--++|+. T Consensus 169 ~~~~~a~~v~E~--~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~~~~il~g~g 244 (408) T protein:vir:10 169 DVTPLTVMDAED--GKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT--AENILAWLSSWIAKKVVVTRNQAIIEVMK 244 (408) T ss_pred ccccceeeecCc--cccccccCcceeeEEeeeeeEEeeehhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 11222222221 1222 344556777888888877788999988863 34789999999988888654443344432 Q ss_pred ccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEe Q lcl|NC_015266. 149 AALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) Q Consensus 149 ~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVviv 228 (337) .+. .. +.. .+.|.++. ++...+++.|+.. -+++| T Consensus 245 ~~~-------------------------------------~~---~~~---~~~~~l~~-~~~~~~~~~~~~~--a~~v~ 278 (408) T protein:vir:10 245 AAP-------------------------------------KK---PTI---AKFDDVIT-MINTAVDPAIIAT--SSLLT 278 (408) T ss_pred ccc-------------------------------------cc---ccc---ccHHHHHH-HHHHhhhhhhccC--CEEEE Confidence 110 00 011 34566554 3444568888765 48899 Q ss_pred ChHHHHHHHHHHHhc-cCChhHHHHHHH-HHhhhhhcCceeEECC--ccCCCc-----eEEecccccEEEEecCceEEeE Q lcl|NC_015266. 229 GRELLHDKYFPIVNT-TQAPTEQLAADL-IVSQKRIGNLPAVRVP--FFPKRA-----MMVTKLENLSIYFQEGARRRSL 299 (337) Q Consensus 229 G~dLla~k~~~l~n~-~~~ptE~~A~~~-~~~~k~igGl~a~~vP--ffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~ 299 (337) .+..+.. +..+.. .+.|- ..... -....++-|+|++.++ .+|..+ +++-.+++.-..+.++...=.+ T Consensus 279 n~~~~~~--l~~lkd~~G~~i--~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~ 354 (408) T protein:vir:10 279 NQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLP 354 (408) T ss_pred cHHHHHH--HHHhhccCCceE--eccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEE Confidence 9987663 222321 11210 00000 0123588999999977 577654 7777888754444344444333 Q ss_pred eeccc----cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 300 IDNPK----RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 300 ~d~p~----r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .+.+. ++.+--+-..--+.+|-++.+++.++--..+++ T Consensus 355 ~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~ 396 (408) T protein:vir:10 355 TNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQ 396 (408) T ss_pred cccccchhhcCceEEEEEEeeccEEeccccEEEEEeeccccC Confidence 32221 222222222223445556666665543332222 No 49 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.10 E-value=7.4e-07 Score=54.20 Aligned_cols=297 Identities=10% Similarity=-0.005 Sum_probs=145.8 Q ss_pred CChHHHHHHHHHHHHHHH---hcCcc----cccceeeecHHHHHHHHHHHHh-hhhhhcccccccchhhhhhhh------ Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK---LNDTD----DVSQKFAVEPSVQQTLETKMQE-SSAFLKSINILPVTELEGEKL------ 66 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~---~ngv~----~~~~~Fsv~P~~~q~L~~~i~e-ss~FL~~Inv~~V~~~~Ge~v------ 66 (337) |....+..+..+...... .+... .....+.+.|.........+.+ ++...+.++++++..-..... T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 177 (419) T protein:vir:94 98 RARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGT 177 (419) T ss_pred HHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceeeeeecccc Confidence 111111222222222111 11110 1133456777777776665544 445666788877653222211 Q ss_pred -ccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccc Q lcl|NC_015266. 67 -GLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWN 145 (337) Q Consensus 67 -~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfN 145 (337) .+...++-++=+ +.....|..-..++...+..++.---..|+.+.|+.. ++|+..+.+.+.++++.=.-.-.+| T Consensus 178 ~~~~~~~~~a~~v--~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~la~a~~~~~d~aii~ 252 (419) T protein:vir:94 178 AGAGSTWNKAAVV--PEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN---SQLMGYIQGRLTYGLRFLRDRQLLN 252 (419) T ss_pred ccccccCccccee--cCCccccccccceeeEEeeeeeEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 111111111111 1122223333456777788888877789999999864 5799999999999988777777778 Q ss_pred ceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCccc-ccHHHHHHHHHhcccChhHcCCCCe Q lcl|NC_015266. 146 GVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDY-VNLDALVMDIVSSMIDPWFQEDTGL 224 (337) Q Consensus 146 G~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy-~nLDalV~da~~~li~~~~r~~~dL 224 (337) |+- +.+|. |++..-. +.. .....+ ..+... ..+|.| .+++..+.++.++. . T Consensus 253 G~G-------~~~p~------Gi~~~~~------~~~-~~~~~~----~~~~t~~~~~~~l-~~~~~~~~~~~~~~-~-- 304 (419) T protein:vir:94 253 GNG-------STEMQ------GILTTPG------IGT-YQQPKP----TAPATDEPPLVDI-RRAKTVAEIAGFPP-D-- 304 (419) T ss_pred ccC-------ccccc------ceecccc------ccc-cccccc----ccccccchhHHHH-HHHHHhhhhccCCC-C-- Confidence 844 22344 7765211 110 000000 111111 223333 33454444444333 2 Q ss_pred EEEeChHHHHHHHHHHHhccCCh---hHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEee Q lcl|NC_015266. 225 VVICGRELLHDKYFPIVNTTQAP---TEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLID 301 (337) Q Consensus 225 VvivG~dLla~k~~~l~n~~~~p---tE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d 301 (337) +++|.+..+.. ...+....+.+ .+-. .+ ....+|-|+|++..+++|++.+++-.+++....+.+....=.+.+ T Consensus 305 ~~v~n~~~~~~-l~~~k~~~~~~~~~~~~~-~~--~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~ 380 (419) T protein:vir:94 305 GVVVHPQDWES-IELDQAPGSGVFRVIANV-QG--EATPRIWGLNVVSTVAIAQGTALVGGFRQGATLWSRQGITVLMTD 380 (419) T ss_pred EEEEcHHHHHH-HHHHhhcCCCceeecCCc-cc--CCCccccceeeEEcCCCCCccEEEeeccceEEEEEecceEEEEec Confidence 78888876553 22222222221 1100 00 124588999999999999999999999886655555554433322 Q ss_pred ccc----cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 302 NPK----RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 302 ~p~----r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ... ++.+.-.-..--+..|-+...+|. +++..| T Consensus 381 ~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~---~~~~aa 417 (419) T protein:vir:94 381 SHADFFTANTLVILAEFRANLAVYQPKAFVR---VTFAAA 417 (419) T ss_pred cccchhhcCcEEEEEEEeeccEEeccccEEE---EEeccC Confidence 221 222211111122333444444443 445555 No 50 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.08 E-value=1.3e-06 Score=52.84 Aligned_cols=292 Identities=10% Similarity=0.019 Sum_probs=162.2 Q ss_pred CChHHHHHH--HHHHHHH---HHhcC--cc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccc Q lcl|NC_015266. 1 MKKETRQAY--RKYAAQI---AKLND--TD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr~~~--~~y~~~~---a~~ng--v~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g 72 (337) |++.-...+ .+|...+ ...+. +. .......|-+++...+.+.+.+.|.+++..+++++.-.... +-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~-~p~~~~~ 79 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK-FTFWADK 79 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE-EEEEeCC Confidence 766544332 2333332 22211 11 11224467788889999999999999999999887743322 2222234 Q ss_pred ccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCC Q lcl|NC_015266. 73 PIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALS 152 (337) Q Consensus 73 ~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~ 152 (337) +.+.-..- +...|..-..++...+.+++.---..|+.+.|+... ++|+..+.+.+.++++.-.-.-.++|+-. . T Consensus 80 ~~a~~v~E--g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~a~l~G~g~--~ 153 (324) T protein:vir:10 80 PGAYWVGE--GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGN--N 153 (324) T ss_pred cceeEecc--CccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCC--C Confidence 44444322 223344446678888999998888999999998663 58999999999998887666666777431 1 Q ss_pred CChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) +.|. -++.... .+.....+.-.|..|. +++.. |++.++... +++|.+.. T Consensus 154 ~~~~----------------------~i~~~~~--~~~~~~~~~~t~~~i~----~~~~~-l~~~~~~~~--~~v~n~~~ 202 (324) T protein:vir:10 154 PFGK----------------------SIAQSIE--KTNKVIKGDFTQDNII----DLEAL-LEDDELEAN--AFISKTQN 202 (324) T ss_pred ccCc----------------------ccccccc--ccceeccccCCHHHHH----HHHHh-hhhccCCCC--EEEEcHHH Confidence 1111 0111100 0000011112243333 34443 355555544 67888877 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCC--CceEEecccccEEEEecCceEEeEeeccc------ Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPK--RAMMVTKLENLSIYFQEGARRRSLIDNPK------ 304 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~--~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~------ 304 (337) +.. ...+-...+.|- .. -....++-|+|++..|..|. +.+++..++++-| ...+..+-.+.++.. T Consensus 203 ~~~-L~~l~d~~g~~~--~~---~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~-~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:10 203 RSL-LRKIVDPETKER--IY---DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIY-GIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhccCCcee--ec---CCCCccccceeEEeecCCCCCcceEEEEecccEEE-EEecCcEEEEeeccccccccc Confidence 663 222322222221 10 01235789999999988664 4588899998643 333344443433321 Q ss_pred ----------cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 305 ----------RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 305 ----------r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) +|.+.-.-..--|+.|-+.++++.+.+.+-+.. T Consensus 276 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~ 318 (324) T protein:vir:10 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCC Confidence 111111111223567888888888876555543 No 51 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.06 E-value=6.1e-07 Score=54.64 Aligned_cols=281 Identities=11% Similarity=0.030 Sum_probs=161.7 Q ss_pred HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccCCCcccccccccccCc Q lcl|NC_015266. 16 IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~ 95 (337) +|. +-.+.|-|...+.+.+.+++.|.+++...++++..-. .++-.-.+++-++-...+ ...|..-..++. T Consensus 1 ma~-------~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~~v~E~--~~~~~~~~~f~~ 70 (298) T protein:vir:16 1 MVL-------NKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNG-EKVFTFTMDSEIDVVAES--GKKTHGGVTLAP 70 (298) T ss_pred Ccc-------cCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEecCC--ccccccccceeE Confidence 222 2244688899999999999999999999888876422 234343445555444322 233444456778 Q ss_pred cceeeEeeccccccCHHHHHH-HhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHh Q lcl|NC_015266. 96 NRYRCEKTDYDTAITYRKLDA-WAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRD 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~-wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re 174 (337) ..+..++.---..|+.+.|.+ +....+|++.+.+.+.++++.-.-.-.+||+-...-+... +.+. T Consensus 71 v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~--~~~~------------ 136 (298) T protein:vir:16 71 QTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASA--VIGT------------ 136 (298) T ss_pred EEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccc--cccc------------ Confidence 888888888889999998854 4445689999999999998887777788885422111100 0000 Q ss_pred hchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChhHHHHHH Q lcl|NC_015266. 175 RAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAAD 254 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~ 254 (337) ......+ ......+. .-.++++.+.+++..+ ...+++-. +++|.+...+. ...+-...+.|-= .... T Consensus 137 ----~~~~~~~--~~~~~~~~--~~~~~~~~i~~~~~~~-~~~~~~~~--~~vmn~~~~~~-l~~lkd~~G~~i~-~~~~ 203 (298) T protein:vir:16 137 ----NHFDSKV--TQKVEAPR--GIADPNGAIENAVELL-TGVDADVT--GIAINPSFRSA-LAKQKDLQDNALF-PELK 203 (298) T ss_pred ----ccccccc--cccccccc--ccccHHHHHHHHHHHh-hhcCCCcc--EEEEcHHHHHH-HHHhhccCCCeee-cCcc Confidence 0000000 00011111 1133444455555433 34334322 58888876653 2222222222210 0000 Q ss_pred HHHhhhhhcCceeEECCccCCC------ceEEecccccEEEEecCceEEeEeeccccc-eecchhhhc---------ccc Q lcl|NC_015266. 255 LIVSQKRIGNLPAVRVPFFPKR------AMMVTKLENLSIYFQEGARRRSLIDNPKRD-QIENYESSN---------DAY 318 (337) Q Consensus 255 ~~~~~k~igGl~a~~vPffP~~------~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~-r~e~y~s~N---------e~Y 318 (337) .-....++-|+|++..+++|+. .+++--+++.-.|..++..+-.+.+.-+-+ ...+|..+| -++ T Consensus 204 ~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~ 283 (298) T protein:vir:16 204 WGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGW 283 (298) T ss_pred cCCCCceecceeeEEecccccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEcc Confidence 1112368899999999999974 466678888877766666665554432111 112233333 456 Q ss_pred eeecCCcEEEeecee Q lcl|NC_015266. 319 VVEDFGCGCVAENIE 333 (337) Q Consensus 319 vVEd~~~~a~iEnI~ 333 (337) .|-+..++|.+++++ T Consensus 284 ~v~~~~a~~~l~~at 298 (298) T protein:vir:16 284 GILDATKFARVTEAN 298 (298) T ss_pred EeecccceEEEeecC Confidence 778888888888887 No 52 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.05 E-value=1.6e-06 Score=52.35 Aligned_cols=281 Identities=10% Similarity=0.066 Sum_probs=160.6 Q ss_pred hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccCCCc--c-cccccccccCc Q lcl|NC_015266. 19 LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDTTKA--E-RQPIDPTALDS 95 (337) Q Consensus 19 ~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t~~~--~-R~~~~~~~l~~ 95 (337) ........-.+.|-+++.+.+.+.+++.+.+++..+++++.--. .++-.-.+++-++-...+.. + ..|..-..++. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~ 79 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATLPEADWVGESATDPKGVKPTSKVTWAN 79 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCc-EEEEEEeCCcceEEeecccccccccccccccceee Confidence 22233333466788888899999999999999999998876332 22222223444443332221 1 12333455677 Q ss_pred cceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHhh Q lcl|NC_015266. 96 NRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDR 175 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~ 175 (337) ..+..++.---..|+.+.|+... ++|+..+++.+.++++.-.-.--|||+-.. +.+ T Consensus 80 i~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~--~~~-------------------- 135 (305) T protein:vir:25 80 RTLVAEEIAVIIPVHENVIDDAT--VAVLTEVAELGGQAIGKKLDQAVIFGTDKP--ASW-------------------- 135 (305) T ss_pred EEeeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHhhhheeccCCC--CCc-------------------- Confidence 77888888888899999997633 589999999999999999988899997511 111 Q ss_pred chhhhccccccc-CCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChhHHHHHH Q lcl|NC_015266. 176 AGHRVLHEGAKE-AGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAAD 254 (337) Q Consensus 176 a~~~v~~~~~~~-~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~ 254 (337) .+..++..+... ......+..-.+.++..++..+...+.+..+... .++|.+.....- ..+-...+.| T Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~v~~~~~~~~l-~~lkd~~G~~------- 204 (305) T protein:vir:25 136 VSPALIPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAGWAPD---TLLSSLALRYEV-ANIRDANGNP------- 204 (305) T ss_pred cccccccccccccccccccccchhhhHHHHHHHHHHHhhhhcccccc---eeEecHHHHHHH-HHhhccCCce------- Confidence 011122111110 0001112222344455555555443323222222 367777665532 1222221222 Q ss_pred HHHhhhhhcCceeEECCccCCC----ceEEecccccEEEEecCceEEeEeec----cccceecchhhh--------cccc Q lcl|NC_015266. 255 LIVSQKRIGNLPAVRVPFFPKR----AMMVTKLENLSIYFQEGARRRSLIDN----PKRDQIENYESS--------NDAY 318 (337) Q Consensus 255 ~~~~~k~igGl~a~~vPffP~~----~ilvT~l~NLsIY~Q~gs~RR~~~d~----p~r~r~e~y~s~--------Ne~Y 318 (337) +....++-|+|++..+++|.. .+++-.++++-|..+.+- +-.+.++ .....+.-|++- =-|+ T Consensus 205 -i~~~~~l~G~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~-~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~ 282 (305) T protein:vir:25 205 -VFRDDSFAGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDI-TVKFLDQATLGTGENQINLAERDMVALRLKARFAY 282 (305) T ss_pred -eecCCcccccceEEcCccCCCCCccEEEEEecceEEEEEecCe-EEEEeeeeeeecCCceeeeeecCcEEEEEEEeecc Confidence 223458999999999998854 578888888766555543 2222111 111112112110 1266 Q ss_pred eeecCCcEEEeeceeeccC Q lcl|NC_015266. 319 VVEDFGCGCVAENIELVAA 337 (337) Q Consensus 319 vVEd~~~~a~iEnI~~~~a 337 (337) .|-++.+++.+.+++++.. T Consensus 283 ~v~~p~a~v~~~~~~~~~~ 301 (305) T protein:vir:25 283 VLGVSATAQGANKTPVAVV 301 (305) T ss_pred eeeCcccEEEEcccccccc Confidence 7889999999999887543 No 53 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=98.05 E-value=2.2e-06 Score=51.58 Aligned_cols=276 Identities=13% Similarity=0.117 Sum_probs=149.5 Q ss_pred CChHHHHHHHHHHHH-HHHhcCcc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhh-hcccccccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQ-IAKLNDTD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEK-LGLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~-~a~~ngv~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~-v~~gv~g~iagR 77 (337) ++...+..|..|+.. ........ ..+-.+.|-+.+...+.+.+.+.+.+++.++++++....|.. +....+++-++= T Consensus 71 ~~~~~~~~~~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 150 (371) T protein:vir:81 71 VKENEVEAFVNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQTGFVE 150 (371) T ss_pred hHHHHHHHHHHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCcceee Confidence 455555666666543 22222222 223456777788899999999999999999999998766664 333333344433 Q ss_pred ccCCCccccc-ccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChh Q lcl|NC_015266. 78 TDTTKAERQP-IDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKA 156 (337) Q Consensus 78 t~t~~~~R~~-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~ 156 (337) ...+ ...| .+...++.....+++.---+.|+.+.|+... ++++..+.+.+.+.++.-.-..=.+|+..... T Consensus 151 v~Eg--~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~---- 222 (371) T protein:vir:81 151 VAEG--AAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDST--EAIVNTLVRWIGDESRVTRNGLIINVLNTKAK---- 222 (371) T ss_pred eccc--cccccccccceeeEEeeeeEEEEeehhhHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---- Confidence 3222 2222 2334566777777777777899999988643 58899999999887776544333444321110 Q ss_pred hhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHH Q lcl|NC_015266. 157 ANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDK 236 (337) Q Consensus 157 ~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k 236 (337) . | -.+.|.+... +...+++.|+.. .+++|.+...+. T Consensus 223 -------------------------------~--------~-~~~~~~i~~~-~~~~l~~~~~~~--a~~vmn~~~~~~- 258 (371) T protein:vir:81 223 -------------------------------T--------A-IADLDGLKQI-INVQLDPVFRST--SSVIVNQDAFNW- 258 (371) T ss_pred -------------------------------c--------c-cccHHHHHHH-HHhhcchhhhcC--CEEEEcHHHHHH- Confidence 0 0 0234444433 444568888764 488899877653 Q ss_pred HHHHHhccCCh--hHHHHHHHHHhhhhhcCceeEECCccCCCc------------eEEecccc-cEEEEecCceEEeEee Q lcl|NC_015266. 237 YFPIVNTTQAP--TEQLAADLIVSQKRIGNLPAVRVPFFPKRA------------MMVTKLEN-LSIYFQEGARRRSLID 301 (337) Q Consensus 237 ~~~l~n~~~~p--tE~~A~~~~~~~k~igGl~a~~vPffP~~~------------ilvT~l~N-LsIY~Q~gs~RR~~~d 301 (337) ...+-...+.| ..-.. -....++-|+|++..+++|.+. +++=.+++ ..|+...+.. =.+ + T Consensus 259 L~~lkd~~g~~l~~~~~~---~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~-i~~-~ 333 (371) T protein:vir:81 259 LDTLKDQNGQYLLQPSIS---SPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTE-IMS-S 333 (371) T ss_pred HHHhhccCCCeeeecccC---CCCCceecceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceE-EEE-e Confidence 22222221111 00000 0134588899999999998543 44555554 2333333222 111 1 Q ss_pred ccccceecchhhhc-ccceeecC-CcEEEee----ceeeccC Q lcl|NC_015266. 302 NPKRDQIENYESSN-DAYVVEDF-GCGCVAE----NIELVAA 337 (337) Q Consensus 302 ~p~r~r~e~y~s~N-e~YvVEd~-~~~a~iE----nI~~~~a 337 (337) +.. .+++..| -+|.+|-. +....-. -+++..| T Consensus 334 ~~~----~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 334 NVA----MDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred ccc----cchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 111 1112222 24444443 2222221 2456666 No 54 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.04 E-value=2e-06 Score=51.87 Aligned_cols=290 Identities=9% Similarity=-0.048 Sum_probs=151.5 Q ss_pred CChHHHHHHHHHHHHHHHhcC---------------cccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLND---------------TDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEK 65 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ng---------------v~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~ 65 (337) ........+..+.......-+ ....+....+-|.....+.+.+.+.+.+++..+++++..-.... T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 159 (390) T protein:vir:81 80 DMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEY 159 (390) T ss_pred hhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEE Confidence 000001111222211111111 01123344678888899999999999999999999887544444 Q ss_pred hccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccc Q lcl|NC_015266. 66 LGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWN 145 (337) Q Consensus 66 v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfN 145 (337) ..+....+-+.=+. . +...|..-..++...+..++.--.+.|+.+.|+.. ++++..+.+.+.+.++.-.-.--+| T Consensus 160 ~~~~~~~~~a~~v~-E-g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~~~~~~i~~~l~~~~~~~~d~a~l~ 234 (390) T protein:vir:81 160 VQETGFVNNAAIVA-E-GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA---PQLASYMNNRLIRGLKVKEDAEILR 234 (390) T ss_pred EEEecCCcceeeec-C-CcccccccceeeEEEEeeeEEEEeehhhHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 44432222222221 1 12223333467888899999988999999999874 4799999999999888776666677 Q ss_pred ceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeE Q lcl|NC_015266. 146 GVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 146 G~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLV 225 (337) |+-.. .+| +|.+ +..... ... ...++-...|. +.+++..+ .+.++... + T Consensus 235 G~g~~------~~~------~Gi~------------~~~~~~--~~~-~~~~~~~~~~~-~~~~~~~~-~~~~~~~~--~ 283 (390) T protein:vir:81 235 GTGAN------DGL------LGLI------------PQATTY--AAP-TTIAGATRVDQ-LRLAMLQA-SLAEYNPS--G 283 (390) T ss_pred cCCCC------Ccc------ccee------------eccccc--ccc-cccccchhHHH-HHHHHHhh-ccccCCCC--E Confidence 74311 112 2332 221110 111 11222233444 34456555 44444433 7 Q ss_pred EEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeecccc Q lcl|NC_015266. 226 VICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKR 305 (337) Q Consensus 226 vivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r 305 (337) ++|.+..+. +...+-...+.|-=.-..+ ....++-|+|++..+++|++.+++=.+++.-..+.+++.+=...+.+. T Consensus 284 ~v~~~~~~~-~l~~lkd~~G~~l~~~~~~--~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~- 359 (390) T protein:vir:81 284 IVINPIDWA-AIELAKDANNQYLIGNARG--TLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGE- 359 (390) T ss_pred EEEcHHHHH-HHHHhhcCCCceeecCccc--ccCceecceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccc- Confidence 888888665 2222322222220000001 124578899999999999999999999874323333333333222211 Q ss_pred ceecchhhhc-ccceeec-CCcEEEee----ceeec Q lcl|NC_015266. 306 DQIENYESSN-DAYVVED-FGCGCVAE----NIELV 335 (337) Q Consensus 306 ~r~e~y~s~N-e~YvVEd-~~~~a~iE----nI~~~ 335 (337) +...| .+|.++. ++....-. -|+++ T Consensus 360 -----~~~~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 360 -----DFQRNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred -----hhhcCcEEEEEEEeeccEEecccceEEEEeC Confidence 22222 2333333 23322222 23444 No 55 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.03 E-value=3.5e-06 Score=50.53 Aligned_cols=292 Identities=10% Similarity=0.008 Sum_probs=161.9 Q ss_pred CChHHHHH--HHHHHHHH---HHhcC--cc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccc Q lcl|NC_015266. 1 MKKETRQA--YRKYAAQI---AKLND--TD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr~~--~~~y~~~~---a~~ng--v~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g 72 (337) |++.-... +..|...+ +..+- +. .......|-+++...+.+.+.+.|.+++..+++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 55442222 23333333 22221 11 1123446778888999999999999999999988764222 22222233 Q ss_pred ccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCC Q lcl|NC_015266. 73 PIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALS 152 (337) Q Consensus 73 ~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~ 152 (337) +.+.-.. .+...|..-..++...+..++.--...|+.+.|++.. ++|+..+.+.+.++++.-.-.--++|+-. . T Consensus 80 ~~a~~v~--Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~aia~~~d~~~l~G~g~--~ 153 (324) T protein:vir:96 80 PGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGN--N 153 (324) T ss_pred cceeeec--CCccccccccceeEEEEEeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCC--C Confidence 4443332 2233344446688888999999999999999999754 68999999999999887777777788531 1 Q ss_pred CChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) ..| . -++.... .........-.|++|-. ++.. |++.+++.. +++|.+.. T Consensus 154 ~~~----~------------------~~~~~~~--~~~~~~~~~~~~~~i~~----~~~~-i~~~~~~~~--~~i~n~~~ 202 (324) T protein:vir:96 154 PFG----K------------------SIAQSIK--KTNKVIKGDFTQDNIID----LEAL-LEDDELEAN--AFISKTQN 202 (324) T ss_pred CcC----c------------------ccccccc--ccceecccccchHHHHH----HHHh-hhhccCCCC--EEEEcHHH Confidence 111 0 1111110 01111112223444433 4443 355555443 67888877 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccC--CCceEEecccccEEEEecCceEEeEeeccccc---- Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFP--KRAMMVTKLENLSIYFQEGARRRSLIDNPKRD---- 306 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP--~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~---- 306 (337) +.. ...+-...+.|-- .. ....++-|+|++..|..+ ++.+++-.++++-| -..+..+-.+.++.... T Consensus 203 ~~~-L~~lkd~~G~~~~--~~---~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~-~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:96 203 RSL-LRKIVDPETKERI--YD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIY-GIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhCCCCCeee--cC---CCCCcccceeeEeecCCCCCcceEEEEecceEEE-EEecCcEEEEeeccccccccc Confidence 663 2222222222210 00 124578999999877654 44588888888643 33344443333332211 Q ss_pred ---eecchhhhcc---------cceeecCCcEEEeeceeeccC Q lcl|NC_015266. 307 ---QIENYESSND---------AYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 307 ---r~e~y~s~Ne---------~YvVEd~~~~a~iEnI~~~~a 337 (337) ..-++..+|. ++.|-+.++++.+...+-+.. T Consensus 276 ~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~ 318 (324) T protein:vir:96 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCC Confidence 1122333343 677888888887765444433 No 56 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.01 E-value=2.8e-06 Score=51.04 Aligned_cols=282 Identities=10% Similarity=0.033 Sum_probs=148.8 Q ss_pred CChHHHHHHHHHHHHHHH-----hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK-----LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIA 75 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~-----~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~ia 75 (337) .....+..|..|+..... ..+.....-.+.|-+...+.+.+.+.+.+.+++.+++++|...+|.......++.-+ T Consensus 88 ~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 167 (394) T protein:vir:10 88 PIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRATDRF 167 (394) T ss_pred HHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEecCCCcc Confidence 234455667777644221 112222234577877889999999999999999999999887766654433222211 Q ss_pred eeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCCh Q lcl|NC_015266. 76 SRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDK 155 (337) Q Consensus 76 gRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~ 155 (337) .=. .+.......+...++...+..++.---+.|+.+.|+. ..++|+..+.+.+.+.++.=.-.--.+|.. T Consensus 168 ~~~-~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~il~g~g------- 237 (394) T protein:vir:10 168 SSV-AELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIAD--SAVDLTSLVGQSINEKSVNTYNAMIAPVLQ------- 237 (394) T ss_pred ccc-cccccccccccccceeEEeeeeeeEeeehhHHHHHhh--hhHHHHHHHHHHHHHHHHHHHHHHHhhccc------- Confidence 111 1111111123344556666666665567888888885 336899999998888777532211111211 Q ss_pred hhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHH Q lcl|NC_015266. 156 AANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHD 235 (337) Q Consensus 156 ~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~ 235 (337) ++ ..+........|.++. ++...+++.|. =+++|.+..+.. T Consensus 238 --------------------------------~~--~~~~~~~~~~~d~l~~-~~~~~~~~~~~----a~~vmn~~~~~~ 278 (394) T protein:vir:10 238 --------------------------------SF--TAKATTTDTLVDSLKH-ILNVDLDPAYS----RALVVTQSLFNT 278 (394) T ss_pred --------------------------------cc--ccccccccccHHHHHH-HHHhhhhhhcc----CEEEecHHHHHH Confidence 01 0111223456677664 55667788874 278999987653 Q ss_pred HHHHHHhccCCh------hHHHHHHHHHhhhhhcCceeEECCc--cCCC----ceEEeccccc-EEEEecCceEEeEeec Q lcl|NC_015266. 236 KYFPIVNTTQAP------TEQLAADLIVSQKRIGNLPAVRVPF--FPKR----AMMVTKLENL-SIYFQEGARRRSLIDN 302 (337) Q Consensus 236 k~~~l~n~~~~p------tE~~A~~~~~~~k~igGl~a~~vPf--fP~~----~ilvT~l~NL-sIY~Q~gs~RR~~~d~ 302 (337) ...|-...+.| ..... -....++-|+|++.++. +|.. .+++-.|++. -|+-+ +..+-...+. T Consensus 279 -l~~lkd~~G~~i~~~~~~~~~~---~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~-~~~~v~~~~~ 353 (394) T protein:vir:10 279 -LDTLKDKNGRYLLHDASDSITD---GTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADR-QQVTLAWEDS 353 (394) T ss_pred -HHHhhccCCCeeeecccccccc---CCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEee-cceEEEEecc Confidence 22222222111 11000 01124788999998774 3322 2788888873 44433 3344444444 Q ss_pred cccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 303 PKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 303 p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ....+.--.+.|- +..|-+...++.++--..+.+ T Consensus 354 ~~~~~~~~~~~r~-d~~~~~~~ai~~~~~~~~~~~ 387 (394) T protein:vir:10 354 KIYGRYLGAAFRF-GVKQADSNAGYFVTNTDAASG 387 (394) T ss_pred cccceeEEEEEEe-ccEEeccccEEEEEeecccCC Confidence 3333322222222 234445555555442222111 No 57 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.00 E-value=3.5e-06 Score=50.48 Aligned_cols=324 Identities=10% Similarity=0.004 Sum_probs=160.7 Q ss_pred CChHHHHHHHHHHHHHHH--hcCc-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LNDT-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagR 77 (337) ...+.+..+..+....+. .+.+ .+.+-.+.|-|.+...+.+.+.+.+.++++++++++.--...........+-++- T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~w 209 (497) T protein:vir:78 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCccee Confidence 111112222222222111 1111 1223356788999999999999999999999999886432222111111122222 Q ss_pred ccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceecc------- Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAA------- 150 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A------- 150 (337) +. .....|..-..++...+..++.---+.|+.+.|+.. |+++..|.+.+.+.++.=.-.--+||+-.. T Consensus 210 v~--E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~---~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~ 284 (497) T protein:vir:78 210 VA--EAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQ 284 (497) T ss_pred ec--cCcccccccccceeeEeeeeeeEeecHhHHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccc Confidence 21 122333334557777888888777889999999874 568899999998888754433333442110 Q ss_pred -----CC---------CChh--------hhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHH Q lcl|NC_015266. 151 -----LS---------TDKA--------ANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMD 208 (337) Q Consensus 151 -----~~---------TD~~--------~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~d 208 (337) .. +... ..-....+|..|+..++..+........ .+-+..+...++..++-+..- T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 361 (497) T protein:vir:78 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGS---GSGVAGSYPTAAEIAENVFDA 361 (497) T ss_pred ccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhh---ccchhccccchhhhhhHHHHH Confidence 00 0000 0001235566676666664322222111 111122223334444433332 Q ss_pred HHhcccChhHcCCCCeEEEeChHHHHHHHHHHHh-ccC-----ChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEec Q lcl|NC_015266. 209 IVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVN-TTQ-----APTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTK 282 (337) Q Consensus 209 a~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n-~~~-----~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~ 282 (337) +..+-.. ....++ +++|.+.-+.. ...+. ..+ .+....+.+.....+++-|+|++..|++|.+.+++-. T Consensus 362 -~~~~~~~-~~~~~~-~~vmn~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd 436 (497) T protein:vir:78 362 -FVDIQLT-LFQTPN-AVVMNPRDWEL--LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH 436 (497) T ss_pred -Hhhhhhh-cccCCC-eEEEchHHHHH--HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEee Confidence 3223222 333334 45666643331 22232 211 1212223344444568889999999999999999988 Q ss_pred ccccEEEE-ecCceEEeEeec--c--ccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 283 LENLSIYF-QEGARRRSLIDN--P--KRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 283 l~NLsIY~-Q~gs~RR~~~d~--p--~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ++...+.. -++..+-.+-+. + .+|.+.----.=-+..|-++++++.++-...+.| T Consensus 437 ~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~ 496 (497) T protein:vir:78 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) T ss_pred cccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCccC Confidence 87644432 233333322211 0 1122111111112345557777777766666666 No 58 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.00 E-value=3.5e-06 Score=50.48 Aligned_cols=324 Identities=10% Similarity=0.004 Sum_probs=160.7 Q ss_pred CChHHHHHHHHHHHHHHH--hcCc-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccccccee Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LNDT-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagR 77 (337) ...+.+..+..+....+. .+.+ .+.+-.+.|-|.+...+.+.+.+.+.++++++++++.--...........+-++- T Consensus 130 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~w 209 (497) T protein:vir:10 130 AAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAA 209 (497) T ss_pred HHHHHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCccee Confidence 111112222222222111 1111 1223356788999999999999999999999999886432222111111122222 Q ss_pred ccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceecc------- Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAA------- 150 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A------- 150 (337) +. .....|..-..++...+..++.---+.|+.+.|+.. |+++..|.+.+.+.++.=.-.--+||+-.. T Consensus 210 v~--E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~---~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~ 284 (497) T protein:vir:10 210 VA--EAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQ 284 (497) T ss_pred ec--cCcccccccccceeeEeeeeeeEeecHhHHHHHHhH---HHHHHHHHHHHHHHHHHHHHHHhhcCCCccccccccc Confidence 21 122333334557777888888777889999999874 568899999998888754433333442110 Q ss_pred -----CC---------CChh--------hhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHH Q lcl|NC_015266. 151 -----LS---------TDKA--------ANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMD 208 (337) Q Consensus 151 -----~~---------TD~~--------~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~d 208 (337) .. +... ..-....+|..|+..++..+........ .+-+..+...++..++-+..- T Consensus 285 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 361 (497) T protein:vir:10 285 RSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGS---GSGVAGSYPTAAEIAENVFDA 361 (497) T ss_pred ccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhh---ccchhccccchhhhhhHHHHH Confidence 00 0000 0001235566676666664322222111 111122223334444433332 Q ss_pred HHhcccChhHcCCCCeEEEeChHHHHHHHHHHHh-ccC-----ChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEec Q lcl|NC_015266. 209 IVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVN-TTQ-----APTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTK 282 (337) Q Consensus 209 a~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n-~~~-----~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~ 282 (337) +..+-.. ....++ +++|.+.-+.. ...+. ..+ .+....+.+.....+++-|+|++..|++|.+.+++-. T Consensus 362 -~~~~~~~-~~~~~~-~~vmn~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd 436 (497) T protein:vir:10 362 -FVDIQLT-LFQTPN-AVVMNPRDWEL--LRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGH 436 (497) T ss_pred -Hhhhhhh-cccCCC-eEEEchHHHHH--HHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEee Confidence 3223222 333334 45666643331 22232 211 1212223344444568889999999999999999988 Q ss_pred ccccEEEE-ecCceEEeEeec--c--ccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 283 LENLSIYF-QEGARRRSLIDN--P--KRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 283 l~NLsIY~-Q~gs~RR~~~d~--p--~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ++...+.. -++..+-.+-+. + .+|.+.----.=-+..|-++++++.++-...+.| T Consensus 437 ~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~~~~ 496 (497) T protein:vir:10 437 FAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLKKGATG 496 (497) T ss_pred cccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeeccccEEEEEecCCccC Confidence 87644432 233333322211 0 1122111111112345557777777766666666 No 59 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.00 E-value=1.1e-06 Score=53.19 Aligned_cols=291 Identities=10% Similarity=0.013 Sum_probs=153.3 Q ss_pred CChHHHHHHHHHHHHHHHhc------------CcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLN------------DTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL 68 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~n------------gv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~ 68 (337) -+...+...+.+....-... .....+....|-|.+...+.+.+.+.+.+++.++++++.--.++.... T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 153 (385) T protein:vir:19 74 KKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVRE 153 (385) T ss_pred hhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEE Confidence 11111122222222111100 011112234577889999999999999999999999887555444444 Q ss_pred ccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccccee Q lcl|NC_015266. 69 SVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVK 148 (337) Q Consensus 69 gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s 148 (337) ...++-++-+.. +...|..-..+....+..++.--...|+.+.|+.. ++++..+++.+.+.++.-.-.--++|.- T Consensus 154 ~~~~~~a~~v~E--~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g 228 (385) T protein:vir:19 154 EVFTNNADVVAE--KALKPESDITFSKQTANVKTIAHWVQASRQVMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDG 228 (385) T ss_pred ecCCcceeeecc--CccccccccceeEEEEeeeeEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 322232222211 22233334567888888898888889999988864 5788999999999888755444556632 Q ss_pred ccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEe Q lcl|NC_015266. 149 AALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) Q Consensus 149 ~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVviv 228 (337) . .+| |.-+++..... ....+..++ ..+|.++. ++.. |.+.+++.. +++| T Consensus 229 ~-------~~~-----------------~~Gi~~~~~~~--~~~~~~~~~-~~~d~i~~-~~~~-l~~~~~~~~--~~~~ 277 (385) T protein:vir:19 229 T-------GDN-----------------LEGLNKVATAY--DTSLNATGD-TRADIIAH-AIYQ-VTESEFSAS--GIVL 277 (385) T ss_pred C-------CCc-----------------ccccccccccc--ccccccccc-chHHHHHH-HHHh-hccccCCCC--EEEE Confidence 1 111 11122221111 111222222 45566554 3443 456566654 8899 Q ss_pred ChHHHHHHHHHHHhccCChhHHHHHH-HHHhhhhhcCceeEECCccCCCceEEecccc-cEEEEecCceEEeEeeccccc Q lcl|NC_015266. 229 GRELLHDKYFPIVNTTQAPTEQLAAD-LIVSQKRIGNLPAVRVPFFPKRAMMVTKLEN-LSIYFQEGARRRSLIDNPKRD 306 (337) Q Consensus 229 G~dLla~k~~~l~n~~~~ptE~~A~~-~~~~~k~igGl~a~~vPffP~~~ilvT~l~N-LsIY~Q~gs~RR~~~d~p~r~ 306 (337) .+..+.. ...+-...+.| +-.. .-..+.++-|+|++..+++|++.+++-.+++ .-|+.+.+.. =.+.+.. T Consensus 278 ~~~~~~~-l~~lkd~~G~~---l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~-v~~~~~~--- 349 (385) T protein:vir:19 278 NPRDWHN-IALLKDNEGRY---IFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDAT-VEVSRED--- 349 (385) T ss_pred cHHHHHH-HHHhhcCCCce---eccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceE-EEEeccc--- Confidence 9876652 22222221111 0000 0113467889999999999999999988876 4454443322 1111111 Q ss_pred eecchhhhcc-cceeec-CCcEEEee----ceeeccC Q lcl|NC_015266. 307 QIENYESSND-AYVVED-FGCGCVAE----NIELVAA 337 (337) Q Consensus 307 r~e~y~s~Ne-~YvVEd-~~~~a~iE----nI~~~~a 337 (337) .+++.+|. +|.++- ++....-. -+++..| T Consensus 350 --~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:19 350 --RDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred --cchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 13344443 333333 23322222 3456666 No 60 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.00 E-value=1.1e-06 Score=53.19 Aligned_cols=291 Identities=10% Similarity=0.013 Sum_probs=153.3 Q ss_pred CChHHHHHHHHHHHHHHHhc------------CcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLN------------DTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL 68 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~n------------gv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~ 68 (337) -+...+...+.+....-... .....+....|-|.+...+.+.+.+.+.+++.++++++.--.++.... T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 153 (385) T protein:vir:18 74 KKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEYVRE 153 (385) T ss_pred hhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEEEEE Confidence 11111122222222111100 011112234577889999999999999999999999887555444444 Q ss_pred ccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccccee Q lcl|NC_015266. 69 SVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVK 148 (337) Q Consensus 69 gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s 148 (337) ...++-++-+.. +...|..-..+....+..++.--...|+.+.|+.. ++++..+++.+.+.++.-.-.--++|.- T Consensus 154 ~~~~~~a~~v~E--~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g 228 (385) T protein:vir:18 154 EVFTNNADVVAE--KALKPESDITFSKQTANVKTIAHWVQASRQVMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDG 228 (385) T ss_pred ecCCcceeeecc--CccccccccceeEEEEeeeeEEEeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccC Confidence 322232222211 22233334567888888898888889999988864 5788999999999888755444556632 Q ss_pred ccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEe Q lcl|NC_015266. 149 AALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVIC 228 (337) Q Consensus 149 ~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVviv 228 (337) . .+| |.-+++..... ....+..++ ..+|.++. ++.. |.+.+++.. +++| T Consensus 229 ~-------~~~-----------------~~Gi~~~~~~~--~~~~~~~~~-~~~d~i~~-~~~~-l~~~~~~~~--~~~~ 277 (385) T protein:vir:18 229 T-------GDN-----------------LEGLNKVATAY--DTSLNATGD-TRADIIAH-AIYQ-VTESEFSAS--GIVL 277 (385) T ss_pred C-------CCc-----------------ccccccccccc--ccccccccc-chHHHHHH-HHHh-hccccCCCC--EEEE Confidence 1 111 11122221111 111222222 45566554 3443 456566654 8899 Q ss_pred ChHHHHHHHHHHHhccCChhHHHHHH-HHHhhhhhcCceeEECCccCCCceEEecccc-cEEEEecCceEEeEeeccccc Q lcl|NC_015266. 229 GRELLHDKYFPIVNTTQAPTEQLAAD-LIVSQKRIGNLPAVRVPFFPKRAMMVTKLEN-LSIYFQEGARRRSLIDNPKRD 306 (337) Q Consensus 229 G~dLla~k~~~l~n~~~~ptE~~A~~-~~~~~k~igGl~a~~vPffP~~~ilvT~l~N-LsIY~Q~gs~RR~~~d~p~r~ 306 (337) .+..+.. ...+-...+.| +-.. .-..+.++-|+|++..+++|++.+++-.+++ .-|+.+.+.. =.+.+.. T Consensus 278 ~~~~~~~-l~~lkd~~G~~---l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~-v~~~~~~--- 349 (385) T protein:vir:18 278 NPRDWHN-IALLKDNEGRY---IFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDRMDAT-VEVSRED--- 349 (385) T ss_pred cHHHHHH-HHHhhcCCCce---eccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEecceE-EEEeccc--- Confidence 9876652 22222221111 0000 0113467889999999999999999988876 4454443322 1111111 Q ss_pred eecchhhhcc-cceeec-CCcEEEee----ceeeccC Q lcl|NC_015266. 307 QIENYESSND-AYVVED-FGCGCVAE----NIELVAA 337 (337) Q Consensus 307 r~e~y~s~Ne-~YvVEd-~~~~a~iE----nI~~~~a 337 (337) .+++.+|. +|.++- ++....-. -+++..| T Consensus 350 --~~~~~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa 384 (385) T protein:vir:18 350 --RDNFVKNMLTILCEERLALAHYRPTAIIKGTFSSG 384 (385) T ss_pred --cchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 13344443 333333 23322222 3456666 No 61 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=97.98 E-value=1e-06 Score=53.37 Aligned_cols=279 Identities=10% Similarity=0.048 Sum_probs=162.3 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |+-++-...+... ..+..-.|-+++.+.+.+.+.+.|.+++..+++++.-..+..+-...+++.++-... T Consensus 1 m~~~~~~~~~~~~----------t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 70 (297) T protein:vir:95 1 MTVQTFNPENVLV----------SQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNE 70 (297) T ss_pred CCccccccccccc----------cCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeec Confidence 5544433332211 112233688888899999999999999999999886554555555555666655543 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) +. . .|..-...+...+.+++.---..|+.+.|+... ++|+..+++.+.+.++...-.-.++|+-....+ T Consensus 71 g~-~-~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~------- 139 (297) T protein:vir:95 71 TE-K-IKTDKPEVVPVTLKAHKLGIILVTSREALNYTW--KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFAN------- 139 (297) T ss_pred Cc-c-ccccccceeEEEEeeEEEEEeehhhHHHHhcCH--HHHHHHHHHHHHHHHHHHHHHHHhcccCCcccc------- Confidence 32 2 233335677888888888888899999888653 589999999999999888887778886422111 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) -++.... ......+.+-+|++|-. ++..+. +.+.+. -+++|.++.... ...| T Consensus 140 ------------------gi~~~~~--~~~~~~~~~~t~~~i~~----~~~~l~-~~~~~~--~~~v~~~~~~~~-L~~l 191 (297) T protein:vir:95 140 ------------------SVAKAAK--DANKVIGGPINYDNILK----LQDALY-DADVEP--NAFVSKIQNRSA-LREA 191 (297) T ss_pred ------------------ccccccc--ccceecccccCHHHHHH----HHHHhh-hccCCc--CEEEEcHHHHHH-HHHh Confidence 1222111 00111122224554443 444443 334333 378999987663 2234 Q ss_pred HhccCChhHHHHHHHHHhhhhhcCceeEECCc--cCCCceEEecccccEEEEecCceEEeEeeccc-------------- Q lcl|NC_015266. 241 VNTTQAPTEQLAADLIVSQKRIGNLPAVRVPF--FPKRAMMVTKLENLSIYFQEGARRRSLIDNPK-------------- 304 (337) Q Consensus 241 ~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPf--fP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~-------------- 304 (337) -...+.|- ...+..++-|+|++..|. .+++.+++-.++++-+ ...+..+-.+.++.. T Consensus 192 ~d~~G~~i------~~~~~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~-~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 264 (297) T protein:vir:95 192 RDGNKVSI------YDKAANTIDGITTVDLKSARFEKGDLLAGDFDNLIY-GVPYNITYKISEEGQISTITNADGTPINL 264 (297) T ss_pred hccCCcee------ecCCCCcccceeeEeecCCCCCCceEEEEecccEEE-EEecCeEEEEeeccccccccccCccchhh Confidence 33322220 112345788999986554 6888999999998754 444444444433322 Q ss_pred --cceecchhhhcccceeecCCcEEEeeceeec Q lcl|NC_015266. 305 --RDQIENYESSNDAYVVEDFGCGCVAENIELV 335 (337) Q Consensus 305 --r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~ 335 (337) ++.+.-.-...-++.|-+.+++|.+...+=+ T Consensus 265 ~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 265 FEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred hhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 1222111112345666677777665432222 No 62 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=97.92 E-value=4e-06 Score=50.20 Aligned_cols=288 Identities=13% Similarity=0.031 Sum_probs=146.8 Q ss_pred CChHHHHHHHHHHHHH-----HHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQI-----AKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIA 75 (337) Q Consensus 1 M~~~tr~~~~~y~~~~-----a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~ia 75 (337) -..+-|..|..|+... +...++....-.|.|-+.+.+.+.+.+.+.+.+.+..+++++.- +-++-+-..++.+ T Consensus 119 ~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~--~~~~p~~~~~~~a 196 (434) T protein:vir:62 119 KETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKE--NIKYPVLVKKAEA 196 (434) T ss_pred HHHHHHHHHHHHhccccchhhhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccCC--ceEEEEEecCCcc Confidence 1223355566665421 22223333334667777778889999999999988888877641 1122221222222 Q ss_pred eecc-CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCC Q lcl|NC_015266. 76 SRTD-TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTD 154 (337) Q Consensus 76 gRt~-t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD 154 (337) +-.. .+.....|..-..++...+..++.---..|+.+.|+.- ..+|+..+++.+.++++.=.-.--+||+-.. T Consensus 197 ~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~---- 270 (434) T protein:vir:62 197 QGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLART--GLPIEQIVMDELKKAYVRKETQYMVNGDEAN---- 270 (434) T ss_pred cceecccccccccccccceeeEEeeheeeEeehhhHHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhccCCCC---- Confidence 1111 11122223333445566677777666678888888864 2479999999999999876666666875422 Q ss_pred hhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHH Q lcl|NC_015266. 155 KAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 155 ~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla 234 (337) +|. +| ++... .+..++.+ =...|.++ +++.+ |++.|+... +++|.+..+. T Consensus 271 ---~~~-----~g------------~~~~~-----~~~~~~~~-~~~~d~l~-~l~~~-l~~~~~~~a--~~v~n~~~~~ 320 (434) T protein:vir:62 271 ---NIN-----DG------------ALAKK-----AVEFKTDE-KNLYDALV-KMKNT-PVKEVRKKA--RWVLNTAALT 320 (434) T ss_pred ---ccc-----cc------------eeecc-----cccccccc-cchhhHHH-HHHhh-cchhhhcCC--EEEEcHHHHH Confidence 221 01 11111 11111111 12345554 46665 477787655 8899888765 Q ss_pred HHHHHHHhccCCh----hHHHHHHHHHhhhhhcCceeEECCccCCCc------eEEecccccEEEEecCceEEeEeeccc Q lcl|NC_015266. 235 DKYFPIVNTTQAP----TEQLAADLIVSQKRIGNLPAVRVPFFPKRA------MMVTKLENLSIYFQEGARRRSLIDNPK 304 (337) Q Consensus 235 ~k~~~l~n~~~~p----tE~~A~~~~~~~k~igGl~a~~vPffP~~~------ilvT~l~NLsIY~Q~gs~RR~~~d~p~ 304 (337) . ...+-...+.| .-.... ....+|-|+|++..+++|... |++=.|+..-|+-..|...-...+ T Consensus 321 ~-L~~lkd~~G~~l~~~~~~~~~---g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~--- 393 (434) T protein:vir:62 321 K-IETMKTDDGFPLLRPFNQAEG---GIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLV--- 393 (434) T ss_pred H-HHHhhccCCCEeeccCCCccC---CCCceecceeeEEecCccCccCCCceEEEEeeccceEEEEeeceeEEEeeh--- Confidence 2 22222222222 100000 112478899999999999765 555455544444333433222111 Q ss_pred cceecchhhhc-ccceeecCCcEEEe---e-----ceeeccC Q lcl|NC_015266. 305 RDQIENYESSN-DAYVVEDFGCGCVA---E-----NIELVAA 337 (337) Q Consensus 305 r~r~e~y~s~N-e~YvVEd~~~~a~i---E-----nI~~~~a 337 (337) +.|...| .+|.++..--+-+| + .+++-.| T Consensus 394 ----~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~ 431 (434) T protein:vir:62 394 ----ELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAP 431 (434) T ss_pred ----hhhcccCceEEEEEeeecceeecCcccceEEEEEeccC Confidence 2222222 24555443212222 1 2222223 No 63 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=97.91 E-value=5.2e-06 Score=49.54 Aligned_cols=292 Identities=11% Similarity=0.041 Sum_probs=159.6 Q ss_pred CChH-----HHHHHHHHHHHHHHhcC--c-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccc Q lcl|NC_015266. 1 MKKE-----TRQAYRKYAAQIAKLND--T-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~-----tr~~~~~y~~~~a~~ng--v-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g 72 (337) |++. .+..|..+....+..+. + ......+.|-+++...+.+.+.+.|.++++.+++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 6544 33334444433333222 1 12233556777888999999999999999999988763221 22222233 Q ss_pred ccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCC Q lcl|NC_015266. 73 PIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALS 152 (337) Q Consensus 73 ~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~ 152 (337) +-++=+. .....|..-..++...+..++.---..|+.+.|+... ++|+..+.+.+.+.++.-.-.-.++|+-.. T Consensus 80 ~~a~~v~--Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~-- 153 (324) T protein:vir:78 80 PGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-- 153 (324) T ss_pred cceeEec--CCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCC-- Confidence 4443332 2233344445678888888988888889998888543 689999999999999888777778885411 Q ss_pred CChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) ..| .-++.... .......+.-.|.+|-.+ +.. |++.+++.. +++|.+.. T Consensus 154 ~~~----------------------~gi~~~~~--~~~~~~~~~~t~~~i~~~----~~~-l~~~~~~~~--~~vmn~~~ 202 (324) T protein:vir:78 154 PFG----------------------KSIAQSIE--KTNKVIKGDFTQDNIIDL----EAL-LEDDELEAN--AFISKTQN 202 (324) T ss_pred CcC----------------------cccccccc--ccceeccccccHHHHHHH----HHh-hhhccCCCC--EEEEcHHH Confidence 111 11111110 000111122234444443 333 355555544 68888876 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCc--cCCCceEEecccccEEEEecCceEEeEeeccc------ Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPF--FPKRAMMVTKLENLSIYFQEGARRRSLIDNPK------ 304 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPf--fP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~------ 304 (337) ... ...+-...+.|- +.. ....++-|+|++..|. .+++.+++-.++++- +-..+..+-.+.++.. T Consensus 203 ~~~-L~~l~d~~G~~~--~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~-~g~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:78 203 RSL-LRKIVDPETKER--IYD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhccCCCee--ecC---CCCCcccceeeEeeCCCCCCcceEEEEecceEE-EEEecCcEEEEeeccccccccc Confidence 653 222322222221 110 1245789999999887 455568888888853 3333444433333321 Q ss_pred --cceecchhh--------hcccceeecCCcEEEeeceeecc-C Q lcl|NC_015266. 305 --RDQIENYES--------SNDAYVVEDFGCGCVAENIELVA-A 337 (337) Q Consensus 305 --r~r~e~y~s--------~Ne~YvVEd~~~~a~iEnI~~~~-a 337 (337) -..+..|++ .--+..|-+++++|.+...+-+. | T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~ 319 (324) T protein:vir:78 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCC Confidence 111111211 12355667777777665533332 2 No 64 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=97.91 E-value=5.2e-06 Score=49.54 Aligned_cols=292 Identities=11% Similarity=0.041 Sum_probs=159.6 Q ss_pred CChH-----HHHHHHHHHHHHHHhcC--c-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccc Q lcl|NC_015266. 1 MKKE-----TRQAYRKYAAQIAKLND--T-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~-----tr~~~~~y~~~~a~~ng--v-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g 72 (337) |++. .+..|..+....+..+. + ......+.|-+++...+.+.+.+.|.++++.+++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 6544 33334444433333222 1 12233556777888999999999999999999988763221 22222233 Q ss_pred ccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCC Q lcl|NC_015266. 73 PIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALS 152 (337) Q Consensus 73 ~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~ 152 (337) +-++=+. .....|..-..++...+..++.---..|+.+.|+... ++|+..+.+.+.+.++.-.-.-.++|+-.. T Consensus 80 ~~a~~v~--Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~la~ai~~~~d~a~l~G~g~~-- 153 (324) T protein:vir:96 80 PGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-- 153 (324) T ss_pred cceeEec--CCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhccCCCC-- Confidence 4443332 2233344445678888888988888889998888543 689999999999999888777778885411 Q ss_pred CChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) ..| .-++.... .......+.-.|.+|-.+ +.. |++.+++.. +++|.+.. T Consensus 154 ~~~----------------------~gi~~~~~--~~~~~~~~~~t~~~i~~~----~~~-l~~~~~~~~--~~vmn~~~ 202 (324) T protein:vir:96 154 PFG----------------------KSIAQSIE--KTNKVIKGDFTQDNIIDL----EAL-LEDDELEAN--AFISKTQN 202 (324) T ss_pred CcC----------------------cccccccc--ccceeccccccHHHHHHH----HHh-hhhccCCCC--EEEEcHHH Confidence 111 11111110 000111122234444443 333 355555544 68888876 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCc--cCCCceEEecccccEEEEecCceEEeEeeccc------ Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPF--FPKRAMMVTKLENLSIYFQEGARRRSLIDNPK------ 304 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPf--fP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~------ 304 (337) ... ...+-...+.|- +.. ....++-|+|++..|. .+++.+++-.++++- +-..+..+-.+.++.. T Consensus 203 ~~~-L~~l~d~~G~~~--~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~-~g~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:96 203 RSL-LRKIVDPETKER--IYD---RNSDSLDGLPVVNLKSSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhccCCCee--ecC---CCCCcccceeeEeeCCCCCCcceEEEEecceEE-EEEecCcEEEEeeccccccccc Confidence 653 222322222221 110 1245789999999887 455568888888853 3333444433333321 Q ss_pred --cceecchhh--------hcccceeecCCcEEEeeceeecc-C Q lcl|NC_015266. 305 --RDQIENYES--------SNDAYVVEDFGCGCVAENIELVA-A 337 (337) Q Consensus 305 --r~r~e~y~s--------~Ne~YvVEd~~~~a~iEnI~~~~-a 337 (337) -..+..|++ .--+..|-+++++|.+...+-+. | T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~ 319 (324) T protein:vir:96 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDS 319 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEecccccCCC Confidence 111111211 12355667777777665533332 2 No 65 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=97.89 E-value=2.4e-06 Score=51.38 Aligned_cols=272 Identities=15% Similarity=0.107 Sum_probs=157.1 Q ss_pred cCccc------ccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccCCCccccccccccc Q lcl|NC_015266. 20 NDTDD------VSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTAL 93 (337) Q Consensus 20 ngv~~------~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l 93 (337) .|-+. ......|-+.+.+.+.+.+++.+.+++..+++++.-...... -.+++-++=+ +.....|..-..+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~a~~v--~E~~~~~~~~~~f 76 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFT--FMSGVGAFWV--DEAERIQTSKPTF 76 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEE--EEcCCceeee--ecCccccccccce Confidence 34321 112346888899999999999999999999999874433322 2233434332 2233334444668 Q ss_pred CccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHH Q lcl|NC_015266. 94 DSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYR 173 (337) Q Consensus 94 ~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~R 173 (337) +...+..++.--...|+.+.|+.= .++|+..+.+.+.+.++.-.-.-=++|+-. ..| .|.++... T Consensus 77 ~~v~l~~~k~~~~~~is~ell~ds--~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~-------~~~------~gil~~~~ 141 (299) T protein:vir:41 77 TKAKMRSKKMGVIIPTTKENLNYS--VTNFFSLMQAEIVEAFYKKFDQAVFTGVES-------PYN------WNILKSAT 141 (299) T ss_pred eEEEEeeEEEEEeehhhHHHHhcC--HHHHHHHHHHHHHHHHHHHHHHHHhhcccC-------ccc------cccccccc Confidence 888999999988999999999842 268999999999999887666666688531 122 24444211 Q ss_pred hhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChhHHHHH Q lcl|NC_015266. 174 DRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAA 253 (337) Q Consensus 174 e~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~ 253 (337) . + ......+. .++|.+ .+++.. +.+.++... +++|.++.... ...+-...+.|--.- T Consensus 142 ~---------~-----~~~~~~~~--~~~~~l-~~~~~~-l~~~~~~~~--~~v~n~~~~~~-L~~lkd~~G~~l~~~-- 198 (299) T protein:vir:41 142 D---------A-----SNLVEETA--NKYDDL-NEAIGL-IEAEDLEPN--GIATIRKQRVK-YRSTKDGNGMPIFNT-- 198 (299) T ss_pred c---------c-----ceeecccc--ccHHHH-HHHHHh-hhcccCCcC--EEEEcHHHHHH-HHHhhccCCceeecC-- Confidence 1 0 00111111 234443 445654 456666544 68899987553 333333332321100 Q ss_pred HHHHhhhhhcCceeEECCccCCCc----eEEecccccEEEEecCceEEeEeecccccee-------cchhhh-------- Q lcl|NC_015266. 254 DLIVSQKRIGNLPAVRVPFFPKRA----MMVTKLENLSIYFQEGARRRSLIDNPKRDQI-------ENYESS-------- 314 (337) Q Consensus 254 ~~~~~~k~igGl~a~~vPffP~~~----ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~-------e~y~s~-------- 314 (337) .......++-|+|++..+++|.+. +++-.++++-|....+ .+-.+.++...... .+++.+ T Consensus 199 ~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~ 277 (299) T protein:vir:41 199 ATSNGVDDVLGLPIAYTPKYTFGDKDISELVGDWNQAYYGILRG-VEYEILTEATLTTVADETGKPLNLAERDMAAIKAT 277 (299) T ss_pred CcCCCCceecceeeEEecccCCCCCceEEEEEecccEEEEEecC-cEEEEeecccccccccccccchhhhhcCcEEEEEE Confidence 011123578899999999999997 9999999876544443 33333333221111 111222 Q ss_pred -cccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 315 -NDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 315 -Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) --+..|.++.+++.+ +...| T Consensus 278 ~~~d~~v~~~~A~~~l---~~~aa 298 (299) T protein:vir:41 278 FEVGFMVVKDEAFSAV---QPKAG 298 (299) T ss_pred EEeccEEecccceEEE---EeccC Confidence 234456666666655 33333 No 66 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=97.87 E-value=6.6e-06 Score=48.98 Aligned_cols=280 Identities=11% Similarity=0.097 Sum_probs=146.7 Q ss_pred CChHHHHHHHHHHHHHH------------------HhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIA------------------KLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELE 62 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a------------------~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~ 62 (337) ....+...++++...+- ...+....+-.+.|-++....+.+.+.+.+.+++.++++++...+ T Consensus 87 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 166 (397) T protein:vir:12 87 NEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVTVEPVTTRS 166 (397) T ss_pred hhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcceeeccCCc Confidence 11111112222221111 011222223455676777888999999999999999999999888 Q ss_pred hhhh-ccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhh Q lcl|NC_015266. 63 GEKL-GLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIM 141 (337) Q Consensus 63 Ge~v-~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~ 141 (337) |+.. ....+++.+.-...+.. ....+...++...+.+++.---..|+.+.|.... .+|+..+.+.+.++++.-.-. T Consensus 167 ~~~~~~~~~~~~~a~~v~Eg~~-~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~--~~l~~~i~~~l~~~~~~~~d~ 243 (397) T protein:vir:12 167 GTRLLEKNADMVPFSPVEELGN-LPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSD--QAIMTYVAKWFAKKSVVTRNN 243 (397) T ss_pred eeEEEEEecCCcceeeeccccc-ccccccccceeEEeeheeeEeeehhhHHHHhhch--HHHHHHHHHHHHHHHHHHHHH Confidence 8753 33334444433332221 1112334566667777777777889998886433 478888999998888876666 Q ss_pred hcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCC Q lcl|NC_015266. 142 IGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQED 221 (337) Q Consensus 142 IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~ 221 (337) --++|+.... |. |. .+.|.++. ++...+++.|+.. T Consensus 244 ~il~G~g~~~-------------------------~~----------g~---------~~~~~i~~-~~~~~l~~~~~~~ 278 (397) T protein:vir:12 244 LILAAIASLK-------------------------KV----------DI---------DGLDGIKK-ALNVTLDPMVAPG 278 (397) T ss_pred HHHhcccccc-------------------------cc----------cc---------ccHHHHHH-HHhhccchhhhCC Confidence 6667753211 10 00 23455443 4555568888875 Q ss_pred CCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHH-HhhhhhcCceeEECCc-cCCCc-----eEEecccccEEEEecCc Q lcl|NC_015266. 222 TGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLI-VSQKRIGNLPAVRVPF-FPKRA-----MMVTKLENLSIYFQEGA 294 (337) Q Consensus 222 ~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~-~~~k~igGl~a~~vPf-fP~~~-----ilvT~l~NLsIY~Q~gs 294 (337) .+++|.+...+. ...+-+..+.|- ....+. ..+.++-|+|++..+. +|+.+ +++-.+++.-+..-+.. T Consensus 279 --a~~~~n~~~~~~-L~~lkd~~G~~l--~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 353 (397) T protein:vir:12 279 --SIVLTNQDGYDW-LDTLKDGTGRYL--LQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQ 353 (397) T ss_pred --CEEEEcHHHHHH-HHHhhccCCcee--ecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecc Confidence 588999987653 222222222220 000010 1245888999987665 44332 78888887543332233 Q ss_pred eEEeEeeccccceecchhhhcccceeecCCcEEEee--ceeeccC Q lcl|NC_015266. 295 RRRSLIDNPKRDQIENYESSNDAYVVEDFGCGCVAE--NIELVAA 337 (337) Q Consensus 295 ~RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iE--nI~~~~a 337 (337) ..=.+.+.+. ..|..-..+|.++-+--+.... .|.++.- T Consensus 354 ~~i~~~~~~~----~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 394 (397) T protein:vir:12 354 QSIASTDTGA----GAFETNSTKVRGIEREDVRKWDEDAVVFGQI 394 (397) T ss_pred eEEEEecccc----chhhcCceEEEEEEeeccEEecccceEEEEE Confidence 3322222221 1122222345444432222221 1222221 No 67 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=97.87 E-value=5.6e-06 Score=49.38 Aligned_cols=293 Identities=14% Similarity=0.120 Sum_probs=141.0 Q ss_pred CCh-----HHHHHHHHHHHH------------HHHhcC-cccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhh Q lcl|NC_015266. 1 MKK-----ETRQAYRKYAAQ------------IAKLND-TDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELE 62 (337) Q Consensus 1 M~~-----~tr~~~~~y~~~------------~a~~ng-v~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~ 62 (337) +.. ..+..+..+..+ +...+. .....-.+.|-+...+.+.+.+.+++.+++..+++++.--. T Consensus 126 ~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 205 (458) T protein:vir:10 126 TQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKI 205 (458) T ss_pred hhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcc Confidence 100 011111111111 000000 11112345677788999999999999999999988875322 Q ss_pred hhhhccccccccceeccCC-Ccccc---cccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhc Q lcl|NC_015266. 63 GEKLGLSVSGPIASRTDTT-KAERQ---PIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALD 138 (337) Q Consensus 63 Ge~v~~gv~g~iagRt~t~-~~~R~---~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD 138 (337) .. +..-..++-++=+.-+ ..... +.....++...+..++.--...|+.+.|+... ++|+..+.+.+.+.++.- T Consensus 206 ~~-~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~--~~~~~~i~~~l~~~i~~~ 282 (458) T protein:vir:10 206 LT-MLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAI--FSLLPLLRKRLIEAHAVS 282 (458) T ss_pred eE-EEEecCCcceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcch--HHHHHHHHHHHHHHHHHH Confidence 11 1122222222222111 11111 01122355667777777778899999887643 579999999999988865 Q ss_pred hhhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCC---CcccccHHHHHHHHHhcccC Q lcl|NC_015266. 139 RIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGK---GGDYVNLDALVMDIVSSMID 215 (337) Q Consensus 139 ~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~---ggdy~nLDalV~da~~~li~ 215 (337) .-.--+||+-. ..| +| +++.....++....+. ..+-.+.|.++. ++.. ++ T Consensus 283 ~d~~~l~G~G~-------~~p------~G------------i~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~-~~~~-l~ 335 (458) T protein:vir:10 283 IEEAFMTGDGS-------GKP------KG------------LLTLASEDSAKVVTEAKADGSVLVTAKTISK-LRRK-LG 335 (458) T ss_pred HHHHhhcCCCC-------Ccc------ce------------eeecccccccceeecccccccccccHHHHHH-HHHh-hh Confidence 55556777431 122 23 2332222222211111 122234455543 5554 47 Q ss_pred hhHcCCCCeEEEeChHHHHHHHHHHHhcc-CChhH--HH-HHHHHHhhhhhcCceeEECCccCCCc----eEEecccc-c Q lcl|NC_015266. 216 PWFQEDTGLVVICGRELLHDKYFPIVNTT-QAPTE--QL-AADLIVSQKRIGNLPAVRVPFFPKRA----MMVTKLEN-L 286 (337) Q Consensus 216 ~~~r~~~dLVvivG~dLla~k~~~l~n~~-~~ptE--~~-A~~~~~~~k~igGl~a~~vPffP~~~----ilvT~l~N-L 286 (337) +.|+..+ +++|.+..+.. +..+... +.|-- .. .........++-|+|++...++|..+ +++--+.+ . T Consensus 336 ~~~~~~~--~~v~~~~~~~~--l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~~~~ 411 (458) T protein:vir:10 336 RHGLKLS--KLVLIVSMDAY--YDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYKDNF 411 (458) T ss_pred hhhcCCC--EEEEcHHHHHH--HHhhcccCCceeeccccccccccCcCceecceeeEEccccccccCCcceEEEEecccE Confidence 7777654 78999887652 2233222 22210 00 01111223578899999999999863 55555543 3 Q ss_pred EEEEecCceEEeEeeccccceecchhhhc-ccceeec-CCcEEEee----ceeeccC Q lcl|NC_015266. 287 SIYFQEGARRRSLIDNPKRDQIENYESSN-DAYVVED-FGCGCVAE----NIELVAA 337 (337) Q Consensus 287 sIY~Q~gs~RR~~~d~p~r~r~e~y~s~N-e~YvVEd-~~~~a~iE----nI~~~~a 337 (337) -| +.+++.+ +. + ++|-..| .+|..|. .|..+..- -++++.+ T Consensus 412 ~~-~~~~~~~--v~----~---d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 412 VM-PRQRAVT--VE----R---ERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred EE-EEeeceE--EE----e---ecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 33 2222222 11 1 1222222 2333332 23222221 1445555 No 68 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.87 E-value=6.1e-06 Score=49.17 Aligned_cols=291 Identities=10% Similarity=0.017 Sum_probs=147.5 Q ss_pred CChHHHHHHHHHHH--------H-------HHHhcCcccccceeeecHHHHHH-HHHHHHhhhhhhcccccccchhhhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAA--------Q-------IAKLNDTDDVSQKFAVEPSVQQT-LETKMQESSAFLKSINILPVTELEGE 64 (337) Q Consensus 1 M~~~tr~~~~~y~~--------~-------~a~~ngv~~~~~~Fsv~P~~~q~-L~~~i~ess~FL~~Inv~~V~~~~Ge 64 (337) =+...+..+..++. . -+...++...+..+.|-+.+... +...+.+++.+.+..++++. .|. T Consensus 217 ~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~---~g~ 293 (543) T protein:vir:81 217 SSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVA---TGD 293 (543) T ss_pred hhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccC---Ccc Confidence 00011111111110 0 01122333333344444555545 45667788888888877665 343 Q ss_pred -hhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhc Q lcl|NC_015266. 65 -KLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIG 143 (337) Q Consensus 65 -~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IG 143 (337) .+.+..+++.+.-+. .+...|..-..++...+..+..--.+.|+.+.|+. .++|+..+.+.+.+.++.-.-.-. T Consensus 294 ~~~~~~~~~~~a~~v~--Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d---~~~~~~~i~~~l~~~~~~~~d~ai 368 (543) T protein:vir:81 294 VWHGVSSAAVQWSWDA--EFEEVSDDSPEFGQPEIPVKKAQGFVPISIEALQD---EANVTETVALLFAEGKDELEAVTL 368 (543) T ss_pred eEEEEecCCcceeecc--cCccccccccccceeeeeeeeeEeeehhhHHHHhc---cHHHHHHHHHHHHHHHHHHHHHHH Confidence 222333344443332 22223445566888889999999999999999974 269999999999999998777777 Q ss_pred ccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCC Q lcl|NC_015266. 144 WNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTG 223 (337) Q Consensus 144 fNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~d 223 (337) |||+-.+ ..|. |. ++........+..++.+. ..+|.+ .+++.. +++.|+.. T Consensus 369 l~G~Gt~------~~p~------Gi------------~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~-l~~~~~~~-- 419 (543) T protein:vir:81 369 TTGTGQG------NQPT------GI------------VTALAGTAAEIAPVTAET-FALADV-YAVYEQ-LAARHRRQ-- 419 (543) T ss_pred hccCCCC------cccc------cc------------hhhccccccccccccccc-ccHHHH-HHHHHh-hhccccCC-- Confidence 8884211 1222 22 222111111122222222 122222 234443 46777764 Q ss_pred eEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCc----------eEEecccccEEEEecC Q lcl|NC_015266. 224 LVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRA----------MMVTKLENLSIYFQEG 293 (337) Q Consensus 224 LVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~----------ilvT~l~NLsIY~Q~g 293 (337) -+++|.+..+.. ...+-...+.|-=.-.. -....++-|+|++..+++|.+. |++-.++++-|....| T Consensus 420 ~~~v~n~~~~~~-l~~lkd~~G~~l~~~~~--~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~ 496 (543) T protein:vir:81 420 GAWLANNLIYNK-IRQFDTQGGAGLWTTIG--NGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIG 496 (543) T ss_pred cEEEEcHHHHHH-HHHhhcCCCceeccCcC--CCCCccccceeeEEeccccccccccccCCcceEEEeeccceeEEeecc Confidence 488999987653 22222222222100000 0123578899999999999875 7788888877765544 Q ss_pred ceEEeEeeccccceecchhhhcc--------cceeecCCcEEEeeceeeccC Q lcl|NC_015266. 294 ARRRSLIDNPKRDQIENYESSND--------AYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 294 s~RR~~~d~p~r~r~e~y~s~Ne--------~YvVEd~~~~a~iEnI~~~~a 337 (337) .. +.-.|+...-.++..-.- |+.|-+..+++.+ ++..| T Consensus 497 ~~---i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l---~~~~~ 542 (543) T protein:vir:81 497 MT---VEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLL---NVETA 542 (543) T ss_pred cE---EEEeccccccchhhcCceEEEEEEeeccEeecccceEEE---Eeccc Confidence 22 111122111111111112 3344444444443 33333 No 69 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.84 E-value=8.8e-06 Score=48.29 Aligned_cols=292 Identities=11% Similarity=0.047 Sum_probs=162.3 Q ss_pred CChHHHHH-----HHHHHHHHHHhcC--cc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccc Q lcl|NC_015266. 1 MKKETRQA-----YRKYAAQIAKLND--TD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr~~-----~~~y~~~~a~~ng--v~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g 72 (337) |++.-... |..++.+.+.... +. .......|-+.+...+.+.+.+.+.+++..+++++.-... ++-.-.++ T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~~p~~~~~ 79 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecC Confidence 76553333 3333333332221 11 1122346778889999999999999999999998774322 22222223 Q ss_pred ccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCC Q lcl|NC_015266. 73 PIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALS 152 (337) Q Consensus 73 ~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~ 152 (337) +-++-.. .+...|..-..++...+.+++.---..|+.+.|+... ++|+..+.+.+.++++.-.-.--++|+-. . T Consensus 80 ~~a~~v~--Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ai~~~~d~~~l~G~g~--~ 153 (324) T protein:vir:99 80 PGAYWVG--EGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGN--N 153 (324) T ss_pred cceeEec--cCccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhhcCCC--C Confidence 3333322 2223344446678888999998888999999999774 68999999999998776665556677431 1 Q ss_pred CChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) +.| . -++.... .. .. ...+. .+.|. +.+++.. |++.+++.. +++|.+.. T Consensus 154 ~~~----~------------------~~~~~~~--~~-~~-~~~~~-~~~~~-i~~~~~~-l~~~~~~~~--~~v~n~~~ 202 (324) T protein:vir:99 154 PFG----K------------------SIAQSIE--KT-NK-VIKGD-FTQDN-IIDLEAL-LEDDELEAN--AFISKTQN 202 (324) T ss_pred ccC----c------------------ccccccc--cc-ce-ecccc-CCHHH-HHHHHHh-hhhccCCCC--EEEEcHHH Confidence 111 1 0111110 00 00 11111 12333 3345554 466666554 68888887 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCC--ceEEecccccEEEEecCceEEeEeeccc------ Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKR--AMMVTKLENLSIYFQEGARRRSLIDNPK------ 304 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~--~ilvT~l~NLsIY~Q~gs~RR~~~d~p~------ 304 (337) ++. ...+-...+.+ ... -....++-|+|++..|..|.+ .+++..++++- |...+..+=.+.++.. T Consensus 203 ~~~-L~~l~d~~g~~--~~~---~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~~-~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:99 203 RSL-LRKIVDPETKE--RIY---DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhcCCCce--eec---CCCCccccceeEEeecCCCCCcceEEEEecccEE-EEEecCcEEEEeeccccccccc Confidence 663 22232222221 100 012357889999999997755 58888888863 4444444444433321 Q ss_pred ----------cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 305 ----------RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 305 ----------r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) +|.+.---..--++.|.+.++++.+.+.+.+.. T Consensus 276 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~ 318 (324) T protein:vir:99 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCC Confidence 122221111224668888888888876555544 No 70 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=97.84 E-value=8.6e-06 Score=48.35 Aligned_cols=284 Identities=10% Similarity=0.125 Sum_probs=153.0 Q ss_pred CChHHHHHHHHHHHHHHHh--------cC--------cccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKL--------ND--------TDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGE 64 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--------ng--------v~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge 64 (337) +....+.....|....... +. .......+.|-+.+...+.+.+.+.+.+++.++++++....|. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~~~ 161 (408) T protein:vir:74 82 LNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVSTSSGS 161 (408) T ss_pred ccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccCCcce Confidence 2222222222222221110 11 1122346678788888999999999999999999999887776 Q ss_pred hhcc--ccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhh Q lcl|NC_015266. 65 KLGL--SVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMI 142 (337) Q Consensus 65 ~v~~--gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~I 142 (337) .... ...++.+..+..+. .....+...++...+.+++.---..|+.+.|+. ...+|+..+.+.+.+.++.=.-.- T Consensus 162 ~~~~~~~~~~~~~~~v~E~~-~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~~~~~~d~~ 238 (408) T protein:vir:74 162 RVYEKWTDVTPLKAMDEEDG-KIPDLDNPRLTIIKYLIKRYAGIITATNTLLKD--TAENILAWLSSWIAKKVVVTRNQA 238 (408) T ss_pred EEEEeecCCccccccccccc-ccccccccceeeEEeeeeeEEeeehhHHHHHhh--chHHHHHHHHHHHHHHHHHHHHHH Confidence 5433 22233333332221 111123345677777777777778899998875 234788888888888877655444 Q ss_pred cccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCC Q lcl|NC_015266. 143 GWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDT 222 (337) Q Consensus 143 GfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~ 222 (337) -++|+.... . .|.. .+.|.++. ++...+++.|+.. T Consensus 239 il~G~G~~~------------------------------------~----~~~~---~~~~~i~~-~~~~~l~~~~~~~- 273 (408) T protein:vir:74 239 IIAAMGTVP------------------------------------K----KPTI---ANFDDVIT-MINTSVDPAIIAT- 273 (408) T ss_pred Hhhcccccc------------------------------------c----cccc---ccHHHHHH-HHHHhhhhhhcCC- Confidence 455532110 0 0111 24455554 4445678888875 Q ss_pred CeEEEeChHHHHHHHHHHHh-ccCChhHHHHHHHH-HhhhhhcCceeEECC--ccCCCc-----eEEecccccEEEEecC Q lcl|NC_015266. 223 GLVVICGRELLHDKYFPIVN-TTQAPTEQLAADLI-VSQKRIGNLPAVRVP--FFPKRA-----MMVTKLENLSIYFQEG 293 (337) Q Consensus 223 dLVvivG~dLla~k~~~l~n-~~~~ptE~~A~~~~-~~~k~igGl~a~~vP--ffP~~~-----ilvT~l~NLsIY~Q~g 293 (337) -+++|.+..+.. +..+. ..+.|- ...... ....+|-|+|++..+ ++|..+ +++-.++..-..+.++ T Consensus 274 -a~~v~n~~~~~~--l~~lkd~~G~~l--~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~ 348 (408) T protein:vir:74 274 -SSLLTNQSGLNK--LALVKTAEGKYL--LEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRE 348 (408) T ss_pred -CEEEEcHHHHHH--HHHhhcCCCceE--eccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEec Confidence 488999987653 22232 222221 000111 123588999999877 577543 6766777655555555 Q ss_pred ceEEeEeecc----ccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 294 ARRRSLIDNP----KRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 294 s~RR~~~d~p----~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) +.+-.+-+.. .++.+--.-..--++.|-++.+++.++=-....+ T Consensus 349 ~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~ 396 (408) T protein:vir:74 349 NMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFTAIADQ 396 (408) T ss_pred ceEEEEeccccchhhcceeeEEEEEeeCcEEecccceEEEEeecccCC Confidence 5443332211 1222111111122445666666666653333333 No 71 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=97.84 E-value=6.5e-06 Score=49.02 Aligned_cols=294 Identities=11% Similarity=0.029 Sum_probs=152.5 Q ss_pred HHHHHHHHHHHHHhcCccccc-----ceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 6 RQAYRKYAAQIAKLNDTDDVS-----QKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 6 r~~~~~y~~~~a~~ngv~~~~-----~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) -..++++... .-|+.... ....|-+++...+.+.+++.|..+++.+++++.- .+.++-.-.+++.++-+.. T Consensus 1 ~a~l~el~~~---~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 1 MATLNELLPN---SAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVGQVGV 76 (333) T ss_pred CchhHHhhhh---cccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEeecC Confidence 1112222111 11221111 1225677788999999999999999999988763 2234434344444443332 Q ss_pred CC------cccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCC Q lcl|NC_015266. 81 TK------AERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTD 154 (337) Q Consensus 81 ~~------~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD 154 (337) +. ....|.....++......++.-.-..|+.+.|+. ..++|+..+++.+.+.++.-.---.+||+-....+- T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~--s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~ 154 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARM--NPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSA 154 (333) T ss_pred cccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcc Confidence 21 1233444455666677777777788888888863 225799999999999999887777788877544332 Q ss_pred hhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHH Q lcl|NC_015266. 155 KAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 155 ~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla 234 (337) + . |.+. ..... ..+.....+.+++ ..+|.++. ++..+.....++. -+++|.+...+ T Consensus 155 ~----~------g~~~-------~~~~~---~~~~~~~~~~~~~-~~~~~i~~-~~~~~~~~~~~~~--~~~vmn~~~~~ 210 (333) T protein:vir:78 155 L----Q------GIDT-------DNVIA---NTTNVDYLQETGD-PLLDRLLD-GYDLVSANTDVEF--NGWAVDPRFRA 210 (333) T ss_pred c----c------cccc-------ccccc---ccccccccccccc-hhHHHHHH-HHHhhccccccCc--eEEEEcchHHH Confidence 2 1 1110 00000 0111222333443 34554433 3443333333332 26777776544 Q ss_pred HH-HHH-HHhccCCh--hHHHHHHHHHhhhhhcCceeEECCccCCC---------ceEEecccccEEEEecCceEEeEee Q lcl|NC_015266. 235 DK-YFP-IVNTTQAP--TEQLAADLIVSQKRIGNLPAVRVPFFPKR---------AMMVTKLENLSIYFQEGARRRSLID 301 (337) Q Consensus 235 ~k-~~~-l~n~~~~p--tE~~A~~~~~~~k~igGl~a~~vPffP~~---------~ilvT~l~NLsIY~Q~gs~RR~~~d 301 (337) .- ... +-+..+.+ .+- -......++-|+|++..+++|.+ .+++..+++.-|....+ .+=.+.+ T Consensus 211 ~L~~~~~~~d~~G~~i~~~~---~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~-~~i~~~~ 286 (333) T protein:vir:78 211 HLLRAQAYRDANGNVDPSRI---NLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADE-IRIKMSD 286 (333) T ss_pred HHHHHhhhcCCCCceeecCc---cccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeec-cEEEEec Confidence 21 111 11111111 000 00112358889999999999976 48888888866554433 2222221 Q ss_pred c-----cccceecchhhhc---------ccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 302 N-----PKRDQIENYESSN---------DAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 302 ~-----p~r~r~e~y~s~N---------e~YvVEd~~~~a~iEnI~~~~a 337 (337) . ..-..+ ++...| -++.|.+..+++.+ +.++| T Consensus 287 ~~~~~~~~~~~~-~~~~~~~v~~r~~~r~d~~v~~~~a~~~l---~~~~a 332 (333) T protein:vir:78 287 TATLTDSGSATV-SMWQTNQIAILIEVTFGWLLGDKQAFVKF---VDDEQ 332 (333) T ss_pred ccccccccccee-ehhhcCcEEEEEEEEEccEEecccceEEE---eccCC Confidence 1 111111 222222 35566666666654 45555 No 72 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.79 E-value=1.3e-05 Score=47.38 Aligned_cols=292 Identities=10% Similarity=0.011 Sum_probs=162.4 Q ss_pred CChHHHH-----HHHHHHHHHHHhcC--cc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccc Q lcl|NC_015266. 1 MKKETRQ-----AYRKYAAQIAKLND--TD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr~-----~~~~y~~~~a~~ng--v~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g 72 (337) |++.... .|..+..+.+.... +. .......|-+.+.+.+.+.+.+.+.+++..+++++.-... ++-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~-~ip~~~~~ 79 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK-KFTFWADK 79 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCce-EEEEEecC Confidence 7655333 33334443333221 21 1234556777788999999999999999999988763222 22222233 Q ss_pred ccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCC Q lcl|NC_015266. 73 PIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALS 152 (337) Q Consensus 73 ~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~ 152 (337) +-+.-+.. +...|..-...+...+.+++.---..|+.+.|+... ++|+..+.+.+.++++.-.-..-++|+-.. T Consensus 80 ~~a~~v~E--g~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~g~~-- 153 (324) T protein:vir:97 80 PGAYWVGE--GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQGNN-- 153 (324) T ss_pred cceeEecc--CccccccccceeEEEEeeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHhhccCCCC-- Confidence 34433322 222344446788889999999999999999998653 689999999999998887777778885411 Q ss_pred CChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) .. |. -++.... .......+.-.|++|-. ++..+ .+.++... +++|.+.. T Consensus 154 ~~----~~------------------gi~~~~~--~~~~~~~~~~~~~~i~~----~~~~l-~~~~~~~~--~~v~n~~~ 202 (324) T protein:vir:97 154 PF----GK------------------SIAQSIE--KTNKVIKGDFTQDNIID----LEALL-EDDELEAN--AFISKTQN 202 (324) T ss_pred cc----Cc------------------ccccccc--ccceeccccCCHHHHHH----HHHhh-hhccCCCC--EEEEcHHH Confidence 11 11 1111110 01111122334544433 44433 44454443 67888877 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccC--CCceEEecccccEEEEecCceEEeEeeccc------ Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFP--KRAMMVTKLENLSIYFQEGARRRSLIDNPK------ 304 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP--~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~------ 304 (337) +.. ...+-...+.+ ... -....++-|+|++..|..| .+.+++-.++++-| -..+..+=.+.++.. T Consensus 203 ~~~-L~~lkd~~g~~--~~~---~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i-~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:97 203 RSL-LRKIVDPETKE--RIY---DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIY-GIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhcCCCce--eec---CCCCccccceeeEeecCCCCCcceEEEEecccEEE-EEecCcEEEEeeccccccccc Confidence 653 22222222211 000 0123578899999988755 55688888888744 334444433333322 Q ss_pred ----------cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 305 ----------RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 305 ----------r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) +|.+.---..--++.|-+.++++.+.+.+-+.. T Consensus 276 ~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 318 (324) T protein:vir:97 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCC Confidence 111111111223566778888887775444322 No 73 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=97.75 E-value=3.9e-06 Score=50.24 Aligned_cols=283 Identities=13% Similarity=0.033 Sum_probs=157.9 Q ss_pred cCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccCCCcccccccccccCcccee Q lcl|NC_015266. 20 NDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYR 99 (337) Q Consensus 20 ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~ 99 (337) .|+.. +-.+.|-+++.+.+.+.+++.|.++++-+++++.--. .++-.-.+++-+.-... ....|..-..++...+. T Consensus 1 m~t~t-~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~a~wv~E--~~~~~~s~~~f~~v~l~ 76 (303) T protein:vir:97 1 MGTET-SKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNG-SKEFTFTLDSDIDVVAE--NGKKTHGGLSLEPVTIV 76 (303) T ss_pred CcccC-CCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEEecCcceEEeec--CccccccccceeeEEee Confidence 56554 3357899999999999999999999999988876322 23333334444443332 23334444567778888 Q ss_pred eEeeccccccCHHHHHHH-hcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchh Q lcl|NC_015266. 100 CEKTDYDTAITYRKLDAW-AKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGH 178 (337) Q Consensus 100 c~qtn~d~~i~y~~LD~w-A~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~ 178 (337) .++.---..++-+.|.+= ...++|.+.+.+.+.++++.-+-.-.+||+.-+..++-.. +|+. T Consensus 77 ~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~--------~~~~--------- 139 (303) T protein:vir:97 77 PIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDV--------IGTN--------- 139 (303) T ss_pred eEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccc--------cccc--------- Confidence 888888888888877432 3356899999999999999888888889964322222111 1110 Q ss_pred hhcccccccCCceecCCCcc-cccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHH Q lcl|NC_015266. 179 RVLHEGAKEAGKVLVGKGGD-YVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIV 257 (337) Q Consensus 179 ~v~~~~~~~~~~i~~G~ggd-y~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~ 257 (337) ... +. ....+..+++.+ |.+|.+++. . +.+.+++.. .++|.+.....- ..+-...+.+--....+.-. T Consensus 140 -~~~-~~-~~~~~~~~~~~~~~~~i~~~~~----~-~~~~~~~~~--~~vmn~~~~~~L-~~lkd~~g~~~~~~~~~~~~ 208 (303) T protein:vir:97 140 -HFD-SK-VTQVVKFTESEDADANIEAAVN----L-IQGAEGVVT--GLAMDTEFSTAL-AKVTNGEMGPKMYPELAWGA 208 (303) T ss_pred -ccc-cc-cccccccccccchHHHHHHHHH----H-HhhcCCCcc--EEEEcHHHHHHH-HHhhccCCCeEEecCccCCC Confidence 000 00 011111122222 444444433 2 233333332 588888776532 22322222221100001001 Q ss_pred hhhhhcCceeEECCccCCCc--------eEEecccccEEEEecCceEEeEeeccccce-ecchhhhc---------ccce Q lcl|NC_015266. 258 SQKRIGNLPAVRVPFFPKRA--------MMVTKLENLSIYFQEGARRRSLIDNPKRDQ-IENYESSN---------DAYV 319 (337) Q Consensus 258 ~~k~igGl~a~~vPffP~~~--------ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r-~e~y~s~N---------e~Yv 319 (337) ...+|-|+|++...++|... +++=.+++.-.|..++..+-.+-+.-+-+. .-+|.-.| -++. T Consensus 209 ~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~ 288 (303) T protein:vir:97 209 NPDSINGLKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWG 288 (303) T ss_pred CCceecceeeEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccE Confidence 22478899999999998753 455556665555555554444433211111 11222333 4557 Q ss_pred eecCCcEEEeeceee Q lcl|NC_015266. 320 VEDFGCGCVAENIEL 334 (337) Q Consensus 320 VEd~~~~a~iEnI~~ 334 (337) |-++.+++.+.+.++ T Consensus 289 v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 289 ILDAKSFARVTKGEV 303 (303) T ss_pred eecccceEEeeCCCC Confidence 778888888888777 No 74 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=97.73 E-value=1.8e-05 Score=46.55 Aligned_cols=297 Identities=9% Similarity=0.046 Sum_probs=149.3 Q ss_pred CChHHHHHHHHHHHHHHHhcC------------------------cccccceeeecHHHHHHHHHHHHhhhhhhcc-ccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLND------------------------TDDVSQKFAVEPSVQQTLETKMQESSAFLKS-INI 55 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ng------------------------v~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~-Inv 55 (337) +. .....+..|+..++..-| .....-.+.|-+.+.+.+.+.+++.+.+++. -++ T Consensus 91 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~ 169 (435) T protein:vir:14 91 LE-VKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGART 169 (435) T ss_pred hh-hhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhchhhhhccee Confidence 11 111223333332221111 1111223455556678899999988888765 334 Q ss_pred ccchhhhhhh-hccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHH Q lcl|NC_015266. 56 LPVTELEGEK-LGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQ 134 (337) Q Consensus 56 ~~V~~~~Ge~-v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~ 134 (337) ++.. .|.. +-.-.+++-++-+.-+ ...|..-..++...|.+++.---..|+.+.|+.-+-.|+++..+.+.+.++ T Consensus 170 ~~~~--~~~~~~p~~~~~~~a~~v~E~--~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~~~i~~~l~~a 245 (435) T protein:vir:14 170 LPLS--NGNITIPRLKGGAIVGYIGAD--TDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVDQIVVGDLTAA 245 (435) T ss_pred eecC--CCceEEEEEeCCcceeeeccC--ccccccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHHHHHHHHHHHH Confidence 4432 3321 1111223333333222 122333345777888888888889999999999666678999999999999 Q ss_pred HhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhccc Q lcl|NC_015266. 135 SALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMI 214 (337) Q Consensus 135 ~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li 214 (337) ++.-.-.--+||+-.+. +| +|++. ... .......-.++.+..+.+.+.+++..+ T Consensus 246 i~~~~d~a~l~G~G~~~------~p------~Gi~~------------~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~- 299 (435) T protein:vir:14 246 IGAREDKAFIRDDGTAN------TP------KGLRF------------WAL-PSNVITASDASTLQKIETDLGKVILAL- 299 (435) T ss_pred HHHHHHHHhhccCCCCc------cc------cceee------------ccc-ccceeccccccchhhHHHHHHHHHHHh- Confidence 88555444457743221 12 24432 110 000011111222333333333333222 Q ss_pred ChhHcCCCCeEEEeChHHHHHHHHHHHhccC-ChhHHHHHHHHHhhhhhcCceeEECCccCCC--------ceEEecccc Q lcl|NC_015266. 215 DPWFQEDTGLVVICGRELLHDKYFPIVNTTQ-APTEQLAADLIVSQKRIGNLPAVRVPFFPKR--------AMMVTKLEN 285 (337) Q Consensus 215 ~~~~r~~~dLVvivG~dLla~k~~~l~n~~~-~ptE~~A~~~~~~~k~igGl~a~~vPffP~~--------~ilvT~l~N 285 (337) ...+......+++|.+..++. +..+...+ .| .-- + ....++-|+|++..+++|.+ .+++-.++. T Consensus 300 ~~~~~~~~~~~~v~n~~~~~~--L~~lkd~~G~~-l~~--~--~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~ 372 (435) T protein:vir:14 300 ENADANLTQPGWIMAPRTFRF--LEGLRDGNGNK-VYP--E--LANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGD 372 (435) T ss_pred hhccccccCCEEEEcHHHHHH--HHHhhccCCce-ecc--C--CCCCeeecceeEeeccccccccCCCccceEEEeeccc Confidence 111112223588999987753 22333222 22 000 0 12457889999999999985 577777776 Q ss_pred cEEEEecCceEEeEeeccccc----eecchhhhc---------ccceeecCCcEEEeeceeecc Q lcl|NC_015266. 286 LSIYFQEGARRRSLIDNPKRD----QIENYESSN---------DAYVVEDFGCGCVAENIELVA 336 (337) Q Consensus 286 LsIY~Q~gs~RR~~~d~p~r~----r~e~y~s~N---------e~YvVEd~~~~a~iEnI~~~~ 336 (337) .-| ...+..+-.+.+..... .+.+|+.+| -++.|=++.+++.+.++..+. T Consensus 373 ~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 373 VFI-GEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred EEE-EEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 433 33344443333222111 011222222 245666777777777777776 No 75 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.73 E-value=1.7e-05 Score=46.67 Aligned_cols=293 Identities=12% Similarity=0.048 Sum_probs=146.0 Q ss_pred CC-----------------hHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhh Q lcl|NC_015266. 1 MK-----------------KETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEG 63 (337) Q Consensus 1 M~-----------------~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~G 63 (337) +. ...+..+... ...+. .+.......+.|-+...+.+...+.+.+.+++.++++++.-..+ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~ 162 (413) T protein:vir:81 85 AGDQIKQQAGGAQLNYSVGEYVAPRVKAA-SDPAS-TATLTDEFQGGYGTTWNRNIIYRRREKLVVADLMDNLTMTNTTI 162 (413) T ss_pred hhhHHHHHHHHHHhhhhhhhhhhhHHHhh-hhhhh-hcccccccccccchhhHHHHHHHHhhhhhHHhhcceeeccCCce Confidence 00 0000000000 00111 11112234556788889999999999999999999988875544 Q ss_pred hhhccc---cccccceeccCCCccccc-ccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhch Q lcl|NC_015266. 64 EKLGLS---VSGPIASRTDTTKAERQP-IDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDR 139 (337) Q Consensus 64 e~v~~g---v~g~iagRt~t~~~~R~~-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~ 139 (337) ...... +...-++-+.. +...| .+...++...+..++.=-.+.|+.+.|+.. +.++..++..+.++++.=. T Consensus 163 ~~~~~~~~~~~~~~a~~v~E--g~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~l~~~i~~~la~~~~~~~ 237 (413) T protein:vir:81 163 KYLMEKANRVVEGGFKTVAE--GGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDY---DFLVSYINARLLEELAIEE 237 (413) T ss_pred eEEEeccccccccccceecC--cccccccCcccceeeEeeeeeEEEeehhhHHHHHHH---HHHHHHHHHHHHHHHHHHH Confidence 322111 11111111111 11122 233345666777777766778999999875 4689999998888887665 Q ss_pred hhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcc-cChhH Q lcl|NC_015266. 140 IMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSM-IDPWF 218 (337) Q Consensus 140 i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~l-i~~~~ 218 (337) -.--+||+-. .+| -+|++.. . ....+..+++.++ .|. +.+++..+ ++.-+ T Consensus 238 d~~~l~G~G~-------~~~-----~~Gi~~~------------~--~~~~~~~~~~~~~--~~~-i~~~~~~~~~~~~~ 288 (413) T protein:vir:81 238 ERQLLLGDGT-------GNN-----LTGLLKR------------D--GIQTLAVSNKDEL--ADS-IYKAMTNISLATPF 288 (413) T ss_pred HHHHhccCCC-------CCc-----ccccccc------------c--ccccccccccchh--HHH-HHHHHHHhhhhccC Confidence 5556677421 111 1244431 0 0111222222222 222 23333222 22233 Q ss_pred cCCCCeEEEeChHHHHHHHHHHHhccCChh--HHH----HHHHHHhhhhhcCceeEECCccCCCceEEecccc-cEEEEe Q lcl|NC_015266. 219 QEDTGLVVICGRELLHDKYFPIVNTTQAPT--EQL----AADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLEN-LSIYFQ 291 (337) Q Consensus 219 r~~~dLVvivG~dLla~k~~~l~n~~~~pt--E~~----A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~N-LsIY~Q 291 (337) +. . .++|.+..+.. ...|-...+.|- +.. +.-......++-|+|++..+++|++.+++-.+++ +-|+.. T Consensus 289 ~~--~-~~vmn~~~~~~-l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~gd~~~~~~~~~~ 364 (413) T protein:vir:81 289 QA--D-ALVINPLDYQE-LRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVVGAFRSAASVLRK 364 (413) T ss_pred CC--c-EEEEcHHHHHH-HHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEEEecccEEEEEEe Confidence 32 2 46778776552 222222222220 000 0011123457889999999999999999999987 444444 Q ss_pred cCceEEeEeecc----ccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 292 EGARRRSLIDNP----KRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 292 ~gs~RR~~~d~p----~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .|. .=.+-+.. .++.+.-.-..--+..|-+..+++.+ ++..| T Consensus 365 ~~~-~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l---~~~~~ 410 (413) T protein:vir:81 365 GGV-RIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQL---DVAEV 410 (413) T ss_pred cce-EEEEeccccchhhcCcEEEEEEEeeccEEecccceEEE---EecCC Confidence 443 22221111 23333322222234555666666654 45666 No 76 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=97.69 E-value=1.3e-05 Score=47.43 Aligned_cols=282 Identities=12% Similarity=0.073 Sum_probs=150.3 Q ss_pred CChHHHHHHHHHHHHHHHh--cCc-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc--ccccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKL--NDT-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL--SVSGPIA 75 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--ngv-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~--gv~g~ia 75 (337) ....++...+.+....-+. -++ ...+-.+.|-+.....+.+.+.+.+.+++.++++++....|..... ...++.+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a 165 (395) T protein:vir:38 86 GKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLK 165 (395) T ss_pred hhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccc Confidence 3333333344443332211 112 2223355677777889999999999999999999998888876432 2223344 Q ss_pred eeccCCCccccc-ccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCC Q lcl|NC_015266. 76 SRTDTTKAERQP-IDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTD 154 (337) Q Consensus 76 gRt~t~~~~R~~-~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD 154 (337) +-...+ ...| .+...++...+.+++.---..|+.+.|+.. .++|+..+.+.+.+.++.-.-.-=+||+-.... T Consensus 166 ~~v~E~--~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~-- 239 (395) T protein:vir:38 166 DLDDES--ALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDT--VDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPK-- 239 (395) T ss_pred cccccc--cccccccccceeeEEeeeeeeEeehhhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-- Confidence 322222 1222 233456677777777777778888888752 347899999999998886554444454321100 Q ss_pred hhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHH Q lcl|NC_015266. 155 KAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 155 ~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla 234 (337) .+... +.|.++ ++++..+++.|+.. -+++|.+..+. T Consensus 240 --------------------------------------~~~~~---~~~~i~-~~~~~~l~~~~~~~--a~~v~n~~~~~ 275 (395) T protein:vir:38 240 --------------------------------------KPTIS---QFDNIK-DLENNTLDPAIEST--SSFITNQSGYN 275 (395) T ss_pred --------------------------------------ccccc---cHHHHH-HHHHHhhhhhhcCC--CEEEEcHHHHH Confidence 01112 234443 34545568888875 48999998765 Q ss_pred HHHHHHHhccCCh--hHHHHHHHHHhhhhhcCceeEECCccCCC------ceEEeccccc-EEEEecCceEEeEeeccc- Q lcl|NC_015266. 235 DKYFPIVNTTQAP--TEQLAADLIVSQKRIGNLPAVRVPFFPKR------AMMVTKLENL-SIYFQEGARRRSLIDNPK- 304 (337) Q Consensus 235 ~k~~~l~n~~~~p--tE~~A~~~~~~~k~igGl~a~~vPffP~~------~ilvT~l~NL-sIY~Q~gs~RR~~~d~p~- 304 (337) . ...+-...+.| ..-... ....+|-|+|++..+..|.. .+++--+++. -|+...| ..=.+.+.+. T Consensus 276 ~-L~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~-~~i~~~~~~~~ 350 (395) T protein:vir:38 276 I-LSKVKDADGRYLMQPDVTS---PDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQ-MQIDTTNVGAG 350 (395) T ss_pred H-HHHhhccCCceeeccCcCC---CCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecc-eEEEEeccccc Confidence 3 22222222222 000000 12357899999998864333 3777777763 3443333 2222222211 Q ss_pred ---cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 305 ---RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 305 ---r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ++.+--.-..--+..|-++.+++.++--..+.- T Consensus 351 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~ 386 (395) T protein:vir:38 351 SFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVANQ 386 (395) T ss_pred hhhcCceEEEEEEeeccEEecccceEEEEeecccCC Confidence 222211111112455666666666652111111 No 77 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.68 E-value=2.6e-05 Score=45.71 Aligned_cols=280 Identities=12% Similarity=0.119 Sum_probs=150.9 Q ss_pred CChHHHHHHHHHHHHH-----HHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc--ccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQI-----AKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL--SVSGP 73 (337) Q Consensus 1 M~~~tr~~~~~y~~~~-----a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~--gv~g~ 73 (337) ++..-+..|..|+..- +.........-.+.|-..+...+.+.+.+.+.+++.++++++....|..... ...++ T Consensus 86 ~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (397) T protein:vir:49 86 VKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSRVYEKWTDITG 165 (397) T ss_pred HHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccceEEEeeccCCc Confidence 3344445555554321 1122222233456676778889999999999999999999999888876533 22334 Q ss_pred cceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCC Q lcl|NC_015266. 74 IASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALST 153 (337) Q Consensus 74 iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~T 153 (337) .++-+..+. .....+...++...+.+++.---+.|+.+.|+.- .++|+..+++.+.++++.-.-.--++|+..... T Consensus 166 ~a~~v~E~~-~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~- 241 (397) T protein:vir:49 166 LANIDDEAG-KIADVDDPKLSLIKYTIKRYAGISTVTNSLLADS--AENILAWLSGWIAKKVVVTRNKAILEAIAALPT- 241 (397) T ss_pred ceeeecCcc-ccccccccceeeEEeeeeeEEeeehhHHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc- Confidence 444443322 1112334556777788888877788999988753 357899999999998887665555566332110 Q ss_pred ChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHH Q lcl|NC_015266. 154 DKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 D~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLl 233 (337) .+.-.+.|.++ +++.+ |++.|+.. -+++|.+..+ T Consensus 242 ------------------------------------------~~~~~~~d~i~-~~~~~-l~~~~~~~--a~~vmn~~~~ 275 (397) T protein:vir:49 242 ------------------------------------------KPTLTKWDDII-DLEAK-VDPAIKQT--SFFLTNTSGF 275 (397) T ss_pred ------------------------------------------ccccccHHHHH-HHHHh-hhhhhcCC--CEEEEcHHHH Confidence 00012455544 45554 57777765 4889999876 Q ss_pred HHHHHHHHh-ccCChhHHHHHHHH-HhhhhhcCceeEECC--ccCCCc-----eEEecccccEEEEecCceEEeEeec-- Q lcl|NC_015266. 234 HDKYFPIVN-TTQAPTEQLAADLI-VSQKRIGNLPAVRVP--FFPKRA-----MMVTKLENLSIYFQEGARRRSLIDN-- 302 (337) Q Consensus 234 a~k~~~l~n-~~~~ptE~~A~~~~-~~~k~igGl~a~~vP--ffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~~d~-- 302 (337) . + +..+. ..+.|- ....+. ....++-|+|++.++ .+|.++ +++-.|++.-..+.+++.+=..-+. T Consensus 276 ~-~-l~~lkd~~G~~l--~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~ 351 (397) T protein:vir:49 276 T-A-LKKVKNALGDYL--MERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGG 351 (397) T ss_pred H-H-HHHhhcCCCcee--eccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEecccc Confidence 5 2 22332 222220 000010 123589999998865 466654 6666666632222223333222111 Q ss_pred --cccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 303 --PKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 303 --p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -.++.+--+-..--++.|-++..++. +++..+ T Consensus 352 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~---~~~~~~ 385 (397) T protein:vir:49 352 GAFETDTTKVRVIDRFDVVATDTEAFVP---ASFKAI 385 (397) T ss_pred chhhcCceeEEEEeeeCcEEecccceEE---EEeecc Confidence 11111111111112334444444444 344443 No 78 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=97.65 E-value=1.2e-05 Score=47.58 Aligned_cols=278 Identities=10% Similarity=0.068 Sum_probs=138.1 Q ss_pred CChHH-------HHHHHHHHHHHH--------HhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhh Q lcl|NC_015266. 1 MKKET-------RQAYRKYAAQIA--------KLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEK 65 (337) Q Consensus 1 M~~~t-------r~~~~~y~~~~a--------~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~ 65 (337) +.... +.....+....+ ...++....-.+.|-+.....+.+.+.+.+.+++.++++++....|.. T Consensus 101 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 180 (400) T protein:vir:38 101 TRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTY 180 (400) T ss_pred hHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEE Confidence 00000 000111111111 112223333355666678899999999999999999999998777765 Q ss_pred hccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccc Q lcl|NC_015266. 66 LGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWN 145 (337) Q Consensus 66 v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfN 145 (337) -.+..+++.++-...+ ..........++...+..++.--=+.|+.+.|+. ..++|+..+.+.+.++++.=.-.-.++ T Consensus 181 ~~~~~~~~~~~~~~E~-~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~d--s~~~~~~~i~~~l~~~~~~~~~~~i~~ 257 (400) T protein:vir:38 181 PTVANATTKMVTVAEL-EKNPAMAKPEFKPVNWSVETYRQALPVSQESIDD--SAIDLVGLIAQNGQQIKVNTTNGAVAT 257 (400) T ss_pred EEEecCCCcccccccc-ccccccccccceeeEeehhheeeehhhHHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 5443333333222221 1121122234455555555555556777777763 134788888888888776533322233 Q ss_pred ceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeE Q lcl|NC_015266. 146 GVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 146 G~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLV 225 (337) |+... +...-.+.|.+ .+++...+++.+. -+ T Consensus 258 ~~~~~--------------------------------------------~~~~~~~~~~~-~~~~~~~~~~~~~----a~ 288 (400) T protein:vir:38 258 LLKGF--------------------------------------------TAKTISSVDDL-KHINNVDLDPAYS----RV 288 (400) T ss_pred ccccc--------------------------------------------cccccccHHHH-HHHHHhhhhhhhC----cE Confidence 32210 01111234444 3455556676542 38 Q ss_pred EEeChHHHHHHHHHHHhccCChhHHHHHHH-HHhhhhhcCceeEECCccCCCc-----eEEecccccEEEEecCceEEeE Q lcl|NC_015266. 226 VICGRELLHDKYFPIVNTTQAPTEQLAADL-IVSQKRIGNLPAVRVPFFPKRA-----MMVTKLENLSIYFQEGARRRSL 299 (337) Q Consensus 226 vivG~dLla~k~~~l~n~~~~ptE~~A~~~-~~~~k~igGl~a~~vPffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~ 299 (337) ++|.+..+.. ...+-...+.|- ....+ -....++-|+|++..+.+|... +++-.|++..+.+-.....-.. T Consensus 289 ~v~~~~~~~~-l~~lkd~~G~~i--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~ 365 (400) T protein:vir:38 289 IIASQSFYNF-LDTVKDGNGRYL--LQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRW 365 (400) T ss_pred EEEcHHHHHH-HHHhhccCCCee--eecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEE Confidence 8999887653 222222212210 00000 0124588999999999999654 6777888865555444444444 Q ss_pred eeccccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 300 IDNPKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 300 ~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .++......--.+-| -|..|-+...++. |++..+ T Consensus 366 ~~~~~~~~~~~~~~r-~d~~~~~~~a~~~---l~~~~~ 399 (400) T protein:vir:38 366 VDDQIYGQFLQAGMR-FGVSVADEKAGYF---LTYTPK 399 (400) T ss_pred ecccccceeEEEEEE-eccEEecccceEE---EEeecC Confidence 333222211111111 1222333333333 455555 No 79 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=97.63 E-value=2.9e-05 Score=45.45 Aligned_cols=284 Identities=11% Similarity=0.008 Sum_probs=147.7 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) ++++.|..|+++.. +..+....+.|-+++..++.+.+.+.|.++++.+++++.- +.++-...+++-++=..- T Consensus 67 lt~ee~~~~~~~~~------~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~wv~e 138 (377) T protein:vir:96 67 LTAEEIKFFNDIDK------NVGGKDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDI 138 (377) T ss_pred cCHHHHHHHHHHHh------cCCCCCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceeEeec Confidence 66666666665542 2223344667888899999999999999999999888742 234444444444433221 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ...+.+..-..++...+.+++.---..|+++.|+.= -.+++..+++.+.+++|.=.-.--+||+-.. -| T Consensus 139 -~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds--~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~-------~P- 207 (377) T protein:vir:96 139 -FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFG--PKWLKQFITEQLKEAIAVALELAIVKGNGLL-------QP- 207 (377) T ss_pred -ccccccccCccceeEeeeeeeEEeechhhHHHhhcc--hhhHHHHHHHHHHHHHHHHHhhceEeccCCC-------cc- Confidence 123333333467777888888888889999999752 2358888999999988875555566775421 12 Q ss_pred hhccchhHHHHHHhhchhhhccccc--ccCCceecCC--CcccccHHHHHHHHHhccc--C--hhHcCCCCeEEEeChHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGA--KEAGKVLVGK--GGDYVNLDALVMDIVSSMI--D--PWFQEDTGLVVICGREL 232 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~--~~~~~i~~G~--ggdy~nLDalV~da~~~li--~--~~~r~~~dLVvivG~dL 232 (337) +|+|...........-...+ ....+...|+ ..+..++--+..+++..+- . -..+-.+..|++|-+.. T Consensus 208 -----~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t 282 (377) T protein:vir:96 208 -----VGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPED 282 (377) T ss_pred -----eeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhh Confidence 35554221110000000000 0001111121 1223333333444332210 0 00122346889998864 Q ss_pred HHHHH--HHHHhccCChhHHHHHHHHHhhhhhcCc--eeEECCccCCCceEEecccccEEEEecCceEEeEeecccccee Q lcl|NC_015266. 233 LHDKY--FPIVNTTQAPTEQLAADLIVSQKRIGNL--PAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQI 308 (337) Q Consensus 233 la~k~--~~l~n~~~~ptE~~A~~~~~~~k~igGl--~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~ 308 (337) ..+-+ ....+..+.| -++.|+ +.+.-+++|++.+++-.+++--|... ++.|=..- T Consensus 283 ~~~~~~~~~~~~~~G~~------------~~~l~~p~~v~~s~~~p~~~i~fgdf~~Y~i~~r-~~~~i~~~-------- 341 (377) T protein:vir:96 283 RWTLEAKFTSRNQFGEY------------VTVLPHGITILESLAVETGKAIAFVANRYDAFMA-TASTIEEY-------- 341 (377) T ss_pred HHhccccccccCCCCCc------------eeccCCCceEEecCCCCcccEEEEEcCcEEEEEe-cccEEEee-------- Confidence 43211 1111121222 134444 47778999999999999988544433 23322111 Q ss_pred cchhhhcccceeecCCcEEEee---------------ceeec Q lcl|NC_015266. 309 ENYESSNDAYVVEDFGCGCVAE---------------NIELV 335 (337) Q Consensus 309 e~y~s~Ne~YvVEd~~~~a~iE---------------nI~~~ 335 (337) .+.|..+|.-.+-+++ .|+++ T Consensus 342 ------~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 342 ------DQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ------hhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 1234444433333322 12222 No 80 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=97.62 E-value=1.5e-05 Score=46.96 Aligned_cols=281 Identities=12% Similarity=0.023 Sum_probs=153.0 Q ss_pred HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccCCCcccccccccccCc Q lcl|NC_015266. 16 IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~ 95 (337) +|. +.. +.-.-|-|++...+.+.+++.|.+++..+++++.--. ..+-.-.+++-++=... ....|..-..++. T Consensus 1 ma~--~t~--~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~-~~~p~~~~~~~a~wv~E--g~~~~~s~~~f~~ 73 (300) T protein:vir:95 1 MSE--AQL--SKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNG-QREFVFDFDSDIDIVAE--NGKKTHGGVSLDP 73 (300) T ss_pred Ccc--ccc--CCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCc-eEEEEEecCcceEEeeC--Cccccccccccee Confidence 222 111 2233588899999999999999999888887765422 22333334455544332 2334555567888 Q ss_pred cceeeEeeccccccCHHHHHHHh-cCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHh Q lcl|NC_015266. 96 NRYRCEKTDYDTAITYRKLDAWA-KFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRD 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~wA-~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re 174 (337) ..+.+++.--.+.|+.+.|.++. ..+++++.+.+.+.+.++.=.-.-.|+|+-...-+.- +|. | T Consensus 74 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~--~~~------~------- 138 (300) T protein:vir:95 74 VTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQAS--TII------G------- 138 (300) T ss_pred eEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCc--ccc------c------- Confidence 89999999999999999997775 4579999999999999997777777888532211110 000 0 Q ss_pred hchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChh--HHHH Q lcl|NC_015266. 175 RAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPT--EQLA 252 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~pt--E~~A 252 (337) .....+. ....+..+..--|.+|..+ +. +++..+++-. +++|.+..... ...+-...+.|- +.. T Consensus 139 ----~~~~~~~-~~~~~~~~~~~~~~~i~~~----~~-~~~~~~~~~~--~~vmn~~~~~~-L~~lkd~~G~~i~~~~~- 204 (300) T protein:vir:95 139 ----DNCFDKK-VTQTVPFKDTNPDESMEDA----VG-MIDGSERDIT--GAILDPIFTTA-LSKMKNAEGGKLYPELA- 204 (300) T ss_pred ----ccccccc-cceeecccccchHHHHHHH----HH-HhhhcCCCcc--EEEECHHHHHH-HHHhhccCCCeeccCcc- Confidence 0000000 0000000111113344333 33 2334444433 68888876552 222222222221 100 Q ss_pred HHHHHhhhhhcCceeEECCccCCCc------eEEecccccEEEEecCceEEeEeeccccc-eecchhhhc---------c Q lcl|NC_015266. 253 ADLIVSQKRIGNLPAVRVPFFPKRA------MMVTKLENLSIYFQEGARRRSLIDNPKRD-QIENYESSN---------D 316 (337) Q Consensus 253 ~~~~~~~k~igGl~a~~vPffP~~~------ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~-r~e~y~s~N---------e 316 (337) .-....++-|+|++..+++|... +++--++++-.|--++...-++.+..+.+ .-.+|...| - T Consensus 205 --~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~ 282 (300) T protein:vir:95 205 --WGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYI 282 (300) T ss_pred --ccCCCceecceeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEee Confidence 01134689999999999999876 56677776554433444444443322211 111333333 2 Q ss_pred cceeecCCcEEEeeceeeccC Q lcl|NC_015266. 317 AYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 317 ~YvVEd~~~~a~iEnI~~~~a 337 (337) +..|.++.+++.+.++. . T Consensus 283 d~~v~~~~a~~~l~~~~---g 300 (300) T protein:vir:95 283 GWGIMDAASFARIVKTG---G 300 (300) T ss_pred cceeecccceEEEecCC---C Confidence 34555555555553221 1 No 81 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.62 E-value=3.4e-05 Score=45.12 Aligned_cols=292 Identities=12% Similarity=0.056 Sum_probs=161.2 Q ss_pred CChHHHHHHH--HHHHHHH---Hhc--Ccc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccc Q lcl|NC_015266. 1 MKKETRQAYR--KYAAQIA---KLN--DTD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr~~~~--~y~~~~a---~~n--gv~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g 72 (337) |......+++ .|..... ..+ ++- .......|-+.+...+.+.+++.|.+++...++++.--. -++-.-.++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~~ 79 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWADK 79 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEecC Confidence 7766555543 2222211 111 111 112344677889999999999999999999888865322 122222234 Q ss_pred ccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCC Q lcl|NC_015266. 73 PIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALS 152 (337) Q Consensus 73 ~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~ 152 (337) +.++=. +.+...|..-..++...+..++.---..|+.+.|+... ++|+..+.+.+.++++.-.-.--++|.- .. T Consensus 80 ~~a~~v--~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~aia~~~d~a~l~G~g--~~ 153 (324) T protein:vir:93 80 PGAYWV--GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY--SQFFEEMKPMIAEAFYKKFDEAGILNQG--NN 153 (324) T ss_pred cceeee--cCCccccccccceeEEEEEeEEEEEeehhhHHHHhcch--HHHHHHHHHHHHHHHHHHHHHHHhcCCC--CC Confidence 444332 22333344446688889999999988999999998753 6899999999999888666555677743 11 Q ss_pred CChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) ..| .-++...... .. ...+. .+.|.+ .+++.. |++.+++.. +++|.+.. T Consensus 154 ~~~----------------------~~~~~~~~~~---~~-~~~~~-~~~~~i-~~~~~~-l~~~~~~~~--~~v~n~~~ 202 (324) T protein:vir:93 154 PFG----------------------KSIAQSIEKT---NK-VIKGD-FTQDNI-IDLEAL-LEDDELEAN--AFISKTQN 202 (324) T ss_pred CcC----------------------cccccccccc---ce-ecccc-ccHHHH-HHHHHh-hhhccCCCC--EEEEcHHH Confidence 111 1111111000 00 11111 123433 345554 455555544 68888877 Q ss_pred HHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCc--cCCCceEEecccccEEEEecCceEEeEeecccc----- Q lcl|NC_015266. 233 LHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPF--FPKRAMMVTKLENLSIYFQEGARRRSLIDNPKR----- 305 (337) Q Consensus 233 la~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPf--fP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r----- 305 (337) +.. ...+-...+.|- +. -....++-|+|++..|. .+.+.+++-.++++- +...+..+-.+.++... T Consensus 203 ~~~-L~~l~d~~G~~~--~~---~~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~~-~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:93 203 RSL-LRKIVDPETKER--IY---DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLI-YGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred HHH-HHHhhCCCCCee--ec---CCCCCcccceeeEeecCCCCCcceEEEEecceEE-EEEecCcEEEEeeccccccccc Confidence 663 223332222221 00 01346788999998665 556668888888763 33344333333332210 Q ss_pred --ceecchhhhcc---------cceeecCCcEEEeeceeeccC Q lcl|NC_015266. 306 --DQIENYESSND---------AYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 306 --~r~e~y~s~Ne---------~YvVEd~~~~a~iEnI~~~~a 337 (337) ...-+++.+|. |+.|-++++++.+...+-+.. T Consensus 276 ~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~ 318 (324) T protein:vir:93 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTD 318 (324) T ss_pred ccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCC Confidence 01112223333 777888888887765444442 No 82 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.62 E-value=1.2e-05 Score=47.61 Aligned_cols=292 Identities=14% Similarity=0.080 Sum_probs=164.0 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |+.-+. |+.-...++. ..+.+....|-|.+.+.+.+.+++.+.+++..+++++.-.+. ++-.-.+++-++-+.. T Consensus 1 ~~~~~~--~~~e~~~~~~---~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~-~ip~~~~~~~a~~v~E 74 (318) T protein:vir:24 1 MAAGTA--FAVDHAQIAQ---TGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQ-KIPHWVGDVSAQWIGE 74 (318) T ss_pred CCCCCC--CCHHHHHhhc---ccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEeCCcceEEecC Confidence 444332 2211111221 223334557888999999999999999999999988864332 2222333444433322 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ....|..-..++...+.+++.---..|+.+.|+. ..++|+..+++.+.++++.-.-.--+||+-....+. . T Consensus 75 --g~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~d--s~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~-----~ 145 (318) T protein:vir:24 75 --GDMKPITKGNMTSQTIAPHKIATIFVASAETVRA--NPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTY-----I 145 (318) T ss_pred --CccccccccceeEEEEeeEEEEEeehhhHHHhhc--ChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcc-----c Confidence 2223334456788889999988888999988875 236899999999999999777777788865221110 0 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) + .... .... .+..+.=...|..+.+++. .+.+.++.. .+++|.+..... ...+ T Consensus 146 ~--------------------~~~~--~~~~-~~~~~~~~~~~~~~~~~~~-~~~~~~~~~--~~~v~n~~~~~~-L~~l 198 (318) T protein:vir:24 146 G--------------------QTTK--AISI-ADTTGATTVYDQVAVNGLS-LLVNDGKKW--THTLLDDITEPI-LNGA 198 (318) T ss_pred c--------------------cccc--cccc-cccccccchHHHHHHHHHH-hhccccCCC--CEEEEcHHHHHH-HHHh Confidence 1 0000 0000 0111111334445555554 345555554 488999987653 2223 Q ss_pred HhccCCh------hHHHHHHHHHhhhhhcCceeEECCccCCCce--EEecccccEEEEecCceEEeEeecc--------- Q lcl|NC_015266. 241 VNTTQAP------TEQLAADLIVSQKRIGNLPAVRVPFFPKRAM--MVTKLENLSIYFQEGARRRSLIDNP--------- 303 (337) Q Consensus 241 ~n~~~~p------tE~~A~~~~~~~k~igGl~a~~vPffP~~~i--lvT~l~NLsIY~Q~gs~RR~~~d~p--------- 303 (337) -...+.| ..--.. .....++-|+|++..|..|++.. ++-.++.+- |...+..+-.+-++. T Consensus 199 kd~~G~~l~~~~~~~~~~~--~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~-~~~~~~l~i~~~~~~~~~~~~~~~ 275 (318) T protein:vir:24 199 KDQNGRPLFIESTYGEAAS--PFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLI-WGQIGGLSFDVTDQATLNLGTVES 275 (318) T ss_pred hccCCceeecCccccCccc--cccCceEEEEeeEEeCCCCCCccEEEEeecceEE-EEEecCeEEEEeeccceecccccc Confidence 2221111 111111 11235788999999999998864 455666653 333333332222221 Q ss_pred -------ccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 304 -------KRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 304 -------~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .+|++.----.--++.|.++++++.|.++.-+.+ T Consensus 276 ~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~ 316 (318) T protein:vir:24 276 PNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGG 316 (318) T ss_pred ccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCC Confidence 1122211111123678889999999888888777 No 83 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=97.56 E-value=1.3e-05 Score=47.36 Aligned_cols=292 Identities=12% Similarity=0.102 Sum_probs=145.3 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) ++.+-|..++. +.+ +. ..+..+.|-+.+.+.+.+.+.+.|.+++.++++++.- ...+....+++-++-.. T Consensus 75 l~~ee~~~~~~----~~~--~t-~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~~- 144 (395) T protein:vir:95 75 LTSEERKFFND----INY--DV-GYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI--KTRVIKADPAGQAVWGK- 144 (395) T ss_pred cchHHHHHHHH----Hhh--cc-CCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC--ceEEEEecCCcceEEee- Confidence 34444433332 211 11 1123567888889999999999999999999988742 12233323333332211 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ...++.+..-..++...+.+++.---..|+.+.|+.= ..+++..+++.+.++++.=.-.--+||+-...+ -| T Consensus 145 e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds--~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~-----qP- 216 (395) T protein:vir:95 145 VFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFG--PAWIERFVRTQIQEAISVALESAIINGGGAAKT-----QP- 216 (395) T ss_pred cccccCccccccceeeeeceeeEEEeecccHHHHhcc--hhHHHHHHHHHHHHHHHHHHhhheeeccCCCCc-----Cc- Confidence 2233333334556777788888877788999999742 236888899999999887777767788654321 13 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHH---hcc----cChhHcCCCCeEEEeChHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIV---SSM----IDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~---~~l----i~~~~r~~~dLVvivG~dLl 233 (337) +|+|...-.. .....++. ..+.+ .|.+++.++..+. ..+ .....+....+.++|.+... T Consensus 217 -----~Gil~~~~~~--~~~~~~~~-~~~~~------t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~ 282 (395) T protein:vir:95 217 -----VGLMKDVNTN--SGAVTDKA-SSGTL------TFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDS 282 (395) T ss_pred -----eeeeeccccc--cccccccc-ccchh------hhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhh Confidence 2665322110 00111110 01110 1333333322211 100 01112333456788887544 Q ss_pred HHHH-HHHHh-ccCChhHHHHHHHHHhhhhhc-CceeEECCccCCCceEEecccccEEEEecCceEEeEeecc--cccee Q lcl|NC_015266. 234 HDKY-FPIVN-TTQAPTEQLAADLIVSQKRIG-NLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNP--KRDQI 308 (337) Q Consensus 234 a~k~-~~l~n-~~~~ptE~~A~~~~~~~k~ig-Gl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p--~r~r~ 308 (337) .+.. .++.. ..+.|. ..+| |+|++..++||++.+++-.+++..|+.. ++.+=..-+.. .++++ T Consensus 283 ~~~~g~~~~~~~~G~~~-----------~~lg~g~~v~~~~~~p~~~i~fgdfs~y~i~~r-~~~~i~~~~~~~~~~d~~ 350 (395) T protein:vir:95 283 WDVQARYTYLTANGGFV-----------TVLPYNVTIITSEFVPEGKLVAFVTDRYNAVRG-GGLTVKKFDQTLALEDAV 350 (395) T ss_pred hhcCCcceeccCCCcce-----------eccCCcceEEEcCCCCCCcEEEEecccEEEEEe-cceEEEeccchhhhCCcE Confidence 4321 11111 111110 1122 7789999999999999998888666543 33332222211 01222 Q ss_pred cchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 309 ENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 309 e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ..+-..--+-.+=|.+++.++ .|++.++ T Consensus 351 ~f~~~~r~dg~~~~~~A~~~l-~i~~~~~ 378 (395) T protein:vir:95 351 LFTAKTFAYGQPDDNKASAVY-DLKVASA 378 (395) T ss_pred EEEEEEEECCEEeccccEEEE-EeeccCC Confidence 121111122222333333332 2343443 No 84 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=97.52 E-value=3e-05 Score=45.42 Aligned_cols=296 Identities=13% Similarity=0.097 Sum_probs=140.1 Q ss_pred CChHHHHHHHHHHHHHHHh------------cCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhc- Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKL------------NDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLG- 67 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~------------ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~- 67 (337) .....+.....++...... ..-...+..+.|-+.+...+.+.+++.+.+++.+++++|....|.... T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~ 159 (404) T protein:vir:10 80 GALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSGSRTYE 159 (404) T ss_pred HHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCccceEEE Confidence 1111122222222221111 111123345677778889999999999999999999999888876532 Q ss_pred cccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccce Q lcl|NC_015266. 68 LSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGV 147 (337) Q Consensus 68 ~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~ 147 (337) ...+++-+.-...+...-.......++...+..++.---..|+.+.|+. ..++|+..+++.+.+.++.-.-.-=++|+ T Consensus 160 ~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~~il~G~ 237 (404) T protein:vir:10 160 KRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKF--ADKSLEDWIINWFVDKVRITRNAEILYGA 237 (404) T ss_pred EecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhh--cHHHHHHHHHHHHHHHHHHHHHHHHhhcC Confidence 2233333434333321111111122344455555555556777777764 12368888888887777653333223552 Q ss_pred eccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEE Q lcl|NC_015266. 148 KAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVI 227 (337) Q Consensus 148 s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvi 227 (337) -.. .+|. | ++... +...+..++...|..|..++. ..+++-|+.. .+++ T Consensus 238 g~~------~~~~------g------------i~~~~--~~~~~~~~~~~~~~~~~~~~~----~~l~~~~~~~--~~~v 285 (404) T protein:vir:10 238 GGD------EHAT------G------------IMTAN--KFKKITLPKSPALKDFKKCKN----VELLNVFKAT--SSWI 285 (404) T ss_pred CCC------Cccc------c------------eeecc--ccceeeccccccHHHHHHHHH----hhhhccccCC--CEEE Confidence 210 1111 1 11111 111233444555655544432 2346666653 5889 Q ss_pred eChHHHHHHHHHHHhccCChh--HHHHHHHHHhhhhhcCceeEECC-ccCCCc-----eEEecccccEEEEecCceEEeE Q lcl|NC_015266. 228 CGRELLHDKYFPIVNTTQAPT--EQLAADLIVSQKRIGNLPAVRVP-FFPKRA-----MMVTKLENLSIYFQEGARRRSL 299 (337) Q Consensus 228 vG~dLla~k~~~l~n~~~~pt--E~~A~~~~~~~k~igGl~a~~vP-ffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~ 299 (337) |.+..++. ...+-...+.|- .-... ....++-|+|++.+| .+|+.+ +++-.+++.-..+..++..=.+ T Consensus 286 ~n~~~~~~-L~~lkd~~G~~l~~~~~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~ 361 (404) T protein:vir:10 286 VNQDGFNY-LDSLEDKTGRPYLQPDPKD---PTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELAT 361 (404) T ss_pred EcHHHHHH-HHHhhccCCceeeccCcCC---CCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEE Confidence 99987652 222221111110 00000 123578899998654 466654 7777777643333333433322 Q ss_pred eeccc----cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 300 IDNPK----RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 300 ~d~p~----r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .+.+. ++.+.-+-..=-++.|-++..++.+ ++..| T Consensus 362 ~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~---~~~~a 400 (404) T protein:vir:10 362 TNIGAGAFETNTTKARIIMRIDGNVKDSEALLIA---EIPVE 400 (404) T ss_pred eccccchhhcCceEEEEEEeeccEEecccceEEE---Eeecc Confidence 22221 1111111111123344444444443 44444 No 85 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=97.48 E-value=4.8e-05 Score=44.25 Aligned_cols=279 Identities=10% Similarity=0.061 Sum_probs=141.7 Q ss_pred CChH----HHHHHHHHHHHHH----HhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccc Q lcl|NC_015266. 1 MKKE----TRQAYRKYAAQIA----KLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~----tr~~~~~y~~~~a----~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g 72 (337) +... .+..|..|+..-. ...+.....-.|.|-+.+.+.+.+.+.+.+.+++.++++++...+|....+.-++ T Consensus 83 ~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 162 (389) T protein:vir:10 83 LSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKRAT 162 (389) T ss_pred cchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEecCC Confidence 2222 2345666654221 1122223334677766778889999999999999999999987777654433222 Q ss_pred ccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCC Q lcl|NC_015266. 73 PIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALS 152 (337) Q Consensus 73 ~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~ 152 (337) .-+.-. +....+.+.+...++...+..++.---+.|+.+.|+. ..++|+..+++.+.+.++.-+-. T Consensus 163 ~~~~~~-~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~----------- 228 (389) T protein:vir:10 163 DRFSSV-AELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIAD--SAVDLTALVGQSIKEKSVNTYNA----------- 228 (389) T ss_pred Cccccc-cccccccccccccceeeeeeheeeEeeehhhHHHHhh--hhHHHHHHHHHHHHHHHHHHHHH----------- Confidence 222111 1222232233345666667777766666788887764 23478999999988888752111 Q ss_pred CChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHH Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGREL 232 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dL 232 (337) .++.... ++. .+......+.|.++ ++++..+++.+.. +++|.+.. T Consensus 229 --------------------------~i~~g~~--~~~--~~~~~~~~~~d~l~-~~~~~~~~~~~~a----~~~~n~~~ 273 (389) T protein:vir:10 229 --------------------------MIAPVLQ--SFT--AKKTTTDTLVDSLK-HILNVDLDPAYSR----ALVVTQSL 273 (389) T ss_pred --------------------------HHhhhhc--ccc--cccccccccHHHHH-HHHHhhhhhhhCc----EEEecHHH Confidence 0111100 000 01112234566665 4555567777632 78899977 Q ss_pred HHHHHHHHHh-ccCCh------hHHHHHHHHHhhhhhcCceeEECCc-cCCCc-----eEEecccccEEEEecCceEEeE Q lcl|NC_015266. 233 LHDKYFPIVN-TTQAP------TEQLAADLIVSQKRIGNLPAVRVPF-FPKRA-----MMVTKLENLSIYFQEGARRRSL 299 (337) Q Consensus 233 la~k~~~l~n-~~~~p------tE~~A~~~~~~~k~igGl~a~~vPf-fP~~~-----ilvT~l~NLsIY~Q~gs~RR~~ 299 (337) +.. +..+. ..+.| +...++ ....++-|+|++.++- +|+.. +++-.|++.-.++-++..+-.. T Consensus 274 ~~~--L~~lkd~~G~~i~~~~~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~ 348 (389) T protein:vir:10 274 FNT--LDTLKDKNGRYLLHDASDSITDG---TAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAW 348 (389) T ss_pred HHH--HHHhhccCCCeeeecCccccccc---ccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEe Confidence 542 22222 21111 111000 1235789999987653 44432 7888888854333333333333 Q ss_pred eeccccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 300 IDNPKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 300 ~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .+.......---..| -|..|=+..+++.+ ++.++ T Consensus 349 ~~~~~~~~~~~~~~r-~d~~~~~~~a~~~~---~~~~~ 382 (389) T protein:vir:10 349 EDSKIYGKYLGAAFR-FGVQKADSKAGYFV---TNTDV 382 (389) T ss_pred eccccccceEEEEEE-eccEEecccceEEE---Eeecc Confidence 222222111111111 22233334444443 34433 No 86 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=97.48 E-value=3.8e-05 Score=44.85 Aligned_cols=287 Identities=12% Similarity=0.034 Sum_probs=145.2 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) ++.+.|+.|+++. + +. +....|.|-++..+++.+.+.+.|..++.++++++.- +.++....+++.++=..- T Consensus 65 lt~~e~~~~~~~~----~--~~-~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~~e 135 (381) T protein:vir:95 65 LSANQRSFFMDIN----K--NV-NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) T ss_pred ccHHHHHHHHHHh----c--cc-CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc--ceEEEEecCCcceeeecc Confidence 5555555444432 1 12 2233578999999999999999999999999887752 234444444444433221 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) + ..+....-..+....+.+++.---..|+.+.|+. ...+++..++..+.+++|.=.-.-=.||+-. .-| T Consensus 136 ~-~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~D--s~~~ie~~i~~~la~~~a~~~~~a~i~G~G~-------~qP- 204 (381) T protein:vir:95 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK-------DQP- 204 (381) T ss_pred c-ccccccccccceeeeecceeEEeechhhHHHhhc--CHHHHHHHHHHHHHHHHHHHhhheeEeccCC-------CCc- Confidence 1 2222222234666777777777778999999987 2237888999999998886554445566431 123 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCC------cccccHHHHHHHHHhcccChhHcC-----CCCeEEEeC Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKG------GDYVNLDALVMDIVSSMIDPWFQE-----DTGLVVICG 229 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~g------gdy~nLDalV~da~~~li~~~~r~-----~~dLVvivG 229 (337) .|+|..+-. ....+.+.. ..+...|+- .-|..|.+++..+ ..|+.. ....+++|. T Consensus 205 -----~Gil~~~~~---~~~~~~g~~-~~~~~~~t~t~~~~~~~~~~l~~~~~~~-----~~~~~~~~~~~~~~a~~~mn 270 (381) T protein:vir:95 205 -----IGLNRQVQK---GVSVTEGAY-PEKEEQGTLTFANPRATVNELTQVFKYH-----STNEKGKSVAVKGNVTMVVN 270 (381) T ss_pred -----eeeeeccCc---ccccccccc-cccccccccccccchhhHHHHHHHHHhh-----ccccccccccccCceEEEEc Confidence 344432111 011111110 011111111 1123344343332 333221 335788999 Q ss_pred hHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 230 RELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 230 ~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) +..... ..++....+.. ++-+ ...--|.+++.-++||++.|++-.+++.-|.-. ++.+-..-+. . T Consensus 271 ~~t~~~-l~~~~~~~~~~-----G~~v--~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r-~~~~i~~~~~--~---- 335 (381) T protein:vir:95 271 PSDAFE-VQAQYTHLNAN-----GVYV--TALPFNLNVIESTVQEAGKVLTYVKGLYDGYLA-GGINVQKFKE--T---- 335 (381) T ss_pred cccHHh-hccccccCCCC-----Ccee--ecCCCCceEEecCCCCcCcEEEEecccEEEEEe-cccEEEeech--h---- Confidence 865442 22221111100 1100 001126778899999999999988888555433 3333211111 1 Q ss_pred chhhhcccceee--------cCCcEEEeeceeeccC Q lcl|NC_015266. 310 NYESSNDAYVVE--------DFGCGCVAENIELVAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVE--------d~~~~a~iEnI~~~~a 337 (337) -|..-..+|.+- |..+++.++ |++.++ T Consensus 336 ~~~~d~~~f~a~~r~dg~~~~~~A~~v~~-l~~~~~ 370 (381) T protein:vir:95 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWK-LDLKGH 370 (381) T ss_pred HhhcCCeEEEEEEEEcCEEecCceEEEEE-EEecCC Confidence 111111222222 222333333 555555 No 87 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=97.48 E-value=3.8e-05 Score=44.85 Aligned_cols=287 Identities=12% Similarity=0.034 Sum_probs=145.2 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) ++.+.|+.|+++. + +. +....|.|-++..+++.+.+.+.|..++.++++++.- +.++....+++.++=..- T Consensus 65 lt~~e~~~~~~~~----~--~~-~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~i~~~~~~~~a~w~~e 135 (381) T protein:vir:10 65 LSANQRSFFMDIN----K--NV-NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) T ss_pred ccHHHHHHHHHHh----c--cc-CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCc--ceEEEEecCCcceeeecc Confidence 5555555444432 1 12 2233578999999999999999999999999887752 234444444444433221 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) + ..+....-..+....+.+++.---..|+.+.|+. ...+++..++..+.+++|.=.-.-=.||+-. .-| T Consensus 136 ~-~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~D--s~~~ie~~i~~~la~~~a~~~~~a~i~G~G~-------~qP- 204 (381) T protein:vir:10 136 Y-GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDF--GPAWIERFVRVQIEEAFAVALETAFLKGTGK-------DQP- 204 (381) T ss_pred c-ccccccccccceeeeecceeEEeechhhHHHhhc--CHHHHHHHHHHHHHHHHHHHhhheeEeccCC-------CCc- Confidence 1 2222222234666777777777778999999987 2237888999999998886554445566431 123 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCC------cccccHHHHHHHHHhcccChhHcC-----CCCeEEEeC Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKG------GDYVNLDALVMDIVSSMIDPWFQE-----DTGLVVICG 229 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~g------gdy~nLDalV~da~~~li~~~~r~-----~~dLVvivG 229 (337) .|+|..+-. ....+.+.. ..+...|+- .-|..|.+++..+ ..|+.. ....+++|. T Consensus 205 -----~Gil~~~~~---~~~~~~g~~-~~~~~~~t~t~~~~~~~~~~l~~~~~~~-----~~~~~~~~~~~~~~a~~~mn 270 (381) T protein:vir:10 205 -----IGLNRQVQK---GVSVTEGAY-PEKEEQGTLTFANPRATVNELTQVFKYH-----STNEKGKSVAVKGNVTMVVN 270 (381) T ss_pred -----eeeeeccCc---ccccccccc-cccccccccccccchhhHHHHHHHHHhh-----ccccccccccccCceEEEEc Confidence 344432111 011111110 011111111 1123344343332 333221 335788999 Q ss_pred hHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 230 RELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 230 ~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) +..... ..++....+.. ++-+ ...--|.+++.-++||++.|++-.+++.-|.-. ++.+-..-+. . T Consensus 271 ~~t~~~-l~~~~~~~~~~-----G~~v--~~l~~g~~vv~s~~~p~~~iifgDfs~Y~i~~r-~~~~i~~~~~--~---- 335 (381) T protein:vir:10 271 PSDAFE-VQAQYTHLNAN-----GVYV--TALPFNLNVIESTVQEAGKVLTYVKGLYDGYLA-GGINVQKFKE--T---- 335 (381) T ss_pred cccHHh-hccccccCCCC-----Ccee--ecCCCCceEEecCCCCcCcEEEEecccEEEEEe-cccEEEeech--h---- Confidence 865442 22221111100 1100 001126778899999999999988888555433 3333211111 1 Q ss_pred chhhhcccceee--------cCCcEEEeeceeeccC Q lcl|NC_015266. 310 NYESSNDAYVVE--------DFGCGCVAENIELVAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVE--------d~~~~a~iEnI~~~~a 337 (337) -|..-..+|.+- |..+++.++ |++.++ T Consensus 336 ~~~~d~~~f~a~~r~dg~~~~~~A~~v~~-l~~~~~ 370 (381) T protein:vir:10 336 LALDDMDLYTAKQFAYGKAKDNKVAAVWK-LDLKGH 370 (381) T ss_pred HhhcCCeEEEEEEEEcCEEecCceEEEEE-EEecCC Confidence 111111222222 222333333 555555 No 88 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=97.45 E-value=6.4e-05 Score=43.56 Aligned_cols=281 Identities=12% Similarity=0.127 Sum_probs=148.1 Q ss_pred CChHHHHHHHHHHHH-----HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccc--cccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQ-----IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLS--VSGP 73 (337) Q Consensus 1 M~~~tr~~~~~y~~~-----~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~g--v~g~ 73 (337) +....+..|..|+.. ...........-.+.|-+.+...+.+.+.+.+.+++..+++++....|.....- ..++ T Consensus 86 ~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (397) T protein:vir:49 86 VKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEKWADITG 165 (397) T ss_pred HHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEeeccCCc Confidence 444455556666532 111111122234567766777899999999999999999999988777654332 2223 Q ss_pred cceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCC Q lcl|NC_015266. 74 IASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALST 153 (337) Q Consensus 74 iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~T 153 (337) .+.-+..+.. ....+...++...+.+++.---..|+.+.|.... .+|+..+++.+.++++.-.-.--++|+-... T Consensus 166 ~a~~v~E~~~-~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~-- 240 (397) T protein:vir:49 166 LAKLDDEGGQ-IGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILEAIGTLP-- 240 (397) T ss_pred ceeeeccccc-cccccccceeeeEeeeeeeEeehhhHHHHHhhhh--HHHHHHHHHHHHHHHHHHHHHHHHhcccccc-- Confidence 3333332221 1112233456667777777777888888887532 3789999999999888776666667743210 Q ss_pred ChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHH Q lcl|NC_015266. 154 DKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 D~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLl 233 (337) | . +.. .+.|.++ +++.. +++.|+... +++|.+..+ T Consensus 241 -----~---------------------------~------~~~---~~~d~i~-~~~~~-l~~~~~~~a--~~v~n~~~~ 275 (397) T protein:vir:49 241 -----N---------------------------K------PTL---AKWDDII-DLQAK-VDPAIKQTS--LFLTNTSGF 275 (397) T ss_pred -----c---------------------------c------ccc---cCHHHHH-HHHHh-hhhhhcCCC--EEEEcHHHH Confidence 0 0 001 2345544 45554 477777654 889998876 Q ss_pred HHHHHHHHhccCChhHHHHHHHH-HhhhhhcCceeEECC--ccCCC-----ceEEecccccEEEEecCceEEeEeec--- Q lcl|NC_015266. 234 HDKYFPIVNTTQAPTEQLAADLI-VSQKRIGNLPAVRVP--FFPKR-----AMMVTKLENLSIYFQEGARRRSLIDN--- 302 (337) Q Consensus 234 a~k~~~l~n~~~~ptE~~A~~~~-~~~k~igGl~a~~vP--ffP~~-----~ilvT~l~NLsIY~Q~gs~RR~~~d~--- 302 (337) .. ...|-+..+.|- ....+. ....++-|+|++.++ .+|.. .+++-.|++.-..+..+..+=..-+. T Consensus 276 ~~-l~~lkd~~g~~l--~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 352 (397) T protein:vir:49 276 TA-LKKVKNAMGDYL--MERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGG 352 (397) T ss_pred HH-HHHhhccCCcee--ecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccc Confidence 52 222322222220 000110 123589999998765 45643 36777777633333323322111110 Q ss_pred -cccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 303 -PKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 303 -p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -.++.+--.-..--+..|-++..++.+. +..+ T Consensus 353 ~~~~~~~~~~~~~r~d~~~~~~~a~~~~~---~~~~ 385 (397) T protein:vir:49 353 AFETDTTKVRVIDRFDVVSTDTEAFVPAS---FKAI 385 (397) T ss_pred hhhcCeeeEEEEEeeccEEecccceEEEE---eccc Confidence 1111111111111233444444444443 2222 No 89 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=97.43 E-value=4.8e-05 Score=44.25 Aligned_cols=279 Identities=11% Similarity=0.098 Sum_probs=150.0 Q ss_pred CChHHHHHHHHHHHHHH-----HhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc--ccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIA-----KLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL--SVSGP 73 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a-----~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~--gv~g~ 73 (337) +...-+..|..|+..-- .........-.+.|-+.+...+.+.+.+.+.+++..++++++...|..... ...++ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (397) T protein:vir:48 86 VKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITG 165 (397) T ss_pred HHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCc Confidence 33444444554443211 111111223456788888899999999999999999999999888876643 23334 Q ss_pred cceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCC Q lcl|NC_015266. 74 IASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALST 153 (337) Q Consensus 74 iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~T 153 (337) .+..+..+.. ....+...++...+..++.---..|+.+.|+.. ..+|+..+++.+.+.++.-.-.--+||+..+.. T Consensus 166 ~a~~v~E~~~-~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~- 241 (397) T protein:vir:48 166 LAKLDDEAGS-IGTNDDPKLYPIRYAIKRYAGISTVTNSLLADS--AENILAWLSGWIAKKVVVTRNKAILEAIATLPT- 241 (397) T ss_pred ceeeeccccc-cccccccceeeEEeeheeeeeehhhHHHHHhhc--hHHHHHHHHHHHHHHHHHHHHHHHhhccccccc- Confidence 4444433221 111222334555555555555578899988763 247888888888888887666666677532110 Q ss_pred ChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHH Q lcl|NC_015266. 154 DKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 D~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLl 233 (337) .+.. .+.|.++ +++.. +++.|+.. -+++|.+..+ T Consensus 242 ---------------------------------------~~~~---~~~d~i~-~~~~~-l~~~~~~~--a~~v~n~~~~ 275 (397) T protein:vir:48 242 ---------------------------------------KPTL---TKWDDII-DLQAK-VDPAIKQT--SFFLTNTSGF 275 (397) T ss_pred ---------------------------------------cccc---ccHHHHH-HHHHH-hhhhhcCC--CEEEECHHHH Confidence 0111 2345544 34544 46777765 4888999876 Q ss_pred HHHHHHHHh-ccCCh--hHHHHHHHHHhhhhhcCceeEECC--ccC-----CCceEEecccccEEEEecCceEEeEeecc Q lcl|NC_015266. 234 HDKYFPIVN-TTQAP--TEQLAADLIVSQKRIGNLPAVRVP--FFP-----KRAMMVTKLENLSIYFQEGARRRSLIDNP 303 (337) Q Consensus 234 a~k~~~l~n-~~~~p--tE~~A~~~~~~~k~igGl~a~~vP--ffP-----~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p 303 (337) +. +..+. ..+.| ..-... ....+|-|+|++.++ ++| ...+++=.|++...++..+..+=.+.+.. T Consensus 276 ~~--L~~lkd~~G~~i~~~~~~~---~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~ 350 (397) T protein:vir:48 276 TA--LKKVKNAFGDYLMERDVKS---PTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIG 350 (397) T ss_pred HH--HHHhhcCCCceeeccCcCC---CCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccc Confidence 52 22232 22222 111101 123589999998865 344 33477777787655555555443332221 Q ss_pred c----cceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 304 K----RDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 304 ~----r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) + ++.+--.-..--++.|-++..++. +++..+ T Consensus 351 ~~~~~~~~~~~r~~~r~d~~~~~~~a~~~---~~~~~~ 385 (397) T protein:vir:48 351 GGAFETDTTKIRVIDRFDVVATDTESFVP---ASFKAI 385 (397) T ss_pred hhhhhcCceeEEEEeeeccEEecccceEE---EEeccc Confidence 1 111111111112334444444444 344444 No 90 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.39 E-value=7.7e-05 Score=43.13 Aligned_cols=285 Identities=13% Similarity=0.047 Sum_probs=151.7 Q ss_pred HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccCCCcccccccccccCc Q lcl|NC_015266. 16 IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~ 95 (337) ||. +. ..+-.+.|-+...+.+.+.++++|.++++-+++++.-- +-++-.-.+++-++-+.. ....|..-..++. T Consensus 1 Ma~--~~-~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~-~~~ip~~~~~~~a~wv~E--g~~~~~s~~~f~~ 74 (315) T protein:vir:80 1 MAD--DF-LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFG-PVKGAVFSGVPRAKIVGE--GEVKPSASVDVSA 74 (315) T ss_pred CCC--Cc-CCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCC-ceEEEEEeCCcceEEeeC--Cccccccccceee Confidence 332 22 22456789999999999999999999999888887532 223333344555554433 2334445567888 Q ss_pred cceeeEeeccccccCHHHHHHHhc-C-ccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHH Q lcl|NC_015266. 96 NRYRCEKTDYDTAITYRKLDAWAK-F-PDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYR 173 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~wA~-~-~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~R 173 (337) ....+++.-.-+.|+-+.|....- + ..+++.+.+.+.+.++.=.-.-.|||+.-...+.+ T Consensus 75 v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~------------------ 136 (315) T protein:vir:80 75 FTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAA------------------ 136 (315) T ss_pred eEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccc------------------ Confidence 888889888888899888865432 2 34678888888888887666778899642221111 Q ss_pred hhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChhH--HH Q lcl|NC_015266. 174 DRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPTE--QL 251 (337) Q Consensus 174 e~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~ptE--~~ 251 (337) ..+........+.....+.-|.+++.++.- +. ...++. +. +++|.+.....- ..+......++- .+ T Consensus 137 -----~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~-~~---~~~~~~-~~-~~imn~~~~~~L-~~l~~~~g~~~~g~~~ 204 (315) T protein:vir:80 137 -----SAVHTSLNKTKNIVDATDSATADLVKAVGL-IA---GAGLQV-PN-GVALDPAFSFAL-STEVYPKGSPLAGQPM 204 (315) T ss_pred -----cccccccccccceeeccccchHHHHHHHHH-Hh---hccCcc-ce-EEEEcHHHHHHH-HHHhhccCCccccccc Confidence 011111101111111233446777766543 22 222222 22 688888776532 222211112110 00 Q ss_pred HHHH-HHhhhhhcCceeEECCccCCCc---------eEEecccccEEEEecCceEEeEeeccccce-ecchhhhc----- Q lcl|NC_015266. 252 AADL-IVSQKRIGNLPAVRVPFFPKRA---------MMVTKLENLSIYFQEGARRRSLIDNPKRDQ-IENYESSN----- 315 (337) Q Consensus 252 A~~~-~~~~k~igGl~a~~vPffP~~~---------ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r-~e~y~s~N----- 315 (337) --.+ .....++-|+|++..+++|++. +++--++++-|-. .+..+-.+-+..+-+. -.++..+| T Consensus 205 ~~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~-~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r 283 (315) T protein:vir:80 205 YPAAGFAGLDNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGF-QRNFPIELIEYGDPDQTGRDLKGHNEVMVR 283 (315) T ss_pred ccccccCCCceecceeeEecCcCCcccccccccccEEEEeecccEEEEE-ecCeeEEEeccccccCcccchhhcCcEEEE Confidence 0000 0123578999999999999764 4455666644422 2222222222211110 01222222 Q ss_pred ----ccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 316 ----DAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 316 ----e~YvVEd~~~~a~iEnI~~~~a 337 (337) -|..|.+.++++.+++..--.+ T Consensus 284 ~~~r~~~~v~~~~a~~~l~~~~a~~~ 309 (315) T protein:vir:80 284 AEAVLYVAIESLDSFAVVKEKAAPKP 309 (315) T ss_pred EEEEecceeecccceEEEeeccCCCC Confidence 4455666666666654332222 No 91 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.36 E-value=6.2e-05 Score=43.64 Aligned_cols=283 Identities=10% Similarity=-0.021 Sum_probs=150.6 Q ss_pred HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccCCCcccccccccccCc Q lcl|NC_015266. 16 IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~ 95 (337) +|. -+ +-.+.|-++..+.+.+.+++.|..+++.+++++.--. ..+-.-.+++-++-... +...|..-..++. T Consensus 1 mat----~~-~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~-~~~p~~~~~~~a~wv~E--g~~~~~~~~~f~~ 72 (311) T protein:vir:81 1 MVA----LA-TGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRGEVVGE--GAQKSESTATFAP 72 (311) T ss_pred Cce----ec-CCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCc-eEEEEEeCCceeEEeec--CcccccccceeeE Confidence 221 11 2356788888999999999999999999988864422 22223234555544332 2233344456788 Q ss_pred cceeeEeeccccccCHHHHHHHhc-CccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHh Q lcl|NC_015266. 96 NRYRCEKTDYDTAITYRKLDAWAK-FPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRD 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~wA~-~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re 174 (337) ..+.+++.--.+.|+.+.|.++.. ..+|++.+.+.+.++++.-.-.-.+||+.....+.+ T Consensus 73 v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~------------------- 133 (311) T protein:vir:81 73 VTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAAL------------------- 133 (311) T ss_pred EEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCccc------------------- Confidence 889999998889999999977654 467999999999999999988889999753322221 Q ss_pred hchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChhHHHHHH Q lcl|NC_015266. 175 RAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAAD 254 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~ 254 (337) .-+.+........+.. ...+-..+|.++..++. ++... .-++. .++|.+..+.. ...|-...+.|- -.... T Consensus 134 ---~gi~~~~~~~~~~~~~-~~~~~~~~~~~i~~~~~-~~~~~-~~~~~-~~vmn~~~~~~-l~~lkd~~G~~l-~~~~~ 204 (311) T protein:vir:81 134 ---SGSPAKILDTTNIVEL-TTGTSATPDLAVEAAVG-LVLGD-NLSPD-GVALDNTFSFM-LATQRDSQGRKL-YPELG 204 (311) T ss_pred ---ccccccccccceeeee-cccccchHHHHHHHHHH-Hhhhc-CCCce-EEEEcHHHHHH-HHhhhccCCCee-ecCcc Confidence 1111111111111111 12222455666666654 33332 23333 47888876642 222322222220 00011 Q ss_pred HHHhhhhhcCceeEECCccCCCceEE------------------ecccccEEEEecCceEEeEeeccccceecchhhhcc Q lcl|NC_015266. 255 LIVSQKRIGNLPAVRVPFFPKRAMMV------------------TKLENLSIYFQEGARRRSLIDNPKRDQIENYESSND 316 (337) Q Consensus 255 ~~~~~k~igGl~a~~vPffP~~~ilv------------------T~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne 316 (337) .-....++-|+|++..-++|.+.... =-++++-|-...+- +-.+-+..+-+...++..+|. T Consensus 205 ~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 283 (311) T protein:vir:81 205 FGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSI-PLELIEFGDPDGLGDLKRQNQ 283 (311) T ss_pred ccCCCceecceeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccc-eEEEeccCCCCcchhhhhcCc Confidence 11235688899999999999765433 33333222222221 112222211122223333332 Q ss_pred -cc--------eeecCCcEEEeeceeec Q lcl|NC_015266. 317 -AY--------VVEDFGCGCVAENIELV 335 (337) Q Consensus 317 -~Y--------vVEd~~~~a~iEnI~~~ 335 (337) +| .|=++.+++.+...+-+ T Consensus 284 v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 284 IAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred EEEEEEEEeccEeecccceEEEEeeccC Confidence 22 23333444444332222 No 92 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=97.33 E-value=4.8e-05 Score=44.24 Aligned_cols=276 Identities=11% Similarity=0.078 Sum_probs=135.3 Q ss_pred CChHHHHHHHHHHHHH----HHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccce Q lcl|NC_015266. 1 MKKETRQAYRKYAAQI----AKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIAS 76 (337) Q Consensus 1 M~~~tr~~~~~y~~~~----a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iag 76 (337) .+...+..|..++... ....++....-.+.|-+.+...+.+.+++.+.+++.++++++..-.+...-.. .++.++ T Consensus 92 ~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~~~~~~~~~~~-~~~~~~ 170 (421) T protein:vir:13 92 KRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVNRNAGKMPVRA-GASVDK 170 (421) T ss_pred HHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHHHhhhhhhhhceeeeccCCceEEEEee-cCCccc Confidence 1111222333333211 12223333344566766777889999999999999999999887666443221 222221 Q ss_pred eccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChh Q lcl|NC_015266. 77 RTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKA 156 (337) Q Consensus 77 Rt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~ 156 (337) =...+...-.|..-..++...+..++.---..|+.+.|+. + .++|+..+++.+.+++++ -.|| +.. T Consensus 171 ~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~d-s-~~~l~~~i~~~la~~~~~-----~~~~-------~i~ 236 (421) T protein:vir:13 171 LANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLED-S-EINFLEFVNEEFAEFAVN-----TENA-------EIV 236 (421) T ss_pred eeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhh-h-HHHHHHHHHHHHHHHHHH-----Hhhh-------hHh Confidence 1111122222333344555666666665566778887764 2 247888888888877753 1121 110 Q ss_pred hhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHH Q lcl|NC_015266. 157 ANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDK 236 (337) Q Consensus 157 ~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k 236 (337) + .|.-+++ .+ ...+| |.++ +++..+ ++.|+.. -+++|.+..... T Consensus 237 ------~------------~~~g~~~----~~------~~~~~---d~i~-~~~~~l-~~~~~~~--a~~v~n~~~~~~- 280 (421) T protein:vir:13 237 ------K------------QAKAVLA----EE------TINDY---AGLV-KTINSL-VPNARKR--AIIVTNSDGRAY- 280 (421) T ss_pred ------h------------hhhhccc----cc------cccch---HHHH-HHHHHh-hhhhcCC--CEEEEcHHHHHH- Confidence 0 1111111 11 11234 4433 455544 5556654 378888876653 Q ss_pred HHHHHhccCChhHHHHHHHH-HhhhhhcCceeEECCccCCCc-----eEEecccccEEEEecCceEEeEeeccccceecc Q lcl|NC_015266. 237 YFPIVNTTQAPTEQLAADLI-VSQKRIGNLPAVRVPFFPKRA-----MMVTKLENLSIYFQEGARRRSLIDNPKRDQIEN 310 (337) Q Consensus 237 ~~~l~n~~~~ptE~~A~~~~-~~~k~igGl~a~~vPffP~~~-----ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~ 310 (337) ...+-...+.|- -.... ....++-|+|++.++++|... +++-.+++.-..+..++.+=...+++ T Consensus 281 l~~lkd~~G~~i---~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~------- 350 (421) T protein:vir:13 281 LDGLMDKQGRPL---LKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEA------- 350 (421) T ss_pred HHHhhcCCCcee---ecCcCCCCCceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeeccc------- Confidence 222222222220 00000 123578999999999999764 78888888543333445554443333 Q ss_pred hhhhcc-cceee--------cCCcEEEeec------eeeccC Q lcl|NC_015266. 311 YESSND-AYVVE--------DFGCGCVAEN------IELVAA 337 (337) Q Consensus 311 y~s~Ne-~YvVE--------d~~~~a~iEn------I~~~~a 337 (337) |+.+|. ++.++ +.+.++++.- |.+.++ T Consensus 351 ~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~ 392 (421) T protein:vir:13 351 GYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEV 392 (421) T ss_pred ccccCeeEEEEEeeecceeecchhhheeeecccceeeccccc Confidence 222222 33332 2222222221 111111 No 93 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=97.28 E-value=8.2e-05 Score=42.99 Aligned_cols=280 Identities=9% Similarity=-0.027 Sum_probs=131.6 Q ss_pred CChHHHHHHHH----------HHHHHHHhcC--cccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc Q lcl|NC_015266. 1 MKKETRQAYRK----------YAAQIAKLND--TDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL 68 (337) Q Consensus 1 M~~~tr~~~~~----------y~~~~a~~ng--v~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~ 68 (337) ..+.....+.. .........| .......+.+-+.....+...+.+.+..++.++++++..-....... T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 156 (379) T protein:vir:10 77 KSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTYTFVRE 156 (379) T ss_pred cchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccCCceEEEEe Confidence 00000011110 0000011111 11112233466667778888888889999999888875433222211 Q ss_pred -ccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHh--hchhhhccc Q lcl|NC_015266. 69 -SVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSA--LDRIMIGWN 145 (337) Q Consensus 69 -gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~a--lD~i~IGfN 145 (337) |.++.-+ .-.+.+...|..-..++...|..++.---+.|+-+.|+.. |.++..+++.+.+.++ +|.-.+|-. T Consensus 157 ~~~~~~~~--~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~---~~l~~~i~~~la~~~~~~~~~~~~~g~ 231 (379) T protein:vir:10 157 NGAGEGAI--GAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNL---PFLTSFIPNALRRDYAKAENAAFNAVL 231 (379) T ss_pred ecCCCccc--ccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhH---HHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 1111111 1112223334333456666777777666678888888764 5688888887777664 344333332 Q ss_pred ceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeE Q lcl|NC_015266. 146 GVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLV 225 (337) Q Consensus 146 G~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLV 225 (337) |+.. +..+ -....+..+|.++. ++..+.+. ++... + T Consensus 232 ~~~~---------------------------~~~~-------------~~~~~~~~~d~i~~-~~~~~~~~-~~~~~--~ 267 (379) T protein:vir:10 232 AANA---------------------------TAST-------------EIITNKNKVEMLIN-EIAKQENL-DFPVT--A 267 (379) T ss_pred cccc---------------------------cccc-------------ccccCcccHHHHHH-HHHhhhhc-cCCCC--E Confidence 2110 0000 01122344666554 45555444 44433 5 Q ss_pred EEeChHHHHHHHHHHHh-ccCChhH--HHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeec Q lcl|NC_015266. 226 VICGRELLHDKYFPIVN-TTQAPTE--QLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDN 302 (337) Q Consensus 226 vivG~dLla~k~~~l~n-~~~~ptE--~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~ 302 (337) ++|.+.-+.. ...+. ..+.|=- ...++ -....++-|+|++..|.+|++.+++=.++..-+-.-+|..-....+. T Consensus 268 ~vmn~~~~~~--l~~lkd~~G~~l~~~~~~~~-~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~ 344 (379) T protein:vir:10 268 IVLRPTDYYD--ILVTQKSVGAGYGLPGVVTQ-DNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVE 344 (379) T ss_pred EEEcHHHHHH--HHHhhccCCceeccCCccCC-CCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecc Confidence 7788764432 22222 1111100 00000 01224788999999999999999988888754433333221111111 Q ss_pred ---cccceecchhhhcccceeecCCcEEEee--ce Q lcl|NC_015266. 303 ---PKRDQIENYESSNDAYVVEDFGCGCVAE--NI 332 (337) Q Consensus 303 ---p~r~r~e~y~s~Ne~YvVEd~~~~a~iE--nI 332 (337) -.+|.+.-.--.=-|..|=++++++.++ .| T Consensus 345 ~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 345 GTNFVKNNITARIEAQVALAVEQPAALIFGDFTAV 379 (379) T ss_pred cccccCCcEEEEEEEEeccEEecCccEEEEEecCC Confidence 1122221111111234445555555544 33 No 94 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=97.27 E-value=9.8e-05 Score=42.55 Aligned_cols=298 Identities=12% Similarity=0.070 Sum_probs=144.0 Q ss_pred CChHHH-----HHHHHHHHHH------------------HHhcCcccccceeeecHH-HHHHHHHHHHhhhhhhcccccc Q lcl|NC_015266. 1 MKKETR-----QAYRKYAAQI------------------AKLNDTDDVSQKFAVEPS-VQQTLETKMQESSAFLKSINIL 56 (337) Q Consensus 1 M~~~tr-----~~~~~y~~~~------------------a~~ngv~~~~~~Fsv~P~-~~q~L~~~i~ess~FL~~Inv~ 56 (337) +....+ .......... +......+.+-.+.|-|. ....+.+.+++++.+++.+.++ T Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~ 194 (477) T protein:vir:84 115 LAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTE 194 (477) T ss_pred HHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeeccchhHHHHHHHhhhcchHHHhhcee Confidence 000000 0000000000 000001111224456665 3678999999999999999999 Q ss_pred cchhhhhhhhccc-cccccce-eccCCC---cccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHH Q lcl|NC_015266. 57 PVTELEGEKLGLS-VSGPIAS-RTDTTK---AERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVI 131 (337) Q Consensus 57 ~V~~~~Ge~v~~g-v~g~iag-Rt~t~~---~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i 131 (337) ++....|..-.-- .+|+..+ -+.-+. ....|..-..++...+.+++.---+.|+.+.|+..+ ++++..+++.+ T Consensus 195 ~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l 272 (477) T protein:vir:84 195 PLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAA--VSVDEFVFRDL 272 (477) T ss_pred eecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeeeHHHHHHHhccc--hhHHHHHHHHH Confidence 9888777532211 1233222 222111 122233334567778888888888889999988755 58999999999 Q ss_pred HHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceec-CCCcccccHHHHHH--- Q lcl|NC_015266. 132 LNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLV-GKGGDYVNLDALVM--- 207 (337) Q Consensus 132 ~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~-G~ggdy~nLDalV~--- 207 (337) .+.++.=.-.--++|+-.+ .+|. |.+. .. ..+.+.. +.+..+..+|.+.. T Consensus 273 ~~~~~~~~d~~~l~G~Gt~------~~p~------Gi~~------------~~--~~~~~~~~~~~~t~~~~~~~~~~i~ 326 (477) T protein:vir:84 273 AADYANKLNVQVISGTGSN------NQVV------GVRA------------TA--GITQVTATSAGSALEKHQIIYQKIA 326 (477) T ss_pred HHHHHHHHHHHHhccCCCC------Cccc------eeee------------cc--ccccccccccccchhhHHHHHHHHH Confidence 9988865555666884321 1232 3331 11 0111222 23456777777754 Q ss_pred HHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCCh----h--H-----HHHHHHH-HhhhhhcCceeEECCccCC Q lcl|NC_015266. 208 DIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAP----T--E-----QLAADLI-VSQKRIGNLPAVRVPFFPK 275 (337) Q Consensus 208 da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~p----t--E-----~~A~~~~-~~~k~igGl~a~~vPffP~ 275 (337) +++.. +++-++..+..+++ ....++ ....+-...+.| . + .+..... ....++.|+|++..+++|+ T Consensus 327 ~~~~~-~~~~~~~~~~~~v~-~~~~~~-~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~ 403 (477) T protein:vir:84 327 DAIQR-VHTSRFLEPEVIVM-HPRRWA-SFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPT 403 (477) T ss_pred HHHhh-ccccccCCccEEEE-cHHHHH-HHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccc Confidence 44443 45556655555544 444333 222222221111 0 0 0001110 1234788999999999997 Q ss_pred C--------ceEEecccccEEEEecCceEEeEeeccccceecchhhhcccceeec---------CCcEEEeeceeeccC Q lcl|NC_015266. 276 R--------AMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSNDAYVVED---------FGCGCVAENIELVAA 337 (337) Q Consensus 276 ~--------~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~YvVEd---------~~~~a~iEnI~~~~a 337 (337) + .+++-.++.+-| -+++.+- +..++. ..++-. ..|.|.- +.+++.+-+.-...- T Consensus 404 ~~~~~~d~~~i~~gd~~~~~i--~~~~~~~--~~~~~~--~~~~~~--~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~ 474 (477) T protein:vir:84 404 TLGTGTDQDVIHVLRASDLAL--FESSVRM--RALQET--RAENLS--VLLQVYGYLAFTAARFPQSVVEIGGTALTAP 474 (477) T ss_pred cccccCCcceEEEEEeceEEE--EeeceeE--Eecccc--ccccce--eeeeehhhhhhhhhccccceEEeeccccccc Confidence 5 467777766533 2233222 222221 111111 1222221 233333322211111 No 95 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.25 E-value=5e-05 Score=44.16 Aligned_cols=287 Identities=9% Similarity=0.043 Sum_probs=150.0 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) |--..... .++ ..+-... .-.+-|.+.+.+.+.+++.+..++..+++++.-... ++-.-.+++-+.-+.. T Consensus 1 ~g~~~e~~------~~~-~~~t~~~--~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-~ip~~~~~~~a~wv~E 70 (397) T protein:vir:23 1 MGFSADHS------QIA-QTKDTMF--TGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGI-VIPHWTGDVSAQWIGE 70 (397) T ss_pred CCcCHHHH------HHh-hccCCCC--ccccchhHHHHHHHHHHhccchhhhcceeeccCCce-EEEEEcCCcceEEecC Confidence 32222211 111 1122221 224789999999999999999999988888763222 2323333444444322 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ....|..-..++...|..++.---..|+.+.|+.= .++|+..+++.+.++++.-.-.--+||.-.. .|+ T Consensus 71 --g~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds--~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~-------~~~ 139 (397) T protein:vir:23 71 --GDMKPITKGNMTKRDVHPAKIATIFVASAETVRAN--PANYLGTMRTKVATAIAMAFDNAALHGTNAP-------SAF 139 (397) T ss_pred --CccccccccceeEEEEeeEEEEEeehhhHHHHhcc--hHHHHHHHHHHHHHHHHHHHHHHHhhcccCC-------ccc Confidence 22334444567888888898888899999988842 3789999999999999988888888886521 111 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) .||+- .+ ...........| |.+ .+++..+ .+-+++. -+++|.+..... ...+ T Consensus 140 -----~~~~~-------------~~--~~~~~~~~~~~~---~~~-~~~~~~l-~~~~~~~--a~~vmn~~~~~~-L~~l 191 (397) T protein:vir:23 140 -----QGYLD-------------QS--NKTQSISPNAYQ---GLG-VSGLTKL-VTDGKKW--THTLLDDTVEPV-LNGS 191 (397) T ss_pred -----ccccc-------------cc--cceeeecccchh---HHH-HHHHHhh-hhcccCC--CEEEEcHHHHHH-HHHh Confidence 12211 00 011111122222 222 2344444 4445543 478888876552 2222 Q ss_pred HhccCChh--HHHHHH--HHHhhhhhcCceeEECCccCCCce--EEecccccEEEEecCceEEeEeecc--------ccc Q lcl|NC_015266. 241 VNTTQAPT--EQLAAD--LIVSQKRIGNLPAVRVPFFPKRAM--MVTKLENLSIYFQEGARRRSLIDNP--------KRD 306 (337) Q Consensus 241 ~n~~~~pt--E~~A~~--~~~~~k~igGl~a~~vPffP~~~i--lvT~l~NLsIY~Q~gs~RR~~~d~p--------~r~ 306 (337) -...+.|- .....+ ......++-|+|++..+++|++.+ ++..++++-|....+ .+-.+.++. ... T Consensus 192 kd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~-i~i~~~~e~~~~~~~~~~~~ 270 (397) T protein:vir:23 192 VDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGG-LSFDVTDQATLNLGSQESPN 270 (397) T ss_pred hccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEec-eEEEEeeeeeeeeccccccc Confidence 22211210 000011 111235788999999999999876 456788765443333 333232221 111 Q ss_pred eecchhh--------hcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 307 QIENYES--------SNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 307 r~e~y~s--------~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .+.-|+. .--++.|-+.++++.+..-..... T Consensus 271 ~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~ 309 (397) T protein:vir:23 271 FVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTT 309 (397) T ss_pred eeeeeeccceeEEEEeeeccceecccceEEEeeccccce Confidence 1111221 113344555555555542111111 No 96 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=97.24 E-value=8.9e-05 Score=42.79 Aligned_cols=282 Identities=9% Similarity=0.051 Sum_probs=132.2 Q ss_pred CChHHHHHHHHHHHHHHH--hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt 78 (337) .....+..|..++..--. ..........|.|-..+...+. .+.+.+..++.++++++....+.......+++.++-. T Consensus 136 ~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 214 (437) T protein:vir:10 136 IADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEK-EVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAH 214 (437) T ss_pred HHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHH-HhhhhhhhhhcceeEeeccCceeeEEeeccccccccc Confidence 222222334444332111 1112222344555445555544 4566777888899998887766654443333333222 Q ss_pred cCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhh Q lcl|NC_015266. 79 DTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAAN 158 (337) Q Consensus 79 ~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~n 158 (337) .-+. .....+...++...+..++.-.-+.|+.+.|+... ++|+..+++.+.+.++.=.-.-=+||+. T Consensus 215 ~e~~-~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~--~~~~~~i~~~l~~~~~~~~~~~i~~g~g---------- 281 (437) T protein:vir:10 215 TEYG-QTTKNATPVITPILWDLKTYTGGYVFSQELISDSS--YDWQAELQSRLIELRDNTDDSLIITALT---------- 281 (437) T ss_pred cccc-cccccccccceeeeeehhheeeehhhhHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhhhc---------- Confidence 2221 11112223344555555555555788888888643 4788888888888876432221122321 Q ss_pred hhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHH Q lcl|NC_015266. 159 PLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 159 PllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~ 238 (337) ++.-. +.++ .+.|.+ .++++.-+++.|+... +++|.+..+.. .. T Consensus 282 -----------------------------~~~~~-~~~~--~~~~~~-~~~~~~~l~~~~~~~~--~~~~~~~~~~~-l~ 325 (437) T protein:vir:10 282 -----------------------------DGIKK-TTST--YLLGDL-KKVLNVTLKPQDSAAA--SIVMSQSAYNL-FD 325 (437) T ss_pred -----------------------------ccccc-cccc--cchhhH-HHHHHhhhhhhhhcCC--EEEEcHHHHHH-HH Confidence 00000 1111 112332 3445445688898765 89999987653 22 Q ss_pred HHHhccCChhHHHHHHHH-HhhhhhcCceeEECCcc--CCCc-----eEEecccccEEEEecCceEEeEeeccccceecc Q lcl|NC_015266. 239 PIVNTTQAPTEQLAADLI-VSQKRIGNLPAVRVPFF--PKRA-----MMVTKLENLSIYFQEGARRRSLIDNPKRDQIEN 310 (337) Q Consensus 239 ~l~n~~~~ptE~~A~~~~-~~~k~igGl~a~~vPff--P~~~-----ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~ 310 (337) .|-...+.|- ...++- ....++-|+|++..+.+ |..+ +++-.|++.-+.+-+...+=.. .+.++.... T Consensus 326 ~lkd~~g~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~--~~~~~~~~~ 401 (437) T protein:vir:10 326 MATDAMGRPL--LQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQF--QDTYDIWYK 401 (437) T ss_pred HhhccCCCee--eccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEE--ecccccccc Confidence 2322222220 000110 12458999999998865 5443 6766776643222222222111 111111111 Q ss_pred hh---hhcccceeecCCcEEEee----ceeeccC Q lcl|NC_015266. 311 YE---SSNDAYVVEDFGCGCVAE----NIELVAA 337 (337) Q Consensus 311 y~---s~Ne~YvVEd~~~~a~iE----nI~~~~a 337 (337) +. .| -+..|=|..+++.+. -+....+ T Consensus 402 ~~~~~~r-~d~~~~~~~a~~~l~~~~~~~~~~~~ 434 (437) T protein:vir:10 402 QLGIFLR-QNVVQASKDLIVNLTGKLKAVTVVQS 434 (437) T ss_pred eeeEEEE-EccEEecccceEEEEeeccccccCCC Confidence 00 11 133444556666553 2333333 No 97 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=97.22 E-value=7.1e-05 Score=43.33 Aligned_cols=277 Identities=9% Similarity=0.040 Sum_probs=136.8 Q ss_pred CChHHHHHHHHHHHHHH-------HhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIA-------KLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGP 73 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a-------~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~ 73 (337) ........+..+..... ...|+......+.|-+.....+.+.+.+.+.+++..+++++..-.+....+..+++ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 182 (394) T protein:vir:97 103 NDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATT 182 (394) T ss_pred hhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEecCCC Confidence 01111222222222221 11223333345667777888999999999999999999999877776554443332 Q ss_pred cceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCC Q lcl|NC_015266. 74 IASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALST 153 (337) Q Consensus 74 iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~T 153 (337) -++-+..+ ......+...++...+.+++.=--+.|+.+.|+.= .++|+..+.+.+.++++.=.-. T Consensus 183 ~~~~v~E~-~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds--~~~~~~~i~~~la~~~~~~~~~------------ 247 (394) T protein:vir:97 183 KMVTVAEL-EKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA--DVDLVGIVSESISQIKVNTTND------------ 247 (394) T ss_pred ccceeccc-ccccccccccceeEEeehhheeeehhhHHHHHhhh--hHHHHHHHHHHHHHHHHHHHHH------------ Confidence 22222111 11111233456666777777666677888877632 2478888888888877752111 Q ss_pred ChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHH Q lcl|NC_015266. 154 DKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 D~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLl 233 (337) .++.... + +++..-.+.|.++ ++++..+++.+. -+++|.+... T Consensus 248 -------------------------~i~~g~~--~-----~~~~~~~~~~~~~-~~~~~~~~~~~~----a~~v~n~~~~ 290 (394) T protein:vir:97 248 -------------------------AIAKVLK--S-----FTTKTVKNLDEIK-ALLNGGFDPAYN----VSLIVSQSFY 290 (394) T ss_pred -------------------------HHhhccc--c-----ccccccccHHHHH-HHHHhhhhhhhC----CEEEEcHHHH Confidence 1111110 0 1111223456655 455666777653 2688888765 Q ss_pred HHHHHHHHh-ccCChhHHHHHHHH-HhhhhhcCceeEECCc--cCCCceEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 234 HDKYFPIVN-TTQAPTEQLAADLI-VSQKRIGNLPAVRVPF--FPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 234 a~k~~~l~n-~~~~ptE~~A~~~~-~~~k~igGl~a~~vPf--fP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) .. +..+. ..+.|- ..-.+. ....++-|+|++..|. +|.+.+++=.+++.-.++-+....=...+++.....- T Consensus 291 ~~--l~~lkd~~G~~i--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~ 366 (394) T protein:vir:97 291 QT--LDTLKDGNGRYL--LQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYL 366 (394) T ss_pred HH--HHHhhccCCCee--eecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEecccccceeE Confidence 42 22232 222220 000000 1235788999998764 7777788877777433333333332222222111100 Q ss_pred chhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 310 NYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -.+-| -|..|-++..++. |++.++ T Consensus 367 ~~~~r-~d~~v~~~~a~~~---~~~~~~ 390 (394) T protein:vir:97 367 QAVLR-FGVSKVDDKAGYY---VTFTPE 390 (394) T ss_pred EEEEE-EccEEecccceEE---EEeccc Confidence 00011 1122223332222 334344 No 98 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=97.22 E-value=8.2e-05 Score=42.98 Aligned_cols=289 Identities=9% Similarity=-0.070 Sum_probs=148.6 Q ss_pred HHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccCCCcccccccccccCc Q lcl|NC_015266. 16 IAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTALDS 95 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~ 95 (337) +|. -..+-.+.|-+.+++.+.+.+.+.|.+++..+++++..- +.++-.-.+++.++-... ....|..-..++. T Consensus 1 Mat----~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~-~~~~p~~~~~~~a~wv~E--g~~~~~~~~~f~~ 73 (311) T protein:vir:99 1 MAT----FGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFG-NEDIITFNGRPKAEFVGE--GQQKSSTTGEFDF 73 (311) T ss_pred Cce----ecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCC-ceEEEEEeCCceeEEeec--CcccccccceeeE Confidence 332 122345678778889999999999999999999988742 233333334555544422 2333443456778 Q ss_pred cceeeEeeccccccCHHHHHHHhc-CccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHh Q lcl|NC_015266. 96 NRYRCEKTDYDTAITYRKLDAWAK-FPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRD 174 (337) Q Consensus 96 ~~Y~c~qtn~d~~i~y~~LD~wA~-~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re 174 (337) ..+..++.---..|+.+.|.++.. ..+|++.+++.+.++++.-.-.-.++|+-....+- |++ ..+|+.+ T Consensus 74 v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~----~~g---~~~~~~~--- 143 (311) T protein:vir:99 74 VTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTV----IPG---WSNYLGA--- 143 (311) T ss_pred EEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCcc----ccc---ccccccc--- Confidence 888888888899999999988754 57899999999999999988888888865222211 111 1111110 Q ss_pred hchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChhHHHHHH Q lcl|NC_015266. 175 RAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAAD 254 (337) Q Consensus 175 ~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~ 254 (337) ....+..+ ..+-..+++.+.+++..+ .....+-.--.++|.+..... ...+-...+.|-=. ... T Consensus 144 ------------~~~~~~~~-~~~~~~~~~~i~~~~~~~-~~~~~~~~~~~~vmn~~~~~~-L~~lkd~~G~~l~~-~~~ 207 (311) T protein:vir:99 144 ------------ASKRVELT-ADTIANPDLAIEAAVGLL-VANGHPTPVNGLALHPSIAWG-LSTARYTDGRKKFP-ELG 207 (311) T ss_pred ------------ccceeecc-ccccchhHHHHHHHHHHH-hhhccCCCccEEEEcHHHHHH-HHhhhccCCCeeec-Ccc Confidence 01111121 122245566666665432 222222222247888876652 22222222222100 000 Q ss_pred HHHhhhhhcCceeEECCccCCCceEEe----------------cccccEEE-EecCceEEeEeeccccceecchhhhccc Q lcl|NC_015266. 255 LIVSQKRIGNLPAVRVPFFPKRAMMVT----------------KLENLSIY-FQEGARRRSLIDNPKRDQIENYESSNDA 317 (337) Q Consensus 255 ~~~~~k~igGl~a~~vPffP~~~ilvT----------------~l~NLsIY-~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~ 317 (337) .-....++-|+|++...++|.+....+ .++++--| ..++..=+..........+.-|++-.-+ T Consensus 208 ~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~ 287 (311) T protein:vir:99 208 LGIGVSSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIA 287 (311) T ss_pred cCCCCceecceeeEeecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEE Confidence 011235788999999999986654322 22332111 1111110111110001111112222223 Q ss_pred ceeec-CCcEEEee-ceeeccC Q lcl|NC_015266. 318 YVVED-FGCGCVAE-NIELVAA 337 (337) Q Consensus 318 YvVEd-~~~~a~iE-nI~~~~a 337 (337) |-+|. ++....=+ -+.+.++ T Consensus 288 ~r~~~r~d~~v~~~~~v~~~~~ 309 (311) T protein:vir:99 288 LRLEIVYGWYVFTDRFVVIENA 309 (311) T ss_pred EEEEEeecceecChhHeeeecc Confidence 33333 33322111 1444444 No 99 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.14 E-value=6.5e-05 Score=43.55 Aligned_cols=272 Identities=13% Similarity=0.110 Sum_probs=137.5 Q ss_pred CChHHHHHHHHHHHHHHH--hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt 78 (337) .....+..+..++..... ..+.......+.|-+...+.+.+ ..+.+..++.++++++....|.......++..++-. T Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 190 (397) T protein:vir:96 112 ELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATV 190 (397) T ss_pred HHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEeccCCccccc Confidence 223344555555544321 22333445566777777888776 466777899999999988877765554443333322 Q ss_pred cCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhh Q lcl|NC_015266. 79 DTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAAN 158 (337) Q Consensus 79 ~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~n 158 (337) ..+ ..........++...+.+++.---..++.+.|+... ++++..+.+.+.+.++.-.-.--++|+..+.. T Consensus 191 ~E~-~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~--~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~~------ 261 (397) T protein:vir:96 191 QQL-EKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDAS--YDVTGLIADEIQDQSLNTKNADIAAVLKTATA------ 261 (397) T ss_pred ccc-ccccccccccccceeecHhHhhcchhhHHHHHhhhH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------ Confidence 211 112112223455556666665555677888887643 46888888888887765433323333321110 Q ss_pred hhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHH Q lcl|NC_015266. 159 PLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYF 238 (337) Q Consensus 159 PllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~ 238 (337) .+ . .+.|.++ +++...+++.+ + -+++|.+..+.. -. T Consensus 262 -----------------------------~~------~---~~~d~~~-~~~~~~~~~~~-~---a~~v~n~~~~~~-l~ 297 (397) T protein:vir:96 262 -----------------------------KS------V---VGVDGLK-DLINKEIKKVY-D---VKLFISASMYSE-LD 297 (397) T ss_pred -----------------------------cc------c---cchHHHH-HHHHHhhhhhc-C---cEEEEcHHHHHH-HH Confidence 00 0 1233333 45555566644 2 378999877642 12 Q ss_pred HHHhccCChhHHHHHHHH-HhhhhhcCceeEECCcc-CCC-----ceEEecccccEEEEecCceEEeEeeccccceecch Q lcl|NC_015266. 239 PIVNTTQAPTEQLAADLI-VSQKRIGNLPAVRVPFF-PKR-----AMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENY 311 (337) Q Consensus 239 ~l~n~~~~ptE~~A~~~~-~~~k~igGl~a~~vPff-P~~-----~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y 311 (337) .|-...+.|- ....+. ....++-|+|++..+.. |+. .+++-.|++.-..+..++.+-...+.... T Consensus 298 ~lkd~~G~~~--~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~------ 369 (397) T protein:vir:96 298 KLKDKNGRYL--LQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNNIY------ 369 (397) T ss_pred HhhccCCCeE--eccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEeccccc------ Confidence 2222222221 000110 12357889999876654 333 27887888754334334444333332211 Q ss_pred hhhcccc-eeecCCcEEEee----ceeeccC Q lcl|NC_015266. 312 ESSNDAY-VVEDFGCGCVAE----NIELVAA 337 (337) Q Consensus 312 ~s~Ne~Y-vVEd~~~~a~iE----nI~~~~a 337 (337) . .++ +++.++....-. -+++..| T Consensus 370 -~--~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 370 -G--QLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred -c--eeEEEEEEEccEEecccceEEEEeecC Confidence 1 112 333344333222 2344444 No 100 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=97.10 E-value=0.00012 Score=42.16 Aligned_cols=264 Identities=11% Similarity=0.115 Sum_probs=140.9 Q ss_pred HHHhcCcc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc--ccccccceeccCCCcccccccccc Q lcl|NC_015266. 16 IAKLNDTD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL--SVSGPIASRTDTTKAERQPIDPTA 92 (337) Q Consensus 16 ~a~~ngv~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~--gv~g~iagRt~t~~~~R~~~~~~~ 92 (337) +.+..... ...-.+.|-+.+.+.+.+.+.+.+.+++..+++++....|..... ...++.++-+..+.. ....+... T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~-~~~~~~~~ 79 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGK-IADIDDPK 79 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcc-cccccccc Confidence 33322222 223456777777899999999999999999999998888874433 333444444433221 11134456 Q ss_pred cCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHH Q lcl|NC_015266. 93 LDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQY 172 (337) Q Consensus 93 l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~ 172 (337) ++...+.+++.---..|+.+.|+... .+++..+++.+.++++.-.-.--++|.. T Consensus 80 ~~~i~l~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~la~~~~~~~~~~i~~g~~------------------------ 133 (293) T protein:vir:48 80 LSLIKYTIKRYAGISTVTNSLLADSA--ENILAWLSGWIAKKVVVTRNKAILGVVD------------------------ 133 (293) T ss_pred eeEEEEeeeEEEEeehhhHHHHhhhh--HHHHHHHHHHHHHHHHHHHHhHHhhccc------------------------ Confidence 77888999999888999999998653 4788888888888876422111111111 Q ss_pred HhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCCh--hHH Q lcl|NC_015266. 173 RDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAP--TEQ 250 (337) Q Consensus 173 Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~p--tE~ 250 (337) .. +..+.=.++|.++. ++.. +++.++... +++|.+..++. ...+-...+.| ... T Consensus 134 ----------~~---------~~~~~~~~~d~i~~-~~~~-l~~~~~~~a--~~vmn~~~~~~-L~~lkd~~g~~l~~~~ 189 (293) T protein:vir:48 134 ----------KL---------PTKPTLTKWDDIID-LEAK-VDPAIKQTS--FFLTNTSGFTA-LKKVKNALGDYLMERD 189 (293) T ss_pred ----------cc---------cccccccCHHHHHH-HHHh-hhhhhcCCC--EEEEcHHHHHH-HHHhhccCCceEeecC Confidence 00 01111134455443 5554 466677654 88898887653 22222222221 000 Q ss_pred HHHHHHHhhhhhcCceeEECC--ccCCC-----ceEEeccccc-EEEEecCceEEeEeec----cccceecchhhhcccc Q lcl|NC_015266. 251 LAADLIVSQKRIGNLPAVRVP--FFPKR-----AMMVTKLENL-SIYFQEGARRRSLIDN----PKRDQIENYESSNDAY 318 (337) Q Consensus 251 ~A~~~~~~~k~igGl~a~~vP--ffP~~-----~ilvT~l~NL-sIY~Q~gs~RR~~~d~----p~r~r~e~y~s~Ne~Y 318 (337) .. -....++-|+|++.++ ++|.. .+++-.+++. -|..+.+- +=...+. -+++.+.-+-..--++ T Consensus 190 ~~---~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~r~~~r~d~ 265 (293) T protein:vir:48 190 VK---SPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQM-SLLSTNIGGGAFETDTTKVRVIDRFDV 265 (293) T ss_pred cC---CCCCceecceeeEEecccccCCccCCceEEEEEeccceEEEEEecce-EEEEecccchhhhcCeEEEEEEEeeCc Confidence 00 0124589999998754 45543 2566667763 44444332 2111111 0111111111111234 Q ss_pred eeecCCcEEEeeceeeccC Q lcl|NC_015266. 319 VVEDFGCGCVAENIELVAA 337 (337) Q Consensus 319 vVEd~~~~a~iEnI~~~~a 337 (337) ++-+..+++.++ +..+ T Consensus 266 ~~~~~~a~~~l~---~~~~ 281 (293) T protein:vir:48 266 VATDTEAFVPAS---FKAI 281 (293) T ss_pred EEecccceEEEE---eecc Confidence 444555554443 3333 No 101 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.00 E-value=0.00022 Score=40.67 Aligned_cols=294 Identities=10% Similarity=0.060 Sum_probs=141.1 Q ss_pred CCh--HHHHHHHHHHHHH-----------------------HHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcc-cc Q lcl|NC_015266. 1 MKK--ETRQAYRKYAAQI-----------------------AKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKS-IN 54 (337) Q Consensus 1 M~~--~tr~~~~~y~~~~-----------------------a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~-In 54 (337) +.. .....+..+...+ +...+....+-.+.|-......+.+.+++++.+++. .+ T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~~~~~~ 162 (428) T protein:vir:10 83 AEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVRKLGAR 162 (428) T ss_pred cccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhhhhcce Confidence 000 0000010111100 000111111123345556678899999999988776 44 Q ss_pred cccchhhhhh-hhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHH Q lcl|NC_015266. 55 ILPVTELEGE-KLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILN 133 (337) Q Consensus 55 v~~V~~~~Ge-~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~ 133 (337) +++.. .|. ++-.-.+++-++-+..+ ...|..-..++...|..++.---..|+.+.|+. ..++|+..+.+.+.+ T Consensus 163 ~~~~~--~g~~~~p~~~~~~~a~~v~Eg--~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~d--s~~~l~~~i~~~l~~ 236 (428) T protein:vir:10 163 SIPLP--NGNMSLPRLAGGATASYTGEN--QDAKVSEARFDDVKLTAKTMIAMVPISNALIGR--AGFNVEQLVLQDILT 236 (428) T ss_pred eeecC--CcceEEEEEeCCcceeeeccC--ccccccccceeeEEeeeEEEEEeehhhHHHHhh--hhHHHHHHHHHHHHH Confidence 54443 232 11111233444333222 222222344666667777777778999998874 136899999999999 Q ss_pred HHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCcee--cCCCcccccHHHHHHHHHh Q lcl|NC_015266. 134 QSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVL--VGKGGDYVNLDALVMDIVS 211 (337) Q Consensus 134 ~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~--~G~ggdy~nLDalV~da~~ 211 (337) +++.-+-.--+||+-.. .+|. -+++........+. .+...++..+|.++.-+.. T Consensus 237 ai~~~~d~~~l~G~G~~------~~p~------------------Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (428) T protein:vir:10 237 AISVREDKAFMRDDGTG------DTPI------------------GMKARATQWNRLLPWAADAAVNLDTIDTYLDSIIL 292 (428) T ss_pred HHHHHHHHHHhccCCCC------cccc------------------ccccccccccccccccccccccHHHHHHHHHHHHH Confidence 88866655666885321 1222 22222111111111 1234455555544332211 Q ss_pred -cccChhHcCCCCeEEEeChHHHHHHHHHHHh-ccCChhHHHHHHHHHhhhhhcCceeEECCccCCCc--------eEEe Q lcl|NC_015266. 212 -SMIDPWFQEDTGLVVICGRELLHDKYFPIVN-TTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRA--------MMVT 281 (337) Q Consensus 212 -~li~~~~r~~~dLVvivG~dLla~k~~~l~n-~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~--------ilvT 281 (337) ......+.. ..+++|.+..+.. +..+. ..+.| +--. ..+.+|.|+|++..+++|.+. +++- T Consensus 293 ~~~~~~~~~~--~~~~v~n~~~~~~--L~~lkd~~G~~---i~~~--~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~g 363 (428) T protein:vir:10 293 MSMDGNSNMI--SSGWGMSNRTYMK--LFGLRDGNGNK---VYPE--MAQGMLKGYPIQRTSAIPANLGEGGKESEIYFA 363 (428) T ss_pred hhhccccccc--cCEEEEcHHHHHH--HHHhhccCCce---eccC--CCCCeeeceeeEEeccccccccCCCccceEEEE Confidence 111222222 3578998887652 22232 21122 1001 134579999999999999863 5666 Q ss_pred cccccEEEEecCceEEeEeeccc----cceecchhhhcc---------cceeecCCcEEEeeceee Q lcl|NC_015266. 282 KLENLSIYFQEGARRRSLIDNPK----RDQIENYESSND---------AYVVEDFGCGCVAENIEL 334 (337) Q Consensus 282 ~l~NLsIY~Q~gs~RR~~~d~p~----r~r~e~y~s~Ne---------~YvVEd~~~~a~iEnI~~ 334 (337) .++++-|.. .+..+-.+-+... -..+.+++..|. ++.|=++++++.+.+|+. T Consensus 364 d~s~~~i~~-~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 364 DFNDVVIGE-DGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred ecceEEEEE-ecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 666544432 2333322211111 011223333333 345666777777777777 No 102 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=96.98 E-value=0.00014 Score=41.79 Aligned_cols=273 Identities=10% Similarity=0.090 Sum_probs=134.2 Q ss_pred CChHHHHHHHHHHHHHHH--------------hcC--c-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK--------------LND--T-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEG 63 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~--------------~ng--v-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~G 63 (337) +.......+..|+..... .+. . .+..-.+.|-+.+...+.+.+.+.+.+++.++++++...++ T Consensus 98 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~~ 177 (402) T protein:vir:93 98 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEI 177 (402) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhceeeecCCcee Confidence 333332333333322110 011 1 11223567877788899999999999999999998875554 Q ss_pred hhhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhc Q lcl|NC_015266. 64 EKLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIG 143 (337) Q Consensus 64 e~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IG 143 (337) -++.. +++-++-... .......+ ...+...|..++.---+.|+.+.|+.. .++|+..+.+.++++++.=..-.- T Consensus 178 p~~~~--~~~~a~~v~E-g~~~~~~~-~~f~~i~~~~~k~~~~i~iS~ell~Ds--~~~l~~~i~~~la~~~~~~e~~~~ 251 (402) T protein:vir:93 178 PRVSY--TLDDDDFITD-VETAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERKDA 251 (402) T ss_pred eeeec--cCCccccccc-cccccccc-cccceeeecceeeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHhH Confidence 33322 2222322221 12222222 345666676666666688999988864 346899999999998876211111 Q ss_pred c-cceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCC Q lcl|NC_015266. 144 W-NGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDT 222 (337) Q Consensus 144 f-NG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~ 222 (337) | +| +....| .|++ ..... ..+ ++.+ ..|.|+ +++.+ |++.|+... T Consensus 252 ~~~g-------~g~g~p------~g~~------------~~~~~--~~~---~~~~--~~d~l~-~~~~~-l~~~y~~na 297 (402) T protein:vir:93 252 LAVS-------PKSGLE------HMSF------------YNGSV--KEV---EGAD--MYDAII-NALAD-LHEDYRDNA 297 (402) T ss_pred hhcC-------CCcccc------ceee------------ecccc--ccc---cccc--hHHHHH-HHHhc-cChhhhcCC Confidence 2 22 211122 1222 11100 001 1111 246555 46665 578888754 Q ss_pred CeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeec Q lcl|NC_015266. 223 GLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDN 302 (337) Q Consensus 223 dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~ 302 (337) +++|.+.-+.. ...+....+.+- . .....++-|+|++....+|. +++= |+|.||.. +++... . T Consensus 298 --~~imn~~t~~~-~~~~~~d~~~~~--~----~~~~~~llG~PV~~t~~~~~--i~~G---Df~~~~~~--~~~~~~-~ 360 (402) T protein:vir:93 298 --TIYMRYADYVK-IISVLSNGTTNF--F----DTPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGIN--YDGTTY-D 360 (402) T ss_pred --EEEEechHHHH-HHHHHhcCCCcc--c----ccCCccccccceEEecCCCc--eeee---chhhhhhh--hhhhhh-h Confidence 67777653332 222333333321 1 12346788999999999885 5554 45555532 222221 1 Q ss_pred cccceecchhhhcccceeec--------CCcEEEeeceeeccC Q lcl|NC_015266. 303 PKRDQIENYESSNDAYVVED--------FGCGCVAENIELVAA 337 (337) Q Consensus 303 p~r~r~e~y~s~Ne~YvVEd--------~~~~a~iEnI~~~~a 337 (337) +.++ ...-..+|+... .++++.+ ++..| T Consensus 361 ~~~~----~~~~~~~~~~~~r~Dg~v~~~~A~~~l---~ik~~ 396 (402) T protein:vir:93 361 TDKD----VKKGEYLFVLTAWYDQQRTLDSAFRIA---KAKEN 396 (402) T ss_pred hhhc----ccCCceEEEEEEEeCcEEechhheEEE---EeecC Confidence 2221 111223333322 2222221 22222 No 103 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=96.98 E-value=0.00019 Score=40.96 Aligned_cols=273 Identities=10% Similarity=0.056 Sum_probs=137.1 Q ss_pred CChHHHHHHHHHHHHHHHh--------------c--Cc-ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKL--------------N--DT-DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEG 63 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~--------------n--gv-~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~G 63 (337) .+......|..|+.+.... + +. .+..-.+.|-+.+...+.+.+.+.+.+.+.++++++...+. T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~~~~~ 162 (387) T protein:vir:93 83 DHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEI 162 (387) T ss_pred hhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecCCceE Confidence 2222223334444333210 0 11 11123567877788889999999999999999988875443 Q ss_pred hhhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhc Q lcl|NC_015266. 64 EKLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIG 143 (337) Q Consensus 64 e~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IG 143 (337) -++ ..++.-++-...+ .... ..-...+...|.+++.---+.|+.+.|+. ..++|+..+.+.++++++.=..-.. T Consensus 163 p~~--~~~~~~a~~v~E~-~~~~-~~~~~f~~v~~~~~k~~~~~~iS~ell~D--s~~~l~~~i~~~la~~~~~~e~~~~ 236 (387) T protein:vir:93 163 PRV--SYTLDDDDFITDV-ETAK-ELKLKGDTVKFTTNKFKVFAAISDTVIHG--SDVDLVNWVENALQSGLAAKERKDA 236 (387) T ss_pred EEE--eecCCccccccCc-cccc-ccccccceeeeeheeeeeechhhHHHHhh--hHHHHHHHHHHHHHHHHHHHHHHhH Confidence 332 2222333333222 1222 22244666777777777778889888863 1247999999999998875321112 Q ss_pred c-cceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCC Q lcl|NC_015266. 144 W-NGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDT 222 (337) Q Consensus 144 f-NG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~ 222 (337) | +| +....| .|++- .... .. + ++.+ ..|.+ -+++.+ +++.|+... T Consensus 237 ~~~g-------~g~g~p------~g~l~------------~~~~--~~--v-~~~~--~~d~i-~~~~~~-l~~~~~~~a 282 (387) T protein:vir:93 237 LAVS-------PKSGLD------HMSFY------------NGSV--KE--V-EGAD--MYDAI-INALAD-LHEDYRDNA 282 (387) T ss_pred hhcC-------CCcccc------ceeee------------cccc--cc--c-cccc--hHHHH-HHHHhc-cChhhhcCC Confidence 2 22 211122 23321 1100 00 1 1111 13554 356665 588888765 Q ss_pred CeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeec Q lcl|NC_015266. 223 GLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDN 302 (337) Q Consensus 223 dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~ 302 (337) +++|.+.-+.. ...++...+.+- + .....+|-|+|++....+|. +++-.|+ -||.. +++...+ T Consensus 283 --~~~mn~~t~~~-~~~~~~d~~~~~--~----~~~~~~llG~PV~~~~~~~~--~~~GDf~---~~~~~--~~~~~~~- 345 (387) T protein:vir:93 283 --TIYMRYADYVK-IISVLSNGTTNF--F----DTPAEKVFGKPVVFTDAAVK--PIVGDFN---YFGIN--YDGTTYD- 345 (387) T ss_pred --EEEEechHHHH-HHHHHhcCCCcc--c----ccCCccccccceEEecCCCc--eeeeehh---hhhee--hhhheee- Confidence 77887643322 233444333331 1 12346888999999999885 5655554 44431 2222221 Q ss_pred cccceecchhhhcccceee--------cCCcEEEeeceeeccC Q lcl|NC_015266. 303 PKRDQIENYESSNDAYVVE--------DFGCGCVAENIELVAA 337 (337) Q Consensus 303 p~r~r~e~y~s~Ne~YvVE--------d~~~~a~iEnI~~~~a 337 (337) +..++.....+|+.. |.++++. .++..| T Consensus 346 ----~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~---l~~k~~ 381 (387) T protein:vir:93 346 ----TDKDVKKGEYLFVLTAWYDQQRTLDSAFRI---AKAKEN 381 (387) T ss_pred ----ecccccCCceeEEEEeeeCceeechhheEE---EEeecC Confidence 222222333344433 3333332 223222 No 104 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=96.87 E-value=0.00028 Score=40.03 Aligned_cols=295 Identities=10% Similarity=0.054 Sum_probs=151.4 Q ss_pred CChH----HHHHHHHHHHHHHHhcC---------------------c--ccccceeeecHHHHHHHHHHHHhhhhhhcc- Q lcl|NC_015266. 1 MKKE----TRQAYRKYAAQIAKLND---------------------T--DDVSQKFAVEPSVQQTLETKMQESSAFLKS- 52 (337) Q Consensus 1 M~~~----tr~~~~~y~~~~a~~ng---------------------v--~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~- 52 (337) .+++ .-..|..|...+|..-| + ...+-.+.|-++++..+.+.+.+.+.+.+. T Consensus 20 ~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~l~~lg 99 (366) T protein:vir:57 20 IKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTVVRILG 99 (366) T ss_pred cccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcchhhhc Confidence 1100 00112222222221111 1 111223446556777899999988877665 Q ss_pred cccccchhhhhh-hhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHH Q lcl|NC_015266. 53 INILPVTELEGE-KLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVI 131 (337) Q Consensus 53 Inv~~V~~~~Ge-~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i 131 (337) .++++.. .|. .+-.-.+++-++-+.. ....|..-..++...+..++.---..|+-+.|+.- .++++..+++.+ T Consensus 100 ~~~v~~~--~g~~~~p~~t~~~~a~wv~E--~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds--~~~~~~~i~~~l 173 (366) T protein:vir:57 100 ARSIPLP--NGNLSMPRLSGGATAGYVGE--GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRA--GFNVEQLLLGDI 173 (366) T ss_pred eeeeecC--CCceEEEEEeCCcceeeecc--CccccccccceeEEEEeeEEEEEeehhhHHHHhhh--hHHHHHHHHHHH Confidence 5555443 232 1112223344433322 22223333557777888888888888999988743 358999999999 Q ss_pred HHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCC-ceecCCCcccccHHHHHHHHH Q lcl|NC_015266. 132 LNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAG-KVLVGKGGDYVNLDALVMDIV 210 (337) Q Consensus 132 ~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~-~i~~G~ggdy~nLDalV~da~ 210 (337) .++++.-.-.--++|+-.+ .+|. |.+ ........ ....|++.++..+|+++--+. T Consensus 174 ~~a~~~~~d~a~l~G~G~~------~~p~------Gi~------------~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~ 229 (366) T protein:vir:57 174 LSAIATREDKAFLRDDGTG------DTPK------GMK------------AVATAANRLVAWTGTAINLTTIDEYLDSLI 229 (366) T ss_pred HHHHHHHHHHHhhccCCCC------cccc------cee------------eccccccceeeccccccchhhHHHHHHHHH Confidence 9999876666667775321 1222 222 11111111 122356778888888754332 Q ss_pred hcc-cChhHcCCCCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCC--------ceEEe Q lcl|NC_015266. 211 SSM-IDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKR--------AMMVT 281 (337) Q Consensus 211 ~~l-i~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~--------~ilvT 281 (337) ... +...++. ..+++|.+..... ...+-...+.|-= -. ..+.++-|+|++..+++|++ .+++- T Consensus 230 ~~~~~~~~~~~--~a~~vmn~~~~~~-L~~lkd~~G~~l~---~~--~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~g 301 (366) T protein:vir:57 230 LKHMDSNSNMI--RCGWGLSNRTYMT-LFGLRDGNGNKVY---PE--MSQGILKGYPIQRTSAIPANLGDDGNESEIYFC 301 (366) T ss_pred Hhhhccccccc--cCEEEecHHHHHH-HHhhhccCCceec---cC--CCCCeecceeeEEccccccccccCCCccEEEEE Confidence 211 1122222 4578898887653 2222222222210 00 12457889999999999984 36667 Q ss_pred cccccEEEEecCceEEeEeeccccc----eecchhhhc---------ccceeecCCcEEEeeceee Q lcl|NC_015266. 282 KLENLSIYFQEGARRRSLIDNPKRD----QIENYESSN---------DAYVVEDFGCGCVAENIEL 334 (337) Q Consensus 282 ~l~NLsIY~Q~gs~RR~~~d~p~r~----r~e~y~s~N---------e~YvVEd~~~~a~iEnI~~ 334 (337) .++++-|. ..+..+-.+-+++.+. .+.+-+.+| -++.|-+.+++|.+.+|.- T Consensus 302 dfs~~~i~-~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 302 DFNDVVIG-EDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred ecceEEEE-EecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 77765433 3333332222222110 011111222 2445567888888888887 No 105 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=96.80 E-value=0.00033 Score=39.65 Aligned_cols=275 Identities=9% Similarity=0.091 Sum_probs=135.8 Q ss_pred CChHHH--HHHHHHHHHHH----------------HhcCcc-cccceeeecHHHHHHHHHHHHhhhhhhcccccccchhh Q lcl|NC_015266. 1 MKKETR--QAYRKYAAQIA----------------KLNDTD-DVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTEL 61 (337) Q Consensus 1 M~~~tr--~~~~~y~~~~a----------------~~ngv~-~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~ 61 (337) +....+ ..+..|..... ...+.. +..-.|.|-.++...+.+.+++.+.+.+..+++++... T Consensus 46 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~ 125 (352) T protein:vir:78 46 LNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGL 125 (352) T ss_pred cchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCc Confidence 111111 11111111111 111111 22335667667888999999999999999998887643 Q ss_pred hhhhhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhh--ch Q lcl|NC_015266. 62 EGEKLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSAL--DR 139 (337) Q Consensus 62 ~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~al--D~ 139 (337) +.-++ ..+++-++-...+ ...|..-...+...|..++.---+.|+.+.|+.=+ ++++..+.+.+.+.++. +- T Consensus 126 ~~p~~--~~~~~~a~~v~E~--~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~--~~l~~~i~~~la~~~~~~e~~ 199 (352) T protein:vir:78 126 EIPRV--SYTLDDDDFITDV--ETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSD--VDLVNWVENALQSGLAAKERK 199 (352) T ss_pred eEEEE--ecCCCcccccccc--cccccccccceeeeecceeEEeechhhHHHHhhhh--HHHHHHHHHHHHHHHHHHHHH Confidence 33222 2222233322221 11222234566777888877777899999988633 57888899999998864 22 Q ss_pred hhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHc Q lcl|NC_015266. 140 IMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQ 219 (337) Q Consensus 140 i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r 219 (337) ..+| +| +....|. |.| ..... .++ ++.. ..|.++ +++.+ |++-|+ T Consensus 200 ~~~~-~g-------~g~~~~~------g~l------------~~~~~--~~~---t~~~--~~d~i~-~~~~~-l~~~~~ 244 (352) T protein:vir:78 200 DALA-VS-------PKSGLEH------MSF------------YNGSV--KEV---EGAN--MYDAII-NALAD-LHEDYR 244 (352) T ss_pred hhhh-cC-------CCCcccc------cce------------ecccc--ccc---cccc--hHHHHH-HHHhc-cChhhh Confidence 2222 23 2222222 211 11100 011 1111 135544 45554 588888 Q ss_pred CCCCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeE Q lcl|NC_015266. 220 EDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSL 299 (337) Q Consensus 220 ~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~ 299 (337) +.. +++|.+..... -..+....+.| .++ ....++-|+|++....+|. +++= |+|.||.. +.+.. T Consensus 245 ~~a--~~~mn~~t~~~-l~~~~~~~~~~--~~~----~~~~~llG~PV~~~~~~~~--~~~G---df~~~~~~--~~~~~ 308 (352) T protein:vir:78 245 DNA--TIYMRYADYVK-IISVLSNGTTN--FFD----TPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGIN--YDGTT 308 (352) T ss_pred cCC--EEEEehHHHHH-HHHHHhccCCc--ccc----cCCccccccceEEecCCCc--eeEe---ehhhhhhh--hhhhe Confidence 865 77887654432 22333333333 111 1235788999999999886 4544 45555531 11111 Q ss_pred eeccccceecchhhhcccceeecCCcEEEe--eceee---ccC Q lcl|NC_015266. 300 IDNPKRDQIENYESSNDAYVVEDFGCGCVA--ENIEL---VAA 337 (337) Q Consensus 300 ~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~i--EnI~~---~~a 337 (337) +++..++.....+|+....--+..+ |-+.+ ..+ T Consensus 309 -----~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~a~ 346 (352) T protein:vir:78 309 -----YDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAKES 346 (352) T ss_pred -----eeeeccccCCeeEEEEEeeeCceeechhheEEEEeecc Confidence 1222233333455554443222223 12222 222 No 106 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=96.79 E-value=0.00027 Score=40.19 Aligned_cols=295 Identities=12% Similarity=0.026 Sum_probs=138.8 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) +.++.|..|+++. + + ......|.|-+++..++.+.+.+.|.+++.++++++.- +.++-...+++.++=..- T Consensus 72 lt~~e~~~~~~~~----~--~-~~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~--~~~i~~~~~~~~a~w~~e 142 (383) T protein:vir:78 72 ITNEEIKFFNDIN----K--E-VGYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGL--RTKFLKSETSGVAVWGKI 142 (383) T ss_pred hhHHHHHHHHHHh----c--c-CCCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCC--ceEEEEEcCCcceEEeec Confidence 4444444443322 1 1 12234578888999999999999999999999888742 223444444444432221 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) . .++.......++...+.+++.=--..|+.+.|+.=+ .+++..+++.+.+++|.=.-.--++|+-. .-| T Consensus 143 ~-~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~--~~ie~~i~~~l~~~~a~~~~~a~i~G~G~-------~qP- 211 (383) T protein:vir:78 143 F-GEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGP--AWVKRFVVTQIEEAFAVALESAYIVGDGN-------DKP- 211 (383) T ss_pred c-cccccccCcceeeEeecceeeEeeccchHHHhhccH--HHHHHHHHHHHHHHHHHHHhhheEeccCC-------CCc- Confidence 1 223222223455666666666666789999997522 26788888888888876444445566431 112 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCC--CcccccHHHHHHHHHhccc----ChhHcCCCCeEEEeChHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGK--GGDYVNLDALVMDIVSSMI----DPWFQEDTGLVVICGRELLH 234 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~--ggdy~nLDalV~da~~~li----~~~~r~~~dLVvivG~dLla 234 (337) +|+|..+= +......+.. ..+...|. ..+-.++-.++..+.+..- ....+-...++++|++.-.. T Consensus 212 -----~Gil~~~~---~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~ 282 (383) T protein:vir:78 212 -----IGLNRKVG---KGSTVVDGVY-AEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAW 282 (383) T ss_pred -----eeeeeccC---Cccccccccc-ccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchh Confidence 35553110 0000011110 00111111 1122222222222222110 01112234578888874222 Q ss_pred HHHHHHHhccCChhHHHHHHHHHhhhhhc--CceeEECCccCCCceEEecccccEEEEecCceEEeEeeccc--cceecc Q lcl|NC_015266. 235 DKYFPIVNTTQAPTEQLAADLIVSQKRIG--NLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPK--RDQIEN 310 (337) Q Consensus 235 ~k~~~l~n~~~~ptE~~A~~~~~~~k~ig--Gl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~--r~r~e~ 310 (337) +. .|.+...+.+ ++- -++- |++.+.-+++|++.+++-.++.--|.. +++.|-..-+... ++++.- T Consensus 283 ~~-~~~~~~~~~~-----G~~----~t~l~~~~~iv~s~~~p~~~iifgdfs~Y~i~~-r~~~~i~~~~~~~f~~d~~~f 351 (383) T protein:vir:78 283 DV-KKQYTSLNAN-----GVY----VTALPFNLNIIESLFVPEKKAISYVAERYDALI-GGPLDIGTYDQTLAIEDLNLY 351 (383) T ss_pred hh-ccchhccCCC-----Cce----eeecCCCceEEecCCCCcccEEEeeccceEEEe-cccceEEecchhhhhcCceEE Confidence 11 2222111111 110 0222 445777899999999988888855543 3333322211110 000000 Q ss_pred -hhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 311 -YESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 311 -y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -..|-+|= +=|.+++..++ |++.++ T Consensus 352 ~~~~r~dG~-~~~~~A~~vl~-~~~~~~ 377 (383) T protein:vir:78 352 AAKQFAYGK-AKDDKAAAVWT-LNINPA 377 (383) T ss_pred EEEEEEcCE-EecCCeEEEEE-EEecCC Confidence 00112221 12334444444 555555 No 107 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=96.75 E-value=0.00036 Score=39.44 Aligned_cols=272 Identities=8% Similarity=0.066 Sum_probs=131.4 Q ss_pred CChHHHHHHHHHHHHHHH------------------hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK------------------LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELE 62 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~------------------~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~ 62 (337) ........+..|+..... ..|. +..-.+.|-+.++..+.+.+.+.+.+++.++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 161 (387) T protein:vir:96 83 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCce Confidence 222222223333322210 0111 112356787788999999999999999999999987655 Q ss_pred hhhhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhh Q lcl|NC_015266. 63 GEKLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMI 142 (337) Q Consensus 63 Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~I 142 (337) .-++.. +++-++-...+. .....+ ...+...|..++.---+.|+++.|+.. .++|+..+.+.+.++++.-..-. T Consensus 162 ~p~~~~--~~~~a~~v~Eg~-~~~~~~-~~f~~v~l~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:96 162 IPRVSY--TLDDDDFITDVE-TAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred eeeeec--cCCccccccccc-cccccc-cccceeeechheeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHh Confidence 443322 222233222211 111122 334445555555555578899988864 35788999999999886632222 Q ss_pred cc-cceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCC Q lcl|NC_015266. 143 GW-NGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQED 221 (337) Q Consensus 143 Gf-NG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~ 221 (337) -| +| +...-| .|.+ ..... ..+ ++. ...|.++ +++.+ +++.|+.. T Consensus 236 ~~~~g-------~g~g~~------~g~~------------~~~~~--~~~---~~~--~~~d~i~-~~~~~-l~~~y~~n 281 (387) T protein:vir:96 236 ALAVS-------PKSGLE------HMSF------------YNGSV--KEV---EGA--DMYDAII-NALAD-LHEDYRDN 281 (387) T ss_pred HhhcC-------CCcccc------ceee------------ecccc--ccc---ccc--chHHHHH-HHHhc-cChhhhcC Confidence 22 12 111111 1211 11000 000 111 1246555 45665 58888876 Q ss_pred CCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEee Q lcl|NC_015266. 222 TGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLID 301 (337) Q Consensus 222 ~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d 301 (337) . +++|.+.-+.. ...+....+.+- . .....++-|+|++....+|. +++= |+|-||- .+++...+ T Consensus 282 a--~~imn~~t~~~-~~~~~~~~~~~~--~----~~~~~~llG~PV~~~~~~~~--~~~G---Df~~~~~--~~~~~~~~ 345 (387) T protein:vir:96 282 A--TIYMRYADYVK-IISVLSNGTTNF--F----DTPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGI--NYDGTTYD 345 (387) T ss_pred C--EEEEechHHHH-HHHHHhcCCCcc--c----ccCCccccccceEEecCCCc--eeee---chhhhhh--hhhhhhhe Confidence 5 67777654332 223343333331 1 12346788999999999885 5554 4454542 12222221 Q ss_pred ccccceecchhhhcccceeec--------CCcEEEeeceeeccC Q lcl|NC_015266. 302 NPKRDQIENYESSNDAYVVED--------FGCGCVAENIELVAA 337 (337) Q Consensus 302 ~p~r~r~e~y~s~Ne~YvVEd--------~~~~a~iEnI~~~~a 337 (337) +.+ +...-.-+|++.. .++++. +++..| T Consensus 346 -~~~----~~~~~~~~~~~~~r~Dg~v~~~~A~~~---l~~ka~ 381 (387) T protein:vir:96 346 -TDK----DVKKGEYLFVLTAWYDQQRTLDSAFRI---AKAKEN 381 (387) T ss_pred -ecc----cccCCceEEEEEEEeCcEeechhheEE---EEeecC Confidence 111 1111222333322 222222 223222 No 108 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=96.75 E-value=0.00036 Score=39.44 Aligned_cols=272 Identities=8% Similarity=0.066 Sum_probs=131.4 Q ss_pred CChHHHHHHHHHHHHHHH------------------hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK------------------LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELE 62 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~------------------~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~ 62 (337) ........+..|+..... ..|. +..-.+.|-+.++..+.+.+.+.+.+++.++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 161 (387) T protein:vir:94 83 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCce Confidence 222222223333322210 0111 112356787788999999999999999999999987655 Q ss_pred hhhhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhh Q lcl|NC_015266. 63 GEKLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMI 142 (337) Q Consensus 63 Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~I 142 (337) .-++.. +++-++-...+. .....+ ...+...|..++.---+.|+++.|+.. .++|+..+.+.+.++++.-..-. T Consensus 162 ~p~~~~--~~~~a~~v~Eg~-~~~~~~-~~f~~v~l~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:94 162 IPRVSY--TLDDDDFITDVE-TAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred eeeeec--cCCccccccccc-cccccc-cccceeeechheeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHh Confidence 443322 222233222211 111122 334445555555555578899988864 35788999999999886632222 Q ss_pred cc-cceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCC Q lcl|NC_015266. 143 GW-NGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQED 221 (337) Q Consensus 143 Gf-NG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~ 221 (337) -| +| +...-| .|.+ ..... ..+ ++. ...|.++ +++.+ +++.|+.. T Consensus 236 ~~~~g-------~g~g~~------~g~~------------~~~~~--~~~---~~~--~~~d~i~-~~~~~-l~~~y~~n 281 (387) T protein:vir:94 236 ALAVS-------PKSGLE------HMSF------------YNGSV--KEV---EGA--DMYDAII-NALAD-LHEDYRDN 281 (387) T ss_pred HhhcC-------CCcccc------ceee------------ecccc--ccc---ccc--chHHHHH-HHHhc-cChhhhcC Confidence 22 12 111111 1211 11000 000 111 1246555 45665 58888876 Q ss_pred CCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEee Q lcl|NC_015266. 222 TGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLID 301 (337) Q Consensus 222 ~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d 301 (337) . +++|.+.-+.. ...+....+.+- . .....++-|+|++....+|. +++= |+|-||- .+++...+ T Consensus 282 a--~~imn~~t~~~-~~~~~~~~~~~~--~----~~~~~~llG~PV~~~~~~~~--~~~G---Df~~~~~--~~~~~~~~ 345 (387) T protein:vir:94 282 A--TIYMRYADYVK-IISVLSNGTTNF--F----DTPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGI--NYDGTTYD 345 (387) T ss_pred C--EEEEechHHHH-HHHHHhcCCCcc--c----ccCCccccccceEEecCCCc--eeee---chhhhhh--hhhhhhhe Confidence 5 67777654332 223343333331 1 12346788999999999885 5554 4454542 12222221 Q ss_pred ccccceecchhhhcccceeec--------CCcEEEeeceeeccC Q lcl|NC_015266. 302 NPKRDQIENYESSNDAYVVED--------FGCGCVAENIELVAA 337 (337) Q Consensus 302 ~p~r~r~e~y~s~Ne~YvVEd--------~~~~a~iEnI~~~~a 337 (337) +.+ +...-.-+|++.. .++++. +++..| T Consensus 346 -~~~----~~~~~~~~~~~~~r~Dg~v~~~~A~~~---l~~ka~ 381 (387) T protein:vir:94 346 -TDK----DVKKGEYLFVLTAWYDQQRTLDSAFRI---AKAKEN 381 (387) T ss_pred -ecc----cccCCceEEEEEEEeCcEeechhheEE---EEeecC Confidence 111 1111222333322 222222 223222 No 109 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=96.75 E-value=0.00036 Score=39.44 Aligned_cols=272 Identities=8% Similarity=0.066 Sum_probs=131.4 Q ss_pred CChHHHHHHHHHHHHHHH------------------hcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhh Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAK------------------LNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELE 62 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~------------------~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~ 62 (337) ........+..|+..... ..|. +..-.+.|-+.++..+.+.+.+.+.+++.++++++...+ T Consensus 83 ~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~ 161 (387) T protein:vir:26 83 DNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLE 161 (387) T ss_pred hhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeecCCce Confidence 222222223333322210 0111 112356787788999999999999999999999987655 Q ss_pred hhhhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhh Q lcl|NC_015266. 63 GEKLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMI 142 (337) Q Consensus 63 Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~I 142 (337) .-++.. +++-++-...+. .....+ ...+...|..++.---+.|+++.|+.. .++|+..+.+.+.++++.-..-. T Consensus 162 ~p~~~~--~~~~a~~v~Eg~-~~~~~~-~~f~~v~l~~~k~~~~i~iS~ell~ds--~~~l~~~i~~~la~~~~~~e~~~ 235 (387) T protein:vir:26 162 IPRVSY--TLDDDDFITDVE-TAKELK-AKGDTVKFTTNKFKVFAAISDTVIHGS--DVDLVNWVENALQSGLAAKERKD 235 (387) T ss_pred eeeeec--cCCccccccccc-cccccc-cccceeeechheeeeechhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHh Confidence 443322 222233222211 111122 334445555555555578899988864 35788999999999886632222 Q ss_pred cc-cceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCC Q lcl|NC_015266. 143 GW-NGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQED 221 (337) Q Consensus 143 Gf-NG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~ 221 (337) -| +| +...-| .|.+ ..... ..+ ++. ...|.++ +++.+ +++.|+.. T Consensus 236 ~~~~g-------~g~g~~------~g~~------------~~~~~--~~~---~~~--~~~d~i~-~~~~~-l~~~y~~n 281 (387) T protein:vir:26 236 ALAVS-------PKSGLE------HMSF------------YNGSV--KEV---EGA--DMYDAII-NALAD-LHEDYRDN 281 (387) T ss_pred HhhcC-------CCcccc------ceee------------ecccc--ccc---ccc--chHHHHH-HHHhc-cChhhhcC Confidence 22 12 111111 1211 11000 000 111 1246555 45665 58888876 Q ss_pred CCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEee Q lcl|NC_015266. 222 TGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLID 301 (337) Q Consensus 222 ~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d 301 (337) . +++|.+.-+.. ...+....+.+- . .....++-|+|++....+|. +++= |+|-||- .+++...+ T Consensus 282 a--~~imn~~t~~~-~~~~~~~~~~~~--~----~~~~~~llG~PV~~~~~~~~--~~~G---Df~~~~~--~~~~~~~~ 345 (387) T protein:vir:26 282 A--TIYMRYADYVK-IISVLSNGTTNF--F----DTPAEKVFGKPVVFTDAAVK--PIVG---DFNYFGI--NYDGTTYD 345 (387) T ss_pred C--EEEEechHHHH-HHHHHhcCCCcc--c----ccCCccccccceEEecCCCc--eeee---chhhhhh--hhhhhhhe Confidence 5 67777654332 223343333331 1 12346788999999999885 5554 4454542 12222221 Q ss_pred ccccceecchhhhcccceeec--------CCcEEEeeceeeccC Q lcl|NC_015266. 302 NPKRDQIENYESSNDAYVVED--------FGCGCVAENIELVAA 337 (337) Q Consensus 302 ~p~r~r~e~y~s~Ne~YvVEd--------~~~~a~iEnI~~~~a 337 (337) +.+ +...-.-+|++.. .++++. +++..| T Consensus 346 -~~~----~~~~~~~~~~~~~r~Dg~v~~~~A~~~---l~~ka~ 381 (387) T protein:vir:26 346 -TDK----DVKKGEYLFVLTAWYDQQRTLDSAFRI---AKAKEN 381 (387) T ss_pred -ecc----cccCCceEEEEEEEeCcEeechhheEE---EEeecC Confidence 111 1111222333322 222222 223222 No 110 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=96.70 E-value=0.0004 Score=39.22 Aligned_cols=293 Identities=14% Similarity=0.172 Sum_probs=139.3 Q ss_pred CChH-HHHHHHHHHHHH-------------HHh-----------------cCc---ccccceeeecHHHHHHHHHHHHhh Q lcl|NC_015266. 1 MKKE-TRQAYRKYAAQI-------------AKL-----------------NDT---DDVSQKFAVEPSVQQTLETKMQES 46 (337) Q Consensus 1 M~~~-tr~~~~~y~~~~-------------a~~-----------------ngv---~~~~~~Fsv~P~~~q~L~~~i~es 46 (337) ..+. ....|..++..+ |+. .|. ...+-.|.+.....+.+.+.+.+. T Consensus 286 ~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~~~~~ii~~l~~~ 365 (645) T protein:vir:93 286 EQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQEYAQDFIDYLRPQ 365 (645) T ss_pred hhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCchhhHHHHHHhhhhh Confidence 0000 001122222111 111 111 111235666666788899999988 Q ss_pred hhhhccccc-cc-chhhhh-hhhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccH Q lcl|NC_015266. 47 SAFLKSINI-LP-VTELEG-EKLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDF 123 (337) Q Consensus 47 s~FL~~Inv-~~-V~~~~G-e~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF 123 (337) |-+.+.-.. ++ .....| ..+-.-.+|+.++=+..+ ...|..-..++...+..++.---+.|+=+.|+.. .+++ T Consensus 366 svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg--~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds--~~~~ 441 (645) T protein:vir:93 366 TIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEG--KTKPLTKFDFESITFSHAKVSAIAVLTEELIRFS--SPAA 441 (645) T ss_pred hhHHhhccccccccccccCceeeeeeecCcceEEeccC--ccccccccceeEEEEeeEEEEEeehhHHHHHhhc--hHHH Confidence 877654322 11 111122 233333344555444322 2233333456667777777655556666666533 3678 Q ss_pred HHHHHHHHHHHHh--hchhhhcccceecc-CCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccc Q lcl|NC_015266. 124 QQRIRNVILNQSA--LDRIMIGWNGVKAA-LSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYV 200 (337) Q Consensus 124 ~~~~~~~i~~~~a--lD~i~IGfNG~s~A-~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~ 200 (337) +..+++.+.+.++ +|...| +|+..+ ....| . -+....... -..+..+. T Consensus 442 ~~~i~~~l~~aia~~~d~a~l--~g~g~~~~~~~p----~------------------gi~~~~~~~-----~~~~~~~~ 492 (645) T protein:vir:93 442 DALVRNALAEAVVARLDTDFV--DPKKAAVADVSP----A------------------SITHDVKGT-----ASSGNPDA 492 (645) T ss_pred HHHHHHHHHHHHHHHHHHHhh--cCCCcccCCccc----c------------------ceecccccc-----ccccchHH Confidence 8888888888776 455555 443322 11112 1 111111000 01122344 Q ss_pred cHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEE Q lcl|NC_015266. 201 NLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMV 280 (337) Q Consensus 201 nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilv 280 (337) ++..+...+...-+++ +.-|++|.+..... ...+-...+.+ +--++-....++-|+|++...++|++-++. T Consensus 493 d~~~~~~~~~~a~~~~-----~~a~~vmn~~~~~~-L~~lkd~~G~~---~~~~~~~~~~tL~G~PV~~s~~vp~~~~~g 563 (645) T protein:vir:93 493 DAEAAFGQFVAANLQP-----TGAVWLMSSTNALA-LSMRKNALGQK---EYPDMTLLGGSFQGLPVIVSQYVGDQLVLV 563 (645) T ss_pred HHHHHHHHHHhcCCCc-----cccEEEEcHHHHHH-HHhccccCCce---eecCCCCCCceeeceeeEEeccCCcceeEe Confidence 5555544433222221 24689999986553 11121111111 101111234589999999999999875544 Q ss_pred ecccccEEEEecCceEE--------eEeeccccce--------ecchhhhc--------ccceeecCCcEEEeeceeecc Q lcl|NC_015266. 281 TKLENLSIYFQEGARRR--------SLIDNPKRDQ--------IENYESSN--------DAYVVEDFGCGCVAENIELVA 336 (337) Q Consensus 281 T~l~NLsIY~Q~gs~RR--------~~~d~p~r~r--------~e~y~s~N--------e~YvVEd~~~~a~iEnI~~~~ 336 (337) .++.+-|- ..+...- .+-+.|.-+. |.-|+.-. -+|.|=++++++.|.+|+.+. T Consensus 564 -d~s~~~ig-~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~ 641 (645) T protein:vir:93 564 -NAPDIYLA-DDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGS 641 (645) T ss_pred -ccccEEEE-EecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCc Confidence 55543221 2222221 1122222221 11122211 366677899999999999999 Q ss_pred C Q lcl|NC_015266. 337 A 337 (337) Q Consensus 337 a 337 (337) | T Consensus 642 ~ 642 (645) T protein:vir:93 642 A 642 (645) T ss_pred c Confidence 9 No 111 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=96.57 E-value=0.0002 Score=40.84 Aligned_cols=304 Identities=11% Similarity=0.059 Sum_probs=133.1 Q ss_pred CChHHHHH------HHHHHHHHHHhcCc--ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccc Q lcl|NC_015266. 1 MKKETRQA------YRKYAAQIAKLNDT--DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSG 72 (337) Q Consensus 1 M~~~tr~~------~~~y~~~~a~~ngv--~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g 72 (337) |....|.. +..+...+...... ........|-..+...+.+.+.+.+.+++.++++++.-. ..+.+...+ T Consensus 123 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g~--~~~~~~~~~ 200 (466) T protein:vir:80 123 MPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMHRYSKLISKVRLRPLKGT--ARQNIAGAI 200 (466) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhhhhhhhhhheeeeecCce--eEeeeecCC Confidence 22222211 12222222211111 111122344455778889999999999999999888521 122222223 Q ss_pred ccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCC Q lcl|NC_015266. 73 PIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALS 152 (337) Q Consensus 73 ~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~ 152 (337) +.++-+..+ ......+ ...+...|.+++.---+.|+.+.|+. ..++|+..+++.+.++++.=.-.--+||+- T Consensus 201 ~~a~wv~E~-~~~~~~~-~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~la~~~~~~~~~ail~G~G---- 272 (466) T protein:vir:80 201 PEGVWTEAV-ANLNELS-LSFSQIEVDGYKVGGFIPIPNSTLED--SDLNLADEILDAIGQAIGFALDKAILYGTG---- 272 (466) T ss_pred cceeecccc-ccccccc-ccccceeecceeeeeehhhhHHHHhc--chHHHHHHHHHHHHHHHHHHHhhheeeccC---- Confidence 333322221 2222233 44667788888887788999999973 123789999999999776544444445532 Q ss_pred CChhhhhhhhccchhHHHHHHhh--chhhhcccccc---cC-C---ceecCCCcccccHHHHHHHHHhcccChhHcCCCC Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDR--AGHRVLHEGAK---EA-G---KVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTG 223 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~--a~~~v~~~~~~---~~-~---~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~d 223 (337) ..+| +|+|...-.. ++......... +. . ....+..+.+...|. +.. +..+.+.. ..+. T Consensus 273 ---~~~P------~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~~~--~~~~ 339 (466) T protein:vir:80 273 ---TKMP------VGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSEL-VLK-LSKARANY--SNGM 339 (466) T ss_pred ---CCCc------ceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHH-HHH-HHhhhccc--cCCc Confidence 1122 3555321000 00000000000 00 0 000112222222222 221 12222222 2334 Q ss_pred eEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeecc Q lcl|NC_015266. 224 LVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNP 303 (337) Q Consensus 224 LVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p 303 (337) .++++..+.... .+.+.-..+....-. ... .....+.|+|++.-|++|.+.+++--++..-|+...+- +-.. .+ T Consensus 340 ~~w~~~~~~~~~-l~~~~~~~~~~g~~~-~~~-~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r~~~-~i~~--~~ 413 (466) T protein:vir:80 340 KFWAMSSNTHAV-LMSKAITFNSAGALV-ASL-NNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAERADI-KLAQ--SE 413 (466) T ss_pred eeEEecchhHHH-hhcccccccCCcccc-ccC-CCcccccccceeecCccCccceeeeccccEEEEeecce-EEEe--ch Confidence 567776664432 122210111111111 110 11224789999999999999998877776555433322 2111 11 Q ss_pred ccceecchhhhcccceee--------cCCcEEEeeceeeccC Q lcl|NC_015266. 304 KRDQIENYESSNDAYVVE--------DFGCGCVAENIELVAA 337 (337) Q Consensus 304 ~r~r~e~y~s~Ne~YvVE--------d~~~~a~iEnI~~~~a 337 (337) +.. |..-+.+|.+. +.+.+..++-=++.++ T Consensus 414 ~~~----f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~ 451 (466) T protein:vir:80 414 HVR----FIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPT 451 (466) T ss_pred hhh----hhcCcEEEEEEEEEccEEeccCceEEEEecCCCcc Confidence 111 11111222222 2233333321111111 No 112 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=96.12 E-value=0.0005 Score=38.70 Aligned_cols=287 Identities=12% Similarity=0.032 Sum_probs=135.1 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) ++.+-|..|+++. + +. ...-.|.|-++...++.+.+.+.|.+++.++++++.- +.++....+++.++=..- T Consensus 65 l~~~e~~~~~~~~----~--~t-~~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~--~~~i~~~~~~~~a~W~~e 135 (381) T protein:vir:10 65 LSANQRNFFMDIN----K--SV-GYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGL--RLKFLKSETSGVAVWGKI 135 (381) T ss_pred cCHHHHHHHHHHh----h--cC-CCCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCc--ceEEEeecCCcceEEeec Confidence 3333333333221 1 11 1223567888899999999999999999999988742 334444444444432211 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ...+.+..-..++...+.+++.---..|+.+.|+...- +++..++..+.+++|.=.-.-=.||+-. .-| T Consensus 136 -~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~--~le~~i~~~la~~~a~~~~~afi~GdG~-------~qP- 204 (381) T protein:vir:10 136 -YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPA--WIERFVRVQIEEAFAVALETAFLKGTGK-------DQP- 204 (381) T ss_pred -ccccccccCccceeEeecceeEEeeccccHHHHhccHH--HHHHHHHHHHHHHHHHHhhceeEecccC-------CCc- Confidence 12333232334666667777777778899999987542 5788888888887764333323356431 112 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCC------CcccccHHHHHHHHHhcccChh--HcCCCCeEEEeChHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGK------GGDYVNLDALVMDIVSSMIDPW--FQEDTGLVVICGREL 232 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~------ggdy~nLDalV~da~~~li~~~--~r~~~dLVvivG~dL 232 (337) +|+|..+ ++......++. ..+...|. ..-|..|.+++..+- +..-. .......+++|.+.- T Consensus 205 -----~Gil~~~---~~~~~~~~g~~-~~~~~~~~~t~~~~~~~~~~l~~~~~~~~--~~~~~~~~~~~~~~~~vmn~~t 273 (381) T protein:vir:10 205 -----IGLNRQV---QKGVSVTDGAY-PEKEEQGTLTFANPRATVNELTQVFKYHS--TNEKGKSVAVKGNVTMVVNPSD 273 (381) T ss_pred -----eeeeecC---Ccccccccccc-ccccccccccccchhhHHHHHHHHHHhhh--hhhccccccccCceEEEEchhh Confidence 4665311 11111111111 11111111 111333333333221 11110 012235678888764 Q ss_pred HHHHHHHHH---hccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceec Q lcl|NC_015266. 233 LHDKYFPIV---NTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIE 309 (337) Q Consensus 233 la~k~~~l~---n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e 309 (337) .. +..++. +..+.+. .. .--|+|++.-|+||++.|++--+++--|.-..+ .|=..-+ +.- T Consensus 274 ~~-~l~~~~~~~~~~G~~v-------~~---lp~g~~vv~~~~~p~~~i~fGDfs~Y~i~~r~~-~~i~~~~--~~~--- 336 (381) T protein:vir:10 274 AF-EVQAQYTHLNANGVYV-------TA---LPFNLNVIESTVQEAGKVLTYVKGLYDGYLAGG-INVQKFK--ETL--- 336 (381) T ss_pred HH-hhccccccCCCCCcee-------ec---CCCCceeEEcCCCCcCcEEEEEcccEEEEEecc-cEEEeec--hhh--- Confidence 44 222222 1111110 00 112778999999999999999998866654333 3221111 110 Q ss_pred chhhhcccceeecC--------CcEEEeeceeec---cC Q lcl|NC_015266. 310 NYESSNDAYVVEDF--------GCGCVAENIELV---AA 337 (337) Q Consensus 310 ~y~s~Ne~YvVEd~--------~~~a~iEnI~~~---~a 337 (337) |..--.+|..--+ ++++.++ |++. +| T Consensus 337 -~~~d~~~f~a~~r~dG~~~~~~A~~v~~-l~~~~~~~~ 373 (381) T protein:vir:10 337 -ALDDMDLYTAKQFAYGKAKDNKVAAVWK-LDLKGHKPA 373 (381) T ss_pred -hhcCceEEEEEEEEcCEEecCCcEEEEE-EeecCCccc Confidence 1111112222111 1111111 1111 11 No 113 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=95.27 E-value=0.0024 Score=34.95 Aligned_cols=281 Identities=11% Similarity=0.021 Sum_probs=141.8 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceeccC Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTDT 80 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~t 80 (337) +.++.|..|+++++. |.+ ....+.|-+++..++.+.+.+.|..++.++++++.- +.++-...+++-++=..- T Consensus 67 lt~ee~~~~~~~~~~-----~~~-~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~--~~~~~~~~~~~~a~w~~e 138 (377) T protein:vir:98 67 LTAEEIKFFNDIDKN-----VGG-KDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL--RLKALTAETSGTAVWGDI 138 (377) T ss_pred cCHHHHHHHHHHHhc-----cCC-CCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCc--ceEEEEecCCcceeEeec Confidence 777777777766532 222 233567888899999999999999999999887641 223333334343332221 Q ss_pred CCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhh Q lcl|NC_015266. 81 TKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPL 160 (337) Q Consensus 81 ~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPl 160 (337) ...+.+.....++...+.+++.---..|+.+.|+.=+ .+++..+++.+.+++|.=.-.--+||+-. .-| T Consensus 139 -~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~--~~ie~~i~~~la~~~a~~~~~a~i~G~G~-------~qP- 207 (377) T protein:vir:98 139 -FGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP--KWIKQFITEQLKEAIAVALELAIVKGDGL-------LQP- 207 (377) T ss_pred -ccccCcccCccceeEeecceeEEeeecccHHhhhccH--hHHHHHHHHHHHHHHHHHHhhceEeccCC-------Ccc- Confidence 1233333334566677777777777889999997522 26888899999999887666666677541 112 Q ss_pred hhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHH Q lcl|NC_015266. 161 LQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPI 240 (337) Q Consensus 161 lqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l 240 (337) +|+|..+ ..++. ...-..+.++.|...|++. ++... +++.|+.. .+++|-+.-+.... .+ T Consensus 208 -----~Gil~~~---------~~~~~-~~~~~~~~~~~~~~~~~~~-~l~~~-~~~~~~~~--a~~~m~~~t~~~~~-kl 267 (377) T protein:vir:98 208 -----VGLLKDL---------SQPTV-DQSTGRDITTYKTDKEAIA-DLSDL-TPDNAPKK--LVPVMKHLSVNDKK-RP 267 (377) T ss_pred -----eeeeecc---------ccccc-ccccccccccccchhhhHh-hhhhh-chhHHHHH--HHHHHHHHHHHHHh-hh Confidence 3555211 00000 0011112233344334432 23332 34444442 23333333222111 01 Q ss_pred HhccC------ChhHHHHHH---HHH-h---hhhhcCce--eEECCccCCCceEEecccccEEEEecCceEEeEeecccc Q lcl|NC_015266. 241 VNTTQ------APTEQLAAD---LIV-S---QKRIGNLP--AVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKR 305 (337) Q Consensus 241 ~n~~~------~ptE~~A~~---~~~-~---~k~igGl~--a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r 305 (337) -...+ .|+.....+ ... + ..++-|+| ++.-+++|++.+++--+++--|+...+ .+= T Consensus 268 kd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~-~~i-------- 338 (377) T protein:vir:98 268 LKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANRYDAFMATA-STI-------- 338 (377) T ss_pred hccCCceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecceeEEeecc-eEE-------- Confidence 11000 111111000 000 0 01334455 678899999999988888855544332 221 Q ss_pred ceecchhhhcccceeecCCcEEEee---------------ceeec Q lcl|NC_015266. 306 DQIENYESSNDAYVVEDFGCGCVAE---------------NIELV 335 (337) Q Consensus 306 ~r~e~y~s~Ne~YvVEd~~~~a~iE---------------nI~~~ 335 (337) ..+.+.|..+|.-.+-++. .|.++ T Consensus 339 ------~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 339 ------EEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ------EeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 1222334555543333332 12222 No 114 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=94.95 E-value=0.0031 Score=34.33 Aligned_cols=272 Identities=12% Similarity=0.066 Sum_probs=135.0 Q ss_pred cCcccccceeeecHHHHHHHHHHHHh----hhhhhcccccccchhhhh---hhhc------cccccccceeccCCCcccc Q lcl|NC_015266. 20 NDTDDVSQKFAVEPSVQQTLETKMQE----SSAFLKSINILPVTELEG---EKLG------LSVSGPIASRTDTTKAERQ 86 (337) Q Consensus 20 ngv~~~~~~Fsv~P~~~q~L~~~i~e----ss~FL~~Inv~~V~~~~G---e~v~------~gv~g~iagRt~t~~~~R~ 86 (337) .+++++...-...-++-+.+...+.| .=...+.| +|...-| |.+. .|....+...+++- T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i---~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~di----- 72 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLF---PVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDL----- 72 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceec---ccccCCCCceeEEEeeeeeccCceeEeCCCcccc----- Confidence 44443322111122223333333332 21222222 3322111 1111 12222222222111 Q ss_pred cccccccCccceeeEeeccccccCHHHHHHHhcCc-cHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhhhhhccc Q lcl|NC_015266. 87 PIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFP-DFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVN 165 (337) Q Consensus 87 ~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~-dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVN 165 (337) |..-.+.+.........--+..+++..|.+.+..+ +...+-..+.++..+..+=.|.|+|.+....+=.-.+|.+.=++ T Consensus 73 p~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~ 152 (296) T protein:vir:10 73 PLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVV 152 (296) T ss_pred ceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCcccc Confidence 11112222223333444455566778888888864 68888888889999999999999995433221111122211000 Q ss_pred --hhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHHHHhc Q lcl|NC_015266. 166 --IGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFPIVNT 243 (337) Q Consensus 166 --kGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~l~n~ 243 (337) .-|- . .+..|..+++++..+... ......|+ ..++..++.. +++. T Consensus 153 ~~~~W~------~------------------~t~i~~Di~~~~~~l~~~---s~g~~~p~-~l~L~p~~~~-----~L~~ 199 (296) T protein:vir:10 153 SGGSWS------Q------------------PTTAVSDITSLLDIIETS---TNGQHRAT-HLLLPTTARR-----IMQN 199 (296) T ss_pred ccCCcc------C------------------HHHHHHHHHHHHHHHHHh---hCceecce-eEEeCHHHHH-----HHhh Confidence 0120 0 012355566666554421 11233444 3444555443 2332 Q ss_pred cCChhHHHHHHHHHhhhhhcCceeEECCccCCC------ceEE--ecccccEEEEecCceEEeEeeccccceecchhhhc Q lcl|NC_015266. 244 TQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKR------AMMV--TKLENLSIYFQEGARRRSLIDNPKRDQIENYESSN 315 (337) Q Consensus 244 ~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~------~ilv--T~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~N 315 (337) ....+-....+.+ .+.+.++.++.+|.+... .+++ +.-+|+++=+-..- |+.-..-....-.+.|..+- T Consensus 200 ~~~~~~~t~l~~i--k~~~~~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~-~~~~~e~~~l~~~~~~~~~~ 276 (296) T protein:vir:10 200 LVPGTSVSYGEFF--RQNNSGVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEAT-NALPAQPKDLHFKIPVTSKA 276 (296) T ss_pred ccCCCCccHHHHH--HHhcCCceEEEeeeeccCCCCcceEEEEEEcCCceEEEEcCcce-eeecccccCceEEEeeEeeE Confidence 2222333345554 457789999999999763 3344 67888887664433 33333333344455556666 Q ss_pred ccceeecCCcEEEeeceeec Q lcl|NC_015266. 316 DAYVVEDFGCGCVAENIELV 335 (337) Q Consensus 316 e~YvVEd~~~~a~iEnI~~~ 335 (337) -|-+|=.+.++|.+++|+|+ T Consensus 277 ~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 277 TGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEEEEECCceeEEEeeeecC Confidence 67888889999999999999 No 115 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=94.56 E-value=0.0041 Score=33.68 Aligned_cols=277 Identities=10% Similarity=0.070 Sum_probs=135.1 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhh-hccccccccceecc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEK-LGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~-v~~gv~g~iagRt~ 79 (337) ++...+........+.+. +......-.+.|-+.....+.+.+.+.|.+++..++++|.-..|.. +....+++-++-+. T Consensus 89 ~~~~~~~~~~~~~~~~~~-~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 167 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRAM-SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) T ss_pred ccHHHHHHHhhhhhhhhc-cccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeec Confidence 122222222222111111 1112223466787778889999999999999999999998777764 33333444443333 Q ss_pred CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANP 159 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nP 159 (337) .+ ......+...++......++.---+.|+.+.|+.. .++|+..+.+.+.+.++.-.-.-=+||...+. T Consensus 168 E~-~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~-------- 236 (392) T protein:vir:10 168 EM-GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------- 236 (392) T ss_pred cc-ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------- Confidence 22 22212233456777788888877889999999863 36899999999988887633222223322110 Q ss_pred hhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHH Q lcl|NC_015266. 160 LLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 160 llqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~ 239 (337) .... .+.|.++. +++..+++.|+.. .+++|.+..+.. -.. T Consensus 237 ---------------------------------~~~~---~~~d~i~~-~~~~~l~~~~~~~--a~~vm~~~~~~~-L~~ 276 (392) T protein:vir:10 237 ---------------------------------KQAI---KSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFNY-LDK 276 (392) T ss_pred ---------------------------------ccCc---cCHHHHHH-HHHHhhhhhhccC--CEEEEcHHHHHH-HHH Confidence 0011 23455443 3444568888764 589999987653 112 Q ss_pred HHhccCChh--HHHHHHHHHhhhhhcCce-eEECCccC-CC------c--eEEecccccEEEEecCceEEeEeeccccce Q lcl|NC_015266. 240 IVNTTQAPT--EQLAADLIVSQKRIGNLP-AVRVPFFP-KR------A--MMVTKLENLSIYFQEGARRRSLIDNPKRDQ 307 (337) Q Consensus 240 l~n~~~~pt--E~~A~~~~~~~k~igGl~-a~~vPffP-~~------~--ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r 307 (337) +-...+.|- .-.. -....++-|+| +++++.++ .. . +++-.+++.-.-..++..+- +..+.. T Consensus 277 lkd~~G~~l~~~~~~---~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~--~~~~~~-- 349 (392) T protein:vir:10 277 LKDKDGKYILQSDPT---QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMEL--ASTDVG-- 349 (392) T ss_pred hhccCCCeEeecCcc---CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEE--EEeccc-- Confidence 211111110 0000 01234666765 44444332 11 1 44444444222122222222 222211 Q ss_pred ecchhhhc-ccceeecC-CcEEEe-ece---eeccC Q lcl|NC_015266. 308 IENYESSN-DAYVVEDF-GCGCVA-ENI---ELVAA 337 (337) Q Consensus 308 ~e~y~s~N-e~YvVEd~-~~~a~i-EnI---~~~~a 337 (337) .+++..| -+|.++-+ +..... +.| ++..+ T Consensus 350 -~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 350 -GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred -cchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 1222222 34444443 222222 122 33333 No 116 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=94.56 E-value=0.0041 Score=33.68 Aligned_cols=277 Identities=10% Similarity=0.070 Sum_probs=135.1 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhh-hccccccccceecc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEK-LGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~-v~~gv~g~iagRt~ 79 (337) ++...+........+.+. +......-.+.|-+.....+.+.+.+.|.+++..++++|.-..|.. +....+++-++-+. T Consensus 89 ~~~~~~~~~~~~~~~~~~-~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 167 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRAM-SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) T ss_pred ccHHHHHHHhhhhhhhhc-cccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeec Confidence 122222222222111111 1112223466787778889999999999999999999998777764 33333444443333 Q ss_pred CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANP 159 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nP 159 (337) .+ ......+...++......++.---+.|+.+.|+.. .++|+..+.+.+.+.++.-.-.-=+||...+. T Consensus 168 E~-~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~-------- 236 (392) T protein:vir:10 168 EM-GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------- 236 (392) T ss_pred cc-ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------- Confidence 22 22212233456777788888877889999999863 36899999999988887633222223322110 Q ss_pred hhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHH Q lcl|NC_015266. 160 LLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 160 llqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~ 239 (337) .... .+.|.++. +++..+++.|+.. .+++|.+..+.. -.. T Consensus 237 ---------------------------------~~~~---~~~d~i~~-~~~~~l~~~~~~~--a~~vm~~~~~~~-L~~ 276 (392) T protein:vir:10 237 ---------------------------------KQAI---KSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFNY-LDK 276 (392) T ss_pred ---------------------------------ccCc---cCHHHHHH-HHHHhhhhhhccC--CEEEEcHHHHHH-HHH Confidence 0011 23455443 3444568888764 589999987653 112 Q ss_pred HHhccCChh--HHHHHHHHHhhhhhcCce-eEECCccC-CC------c--eEEecccccEEEEecCceEEeEeeccccce Q lcl|NC_015266. 240 IVNTTQAPT--EQLAADLIVSQKRIGNLP-AVRVPFFP-KR------A--MMVTKLENLSIYFQEGARRRSLIDNPKRDQ 307 (337) Q Consensus 240 l~n~~~~pt--E~~A~~~~~~~k~igGl~-a~~vPffP-~~------~--ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r 307 (337) +-...+.|- .-.. -....++-|+| +++++.++ .. . +++-.+++.-.-..++..+- +..+.. T Consensus 277 lkd~~G~~l~~~~~~---~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~--~~~~~~-- 349 (392) T protein:vir:10 277 LKDKDGKYILQSDPT---QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMEL--ASTDVG-- 349 (392) T ss_pred hhccCCCeEeecCcc---CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEE--EEeccc-- Confidence 211111110 0000 01234666765 44444332 11 1 44444444222122222222 222211 Q ss_pred ecchhhhc-ccceeecC-CcEEEe-ece---eeccC Q lcl|NC_015266. 308 IENYESSN-DAYVVEDF-GCGCVA-ENI---ELVAA 337 (337) Q Consensus 308 ~e~y~s~N-e~YvVEd~-~~~a~i-EnI---~~~~a 337 (337) .+++..| -+|.++-+ +..... +.| ++..+ T Consensus 350 -~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 350 -GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred -cchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 1222222 34444443 222222 122 33333 No 117 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=94.56 E-value=0.0041 Score=33.68 Aligned_cols=277 Identities=10% Similarity=0.070 Sum_probs=135.1 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhh-hccccccccceecc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEK-LGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~-v~~gv~g~iagRt~ 79 (337) ++...+........+.+. +......-.+.|-+.....+.+.+.+.|.+++..++++|.-..|.. +....+++-++-+. T Consensus 89 ~~~~~~~~~~~~~~~~~~-~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 167 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRAM-SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) T ss_pred ccHHHHHHHhhhhhhhhc-cccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeec Confidence 122222222222111111 1112223466787778889999999999999999999998777764 33333444443333 Q ss_pred CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANP 159 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nP 159 (337) .+ ......+...++......++.---+.|+.+.|+.. .++|+..+.+.+.+.++.-.-.-=+||...+. T Consensus 168 E~-~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~-------- 236 (392) T protein:vir:10 168 EM-GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------- 236 (392) T ss_pred cc-ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------- Confidence 22 22212233456777788888877889999999863 36899999999988887633222223322110 Q ss_pred hhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHH Q lcl|NC_015266. 160 LLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 160 llqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~ 239 (337) .... .+.|.++. +++..+++.|+.. .+++|.+..+.. -.. T Consensus 237 ---------------------------------~~~~---~~~d~i~~-~~~~~l~~~~~~~--a~~vm~~~~~~~-L~~ 276 (392) T protein:vir:10 237 ---------------------------------KQAI---KSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFNY-LDK 276 (392) T ss_pred ---------------------------------ccCc---cCHHHHHH-HHHHhhhhhhccC--CEEEEcHHHHHH-HHH Confidence 0011 23455443 3444568888764 589999987653 112 Q ss_pred HHhccCChh--HHHHHHHHHhhhhhcCce-eEECCccC-CC------c--eEEecccccEEEEecCceEEeEeeccccce Q lcl|NC_015266. 240 IVNTTQAPT--EQLAADLIVSQKRIGNLP-AVRVPFFP-KR------A--MMVTKLENLSIYFQEGARRRSLIDNPKRDQ 307 (337) Q Consensus 240 l~n~~~~pt--E~~A~~~~~~~k~igGl~-a~~vPffP-~~------~--ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r 307 (337) +-...+.|- .-.. -....++-|+| +++++.++ .. . +++-.+++.-.-..++..+- +..+.. T Consensus 277 lkd~~G~~l~~~~~~---~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~--~~~~~~-- 349 (392) T protein:vir:10 277 LKDKDGKYILQSDPT---QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMEL--ASTDVG-- 349 (392) T ss_pred hhccCCCeEeecCcc---CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEE--EEeccc-- Confidence 211111110 0000 01234666765 44444332 11 1 44444444222122222222 222211 Q ss_pred ecchhhhc-ccceeecC-CcEEEe-ece---eeccC Q lcl|NC_015266. 308 IENYESSN-DAYVVEDF-GCGCVA-ENI---ELVAA 337 (337) Q Consensus 308 ~e~y~s~N-e~YvVEd~-~~~a~i-EnI---~~~~a 337 (337) .+++..| -+|.++-+ +..... +.| ++..+ T Consensus 350 -~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 350 -GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred -cchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 1222222 34444443 222222 122 33333 No 118 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=94.56 E-value=0.0041 Score=33.68 Aligned_cols=277 Identities=10% Similarity=0.070 Sum_probs=135.1 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhh-hccccccccceecc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEK-LGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~-v~~gv~g~iagRt~ 79 (337) ++...+........+.+. +......-.+.|-+.....+.+.+.+.|.+++..++++|.-..|.. +....+++-++-+. T Consensus 89 ~~~~~~~~~~~~~~~~~~-~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~ 167 (392) T protein:vir:10 89 LNAEEREFLEDDLEQRAM-SGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEPVRTRSGSRVLEKNSDMIPFAEIT 167 (392) T ss_pred ccHHHHHHHhhhhhhhhc-cccccCCCceecchhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCCccceeec Confidence 122222222222111111 1112223466787778889999999999999999999998777764 33333444443333 Q ss_pred CCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCCChhhhh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALSTDKAANP 159 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~TD~~~nP 159 (337) .+ ......+...++......++.---+.|+.+.|+.. .++|+..+.+.+.+.++.-.-.-=+||...+. T Consensus 168 E~-~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~-------- 236 (392) T protein:vir:10 168 EM-GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDS--DQNILKYVTKWLGKKSKVTRNVLILGVIEKLT-------- 236 (392) T ss_pred cc-ccccccccccceeEEeeeeeEEEeehhhHHHHhhh--HHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------- Confidence 22 22212233456777788888877889999999863 36899999999988887633222223322110 Q ss_pred hhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHHHH Q lcl|NC_015266. 160 LLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKYFP 239 (337) Q Consensus 160 llqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~~~ 239 (337) .... .+.|.++. +++..+++.|+.. .+++|.+..+.. -.. T Consensus 237 ---------------------------------~~~~---~~~d~i~~-~~~~~l~~~~~~~--a~~vm~~~~~~~-L~~ 276 (392) T protein:vir:10 237 ---------------------------------KQAI---KSLDDIKD-VLNVKLDPAISPN--AILLTNQDGFNY-LDK 276 (392) T ss_pred ---------------------------------ccCc---cCHHHHHH-HHHHhhhhhhccC--CEEEEcHHHHHH-HHH Confidence 0011 23455443 3444568888764 589999987653 112 Q ss_pred HHhccCChh--HHHHHHHHHhhhhhcCce-eEECCccC-CC------c--eEEecccccEEEEecCceEEeEeeccccce Q lcl|NC_015266. 240 IVNTTQAPT--EQLAADLIVSQKRIGNLP-AVRVPFFP-KR------A--MMVTKLENLSIYFQEGARRRSLIDNPKRDQ 307 (337) Q Consensus 240 l~n~~~~pt--E~~A~~~~~~~k~igGl~-a~~vPffP-~~------~--ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r 307 (337) +-...+.|- .-.. -....++-|+| +++++.++ .. . +++-.+++.-.-..++..+- +..+.. T Consensus 277 lkd~~G~~l~~~~~~---~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~--~~~~~~-- 349 (392) T protein:vir:10 277 LKDKDGKYILQSDPT---QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMEL--ASTDVG-- 349 (392) T ss_pred hhccCCCeEeecCcc---CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEE--EEeccc-- Confidence 211111110 0000 01234666765 44444332 11 1 44444444222122222222 222211 Q ss_pred ecchhhhc-ccceeecC-CcEEEe-ece---eeccC Q lcl|NC_015266. 308 IENYESSN-DAYVVEDF-GCGCVA-ENI---ELVAA 337 (337) Q Consensus 308 ~e~y~s~N-e~YvVEd~-~~~a~i-EnI---~~~~a 337 (337) .+++..| -+|.++-+ +..... +.| ++..+ T Consensus 350 -~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 350 -GKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred -cchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 1222222 34444443 222222 122 33333 No 119 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=94.21 E-value=0.0051 Score=33.16 Aligned_cols=292 Identities=10% Similarity=0.009 Sum_probs=144.6 Q ss_pred CChHHHHHHHHH-HHHHHHhcCcccc-cc---eeeecHHHHHHHHHHHHhh-hhhhcccccccchhhhh---hhhcc--- Q lcl|NC_015266. 1 MKKETRQAYRKY-AAQIAKLNDTDDV-SQ---KFAVEPSVQQTLETKMQES-SAFLKSINILPVTELEG---EKLGL--- 68 (337) Q Consensus 1 M~~~tr~~~~~y-~~~~a~~ngv~~~-~~---~Fsv~P~~~q~L~~~i~es-s~FL~~Inv~~V~~~~G---e~v~~--- 68 (337) |+...-..+... ++.-++..|+... .. -|. .++-+.+...+.|- -.=|.--.+++|...-| +.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~--~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~ 78 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWT--AQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTF 78 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHH--HHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeee Confidence 888665555444 2334455666422 11 232 22333333333321 11111112223321111 11111 Q ss_pred ---ccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCc-cHHHHHHHHHHHHHhhchhhhcc Q lcl|NC_015266. 69 ---SVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFP-DFQQRIRNVILNQSALDRIMIGW 144 (337) Q Consensus 69 ---gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~-dF~~~~~~~i~~~~alD~i~IGf 144 (337) |....+...+++ -|..-.+.+.....+...--+..+++..|.+.+..+ +...+-+.+..+..+..+=.|.| T Consensus 79 ~~~G~a~~~~d~~~d-----ip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f 153 (319) T protein:vir:10 79 DKVGTAQIIADYTDD-----LPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVF 153 (319) T ss_pred ccccceeeecCcccc-----ccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEE Confidence 111222222111 011112222223344555556678889999998865 67788888888999999999999 Q ss_pred cceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCC-C--cccccHHHHHHHHHhcccChhHcCC Q lcl|NC_015266. 145 NGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGK-G--GDYVNLDALVMDIVSSMIDPWFQED 221 (337) Q Consensus 145 NG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~-g--gdy~nLDalV~da~~~li~~~~r~~ 221 (337) +|......+=.-.+|.++ .++.+ ++ ...++ . .-|+.+.+++..+... .-+... T Consensus 154 ~G~~~~g~~GLlN~p~~~-----------------~~~~~---~~-~~~~t~t~~~i~~di~~~~~~l~~~---s~g~~~ 209 (319) T protein:vir:10 154 KGSAPHKIVSVFNHPNIT-----------------KITSG---KW-IDVSTMKPETAEAELTQAIETIETI---TRGQHR 209 (319) T ss_pred eecccccceeEEeCCCce-----------------eeecC---CC-CCccccCHHHHHHHHHHHHHHHHHh---cCceee Confidence 995432221111122111 11100 00 00010 0 1234455555554421 112233 Q ss_pred CCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCc-------eE-EecccccEEEEecC Q lcl|NC_015266. 222 TGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRA-------MM-VTKLENLSIYFQEG 293 (337) Q Consensus 222 ~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~-------il-vT~l~NLsIY~Q~g 293 (337) | ...++..++... ++.....+-....+.+ .+...++.++.+|.+...+ ++ ...-+|+++=+-.. T Consensus 210 p-~~L~L~p~~~~~-----L~~~~~~~~~t~l~~l--k~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~ 281 (319) T protein:vir:10 210 A-TNILIPPSMRKV-----LAIRMPETTMSYLDYF--KSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEA 281 (319) T ss_pred c-eEEEecHHHHHh-----hhcccCCCCeeHHHHH--HHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcc Confidence 4 355666665542 3322222334445554 4567899999999998532 33 33577777766443 Q ss_pred ceEEeEeeccccceecchhhhcccceeecCCcEEEeece Q lcl|NC_015266. 294 ARRRSLIDNPKRDQIENYESSNDAYVVEDFGCGCVAENI 332 (337) Q Consensus 294 s~RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI 332 (337) .|+.-..-....-.+.|..+--|-+|=.+.++|.+++| T Consensus 282 -~~~~~~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 282 -FNMLPAQPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred -eeeeeeeecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 33444434445566677777778888889999999999 No 120 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=94.07 E-value=0.0055 Score=32.97 Aligned_cols=286 Identities=12% Similarity=0.110 Sum_probs=137.3 Q ss_pred CChH---------HHHHH-HHHHHHHHHhcCcc--------------------cccceeeecHHH-HHHHHHHHHhhhhh Q lcl|NC_015266. 1 MKKE---------TRQAY-RKYAAQIAKLNDTD--------------------DVSQKFAVEPSV-QQTLETKMQESSAF 49 (337) Q Consensus 1 M~~~---------tr~~~-~~y~~~~a~~ngv~--------------------~~~~~Fsv~P~~-~q~L~~~i~ess~F 49 (337) |.+. ....+ ..+...+++.+|.+ ..+-.+.|-|++ .+.+.+.+.+++-+ T Consensus 309 l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i 388 (632) T protein:vir:96 309 LMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAII 388 (632) T ss_pred HHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchh Confidence 0000 00001 11122233322210 112234455554 57889998887765 Q ss_pred hcccccccchhhhhhhh-ccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHH Q lcl|NC_015266. 50 LKSINILPVTELEGEKL-GLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIR 128 (337) Q Consensus 50 L~~Inv~~V~~~~Ge~v-~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~ 128 (337) .+ +.+-.+.-..|..- -.-.+|+-++=+..+ ...|..-...+...+..++.---..|+.+.|+.. .++++..++ T Consensus 389 ~~-l~~~~~~~~~g~~~ip~~~~~~~a~wv~E~--~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds--~~~~~~~i~ 463 (632) T protein:vir:96 389 GQ-MGARMLPGLVGDVDIPKKTSGANFYWIGED--EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS--SIHVENLIR 463 (632) T ss_pred hh-hcceEeecCCcceEEEEEeCCceeEeecCC--ccccccccceeeEEeeeeEEEEehhhHHHHHhcc--chHHHHHHH Confidence 44 33323333333321 111123333322221 1222233456677777777777788888888753 568999999 Q ss_pred HHHHHHHhhchhhhcccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceec-CCCcccccHHHHHH Q lcl|NC_015266. 129 NVILNQSALDRIMIGWNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLV-GKGGDYVNLDALVM 207 (337) Q Consensus 129 ~~i~~~~alD~i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~-G~ggdy~nLDalV~ 207 (337) +.+.+.++.-.-.-.++|+-.+. +|. |.+. ... .+.+.. +.+-+|..+-.|.. T Consensus 464 ~~l~~a~~~~~d~a~l~G~G~~~------~p~------Gi~~------------~~~--~~~~~~~~~~~~~~~i~~~~~ 517 (632) T protein:vir:96 464 EDLIEGIGVALDLAMLTGTGLAN------DPV------GLLN------------MTG--VPALTYPAGGVDWASVVDMET 517 (632) T ss_pred HHHHHHHHHHHHHHhhcccCCCC------ccc------eeee------------ccc--ccceecccccCCHHHHHHHHH Confidence 99999998554444557754221 232 3221 110 011111 12235666655543 Q ss_pred HHHhcccChhHcCCCCeEEEeChHHHHHHHH-HHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEeccccc Q lcl|NC_015266. 208 DIVSSMIDPWFQEDTGLVVICGRELLHDKYF-PIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENL 286 (337) Q Consensus 208 da~~~li~~~~r~~~dLVvivG~dLla~k~~-~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NL 286 (337) . |...+.+.+..+++|.......-.. .+....+.| +....++-|+|++...++|++.+++-.++.+ T Consensus 518 ~-----i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~--------i~~~~~l~G~pv~~s~~ip~~~~~~gd~s~~ 584 (632) T protein:vir:96 518 K-----ISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGER--------IWQNNEVNGYRAEASNQIPADTWIFGDWSQI 584 (632) T ss_pred H-----HhhcccccCccEEEEchhHHHHHHHHhccCCCCce--------eecCCeecccceEeccccccCcEEEeecceE Confidence 2 3444556667899998765543221 122222222 2235688899999999999999999999886 Q ss_pred EEEEecCceEEeEeeccccceecchhhhccccee-ecCCcEEEe-eceeeccC Q lcl|NC_015266. 287 SIYFQEGARRRSLIDNPKRDQIENYESSNDAYVV-EDFGCGCVA-ENIELVAA 337 (337) Q Consensus 287 sIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~YvV-Ed~~~~a~i-EnI~~~~a 337 (337) -|.. .++.+-.+ .|. ..+.+-.-.+.+ ++++....- |.|.++.- T Consensus 585 ~i~~-~~~~~i~~--~~~----~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~ 630 (632) T protein:vir:96 585 VIAM-WGVLDLKV--DPY----TKAASDGLVLRVFQDVDAGVRRKEAFCIAKK 630 (632) T ss_pred EEEE-ecceEEEE--ccc----cccccCceEEEEEeecCceeechhhhhheee Confidence 4443 34433322 111 111111112222 222222222 22222222 No 121 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=94.03 E-value=0.0056 Score=32.92 Aligned_cols=258 Identities=13% Similarity=0.118 Sum_probs=124.0 Q ss_pred HHHhcCcccccceeeecHHH-HHHHHHHHHhhhhhhccccccc-chhhhhhhhccccccccceeccCCCccccccccccc Q lcl|NC_015266. 16 IAKLNDTDDVSQKFAVEPSV-QQTLETKMQESSAFLKSINILP-VTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTAL 93 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~-~q~L~~~i~ess~FL~~Inv~~-V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l 93 (337) +|..+ .+..=-+.|++ ++.+.+.+++++.|-+..++.. ...+.|..+.+=.-..+..-..-+.+.--|..-.+. T Consensus 1 MA~~~----T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~ 76 (272) T protein:vir:98 1 MAVGT----TKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGF 76 (272) T ss_pred CCCcc----ccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccccccccc Confidence 22111 00000355644 3445567777777655444422 111223333331111111111112222234444555 Q ss_pred CccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhch--hhhcccceeccCCCChhhhhhhhccchhHHHH Q lcl|NC_015266. 94 DSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDR--IMIGWNGVKAALSTDKAANPLLQDVNIGWLQQ 171 (337) Q Consensus 94 ~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~--i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~ 171 (337) +......++.- ..++...++.....+|+...+.+.+...++... ..++ . T Consensus 77 ~~~~~~~~~~~--~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~---------------------------~ 127 (272) T protein:vir:98 77 KKTTMTIKKAG--KGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLD---------------------------A 127 (272) T ss_pred ceEEEEeeeee--eeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHH---------------------------H Confidence 56666666643 456666677777789999999999888876432 2221 0 Q ss_pred HHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH-HHHHhccCChhHH Q lcl|NC_015266. 172 YRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY-FPIVNTTQAPTEQ 250 (337) Q Consensus 172 ~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~-~~l~n~~~~ptE~ 250 (337) ++... ..++.+. ++|+ +.|++.. ++... .+.-+++|+++..+.-+ ..+.+- ...++- T Consensus 128 ---------~~~a~-----~~~~~~~---t~d~-i~da~~~-l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~-~~~~~~ 185 (272) T protein:vir:98 128 ---------LSKST-----QTVEATA---TVDG-VSKALDI-FNDED--DAETVIVMNPADASTLRLDAAKEW-LGATEV 185 (272) T ss_pred ---------hcccc-----ccccccc---CHHH-HHHHHHH-HhccC--CCccEEEEcHHHHHHHHHhccccc-cccccc Confidence 11000 1112222 3454 3455654 34432 33458899998655321 111211 111111 Q ss_pred HHHHHHH--hhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhcccceeecCCcEEE Q lcl|NC_015266. 251 LAADLIV--SQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSNDAYVVEDFGCGCV 328 (337) Q Consensus 251 ~A~~~~~--~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~ 328 (337) . .+.+. ...+|.|+|++.-+++|++.+++-.-..+.++.+.+.. ++.+.+-++..+.- ++-.-|+..-. T Consensus 186 ~-~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~---ve~~r~~~~~~~~i-----~~~~~~~~~v~ 256 (272) T protein:vir:98 186 G-ANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTM---VETDRDITKAINQI-----VANKHYGVYLY 256 (272) T ss_pred c-ccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCce---eeeccccccceeEE-----EEEEEEEEEEE Confidence 1 01111 12489999999999999999999998999888877643 22221111122222 22233332211 Q ss_pred ee----ceeeccC Q lcl|NC_015266. 329 AE----NIELVAA 337 (337) Q Consensus 329 iE----nI~~~~a 337 (337) -+ .+++.+| T Consensus 257 ~~~~vv~~t~~~a 269 (272) T protein:vir:98 257 KAEKAVKITLKDA 269 (272) T ss_pred cCCceEEEEeccc Confidence 11 3577777 No 122 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=94.03 E-value=0.0056 Score=32.92 Aligned_cols=258 Identities=13% Similarity=0.118 Sum_probs=124.0 Q ss_pred HHHhcCcccccceeeecHHH-HHHHHHHHHhhhhhhccccccc-chhhhhhhhccccccccceeccCCCccccccccccc Q lcl|NC_015266. 16 IAKLNDTDDVSQKFAVEPSV-QQTLETKMQESSAFLKSINILP-VTELEGEKLGLSVSGPIASRTDTTKAERQPIDPTAL 93 (337) Q Consensus 16 ~a~~ngv~~~~~~Fsv~P~~-~q~L~~~i~ess~FL~~Inv~~-V~~~~Ge~v~~gv~g~iagRt~t~~~~R~~~~~~~l 93 (337) +|..+ .+..=-+.|++ ++.+.+.+++++.|-+..++.. ...+.|..+.+=.-..+..-..-+.+.--|..-.+. T Consensus 1 MA~~~----T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i~~~~~~~ 76 (272) T protein:vir:30 1 MAVGT----TKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAIPMTQLGF 76 (272) T ss_pred CCCcc----ccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCCCccccccccc Confidence 22111 00000355644 3445567777777655444422 111223333331111111111112222234444555 Q ss_pred CccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhch--hhhcccceeccCCCChhhhhhhhccchhHHHH Q lcl|NC_015266. 94 DSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDR--IMIGWNGVKAALSTDKAANPLLQDVNIGWLQQ 171 (337) Q Consensus 94 ~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~--i~IGfNG~s~A~~TD~~~nPllqDVNkGWlq~ 171 (337) +......++.- ..++...++.....+|+...+.+.+...++... ..++ . T Consensus 77 ~~~~~~~~~~~--~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~---------------------------~ 127 (272) T protein:vir:30 77 KKTTMTIKKAG--KGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLD---------------------------A 127 (272) T ss_pred ceEEEEeeeee--eeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHH---------------------------H Confidence 56666666643 456666677777789999999999888876432 2221 0 Q ss_pred HHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHHHHHH-HHHHhccCChhHH Q lcl|NC_015266. 172 YRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELLHDKY-FPIVNTTQAPTEQ 250 (337) Q Consensus 172 ~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLla~k~-~~l~n~~~~ptE~ 250 (337) ++... ..++.+. ++|+ +.|++.. ++... .+.-+++|+++..+.-+ ..+.+- ...++- T Consensus 128 ---------~~~a~-----~~~~~~~---t~d~-i~da~~~-l~~~~--~~~~~~vv~p~~~~~L~k~~~~~~-~~~~~~ 185 (272) T protein:vir:30 128 ---------LSKST-----QTVEATA---TVDG-VSKALDI-FNDED--DAETVIVMNPADASTLRLDAAKEW-LGATEV 185 (272) T ss_pred ---------hcccc-----ccccccc---CHHH-HHHHHHH-HhccC--CCccEEEEcHHHHHHHHHhccccc-cccccc Confidence 11000 1112222 3454 3455654 34432 33458899998655321 111211 111111 Q ss_pred HHHHHHH--hhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecchhhhcccceeecCCcEEE Q lcl|NC_015266. 251 LAADLIV--SQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIENYESSNDAYVVEDFGCGCV 328 (337) Q Consensus 251 ~A~~~~~--~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~ 328 (337) . .+.+. ...+|.|+|++.-+++|++.+++-.-..+.++.+.+.. ++.+.+-++..+.- ++-.-|+..-. T Consensus 186 ~-~~~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~---ve~~r~~~~~~~~i-----~~~~~~~~~v~ 256 (272) T protein:vir:30 186 G-ANRVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTM---VETDRDITKAINQI-----VANKHYGVYLY 256 (272) T ss_pred c-ccccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCce---eeeccccccceeEE-----EEEEEEEEEEE Confidence 1 01111 12489999999999999999999998999888877643 22221111122222 22233332211 Q ss_pred ee----ceeeccC Q lcl|NC_015266. 329 AE----NIELVAA 337 (337) Q Consensus 329 iE----nI~~~~a 337 (337) -+ .+++.+| T Consensus 257 ~~~~vv~~t~~~a 269 (272) T protein:vir:30 257 KAEKAVKITLKDA 269 (272) T ss_pred cCCceEEEEeccc Confidence 11 3577777 No 123 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=85.17 E-value=0.055 Score=27.50 Aligned_cols=295 Identities=12% Similarity=0.102 Sum_probs=134.8 Q ss_pred CChH---HHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccccccee Q lcl|NC_015266. 1 MKKE---TRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~---tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagR 77 (337) |++- ||.-+ +..+ ++++-. + ..-.-...++.+.++-|+++.++-.++. |..+-+ |..|| T Consensus 1 ms~~~~~t~~~~-------~~s~--~d~al~--l-e~f~geV~~af~~~s~~~~~~~~rti~~--g~s~~~----~~iG~ 62 (335) T protein:vir:78 1 MSFLNDLTRPNY-------AGKN--ADVDIH--L-EEHLGIVDKHFAYTSKFAPLMNIRDLRG--SNVVRL----DRLGN 62 (335) T ss_pred CCcccccccccc-------cccc--chhhhh--h-hhhhhHHHHHHHHhhhhccccceeeecc--ceeEEE----eeeee Confidence 5543 22211 1111 111111 1 1112344567788888998877765422 333332 23334 Q ss_pred ccCC-Cccccccccccc--CccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhh--chhhhcccceeccCC Q lcl|NC_015266. 78 TDTT-KAERQPIDPTAL--DSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSAL--DRIMIGWNGVKAALS 152 (337) Q Consensus 78 t~t~-~~~R~~~~~~~l--~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~al--D~i~IGfNG~s~A~~ 152 (337) +.-. ..+-++.+.... +.....+.++ .=++..-+.||.|-.+=|+-..+...+-+..|- |+-.+ =...++|.. T Consensus 63 ~~~~~~~pG~~l~~~~~~~~k~~itID~l-l~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~-~~l~~aa~~ 140 (335) T protein:vir:78 63 VEAKGRRAGEELERSRVVNDKWNLTVDTL-LYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACL-IQVIKAAAM 140 (335) T ss_pred eeecccccCcccCCCCcccCCeEEEecce-eechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHH-HHHHhhccc Confidence 3321 111111111111 1112222111 123445678999999989888888888888777 55332 112222322 Q ss_pred CChhhhhhhhccchhHHHHHHhhchhhhcccccccCCcee-cCCCcccccHHHHHHHHHhcccChhHcCC---CCeEEEe Q lcl|NC_015266. 153 TDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVL-VGKGGDYVNLDALVMDIVSSMIDPWFQED---TGLVVIC 228 (337) Q Consensus 153 TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~-~G~ggdy~nLDalV~da~~~li~~~~r~~---~dLVviv 228 (337) ..|...|- ||. +|...+-.+. .....++..|-.++.++.+.| ++..-.+ .|.|++| T Consensus 141 ~a~~~~~~------~~~-------------~G~~~~~~~tg~~~~~~~~~l~~a~~~a~~~l-~ekdvP~~~~~~rv~vv 200 (335) T protein:vir:78 141 DAPVDLED------AFS-------------PGVLEKLDLTGLTAKEAAEKIVRMHRRVVETF-IERDLGDAVYSEGLTPM 200 (335) T ss_pred ccccccCC------CcC-------------CCcceeeeeccccccccHHHHHHHHHHHHHHH-HhccCCCCCCCccEEEe Confidence 23322111 111 0100000111 112347788888999988866 4543322 1589999 Q ss_pred ChHHHHH--HHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccc------------cEEEEecC- Q lcl|NC_015266. 229 GRELLHD--KYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLEN------------LSIYFQEG- 293 (337) Q Consensus 229 G~dLla~--k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~N------------LsIY~Q~g- 293 (337) ..+-... +.-+++|..-..+.......--....+.|.|++..|.||.+++--++|.| .-..++.. T Consensus 201 ~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~A 280 (335) T protein:vir:78 201 SPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKT 280 (335) T ss_pred ChHHHHHHhcccccccccccccccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecce Confidence 9752221 11224443211111100000001246789999999999988766556654 23334444 Q ss_pred ---------ceEEeEeeccccceecchhhhcccceeecCCcEEEee--ceeeccC Q lcl|NC_015266. 294 ---------ARRRSLIDNPKRDQIENYESSNDAYVVEDFGCGCVAE--NIELVAA 337 (337) Q Consensus 294 ---------s~RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iE--nI~~~~a 337 (337) +-+.......+=+-+-.|++ .|-.+=.+++++.|+ +|.-.+- T Consensus 281 l~t~~~~~~~~e~~~~~~~~~~~i~~~~a--~G~g~lRPe~a~~i~~tg~~~~~~ 333 (335) T protein:vir:78 281 LITAQVAPVQAKLWEDHDQFSWVLDTFQM--YNIGARRPDTAGAIELKGIEAFDI 333 (335) T ss_pred EEEEEEEecccceeeccchhhHhhhHHHH--cCCcccCcceEEEEEecCCCcccc Confidence 22222222223334444444 333445677888877 3322221 No 124 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=84.77 E-value=0.058 Score=27.37 Aligned_cols=279 Identities=11% Similarity=0.073 Sum_probs=135.1 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcc------cccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGL------SVSGPI 74 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~------gv~g~i 74 (337) |.+...-.|- . .+-=.|.|.+-+.+..-+. ..+|+.--..++ .--+.+.. |....+ T Consensus 1 ~~~~~~g~f~---~-----------~~l~~id~~v~e~~~~~l~-~r~l~~v~~~~~---~~~~~~~~~~~~~~G~~~~~ 62 (301) T protein:vir:80 1 MQGKITATIE---A-----------RDLQAIDNVIYEPKQEELT-ARSVFPQKFDVN---EGAESYSFDVMTRSGAAKII 62 (301) T ss_pred CCccccchhh---H-----------HHHHHHHHHHHHhhhhhhh-hhhhcccccCCC---CceEEEEEeeeccceeEEEe Confidence 2222111110 0 0000133444444433333 222322211111 11111111 111111 Q ss_pred ceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCc-cHHHHHHHHHHHHHhhchhhhcccceeccCCC Q lcl|NC_015266. 75 ASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFP-DFQQRIRNVILNQSALDRIMIGWNGVKAALST 153 (337) Q Consensus 75 agRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~-dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~T 153 (337) ....+. -|..-...+.....+.+.--+..+.|..|.+.+..+ +...+-.++.++..+..+=.+.|+|.+....+ T Consensus 63 ~~~~~d-----ip~~~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~ 137 (301) T protein:vir:80 63 ANGADD-----LPLVDVDMVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIK 137 (301) T ss_pred cCcccc-----cccccccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccce Confidence 111110 122222233444555666667788899999998875 67788888889999999999999995532211 Q ss_pred ChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccc--cHHHHHHHHHhcccCh-h----HcCCCCeEE Q lcl|NC_015266. 154 DKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYV--NLDALVMDIVSSMIDP-W----FQEDTGLVV 226 (337) Q Consensus 154 D~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~--nLDalV~da~~~li~~-~----~r~~~dLVv 226 (337) = ||-+=|. ..+..++. + .|...+++ +-|.++.|+.. ++.. | +...| ... T Consensus 138 G-----LlN~p~~------------~~~~~~~~--~---~~~~~~w~~~t~~ei~~di~~-~~~~l~~~s~g~~~p-~~L 193 (301) T protein:vir:80 138 G-----AFEATGI------------QIDVSPTT--G---VGNVSKWEKKTAEQIIDEIGE-AHTKITVLPGYGTAS-LKL 193 (301) T ss_pred e-----eecCCCc------------ccccccCc--c---cccccccccCCHHHHHHHHHH-HHHHHHHhcCceecc-cEE Confidence 1 1111110 11111111 0 11222222 23333333322 1111 1 22233 456 Q ss_pred EeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCc-------eEEe-cccccEEEEecCceEEe Q lcl|NC_015266. 227 ICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRA-------MMVT-KLENLSIYFQEGARRRS 298 (337) Q Consensus 227 ivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~-------ilvT-~l~NLsIY~Q~gs~RR~ 298 (337) +++.+....=..++++....-|+ .+.+ .+...++.++.+|.+...+ +++. .-+|+++-+-..-+ ++ T Consensus 194 ~L~p~~~~~L~~~~~~~~~~~tv---l~~l--~~~~~~~~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~-~~ 267 (301) T protein:vir:80 194 CLPPKQFELINKKRYSNEDSRSV---LKVL--QDNAWFSAIVRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMDIT-RH 267 (301) T ss_pred EecHHHHHhhhhccccCCCCeeH---HHHH--HHHcCcceEEEcceeccCCCCcccEEEEEecCCcEEEEEecCcee-ee Confidence 67776655433334433222233 3443 4567889999999998754 3333 47888887765433 33 Q ss_pred EeeccccceecchhhhcccceeecCCcEEEeece Q lcl|NC_015266. 299 LIDNPKRDQIENYESSNDAYVVEDFGCGCVAENI 332 (337) Q Consensus 299 ~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI 332 (337) -..-....-.+.|..+--|-+|=.+.++|.+++| T Consensus 268 ~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 268 PEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred cceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 3322223455677777778888889999999999 No 125 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=81.60 E-value=0.084 Score=26.48 Aligned_cols=288 Identities=13% Similarity=0.079 Sum_probs=137.9 Q ss_pred CChHHHHHHHHHHHHHHHhcCccccc--ceeeec--HHHHHHHHHHHHhhhhhhcccccccchhhhhh---------hhc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVS--QKFAVE--PSVQQTLETKMQESSAFLKSINILPVTELEGE---------KLG 67 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~--~~Fsv~--P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge---------~v~ 67 (337) |+=+. -+..-..++ ...++.++. --|.++ -.+.+++.+.....-...+. ++|+..-+. .-. T Consensus 3 ~~~~~--~~~~~~~~~-~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~---i~v~~~~~~~~et~~~~~~e~ 76 (314) T protein:vir:10 3 IKFDA--EQAKITTHL-EQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNI---FPVTNEIPGHAKYFEYPEFDG 76 (314) T ss_pred cchHH--HHHHHHHHH-HhhcccchhhhHHHHHHHHHHHHHHHhhhhcccccccee---eccccCCCCceeEEEeeeecc Confidence 55442 222222222 233332222 234332 12222333322223223333 333322111 111 Q ss_pred cccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCc-cHHHHHHHHHHHHHhhchhhhcccc Q lcl|NC_015266. 68 LSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFP-DFQQRIRNVILNQSALDRIMIGWNG 146 (337) Q Consensus 68 ~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~-dF~~~~~~~i~~~~alD~i~IGfNG 146 (337) .|....+.+.+++- |..-.+.+...-.....-.+..+++..|.+.+..+ +...+-..+..+..+..+-.|+|+| T Consensus 77 ~G~a~~~~d~~~di-----p~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G 151 (314) T protein:vir:10 77 VGIAQIIADYSDDL-----PLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSG 151 (314) T ss_pred ccceeeeCCccccc-----ceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEee Confidence 22222222222211 11111122222233334445556668888887765 6788888888889999999999999 Q ss_pred eeccCCCChhhhhhhh--ccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCe Q lcl|NC_015266. 147 VKAALSTDKAANPLLQ--DVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGL 224 (337) Q Consensus 147 ~s~A~~TD~~~nPllq--DVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dL 224 (337) .+.-..+=.-.+|.+. -...+| ..++++ |.-+++++..+... .-+...|+ T Consensus 152 ~~~~g~~GLlN~p~v~~~~~~~~W------aT~~ei------------------~~Di~~~~~~l~~~---s~g~~~p~- 203 (314) T protein:vir:10 152 SAPHGIVSVFDQPNINNVVATPNW------SVPQNA------------------IDDVTAMIDAVESS---TQGLHHVT- 203 (314) T ss_pred cccccceeEeecCCCccccCCCCc------ccHHHH------------------HHHHHHHHHHHHHh---cCccccce- Confidence 5432221111222110 011122 011111 33444444443321 01223444 Q ss_pred EEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCc--------eEEecccccEEEEecCceE Q lcl|NC_015266. 225 VVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRA--------MMVTKLENLSIYFQEGARR 296 (337) Q Consensus 225 VvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~--------ilvT~l~NLsIY~Q~gs~R 296 (337) .+++..+. +.++......+-.-..+.+ .+..=+|.+..+|.+-..+ +..+.-+|+++=+-..-+ T Consensus 204 ~l~Lpp~~-----~~~L~~~~~~~~~tvl~~l--~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~- 275 (314) T protein:vir:10 204 DILLPASA-----RRVMQGLVPQTNLSYGELF--TRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTN- 275 (314) T ss_pred eEEecHHH-----HHhhcccccCCCccHHHHH--HHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccce- Confidence 33344443 3345543333444455655 4456689999999998655 223666777765544333 Q ss_pred EeEeeccccceecchhhhcccceeecCCcEEEeeceeec Q lcl|NC_015266. 297 RSLIDNPKRDQIENYESSNDAYVVEDFGCGCVAENIELV 335 (337) Q Consensus 297 R~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~ 335 (337) +.-..-....-.+.|..+--|-+|=.+.++|.+++|+|+ T Consensus 276 ~l~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 276 VLPAQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred eecceecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 333333335556667777777888889999999999999 No 126 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=76.17 E-value=0.052 Score=27.62 Aligned_cols=288 Identities=11% Similarity=0.029 Sum_probs=123.5 Q ss_pred CChHHHHHHHHHHHHHHHh----cCcc--cccceeeec-HHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKL----NDTD--DVSQKFAVE-PSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGP 73 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~----ngv~--~~~~~Fsv~-P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~ 73 (337) |. +++.+..- +|.. +.....++- ..-.-.+..+.+.+|-|+.++++-.++ .|..+.+-.-|. T Consensus 1 ~~---------~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~--~G~tv~i~~ig~ 69 (332) T protein:vir:78 1 MT---------TLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLR--GGKSKQFMFTGK 69 (332) T ss_pred Cc---------ccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhcccccccc--ccceEEEEeccc Confidence 22 22222111 1211 111111110 122334566777888899888887665 477776665554 Q ss_pred cceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchh--hhcccceeccC Q lcl|NC_015266. 74 IASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRI--MIGWNGVKAAL 151 (337) Q Consensus 74 iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i--~IGfNG~s~A~ 151 (337) +.-..-+.+.+-.+...+..+.....+-+.-+..+ .=+.||.|....|+...+.+......|...= .++ --+.+| T Consensus 70 ~~~~~~~~g~~l~~~~~~~~~~~~l~ID~~ky~~~-~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~-~l~~aa- 146 (332) T protein:vir:78 70 LSAGYHTPGTPIVGDAGIKANEKTLVMDDLLVSSQ-FVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIAR-VLAKAS- 146 (332) T ss_pred eeEeeecCCCCCCCCCCCCCceEEEEEehhhhhHH-HHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHH-HHHhhh- Confidence 43332222221111111112222333333333322 2256999999989888887777666654321 111 001111 Q ss_pred CCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcc--cccHHHHHHHHHhcccChhHcCCCCeEEEeC Q lcl|NC_015266. 152 STDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGD--YVNLDALVMDIVSSMIDPWFQEDTGLVVICG 229 (337) Q Consensus 152 ~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggd--y~nLDalV~da~~~li~~~~r~~~dLVvivG 229 (337) .+. +|..-...+ -.+.++.++. =.++-..+.++.. .|++..-...+.+++|+ T Consensus 147 ~~~---------------------~~~~~~~g~----~~~~~~~~~~~~~~~~~~~i~~a~~-~Lde~~VP~~gR~~vv~ 200 (332) T protein:vir:78 147 AEA---------------------SPVTGEPGG----FHVNIGAGNTNDAQAIVDGFFEAAA-VLDERSAPQEGRVAVLS 200 (332) T ss_pred ccc---------------------Ccccccccc----cccccCCccccCHHHHHHHHHHHHH-HHhhcCCCccCCEEEeC Confidence 111 000000000 1122222221 1345555677776 45888777778899999 Q ss_pred hHHHHH----HHHHHHhccCChhH-HHHHHHHHhhhhhcCceeEECCccCCCceEEeccccc---------------EEE Q lcl|NC_015266. 230 RELLHD----KYFPIVNTTQAPTE-QLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENL---------------SIY 289 (337) Q Consensus 230 ~dLla~----k~~~l~n~~~~ptE-~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NL---------------sIY 289 (337) +..... +-..++|..-..+. .+.... ...++.|.+++..|.+|..+.--....+. .+- T Consensus 201 P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~--~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~ 278 (332) T protein:vir:78 201 PRQYYSLISSVDTNILNREIGNSQGDMNSGK--GLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLI 278 (332) T ss_pred HHHHHHHHhhcCceeeeeeccccccceecce--eeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEe Confidence 853331 11122232111111 111111 13578899999999999765433322221 122 Q ss_pred EecCc-------------eEEeEeeccccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 290 FQEGA-------------RRRSLIDNPKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 290 ~Q~gs-------------~RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) ++... .|+...++.+.+.|-..+. .|.-|=.+++++ +|..| T Consensus 279 ~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~--~G~~v~rPe~~v-----~l~~a 332 (332) T protein:vir:78 279 FHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLA--MGCGSLRTSVAG-----SFQAA 332 (332) T ss_pred ecccceeeeeeeccchhhhhcccchhhhHhhhhhhhh--hcCceecccceE-----EEeeC Confidence 22221 1122222222233332222 222223333333 33334 No 127 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=50.21 E-value=0.62 Score=21.74 Aligned_cols=295 Identities=12% Similarity=0.056 Sum_probs=133.7 Q ss_pred CChHHHHHHHHHH----HHHHHhcCcc-cc--cceee------ecHHHHHHHHHHHHhhhhhhcccccccchhhh---hh Q lcl|NC_015266. 1 MKKETRQAYRKYA----AQIAKLNDTD-DV--SQKFA------VEPSVQQTLETKMQESSAFLKSINILPVTELE---GE 64 (337) Q Consensus 1 M~~~tr~~~~~y~----~~~a~~ngv~-~~--~~~Fs------v~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~---Ge 64 (337) |++.-+ ++++- ...++.-+.. ++ .--|. |+|.+-++....+. -..|+.--+..+--.++ +- T Consensus 6 ~~~~~~--~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~-~~~~i~i~~~~~~~~~~~t~~~ 82 (329) T protein:vir:79 6 MSKEMK--YDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGS-ALRVFPVTSELSDTDKTFEYQT 82 (329) T ss_pred hhhhhc--cchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccc-hhhhcccccCCCCceeEEEeee Confidence 333222 22222 2222222221 11 11222 22333322222222 12232222211110000 00 Q ss_pred hhccccccccceeccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCc-cHHHHHHHHHHHHHhhchhhhc Q lcl|NC_015266. 65 KLGLSVSGPIASRTDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFP-DFQQRIRNVILNQSALDRIMIG 143 (337) Q Consensus 65 ~v~~gv~g~iagRt~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~-dF~~~~~~~i~~~~alD~i~IG 143 (337) .-..|....+.+..+.- |..-.+.+...-.....--+..++|..|.+.+..+ +...+-+.+..+..+..+-.|+ T Consensus 83 ~~~~G~a~~~~d~~~di-----p~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~ 157 (329) T protein:vir:79 83 FDKVGHAKIIADYTDDL-----STVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLV 157 (329) T ss_pred eecceeeeeecCccccc-----ceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEE Confidence 00112222222222110 11111122222334444445567788888888765 6888888889999999999999 Q ss_pred ccceeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccCh----hHc Q lcl|NC_015266. 144 WNGVKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDP----WFQ 219 (337) Q Consensus 144 fNG~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~----~~r 219 (337) |+|.+....+=.-.+|.++-+.. ++ +. ++.--.++-|.++.|+..-+..- .+. T Consensus 158 f~G~~~~g~~GLlN~p~v~~~~~-----------------~~---~~---~~~w~~kt~~ei~~di~~~~~~l~~~s~g~ 214 (329) T protein:vir:79 158 FKGSKPHKIISVFEHPNLTTINS-----------------AG---WN---NAAGTGKKPETAQDELEQAIEKIETLTNGQ 214 (329) T ss_pred EeecccccceeeecCCCcccccc-----------------CC---CC---CccccccCHHHHHHHHHHHHHHHHHhcCce Confidence 99954222211112222111110 00 00 00111133344444433211111 123 Q ss_pred CCCCeEEEeChHHHHHHHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCC------ceEE--ecccccEEEEe Q lcl|NC_015266. 220 EDTGLVVICGRELLHDKYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKR------AMMV--TKLENLSIYFQ 291 (337) Q Consensus 220 ~~~dLVvivG~dLla~k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~------~ilv--T~l~NLsIY~Q 291 (337) ..|+ .+++..++.. +++.....+-....+.+ .+..-++..+.+|.+=.. .+++ +.-+|+++=+- T Consensus 215 ~~p~-~L~Lpp~~~~-----~L~~~~~~~~~tvl~~l--k~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp 286 (329) T protein:vir:79 215 HRAN-MILIPPSMRK-----VLMVRMPETTMSYLDYF--KQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIP 286 (329) T ss_pred eccc-EEEecHHHHH-----HhhcccCCCCccHHHHH--HHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecC Confidence 3443 4555665443 33322222334445654 345667888999998542 2233 67777777654 Q ss_pred cCceEEeEeeccccceecchhhhcccceeecCCcEEEeeceeec Q lcl|NC_015266. 292 EGARRRSLIDNPKRDQIENYESSNDAYVVEDFGCGCVAENIELV 335 (337) Q Consensus 292 ~gs~RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~ 335 (337) ..-+ +.-..-....-.+.|..+--|-+|=-+.++|.+++|.++ T Consensus 287 ~~~~-~l~~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 287 EAFN-MLTAQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred ccee-eeeceecCceEEEceeeeEEEEEEECcceeeeeeeeeeC Confidence 4333 333333334556677777777888889999999999999 No 128 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=38.01 E-value=1.1 Score=20.38 Aligned_cols=293 Identities=10% Similarity=0.035 Sum_probs=124.5 Q ss_pred CChHHHHHHHHHH-HHHHHhcCcccc----cceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccc Q lcl|NC_015266. 1 MKKETRQAYRKYA-AQIAKLNDTDDV----SQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIA 75 (337) Q Consensus 1 M~~~tr~~~~~y~-~~~a~~ngv~~~----~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~ia 75 (337) |-+..- ..++ .+.+..+...++ -+.| .-.+..+.+.+|-|+.+.++-.++ .|..+.+-.-|..+ T Consensus 1 ma~~~~---~~~~~t~~~~~~~~~~~~a~~ie~f------~g~V~~~f~~~s~~~~~~~~~~~~--~G~sv~i~~ig~~t 69 (347) T protein:vir:15 1 MANIQG---GQQIGTNQGKGQSAADKLALFLKVF------GGEVLTAFARTSVTMPRHMLRSIA--SGKSAQFPVIGRTK 69 (347) T ss_pred CCcccc---CCccccccccCCCcchHHHHHHHHH------HHHHHHHHHHhhhhhhcccccccc--ccceeEeeecccee Confidence 221110 0000 111111111110 0122 234456677888899888887655 48888877776666 Q ss_pred eeccCCCccccccccccc--CccceeeEeecccc-ccCHHHHHHHhcCccHHHHHHHHHHHHHhh--chhhhc----ccc Q lcl|NC_015266. 76 SRTDTTKAERQPIDPTAL--DSNRYRCEKTDYDT-AITYRKLDAWAKFPDFQQRIRNVILNQSAL--DRIMIG----WNG 146 (337) Q Consensus 76 gRt~t~~~~R~~~~~~~l--~~~~Y~c~qtn~d~-~i~y~~LD~wA~~~dF~~~~~~~i~~~~al--D~i~IG----fNG 146 (337) ...-+.+.+- +.++..+ +...+.+-+.-+.. .| +.+|.|...-|+...+.+......|. |.-.++ -.. T Consensus 70 ~~~~~~g~~l-~~~~~~~~~~e~~ltID~~~~~~~~V--ddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~ 146 (347) T protein:vir:15 70 AAYLKPGENL-DDKRKDIKHTEKVIHIDGLLTADVLI--YDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVN 146 (347) T ss_pred eeeeccCCCC-CCCCCCCccceEEEEechhhhhhHHh--hhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 5443332211 1112222 22334444443432 33 68999999989988888777777665 332221 100 Q ss_pred eeccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcc-------cccHHHHHHHHHhcccChhHc Q lcl|NC_015266. 147 VKAALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGD-------YVNLDALVMDIVSSMIDPWFQ 219 (337) Q Consensus 147 ~s~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggd-------y~nLDalV~da~~~li~~~~r 219 (337) .+.+.. .+...| ||- .+... ..+.+|+ +.++=.++.++.. .|++..- T Consensus 147 ~~~~~~-~~~~~~-------g~~----------~~~~~-------~~~~~~~~~~~~~~~~~i~d~~~~a~~-~Lde~~V 200 (347) T protein:vir:15 147 LPDASN-ENIEGL-------GKP----------TVLTL-------VKPTTGDLTDPVELGKAIIAQLTIARA-SLTKNYV 200 (347) T ss_pred cccccc-cccccc-------Ccc----------ccccc-------cccccccchhhhhHHHHHHHHHHHHHH-HHhhcCC Confidence 000000 000000 000 00000 0112222 3343333444444 3466666 Q ss_pred CCCCeEEEeChHHHHH--HHHHHHhccCChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEec--------------- Q lcl|NC_015266. 220 EDTGLVVICGRELLHD--KYFPIVNTTQAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTK--------------- 282 (337) Q Consensus 220 ~~~dLVvivG~dLla~--k~~~l~n~~~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~--------------- 282 (337) ...+.+++|+++.... +.-.+.+....-++.+.-.. ..++.|.+++..+.+|..+.--+. T Consensus 201 P~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~G~---Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~ 277 (347) T protein:vir:15 201 PAADRTFYTTPDNYSAILAALMPNAANYQALIDHERGT---IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATS 277 (347) T ss_pred CccCCEEEeCHHHHHHHhcccccccccccccccccceE---EEEEeceEEEecccccccccccccccccccccccccccc Confidence 6667899999754332 01112222211122221111 246889999999999975432111 Q ss_pred -------ccccE-EEEecCce-EEeEee-ccccceecchhhhc------ccceeecCCcEEEeeceeecc Q lcl|NC_015266. 283 -------LENLS-IYFQEGAR-RRSLID-NPKRDQIENYESSN------DAYVVEDFGCGCVAENIELVA 336 (337) Q Consensus 283 -------l~NLs-IY~Q~gs~-RR~~~d-~p~r~r~e~y~s~N------e~YvVEd~~~~a~iEnI~~~~ 336 (337) +++.. +-+|+... --...+ .-++.|-+.|+... -|.-|=++++++.|+==++.| T Consensus 278 ~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 278 STTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred cceeeeccccceeeeeccceeeeeEeeceeeeecccchhhhhhhehhhhcCCceeccccEEEEecCCCCC Confidence 11111 11122111 000111 22222333332210 133344556666665333333 No 129 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=34.67 E-value=1.3 Score=20.00 Aligned_cols=291 Identities=11% Similarity=0.051 Sum_probs=126.0 Q ss_pred CChHHHHHHHHHHHHHHHhcCccccc-ceeeec-HHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceec Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVS-QKFAVE-PSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRT 78 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~-~~Fsv~-P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt 78 (337) |-|.+ +..+++...|...++ ..-++- ..-.-.+..+.+.+|-|+.++++-.++ .|..+.+-.-|...... T Consensus 1 ~a~~~------~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~--~G~sv~~~~iG~~~~~~ 72 (347) T protein:vir:88 1 MANAT------GGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQ--NGKSASFPVMGRTKGYY 72 (347) T ss_pred CCCcc------cchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhcccccccc--CcceEEEeeecceeeee Confidence 66555 455555555553221 111111 112234455677888899999887654 48877777666655433 Q ss_pred cCCCcccc-cccccccCccceee--EeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhh--chhhhcc--cceeccC Q lcl|NC_015266. 79 DTTKAERQ-PIDPTALDSNRYRC--EKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSAL--DRIMIGW--NGVKAAL 151 (337) Q Consensus 79 ~t~~~~R~-~~~~~~l~~~~Y~c--~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~al--D~i~IGf--NG~s~A~ 151 (337) -+.+.+.. |. ..++..+-.| -+.-+. ...=+.+|.|...-|+...+.+...+..|. |...++= .+...++ T Consensus 73 ~~~g~~l~~~~--~~~~~~~~~i~ID~~~y~-~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~ 149 (347) T protein:vir:88 73 LAPGENLDDKR--KDIKHSEKVIQIDGLLTS-DVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPA 149 (347) T ss_pred eccccCCCCCC--CCCccceEEEEEechhhh-hhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 22222211 11 1233333333 222221 233358999999988887777777665443 3322211 1111122 Q ss_pred CCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCccc----c---c-HHHHHHHHHhcccChhHcCCCC Q lcl|NC_015266. 152 STDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDY----V---N-LDALVMDIVSSMIDPWFQEDTG 223 (337) Q Consensus 152 ~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy----~---n-LDalV~da~~~li~~~~r~~~d 223 (337) .+++... ||-. .-.+.+|.+++- . . .|+ +.++.. .+++..-...+ T Consensus 150 ~~~~~~~--------g~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~-i~~a~~-~Lde~~VP~~g 202 (347) T protein:vir:88 150 ASNENIA--------GLGQ-----------------AVVLNIGAAADLVDVEARGKAILKG-LTLARA-RLTKNYVPAGD 202 (347) T ss_pred ccccccC--------Cccc-----------------cccccccccccccchhhhHHHHHHH-HHHHHH-HHhhcCCCCCC Confidence 2222111 1100 001112322221 1 1 233 333444 35776666668 Q ss_pred eEEEeChHHHHHH-HHHHHhccCChhHHHHHHHHHh-hhhhcCceeEECCccCCCceE---------Eecccc-----c- Q lcl|NC_015266. 224 LVVICGRELLHDK-YFPIVNTTQAPTEQLAADLIVS-QKRIGNLPAVRVPFFPKRAMM---------VTKLEN-----L- 286 (337) Q Consensus 224 LVvivG~dLla~k-~~~l~n~~~~ptE~~A~~~~~~-~k~igGl~a~~vPffP~~~il---------vT~l~N-----L- 286 (337) .+++|+++-..+- .-+..+..+..+.. ++... ..++.|.+++..|.+|-...- +|.... + T Consensus 203 R~~vv~P~~y~~Ll~~~~~~~~~~~~~~---~~~~G~vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~ 279 (347) T protein:vir:88 203 RRFYCAPEDYSAILSALMPNAANYAALI---DPETGNIRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATAT 279 (347) T ss_pred CEEEeCHHHHHHHhcchhhhhhhhcccc---chhcceeeeeccceEEEeecccccccccccccccccccccccccccccc Confidence 8999988633210 01112222222222 11111 134569999999999953332 222111 1 Q ss_pred -----------EEEEecCc----------eEEeEeeccccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 287 -----------SIYFQEGA----------RRRSLIDNPKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 287 -----------sIY~Q~gs----------~RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .++++... .......+.+-+.|--++. .|.-|=.+++++.| ++..| T Consensus 280 ~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~~d~i~~~~~--~G~~~~rPe~a~~~---~~~~a 346 (347) T protein:vir:88 280 GDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQADQIIGKYA--MGHGGLRPEAAGAL---VFTPA 346 (347) T ss_pred cccccccCcEEEEEechhhhhheecccceeeeeechhhHHHHhhhhhh--hcCceeccceEEEE---EeCCC Confidence 12333221 1111111111222222222 22333344444443 33333 No 130 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=33.86 E-value=1.3 Score=19.91 Aligned_cols=285 Identities=12% Similarity=0.098 Sum_probs=119.0 Q ss_pred CChHHHHHHHHHHHHHHHhcC--c--ccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccce Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLND--T--DDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIAS 76 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ng--v--~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iag 76 (337) -.+.-|-.|+....+.=++-- + .+ +.+. .....++.+.+.+.+.|++|..+....|. |-.... - T Consensus 5 ~~~~~~~~~~~~~~~~p~l~m~alTLae-a~~l-~~d~~~~~VIE~l~~~s~iL~~lpf~~ve---~~~~~~-------~ 72 (330) T protein:vir:94 5 CTPPLRGRWRTLTHQFPELKMPTVTLAE-SAKL-SQDHLVSGLIETIVEVNPLYEMMPFTEIE---GNALAY-------N 72 (330) T ss_pred cCCccccceeehhccccccchhhhhhhH-Hhhc-CchhhHHHHHHhhhccchHHhhccccccc---CCccee-------e Confidence 112222222222211111000 0 01 1121 24567788888999999999887654443 221111 1 Q ss_pred eccC--CCcccc------cccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhccccee Q lcl|NC_015266. 77 RTDT--TKAERQ------PIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVK 148 (337) Q Consensus 77 Rt~t--~~~~R~------~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s 148 (337) |+.+ +..-|. |..+.+.......|.-..-+..+.=...|..-+--|+...-....++.++..+.--=+||.+ T Consensus 73 r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs 152 (330) T protein:vir:94 73 RENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDG 152 (330) T ss_pred eeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 2222 101111 00111111222223222222222222233232223566555666666666666666678854 Q ss_pred ccCCCChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCC-CCeEEE Q lcl|NC_015266. 149 AALSTDKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQED-TGLVVI 227 (337) Q Consensus 149 ~A~~TD~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~-~dLVvi 227 (337) . +.. + .|=+. .-+++++++- |..|-.-++|. +..||+..+..+ ..-+++ T Consensus 153 ~-----~~~-F------~GL~~---~~~~~q~i~t----------g~~gg~~T~d~-----LDeLl~~v~~~~g~~~~~l 202 (330) T protein:vir:94 153 T-----GNS-F------QGMMG---LVAASQTISA----------GANGGTLTFEL-----LDQLLDLVKDKDGQVDYLM 202 (330) T ss_pred C-----Ccc-c------cchhh---cCCcccEEec----------CCCCCCCCHHH-----HHHHHHHhcCCCCCCcEEE Confidence 2 100 1 12222 2234455432 22222233332 233344433322 123666 Q ss_pred eChHHHHHHHHHHHhcc----CC-hhHHHHHHHHHhhhhhcCceeEECCccCCCceEEecccccEEEEec---------- Q lcl|NC_015266. 228 CGRELLHDKYFPIVNTT----QA-PTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQE---------- 292 (337) Q Consensus 228 vG~dLla~k~~~l~n~~----~~-ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~---------- 292 (337) +.+-... +...+.-+. -. +++-.-+.. .-+++|.|++..-+.|.+.--.|.-.==|||.=+ T Consensus 203 ~n~a~~r-~I~a~~R~~~~~~v~~~~~~~~G~~---v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV 278 (330) T protein:vir:94 203 SSFAMRR-KYFSLLRALGGAAIGEVMTLPSGRQ---IPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGI 278 (330) T ss_pred echhHHH-HHHHHHHhccCCCCCCcccccCCCE---EeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccce Confidence 6665333 222222211 11 222221211 1267888888777777753211111112444322 Q ss_pred ------C--ceE-Ee---Eeec-cccceecchhhhcccceeecCCcEEEeeceeec Q lcl|NC_015266. 293 ------G--ARR-RS---LIDN-PKRDQIENYESSNDAYVVEDFGCGCVAENIELV 335 (337) Q Consensus 293 ------g--s~R-R~---~~d~-p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~ 335 (337) | +.. |. +.+. -.|=+|+-|.| -+|-...+++.+|||+++ T Consensus 279 ~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~y~~----~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 279 AGLTARGSAGLRVQNVGAKENADETITRVKMYCG----FANFSQLGLAAIKGLIPG 330 (330) T ss_pred EeecCCCCCcceeeeCCCccccceeeEEEEEeee----eEEechhheeeeccccCC Confidence 0 111 11 1111 13346666654 578888999999999999 No 131 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=25.61 E-value=2 Score=18.89 Aligned_cols=295 Identities=13% Similarity=0.070 Sum_probs=128.6 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeec-HHHHHHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceecc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVE-PSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~-P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~ 79 (337) |.+-.-..+ .+. . ++.+...-++- ..-.-....+.++++-|+++.++-.++. |..+-+-.-|......- T Consensus 1 m~~~~~~~~----t~~--~--~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~--G~s~~~~~iG~~~~~~~ 70 (334) T protein:vir:80 1 MTYPAANTH----TRP--G--WGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRG--TNQLRVDRVGASTIAGR 70 (334) T ss_pred CCCCcCCCc----ccc--c--cccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccc--cceEEEeeecceeeeee Confidence 333211000 000 0 11111111111 2222344667888898998887765532 66666655444443322 Q ss_pred CCCcccccccccccCccceeeEeec-cccccCHHHHHHHhcCccHHHHHHHHHHHHHhh--chhhhcccceeccCCCChh Q lcl|NC_015266. 80 TTKAERQPIDPTALDSNRYRCEKTD-YDTAITYRKLDAWAKFPDFQQRIRNVILNQSAL--DRIMIGWNGVKAALSTDKA 156 (337) Q Consensus 80 t~~~~R~~~~~~~l~~~~Y~c~qtn-~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~al--D~i~IGfNG~s~A~~TD~~ 156 (337) +... +-+...+...+-.|.=-+ .=++..-+.||.|-.+-||...+.+..-+..|- |+-.+. ...++|...-|. T Consensus 71 ~~g~---~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~-~l~kaa~~~~~~ 146 (334) T protein:vir:80 71 KAGE---ELVVQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACII-QLQKCGDFLAPA 146 (334) T ss_pred cCCC---CCCCCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHH-HHHHhhhhcccc Confidence 2211 111112222233332222 223455578999999999999999999888887 753321 111122222221 Q ss_pred hhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCccc-ccHHHH---HHHHHhcccChhHcCC---CCeEEEeC Q lcl|NC_015266. 157 ANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDY-VNLDAL---VMDIVSSMIDPWFQED---TGLVVICG 229 (337) Q Consensus 157 ~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy-~nLDal---V~da~~~li~~~~r~~---~dLVvivG 229 (337) .++. +|. +|....-. ..|...+. .+-|+| ..+|.+. +++.--.+ .+.|++|+ T Consensus 147 ~~~~------~~~-------------~G~~~~~~-~~g~~~~~~~~~~~l~~a~~~a~~~-L~e~dvp~~~~~~R~~vv~ 205 (334) T protein:vir:80 147 HLKP------AFH-------------DGILLPST-ISGLAADAAADADVLVAAHRQGVEA-MVFRDLGDQLMSEGVTLLD 205 (334) T ss_pred cccc------ccc-------------CCcceeec-ccccccchhhhHHHHHHHHHHHHHH-HHhcCCCCCcCCceEEEeC Confidence 1110 000 01000000 01222222 223333 3466653 46654442 36899999 Q ss_pred hH----HHHHHHHHHHhcc--CChhHHHHHHHHHhhhhhcCceeEECCccCCCceEEeccc---c--------c-EEEEe Q lcl|NC_015266. 230 RE----LLHDKYFPIVNTT--QAPTEQLAADLIVSQKRIGNLPAVRVPFFPKRAMMVTKLE---N--------L-SIYFQ 291 (337) Q Consensus 230 ~d----Lla~k~~~l~n~~--~~ptE~~A~~~~~~~k~igGl~a~~vPffP~~~ilvT~l~---N--------L-sIY~Q 291 (337) .. |+.++ +++|.. ...+-..-+. ....++.|.+++..+.||...+--..+- | . -.++| T Consensus 206 P~~y~~Ll~~~--r~~n~d~~~s~~~~~~~~--g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~ 281 (334) T protein:vir:80 206 PVIFSFLLEHD--RLMNVEFGAKEGGNSFVG--GRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIP 281 (334) T ss_pred hHHHHHHhccc--ccccceeccccccccccc--eeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEEe Confidence 75 33332 245542 1111110011 1134788999999999997754332211 1 1 12333 Q ss_pred cCce----------EEeEeeccccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 292 EGAR----------RRSLIDNPKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 292 ~gs~----------RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) .... +........-+.+-.|++ .|--+=.+++++.+| +++.+- T Consensus 282 ~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a--~G~g~lRPeaa~vv~-~~~~~~ 334 (334) T protein:vir:80 282 SMALISAQVHPVSAQFWEEKKDFGHYLDTFQS--YNIGQRRPDAVAVHD-ITVTNP 334 (334) T ss_pred CceEEEEEEeecceeeeechhhHHHHHHHHHH--cCCceeccceEEEEE-EeeecC Confidence 3321 111111122233333433 233445677777777 222222 No 132 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=21.45 E-value=2.6 Score=18.31 Aligned_cols=298 Identities=12% Similarity=0.086 Sum_probs=128.1 Q ss_pred CChH---HHHHHHHHHHHHHHhcCcccccceeeecHHHHHHHHHHHHhhhhhhcccccccchhhhhhhhcccccccccee Q lcl|NC_015266. 1 MKKE---TRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQQTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASR 77 (337) Q Consensus 1 M~~~---tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagR 77 (337) |++- ||.-+ +..+ ++++ -| + ..-.-...++.+.++-|+.+.++-.++. |..+-+-..|..... T Consensus 1 ms~~~~~tr~~~-------~~s~--~d~a-l~-l-e~f~geV~~af~~~s~~~~~~~~rti~~--g~s~~~~~iG~~~~~ 66 (335) T protein:vir:63 1 MSFLNDLTRPNY-------AGKN--ADVD-IH-L-EEHLGIVDKHFAYTSKFAPLMNIRDLRG--SNVVRLDRLGNVEAK 66 (335) T ss_pred CCCcccchhhhc-------cccc--chhh-ee-h-hhhhhhHHHHHHhhhhhccccceeeecc--ceeEEEeeeeeeeee Confidence 5543 23221 1111 1222 11 1 1123345667788899998887765522 333333222222111 Q ss_pred ccCCCcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcc-cceeccCCCChh Q lcl|NC_015266. 78 TDTTKAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGW-NGVKAALSTDKA 156 (337) Q Consensus 78 t~t~~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGf-NG~s~A~~TD~~ 156 (337) --+...+-....+. .+.....+.. -.=++..-+.||.|-.+-|+-..+...+-+..|...=.-=| --.++|..+.|. T Consensus 67 ~~~pG~~l~~~~~~-~~k~~itVD~-ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~ 144 (335) T protein:vir:63 67 GRRAGEELERSRVV-NDKWNLTVDT-LLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPV 144 (335) T ss_pred cccCCcCcCCCCcc-ccceEEEecc-eeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCcc Confidence 11111111100010 1111111111 11234556789999999888888877776665543211000 001112222222 Q ss_pred hhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCC---CCeEEEeChHHH Q lcl|NC_015266. 157 ANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQED---TGLVVICGRELL 233 (337) Q Consensus 157 ~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~---~dLVvivG~dLl 233 (337) ..|- ||.. . +.. ....++ .+...++..|.+.+.++.+.| ++..-.+ .|.|++|..+-. T Consensus 145 ~~~~------~~~~-------G-~~~-~~~~tg---~~~~~~~~~l~~a~~~a~~~L-~e~dVP~~~~~dr~~vv~P~~y 205 (335) T protein:vir:63 145 DLED------AFSP-------G-VLE-KLDLTG---LTAKQAADKIVRMHRRVVETF-IDRDLGDAVYSEGLTPMSPRVF 205 (335) T ss_pred ccCC------CcCC-------C-cce-eeeecc---CcccccHHHHHHHHHHHHHHH-HhccCCCcccCceEEEeChHHH Confidence 2111 1110 0 000 000011 011124667777788888865 6654432 358999997532 Q ss_pred HH--HHHHHHhccCChhHHHHHHHH-HhhhhhcCceeEECCccCCCceEEecccc------------cEEEEecCc---- Q lcl|NC_015266. 234 HD--KYFPIVNTTQAPTEQLAADLI-VSQKRIGNLPAVRVPFFPKRAMMVTKLEN------------LSIYFQEGA---- 294 (337) Q Consensus 234 a~--k~~~l~n~~~~ptE~~A~~~~-~~~k~igGl~a~~vPffP~~~ilvT~l~N------------LsIY~Q~gs---- 294 (337) .. +.-+++|..-.++.... ... -....+.|.|++..|.||...+--++|.| .-..+|... T Consensus 206 ~~Ll~~~~l~n~~~~~s~~~~-~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~ 284 (335) T protein:vir:63 206 SLLLEHDKLMNVEYQATGATN-DYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITA 284 (335) T ss_pred HHHhccccccccccccccccc-cccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEE Confidence 21 11224443211111000 000 01236789999999999998876666644 233333322 Q ss_pred ------eEEeEeeccccceecchhhhcccceeecCCcEEEeeceeeccC Q lcl|NC_015266. 295 ------RRRSLIDNPKRDQIENYESSNDAYVVEDFGCGCVAENIELVAA 337 (337) Q Consensus 295 ------~RR~~~d~p~r~r~e~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) -+.......+=+-+-.|++ .|--+=.+++++.++= +=..| T Consensus 285 ~~~~vt~e~~~~~~~~~~~i~~~~a--~G~g~lRPe~a~~i~~-tg~~~ 330 (335) T protein:vir:63 285 QVAPVQAKLWEDNEKFSWVLDTFQM--YNIGARRPDTAGAIEL-KGIGA 330 (335) T ss_pred EEeecccceeeccchhhHHhHHHHH--cCCcccccceEEEEEE-cCCCc Confidence 1111111112233334444 3333446777887771 11122 No 133 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=21.11 E-value=2.7 Score=18.26 Aligned_cols=256 Identities=13% Similarity=0.079 Sum_probs=113.4 Q ss_pred CChHHHHHHHHHHHHHHHhcCcccccceeeecHHHH-HHHHHHHHhhhhhhcccccccchhhhhhhhccccccccceecc Q lcl|NC_015266. 1 MKKETRQAYRKYAAQIAKLNDTDDVSQKFAVEPSVQ-QTLETKMQESSAFLKSINILPVTELEGEKLGLSVSGPIASRTD 79 (337) Q Consensus 1 M~~~tr~~~~~y~~~~a~~ngv~~~~~~Fsv~P~~~-q~L~~~i~ess~FL~~Inv~~V~~~~Ge~v~~gv~g~iagRt~ 79 (337) |-+.+.++- . -+-|++- ..+.+++.+..-|-+..++. .++.|+. |-.-.|+..+. T Consensus 1 ma~~~T~~~-----------------~--~iiPev~~~~v~~~~~~~~~~~~~~~~~--~~l~g~~---G~tv~ip~~~~ 56 (274) T protein:vir:93 1 MPQGITKTS-----------------N--QIIPEVLAPMMQAQLEKKLRFASFAEVD--STLQGQP---GDTLTFPAFVY 56 (274) T ss_pred CCccceehh-----------------h--eechHHHHHHHHHHHHhhhhhccccccc--ccccCCC---CCEEEEEeecc Confidence 222221110 0 1334332 22333333333343433332 2333431 21112222222 Q ss_pred CC------CcccccccccccCccceeeEeeccccccCHHHHHHHhcCccHHHHHHHHHHHHHhhchhhhcccceeccCCC Q lcl|NC_015266. 80 TT------KAERQPIDPTALDSNRYRCEKTDYDTAITYRKLDAWAKFPDFQQRIRNVILNQSALDRIMIGWNGVKAALST 153 (337) Q Consensus 80 t~------~~~R~~~~~~~l~~~~Y~c~qtn~d~~i~y~~LD~wA~~~dF~~~~~~~i~~~~alD~i~IGfNG~s~A~~T 153 (337) .+ .+.--+.+-++.+..+...++-.+ .+.+..++.=...+|+...+.+.+...+|...- T Consensus 57 ~g~~~~~~eg~~i~~~~it~~~~~~~i~~~~~--~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d------------- 121 (274) T protein:vir:93 57 SGDAQVVAEGEKIPTDILETKKREAKIRKIAK--GTSITDEALLSGYGDPQGEQVRQHGLAHANKVD------------- 121 (274) T ss_pred CCCcccccCCCcccccccccceeEEEeeeecc--cccccHHHHHhhccchHHHHHHHHHHHHHHHHH------------- Confidence 11 111113333444555566655443 455666666666678887777666666554332 Q ss_pred ChhhhhhhhccchhHHHHHHhhchhhhcccccccCCceecCCCcccccHHHHHHHHHhcccChhHcCCCCeEEEeChHHH Q lcl|NC_015266. 154 DKAANPLLQDVNIGWLQQYRDRAGHRVLHEGAKEAGKVLVGKGGDYVNLDALVMDIVSSMIDPWFQEDTGLVVICGRELL 233 (337) Q Consensus 154 D~~~nPllqDVNkGWlq~~Re~a~~~v~~~~~~~~~~i~~G~ggdy~nLDalV~da~~~li~~~~r~~~dLVvivG~dLl 233 (337) +-.+..+.. ++.. + .++--+.|+++ ||+.. ++.. ++..-+++|+++.. T Consensus 122 ------------~~~~~~~~~---------a~~~-----~--~~~~~~~d~i~-dA~~~-l~d~--~~~~~~ivv~p~~~ 169 (274) T protein:vir:93 122 ------------NDVLEALMG---------AKLT-----V--NADITKLNGLQ-SAIDK-FNDE--DLEPMVLFINPLDA 169 (274) T ss_pred ------------HHHHHHHhc---------cccc-----c--cccccCHHHHH-HHHHH-hhhc--cCCccEEEeCHHHH Confidence 111222211 1100 0 11112345543 56654 4553 44556899998866 Q ss_pred HHHHH-HHHhccCChhHHHHHHHHH--hhhhhcCceeEECCccCCCceEEecccccEEEEecCceEEeEeeccccceecc Q lcl|NC_015266. 234 HDKYF-PIVNTTQAPTEQLAADLIV--SQKRIGNLPAVRVPFFPKRAMMVTKLENLSIYFQEGARRRSLIDNPKRDQIEN 310 (337) Q Consensus 234 a~k~~-~l~n~~~~ptE~~A~~~~~--~~k~igGl~a~~vPffP~~~ilvT~l~NLsIY~Q~gs~RR~~~d~p~r~r~e~ 310 (337) +.-.. +..+ --..++.-. ..+. ...++.|++++.-+.+|.+..++..-.++-++.+.+.. ++ .+|+ +. T Consensus 170 ~~L~k~~~~~-f~~~s~~g~-~~~~~G~ig~~~G~~Vi~s~~~p~~t~~l~~~gai~~~~~~~~~---vE--~~Rd--~~ 240 (274) T protein:vir:93 170 GKLRGDASTN-FTRATELGD-DIIVKGAFGEALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFF---LE--VARD--AS 240 (274) T ss_pred HHHHhhhhhc-ccccccccc-cceeecccceecCeeEEEcCCCCcceEEEEeCCeEEEEecCCcc---cc--cccc--hh Confidence 53211 1111 011122110 1111 13479999999999999999999999998887665422 22 2222 22 Q ss_pred hhhhcccceeecCCcEEEee----ceeeccC Q lcl|NC_015266. 311 YESSNDAYVVEDFGCGCVAE----NIELVAA 337 (337) Q Consensus 311 y~s~Ne~YvVEd~~~~a~iE----nI~~~~a 337 (337) ..+ +.=++..-|++.-.-+ -++++.| T Consensus 241 ~~~-d~i~~~~~y~~~~~~~~~~v~~t~~~~ 270 (274) T protein:vir:93 241 TKT-TALYSDKHYVAYLYDESKAVKITKGSG 270 (274) T ss_pred hcc-cEEEEEEEEEEEEEcCCceEEEeeCcc Confidence 221 1222233333333333 2444555 Done!