Query lcl|NC_019525.1_cdsid_YP_007007129.1 [gene=F395_gp50] [protein=hypothetical protein] [protein_id=YP_007007129.1] [location=27322..28353] Match_columns 343 No_of_seqs 98 out of 106 Neff 6.8 Searched_HMMs 1612 Date Thu Nov 7 18:39:20 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_50 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_50_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5255 Length: 304 # 100.0 9.2E-98 6E-101 552.6 28.8 297 33-341 1-304 (304) 2 protein:vir:104342 Length: 314 100.0 8.8E-83 5.4E-86 470.4 27.7 307 12-343 1-312 (314) 3 protein:vir:79642 Length: 329 100.0 4.5E-82 2.8E-85 466.6 27.6 318 1-343 6-327 (329) 4 protein:vir:107687 Length: 319 100.0 4.2E-79 2.6E-82 450.3 29.2 312 1-342 1-319 (319) 5 protein:vir:80068 Length: 301 100.0 2.3E-78 1.4E-81 446.3 26.6 294 30-342 1-301 (301) 6 protein:vir:103285 Length: 296 100.0 1.7E-77 1.1E-80 441.5 26.4 292 27-343 1-294 (296) 7 protein:vir:94070 Length: 339 100.0 2.6E-73 1.6E-76 418.5 24.9 309 1-342 13-339 (339) 8 protein:vir:78558 Length: 336 100.0 1.5E-67 9.3E-71 386.9 24.5 312 1-342 10-336 (336) 9 protein:vir:101557 Length: 336 100.0 1.1E-66 6.7E-70 382.2 25.0 315 1-342 10-336 (336) 10 protein:vir:3643 Length: 336 # 100.0 1.5E-66 9.5E-70 381.4 25.0 315 1-342 10-336 (336) 11 protein:vir:106734 Length: 336 100.0 6.4E-66 4E-69 378.0 24.7 312 1-342 10-336 (336) 12 protein:vir:96079 Length: 382 100.0 5.8E-66 3.6E-69 378.2 23.9 322 1-342 1-382 (382) 13 protein:vir:107732 Length: 379 100.0 3.3E-65 2E-68 374.1 25.8 312 1-342 1-379 (379) 14 protein:vir:99576 Length: 388 100.0 6.8E-65 4.2E-68 372.4 23.8 318 1-342 38-388 (388) 15 protein:vir:7771 Length: 330 # 98.6 1.5E-08 9.4E-12 63.5 16.0 296 1-343 1-322 (330) 16 protein:vir:105778 Length: 358 98.3 2.1E-08 1.3E-11 62.7 10.9 311 5-343 1-358 (358) 17 protein:vir:4339 Length: 395 # 98.1 4.4E-07 2.7E-10 55.5 14.9 305 1-343 77-394 (395) 18 protein:vir:94673 Length: 419 98.1 6.3E-07 3.9E-10 54.6 15.7 315 1-343 71-416 (419) 19 protein:vir:96392 Length: 324 97.9 5.4E-06 3.3E-09 49.5 17.5 297 1-343 1-314 (324) 20 protein:vir:78830 Length: 324 97.9 5.4E-06 3.3E-09 49.5 17.5 297 1-343 1-314 (324) 21 protein:vir:100135 Length: 418 97.9 1.8E-06 1.1E-09 52.0 14.7 303 1-343 90-414 (418) 22 protein:vir:97148 Length: 324 97.9 7.5E-06 4.7E-09 48.7 17.6 304 1-343 1-314 (324) 23 protein:vir:2504 Length: 305 # 97.9 7.4E-06 4.6E-09 48.7 17.3 282 27-343 1-298 (305) 24 protein:vir:81227 Length: 413 97.9 2.3E-06 1.4E-09 51.5 14.5 306 1-343 77-409 (413) 25 protein:vir:103955 Length: 324 97.9 6E-06 3.7E-09 49.2 16.8 291 1-343 1-314 (324) 26 protein:vir:101650 Length: 497 97.9 1.8E-06 1.1E-09 52.1 13.9 312 1-343 108-492 (497) 27 protein:vir:7855 Length: 497 # 97.9 1.8E-06 1.1E-09 52.1 13.9 312 1-343 108-492 (497) 28 protein:vir:9309 Length: 324 # 97.8 6.8E-06 4.2E-09 48.9 16.5 291 1-343 1-314 (324) 29 protein:vir:9574 Length: 300 # 97.8 4.3E-06 2.7E-09 50.0 15.3 282 28-343 1-299 (300) 30 protein:vir:99749 Length: 324 97.8 9.2E-06 5.7E-09 48.2 17.1 297 1-343 1-314 (324) 31 protein:vir:96223 Length: 324 97.8 8.7E-06 5.4E-09 48.3 16.9 297 1-343 1-314 (324) 32 protein:vir:8187 Length: 311 # 97.8 4E-06 2.5E-09 50.2 15.1 288 29-343 1-309 (311) 33 protein:vir:10364 Length: 390 97.8 4E-06 2.5E-09 50.2 15.0 300 1-342 88-390 (390) 34 protein:vir:1638 Length: 298 # 97.8 5.4E-06 3.4E-09 49.4 15.7 281 30-343 1-298 (298) 35 protein:vir:94771 Length: 298 97.8 5.3E-06 3.3E-09 49.5 15.6 282 30-343 1-298 (298) 36 protein:vir:97053 Length: 390 97.8 3.7E-06 2.3E-09 50.3 14.6 300 1-342 88-390 (390) 37 protein:vir:1433 Length: 435 # 97.8 1.2E-05 7.6E-09 47.5 17.2 308 1-343 65-433 (435) 38 protein:vir:1886 Length: 385 # 97.8 1.2E-05 7.2E-09 47.6 16.9 303 1-343 61-383 (385) 39 protein:vir:191 Length: 385 # 97.8 1.2E-05 7.2E-09 47.6 16.9 303 1-343 61-383 (385) 40 protein:vir:95763 Length: 297 97.7 1.8E-05 1.1E-08 46.6 16.3 279 16-343 1-297 (297) 41 protein:vir:9759 Length: 303 # 97.6 1.5E-05 9.3E-09 47.0 15.7 288 28-343 1-303 (303) 42 protein:vir:81070 Length: 390 97.6 9.8E-06 6.1E-09 48.0 14.6 301 1-342 64-390 (390) 43 protein:vir:104085 Length: 320 97.6 9.1E-06 5.6E-09 48.2 14.0 294 16-343 1-316 (320) 44 protein:vir:99920 Length: 311 97.6 8.2E-06 5.1E-09 48.5 13.7 292 27-343 1-311 (311) 45 protein:vir:78223 Length: 333 97.6 2.7E-05 1.7E-08 45.7 16.4 308 16-342 1-333 (333) 46 protein:vir:8102 Length: 543 # 97.6 1.2E-05 7.3E-09 47.6 14.1 307 1-343 222-541 (543) 47 protein:vir:102119 Length: 404 97.5 1.9E-05 1.2E-08 46.5 14.6 307 1-343 61-399 (404) 48 protein:vir:2344 Length: 397 # 97.4 5E-05 3.1E-08 44.2 15.4 290 5-343 1-305 (397) 49 protein:vir:5739 Length: 366 # 97.4 7.4E-05 4.6E-08 43.2 16.2 308 1-343 32-365 (366) 50 protein:vir:104256 Length: 458 97.3 3.2E-05 2E-08 45.2 14.1 312 1-343 115-457 (458) 51 protein:vir:2430 Length: 318 # 97.3 3.8E-05 2.4E-08 44.8 14.1 294 5-343 1-312 (318) 52 protein:vir:4456 Length: 401 # 97.2 4.3E-05 2.7E-08 44.5 13.8 300 1-343 87-400 (401) 53 protein:vir:485 Length: 407 # 97.2 0.0001 6.3E-08 42.5 15.7 299 1-343 86-399 (407) 54 protein:vir:6242 Length: 390 # 97.2 4.9E-05 3.1E-08 44.2 13.8 298 1-343 74-388 (390) 55 protein:vir:80376 Length: 435 97.2 0.00014 8.4E-08 41.8 17.5 309 1-343 80-433 (435) 56 protein:vir:78523 Length: 338 97.2 0.00014 8.9E-08 41.7 17.2 312 4-343 1-334 (338) 57 protein:vir:9410 Length: 415 # 97.1 0.00019 1.2E-07 41.0 16.1 303 1-343 92-403 (415) 58 protein:vir:1328 Length: 392 # 97.0 0.00012 7.3E-08 42.1 14.1 301 1-343 71-390 (392) 59 protein:vir:80684 Length: 315 97.0 0.00021 1.3E-07 40.8 15.1 287 27-343 1-305 (315) 60 protein:vir:41 Length: 299 # N 96.9 0.00024 1.5E-07 40.4 16.0 275 23-343 1-297 (299) 61 protein:vir:8420 Length: 477 # 96.8 0.00013 8.1E-08 41.9 13.0 321 1-343 98-475 (477) 62 protein:vir:4600 Length: 415 # 96.8 0.00032 2E-07 39.7 16.4 303 1-343 71-403 (415) 63 protein:vir:4700 Length: 415 # 96.8 0.00032 2E-07 39.7 16.4 303 1-343 71-403 (415) 64 protein:vir:81100 Length: 415 96.8 0.00032 2E-07 39.7 15.8 303 1-343 93-403 (415) 65 protein:vir:98339 Length: 415 96.8 0.00032 2E-07 39.7 15.8 303 1-343 93-403 (415) 66 protein:vir:79987 Length: 415 96.8 0.00032 2E-07 39.7 15.8 303 1-343 93-403 (415) 67 protein:vir:4226 Length: 326 # 96.8 0.00035 2.1E-07 39.6 15.3 302 4-343 1-322 (326) 68 protein:vir:6212 Length: 434 # 96.8 0.00014 8.4E-08 41.8 12.7 308 1-343 102-432 (434) 69 protein:vir:4092 Length: 390 # 96.7 0.00041 2.5E-07 39.2 16.3 302 1-343 58-376 (390) 70 protein:vir:100247 Length: 425 96.7 0.00043 2.7E-07 39.0 16.7 306 1-343 100-423 (425) 71 protein:vir:4511 Length: 409 # 96.4 0.0006 3.7E-07 38.2 13.8 306 1-343 72-405 (409) 72 protein:vir:4997 Length: 397 # 96.3 0.00082 5.1E-07 37.5 14.1 289 1-343 75-384 (397) 73 protein:vir:105905 Length: 304 96.2 0.00091 5.6E-07 37.3 16.5 288 16-343 1-304 (304) 74 protein:vir:94142 Length: 304 96.2 0.00091 5.6E-07 37.3 16.5 288 16-343 1-304 (304) 75 protein:vir:95376 Length: 425 96.1 0.0008 4.9E-07 37.6 13.2 302 1-343 96-420 (425) 76 protein:vir:4856 Length: 293 # 96.0 0.0012 7.3E-07 36.7 14.7 269 15-343 1-280 (293) 77 protein:vir:96762 Length: 632 96.0 0.0012 7.3E-07 36.6 14.6 298 1-343 313-632 (632) 78 protein:vir:4159 Length: 315 # 95.7 0.0016 9.7E-07 36.0 18.0 306 4-341 1-315 (315) 79 protein:vir:4197 Length: 314 # 95.6 0.0018 1.1E-06 35.6 17.1 301 11-343 1-312 (314) 80 protein:vir:7409 Length: 408 # 95.4 0.0022 1.4E-06 35.1 13.7 292 1-343 74-393 (408) 81 protein:vir:93881 Length: 387 94.6 0.0023 1.4E-06 35.0 10.9 286 1-343 71-380 (387) 82 protein:vir:4830 Length: 397 # 94.5 0.0041 2.6E-06 33.7 12.2 289 1-343 72-384 (397) 83 protein:vir:9643 Length: 377 # 94.4 0.0046 2.9E-06 33.4 16.9 301 1-343 54-376 (377) 84 protein:vir:1268 Length: 397 # 94.3 0.0048 3E-06 33.3 14.3 289 1-343 78-396 (397) 85 protein:vir:78350 Length: 383 94.3 0.0049 3E-06 33.3 14.1 293 1-343 59-374 (383) 86 protein:vir:4953 Length: 397 # 93.9 0.0062 3.8E-06 32.7 13.6 291 1-343 68-384 (397) 87 protein:vir:95963 Length: 395 93.6 0.007 4.4E-06 32.4 15.8 295 1-343 62-375 (395) 88 protein:vir:105038 Length: 428 93.4 0.0077 4.8E-06 32.2 14.7 309 1-343 76-427 (428) 89 protein:vir:3991 Length: 404 # 92.9 0.0096 5.9E-06 31.7 13.7 290 1-343 77-392 (404) 90 protein:vir:1025 Length: 408 # 92.6 0.011 6.6E-06 31.4 14.7 291 1-343 74-392 (408) 91 protein:vir:96978 Length: 387 91.8 0.014 8.8E-06 30.7 11.8 285 1-343 95-380 (387) 92 protein:vir:2685 Length: 387 # 91.8 0.014 8.8E-06 30.7 11.8 285 1-343 95-380 (387) 93 protein:vir:94424 Length: 387 91.8 0.014 8.8E-06 30.7 11.8 285 1-343 95-380 (387) 94 protein:vir:78640 Length: 352 91.8 0.014 8.9E-06 30.7 12.4 288 1-343 36-345 (352) 95 protein:vir:100172 Length: 394 91.7 0.015 9E-06 30.7 13.4 289 1-343 75-383 (394) 96 protein:vir:80128 Length: 466 91.2 0.017 1E-05 30.3 16.1 309 1-343 116-447 (466) 97 protein:vir:102873 Length: 392 91.0 0.018 1.1E-05 30.1 15.1 293 1-343 62-383 (392) 98 protein:vir:102082 Length: 392 91.0 0.018 1.1E-05 30.1 15.1 293 1-343 62-383 (392) 99 protein:vir:105004 Length: 392 91.0 0.018 1.1E-05 30.1 15.1 293 1-343 62-383 (392) 100 protein:vir:107593 Length: 392 91.0 0.018 1.1E-05 30.1 15.1 293 1-343 62-383 (392) 101 protein:vir:9361 Length: 402 # 90.6 0.02 1.2E-05 29.9 12.0 289 1-343 86-395 (402) 102 protein:vir:962 Length: 397 # 88.5 0.032 2E-05 28.8 12.3 284 1-343 103-396 (397) 103 protein:vir:101291 Length: 381 88.4 0.032 2E-05 28.8 15.8 303 1-343 52-367 (381) 104 protein:vir:9509 Length: 381 # 88.4 0.032 2E-05 28.8 15.8 303 1-343 52-367 (381) 105 protein:vir:107882 Length: 307 87.9 0.035 2.2E-05 28.5 11.3 274 29-343 1-300 (307) 106 protein:vir:101607 Length: 379 87.7 0.037 2.3E-05 28.4 14.1 289 1-343 72-378 (379) 107 protein:vir:81160 Length: 371 87.6 0.038 2.3E-05 28.4 15.0 295 1-343 61-370 (371) 108 protein:vir:3158 Length: 321 # 86.7 0.044 2.7E-05 28.0 17.2 297 1-343 1-311 (321) 109 protein:vir:1383 Length: 421 # 83.7 0.066 4.1E-05 27.1 13.2 289 1-343 75-394 (421) 110 protein:vir:99675 Length: 324 83.7 0.066 4.1E-05 27.0 11.3 257 63-343 1-295 (324) 111 protein:vir:79078 Length: 307 83.4 0.069 4.3E-05 26.9 10.6 278 29-343 1-300 (307) 112 protein:vir:3845 Length: 395 # 83.0 0.072 4.5E-05 26.8 17.0 290 1-343 71-382 (395) 113 protein:vir:9704 Length: 394 # 80.5 0.094 5.8E-05 26.2 14.7 281 1-343 75-389 (394) 114 protein:vir:1084 Length: 437 # 77.0 0.13 8.1E-05 25.4 13.1 290 1-343 120-428 (437) 115 protein:vir:98635 Length: 377 74.4 0.16 9.8E-05 25.0 13.3 300 1-343 32-376 (377) 116 protein:vir:102655 Length: 322 71.7 0.19 0.00012 24.5 15.0 298 14-343 1-320 (322) 117 protein:vir:3870 Length: 400 # 64.1 0.31 0.00019 23.4 13.4 286 1-343 79-398 (400) 118 protein:vir:108211 Length: 318 62.4 0.34 0.00021 23.2 11.7 277 19-343 1-316 (318) 119 protein:vir:94933 Length: 330 55.7 0.47 0.00029 22.4 13.1 306 1-343 1-329 (330) 120 protein:vir:100632 Length: 381 39.9 1 0.00062 20.6 17.1 301 1-343 30-367 (381) 121 protein:vir:9820 Length: 272 # 38.7 1.1 0.00065 20.5 14.9 263 19-343 1-268 (272) 122 protein:vir:3033 Length: 272 # 38.7 1.1 0.00065 20.5 14.9 263 19-343 1-268 (272) 123 protein:vir:100884 Length: 389 38.6 1.1 0.00066 20.5 15.9 289 1-343 73-381 (389) 124 protein:vir:8843 Length: 317 # 37.8 1.1 0.00068 20.4 13.6 277 36-343 1-315 (317) 125 protein:vir:98480 Length: 348 36.3 1.2 0.00073 20.2 13.7 278 23-343 1-348 (348) 126 protein:vir:106590 Length: 349 31.6 1.5 0.00092 19.6 14.4 287 1-343 1-323 (349) 127 protein:vir:99888 Length: 309 28.0 1.8 0.0011 19.2 14.8 274 23-343 1-301 (309) 128 protein:vir:6378 Length: 346 # 24.6 2.2 0.0013 18.8 19.1 284 35-343 1-320 (346) 129 protein:vir:97397 Length: 517 23.2 2.3 0.0015 18.6 13.5 303 1-343 172-515 (517) 130 protein:vir:97255 Length: 310 21.0 2.7 0.0017 18.2 11.9 285 1-343 1-309 (310) 131 protein:vir:97331 Length: 319 20.8 2.7 0.0017 18.2 12.4 286 1-343 1-294 (319) 132 protein:vir:94800 Length: 319 20.8 2.7 0.0017 18.2 12.4 286 1-343 1-294 (319) 133 protein:vir:94711 Length: 347 20.6 2.7 0.0017 18.2 10.7 290 6-343 1-345 (347) No 1 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=100.00 E-value=9.2e-98 Score=552.57 Aligned_cols=297 Identities=21% Similarity=0.234 Sum_probs=274.1 Q ss_pred hhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe--eeccchhh--hhhcccccCCccccccccccc Q lcl|NC_019525. 33 DLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT--YRSFSLAE--DFATGIIDTGNSNGKLAAVDT 108 (343) Q Consensus 33 d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~--~~~vg~a~--~ia~g~~~~g~~a~Dip~vd~ 108 (343) =+|++|+++||++||++|||.|||+++++++|||+++++.|++++++ ++.+|+|+ +|+ ++++|||++|+ T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~-------~~a~dip~vd~ 73 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLIT-------VGTSTLDQVEV 73 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccC-------CcCCccceeec Confidence 48899999999999999999999999999999999999999988765 57789999 655 56799999999 Q ss_pred cccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeeccc- Q lcl|NC_019525. 109 GVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTF- 187 (343) Q Consensus 109 ~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~- 187 (343) +++++..+|+.||++|+||++||++|+++| +||+++|+++||+++++++|+++|+|+++.+|++||||||+|++..++ T Consensus 74 ~~~~~~~~i~~~~~~~~y~~~El~~a~~~g-~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~ 152 (304) T protein:vir:52 74 GFTPTRSYIVPWAKSVTWTKPELEQGKLLG-LALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKG 152 (304) T ss_pred ccceeEEEEEEEeeeeeecHHHHHHHHHhC-CCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecC Confidence 999999999999999999999999999998 599999999999999999999999998666799999999999864332 Q ss_pred --CCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceE Q lcl|NC_019525. 188 --LTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFE 265 (343) Q Consensus 188 --~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~ 265 (343) .++.|++||++||++|||++++++|.+|++.++|+||+|||++|.+|+++++++ .++|+++||+.|+++ .+|++|+ T Consensus 153 ~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~-~~~Tvl~~l~~n~~~-~~g~~l~ 230 (304) T protein:vir:52 153 AAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRAN-TDTTALEFLTKHLSA-AAGRQVA 230 (304) T ss_pred CccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCC-CCchHHHHHHHhccc-ccCCcce Confidence 345699999999999999999999999999999999999999999999988776 568999999998886 5799999 Q ss_pred EeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeec Q lcl|NC_019525. 266 ILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDI 341 (343) Q Consensus 266 I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~ 341 (343) |++++|+ ++++|.|||||||+|++|+++++|+||||++++++|+++.+.+++||++|+|||+||||++++|+|- T Consensus 231 I~~v~~~--~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 231 IKALPSN--YGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred EEEeccc--ccccCCCCceEEEEEecChhheEEecCccccccchhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 9999873 4567888999999999999999999999999999999876789999999999999999999999999 No 2 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=100.00 E-value=8.8e-83 Score=470.45 Aligned_cols=307 Identities=15% Similarity=0.162 Sum_probs=271.9 Q ss_pred hHHHHHHHHhhhccchhh---hhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe--eeccchh Q lcl|NC_019525. 12 EKILLNAQEAKIAGVIQR---LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT--YRSFSLA 86 (343) Q Consensus 12 ~~~~~~a~~~~~~~~~~~---~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~--~~~vg~a 86 (343) --+.-+|+++.+..++++ ..+|++++|+++||++||++|||.+||+|+++++|||.++++.|++++++ ++.+|.| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a 80 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIA 80 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccce Confidence 233345777777776666 44788899999999999999999999999999999999999999987765 6788898 Q ss_pred hhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeee Q lcl|NC_019525. 87 EDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGM 166 (343) Q Consensus 87 ~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~ 166 (343) ++++ ++++|||++|+++++++.+++.|+.+|+|+++||++|++.| +||+++|+.+|++++++++|+++|+|+ T Consensus 81 ~~~~-------d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aA~~~~~~~~n~i~f~G~ 152 (314) T protein:vir:10 81 QIIA-------DYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATG-QSLSARKQALAFEAHDNLLDKLVWSGS 152 (314) T ss_pred eeeC-------CcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhceEEEeec Confidence 8775 66889999999999999999999999999999999999997 699999999999999999999999996 Q ss_pred ccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchh Q lcl|NC_019525. 167 KDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKS 246 (343) Q Consensus 167 ~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~t 246 (343) .. .|++||||+|+|+..++ ++.| +|++||++||+++++++|.+|++.|.|++|+|||++|.+|+++ +++++.| T Consensus 153 ~~-~g~~GLlN~p~v~~~~~--~~~W--aT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~--~~~~~~t 225 (314) T protein:vir:10 153 AP-HGIVSVFDQPNINNVVA--TPNW--SVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGL--VPQTNLS 225 (314) T ss_pred cc-ccceeEeecCCCccccC--CCCc--ccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhccc--ccCCCcc Confidence 55 59999999999986544 5668 5899999999999999999999999999999999999999543 3456789 Q ss_pred hhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeec Q lcl|NC_019525. 247 TKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFT 326 (343) Q Consensus 247 l~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~G 326 (343) +++||++|++ +|+|++++||.+ +|.+|++|||+|++++++++|++||+++++++ +.+++.+++||++|+| T Consensus 226 vl~~l~~n~~------~l~I~~~~el~~---ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-e~~~~~~~~~~~~r~~ 295 (314) T protein:vir:10 226 YGELFTRNNP------GLTIRFLQFLDN---YDGAGGKAALAFEKSPLNMSIEIPEVTNVLPA-QPKDLHFRYPVTSKAT 295 (314) T ss_pred HHHHHHHhCC------CcEEEEcccccc---cCCCcceEEEEEecCCcEEEEecCccceeecc-eecCceEEEcceeeeE Confidence 9999998875 578999998764 55678999999999999999999999999976 5557999999999999 Q ss_pred cEEEEcCceEEeeecCC Q lcl|NC_019525. 327 GVLAYRPKELLYLDIPV 343 (343) Q Consensus 327 Gv~v~yP~a~~Y~D~~~ 343 (343) ||+||||.+++|+|==- T Consensus 296 Gv~i~~P~ai~~~dGI~ 312 (314) T protein:vir:10 296 GLIVYRPLTMAVIKGIT 312 (314) T ss_pred EEEEECcceeEeeeeee Confidence 99999999999999433 No 3 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=100.00 E-value=4.5e-82 Score=466.57 Aligned_cols=318 Identities=14% Similarity=0.105 Sum_probs=272.7 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe- Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT- 79 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~- 79 (343) |.|-...|.. ... +.+...+.......+|++.+|+++||++||++|||.++++++++++|||.++.+.|++++++ T Consensus 6 ~~~~~~~d~~--~~~--~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~ 81 (329) T protein:vir:79 6 MSKEMKYDEF--EAN--VIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQ 81 (329) T ss_pred hhhhhccchh--hhh--hHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEee Confidence 4433333322 111 12222233444455688889999999999999999999999999999999999999987665 Q ss_pred -eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhh Q lcl|NC_019525. 80 -YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGI 158 (343) Q Consensus 80 -~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~ 158 (343) ++.+|.+++++ ++++|||++|+++.+++.+++.|+.+|+|+++||++|+++| +||+++|+.+|++++++++ T Consensus 82 ~~~~~G~a~~~~-------d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aA~~~~~~~~ 153 (329) T protein:vir:79 82 TFDKVGHAKIIA-------DYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTG-KSLSTRKANAAQNAHDQLV 153 (329) T ss_pred eeecceeeeeec-------CcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhh Confidence 67789998775 66789999999999999999999999999999999999998 6999999999999999999 Q ss_pred hheEeeeeccccceeeeeecCCceeec--ccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhc Q lcl|NC_019525. 159 QEIAFVGMKDNANVKGLLTQTGNVVNN--TFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAG 236 (343) Q Consensus 159 n~v~~~G~~~~~g~~GLlN~p~v~~~~--a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~ 236 (343) |+++|+|+ +..|++||||+|++++.. +.+++.|++||++||++||+++++++|.+|++.+.|++|+|||++|.+|++ T Consensus 154 n~i~f~G~-~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~~ 232 (329) T protein:vir:79 154 NHLVFKGS-KPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLMV 232 (329) T ss_pred ccEEEeec-ccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhhc Confidence 99999995 446899999999997533 334557999999999999999999999999999999999999999999976 Q ss_pred cccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCce Q lcl|NC_019525. 237 AASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQ 316 (343) Q Consensus 237 ~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~ 316 (343) +.. ++ +.|+++||++|+++ |+|++++||++ +|.+|+|||++|+++++++++++|||++++++ +++++. T Consensus 233 ~~~-~~-~~tvl~~lk~~~~~------l~I~~~~el~~---ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~-q~~~~~ 300 (329) T protein:vir:79 233 RMP-ET-TMSYLDYFKQQNGG------ITIESISELED---IDGAGTKAALVYEKDPMNMSIEIPEAFNMLTA-QPKDLH 300 (329) T ss_pred ccC-CC-CccHHHHHHHhCCC------cEEEEcccccc---cCCCCceEEEEEecCCceEEEecCcceeeeec-eecCce Confidence 643 33 69999999988764 78999988765 55678999999999999999999999999976 556799 Q ss_pred EEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 317 FQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 317 ~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) |++||++|+|||+||||.+++|+|==| T Consensus 301 ~~v~~~~r~~Gv~i~~P~ai~~~dGI~ 327 (329) T protein:vir:79 301 FKVPCTSKCTGLTIYRPLTLVLIKGLV 327 (329) T ss_pred EEEceeeeEEEEEEECcceeeeeeeee Confidence 999999999999999999999999666 No 4 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=100.00 E-value=4.2e-79 Score=450.30 Aligned_cols=312 Identities=14% Similarity=0.116 Sum_probs=271.4 Q ss_pred CCceeeecCCchHHHHHHHHhhhcc-----chhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCcee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAG-----VIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWST 75 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~-----~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~ 75 (343) ||.+= ..+|++..|.. .++....++...|.++||++||+++||.++++++++++|||.+..+.|++ T Consensus 1 ~~~~~---------~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~ 71 (319) T protein:vir:10 1 MTTKK---------FDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDK 71 (319) T ss_pred CCCcc---------hhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceE Confidence 76532 22333332222 23333334455799999999999999999999999999999999999998 Q ss_pred EEEe--eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHH Q lcl|NC_019525. 76 MLTT--YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKN 153 (343) Q Consensus 76 ~~~~--~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a 153 (343) ++++ ++.+|.+++++ ++++|||++++++.++..|++.|+.+|+|+++||++|++.| +||+++|+.+|+++ T Consensus 72 ~~~~~~~~~~G~a~~~~-------d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aA~~~ 143 (319) T protein:vir:10 72 TFEYMTFDKVGTAQIIA-------DYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATG-RPLSTRKASACQLA 143 (319) T ss_pred EEEeeeeccccceeeec-------CccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhC-CChHHHHHHHHHHH Confidence 8765 67889998775 66789999999999999999999999999999999999997 69999999999999 Q ss_pred HHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHH Q lcl|NC_019525. 154 WDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTG 233 (343) Q Consensus 154 ~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~ 233 (343) +++++|+++|+|++. .|++||||+||+++.+++.++.|++||++||++||+.+++++|.+|++.+.|++|+|||++|.+ T Consensus 144 ~~~~~n~i~f~G~~~-~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~ 222 (319) T protein:vir:10 144 HDQLVNRLVFKGSAP-HKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKV 222 (319) T ss_pred HHHhhceEEEeeccc-ccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHh Confidence 999999999999655 5999999999999988877788999999999999999999999999999999999999999999 Q ss_pred HhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcC Q lcl|NC_019525. 234 LAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVN 313 (343) Q Consensus 234 L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~ 313 (343) |+++ .+++ +.|++++|+.|+++ |+|++++|+.+ +|.+|+||||+|+++++++++++||++++++++ .+ T Consensus 223 L~~~-~~~~-~~t~l~~lk~~~~~------l~I~~~pel~~---ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e-~~ 290 (319) T protein:vir:10 223 LAIR-MPET-TMSYLDYFKSQNSG------IEIDSIAELED---IDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQ-PK 290 (319) T ss_pred hhcc-cCCC-CeeHHHHHHHhcCC------ceEEEeeeecc---cCCCcceEEEEEecCCceEEEecCcceeeeeee-ec Confidence 9755 4444 68999999987764 78999988765 555789999999999999999999999999764 55 Q ss_pred CceEEeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 314 NFQFQNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 314 ~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) ++.|++||++|+|||+||||.+++|+|== T Consensus 291 ~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 291 DLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred CceEEEeeeeeeEEEEEEccceeEeeecC Confidence 79999999999999999999999999954 No 5 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=100.00 E-value=2.3e-78 Score=446.26 Aligned_cols=294 Identities=17% Similarity=0.146 Sum_probs=269.9 Q ss_pred hhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe--eeccchhhhhhcccccCCcccccccccc Q lcl|NC_019525. 30 LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT--YRSFSLAEDFATGIIDTGNSNGKLAAVD 107 (343) Q Consensus 30 ~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~--~~~vg~a~~ia~g~~~~g~~a~Dip~vd 107 (343) +.+|+..+|+++||++||+++||++++++.+++++||.+..+.|++++++ ++.+|++++++ ++++|||+++ T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~-------~~~~dip~~~ 73 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIA-------NGADDLPLVD 73 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEec-------Cccccccccc Confidence 77888889999999999999999999999999999999999999987765 46778888765 5678999999 Q ss_pred ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceee--- Q lcl|NC_019525. 108 TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVN--- 184 (343) Q Consensus 108 ~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~--- 184 (343) +.++++..|++.|+.+|+|+++||++|++.| +||+++|+++|++++++++|+++|+|++. .|++||+|+|+++.. T Consensus 74 ~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aa~~~~~~~~n~~~f~G~~~-~g~~GLlN~p~~~~~~~~ 151 (301) T protein:vir:80 74 VDMVRKSVPIYSIGIGLSYTIQDLRAARMQG-TTVDAAKATTVRRAIAEKENSIAFRGEKK-YAIKGAFEATGIQIDVSP 151 (301) T ss_pred ccceeEEEEEEEEEeeeeecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhceEEeeeccc-ccceeeecCCCccccccc Confidence 9999999999999999999999999999997 69999999999999999999999999655 699999999998653 Q ss_pred --cccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCc Q lcl|NC_019525. 185 --NTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNS 262 (343) Q Consensus 185 --~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~ 262 (343) ++++.+.|++||++||++||+++++++|.+|++.+.|++|+|||++|.+|+++++++.++.|++++++.|+++ T Consensus 152 ~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~----- 226 (301) T protein:vir:80 152 TTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWF----- 226 (301) T ss_pred CcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCc----- Confidence 4455678999999999999999999999999999999999999999999999999998899999999988876 Q ss_pred ceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 263 SFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 263 ~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) ++|++++||++ +|.+|+|||++|+++++++++++|||+++++++++ ++.|++||++|+|||+||||.+++|+|== T Consensus 227 -~~I~~~p~L~~---~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~-~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 227 -SAIVRVPDLAG---MGTAGSDSFAVIHDSNETAELIIPMDITRHPEEYS-FPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred -ceEEEcceecc---CCCCcccEEEEEecCCcEEEEEecCceeeecceec-CceeEeeeeeeeEEEEEEccceEEEEecC Confidence 67999887765 55678999999999999999999999999987655 68999999999999999999999999954 No 6 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=100.00 E-value=1.7e-77 Score=441.47 Aligned_cols=292 Identities=12% Similarity=0.146 Sum_probs=260.3 Q ss_pred hhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe--eeccchhhhhhcccccCCccccccc Q lcl|NC_019525. 27 IQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT--YRSFSLAEDFATGIIDTGNSNGKLA 104 (343) Q Consensus 27 ~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~--~~~vg~a~~ia~g~~~~g~~a~Dip 104 (343) +-...+|++++|+++||++||+++||+++++|+++++|||.+..+.|++++++ ++.+|.+++++ ++++||| T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~-------~~~~dip 73 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVA-------DYTDDLP 73 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeC-------CCccccc Confidence 33445689999999999999999999999999999999999999999988765 56788888775 6678999 Q ss_pred cccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceee Q lcl|NC_019525. 105 AVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVN 184 (343) Q Consensus 105 ~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~ 184 (343) ++|+++++++.+++.|+.+|+|+++||++|++.| +||+++|+.+|++++++++|+++|+|++. +|++||||+|+++.. T Consensus 74 ~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g-~~l~~~ka~aA~~~~~~~~n~~~f~G~~~-~g~~GLlN~p~v~~~ 151 (296) T protein:vir:10 74 LVDALATERQGKVFRFGNAFLISIDEIKVGQATG-QSLSTRKQSLAFEAHDKLLDKLVWSGSTA-HGIPSVFDYPNINNV 151 (296) T ss_pred eeeccceeEEEEEEEEEeeeeecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhceEEEeeccc-ccceeEeecCCCccc Confidence 9999999999999999999999999999999997 69999999999999999999999999655 699999999999876 Q ss_pred cccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcce Q lcl|NC_019525. 185 NTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSF 264 (343) Q Consensus 185 ~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l 264 (343) ++ +++|+++| ||++||+++++++|.+|++.+.|++|+|||++|.+|+++ ++. .+.|++++|++|+++ + T Consensus 152 ~~--~~~W~~~t--~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~-~~~-~~~t~l~~ik~~~~~------l 219 (296) T protein:vir:10 152 VS--GGSWSQPT--TAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNL-VPG-TSVSYGEFFRQNNSG------V 219 (296) T ss_pred cc--cCCccCHH--HHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhc-cCC-CCccHHHHHHHhcCC------c Confidence 55 45697655 999999999999999999999999999999999999655 444 458999999988764 7 Q ss_pred EEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 265 EILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 265 ~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +|++++||.+ ++.+||+|||+|++++++++|++||++++++++ .+++.+++||++|+|||+||||.+++|+|==- T Consensus 220 ~i~~~~~l~~---a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~e-~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~ 294 (296) T protein:vir:10 220 TVEFVQYLND---YNGTGTSAAIAYEKDPNNMAIEIPEATNALPAQ-PKDLHFKIPVTSKATGLIVYRPLTMAVMKGIT 294 (296) T ss_pred eEEEeeeecc---CCCCcceEEEEEEcCCceEEEEcCcceeeeccc-ccCceEEEeeEeeEEEEEEECCceeEEEeeee Confidence 8999887765 556789999999999999999999999999765 45799999999999999999999999998433 No 7 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=100.00 E-value=2.6e-73 Score=418.54 Aligned_cols=309 Identities=11% Similarity=0.083 Sum_probs=254.8 Q ss_pred CCce-eeecCCchHHHHHHHHhhhccchhhhhhhhhh-----------hhhHHHHHHhhhhhhhcccccccchhhcceec Q lcl|NC_019525. 1 MKKF-VIRNSKGEKILLNAQEAKIAGVIQRLCNDLGF-----------EIDVTTLTTLMKKIIEQKFFEISPADYMPIRV 68 (343) Q Consensus 1 ~~~~-~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~-----------~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~ 68 (343) +.++ ++.|+.+.+. ++..+..+.+|.+. -+-++||++||+++|+.+||+++++++||+++ T Consensus 13 l~~~g~~~~~~~~~~--------~~~~~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t 84 (339) T protein:vir:94 13 LEKVGIIFDGYSPKS--------ISSEVSAYAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEVK 84 (339) T ss_pred HHhhceeeccchhhh--------cchhhHhhhccccccccccccccccchhhhhhhhhchhheeecccccchhhhccccc Confidence 1111 1122222221 23333444444321 14467999999999999999999999999988 Q ss_pred CCCCc-eeEEE--eeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHH Q lcl|NC_019525. 69 GEGAW-STMLT--TYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITA 145 (343) Q Consensus 69 ~~~~w-~~~~~--~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~ 145 (343) ++ +| +++++ .++.+|+|++++ +++| +|+++++.+..+..++..+++|+|+++|+++|+++| ++|+++ T Consensus 85 ~g-~w~~~t~~y~~~e~~G~a~~yg-------d~ad-~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g-~~l~~~ 154 (339) T protein:vir:94 85 KG-DWTTTYGVFIIAEPVGQVATYS-------DWSA-NGMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAG-IDYVAR 154 (339) T ss_pred CC-CCcccEEEEeeeecccceEEcc-------cccC-CCcccccceeeEEeEEEEEEEEeecHHHHHHHHhhC-CChHHH Confidence 75 56 56655 468899999885 5664 499999999999999999999999999999999997 799999 Q ss_pred HHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCcee---cCC Q lcl|NC_019525. 146 LEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTA---MPN 222 (343) Q Consensus 146 k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~---~p~ 222 (343) |+.+||+++++++|+++|+|+++ .|++||||||++++.. ++++.|++||++||++|||++++++|.+|++.+ .|+ T Consensus 155 Ka~aA~~al~~~~N~i~~~Gd~~-~~~~GLlN~P~l~~~v-~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~ 232 (339) T protein:vir:94 155 QEISASLVMAKFANSSYLLGVAG-IANYGLMNDPSLPAPV-AATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERM 232 (339) T ss_pred HHHHHHHHHHHhhceEEeeeecc-cceEEEEeCCCccccc-cCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCc Confidence 99999999999999999999655 6899999999997643 346789999999999999999999999999764 677 Q ss_pred eEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCC Q lcl|NC_019525. 223 KFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPV 302 (343) Q Consensus 223 tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~ 302 (343) +|+|||++|.+|+++ +..+.|+++||++|+++ |+|+.++|+. +++.++..+|+.|.++++++++++|| T Consensus 233 ~L~LP~~~~~~L~~~---n~~~~Tvl~~lk~n~pn------l~i~~~~el~---~a~g~~~~~~~~~~~~~~~~~~~~p~ 300 (339) T protein:vir:94 233 VMALAPSALNNVNRT---NNFGLSAGAKIAQTYPN------IQFVAVPEFD---TASGRLVQLWVPEVNGQPTGEVAFAE 300 (339) T ss_pred EEEecHHHHHhcccC---CcCCccHHHHHHHhcCC------cEEEEccccc---cCCCceEEEEEEeccCCcceEEEcch Confidence 999999999999865 34568999999999876 6899988764 45556778889999999999999999 Q ss_pred chhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 303 DYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 303 ~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) ++++|++ |++++.|++||.+|+|||+||||.+++|+|== T Consensus 301 ~~~~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 301 KLRSHSI-ERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred hhhcccc-EEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 9999976 55679999999999999999999999999844 No 8 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=100.00 E-value=1.5e-67 Score=386.95 Aligned_cols=312 Identities=13% Similarity=0.089 Sum_probs=249.9 Q ss_pred CCce-eeecCCchHHHHHH----H-HhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCce Q lcl|NC_019525. 1 MKKF-VIRNSKGEKILLNA----Q-EAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWS 74 (343) Q Consensus 1 ~~~~-~~~~~~~~~~~~~a----~-~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~ 74 (343) ..|+ |+.+..-.++..+. + ++..++.++...+-.--+|+. ++||+++|+.+++++.+.+++|+++. |+|. T Consensus 10 l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~---~~i~p~~~~~~~~~~~~~~l~~v~t~-g~W~ 85 (336) T protein:vir:78 10 LARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAELVGESKK-GDWT 85 (336) T ss_pred HhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHH---HhcccceeeehhhhhhhhhhcccccC-CCcc Confidence 2222 22322212233221 1 222333333333333344664 69999999999999999999999885 7897 Q ss_pred e-EEE--eeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHH Q lcl|NC_019525. 75 T-MLT--TYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRK 151 (343) Q Consensus 75 ~-~~~--~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar 151 (343) . +++ .++.+|+|++++ |++ |+|++|+..++.+.+++.++++|+|+++|+++|+++| ++|+++|+.+|| T Consensus 86 ~~~~~~~~~e~~G~a~~yg-------d~~-D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g-~~l~~~Ka~aA~ 156 (336) T protein:vir:78 86 TLVAAFITAEPTTTVATYG-------DYS-SDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGR-VDLASELNYSSA 156 (336) T ss_pred ccEEEEeeeecceeeEEee-------ccc-CCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhC-CCcHHHHHHHHH Confidence 4 433 468899999874 555 5599999999999999999999999999999999997 799999999999 Q ss_pred HHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCce---ecCCeEEeCH Q lcl|NC_019525. 152 KNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYT---AMPNKFTIPE 228 (343) Q Consensus 152 ~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~---~~p~tl~lp~ 228 (343) +++++++|+++|+|+.. +|++||||||++++..+++++.|++||++||++||+.++++++.+|++. +.|+||+||| T Consensus 157 ~ale~~~N~~~~~Gd~~-~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~ 235 (336) T protein:vir:78 157 LGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPP 235 (336) T ss_pred HHHHHhhCeEEEEeccc-cceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEech Confidence 99999999999999655 6899999999998766666777999999999999999999999999875 5788999999 Q ss_pred HHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEE---EcCcceEEEecCCchh Q lcl|NC_019525. 229 SDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALY---NDNEDSLRMDIPVDYT 305 (343) Q Consensus 229 ~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y---~~d~~~v~~~iP~~~~ 305 (343) +++.+|+++ +..+.|+++||++|+++ |+|+.++++ +++|. +++.+| ..+++++++++|++|+ T Consensus 236 ~~~~~L~~~---n~~g~tv~~~lk~n~Pn------l~i~t~pel---~~Agg---~~~~~~~~~~~~~~t~~~~~p~~f~ 300 (336) T protein:vir:78 236 TAMSDLSKT---NQYGLSAAAKLKEIFPK------LEFVTIPEY---DTASG---RLVQLWAPRVEGKDTATCGFTEKMR 300 (336) T ss_pred HHHHhccCC---CccCccHHHHHHHhcCc------cEEEEcccc---cccCc---ceEEEEEeeccCCcceeeecchhhh Confidence 999999765 34568999999999886 578887654 44443 455555 4447899999999999 Q ss_pred hcchhhcCCceEEeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 306 STLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 306 ~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) +|+. +.+++.|++||.+|+|||.||||.+++|+|== T Consensus 301 ~lpv-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 301 AHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred ccce-eecCceeEeccccceeeeeeeccchheeeccC Confidence 9976 56679999999999999999999999999844 No 9 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=100.00 E-value=1.1e-66 Score=382.24 Aligned_cols=315 Identities=12% Similarity=0.085 Sum_probs=257.8 Q ss_pred CCce-eeecCCc-----hHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCce Q lcl|NC_019525. 1 MKKF-VIRNSKG-----EKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWS 74 (343) Q Consensus 1 ~~~~-~~~~~~~-----~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~ 74 (343) ..|+ |+.++.- |+..+.-.++.+++.++...+-.--+|+. ++||+++|+.+++++.+..++|+++ .|+|. T Consensus 10 l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~---~~i~p~~~~~~~~p~~a~~l~pv~t-~g~W~ 85 (336) T protein:vir:10 10 LARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLT---TYVDPAVIDILVAPMKAAELVGESK-KGDWT 85 (336) T ss_pred HhhcCeeecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHH---hhcccceeeehhhhhhhhhhccccc-cCCcc Confidence 2333 2222221 22222333556677777666666666775 7999999999999999999999987 67897 Q ss_pred e-EE--EeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHH Q lcl|NC_019525. 75 T-ML--TTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRK 151 (343) Q Consensus 75 ~-~~--~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar 151 (343) . ++ ..++.+|+|+.++ +++ |+|++|++..+.+.+++.++.+|+|+++|+++|+++| ++|.++|+.+|| T Consensus 86 ~~~~~~~~~e~~G~a~~yg-------d~~-D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g-~~l~~~Ka~aA~ 156 (336) T protein:vir:10 86 TLVAAFITAEPTTKVATYG-------DYS-SDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGR-VDLASELNYSSA 156 (336) T ss_pred ceeEEEeeeeceeeEEEee-------ccC-CCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhC-CCcHHHHHHHHH Confidence 4 33 3457789998874 555 5699999999999999999999999999999999986 899999999999 Q ss_pred HHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCce---ecCCeEEeCH Q lcl|NC_019525. 152 KNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYT---AMPNKFTIPE 228 (343) Q Consensus 152 ~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~---~~p~tl~lp~ 228 (343) +++++++|+++|+|+.. .+++||||||++++..+++++.|.++|++||++||+.++++++.+|++. +.|+||+||| T Consensus 157 ~ale~~~N~i~~~Gd~~-~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~ 235 (336) T protein:vir:10 157 LGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPP 235 (336) T ss_pred HHHHHhhCcEEEEeccc-cceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecH Confidence 99999999999999655 5899999999998766666777899999999999999999999999986 7899999999 Q ss_pred HHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcc Q lcl|NC_019525. 229 SDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTL 308 (343) Q Consensus 229 ~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~ 308 (343) +++.+|+++ ++ .+.|+++||++|+++ |+|+.+.++. ++|.++..+|+.+..+.+.+++++|++|++|+ T Consensus 236 ~~~~~Ls~~--n~-~g~Tvl~~lk~n~Pn------l~i~t~pEl~---~a~G~~~~l~~~~~~~~~t~~~~~p~~~~~l~ 303 (336) T protein:vir:10 236 TAMSDLSKT--NQ-YGLAAAAKLKDIFPK------LEFVTIPEYD---TASGRLVQLWAPRVEGKDTATCGFTEKMRAHS 303 (336) T ss_pred HHHHhccCC--Cc-cCccHHHHHHHhcCc------cEEEEccccc---cCCCceEEEEEEecCCCcceeeecchhhhccc Confidence 999999754 33 468999999999887 5788877654 34434445555556778899999999999998 Q ss_pred hhhcCCceEEeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 309 ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 309 p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) . +.+++.|++||.+|+|||.||||.+++|+|== T Consensus 304 v-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 304 I-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred e-eecCceeEeccccceeeeeeeccchheeeecC Confidence 7 55579999999999999999999999999844 No 10 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=100.00 E-value=1.5e-66 Score=381.40 Aligned_cols=315 Identities=12% Similarity=0.085 Sum_probs=253.7 Q ss_pred CCce-eeecCCc-----hHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCce Q lcl|NC_019525. 1 MKKF-VIRNSKG-----EKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWS 74 (343) Q Consensus 1 ~~~~-~~~~~~~-----~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~ 74 (343) ..|+ |+.++.- |...+.-.++.+++......+-.-..|+. ++||+++|+.+++++.+..++|+++ .|+|. T Consensus 10 l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~---~~i~p~~~~~~~~~~~~~~l~pv~t-~g~W~ 85 (336) T protein:vir:36 10 LARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAELVGESK-KGDWT 85 (336) T ss_pred HhhcCeeecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHH---HhhccceEeeecchhhhhhhccccc-cCCcc Confidence 2222 2222211 22222223344555555554444455664 6999999999999999999999987 67897 Q ss_pred e-EE--EeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHH Q lcl|NC_019525. 75 T-ML--TTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRK 151 (343) Q Consensus 75 ~-~~--~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar 151 (343) . ++ ..++.+|+|+.++ +++ |+|++|++..+.+.+++.++.+|+|+++|+++|+++| ++|.++|+.+|| T Consensus 86 ~~~~~~~~~e~~G~a~~yg-------d~~-D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~-~~l~~~Ka~aA~ 156 (336) T protein:vir:36 86 TLVAAFITAEPTTKVATYG-------DYS-SDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGR-VDLASELNYSSA 156 (336) T ss_pred ceeEEEeeeeceeeEEEee-------ccC-CCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhC-CCcHHHHHHHHH Confidence 4 33 3457789998874 555 5699999999999999999999999999999999886 899999999999 Q ss_pred HHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCce---ecCCeEEeCH Q lcl|NC_019525. 152 KNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYT---AMPNKFTIPE 228 (343) Q Consensus 152 ~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~---~~p~tl~lp~ 228 (343) +++++++|+++|+|+.. .+++||||||++++..++.++.|.++|++||++||+.++++++.+|++. +.|+||+||| T Consensus 157 ~ale~~~N~i~~~Gd~~-~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~ 235 (336) T protein:vir:36 157 LGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPP 235 (336) T ss_pred HHHHHhhCcEEEEeccc-cceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEech Confidence 99999999999999655 5899999999998766666777899999999999999999999999865 7899999999 Q ss_pred HHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcc Q lcl|NC_019525. 229 SDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTL 308 (343) Q Consensus 229 ~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~ 308 (343) +++.+|+++ ++ .+.|+++||++|+++ |+|+.+.++. ++|.++..+|+.+..+.+.+++++|+++++|+ T Consensus 236 ~~~~~Ls~~--n~-~g~Tvl~~lk~n~Pn------l~i~t~pEl~---~a~g~~~~l~~~~~~~~~t~~~~~p~~~~~l~ 303 (336) T protein:vir:36 236 TAMSDLSKT--NQ-YGLAAAAKLKDIFPK------LEFVTIPEYD---TASGRLVQLWAPRVEGKDTATCGFTEKMRAHS 303 (336) T ss_pred HHHHhccCC--Cc-cCccHHHHHHHhcCc------cEEEEccccc---cCCCceEEEEEEecCCCcceeeecchhhhccc Confidence 999999754 33 468999999999887 5688877654 34334445555556778899999999999998 Q ss_pred hhhcCCceEEeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 309 ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 309 p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) . +.+++.|++||.+|+|||.||||.+++|+|== T Consensus 304 v-q~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 304 I-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred e-eecCceeEeccccceeeeeeeccchheeeecC Confidence 7 55579999999999999999999999999844 No 11 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=100.00 E-value=6.4e-66 Score=378.01 Aligned_cols=312 Identities=13% Similarity=0.099 Sum_probs=248.5 Q ss_pred CCce-eeecCCchHHHHHH----H-HhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCce Q lcl|NC_019525. 1 MKKF-VIRNSKGEKILLNA----Q-EAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWS 74 (343) Q Consensus 1 ~~~~-~~~~~~~~~~~~~a----~-~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~ 74 (343) ..|+ |+.+..-.++..+. + ++..++.++...+-.--+|+. ++||+++|+.+++++++.+++||++. |+|. T Consensus 10 l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~---~~i~p~~~~~~~~~~~~~~l~~v~t~-g~w~ 85 (336) T protein:vir:10 10 LARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAELVGESKK-GDWT 85 (336) T ss_pred HhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHH---hhcCcceeeeeechhchhhhcccccC-CCcc Confidence 2222 22322212233221 1 222333333333333344664 69999999999999999999999985 6786 Q ss_pred eEEE---eeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHH Q lcl|NC_019525. 75 TMLT---TYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRK 151 (343) Q Consensus 75 ~~~~---~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar 151 (343) ..+. ..+.+|.|..+ |++ +|+|++|+..++.+.+++.++.+|+|+++|+++|+++| ++|+++|+.+|| T Consensus 86 ~~~~~~~~~e~~G~a~~y-------gd~-~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g-~~l~~~Ka~aA~ 156 (336) T protein:vir:10 86 TLVAAFITAEPTTKVATY-------GDY-SSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGR-VDLASELNYSSA 156 (336) T ss_pred eeeEEEEeeeeeeeEEEc-------ccc-CCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhC-CCcHHHHHHHHH Confidence 4433 34677777654 344 58999999999999999999999999999999999997 799999999999 Q ss_pred HHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCce---ecCCeEEeCH Q lcl|NC_019525. 152 KNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYT---AMPNKFTIPE 228 (343) Q Consensus 152 ~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~---~~p~tl~lp~ 228 (343) +++++++|+++|+|+.+ +|++||||||++++..+++++.|++||++||++||+.++++++.+|++. +.|++|+||| T Consensus 157 ~ale~~~N~~~~~Gd~~-~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~ 235 (336) T protein:vir:10 157 LGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPP 235 (336) T ss_pred HHHHHhhCeEEEEeecc-cceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEech Confidence 99999999999999655 5899999999998766666777999999999999999999999999986 5788999999 Q ss_pred HHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcC---cceEEEecCCchh Q lcl|NC_019525. 229 SDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDN---EDSLRMDIPVDYT 305 (343) Q Consensus 229 ~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d---~~~v~~~iP~~~~ 305 (343) +++.+|+++ +..+.|+++||++|+++ |+|+.++++ +++| ++++.+|.++ ++++++++|++|+ T Consensus 236 ~~~~~L~~~---n~~g~tv~~~lk~n~Pn------l~i~t~pel---~~Ag---g~~~~~~~~~~~~~~t~~~~~P~~f~ 300 (336) T protein:vir:10 236 TAMSDLSKT---NQYGLSAAAKLKEIFPK------LEFVTIPEY---DTAS---GRLVQLWAPRVEGKDTATCGFTEKMR 300 (336) T ss_pred HHHHhccCC---CccCccHHHHHHHhCCc------cEEEEcccc---cccC---CceEEEEEecccCCcceeeecChhhh Confidence 999999765 34568999999999886 578887654 4443 2566666555 7899999999999 Q ss_pred hcchhhcCCceEEeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 306 STLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 306 ~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) +|+. |.+++.|++||.+|+|||+||||.+++|+|== T Consensus 301 ~lpv-q~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 301 AHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred ccce-eecCceeEeccccceeeeeeeccchheeeccC Confidence 9976 56679999999999999999999999999844 No 12 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=100.00 E-value=5.8e-66 Score=378.23 Aligned_cols=322 Identities=11% Similarity=0.046 Sum_probs=255.7 Q ss_pred CCc-------------------------------e-eeecCCchHHHHHHHHhhhccchhhhhhhhh---hh------hh Q lcl|NC_019525. 1 MKK-------------------------------F-VIRNSKGEKILLNAQEAKIAGVIQRLCNDLG---FE------ID 39 (343) Q Consensus 1 ~~~-------------------------------~-~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~---~~------f~ 39 (343) ||+ + |..|+..-+.-+++.. +.......+.+|.. .. +. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~-~~~~~~~~~amDa~~~~~~t~~~~g~p 79 (382) T protein:vir:96 1 MSHISKTHSRLAGRHAKPFDLKNVTHEAVAALGRIGLVFDHAVVQDQIKALA-KAGAFRSGSAMDSNFTAPVTTPSIPTP 79 (382) T ss_pred CCCcceeeeecCCccccchhhhcccHHHHHHHhccccccCcccchhHhhhhh-hhhhhhhhcccccccCCccccCCccHH Confidence 221 1 1111111111111100 00000011222211 11 57 Q ss_pred HHHHHHhhhhhhhcccccccchhhcceecCCCCce-eEEE--eeeccchhhhhhcccccCCccccccccccccccceeee Q lcl|NC_019525. 40 VTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWS-TMLT--TYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIK 116 (343) Q Consensus 40 ~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~-~~~~--~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~ 116 (343) +..|+++++.+|+..++++.+.+++|+++. |+|+ ++++ .++.+|+|++++ |++ |+|++|++....+.+ T Consensus 80 ~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~-g~W~~~t~ty~~~e~~G~A~~yg-------d~~-D~Pl~d~~~~~~~r~ 150 (382) T protein:vir:96 80 IQFLQTWLPGFVKVMTAARKIDEIIGIDTV-GSWEDQEIVQGIVEPAGTAVEYG-------DHT-NIPLTSWNANFERRT 150 (382) T ss_pred HHHHhhhhhhhhhhhhhhhhhhhhcccccc-CCccceEEEEeeeecccceEEee-------ccc-CCCccccccceeEEE Confidence 778999999999999999999999999885 7897 4444 467899999874 555 559999999999999 Q ss_pred eEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeecc--ccceeeeeecCCceeecccCCccccc Q lcl|NC_019525. 117 TYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKD--NANVKGLLTQTGNVVNNTFLTKSIKS 194 (343) Q Consensus 117 v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~--~~g~~GLlN~p~v~~~~a~~~~~w~~ 194 (343) ++.++++|+|+..|+.+|+++| ++|.++|+.+||+++++++|+++|+|+.+ ..|++||||||++++..++.++.|++ T Consensus 151 v~~~~~g~~yg~lE~~rAa~~~-~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~ 229 (382) T protein:vir:96 151 IVRGELGLLVGTLEEGRASAIR-LNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWAT 229 (382) T ss_pred EEEEEEeeeecHHHHHHHHhhC-CCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCccc Confidence 9999999999988888888786 79999999999999999999999999643 35799999999999877766788999 Q ss_pred CCHHHHHHHHHHHHHHHHhcCCceecCC----eEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeech Q lcl|NC_019525. 195 MTPAELKVLCAGIIDVYRQGCDYTAMPN----KFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCV 270 (343) Q Consensus 195 kT~~eIl~Din~~l~~v~~~s~~~~~p~----tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~ 270 (343) ||++||++||+.++++++.+|++.|.|+ +|+|||++|.+|+++ + ..+.|+++||++|+++ ++|+.+. T Consensus 230 kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~--n-~~g~Tvl~~lk~n~Pn------l~i~t~p 300 (382) T protein:vir:96 230 ADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT--T-PYGISVSDWIEQTYPK------MRIVSAP 300 (382) T ss_pred ccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc--C-ccCccHHHHHHHhcCC------cEEEEcc Confidence 9999999999999999999999988765 799999999999754 3 3468999999999886 5788999 Q ss_pred hhhhcccccCCCccEEEEEEcCcc---eEEEecCCchhhc-------chhhcCCceEEeceeeeeccEEEEcCceEEeee Q lcl|NC_019525. 271 YADKITAQVPAVAKRYALYNDNED---SLRMDIPVDYTST-------LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLD 340 (343) Q Consensus 271 ~~~~~~~~g~gg~drmv~Y~~d~~---~v~~~iP~~~~~l-------~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D 340 (343) ++++++..|.|++++|++|.++.+ ..++++|++|+|. .|.++..+.|++||.+|+|||.||||.+++|+| T Consensus 301 eL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~ 380 (382) T protein:vir:96 301 ELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYL 380 (382) T ss_pred ccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhcc Confidence 998888888889999999999876 5778889998764 255667899999999999999999999999998 Q ss_pred cC Q lcl|NC_019525. 341 IP 342 (343) Q Consensus 341 ~~ 342 (343) == T Consensus 381 GI 382 (382) T protein:vir:96 381 GI 382 (382) T ss_pred CC Confidence 44 No 13 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=100.00 E-value=3.3e-65 Score=374.12 Aligned_cols=312 Identities=10% Similarity=0.077 Sum_probs=242.1 Q ss_pred CCc---------------------------------e-eeecCCchHHHHHHHHhhhccchhhhhhhhhh--------h- Q lcl|NC_019525. 1 MKK---------------------------------F-VIRNSKGEKILLNAQEAKIAGVIQRLCNDLGF--------E- 37 (343) Q Consensus 1 ~~~---------------------------------~-~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~--------~- 37 (343) ||+ | |+.+++ .+.++++... + +..+|.+. . T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~gi~~~~~-~~~~~~~~~~----a--md~~~~~~~~~~~~~l~~ 73 (379) T protein:vir:10 1 MPQISKIHSSLNARQMTQMVMDSADVTLDNLKHLESYGIHLNGR-KNKLFELMQF----A--MDSNDIGPIPTPLSPLSP 73 (379) T ss_pred CCCcceeeeecCccccchhhhccccccHHHHHHHHhcCccccch-hhhhhhhhhh----h--hccccccccccccCcccc Confidence 222 2 112222 2222222111 0 11111110 0 Q ss_pred ----hhHHHHHHhhhhhhhcccccccchhhcceecCCCCce-eEEE--eeeccchhhhhhcccccCCccccccccccccc Q lcl|NC_019525. 38 ----IDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWS-TMLT--TYRSFSLAEDFATGIIDTGNSNGKLAAVDTGV 110 (343) Q Consensus 38 ----f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~-~~~~--~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~ 110 (343) =....|+.--+.+++.-...+++.++||+++ .|+|. +++. .++.+|.|+.++ +++ |+|.++++. T Consensus 74 ~~~~g~~~~l~~~~p~~i~~~tap~~a~~l~pv~t-~g~W~~~~~~~~v~e~~G~A~~yg-------d~~-d~pl~d~~~ 144 (379) T protein:vir:10 74 VSIPGLIQFLQNWLPGHVRILTAVREADEFLGLST-VGQWDDEQIVQRVLEGLGTAQPYT-------DGG-NMALMSWTP 144 (379) T ss_pred ccccchHHHHHhhcchHHHHHhhhhhhhhhccccc-CCCceeeeEEEeeeeeeeeeEEec-------ccc-CCCeeeeee Confidence 0023445555889999999999999999988 46786 4433 467889999875 555 569999999 Q ss_pred cceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeee-ccccceeeeeecCCcee-----e Q lcl|NC_019525. 111 DAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGM-KDNANVKGLLTQTGNVV-----N 184 (343) Q Consensus 111 ~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~-~~~~g~~GLlN~p~v~~-----~ 184 (343) +..+.+++.++.+|+|+.+|+++|+++| ++|+++|+.+||+++++++|+++|+|+ ++..+++||||||++++ + T Consensus 145 ~~~~r~v~~~~~g~~yg~~El~~Aa~~g-~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~at 223 (379) T protein:vir:10 145 TFETRTVVRFEAGLQVAPLEEARSSRVQ-VSSADEKRAMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPN 223 (379) T ss_pred eeeeeeeEEEEEEEeecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccC Confidence 9999999999999999999999999997 799999999999999999999999996 34568999999999863 3 Q ss_pred cccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCC----eEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcC Q lcl|NC_019525. 185 NTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPN----KFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITR 260 (343) Q Consensus 185 ~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~----tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~ 260 (343) ++++++.|++||++||++||+.+++++|.+|++.|.|+ +|+|||+++.+|+++ + ..+.|+++||++|+++ T Consensus 224 g~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~--n-~~g~Tvl~~lk~n~Pn--- 297 (379) T protein:vir:10 224 GAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP--T-ELGYSVAQYMRESYPN--- 297 (379) T ss_pred CcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc--c-ccCccHHHHHHHhcCC--- Confidence 44566779999999999999999999999999987665 899999999999865 3 3468999999999886 Q ss_pred CcceEEeechhhhhcccccCCCccEEEEEEc-------CcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcC Q lcl|NC_019525. 261 NSSFEILPCVYADKITAQVPAVAKRYALYND-------NEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRP 333 (343) Q Consensus 261 g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~-------d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP 333 (343) |+|+.+.+++ ++|.|+++++++++. +++.+.+++||++++|++ +.+++.|++||.+|+|||.|||| T Consensus 298 ---l~i~t~pEL~---~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l~v-e~~~~~~~~~~~~rt~Gv~ir~P 370 (379) T protein:vir:10 298 ---VTFVSAPELN---DANGGSSAIYYYADAVENNGTDDGRTWLQVVPTKMFTLGV-EKKIKGYAEGYTNATAGAMLKRP 370 (379) T ss_pred ---cEEEEccccc---ccCCCccEEEEEeeccCCCccCCcceEEEecchhhhhccc-eecCceeEeccccceeeeeeecc Confidence 5788877654 455555565555543 455788999999999976 56679999999999999999999 Q ss_pred ceEEeeecC Q lcl|NC_019525. 334 KELLYLDIP 342 (343) Q Consensus 334 ~a~~Y~D~~ 342 (343) .+++|+|=. T Consensus 371 ~Ai~~~~G~ 379 (379) T protein:vir:10 371 FATYRQTGA 379 (379) T ss_pred hhhheecCC Confidence 999999999 No 14 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=100.00 E-value=6.8e-65 Score=372.38 Aligned_cols=318 Identities=11% Similarity=0.063 Sum_probs=247.6 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhh---------hhhhhHHHHHHhhhhhhhcccccccchhhcceecCCC Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDL---------GFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEG 71 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~---------~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~ 71 (343) +.++=+--++ .+.+...... .+....++++|. +.-+.+..|+++++.+|+..++++++.++||+++. | T Consensus 38 l~~~g~~~~~-~~~~~~~~~~-~~~~~~~~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~-g 114 (388) T protein:vir:99 38 LKKFGLVFDH-ATVKRQIELL-HEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV-G 114 (388) T ss_pred hhhcceeccC-ccchhhhhhh-hhhhhhhcccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhcccccc-C Confidence 1111111111 1111100000 000111122221 22277889999999999999999999999999985 7 Q ss_pred Ccee-EE--EeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHH Q lcl|NC_019525. 72 AWST-ML--TTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEK 148 (343) Q Consensus 72 ~w~~-~~--~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~ 148 (343) +|.. ++ ..++.+|.|++++ +++ |+|++|++.++.+.+++.++++|+|+.+|+++|+++| ++|+++|+. T Consensus 115 ~W~~~~~~f~v~e~~G~A~~yg-------d~~-D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g-~~l~~~Ka~ 185 (388) T protein:vir:99 115 SWEDQEIVQGIVEPAGTAMEYG-------DLT-NIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMR-INSAEVKRQ 185 (388) T ss_pred CccceeEEEeeeecceeEEEee-------ccc-CCCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhC-CCcHHHHHH Confidence 8974 33 3457789999874 555 5699999999999999999999999999999999986 799999999 Q ss_pred HHHHHHHhhhhheEeeeecc--ccceeeeeecCCcee----ecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCC Q lcl|NC_019525. 149 SRKKNWDLGIQEIAFVGMKD--NANVKGLLTQTGNVV----NNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPN 222 (343) Q Consensus 149 aar~a~~~~~n~v~~~G~~~--~~g~~GLlN~p~v~~----~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~ 222 (343) +||+++++++|+++|+|++. ..+++||||||++++ ++.+++++|++||++||++||+.++++++.+|++.|.|+ T Consensus 186 AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~ 265 (388) T protein:vir:99 186 GAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPE 265 (388) T ss_pred HHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeec Confidence 99999999999999999753 237999999999863 344567789999999999999999999999999998776 Q ss_pred ----eEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCc----- Q lcl|NC_019525. 223 ----KFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNE----- 293 (343) Q Consensus 223 ----tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~----- 293 (343) +|+|||++|.+|+.+ +. .+.|+++||++|+++ |+|+.+.+++++.. .||.+.++.|.++. T Consensus 266 ~~~~tL~LP~~~~~~Ls~~--n~-~g~Tvl~~lk~n~Pn------l~i~t~pEl~~a~~--tgg~~~~~~~~~~~~~~~~ 334 (388) T protein:vir:99 266 DVDITLVLPMNKVDMLSVV--TD-LGISVRDWLKQTYPR------VRVMSAPELQGGNP--DDGKDIAYMFLDSVDTAVD 334 (388) T ss_pred ccceEEEechHHHHhcccc--Cc-CCccHHHHHHHhcCC------cEEEEecccccccc--cCCceeEEEEecccccccc Confidence 799999999999754 33 468999999999886 57888876665433 45667888887654 Q ss_pred ------ceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 294 ------DSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 294 ------~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) +...+++||+++++++ +++++.|++||.+|+|||.||||.+++|+|== T Consensus 335 ~~~~~~~t~~~~~p~~~~~l~v-q~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 335 GSTDGGDTWAQLVQSKFVTLGV-EKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred cCccCcceeEEecccccccccc-eecCceeEeccccceeeeEEeccchhheeccC Confidence 3466678999999976 55579999999999999999999999999844 No 15 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.56 E-value=1.5e-08 Score=63.45 Aligned_cols=296 Identities=7% Similarity=-0.086 Sum_probs=156.7 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTY 80 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~ 80 (343) |.+.+.+ +.....-+++..+...++ . ..+++...+.-.-+.+.++....+ ....+-.. T Consensus 1 m~~~~~~------------------a~~~~~t~~~g~~i~~~~--~-~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~ 58 (330) T protein:vir:77 1 MAGSTVP------------------STQVALTGDFSAFLTPEQ--S-QDYFAEIEKTSIVQRIARKVPMGP-TGISIPHW 58 (330) T ss_pred Ccccccc------------------hhhccccCCCcceechhH--H-HHHHHHHHhccchhhhcceeeccC-CceEEEEE Confidence 2222211 111111222323443332 1 223443334444444444432211 12233333 Q ss_pred eccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhh Q lcl|NC_019525. 81 RSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQE 160 (343) Q Consensus 81 ~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~ 160 (343) +..+-|.+++.+ ..+|.-+..+++.....++++.-+.+|.+=|+.+ . .++.+.-.+...+++...+|+ T Consensus 59 ~~~~~a~~v~Eg--------~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds---~-~~~~~~i~~~l~~ai~~~~~~ 126 (330) T protein:vir:77 59 TGAVSASWTGEA--------ERKPITKGSFGKQELEPVKITTIFAESAEVVRLN---P-LNYLNTMRTKIAEAIALKFDA 126 (330) T ss_pred cCCcceeEecCC--------CccccccceeeEEEEeEEEEEEeehhhHHHHhcc---h-HHHHHHHHHHHHHHHHHHHHH Confidence 333445555432 3678888889999999999999888888655543 2 478888888999999999999 Q ss_pred eEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccC Q lcl|NC_019525. 161 IAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASA 240 (343) Q Consensus 161 v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s 240 (343) ..++|+....+..|++|..............-.+.+...+++|+.+++..+...- ..+..++|.++.+..|..-+ T Consensus 127 ~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~---~~~~~~vmn~~~~~~l~~lk-- 201 (330) T protein:vir:77 127 AAIHGIDKPSAFKGYLAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSG---KKWTGTLLDNVTEPILNTAV-- 201 (330) T ss_pred HhhcccCCCCccccccccccccceeecccccccccccchhHHHHHHHHHhhhhcC---CCccEEEEcHHHHHHHHHHh-- Confidence 9999976666778999886532222211222234455667788888888775542 24568999999999996432 Q ss_pred CCcchhhhh-HHHhhcch-----hcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhc------- Q lcl|NC_019525. 241 DFPIKSTKQ-VLEDTFKE-----ITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTST------- 307 (343) Q Consensus 241 ~~~~~tl~~-~l~~n~~~-----~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l------- 307 (343) ++.++.+.. -++...+. .-.|.|+-+.. ..-....+++..+++-+-+.=++...-.+.+... T Consensus 202 d~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~-----~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~ 276 (330) T protein:vir:77 202 DGNGRPLFVESTYTEQVGAIREGRILGRPTYVAD-----NVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDF 276 (330) T ss_pred ccCCceeecCccccccccccCCceecceeeEEec-----cccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeee Confidence 333333321 11111111 11255543322 2222222334444443333222222112222111 Q ss_pred -------------chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 308 -------------LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 308 -------------~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -..+++ ...+-|+.|+++.. +.|++++.+..-. T Consensus 277 ~~~~~~~~~~~~~~~f~~~--~~~~r~~~r~d~~v-~~~~a~~~i~~~~ 322 (330) T protein:vir:77 277 GEEQGGVWVPKLISLWQHN--MVAVRCEAEFAFMV-NDKDAFVKLTDQV 322 (330) T ss_pred cccccccccccccchhhcC--cEEEEEEEEeccEE-ecccceEEEEecc Confidence 012232 23445688888766 6699999987766 No 16 >protein:vir:105778 Length: 358 # NCBI annotation: gp9 # Family: family:all:10995 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224147;genbank:gi:62362222;genbank:GeneID:3342531 Probab=98.30 E-value=2.1e-08 Score=62.69 Aligned_cols=311 Identities=11% Similarity=0.074 Sum_probs=175.4 Q ss_pred eeecCCchHHHHHHHHhh--------------------------hccchhhhhhhhhhhhhHHHHHHhhhhhhhccccc- Q lcl|NC_019525. 5 VIRNSKGEKILLNAQEAK--------------------------IAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFE- 57 (343) Q Consensus 5 ~~~~~~~~~~~~~a~~~~--------------------------~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~- 57 (343) ++-+ | |++-..++... ..+... .++..+ .|...=-..+|.++.+.-.|+ T Consensus 1 ~~f~-K-~~~an~~~~~~qw~~L~~~Rna~n~~~~a~maan~a~~~~~~~-~~NAv~-~v~~D~wr~~D~~~~q~fr~e~ 76 (358) T protein:vir:10 1 MYFS-K-ETLATNSRLGGHWNELWANRNMWNAQHDAMIAANRSNMTPEWL-AVNAVG-GFTRDFWAEIDRQVLQLRDQEV 76 (358) T ss_pred Ceec-h-hhhhhHHHHHHHHHHHHHHHHHhhhhhhhHHhhhHHHhhhhhh-eecccc-cCCHHHHHHHhhhhhhhcccch Confidence 1111 1 22222221111 001110 001101 112222345666666665554 Q ss_pred -cc-chhhcceecCCCCceeEEEeeecc-chhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHH Q lcl|NC_019525. 58 -IS-PADYMPIRVGEGAWSTMLTTYRSF-SLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEA 134 (343) Q Consensus 58 -l~-~~~~~pv~~~~~~w~~~~~~~~~v-g~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A 134 (343) +. .-.|+++.++.+-. +++..|... +.+..+.- .=.|.....+-.+...++.+-.||+..|.+++|-..+..+ T Consensus 77 ~~~l~NDLm~ls~sv~Ig-ktv~~y~~~gd~~~~v~~--SmsGQ~~~~lD~~~y~~dGtpiPIfdsg~~f~WR~~~~~~- 152 (358) T protein:vir:10 77 GMEIVNDLIGVQTVLPVG-KTAKLYNVIGDIADDVSV--SIDGQAPFSFDHTEYASDGDPIPVFTAGYGVNWRHAAGLN- 152 (358) T ss_pred hHHHHhhhhhccccccHH-HHHHHHhhhcCCCceEEE--EecccCcccccceeeeccCCEeeeeccCccccccchhhcC- Confidence 33 45677887776533 222222211 21221100 0012333467788899999999999999999887666554 Q ss_pred HHhcCCCcHHHHHHHHHHHHHhhhhheEeeeecc----ccceeeeeecCCceeeccc-----CCcccccCCHHHHHHHHH Q lcl|NC_019525. 135 MKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKD----NANVKGLLTQTGNVVNNTF-----LTKSIKSMTPAELKVLCA 205 (343) Q Consensus 135 ~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~----~~g~~GLlN~p~v~~~~a~-----~~~~w~~kT~~eIl~Din 205 (343) -.| +++..+-+..-.++..+++-+.+++|+.+ ..-.+||-|||++...+-+ .+-+.+++|++++....+ T Consensus 153 -~~g-~d~~~daQ~~~~~kv~~~~vdy~lNG~~~I~v~g~t~~Glrn~~n~~qv~l~~~s~g~NiDlttat~~a~~~~f~ 230 (358) T protein:vir:10 153 -SLG-IDLVLDSQMAKMRKFNQKRVNYYLNGDPNIQVQSYPAQGIKNHRNTKKINLGSGSGGANIDLTTADMTALFAFFG 230 (358) T ss_pred -ccc-cchhHHHHHHHHHHHHHHHHhhhhccCCceeecCcccccccCCcceeEEEeccCCCcceeeeccCCHHHHHHHHH Confidence 355 77777777777888888888999999632 1236899999998632221 123588999999888885 Q ss_pred HHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCc-chhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCcc Q lcl|NC_019525. 206 GIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFP-IKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAK 284 (343) Q Consensus 206 ~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~-~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~d 284 (343) ..+-..-...++...-.+++..|+-++-+..+...+.. ..|++..+..-.. .. +|.+...|.| + T Consensus 231 ~~l~~~~~~~N~~~~~~~~~vs~ei~~n~~r~Y~~~~~~~gTIl~~vl~~~~----va--~I~~~~~Lsg---------N 295 (358) T protein:vir:10 231 KGAFGTLARANKVAQYDVMWVSPEIWANLAQPYVVNGVVSGNVLNAVLPFAP----VR--EIRQTFALSG---------N 295 (358) T ss_pred HHHHHHHHhhcccceeeEEEEcHHHHhhhhcccccccccchhhHHHhhcccC----cc--cccccccCCC---------c Confidence 55444444455555678999999999999776665421 4567766654221 12 4555443332 6 Q ss_pred EEEEEEcCcceEEEecCCchhhcc---hhhcCCceEEeceeeeeccEEEEcCc----eEEeeecCC Q lcl|NC_019525. 285 RYALYNDNEDSLRMDIPVDYTSTL---ANSVNNFQFQNAAYGQFTGVLAYRPK----ELLYLDIPV 343 (343) Q Consensus 285 rmv~Y~~d~~~v~~~iP~~~~~l~---p~~~~~l~~~v~~~~r~GGv~v~yP~----a~~Y~D~~~ 343 (343) -+++|.+..+++.-.+-||+--.| |.+..+..|.| ++ --|++|+.-. .++|.---| T Consensus 296 eii~~~~~~~vi~plvG~~~gt~~~pR~~p~ddY~f~v--ws-A~glqik~D~~Gks~Vv~~~~~~ 358 (358) T protein:vir:10 296 EFIAYVRRQDIISPLVGMAVGVVPLPRPLPNVNYNFQI--MS-AEGLQITADDQGLSGVVYGANLV 358 (358) T ss_pred cEEEEEeCCceeeeeecceeeeecCCCCCCCcchhhhh--hh-hhceeeeeccccceeeEeecccC Confidence 899999999999988877765543 22223333433 22 3355555432 333433333 No 17 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.14 E-value=4.4e-07 Score=55.46 Aligned_cols=305 Identities=8% Similarity=-0.031 Sum_probs=146.2 Q ss_pred CCceeeecCCchHHHHHHHHhhhcc-----ch---hh---hhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecC Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAG-----VI---QR---LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVG 69 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~-----~~---~~---~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~ 69 (343) .++.+...+. +......++.-... .. .+ .....+......+ +...|++...+....+.++++... T Consensus 77 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~---~~~~ii~~~~~~~~l~~l~~~~~~ 152 (395) T protein:vir:43 77 GEEAPKTAGQ-MVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPD---RRPGVVAAPQRRLTIRDLVAPGTT 152 (395) T ss_pred ccchhhhHHH-HHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchh---hHHHHHHHHHhhhhHHhhccceec Confidence 1111110000 00000000000000 00 00 0011111122221 223455544444444455454322 Q ss_pred CCCceeEEEeee-ccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHH Q lcl|NC_019525. 70 EGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEK 148 (343) Q Consensus 70 ~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~ 148 (343) .+ ..-.+.... ..+.|.+++. . ..+|..+..+++.....+.++..+.+|.+=|+. ++ .|.+--.. T Consensus 153 ~~-~~~~~~~~~~~~~~a~~v~E-------~-~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d---~~--~l~~~v~~ 218 (395) T protein:vir:43 153 ES-NSVEYVRETGFVNNAAPVSE-------G-TQKPYSDLTFELENAPVRTIAHLFKASRQILDD---AS--ALQSYIDA 218 (395) T ss_pred CC-CceEEEEEecCCCceeeecC-------C-ccccccccceeEEEEeeeeEEEeehhhHHHHHh---HH--HHHHHHHH Confidence 21 111222221 1234444432 2 368888889999999999999999999764432 22 46777777 Q ss_pred HHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCH Q lcl|NC_019525. 149 SRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPE 228 (343) Q Consensus 149 aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~ 228 (343) ...+++...+|+..++|+-....+.|+++.+++.....+ ...+.+..+++|.+++..+...-. .+..++|.| T Consensus 219 ~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~~~~---~~~~~vmn~ 290 (395) T protein:vir:43 219 RARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSG-----VVVTAEQRIDRIRLAILQAQLAEF---PASGIVLNP 290 (395) T ss_pred HHHHHHHHHHHHHHHhccCCCCccccccccccccccccc-----cccccchhHHHHHHHHHhhccccC---CCcEEEEcH Confidence 777888888889899996554556799998887543322 123344567777777766643322 456899999 Q ss_pred HHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCC-ccEEEEEEcCcceEEEecCCchhhc Q lcl|NC_019525. 229 SDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAV-AKRYALYNDNEDSLRMDIPVDYTST 307 (343) Q Consensus 229 ~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg-~drmv~Y~~d~~~v~~~iP~~~~~l 307 (343) +.|..|..-+ ++.+.-+..-..+.....-.|.|+-+ ..++.. +..-.|. ++...++++. -+.+.+- .... T Consensus 291 ~~~~~l~~lk--d~~G~~i~~~~~~~~~~~l~G~pVv~--~~~~~~-~~~~~gd~~~~~~~~~~~--~~~i~~~--~~~~ 361 (395) T protein:vir:43 291 IDWALIELNK--DAENRYIIGSPQNGTTPTLWRLPVVE--TQAITQ-DEFLTGAFSLGAQIFDRM--DIEVLVS--TEND 361 (395) T ss_pred HHHHHHHHhh--ccCCceeccccccCCCceecceeeEE--cCCCCC-CcEEEEeccceEEEEEec--ceEEEEe--cccc Confidence 9999986543 44443332111111111113555432 222221 1111122 2223344332 2222211 0001 Q ss_pred chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 308 LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 308 ~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...+++-..+ -++.|+++ .++.|.++++++++. T Consensus 362 ~~f~~~~~~~--r~~~r~d~-~v~~~~a~~~~~~ta 394 (395) T protein:vir:43 362 KDFENNMVTI--RAEERLAF-AVYRPEAFVTGSLTA 394 (395) T ss_pred chhhcCcEEE--EEEEeecc-EEecccceEEEEecc Confidence 1123332233 33556655 568999999999999 No 18 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.13 E-value=6.3e-07 Score=54.58 Aligned_cols=315 Identities=9% Similarity=-0.027 Sum_probs=147.3 Q ss_pred CCce----eeecCCchHHHH-HHHHhhhccc---------------hhhhhhhh------hhhhhHHHHHHhhhhhhhcc Q lcl|NC_019525. 1 MKKF----VIRNSKGEKILL-NAQEAKIAGV---------------IQRLCNDL------GFEIDVTTLTTLMKKIIEQK 54 (343) Q Consensus 1 ~~~~----~~~~~~~~~~~~-~a~~~~~~~~---------------~~~~~~d~------~~~f~~~qL~~i~~~iye~~ 54 (343) .++- -..+..+++... +......... ........ +...... +.+...+.+.. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--~~~~~~i~~~~ 148 (419) T protein:vir:94 71 TPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLP--QLVPGIVPTTP 148 (419) T ss_pred ccccccccccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccc--hhhhHHHHHHH Confidence 1100 001111111100 0000000000 00000000 0001111 11222222222 Q ss_pred cccccchhhcceecCCCCceeEEEeee--ccchhhhh-hcccccCCccccccccccccccceeeeeEEEEeeEeecHHHH Q lcl|NC_019525. 55 FFEISPADYMPIRVGEGAWSTMLTTYR--SFSLAEDF-ATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPEL 131 (343) Q Consensus 55 ~~~l~~~~~~pv~~~~~~w~~~~~~~~--~vg~a~~i-a~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL 131 (343) ..+...+.++.+.+.. +..+.+.. ....+... .....|.+..+ .+|.-+..+.+.+..++.++.-+.+|.+=| T Consensus 149 ~~~~~i~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~-~~~~~~~~~~~i~~~~~k~~~~~~is~ell 224 (419) T protein:vir:94 149 DLPLLVADLLDQQNAD---YNVLEYIRDTSGTAGAGSTWNKAAVVPEGT-AKPQSTLSFDTITTTLKTVAHWLPITRQAA 224 (419) T ss_pred hhhhhhhhcceeeecc---CCceeeeeeccccccccccCcccceecCCc-cccccccceeeEEeeeeeEEEeehhhHHHH Confidence 3333333344332111 11111110 00000000 01112333433 588888899999999999999999997766 Q ss_pred HHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHH Q lcl|NC_019525. 132 AEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVY 211 (343) Q Consensus 132 ~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v 211 (343) +.+ + .|.+--.....+++...+|+.+++|+... ...|++|.+++...... ..+...|....++||.+++..+ T Consensus 225 ~d~---~--~l~~~i~~~la~a~~~~~d~aii~G~G~~-~p~Gi~~~~~~~~~~~~--~~~~~~t~~~~~~~l~~~~~~~ 296 (419) T protein:vir:94 225 DDN---S--QLMGYIQGRLTYGLRFLRDRQLLNGNGST-EMQGILTTPGIGTYQQP--KPTAPATDEPPLVDIRRAKTVA 296 (419) T ss_pred HhH---H--HHHHHHHHHHHHHHHHHHHHHHHhccCcc-cccceeccccccccccc--ccccccccchhHHHHHHHHHhh Confidence 543 2 36666667777777788889999996553 57899999998654432 3355667777788999988887 Q ss_pred HhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccE-EEEE Q lcl|NC_019525. 212 RQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKR-YALY 289 (343) Q Consensus 212 ~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~dr-mv~Y 289 (343) ...- ..++.++|.|+.|..|...+.+..+ ..++. ...+.....-.|.|+.+.. ++.. +..-.|.-.+ ..++ T Consensus 297 ~~~~---~~~~~~v~n~~~~~~l~~~k~~~~~-~~~~~~~~~~~~~~~l~G~pV~~~~--~~~~-~~~~~gd~~~~~~~~ 369 (419) T protein:vir:94 297 EIAG---FPPDGVVVHPQDWESIELDQAPGSG-VFRVIANVQGEATPRIWGLNVVSTV--AIAQ-GTALVGGFRQGATLW 369 (419) T ss_pred hhcc---CCCCEEEEcHHHHHHHHHHhhcCCC-ceeecCCcccCCCccccceeeEEcC--CCCC-ccEEEeeccceEEEE Confidence 5432 2567899999999999765543322 21111 1111111112355544322 2211 1111121222 2333 Q ss_pred EcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 290 NDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 290 ~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ++.. +.+.+ ...-.....++-..+ -++.|+++. ++.|.+++++.+.- T Consensus 370 ~~~~--~~v~~--~~~~~~~~~~~~~~~--r~~~r~d~~-v~~~~a~~~~~~~a 416 (419) T protein:vir:94 370 SRQG--ITVLM--TDSHADFFTANTLVI--LAEFRANLA-VYQPKAFVRVTFAA 416 (419) T ss_pred Eecc--eEEEE--eccccchhhcCcEEE--EEEEeeccE-EeccccEEEEEecc Confidence 3322 22211 000000122332333 347778765 47799999999987 No 19 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=97.93 E-value=5.4e-06 Score=49.47 Aligned_cols=297 Identities=9% Similarity=-0.042 Sum_probs=143.1 Q ss_pred CCceeeecCCchHHHH-HHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe Q lcl|NC_019525. 1 MKKFVIRNSKGEKILL-NAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT 79 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~-~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~ 79 (343) |++.=..+-. .+... +.+......+.-....+.+....-.+ +...|++.-.+.-..+.++++.+..+ ....+-. T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~---~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~ 75 (324) T protein:vir:96 1 MEQTQKLKLN-LQHFASNNVKPQVFNPDNVMMHEKKDGTLMNE---FTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTF 75 (324) T ss_pred CCcchhhhHH-HHHHHHHhhhhhhhccccccccCcCccccchh---HHHHHHHHHHhhchhhhhcceeeccC-CceEEEE Confidence 8877433333 22222 11111111111111122222222222 22344444333333444444332211 1222333 Q ss_pred eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 80 YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 80 ~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) .+..+.|.+++.+ ..+|.-+..+.+.....+.++.-+.+|.+-|+.+. .+|.+.-.+...+++...+| T Consensus 76 ~~~~~~a~~v~Eg--------~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d 143 (324) T protein:vir:96 76 WADKPGAYWVGEG--------QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFD 143 (324) T ss_pred EecCcceeEecCC--------ccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHH Confidence 3333455565432 46888889999999999999998888886666442 36777777888888888889 Q ss_pred heEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcccc Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAAS 239 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~ 239 (343) +.+++|......-.|+++..+.....+. .+.|. +||.+++..+... + ..++.++|-++.+..|...+ T Consensus 144 ~a~l~G~g~~~~~~gi~~~~~~~~~~~~-----~~~t~----~~i~~~~~~l~~~--~-~~~~~~vmn~~~~~~L~~l~- 210 (324) T protein:vir:96 144 EAGILNQGNNPFGKSIAQSIEKTNKVIK-----GDFTQ----DNIIDLEALLEDD--E-LEANAFISKTQNRSLLRKIV- 210 (324) T ss_pred HHHhccCCCCCcCccccccccccceecc-----ccccH----HHHHHHHHhhhhc--c-CCCCEEEEcHHHHHHHHHhh- Confidence 9999996433223466665443222221 12234 4445555544322 2 35678999999999996543 Q ss_pred CCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcC------cceEEEecCCchhhcc----- Q lcl|NC_019525. 240 ADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDN------EDSLRMDIPVDYTSTL----- 308 (343) Q Consensus 240 s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d------~~~v~~~iP~~~~~l~----- 308 (343) +..+..++ ..-....-.|.|+.+.+ +...++..++.-+.+ .+.+.+++-....... T Consensus 211 -d~~G~~~~---~~~~~~~l~G~PV~~~~---------~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:96 211 -DPETKERI---YDRNSDSLDGLPVVNLK---------SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNED 277 (324) T ss_pred -ccCCCeee---cCCCCCcccceeeEeeC---------CCCCCcceEEEEecceEEEEEecCcEEEEeeccccccccccc Confidence 32232221 11111122355543322 111122333332221 1222332211100000 Q ss_pred -----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 309 -----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 309 -----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -.+++-.. +-+++|+++. ++.|++++++-.-. T Consensus 278 ~~~~~~f~~d~~~--~r~~~r~d~~-v~~~~A~~~l~~a~ 314 (324) T protein:vir:96 278 GTPVNLFEQDMVA--LRATMHVALH-IADDKAFAKLVPAD 314 (324) T ss_pred ccchhhhhcCcEE--EEEEEEEccE-EecccceEEEeccc Confidence 12333223 3457777655 55599998876544 No 20 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=97.93 E-value=5.4e-06 Score=49.47 Aligned_cols=297 Identities=9% Similarity=-0.042 Sum_probs=143.1 Q ss_pred CCceeeecCCchHHHH-HHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe Q lcl|NC_019525. 1 MKKFVIRNSKGEKILL-NAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT 79 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~-~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~ 79 (343) |++.=..+-. .+... +.+......+.-....+.+....-.+ +...|++.-.+.-..+.++++.+..+ ....+-. T Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~---~~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~ 75 (324) T protein:vir:78 1 MEQTQKLKLN-LQHFASNNVKPQVFNPDNVMMHEKKDGTLMNE---FTTPILQEVMENSKIMQLGKYEPMEG-TEKKFTF 75 (324) T ss_pred CCcchhhhHH-HHHHHHHhhhhhhhccccccccCcCccccchh---HHHHHHHHHHhhchhhhhcceeeccC-CceEEEE Confidence 8877433333 22222 11111111111111122222222222 22344444333333444444332211 1222333 Q ss_pred eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 80 YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 80 ~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) .+..+.|.+++.+ ..+|.-+..+.+.....+.++.-+.+|.+-|+.+. .+|.+.-.+...+++...+| T Consensus 76 ~~~~~~a~~v~Eg--------~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d 143 (324) T protein:vir:78 76 WADKPGAYWVGEG--------QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFD 143 (324) T ss_pred EecCcceeEecCC--------ccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHH Confidence 3333455565432 46888889999999999999998888886666442 36777777888888888889 Q ss_pred heEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcccc Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAAS 239 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~ 239 (343) +.+++|......-.|+++..+.....+. .+.|. +||.+++..+... + ..++.++|-++.+..|...+ T Consensus 144 ~a~l~G~g~~~~~~gi~~~~~~~~~~~~-----~~~t~----~~i~~~~~~l~~~--~-~~~~~~vmn~~~~~~L~~l~- 210 (324) T protein:vir:78 144 EAGILNQGNNPFGKSIAQSIEKTNKVIK-----GDFTQ----DNIIDLEALLEDD--E-LEANAFISKTQNRSLLRKIV- 210 (324) T ss_pred HHHhccCCCCCcCccccccccccceecc-----ccccH----HHHHHHHHhhhhc--c-CCCCEEEEcHHHHHHHHHhh- Confidence 9999996433223466665443222221 12234 4445555544322 2 35678999999999996543 Q ss_pred CCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcC------cceEEEecCCchhhcc----- Q lcl|NC_019525. 240 ADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDN------EDSLRMDIPVDYTSTL----- 308 (343) Q Consensus 240 s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d------~~~v~~~iP~~~~~l~----- 308 (343) +..+..++ ..-....-.|.|+.+.+ +...++..++.-+.+ .+.+.+++-....... T Consensus 211 -d~~G~~~~---~~~~~~~l~G~PV~~~~---------~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:78 211 -DPETKERI---YDRNSDSLDGLPVVNLK---------SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNED 277 (324) T ss_pred -ccCCCeee---cCCCCCcccceeeEeeC---------CCCCCcceEEEEecceEEEEEecCcEEEEeeccccccccccc Confidence 32232221 11111122355543322 111122333332221 1222332211100000 Q ss_pred -----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 309 -----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 309 -----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -.+++-.. +-+++|+++. ++.|++++++-.-. T Consensus 278 ~~~~~~f~~d~~~--~r~~~r~d~~-v~~~~A~~~l~~a~ 314 (324) T protein:vir:78 278 GTPVNLFEQDMVA--LRATMHVALH-IADDKAFAKLVPAD 314 (324) T ss_pred ccchhhhhcCcEE--EEEEEEEccE-EecccceEEEeccc Confidence 12333223 3457777655 55599998876544 No 21 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=97.92 E-value=1.8e-06 Score=52.03 Aligned_cols=303 Identities=10% Similarity=0.011 Sum_probs=138.4 Q ss_pred CCceeeecCCchHHH-------------------HHHHHhh-hccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccc Q lcl|NC_019525. 1 MKKFVIRNSKGEKIL-------------------LNAQEAK-IAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISP 60 (343) Q Consensus 1 ~~~~~~~~~~~~~~~-------------------~~a~~~~-~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~ 60 (343) -.........++.+- .+..... ..........+.+ .+... .+...|++...+...- T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~lvp~---~~~~~ii~~~~~~~~l 165 (418) T protein:vir:10 90 SAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSN-SLVVA---DRQAGIIAPPQRKMTI 165 (418) T ss_pred ccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCc-cccch---hHHHHHHHHHhhhhhH Confidence 111111111111100 0000000 0000000011112 12222 1223344443344444 Q ss_pred hhhcceecCCCCceeEEEeeec-cchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcC Q lcl|NC_019525. 61 ADYMPIRVGEGAWSTMLTTYRS-FSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGN 139 (343) Q Consensus 61 ~~~~pv~~~~~~w~~~~~~~~~-vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr 139 (343) +.++++....+ ....+-.... ...|.+++.+ ..+|.-+..+++.....+.++..+.+|.+=|+. ++ T Consensus 166 ~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~d---s~- 232 (418) T protein:vir:10 166 RDLLMPGQTSS-SSIEYTVETGFTNNAAAVAEG--------AQKPTSDLKFNLKNQPVRTIAHLFKASRQILDD---AP- 232 (418) T ss_pred HhhcceeeccC-CceeEEEEecCCCceeeeccC--------ccccccccceeeEEEeeeeEEEeehhhHHHHHh---HH- Confidence 44444322111 1111211111 2334444322 357888888999999999999988888764442 22 Q ss_pred CCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCcee Q lcl|NC_019525. 140 WDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTA 219 (343) Q Consensus 140 ~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~ 219 (343) +|.+--.....+++...+|+.+++|+.......|++|.+++...+.+ ... ..+. +||.+++..+... + . T Consensus 233 -~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~-~~~--~~~~----~~i~~~~~~~~~~-~--~ 301 (418) T protein:vir:10 233 -ALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSIT-LAN--ATPI----DKIRLALLQAVLA-E--F 301 (418) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc-ccc--cccH----HHHHHHHHhhccc-c--C Confidence 47777777778888888999999996544447899999876543321 111 1233 4455555444322 1 2 Q ss_pred cCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccE-EEEEEcCcceEEE Q lcl|NC_019525. 220 MPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKR-YALYNDNEDSLRM 298 (343) Q Consensus 220 ~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~dr-mv~Y~~d~~~v~~ 298 (343) .++.++|.|..|..|...+ ++.+.-|..-..+.....-.|.|+-+ ..++.. +..-.|.-.. ..++++ .-+.+ T Consensus 302 ~~~~~v~n~~~~~~L~~lk--d~~G~~i~~~~~~~~~~~l~G~pV~~--~~~~p~-~~~~~gd~s~~~~~~~~--~~~~i 374 (418) T protein:vir:10 302 PATGIVLNPIDWASIELTK--DSQGRYIVGNPVNGTTPRLWNLPVVE--TQAMTA-NEFLVGAFSMAAQIFDR--MEIEV 374 (418) T ss_pred CCCEEEEcHHHHHHHHHhh--cCCCceeccccccCCCceecceeeEE--cCCCCC-CcEEEeeccceEEEEEe--cceEE Confidence 4567999999999997544 33333332111111111113555432 122211 1111111111 223322 12222 Q ss_pred ecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 299 DIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 299 ~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .+ ......-..++... +-++.|+++ .++.|.+++|+++.- T Consensus 375 ~~--~~~~~~~f~~~~~~--~r~~~~~d~-~~~~~~a~~~~~~~~ 414 (418) T protein:vir:10 375 LL--STENVDDFEKNMVS--IRAEERLAL-AVYRPESFVTGALVE 414 (418) T ss_pred EE--ecccchhhhcCceE--EEEEEeecc-EEecccceEEEEecc Confidence 11 00000001232222 335778887 588999999999987 No 22 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.89 E-value=7.5e-06 Score=48.67 Aligned_cols=304 Identities=9% Similarity=-0.031 Sum_probs=142.4 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTY 80 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~ 80 (343) |+|.-..+..-.+.-....+.....+.-....+.+..+.-.++ ...|.+...+.-.-+.+.++.... .....+-.. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~---~~~ii~~~~~~s~l~~~~~~~~~~-~~~~~ip~~ 76 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEF---TTPILQEVMENSKIMQLGKYEPME-GTEKKFTFW 76 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhH---HHHHHHHHHhhcchhhhcceeecc-CCceEEEEE Confidence 7765333322111111111110000100111222322332322 233443322333333333332211 112233333 Q ss_pred eccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhh Q lcl|NC_019525. 81 RSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQE 160 (343) Q Consensus 81 ~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~ 160 (343) +..+.|.+++.+ ..+|.-+..+++.....+.++.-+.+|.+-|+.+. .++.+.-.....+++...+|+ T Consensus 77 ~~~~~a~~v~Eg--------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~ 144 (324) T protein:vir:97 77 ADKPGAYWVGEG--------QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDE 144 (324) T ss_pred ecCcceeEeccC--------ccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHH Confidence 444556666533 36888889999999999999999988886666442 468888888888888899999 Q ss_pred eEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccC Q lcl|NC_019525. 161 IAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASA 240 (343) Q Consensus 161 v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s 240 (343) .++.|+....--.|+++........+. ++.|.++ |.+++..+... + ..++.++|.|..|..|.+-. T Consensus 145 a~l~G~g~~~~~~gi~~~~~~~~~~~~-----~~~~~~~----i~~~~~~l~~~--~-~~~~~~v~n~~~~~~L~~lk-- 210 (324) T protein:vir:97 145 AGILNQGNNPFGKSIAQSIEKTNKVIK-----GDFTQDN----IIDLEALLEDD--E-LEANAFISKTQNRSLLRKIV-- 210 (324) T ss_pred HhhccCCCCccCccccccccccceecc-----ccCCHHH----HHHHHHhhhhc--c-CCCCEEEEcHHHHHHHHHhh-- Confidence 999996443223466665443221111 2234444 44444444322 2 35678999999999997543 Q ss_pred CCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhc----------chh Q lcl|NC_019525. 241 DFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTST----------LAN 310 (343) Q Consensus 241 ~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l----------~p~ 310 (343) +..+..+. ..-....-.|.|+.+.+. +..+.+..-.|...++++-. .+.+.+++--..... --. T Consensus 211 d~~g~~~~---~~~~~~tl~G~PV~~~~~-~~~~~~~~~~gd~~~~~i~~--~~~~~i~~~~~~~~~~~~~~~~~~~~~f 284 (324) T protein:vir:97 211 DPETKERI---YDRNSDTLDGLPVVNLKS-SNLKRGELITGDFDKLIYGI--PQLIEYKIDETAQLSTVKNEDGTPVNLF 284 (324) T ss_pred cCCCceee---cCCCCccccceeeEeecC-CCCCcceEEEEecccEEEEE--ecCcEEEEeecccccccccccccchhhh Confidence 22222221 111111224666543221 00000000001111222111 122333221111000 012 Q ss_pred hcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 311 SVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 311 ~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +++...+ -+++|+++..+ .|++++.+..-. T Consensus 285 ~~d~~~~--r~~~r~d~~v~-~~~a~~~l~~~~ 314 (324) T protein:vir:97 285 EQDMVAL--RATMHVALHIA-DDKAFAKLVPAD 314 (324) T ss_pred hcCcEEE--EEEEEeccEEe-cccceEEEEecc Confidence 3332233 44778866655 599999887655 No 23 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=97.88 E-value=7.4e-06 Score=48.70 Aligned_cols=282 Identities=6% Similarity=-0.065 Sum_probs=143.1 Q ss_pred hhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccc Q lcl|NC_019525. 27 IQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAV 106 (343) Q Consensus 27 ~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~v 106 (343) +......+|..+.-.++ ...|++...+.-.-+++.++.+... ....+-....-..|.+++.+.. ....++|.- T Consensus 1 ma~~t~~~gg~liP~~~---~~~Ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~~~~~~~a~wv~E~~~---~~~~~~~~s 73 (305) T protein:vir:25 1 MADISRAEVASLIQEAY---SDTLLAAAKQGSTVLSAFQNVNMGT-KTTHLPVLATLPEADWVGESAT---DPKGVKPTS 73 (305) T ss_pred CCCccCCccceecCHHH---HHHHHHHHHhhchhhhhcceeeccC-CcEEEEEEeCCcceEEeecccc---ccccccccc Confidence 22222233323222222 3444444333333344444322211 1222322333345666654321 112357777 Q ss_pred cccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCcee--- Q lcl|NC_019525. 107 DTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVV--- 183 (343) Q Consensus 107 d~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~--- 183 (343) +..+++.....++.+.-+.+|.+=|+.+. .++.+.-.+...+++.+.+|+..++|+.+.. |+.+...++. T Consensus 74 ~~~f~~i~~~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~---~~~~~~~~~~~~~ 146 (305) T protein:vir:25 74 KVTWANRTLVAEEIAVIIPVHENVIDDAT----VAVLTEVAELGGQAIGKKLDQAVIFGTDKPA---SWVSPALIPAAVT 146 (305) T ss_pred ccceeeEEeeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHhhhheeccCCCC---Ccccccccccccc Confidence 88899999999999998888886555332 4688888999999999999999999964432 3333322221 Q ss_pred ecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcc Q lcl|NC_019525. 184 NNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSS 263 (343) Q Consensus 184 ~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~ 263 (343) .+......-...+.+++++++..+...+... + ..++.++|.+..|..|... -++.++-+. +. ..-.|.| T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~--~~~~~~v~~~~~~~~l~~l--kd~~G~~i~---~~---~~l~G~P 215 (305) T protein:vir:25 147 AGQAVEVVGGVANESDIVGATNRAAKAVASA-G--WAPDTLLSSLALRYEVANI--RDANGNPVF---RD---DSFAGFR 215 (305) T ss_pred ccccccccccchhhhHHHHHHHHHHHhhhhc-c--cccceeEecHHHHHHHHHh--hccCCceee---cC---Ccccccc Confidence 1111122222334566777777776655332 2 3567799999999999643 344333322 11 1234677 Q ss_pred eEEeechhhhhcccccCCCccEEEEEEcCc------ceEEEecC------CchhhcchhhcCCceEEeceeeeeccEEEE Q lcl|NC_019525. 264 FEILPCVYADKITAQVPAVAKRYALYNDNE------DSLRMDIP------VDYTSTLANSVNNFQFQNAAYGQFTGVLAY 331 (343) Q Consensus 264 l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~------~~v~~~iP------~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~ 331 (343) +.+... + .. ..++..++.-+-+. +-+.+++- +.-.+.--++++. ..+-++.|+| +.++ T Consensus 216 v~~~~~--~---~~--~~~~~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~--~~~R~~~r~~-~~v~ 285 (305) T protein:vir:25 216 TFFNRN--G---AW--DADAAIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDM--VALRLKARFA-YVLG 285 (305) T ss_pred eEEcCc--c---CC--CCCccEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCc--EEEEEEEeec-ceee Confidence 654321 1 11 11111222212111 11111110 0001111123322 2234577887 4578 Q ss_pred cCceEEeeecC-C Q lcl|NC_019525. 332 RPKELLYLDIP-V 343 (343) Q Consensus 332 yP~a~~Y~D~~-~ 343 (343) .|++++.++.- + T Consensus 286 ~p~a~v~~~~~~~ 298 (305) T protein:vir:25 286 VSATAQGANKTPV 298 (305) T ss_pred CcccEEEEccccc Confidence 89999999973 3 No 24 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.87 E-value=2.3e-06 Score=51.53 Aligned_cols=306 Identities=14% Similarity=0.070 Sum_probs=141.8 Q ss_pred CCceeeecCCc-hHHHH-------------HHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcce Q lcl|NC_019525. 1 MKKFVIRNSKG-EKILL-------------NAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPI 66 (343) Q Consensus 1 ~~~~~~~~~~~-~~~~~-------------~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv 66 (343) ........... ..... ..+....................+. +.+...|++...+...-+.++++ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp--~~~~~~ii~~~~~~~~l~~~~~~ 154 (413) T protein:vir:81 77 IGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYG--TTWNRNIIYRRREKLVVADLMDN 154 (413) T ss_pred hhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccc--hhhHHHHHHHHhhhhhHHhhcce Confidence 11110000000 00000 0000001111111111111112222 22444566655555555555553 Q ss_pred ecCCC---CceeEEEeeeccchhhhhhcccccCCccccccccccc-cccceeeeeEEEEeeEeecHHHHHHHHHhcCCCc Q lcl|NC_019525. 67 RVGEG---AWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDT-GVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDL 142 (343) Q Consensus 67 ~~~~~---~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~-~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L 142 (343) .+... .|......-...+.|.+++.| ..+|..+. .+++.+..++.++.-+.+|.+=|+.+ . .| T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg--------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~--~l 221 (413) T protein:vir:81 155 LTMTNTTIKYLMEKANRVVEGGFKTVAEG--------GKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDY---D--FL 221 (413) T ss_pred eeccCCceeEEEeccccccccccceecCc--------ccccccCcccceeeEeeeeeEEEeehhhHHHHHHH---H--HH Confidence 22211 111111000111334444432 35676664 68889999999998888887644433 2 26 Q ss_pred HHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCC Q lcl|NC_019525. 143 ITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPN 222 (343) Q Consensus 143 ~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~ 222 (343) .+--......++...+|+..++|+-..+.+.|+++.+++...... +.+...+++..++..+....++ .++ T Consensus 222 ~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~--------~~~~~~~~i~~~~~~~~~~~~~--~~~ 291 (413) T protein:vir:81 222 VSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVS--------NKDELADSIYKAMTNISLATPF--QAD 291 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCCCccccccccccccccccc--------ccchhHHHHHHHHHHhhhhccC--CCc Confidence 666666667777778888899996455557899999887432221 2234567777777776555554 467 Q ss_pred eEEeCHHHHHHHhccccCCCcchhhh-hHHHhhcch-------hcCCcceEEeechhhhhcccccCCCcc-EEEEEEcCc Q lcl|NC_019525. 223 KFTIPESDYTGLAGAASADFPIKSTK-QVLEDTFKE-------ITRNSSFEILPCVYADKITAQVPAVAK-RYALYNDNE 293 (343) Q Consensus 223 tl~lp~~~~~~L~~~~~s~~~~~tl~-~~l~~n~~~-------~~~g~~l~I~~~~~~~~~~~~g~gg~d-rmv~Y~~d~ 293 (343) .++|.++.|..|..-+ ++.+.-|+ ..+...+.+ .-.|.|+.+-. .+. .+..-.|.-+ -..++++ T Consensus 292 ~~vmn~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~--~~~-~~~~~~gd~~~~~~~~~~-- 364 (413) T protein:vir:81 292 ALVINPLDYQELRLAK--DANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQ--VVP-VGKPVVGAFRSAASVLRK-- 364 (413) T ss_pred EEEEcHHHHHHHHHhh--ccCCceeccccccccccccccccCceecceeeEEcC--CCC-cccEEEEecccEEEEEEe-- Confidence 8999999999986443 33333332 122111111 11255543221 111 1111111111 1233332 Q ss_pred ceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 294 DSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 294 ~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .-+.+.+- ...-....++ ...+-++.|+++ .+++|.+++++++-- T Consensus 365 ~~~~v~~~--~~~~~~~~~~--~~~~r~~~r~d~-~~~~~~a~~~l~~~~ 409 (413) T protein:vir:81 365 GGVRIDST--NTNVDDFENN--LITVRAEERVGL-MVTFPEAIVQLDVAE 409 (413) T ss_pred cceEEEEe--ccccchhhcC--cEEEEEEEeecc-EEecccceEEEEecC Confidence 11222110 0000112232 223445778875 457899999998744 No 25 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=97.87 E-value=6e-06 Score=49.21 Aligned_cols=291 Identities=10% Similarity=-0.027 Sum_probs=140.1 Q ss_pred CCceeeecCCchHHHHHHHHhh------hcc-chhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCc Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAK------IAG-VIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAW 73 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~------~~~-~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w 73 (343) |||.=..+ ++.++-. ..- +.-....+.+......++ -..|.+.-.+.-.-+++.++.+... . T Consensus 1 ~~~~~~~~-------~~~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~---~~~ii~~~~~~s~l~~~~~~~~~~~-~ 69 (324) T protein:vir:10 1 MEQTQKLK-------LNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDF---TTPILQEVMENSKIMQLGKYEPMEG-T 69 (324) T ss_pred CCCchHHH-------HHHHHHHHHhhccceecccceeccCCCcceechhH---HHHHHHHHHhhchhhhhcceeeccC-C Confidence 88763332 3322211 010 111111222222222222 2333332222222333333322111 1 Q ss_pred eeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHH Q lcl|NC_019525. 74 STMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKN 153 (343) Q Consensus 74 ~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a 153 (343) ...+...+..+.|++++.+ ..+|..+..+.+.....+.++.-+.+|.+-|+.+. .+|.+.-.+...++ T Consensus 70 ~~~~p~~~~~~~a~~v~Eg--------~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~a 137 (324) T protein:vir:10 70 EKKFTFWADKPGAYWVGEG--------QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEA 137 (324) T ss_pred ceEEEEEeCCcceeEeccC--------ccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHH Confidence 2233334444556666543 46888888999999999999999888887666442 36777777888888 Q ss_pred HHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHH Q lcl|NC_019525. 154 WDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTG 233 (343) Q Consensus 154 ~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~ 233 (343) +.+.+|+.+++|......-.|+++........+ -.+.|.+ ||.+++..+... -..++.++|.|+.|.. T Consensus 138 i~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~-----~~~~t~~----~i~~~~~~l~~~---~~~~~~~v~n~~~~~~ 205 (324) T protein:vir:10 138 FYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVI-----KGDFTQD----NIIDLEALLEDD---ELEANAFISKTQNRSL 205 (324) T ss_pred HHHHHHHHhhhcCCCCccCccccccccccceec-----cccCCHH----HHHHHHHhhhhc---cCCCCEEEEcHHHHHH Confidence 888899999999644322345555433211111 1233444 444555554322 2357789999999999 Q ss_pred HhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcC------cceEEEecCCchhhc Q lcl|NC_019525. 234 LAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDN------EDSLRMDIPVDYTST 307 (343) Q Consensus 234 L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d------~~~v~~~iP~~~~~l 307 (343) |.+-+ +..+..+. ..-....-.|.|+.+.+. ...++..+++-+-+ .+.+.+++--..... T Consensus 206 L~~l~--d~~g~~~~---~~~~~~~l~G~PV~~~~~---------~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~ 271 (324) T protein:vir:10 206 LRKIV--DPETKERI---YDRNSDTLDGLPVVNLKS---------SNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLS 271 (324) T ss_pred HHHhh--ccCCceee---cCCCCccccceeEEeecC---------CCCCcceEEEEecccEEEEEecCcEEEEeeccccc Confidence 96443 22222221 111111224666543221 11122233322211 122222221110000 Q ss_pred c----------hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 308 L----------ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 308 ~----------p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) . -.+.+- ..+-++.|+|+..+ .|++++.+-.-. T Consensus 272 ~~~~~~~~~~~~~~~~~--~~~r~~~r~d~~v~-~~~A~~~l~~a~ 314 (324) T protein:vir:10 272 TVKNEDGTPVNLFEQDM--VALRATMHVALHIA-DDKAFAKLVPAD 314 (324) T ss_pred ccccccccchhhhhcCc--EEEEEEEEEccEEe-cccceEEEEecc Confidence 0 022322 23345778876655 599999987655 No 26 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=97.87 E-value=1.8e-06 Score=52.10 Aligned_cols=312 Identities=14% Similarity=0.069 Sum_probs=150.9 Q ss_pred CCceeeec--------CCchHHH-HHHHHhhhcc-------chhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhc Q lcl|NC_019525. 1 MKKFVIRN--------SKGEKIL-LNAQEAKIAG-------VIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYM 64 (343) Q Consensus 1 ~~~~~~~~--------~~~~~~~-~~a~~~~~~~-------~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~ 64 (343) ++...... ...+... .+........ .......+..+.+++. ..+...|++...+...-+.++ T Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp--~~~~~~ii~~~~~~~~i~~l~ 185 (497) T protein:vir:10 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGIL--PTFLPGIVEQLFYELSLADLI 185 (497) T ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccc--hhhhHHHHHHHHhhhhHHhhc Confidence 11111100 0000000 0001000000 0001112222233333 223345666655555556665 Q ss_pred ceecCCCCceeEEEeeec-cchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcH Q lcl|NC_019525. 65 PIRVGEGAWSTMLTTYRS-FSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLI 143 (343) Q Consensus 65 pv~~~~~~w~~~~~~~~~-vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~ 143 (343) ++-+..+ ..-.+-.... .+.|.+++.+ ..+|.-+..+++.+...+.++.-+.+|.+=|+.+ . .|. T Consensus 186 ~~~~~~~-~~~~~~~~~~~~~~a~wv~E~--------~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~---~--~l~ 251 (497) T protein:vir:10 186 SSRPVTS-PNLSYLTESAAHNNAAAVAEA--------GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---P--ELF 251 (497) T ss_pred cccccCC-CceEEEEEcCCCCcceeeccC--------cccccccccceeeEeeeeeeEeecHhHHHHHHhH---H--HHH Confidence 5422221 1112211111 2345555432 3688888999999999999998887777544433 2 367 Q ss_pred HHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCc--------------------------------- Q lcl|NC_019525. 144 TALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTK--------------------------------- 190 (343) Q Consensus 144 ~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~--------------------------------- 190 (343) +--.+...+++...+|.-.+.|+ ...+..|+++++.+......... T Consensus 252 ~~i~~~l~~~i~~~~d~~~l~G~-G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (497) T protein:vir:10 252 NFVQGRLLEGIQRKEEVQLLAGG-GYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKY 330 (497) T ss_pred HHHHHHHHHHHHHHHHHHhhcCC-CcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHH Confidence 77777788888888999999995 33457899999876432111100 Q ss_pred --------------ccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHh-h Q lcl|NC_019525. 191 --------------SIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLED-T 254 (343) Q Consensus 191 --------------~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~-n 254 (343) .-...|....+.++..++..+.... ...|+.++|-|..|..|..- -++.++.++. .... . T Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~l--kd~~G~~i~~~~~~~~~ 406 (497) T protein:vir:10 331 GRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTL--FQTPNAVVMNPRDWELLRLT--KDANGQYMGGNFFGNAY 406 (497) T ss_pred HHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhc--ccCCCeEEEchHHHHHHHHh--hcCCCceeccCcccccc Confidence 0011234455666666666665533 34678999999999998533 3444433321 1000 0 Q ss_pred c-----chhcCCcceEEeechhhhhcccccCCCccE--EEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeecc Q lcl|NC_019525. 255 F-----KEITRNSSFEILPCVYADKITAQVPAVAKR--YALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTG 327 (343) Q Consensus 255 ~-----~~~~~g~~l~I~~~~~~~~~~~~g~gg~dr--mv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GG 327 (343) . ...-.|.|+.+.+- +. .+..-.|.-+. ..++++ .-+++.+ .......++++ ...+-++.|+++ T Consensus 407 ~~~~~~~~~l~G~pV~~t~~--~~-~~~~~~Gd~~~~~~~i~~r--~~~~v~~--~~~~~~~f~~n--~v~~r~~~r~~~ 477 (497) T protein:vir:10 407 GNPVNGGKNIWGVPVVTTPL--IP-LGTILVGHFAPSVIQTARR--EGVTMQM--TNSNGTDFVDG--KVTVRAEERLGL 477 (497) T ss_pred cccccCCceeeceeeEecCC--CC-CCceEEeecccceEEEEEe--cccEEEe--ecccchhhhcC--cEEEEEEEeecc Confidence 0 00112455433221 10 11110111111 122222 3333322 11111224443 333456889988 Q ss_pred EEEEcCceEEeeecCC Q lcl|NC_019525. 328 VLAYRPKELLYLDIPV 343 (343) Q Consensus 328 v~v~yP~a~~Y~D~~~ 343 (343) .++.|++++++++.- T Consensus 478 -~v~~p~A~~~l~~~~ 492 (497) T protein:vir:10 478 -LVYRPSAFQLIQLKK 492 (497) T ss_pred -eeeccccEEEEEecC Confidence 778999999999987 No 27 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=97.87 E-value=1.8e-06 Score=52.10 Aligned_cols=312 Identities=14% Similarity=0.069 Sum_probs=150.9 Q ss_pred CCceeeec--------CCchHHH-HHHHHhhhcc-------chhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhc Q lcl|NC_019525. 1 MKKFVIRN--------SKGEKIL-LNAQEAKIAG-------VIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYM 64 (343) Q Consensus 1 ~~~~~~~~--------~~~~~~~-~~a~~~~~~~-------~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~ 64 (343) ++...... ...+... .+........ .......+..+.+++. ..+...|++...+...-+.++ T Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp--~~~~~~ii~~~~~~~~i~~l~ 185 (497) T protein:vir:78 108 FEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGIL--PTFLPGIVEQLFYELSLADLI 185 (497) T ss_pred hhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccc--hhhhHHHHHHHHhhhhHHhhc Confidence 11111100 0000000 0001000000 0001112222233333 223345666655555556665 Q ss_pred ceecCCCCceeEEEeeec-cchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcH Q lcl|NC_019525. 65 PIRVGEGAWSTMLTTYRS-FSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLI 143 (343) Q Consensus 65 pv~~~~~~w~~~~~~~~~-vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~ 143 (343) ++-+..+ ..-.+-.... .+.|.+++.+ ..+|.-+..+++.+...+.++.-+.+|.+=|+.+ . .|. T Consensus 186 ~~~~~~~-~~~~~~~~~~~~~~a~wv~E~--------~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~---~--~l~ 251 (497) T protein:vir:78 186 SSRPVTS-PNLSYLTESAAHNNAAAVAEA--------GTYPFSSEEFARVYEQVGKVANALTITDEGLRDA---P--ELF 251 (497) T ss_pred cccccCC-CceEEEEEcCCCCcceeeccC--------cccccccccceeeEeeeeeeEeecHhHHHHHHhH---H--HHH Confidence 5422221 1112211111 2345555432 3688888999999999999998887777544433 2 367 Q ss_pred HHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCc--------------------------------- Q lcl|NC_019525. 144 TALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTK--------------------------------- 190 (343) Q Consensus 144 ~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~--------------------------------- 190 (343) +--.+...+++...+|.-.+.|+ ...+..|+++++.+......... T Consensus 252 ~~i~~~l~~~i~~~~d~~~l~G~-G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 330 (497) T protein:vir:78 252 NFVQGRLLEGIQRKEEVQLLAGG-GYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKY 330 (497) T ss_pred HHHHHHHHHHHHHHHHHHhhcCC-CcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHH Confidence 77777788888888999999995 33457899999876432111100 Q ss_pred --------------ccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHh-h Q lcl|NC_019525. 191 --------------SIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLED-T 254 (343) Q Consensus 191 --------------~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~-n 254 (343) .-...|....+.++..++..+.... ...|+.++|-|..|..|..- -++.++.++. .... . T Consensus 331 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~l--kd~~G~~i~~~~~~~~~ 406 (497) T protein:vir:78 331 GRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTL--FQTPNAVVMNPRDWELLRLT--KDANGQYMGGNFFGNAY 406 (497) T ss_pred HHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhc--ccCCCeEEEchHHHHHHHHh--hcCCCceeccCcccccc Confidence 0011234455666666666665533 34678999999999998533 3444433321 1000 0 Q ss_pred c-----chhcCCcceEEeechhhhhcccccCCCccE--EEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeecc Q lcl|NC_019525. 255 F-----KEITRNSSFEILPCVYADKITAQVPAVAKR--YALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTG 327 (343) Q Consensus 255 ~-----~~~~~g~~l~I~~~~~~~~~~~~g~gg~dr--mv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GG 327 (343) . ...-.|.|+.+.+- +. .+..-.|.-+. ..++++ .-+++.+ .......++++ ...+-++.|+++ T Consensus 407 ~~~~~~~~~l~G~pV~~t~~--~~-~~~~~~Gd~~~~~~~i~~r--~~~~v~~--~~~~~~~f~~n--~v~~r~~~r~~~ 477 (497) T protein:vir:78 407 GNPVNGGKNIWGVPVVTTPL--IP-LGTILVGHFAPSVIQTARR--EGVTMQM--TNSNGTDFVDG--KVTVRAEERLGL 477 (497) T ss_pred cccccCCceeeceeeEecCC--CC-CCceEEeecccceEEEEEe--cccEEEe--ecccchhhhcC--cEEEEEEEeecc Confidence 0 00112455433221 10 11110111111 122222 3333322 11111224443 333456889988 Q ss_pred EEEEcCceEEeeecCC Q lcl|NC_019525. 328 VLAYRPKELLYLDIPV 343 (343) Q Consensus 328 v~v~yP~a~~Y~D~~~ 343 (343) .++.|++++++++.- T Consensus 478 -~v~~p~A~~~l~~~~ 492 (497) T protein:vir:78 478 -LVYRPSAFQLIQLKK 492 (497) T ss_pred -eeeccccEEEEEecC Confidence 778999999999987 No 28 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.83 E-value=6.8e-06 Score=48.92 Aligned_cols=291 Identities=10% Similarity=-0.037 Sum_probs=139.9 Q ss_pred CCceeeecCCchHHHHHHHHhhhcc-chhh------hhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCc Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAG-VIQR------LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAW 73 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~-~~~~------~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w 73 (343) |.|.-+.... .+.-.-.. ..+. ...+.+..+.-.+ +-..|.+...+.-.-+++.++....+ . T Consensus 1 ~~~~~~~~~~-------~~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~---~~~~ii~~~~~~s~l~~l~~~~~~~~-~ 69 (324) T protein:vir:93 1 MEQTQKLKLN-------LQHFASNNVKPQVFNPDNVMMHEKKDGTLLND---FTTPILQEVMENSKIMQLGKYEPMEG-T 69 (324) T ss_pred CchhHHHHHH-------HHHHHHhhhhhhhcccccccccCCCcceechh---HHHHHHHHHHhhchhhhhcceeeccC-C Confidence 6665433322 22111000 0011 1112222222222 22334333222222333333221111 1 Q ss_pred eeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHH Q lcl|NC_019525. 74 STMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKN 153 (343) Q Consensus 74 ~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a 153 (343) .-.+-..+....|.+++.| ..+|.-+..+++.+...+.++.-+.+|.+-|+.+. .+|.+.-.+...++ T Consensus 70 ~~~ip~~~~~~~a~~v~Eg--------~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~a 137 (324) T protein:vir:93 70 EKKFTFWADKPGAYWVGEG--------QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEA 137 (324) T ss_pred ceEEEEEecCcceeeecCC--------ccccccccceeEEEEEeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHH Confidence 1123233334445555432 46888888999999999999998888887666543 35777777777788 Q ss_pred HHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHH Q lcl|NC_019525. 154 WDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTG 233 (343) Q Consensus 154 ~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~ 233 (343) +...+|+.++.|........|+++....+...+.+ +.| .+||.+++..+... ...++.++|.++.|.. T Consensus 138 ia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~-----~~~----~~~i~~~~~~l~~~---~~~~~~~v~n~~~~~~ 205 (324) T protein:vir:93 138 FYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKG-----DFT----QDNIIDLEALLEDD---ELEANAFISKTQNRSL 205 (324) T ss_pred HHHHHHHHHhcCCCCCCcCccccccccccceeccc-----ccc----HHHHHHHHHhhhhc---cCCCCEEEEcHHHHHH Confidence 88888899999954332234566554432222211 223 34566666655433 2356789999999999 Q ss_pred HhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCc------ceEEEecCCchhhc Q lcl|NC_019525. 234 LAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNE------DSLRMDIPVDYTST 307 (343) Q Consensus 234 L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~------~~v~~~iP~~~~~l 307 (343) |.+-+ +..+.-+ +..-....-.|.|+...+ +...++..+++-+.+. +-+++++--..... T Consensus 206 L~~l~--d~~G~~~---~~~~~~~~l~G~PVv~~~---------~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~ 271 (324) T protein:vir:93 206 LRKIV--DPETKER---IYDRNSDSLDGLPVVNLK---------SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLS 271 (324) T ss_pred HHHhh--CCCCCee---ecCCCCCcccceeeEeec---------CCCCCcceEEEEecceEEEEEecCcEEEEeeccccc Confidence 96443 3223222 111111222355653221 1112233333332221 11222210000000 Q ss_pred c----------hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 308 L----------ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 308 ~----------p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) . -.+.+- ..+-+++|+|+. +..|.+++.+.... T Consensus 272 ~~~~~~~~~~~~f~~n~--~~~r~~~r~d~~-v~~~~a~~~l~~a~ 314 (324) T protein:vir:93 272 TVKNEDGTPVNLFEQDM--VALRATMHVALH-IADDKAFAKLVPAD 314 (324) T ss_pred ccccccccchhhhhcCc--EEEEEEEEeccE-EecccceEEEeccc Confidence 0 023322 334457778655 66799999987655 No 29 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=97.83 E-value=4.3e-06 Score=49.99 Aligned_cols=282 Identities=10% Similarity=-0.024 Sum_probs=144.0 Q ss_pred hhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc Q lcl|NC_019525. 28 QRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD 107 (343) Q Consensus 28 ~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd 107 (343) .+...+++..+...+ +...|++.-.++-.-+++.++..... ....+..++.-+.|.+++.+ .++|.-+ T Consensus 1 ma~~t~~~G~lip~~---~~~~ii~~l~~~s~i~~l~~~~~~~~-~~~~~p~~~~~~~a~wv~Eg--------~~~~~s~ 68 (300) T protein:vir:95 1 MSEAQLSKGNLFNPE---LVTKVINKVKGHSSIAKLSPQKPIPF-NGQREFVFDFDSDIDIVAEN--------GKKTHGG 68 (300) T ss_pred CcccccCCcceechh---hHHHHHHHHHhhhhhhhhcceeeccC-CceEEEEEecCcceEEeeCC--------ccccccc Confidence 333334443443332 23334444333333333444322111 12233333333445555433 4788888 Q ss_pred ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccc----cceeeeeecCCcee Q lcl|NC_019525. 108 TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDN----ANVKGLLTQTGNVV 183 (343) Q Consensus 108 ~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~----~g~~GLlN~p~v~~ 183 (343) ..+++...+.++++.-..+|.+=|++.--.. .+|.+.-.+...+++...+++.+++|.++. .++.|..+.+++.. T Consensus 69 ~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~-~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 147 (300) T protein:vir:95 69 VSLDPVTIVPLKVEYGARVSDEFLHASEEAK-VDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVT 147 (300) T ss_pred ccceeeEeeeEEEEEeehhhHHHhccCCCCH-HHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccc Confidence 9999999999999998888876554321122 468888888888999999999999995332 23445555555433 Q ss_pred ecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCc Q lcl|NC_019525. 184 NNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNS 262 (343) Q Consensus 184 ~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~ 262 (343) ..+.++ .+++ .++|.+++..+.... ..|+.++|-|+.+..|..-+ ++.+..+.. ......+..-.|. T Consensus 148 ~~~~~~----~~~~---~~~i~~~~~~~~~~~---~~~~~~vmn~~~~~~L~~lk--d~~G~~i~~~~~~~~~~~~l~G~ 215 (300) T protein:vir:95 148 QTVPFK----DTNP---DESMEDAVGMIDGSE---RDITGAILDPIFTTALSKMK--NAEGGKLYPELAWGGVPDAINGL 215 (300) T ss_pred eeeccc----ccch---HHHHHHHHHHhhhcC---CCccEEEECHHHHHHHHHhh--ccCCCeeccCccccCCCceecce Confidence 222111 1222 356666666554322 35778999999999996443 444433321 1111112222355 Q ss_pred ceEEeechhhhhcccccCCCccEEEEEEcC-------cceEEEecCCchhhc--c---hhhcCCceEEeceeeeeccEEE Q lcl|NC_019525. 263 SFEILPCVYADKITAQVPAVAKRYALYNDN-------EDSLRMDIPVDYTST--L---ANSVNNFQFQNAAYGQFTGVLA 330 (343) Q Consensus 263 ~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d-------~~~v~~~iP~~~~~l--~---p~~~~~l~~~v~~~~r~GGv~v 330 (343) |+.+- ........+.++.+++-+-+ .+.+++++- +.... . -++.+- .-+-+++|+| ..+ T Consensus 216 Pv~~s-----~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~-~~~~~d~~~~~~f~~~~--v~~r~~~r~d-~~v 286 (300) T protein:vir:95 216 AVDKN-----RTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEII-KYGDPDNSGRDLKGYNQ--IYIRCEAYIG-WGI 286 (300) T ss_pred eeEEe-----cCCCCCCCCCccEEEEeeccceEEEEEecccEEEEe-eccCCCCcchhhhhcCc--EEEEEEEeec-cee Confidence 54332 22323333344444433322 222333321 00000 0 023322 2234577775 456 Q ss_pred EcCceEEeeecCC Q lcl|NC_019525. 331 YRPKELLYLDIPV 343 (343) Q Consensus 331 ~yP~a~~Y~D~~~ 343 (343) +.|.+++.+--.= T Consensus 287 ~~~~a~~~l~~~~ 299 (300) T protein:vir:95 287 MDAASFARIVKTG 299 (300) T ss_pred ecccceEEEecCC Confidence 6699999984444 No 30 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.82 E-value=9.2e-06 Score=48.19 Aligned_cols=297 Identities=9% Similarity=-0.046 Sum_probs=140.3 Q ss_pred CCceeeecCCchHHHHHHHHhhhcc-chhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAG-VIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT 79 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~-~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~ 79 (343) |||.=.++-. .+-..++......- +.-....+.+......+ +-..|++.-.+.-.-++++++.+..+ ....+.. T Consensus 1 ~~k~~~~~~~-~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~---~~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~ 75 (324) T protein:vir:99 1 MEQTQKLKLN-LQHFASNNVKPQVFNPDNVMMHEKKDGTLLND---FTTPILQEVMENSKIMRLGKYEPMEG-TEKKFTF 75 (324) T ss_pred CCCchHhhHH-HHHHHHHhhhhhhccccceeccCCCcceechh---HHHHHHHHHHhhchhhhhcceeeccC-CceEEEE Confidence 8887333321 11011111110110 11111122222222222 22334333222222333333222111 1223333 Q ss_pred eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 80 YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 80 ~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) .+..+.|.+++.+ ..+|..+..+.+.....+.++.-..+|.+-|+.+. .++.+.-.....+++...++ T Consensus 76 ~~~~~~a~~v~Eg--------~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d 143 (324) T protein:vir:99 76 WADKPGAYWVGEG--------QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFD 143 (324) T ss_pred EecCcceeEeccC--------ccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHH Confidence 3334456666543 46888889999999999999999998987776543 35777777777888888889 Q ss_pred heEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcccc Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAAS 239 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~ 239 (343) +.+++|.....--.|+++........+. .+.|. +||.+++..+... -..++.++|.|+.|..|.+-+ T Consensus 144 ~~~l~G~g~~~~~~~~~~~~~~~~~~~~-----~~~~~----~~i~~~~~~l~~~---~~~~~~~v~n~~~~~~L~~l~- 210 (324) T protein:vir:99 144 EAGILNQGNNPFGKSIAQSIEKTNKVIK-----GDFTQ----DNIIDLEALLEDD---ELEANAFISKTQNRSLLRKIV- 210 (324) T ss_pred HHhhhcCCCCccCccccccccccceecc-----ccCCH----HHHHHHHHhhhhc---cCCCCEEEEcHHHHHHHHHhh- Confidence 9999995433212355554332111111 12333 4455555554322 235678999999999996432 Q ss_pred CCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcC------cceEEEecCCchhhcc----- Q lcl|NC_019525. 240 ADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDN------EDSLRMDIPVDYTSTL----- 308 (343) Q Consensus 240 s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d------~~~v~~~iP~~~~~l~----- 308 (343) +..+..+. .......-.|.|+.+.+ . ...++..+++-+.+ .+.+.+++--...... T Consensus 211 -d~~g~~~~---~~~~~~~l~G~PVv~~~-----~----~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:99 211 -DPETKERI---YDRNSDTLDGLPVVNLK-----S----SNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNED 277 (324) T ss_pred -cCCCceee---cCCCCccccceeEEeec-----C----CCCCcceEEEEecccEEEEEecCcEEEEeeccccccccccc Confidence 32322221 11112222466653322 1 11122333322221 1222222110100000 Q ss_pred -----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 309 -----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 309 -----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -.+++- ..+-++.|+++... .|++++.+..-. T Consensus 278 ~~~~~~f~~~~--~~~r~~~r~d~~v~-~~~a~~~lt~a~ 314 (324) T protein:vir:99 278 GTPVNLFEQDM--VALRATMHVALHIA-DDKAFAKLVPAD 314 (324) T ss_pred ccchhhhhcCc--EEEEEEEEEccEEe-cccceEEEEecc Confidence 022322 23345778866655 599999987765 No 31 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=97.82 E-value=8.7e-06 Score=48.32 Aligned_cols=297 Identities=9% Similarity=-0.039 Sum_probs=141.4 Q ss_pred CCceeeecCCchHHHHHHHHhhhc-cchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIA-GVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT 79 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~-~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~ 79 (343) |+|.=.++-. .+-..++...... .+.-....+.+..+...++ -..|++.-.+.-.-+.++++.+..+ ....+-. T Consensus 1 ~~~~~~~~~~-~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~---~~~ii~~~~~~s~l~~l~~~~~~~~-~~~~~p~ 75 (324) T protein:vir:96 1 MEQTQKLKLN-LQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDF---TTPILQEVMENSKIMQLGKYEPMEG-TEKKFTF 75 (324) T ss_pred CCcchhhhHH-HHHHHHhhhhhhhcccccccccCCCcceechhH---HHHHHHHHHhhchhhhhcceeeccC-CceEEEE Confidence 8776444322 1111111111000 1111111122222333332 2334433222322333333322111 1222333 Q ss_pred eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 80 YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 80 ~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) .+..+.|++++.+ ..+|.-+..+.+.+...+.++.-+.+|.+-|+.+. .+|.+.-.+...+++...+| T Consensus 76 ~~~~~~a~~v~Eg--------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d 143 (324) T protein:vir:96 76 WADKPGAYWVGEG--------QKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFD 143 (324) T ss_pred EecCcceeeecCC--------ccccccccceeEEEEEeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHH Confidence 3334455665433 46888889999999999999988888876666442 36778888888888888999 Q ss_pred heEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcccc Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAAS 239 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~ 239 (343) +.+++|......-.|+++....... ....+.|.++|+ +++..+... ...++.++|.++.+..|..-+ T Consensus 144 ~~~l~G~g~~~~~~~~~~~~~~~~~-----~~~~~~~~~~i~----~~~~~i~~~---~~~~~~~i~n~~~~~~L~~lk- 210 (324) T protein:vir:96 144 EAGILNQGNNPFGKSIAQSIKKTNK-----VIKGDFTQDNII----DLEALLEDD---ELEANAFISKTQNRSLLRKIV- 210 (324) T ss_pred HHhhhcCCCCCcCccccccccccce-----ecccccchHHHH----HHHHhhhhc---cCCCCEEEEcHHHHHHHHHhh- Confidence 9999995433222345443222111 112234445444 444444222 235778999999999997543 Q ss_pred CCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcC------cceEEEecCCchhhcc----- Q lcl|NC_019525. 240 ADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDN------EDSLRMDIPVDYTSTL----- 308 (343) Q Consensus 240 s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d------~~~v~~~iP~~~~~l~----- 308 (343) +..+..++ ..-....-.|.|+.+-+ +...++..+++-+.+ .+-+++++--...... T Consensus 211 -d~~G~~~~---~~~~~~~l~G~PV~~~~---------~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:96 211 -DPETKERI---YDRNSDSLDGLPVVNLK---------SSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNED 277 (324) T ss_pred -CCCCCeee---cCCCCCcccceeeEeec---------CCCCCcceEEEEecceEEEEEecCcEEEEeeccccccccccc Confidence 33333221 11111122456653321 111122223222221 1222222211100000 Q ss_pred -----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 309 -----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 309 -----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -++++-. .+-+++|+++. ++.|.+++++-... T Consensus 278 ~~~~~~~~~n~v--~~r~~~r~d~~-v~~~~a~~~l~~a~ 314 (324) T protein:vir:96 278 GTPVNLFEQDMV--ALRATMHVALH-IADDKAFAKLVPAD 314 (324) T ss_pred ccchhhhhcCcE--EEEEEEEeccE-EecccceEEEeccc Confidence 0233222 33457778665 66699999998766 No 32 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.82 E-value=4e-06 Score=50.16 Aligned_cols=288 Identities=8% Similarity=-0.062 Sum_probs=143.8 Q ss_pred hhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccc Q lcl|NC_019525. 29 RLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDT 108 (343) Q Consensus 29 ~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~ 108 (343) +.....|..+.-.+ +...|++.-.+.-.-+++.++.... ...-++...+.-+.|.+++.| ..+|.-+. T Consensus 1 mat~~~gg~lvP~~---~~~~ii~~~~~~s~i~~~~~~i~~~-~~~~~~p~~~~~~~a~wv~Eg--------~~~~~~~~ 68 (311) T protein:vir:81 1 MVALATGTFQLPKH---LVPGVWQKAQGQSVLARLSMAEPQE-FGEQQYMTLTAPPRGEVVGEG--------AQKSESTA 68 (311) T ss_pred CceecCCceEcchh---HHHHHHHHHHhcchhhhhcceeecC-CCceEEEEEeCCceeEEeecC--------cccccccc Confidence 22222332222222 2334444433333334444432211 112233334444455555433 46888888 Q ss_pred cccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccc--cceeeeeecCCceeecc Q lcl|NC_019525. 109 GVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDN--ANVKGLLTQTGNVVNNT 186 (343) Q Consensus 109 ~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~--~g~~GLlN~p~v~~~~a 186 (343) .+++.+...++++.-..+|.+=|+..--.. .+|.+.-...+.+++...+|+.+++|..+. .+..|+++...-+.. . T Consensus 69 ~f~~v~l~~~kl~~~~~iS~ell~~~~d~~-~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~-~ 146 (311) T protein:vir:81 69 TFAPVTAIPRKVQVTQRFSQEVKWADESRQ-LGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTN-I 146 (311) T ss_pred eeeEEEEeeEEEEEeehhhHHHhhcCcccH-HHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccce-e Confidence 899999999999887777766444322222 368888888888999999999999996432 234566654221111 0 Q ss_pred cCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhh-hHHHhhcchhcCCcceE Q lcl|NC_019525. 187 FLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTK-QVLEDTFKEITRNSSFE 265 (343) Q Consensus 187 ~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~ 265 (343) ....+.+...+..+|..++..+... ...|+.++|-|..+..|..- -+..+.-+. .......+..-.|.|+. T Consensus 147 ---~~~~~~~~~~~~~~i~~~~~~~~~~---~~~~~~~vmn~~~~~~l~~l--kd~~G~~l~~~~~~~~~~~tl~G~Pv~ 218 (311) T protein:vir:81 147 ---VELTTGTSATPDLAVEAAVGLVLGD---NLSPDGVALDNTFSFMLATQ--RDSQGRKLYPELGFGTDVASFAGLNAA 218 (311) T ss_pred ---eeecccccchHHHHHHHHHHHhhhc---CCCceEEEEcHHHHHHHHhh--hccCCCeeecCccccCCCceecceeEE Confidence 1111222223345566666655433 23567899999999999543 233332222 11111122222355554 Q ss_pred Eeechhhhhcc---------cccCCCccEEEEEEcCcceEEEecCCchhhcc---------hhhcCCceEEeceeeeecc Q lcl|NC_019525. 266 ILPCVYADKIT---------AQVPAVAKRYALYNDNEDSLRMDIPVDYTSTL---------ANSVNNFQFQNAAYGQFTG 327 (343) Q Consensus 266 I~~~~~~~~~~---------~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~---------p~~~~~l~~~v~~~~r~GG 327 (343) +-.- +.+.. .+..+++.++++.+-+.=.+.+.-.+.+.... -.+.+- ..+-++.|+|+ T Consensus 219 ~~~~--i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~--v~~r~~~r~d~ 294 (311) T protein:vir:81 219 VSDT--VRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQ--IAIRAEVVYGI 294 (311) T ss_pred eccc--ccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCc--EEEEEEEEecc Confidence 3210 10000 01123445666655543222222222222111 023322 23345677765 Q ss_pred EEEEcCceEEeeecCC Q lcl|NC_019525. 328 VLAYRPKELLYLDIPV 343 (343) Q Consensus 328 v~v~yP~a~~Y~D~~~ 343 (343) .++.|++++++--.+ T Consensus 295 -~v~~~~a~~~l~~a~ 309 (311) T protein:vir:81 295 -GIMSTDAFAVVRDAD 309 (311) T ss_pred -EeecccceEEEEeec Confidence 566799999997777 No 33 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=97.82 E-value=4e-06 Score=50.18 Aligned_cols=300 Identities=12% Similarity=0.013 Sum_probs=142.4 Q ss_pred CCceeeecCCchHHHHHHH-HhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQ-EAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT 79 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~-~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~ 79 (343) ++.|......++ .....+ ...............| .+...++ + ..+++...+...-+.++++.+... ....+-. T Consensus 88 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~g-~~~~~~~--~-~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~ 161 (390) T protein:vir:10 88 FQASAGRWNDRS-ARATMNIKAALNTASTDAAGSAG-ALTTPNR--L-PGFITQPDARLTVRDLIGSGRTDS-ALIEYVQ 161 (390) T ss_pred HHHHHHhhhhhh-hhhhhHHHHHHHhhhcccccccc-cccchhH--H-HHHHHHHHhhchhhhhcceeeccC-CceEEEE Confidence 111100000000 000000 0000000001111222 2333221 2 234444334334444444432211 1222222 Q ss_pred ee-ccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhh Q lcl|NC_019525. 80 YR-SFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGI 158 (343) Q Consensus 80 ~~-~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~ 158 (343) .+ ..+.+.+++.+ ..+|..+..+++....++.++.-+.+|.+=|+. +. +|.+--.....+++...+ T Consensus 162 ~~~~~~~a~~v~Eg--------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d---~~--~l~~~i~~~l~~~~~~~~ 228 (390) T protein:vir:10 162 ETGFVNNAAIVAEG--------ALKPESSLKFAKKTDTTHVIAHTMKATRQILSD---AP--QLASYMNNRLIRGLKVKE 228 (390) T ss_pred EecCCcceeeecCC--------ccccccccceeEEEEeeEEEEEeehhhHHHHHh---HH--HHHHHHHHHHHHHHHHHH Confidence 22 22344444432 357888889999999999999988888753332 22 477777777888888889 Q ss_pred hheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccc Q lcl|NC_019525. 159 QEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAA 238 (343) Q Consensus 159 n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~ 238 (343) |+..++|+-......|++|.++++..... .+..+ ..+++.+++..+...- ..++.++|.|+.|..|..-+ T Consensus 229 ~~~il~G~G~~~~p~Gi~~~~~~~~~~~~----~~~~~---~~~~~~~~~~~l~~~~---~~~~~~v~n~~~~~~L~~lk 298 (390) T protein:vir:10 229 DAEILRGTGANDGLLGLIPQATTYAAPTT----IAGAT---RVDQLRLAMLQASLAE---YPASGIVINPIDWAAIELAK 298 (390) T ss_pred HHHHhhcCCCCcccccccccccccccccc----ccccc---hHHHHHHHHHhhcccc---CCCCEEEEcHHHHHHHHHhh Confidence 99999996555557899999876432211 11112 2455566655554322 24678999999999997543 Q ss_pred cCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccE-EEEEEcCcceEEEecCCchhhcchhhcCCceE Q lcl|NC_019525. 239 SADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKR-YALYNDNEDSLRMDIPVDYTSTLANSVNNFQF 317 (343) Q Consensus 239 ~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~dr-mv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~ 317 (343) ++.+.-|+.--.......-.|.|+-+.+ ++. .+..-.|.-+. ..++++ ..+.++.... ....+.+ + . T Consensus 299 --d~~g~~l~~~~~~~~~~~l~G~pv~~~~--~~p-~~~~~~gdf~~~~~~~~~--~~~~i~~~~~---~~~~~~~-~-~ 366 (390) T protein:vir:10 299 --DANNQYLIGNARGTLTPTLWGLPVVATQ--AMA-PGEFLVGAFDLAAQIFDQ--WDARVEIGYV---NDDFQRN-M-V 366 (390) T ss_pred --cCCCceeecCCcCcCCceecceeeEEcC--CCC-CCcEEEEeccceEEEEEe--cceEEEEeec---ccccccC-c-E Confidence 4444333211111111112355543321 211 11111111111 122222 3333322111 1122333 2 2 Q ss_pred EeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 318 QNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 318 ~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) .+-++.|+++ .++.|.+++++++. T Consensus 367 ~~r~~~r~d~-~v~~~~a~~~~~~a 390 (390) T protein:vir:10 367 TVLAEERLAL-VVYRPEALISGSFA 390 (390) T ss_pred EEEEEEeecc-EEeccccEEEEEeC Confidence 3345778877 78899999999999 No 34 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=97.81 E-value=5.4e-06 Score=49.45 Aligned_cols=281 Identities=11% Similarity=0.006 Sum_probs=144.6 Q ss_pred hhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccccc Q lcl|NC_019525. 30 LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTG 109 (343) Q Consensus 30 ~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~ 109 (343) +.-+.|. +...+ +.+.|++...+.-.-+++.++..... ....+.....-+.|.+++.+ .++|.-+.. T Consensus 1 ma~~gG~-lvp~~---~~~~ii~~~~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~~v~E~--------~~~~~~~~~ 67 (298) T protein:vir:16 1 MVLNKGT-LFDPT---LVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAES--------GKKTHGGVT 67 (298) T ss_pred CcccCcc-eechh---HHHHHHHHHHhhhhhhhhcceeeccC-CceEEEEEecCcceEEecCC--------ccccccccc Confidence 2222232 22222 22334444333333444444322211 12233333444556666432 478888899 Q ss_pred ccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccc----eeeeeecCCceeec Q lcl|NC_019525. 110 VDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNAN----VKGLLTQTGNVVNN 185 (343) Q Consensus 110 ~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g----~~GLlN~p~v~~~~ 185 (343) +++.+...++++.-...|.+=|+.+--.. .+|.+.-.+...+++...+++..++|.++..| +.|+....+.+... T Consensus 68 f~~v~l~~~k~a~~~~iS~ell~~s~d~~-~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~ 146 (298) T protein:vir:16 68 LAPQTMVPIKVEYGARISDEFMYASDEEK-INILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQK 146 (298) T ss_pred eeEEEEeeeeEEEeehhhHHHhhcCcccH-HHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccc Confidence 99999999999988888877776443232 46888888888888889999999999543222 33333222221111 Q ss_pred ccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcce Q lcl|NC_019525. 186 TFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSF 264 (343) Q Consensus 186 a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l 264 (343) ... ......+.+||.+++..+...- ..+..++|-++.+..|..-+ ++.+..+.. ....-.+..-.|.|+ T Consensus 147 ~~~-----~~~~~~~~~~i~~~~~~~~~~~---~~~~~~vmn~~~~~~l~~lk--d~~G~~i~~~~~~~~~~~~l~G~PV 216 (298) T protein:vir:16 147 VEA-----PRGIADPNGAIENAVELLTGVD---ADVTGIAINPSFRSALAKQK--DLQDNALFPELKWGATPDTINGLPV 216 (298) T ss_pred ccc-----ccccccHHHHHHHHHHHhhhcC---CCccEEEEcHHHHHHHHHhh--ccCCCeeecCcccCCCCceecceee Confidence 111 1112334667888877765432 34667999999999996433 444433321 111111122235555 Q ss_pred EEeechhhhhcccccCCCccEEEEEEcC-------cceEEEecCCchhhc--c---hhhcCCceEEeceeeeeccEEEEc Q lcl|NC_019525. 265 EILPCVYADKITAQVPAVAKRYALYNDN-------EDSLRMDIPVDYTST--L---ANSVNNFQFQNAAYGQFTGVLAYR 332 (343) Q Consensus 265 ~I~~~~~~~~~~~~g~gg~drmv~Y~~d-------~~~v~~~iP~~~~~l--~---p~~~~~l~~~v~~~~r~GGv~v~y 332 (343) .+. ........++++.+++-+-+ .+.+++.+- +.... . -.+.+-.. +-+++|+++ .++. T Consensus 217 ~~~-----~~v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~-~~~~~~~~~~~~f~~~~v~--~ra~~r~d~-~v~~ 287 (298) T protein:vir:16 217 DVN-----KTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVI-QYGDPDNSGLDLKGYNQVY--IRAELFLGW-GILD 287 (298) T ss_pred EEe-----cccccccCCCccEEEEeeccceEEEEEecCceEEEe-eccCCcCcchhhhhcCcEE--EEEEEEEcc-Eeec Confidence 432 23333334455555543332 222233221 00000 0 02232222 344666654 5778 Q ss_pred CceEEeeecCC Q lcl|NC_019525. 333 PKELLYLDIPV 343 (343) Q Consensus 333 P~a~~Y~D~~~ 343 (343) |++++|+--.- T Consensus 288 ~~a~~~l~~at 298 (298) T protein:vir:16 288 ATKFARVTEAN 298 (298) T ss_pred ccceEEEeecC Confidence 99999984433 No 35 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=97.81 E-value=5.3e-06 Score=49.51 Aligned_cols=282 Identities=12% Similarity=0.001 Sum_probs=139.8 Q ss_pred hhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccccc Q lcl|NC_019525. 30 LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTG 109 (343) Q Consensus 30 ~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~ 109 (343) +.-+.|..+- .+ +.+.|++...+.-.-+++.++.+... ....+.....-+.|.+++.| .++|.-+.. T Consensus 1 ma~~gG~lip-~~---~~~~ii~~~~~~s~i~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~Eg--------~~~~~~~~~ 67 (298) T protein:vir:94 1 MVLNKGTLFD-PE---LVTDLISKVAGKSSIARLSAQKPIPF-NGEKVFTFTMDSEIDVVAES--------GKKTHGGVT 67 (298) T ss_pred CeeccccccC-hh---HHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEecCcceEEeeCC--------ccccccccc Confidence 2223332222 22 23334444333333444444332211 12233333334455666543 368888888 Q ss_pred ccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccc----eeeeeecCCceeec Q lcl|NC_019525. 110 VDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNAN----VKGLLTQTGNVVNN 185 (343) Q Consensus 110 ~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g----~~GLlN~p~v~~~~ 185 (343) +++.....++++.-...|.+=|++..-.. .+|.+.-.....+++...+++..+.|.++..| ..|..+..+.+.. T Consensus 68 f~~v~l~~~k~~~~~~iS~ell~~~~~~~-~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~- 145 (298) T protein:vir:94 68 LAPQTMVPIKVEYGARISDEFMYASDEEK-INILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ- 145 (298) T ss_pred eeEEEEeeeEEEEeeehhHHHhccCCccH-HHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccc- Confidence 99999999999988888876555332221 35788888888889999999999999543211 1122111111111 Q ss_pred ccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhh-hHHHhhcchhcCCcce Q lcl|NC_019525. 186 TFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTK-QVLEDTFKEITRNSSF 264 (343) Q Consensus 186 a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l 264 (343) . .-.......+.+|+.+++..+...- ..+..++|-|+.+..|...+ ++.+.-+. .....-.+..-.|.|+ T Consensus 146 ~----~~~~~~~~~~~~~i~~~~~~~~~~~---~~~~~~vmn~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~tl~G~PV 216 (298) T protein:vir:94 146 K----VEAPRGIADPNGAIENAVELLTGVD---ADVTGIAINPSFRSALAKQK--DLQGNALFPELKWGATPDTINGLPV 216 (298) T ss_pred c----cccccccccHHHHHHHHHHhhhhcC---CCccEEEEcHHHHHHHHHhh--ccCCCeeecCcccCCCCceecceee Confidence 1 1111223346778888888765542 35678999999999996533 33333221 1111111111235554 Q ss_pred EEeechhhhhcccccCCCccEEEEEEcCcce-EEEecCCchhhcc----------hhhcCCceEEeceeeeeccEEEEcC Q lcl|NC_019525. 265 EILPCVYADKITAQVPAVAKRYALYNDNEDS-LRMDIPVDYTSTL----------ANSVNNFQFQNAAYGQFTGVLAYRP 333 (343) Q Consensus 265 ~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~-v~~~iP~~~~~l~----------p~~~~~l~~~v~~~~r~GGv~v~yP 333 (343) -+ .......+.++++.+++-+-+.-+ +.+.-.+.+.... -++.+-..+ -++.|+| +.++.| T Consensus 217 ~~-----~~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~--r~~~r~~-~~~~~~ 288 (298) T protein:vir:94 217 DV-----NKTVSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYI--RAELFLG-WGILDA 288 (298) T ss_pred EE-----ecccccccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEE--EEEEEec-cEeecc Confidence 32 233333333344554443332211 1111111111111 123332333 3466664 566779 Q ss_pred ceEEeeecCC Q lcl|NC_019525. 334 KELLYLDIPV 343 (343) Q Consensus 334 ~a~~Y~D~~~ 343 (343) ++++++---- T Consensus 289 ~a~~~l~~~t 298 (298) T protein:vir:94 289 TKFARVTEAN 298 (298) T ss_pred cceEEEEecC Confidence 9998873333 No 36 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=97.80 E-value=3.7e-06 Score=50.33 Aligned_cols=300 Identities=11% Similarity=0.003 Sum_probs=138.5 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceec-CCCCceeEEEe Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRV-GEGAWSTMLTT 79 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~-~~~~w~~~~~~ 79 (343) ++.|.....++............+........+.|. +...+ +...|++...+.-.-+.++++.. ..+ ...+-. T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-lip~~---~~~~ii~~~~~~~~i~~~~~~~~~~~~--~~~~~~ 161 (390) T protein:vir:97 88 FQASTGRWNDRSARATMNIKAALNTASTDAAGSAGA-LTTPN---RLPGFITPPDARLTVRDLIGSGRTDSA--LIEYVQ 161 (390) T ss_pred HHHHHHHhhhhhhhhhhHHHHHHHhhhccccccccc-ccchh---hhHHHHHHHhhhhhhHhhcceeeccCC--ceEEEE Confidence 111111111110000000000001111111112222 22211 11234433233323333333221 222 222222 Q ss_pred eec-cchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhh Q lcl|NC_019525. 80 YRS-FSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGI 158 (343) Q Consensus 80 ~~~-vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~ 158 (343) .+. .+.+.+++.| ..+|.-+..+++.+...+.++.-..+|.+=|+. +. +|.+.-.....+++...+ T Consensus 162 ~~~~~~~a~~v~Eg--------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d---s~--~l~~~i~~~la~a~~~~~ 228 (390) T protein:vir:97 162 ETGFVNNAAIVAEG--------ALKPESSLKFAKKTDTTHVIAHTMKATRQILSD---AP--QLASYMNNRLIRGLKVKE 228 (390) T ss_pred EecCCcceeeecCC--------ccccccccceeEEEEeeeeEEEeehhhHHHHHh---HH--HHHHHHHHHHHHHHHHHH Confidence 222 2345555432 357888888999999999999888888753332 22 477777788888888889 Q ss_pred hheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccc Q lcl|NC_019525. 159 QEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAA 238 (343) Q Consensus 159 n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~ 238 (343) |+-.++|+.......|++|.+++..... ..+.+...+++.+++..+...- ..++.++|-|+.|..|.+-+ T Consensus 229 d~a~l~G~g~~~~p~Gi~~~~~~~~~~~-------~~~~~~~~d~~~~~~~~~~~~~---~~~~~~v~n~~~~~~L~~lk 298 (390) T protein:vir:97 229 DAEILRGTGANDGLLGLIPQATTYAAPT-------TIAGATRVDQLRLAMLQASLAE---YPASGIVINPIDWAAIELAK 298 (390) T ss_pred HHHHhhcCCCCccccceeeccccccccc-------cccccchHHHHHHHHHhhcccc---CCCCEEEEcHHHHHHHHHhh Confidence 9999999655545789999887543211 1122333556666666553222 25678999999999997543 Q ss_pred cCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccE-EEEEEcCcceEEEecCCchhhcchhhcCCceE Q lcl|NC_019525. 239 SADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKR-YALYNDNEDSLRMDIPVDYTSTLANSVNNFQF 317 (343) Q Consensus 239 ~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~dr-mv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~ 317 (343) ++.+.-+..--.......-.|.|+.+.+ ++. .+..-.|.-+. ..++++ .-+.+..--. ....+++-.. T Consensus 299 --d~~G~~l~~~~~~~~~~~l~G~pV~~~~--~~~-~~~~~~gd~~~~~~~~~~--~~~~i~~~~~---~~~f~~~~~~- 367 (390) T protein:vir:97 299 --DANNQYLIGNARGTLTPTLWGLPVVATQ--AMA-PGEFLVGAFDLAAQIFDQ--WDARVEIGYV---NDDFQRNMVT- 367 (390) T ss_pred --cCCCceeecCccCCCCceecceeeEEcC--CCC-CCcEEEEeccceEEEEEe--cceEEEEeec---ccccccCcEE- Confidence 4444333211011111111355543321 211 01111111111 112222 1111211100 0112233222 Q ss_pred EeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 318 QNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 318 ~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) +-++.|+++ .++.|.+++++++. T Consensus 368 -~r~~~r~d~-~v~~~~a~v~~~~a 390 (390) T protein:vir:97 368 -VLAEERLAL-VVYRPEALITGSFA 390 (390) T ss_pred -EEEEEeecc-EEeccccEEEEEeC Confidence 234566655 67889999999999 No 37 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=97.79 E-value=1.2e-05 Score=47.52 Aligned_cols=308 Identities=10% Similarity=-0.027 Sum_probs=149.0 Q ss_pred CCce---------------eeecCCchHHHHH-------H------------HHhhh-------ccchhhhhhhhhhhhh Q lcl|NC_019525. 1 MKKF---------------VIRNSKGEKILLN-------A------------QEAKI-------AGVIQRLCNDLGFEID 39 (343) Q Consensus 1 ~~~~---------------~~~~~~~~~~~~~-------a------------~~~~~-------~~~~~~~~~d~~~~f~ 39 (343) +.+- +....+.....-. + ..... ...........|. ++ T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg-~~ 143 (435) T protein:vir:14 65 AAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGG-VL 143 (435) T ss_pred hcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCc-cc Confidence 0000 0000000000000 0 00000 0000001111121 22 Q ss_pred HHHHHHhhhhhhhcccccccchh----hcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceee Q lcl|NC_019525. 40 VTTLTTLMKKIIEQKFFEISPAD----YMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSI 115 (343) Q Consensus 40 ~~qL~~i~~~iye~~~~~l~~~~----~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~ 115 (343) +. +.+...|++...+....+. .+|. ..+ ...+...+..+.+.+++.+ +.+|.-+..+...+. T Consensus 144 vP--~~~~~~ii~~l~~~~~i~~~~~~~~~~--~~~--~~~~p~~~~~~~a~~v~E~--------~~~~~~~~~f~~i~~ 209 (435) T protein:vir:14 144 VP--ENLSSEVIELLRPKSVVRKLGARTLPL--SNG--NITIPRLKGGAIVGYIGAD--------TDIPTTQQQFDDLKL 209 (435) T ss_pred cc--hhHHHHHHHHHhhhchhhhhcceeeec--CCC--ceEEEEEeCCcceeeeccC--------ccccccccceeEEEe Confidence 22 2233445554333322222 2222 111 1222233333444444332 357888888999999 Q ss_pred eeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccC Q lcl|NC_019525. 116 KTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSM 195 (343) Q Consensus 116 ~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~k 195 (343) ..+.++.-+.+|.+=|+.|.. ..+|.+.-......++...+|+..++|+-......|+++...++..... -... T Consensus 210 ~~~k~~~~~~iS~ell~ds~~--~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~----~~~~ 283 (435) T protein:vir:14 210 TAKKMAALVPIANDLIKYAGV--NPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITA----SDAS 283 (435) T ss_pred eeEEEEEeehhhHHHHHhhcc--CHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceecc----cccc Confidence 999999988888766665432 1247777788888888889999999995433346799987665332221 1234 Q ss_pred CHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhc Q lcl|NC_019525. 196 TPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKI 275 (343) Q Consensus 196 T~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~ 275 (343) |.+.+..|+.+++..+.....+. .+..++|.+..|..|..-+ +..+.-++ .......-.|.|+.+.. .+.. T Consensus 284 ~~~~~~~~~~~l~~~~~~~~~~~-~~~~~v~n~~~~~~L~~lk--d~~G~~l~---~~~~~g~l~G~Pv~~~~--~~p~- 354 (435) T protein:vir:14 284 TLQKIETDLGKVILALENADANL-TQPGWIMAPRTFRFLEGLR--DGNGNKVY---PELANGMLKGYPVGKTT--QVPI- 354 (435) T ss_pred chhhHHHHHHHHHHHhhhccccc-cCCEEEEcHHHHHHHHHhh--ccCCceec---cCCCCCeeecceeEeec--cccc- Confidence 67778888999988887664432 3567999999999996443 33333332 11111122466654432 1111 Q ss_pred ccccCCCcc-EEEEEEcCcceE-EEecCCchhhcc-------------hhhcCCceEEeceeeeeccEEEEcCceEEeee Q lcl|NC_019525. 276 TAQVPAVAK-RYALYNDNEDSL-RMDIPVDYTSTL-------------ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLD 340 (343) Q Consensus 276 ~~~g~gg~d-rmv~Y~~d~~~v-~~~iP~~~~~l~-------------p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D 340 (343) . .+.+++. .+++-+-+ +++ ...-++.+.... -.+++-.. +-+++|+++ .++.|++++++. T Consensus 355 ~-~~~~~~~~~i~~gd~s-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~--~r~~~r~d~-~~~~~~a~~~l~ 429 (435) T protein:vir:14 355 N-LGETGKESEIYFTDFG-DVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTL--IRVIAKNDF-GPRHVESIAVLA 429 (435) T ss_pred c-ccCCCccceEEEeecc-cEEEEEecccEEEEeccccccccccchhhhhhcChhh--eeeeeeeCc-eeecccceEEEe Confidence 0 1111221 22222222 121 111111111110 12232222 345788887 889999999885 Q ss_pred -cCC Q lcl|NC_019525. 341 -IPV 343 (343) Q Consensus 341 -~~~ 343 (343) ++. T Consensus 430 ~~~~ 433 (435) T protein:vir:14 430 GVAW 433 (435) T ss_pred cCCC Confidence 444 No 38 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=97.77 E-value=1.2e-05 Score=47.63 Aligned_cols=303 Identities=10% Similarity=-0.039 Sum_probs=144.8 Q ss_pred CCcee-------eecCCchHHHHHHHHhhh-----------ccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchh Q lcl|NC_019525. 1 MKKFV-------IRNSKGEKILLNAQEAKI-----------AGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPAD 62 (343) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~a~~~~~-----------~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~ 62 (343) .++-. ..+..+++...+-.+... ...+. ...+.+..+...+ +...|++.-.+...-+. T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~i~~~---~~~~ii~~~~~~~~l~~ 136 (385) T protein:vir:18 61 EQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLG-SDADSAGSLIQPM---QIPGIIMPGLRRLTIRD 136 (385) T ss_pred HHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhc-cccccCCceecch---hhhHHHHHhhhccchhh Confidence 00000 001111111111111000 00000 1112222222222 22335554444444555 Q ss_pred hcceecCCCCceeEEEeeec-cchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCC Q lcl|NC_019525. 63 YMPIRVGEGAWSTMLTTYRS-FSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWD 141 (343) Q Consensus 63 ~~pv~~~~~~w~~~~~~~~~-vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~ 141 (343) ++++....+ ..-.+...+. .+.+.+++ .+ ..+|..+..+.+.....+..+..+.+|.+=|+.+ . . T Consensus 137 ~~~~~~~~~-~~~~~~~~~~~~~~a~~v~-------E~-~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~---~--~ 202 (385) T protein:vir:18 137 LLAQGRTSS-NALEYVREEVFTNNADVVA-------EK-ALKPESDITFSKQTANVKTIAHWVQASRQVMDDA---P--M 202 (385) T ss_pred hcceecccC-cceEEEEEecCCcceeeec-------cC-ccccccccceeEEEEeeeeEEEeehhhHHHHhhH---H--H Confidence 555432221 1112222222 23344443 22 4688888899999999999999999996533322 2 3 Q ss_pred cHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecC Q lcl|NC_019525. 142 LITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMP 221 (343) Q Consensus 142 L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p 221 (343) |.+.-.....+++...+|+..+.|+.......|+++.+++...... .+.+..+++|.+++..+...- ..+ T Consensus 203 l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~d~i~~~~~~l~~~~---~~~ 272 (385) T protein:vir:18 203 LQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN-------ATGDTRADIIAHAIYQVTESE---FSA 272 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc-------ccccchHHHHHHHHHhhcccc---CCC Confidence 6666677777778888889999996555567899998876443221 111223566666666653322 346 Q ss_pred CeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCc-cEEEEEEcCcceEEEec Q lcl|NC_019525. 222 NKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVA-KRYALYNDNEDSLRMDI 300 (343) Q Consensus 222 ~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~-drmv~Y~~d~~~v~~~i 300 (343) +.++|.|+.|..|..-+ ++.+..|..-........-.|.|+-+ ..++.. +..-.|.- .-..++++ .-+.+.+ T Consensus 273 ~~~~~~~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~l~G~pV~~--~~~~p~-~~~~~gd~~~~~~~~~~--~~~~v~~ 345 (385) T protein:vir:18 273 SGIVLNPRDWHNIALLK--DNEGRYIFGGPQAFTSNIMWGLPVVP--TKAQAA-GTFTVGGFDMASQVWDR--MDATVEV 345 (385) T ss_pred CEEEEcHHHHHHHHHhh--cCCCceeccCcccCCCceecceeeEE--cCcCCC-CcEEEeecccEEEEEEe--cceEEEE Confidence 79999999999996543 44444332111111111113555432 222221 11111111 11222222 1122221 Q ss_pred CCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 301 PVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 301 P~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .. ......+++ ...+-++.|+++ .++.|.+++.+++.. T Consensus 346 ~~--~~~~~~~~~--~~~~~~~~r~~~-~v~~~~a~~~~~~~a 383 (385) T protein:vir:18 346 SR--EDRDNFVKN--MLTILCEERLAL-AHYRPTAIIKGTFSS 383 (385) T ss_pred ec--cccchhhcC--cEEEEEEEeecc-EEecccceEEEEecc Confidence 10 111123343 223345778885 558899999999999 No 39 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=97.77 E-value=1.2e-05 Score=47.63 Aligned_cols=303 Identities=10% Similarity=-0.039 Sum_probs=144.8 Q ss_pred CCcee-------eecCCchHHHHHHHHhhh-----------ccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchh Q lcl|NC_019525. 1 MKKFV-------IRNSKGEKILLNAQEAKI-----------AGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPAD 62 (343) Q Consensus 1 ~~~~~-------~~~~~~~~~~~~a~~~~~-----------~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~ 62 (343) .++-. ..+..+++...+-.+... ...+. ...+.+..+...+ +...|++.-.+...-+. T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~~i~~~---~~~~ii~~~~~~~~l~~ 136 (385) T protein:vir:19 61 EQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLG-SDADSAGSLIQPM---QIPGIIMPGLRRLTIRD 136 (385) T ss_pred HHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhc-cccccCCceecch---hhhHHHHHhhhccchhh Confidence 00000 001111111111111000 00000 1112222222222 22335554444444555 Q ss_pred hcceecCCCCceeEEEeeec-cchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCC Q lcl|NC_019525. 63 YMPIRVGEGAWSTMLTTYRS-FSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWD 141 (343) Q Consensus 63 ~~pv~~~~~~w~~~~~~~~~-vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~ 141 (343) ++++....+ ..-.+...+. .+.+.+++ .+ ..+|..+..+.+.....+..+..+.+|.+=|+.+ . . T Consensus 137 ~~~~~~~~~-~~~~~~~~~~~~~~a~~v~-------E~-~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~---~--~ 202 (385) T protein:vir:19 137 LLAQGRTSS-NALEYVREEVFTNNADVVA-------EK-ALKPESDITFSKQTANVKTIAHWVQASRQVMDDA---P--M 202 (385) T ss_pred hcceecccC-cceEEEEEecCCcceeeec-------cC-ccccccccceeEEEEeeeeEEEeehhhHHHHhhH---H--H Confidence 555432221 1112222222 23344443 22 4688888899999999999999999996533322 2 3 Q ss_pred cHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecC Q lcl|NC_019525. 142 LITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMP 221 (343) Q Consensus 142 L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p 221 (343) |.+.-.....+++...+|+..+.|+.......|+++.+++...... .+.+..+++|.+++..+...- ..+ T Consensus 203 l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~d~i~~~~~~l~~~~---~~~ 272 (385) T protein:vir:19 203 LQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN-------ATGDTRADIIAHAIYQVTESE---FSA 272 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc-------ccccchHHHHHHHHHhhcccc---CCC Confidence 6666677777778888889999996555567899998876443221 111223566666666653322 346 Q ss_pred CeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCc-cEEEEEEcCcceEEEec Q lcl|NC_019525. 222 NKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVA-KRYALYNDNEDSLRMDI 300 (343) Q Consensus 222 ~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~-drmv~Y~~d~~~v~~~i 300 (343) +.++|.|+.|..|..-+ ++.+..|..-........-.|.|+-+ ..++.. +..-.|.- .-..++++ .-+.+.+ T Consensus 273 ~~~~~~~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~l~G~pV~~--~~~~p~-~~~~~gd~~~~~~~~~~--~~~~v~~ 345 (385) T protein:vir:19 273 SGIVLNPRDWHNIALLK--DNEGRYIFGGPQAFTSNIMWGLPVVP--TKAQAA-GTFTVGGFDMASQVWDR--MDATVEV 345 (385) T ss_pred CEEEEcHHHHHHHHHhh--cCCCceeccCcccCCCceecceeeEE--cCcCCC-CcEEEeecccEEEEEEe--cceEEEE Confidence 79999999999996543 44444332111111111113555432 222221 11111111 11222222 1122221 Q ss_pred CCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 301 PVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 301 P~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .. ......+++ ...+-++.|+++ .++.|.+++.+++.. T Consensus 346 ~~--~~~~~~~~~--~~~~~~~~r~~~-~v~~~~a~~~~~~~a 383 (385) T protein:vir:19 346 SR--EDRDNFVKN--MLTILCEERLAL-AHYRPTAIIKGTFSS 383 (385) T ss_pred ec--cccchhhcC--cEEEEEEEeecc-EEecccceEEEEecc Confidence 10 111123343 223345778885 558899999999999 No 40 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=97.66 E-value=1.8e-05 Score=46.62 Aligned_cols=279 Identities=10% Similarity=-0.030 Sum_probs=135.4 Q ss_pred HHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhccccc Q lcl|NC_019525. 16 LNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIID 95 (343) Q Consensus 16 ~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~ 95 (343) |+.+....-+.. .. ..+......++ ...|.+...+.-.-+.+.++....+.....+.....-..|.+++.| T Consensus 1 m~~~~~~~~~~~--~t-~~~~~lvP~~~---~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg--- 71 (297) T protein:vir:95 1 MTVQTFNPENVL--VS-QKKDGTLHKEF---TDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNET--- 71 (297) T ss_pred CCcccccccccc--cc-CCCcceechhH---HHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeecC--- Confidence 444333222221 11 12222222222 2334443333323333334322222122222222222334555432 Q ss_pred CCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeee Q lcl|NC_019525. 96 TGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGL 175 (343) Q Consensus 96 ~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GL 175 (343) ..+|..+..+++.....+.++..+.+|.+-|+.+. .++.+.-.+...+++.+.+|+.+++|+... +-.|+ T Consensus 72 -----~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~-~~~gi 141 (297) T protein:vir:95 72 -----EKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW----KKFFEDMKPQIVEAFYKKIDEAGLLGHDTP-FANSV 141 (297) T ss_pred -----ccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcccCCc-ccccc Confidence 35788888999999999999999999987776543 368888888888999999999999996543 34677 Q ss_pred eecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhc Q lcl|NC_019525. 176 LTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTF 255 (343) Q Consensus 176 lN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~ 255 (343) ++.........+ ...|.++ |.+++.++...- ..++.++|-+..+..|.+-+ ++.+.-+. .. . T Consensus 142 ~~~~~~~~~~~~-----~~~t~~~----i~~~~~~l~~~~---~~~~~~v~~~~~~~~L~~l~--d~~G~~i~---~~-~ 203 (297) T protein:vir:95 142 AKAAKDANKVIG-----GPINYDN----ILKLQDALYDAD---VEPNAFVSKIQNRSALREAR--DGNKVSIY---DK-A 203 (297) T ss_pred cccccccceecc-----cccCHHH----HHHHHHHhhhcc---CCcCEEEEcHHHHHHHHHhh--ccCCceee---cC-C Confidence 776543222111 1234444 445555554332 24678999999999996432 33332221 11 1 Q ss_pred chhcCCcceEEeechhhhhcccccCCCccEEEEEEcCc------ceEEEecCCchhhc----------chhhcCCceEEe Q lcl|NC_019525. 256 KEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNE------DSLRMDIPVDYTST----------LANSVNNFQFQN 319 (343) Q Consensus 256 ~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~------~~v~~~iP~~~~~l----------~p~~~~~l~~~v 319 (343) ...-.|.|+. +... . ...++.+++-+-+. +.+++++--+.... --++.+... + T Consensus 204 ~~~l~G~Pv~-----~~~~--~--~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 272 (297) T protein:vir:95 204 ANTIDGITTV-----DLKS--A--RFEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIA--I 272 (297) T ss_pred CCcccceeeE-----eecC--C--CCCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEE--E Confidence 1111244432 2111 0 11122222222221 12222221111000 002333222 3 Q ss_pred ceeeeeccEEEEcCceEEeeec--CC Q lcl|NC_019525. 320 AAYGQFTGVLAYRPKELLYLDI--PV 343 (343) Q Consensus 320 ~~~~r~GGv~v~yP~a~~Y~D~--~~ 343 (343) -+++|+|+. ++.|++++.+=. || T Consensus 273 r~~~~~d~~-v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 273 RATMDIAVM-ITKTDAFAKLTPAERV 297 (297) T ss_pred EEEEEeccE-eecccceEEEeecCCC Confidence 446677665 556998887654 33 No 41 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=97.64 E-value=1.5e-05 Score=47.04 Aligned_cols=288 Identities=12% Similarity=0.017 Sum_probs=138.7 Q ss_pred hhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc Q lcl|NC_019525. 28 QRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD 107 (343) Q Consensus 28 ~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd 107 (343) .+.....|. ++. +.+...|++.-.+.-.-+.+.++..... ..-.+..++.-+.|.+++.+ ..+|.-+ T Consensus 1 m~t~t~gg~--liP--~~~~~~ii~~l~~~s~i~~l~~~~~~~~-~~~~ip~~~~~~~a~wv~E~--------~~~~~s~ 67 (303) T protein:vir:97 1 MGTETSKAS--LFD--KHLVSDLINKVKGHSSLAKLSSQKPIPF-NGSKEFTFTLDSDIDVVAEN--------GKKTHGG 67 (303) T ss_pred CcccCCCCe--Ecc--hhHHHHHHHHHHhhchhhhhcceeecCC-CceEEEEEecCcceEEeecC--------ccccccc Confidence 222222332 222 1222334444333333444443322111 12233333444556666533 4678888 Q ss_pred ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeeccc Q lcl|NC_019525. 108 TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTF 187 (343) Q Consensus 108 ~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~ 187 (343) ..+++...+.++++.-+.+|.+=|++.--.. .+|.+.-.+.+.+++...+|+..++|..+..|..+......... +.. T Consensus 68 ~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~-~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~-~~~ 145 (303) T protein:vir:97 68 LSLEPVTIVPIKVEYGARLSDEFLYATEEEK-IDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFD-SKV 145 (303) T ss_pred cceeeEEeeeEEEEEeehhhHHHhhcCccch-HHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccc-ccc Confidence 8999999999999999888877554322221 46888888888899999999999999654333222221111110 000 Q ss_pred CCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHH-hhcchhcCCcceE Q lcl|NC_019525. 188 LTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLE-DTFKEITRNSSFE 265 (343) Q Consensus 188 ~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~-~n~~~~~~g~~l~ 265 (343) +..-+..+.+...+||.+++..+... + ..|+.++|.|+.+..|..-+ ++.+..++. -+. ...+..-.|.|+. T Consensus 146 -~~~~~~~~~~~~~~~i~~~~~~~~~~-~--~~~~~~vmn~~~~~~L~~lk--d~~g~~~~~~~~~~~~~~~~l~G~Pv~ 219 (303) T protein:vir:97 146 -TQVVKFTESEDADANIEAAVNLIQGA-E--GVVTGLAMDTEFSTALAKVT--NGEMGPKMYPELAWGANPDSINGLKSS 219 (303) T ss_pred -ccccccccccchHHHHHHHHHHHhhc-C--CCccEEEEcHHHHHHHHHhh--ccCCCeEEecCccCCCCCceecceeeE Confidence 00001111222356777777766443 2 35678999999999996433 332222221 001 1111122355554 Q ss_pred EeechhhhhcccccCCCccEEEEEEc-------CcceEEEecCCchhhc-----chhhcCCceEEeceeeeeccEEEEcC Q lcl|NC_019525. 266 ILPCVYADKITAQVPAVAKRYALYND-------NEDSLRMDIPVDYTST-----LANSVNNFQFQNAAYGQFTGVLAYRP 333 (343) Q Consensus 266 I~~~~~~~~~~~~g~gg~drmv~Y~~-------d~~~v~~~iP~~~~~l-----~p~~~~~l~~~v~~~~r~GGv~v~yP 333 (343) +-. .+..-...+ .+++.+++-+- ..+.+++++- +...- --.+.+-.. +-+++|+++. ++.| T Consensus 220 ~s~--~v~~~~~~~-~~~~~~~~Gdf~~~~~~~~~~~~~~~~~-~~~~~d~~~~~~~~~n~~~--~r~~~r~~~~-v~~p 292 (303) T protein:vir:97 220 VNT--TVGAGADEA-ESKDLVIIGDFESMFKWGYAKQIPMEII-KYGDPDNSGKDLKGYNQIY--LRAEAYIGWG-ILDA 292 (303) T ss_pred Eec--ccCCccccC-CCccEEEEeeccccEEEEEecCcEEEEe-eccCCCCcchhhhhcCcEE--EEEEEEeccE-eecc Confidence 321 121111111 12222222121 1233344331 11100 002332222 3457777654 5669 Q ss_pred ceEEe-eecCC Q lcl|NC_019525. 334 KELLY-LDIPV 343 (343) Q Consensus 334 ~a~~Y-~D~~~ 343 (343) +++++ .|.+| T Consensus 293 ~af~~l~~~~~ 303 (303) T protein:vir:97 293 KSFARVTKGEV 303 (303) T ss_pred cceEEeeCCCC Confidence 99854 58899 No 42 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=97.63 E-value=9.8e-06 Score=48.05 Aligned_cols=301 Identities=11% Similarity=-0.007 Sum_probs=140.3 Q ss_pred CCceeeec-----------CCchH--HHHHHHHh----------hhccchhhhhhhhhhhhhHHHHHHhhhhhhhccccc Q lcl|NC_019525. 1 MKKFVIRN-----------SKGEK--ILLNAQEA----------KIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFE 57 (343) Q Consensus 1 ~~~~~~~~-----------~~~~~--~~~~a~~~----------~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~ 57 (343) +.+....+ ...+. -....... .............+..+...+ ++ ..|++...+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~--~~-~~ii~~~~~~ 140 (390) T protein:vir:81 64 LEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPN--RL-PGFITPPDAR 140 (390) T ss_pred HHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechh--hh-HHHHHHHhhh Confidence 00000000 00000 00000000 000000011112222333332 12 2344433333 Q ss_pred ccchhhcceecCCCCceeEEEeee-ccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHH Q lcl|NC_019525. 58 ISPADYMPIRVGEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMK 136 (343) Q Consensus 58 l~~~~~~pv~~~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~ 136 (343) ..-+.+.++....+ ....+.... ..+.+.+++.| ..+|..+..+++....++.++.-+.+|.+=|+. T Consensus 141 ~~l~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg--------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d--- 208 (390) T protein:vir:81 141 LTVRDLIGSGRTDS-ALIEYVQETGFVNNAAIVAEG--------ALKPESSLKFAKKTDTTHVIAHTMKATRQILSD--- 208 (390) T ss_pred hhhhhhcceeeccC-CceEEEEEecCCcceeeecCC--------cccccccceeeEEEEeeeEEEEeehhhHHHHHh--- Confidence 33344444322111 111222222 22344454432 358888889999999999999998888753332 Q ss_pred hcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019525. 137 SGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCD 216 (343) Q Consensus 137 ~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~ 216 (343) +. +|.+--.....+++...+|+..++|+-......|++|.+++...... .+.....+|+.+++..+... T Consensus 209 ~~--~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~-- 277 (390) T protein:vir:81 209 AP--QLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT-------IAGATRVDQLRLAMLQASLA-- 277 (390) T ss_pred HH--HHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccc-------cccchhHHHHHHHHHhhccc-- Confidence 22 47777777778888888899999996555568899998876432211 11111245566666555432 Q ss_pred ceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccE-EEEEEcCcce Q lcl|NC_019525. 217 YTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKR-YALYNDNEDS 295 (343) Q Consensus 217 ~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~dr-mv~Y~~d~~~ 295 (343) + ..++.++|-|+.|..|..-+ ++.+.-++.-........-.|.|+-+. .++. .+..-.|.-+. ..++++ .- T Consensus 278 ~-~~~~~~v~~~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~l~G~pv~~~--~~~p-~~~~~~gd~~~~~~~~~~--~~ 349 (390) T protein:vir:81 278 E-YNPSGIVINPIDWAAIELAK--DANNQYLIGNARGTLTPTLWGLPVVAT--QAMA-PGEFLVGAFDLAAQIFDQ--WD 349 (390) T ss_pred c-CCCCEEEEcHHHHHHHHHhh--cCCCceeecCcccccCceecceeeEEc--CCCC-CCcEEEEehhceEEEEEe--cc Confidence 2 35678999999999996443 433333321111111111235554332 1221 11111111111 222221 22 Q ss_pred EEEecCCchhhc-chhhcCCceEEeceeeeeccEEEEcCceEEeeecC Q lcl|NC_019525. 296 LRMDIPVDYTST-LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIP 342 (343) Q Consensus 296 v~~~iP~~~~~l-~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~ 342 (343) +.++. ... .-.+.+... +-++.|+++ .++.|.+++.+.+. T Consensus 350 ~~v~~----~~~~~~~~~~~v~--~r~~~r~d~-~v~~~~a~v~~t~a 390 (390) T protein:vir:81 350 ARVEI----GYVGEDFQRNMIT--VLAEERLAL-VVYRPEALISGSFA 390 (390) T ss_pred eEEEE----ecccchhhcCcEE--EEEEEeecc-EEecccceEEEEeC Confidence 22211 111 112233222 345777776 78899999999999 No 43 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=97.60 E-value=9.1e-06 Score=48.22 Aligned_cols=294 Identities=8% Similarity=-0.042 Sum_probs=133.4 Q ss_pred HHHHH-hhhccchhhhh--hhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcc Q lcl|NC_019525. 16 LNAQE-AKIAGVIQRLC--NDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATG 92 (343) Q Consensus 16 ~~a~~-~~~~~~~~~~~--~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g 92 (343) |.+.. ........+.. .+.+. +...++ -..+++.-.+.-.-+.+.++..... ....+-..+.-+.|.+++.+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~-~ip~~~---~~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~E~ 75 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKG-YLEPEQ---AKDYFAEAEKTSIVQQFAQKVPMGT-TGQKIPHWIGDVSAQWIGEG 75 (320) T ss_pred CCCCccCCHHHHHhhccccccccc-cccHHH---HHHHHHHHHhccchhhhcceeeccC-CceEEEEEeCCcceEEecCC Confidence 11111 11111111111 12222 222221 2223333223333334444322111 12222223333344555432 Q ss_pred cccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeecccc-- Q lcl|NC_019525. 93 IIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNA-- 170 (343) Q Consensus 93 ~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~-- 170 (343) .++|..+..+.+.+.+.++++.-+.+|.+=|+.+. .+|.+.-...+.+++...+|+..+.|+.+.. T Consensus 76 --------~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~ 143 (320) T protein:vir:10 76 --------DMKPITKGNMTSQNIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDSAALNGTDSPFPT 143 (320) T ss_pred --------ccccccccceeEEEEeeEEEEEeehhhHHHHhcCh----HHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCc Confidence 46899999999999999999999999988777443 4688888888889999999999999964321 Q ss_pred ceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhh-h Q lcl|NC_019525. 171 NVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTK-Q 249 (343) Q Consensus 171 g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~-~ 249 (343) ++.|.++..++.. + ++.+++..+.. .+++.+++..+ ......+..++|-|+.+..|..-+ ++.+..+. . T Consensus 144 ~~~~~~~~~~~~~--~-~~~~~~~~~~~--~~~~~~~~~~~---~~~~~~~~~~v~n~~~~~~L~~lk--d~~G~~l~~~ 213 (320) T protein:vir:10 144 YLAQTTKSVSLAD--P-GGATASDLTAY--DAVAVNGLSLL---VNAKKKWTHTLLDDIVEPILNGAK--DKNGRPLFIE 213 (320) T ss_pred cccccccccccee--c-ccccccccccH--HHHHHHHHhhh---hcccCCCcEEEEcHHHHHHHHHhh--ccCCceeecc Confidence 2223333322221 1 12222222221 12232333332 222235678999999999996433 33332221 1 Q ss_pred HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcC------cceEEEecCCchh--hc--------chhhcC Q lcl|NC_019525. 250 VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDN------EDSLRMDIPVDYT--ST--------LANSVN 313 (343) Q Consensus 250 ~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d------~~~v~~~iP~~~~--~l--------~p~~~~ 313 (343) -........-+|.++--.|+....... .++..++.-+-+ .+.+++++--... .. .-++++ T Consensus 214 ~~~~~~~~~~~~~~i~g~pv~~~~~~~----~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~ 289 (320) T protein:vir:10 214 STYTDENSPFRAGRIVSRPTILSDHVA----DGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHN 289 (320) T ss_pred ccccCccccccCceeeeeeeEecCCCC----CCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcC Confidence 111111111223344444443322221 122222111111 1122222110000 00 002232 Q ss_pred CceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 314 NFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 314 ~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -.. +-++.|+ ++.+.+|++++.+---+ T Consensus 290 ~~~--~r~~~~~-d~~v~~~~a~~~l~~~~ 316 (320) T protein:vir:10 290 LVA--VRVEAEY-AFHNNDKDAFVKLTNVV 316 (320) T ss_pred cEE--EEEEEee-ccEEecccceEEEEecc Confidence 222 3446666 56678999998876444 No 44 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=97.59 E-value=8.2e-06 Score=48.45 Aligned_cols=292 Identities=9% Similarity=-0.030 Sum_probs=145.0 Q ss_pred hhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccc Q lcl|NC_019525. 27 IQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAV 106 (343) Q Consensus 27 ~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~v 106 (343) +-....+.| .++.+ .+...|++...+.-.-+.+.++.... .....+......+.|++++.+ ..+|.. T Consensus 1 Mat~tt~~g--~~vP~--~~~~~ii~~~~~~s~l~~~~~~i~~~-~~~~~~p~~~~~~~a~wv~Eg--------~~~~~~ 67 (311) T protein:vir:99 1 MATFGTGNL--KNLPR--NIADGMVKDVVQGSTVAVLSARKPQR-FGNEDIITFNGRPKAEFVGEG--------QQKSST 67 (311) T ss_pred CceecCCCc--eeccH--HHHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCceeEEeecC--------cccccc Confidence 111122222 22221 22334555433443334443322111 112234333444556666533 468888 Q ss_pred cccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccc--cceeeeeecCCceee Q lcl|NC_019525. 107 DTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDN--ANVKGLLTQTGNVVN 184 (343) Q Consensus 107 d~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~--~g~~GLlN~p~v~~~ 184 (343) +..+++.....++++.-...|.+=|+++-... .+|.+.-.+.+.+++...+|+..++|+.+. .+..|+.+-...+.. T Consensus 68 ~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~-~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~ 146 (311) T protein:vir:99 68 TGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQ-LGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASK 146 (311) T ss_pred cceeeEEEEeeEEEEEeehhhHHHhhcccccH-HHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccc Confidence 88999999999999988888876554332232 468888889999999999999999996432 234455544333221 Q ss_pred cccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcc Q lcl|NC_019525. 185 NTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSS 263 (343) Q Consensus 185 ~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~ 263 (343) .. +-.+.+......|+..++..+...... ..++.++|-+..+..|..-+ +..++-++. .........-.|.| T Consensus 147 ~~----~~~~~~~~~~~~~i~~~~~~~~~~~~~-~~~~~~vmn~~~~~~L~~lk--d~~G~~l~~~~~~~~~~~~l~G~P 219 (311) T protein:vir:99 147 RV----ELTADTIANPDLAIEAAVGLLVANGHP-TPVNGLALHPSIAWGLSTAR--YTDGRKKFPELGLGIGVSSFEGID 219 (311) T ss_pred ee----eccccccchhHHHHHHHHHHHhhhccC-CCccEEEEcHHHHHHHHhhh--ccCCCeeecCcccCCCCceeccee Confidence 11 111222333456677777666544332 35667999999999996432 333333321 10111111223555 Q ss_pred eEEeech-----hhhhcccccCCCccEEEEEEcCcceEEEecCCch--hhcc---------hhhcCCceEEeceeeeecc Q lcl|NC_019525. 264 FEILPCV-----YADKITAQVPAVAKRYALYNDNEDSLRMDIPVDY--TSTL---------ANSVNNFQFQNAAYGQFTG 327 (343) Q Consensus 264 l~I~~~~-----~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~--~~l~---------p~~~~~l~~~v~~~~r~GG 327 (343) +-+-... +.........+.++.+++-+.+ +.+++.+-... .... -++++-.. +-|+.|+|+ T Consensus 220 v~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~--~r~~~r~d~ 296 (311) T protein:vir:99 220 ASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFA-NGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIA--LRLEIVYGW 296 (311) T ss_pred eEeecccccccccccccchhhccCcceEEEeecc-ccEEEEEecCceEEEeecCCCCcchhhhhcCcEE--EEEEEeecc Confidence 4332100 0000000111233333332322 22222221111 1110 12333223 456889999 Q ss_pred EEEEcCceEEeeecCC Q lcl|NC_019525. 328 VLAYRPKELLYLDIPV 343 (343) Q Consensus 328 v~v~yP~a~~Y~D~~~ 343 (343) . ++.|..+++.|.-- T Consensus 297 ~-v~~~~~v~~~~~~A 311 (311) T protein:vir:99 297 Y-VFTDRFVVIENAVA 311 (311) T ss_pred e-ecChhHeeeecccC Confidence 6 67798887777666 No 45 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=97.58 E-value=2.7e-05 Score=45.66 Aligned_cols=308 Identities=10% Similarity=-0.053 Sum_probs=144.3 Q ss_pred HHHHHhhhccc----hhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhc Q lcl|NC_019525. 16 LNAQEAKIAGV----IQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFAT 91 (343) Q Consensus 16 ~~a~~~~~~~~----~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~ 91 (343) |.+.+.-.... .+....+.+....- +.+-..|++.-.+.-..+++.++..... -...+-.......|.+++. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP---~~~~~~ii~~l~~~s~l~~~~~~~~~~~-~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLP---KEIVGPIFDKAQESSLVLRMGEQIPISY-GETIIPTTVKRPEVGQVGV 76 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccc---hhHHHHHHHHHHhhchhhhhcceeeccC-CceEEEEEeCCceeEeecC Confidence 43333211110 01010111111111 2223344444334433344444322111 1222222333344445443 Q ss_pred ccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeecc--c Q lcl|NC_019525. 92 GIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKD--N 169 (343) Q Consensus 92 g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~--~ 169 (343) |..........+|.-+..+++.+...++++.-...|.+=|+.+. .++.+--.....+++...+|.-.++|+.+ . T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~----~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~ 152 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNP----SGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTG 152 (333) T ss_pred cccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcccCCCCC Confidence 32211122346888899999999999999998888876555432 36788888888888889999999999643 2 Q ss_pred cceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccc-cCCCcchhhh Q lcl|NC_019525. 170 ANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAA-SADFPIKSTK 248 (343) Q Consensus 170 ~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~-~s~~~~~tl~ 248 (343) .++.|+++...+...+.. .....+.+-.+++|.+++..+..+ +...++.++|-|..|..|..-. ..++.+..+. T Consensus 153 ~~~~g~~~~~~~~~~~~~---~~~~~~~~~~~~~i~~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~ 227 (333) T protein:vir:78 153 SALQGIDTDNVIANTTNV---DYLQETGDPLLDRLLDGYDLVSAN--TDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDP 227 (333) T ss_pred cccccccccccccccccc---cccccccchhHHHHHHHHHhhccc--cccCceEEEEcchHHHHHHHHhhhcCCCCceee Confidence 456788877665432211 111222223356666666655333 3345678999998888775322 2333333332 Q ss_pred hH-HHhhcchhcCCcceEEeechhhh-hcccccCCCccEEEEEEcCcceEEEecCCchhhcc-------------hhhcC Q lcl|NC_019525. 249 QV-LEDTFKEITRNSSFEILPCVYAD-KITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTL-------------ANSVN 313 (343) Q Consensus 249 ~~-l~~n~~~~~~g~~l~I~~~~~~~-~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~-------------p~~~~ 313 (343) .. ........-.|.|+.+-. .+. ..+ .+.+++..+++-+.+.=.+...-.+.+.... -++++ T Consensus 228 ~~~~~~~~~~~l~G~Pv~~~~--~i~~~~~-~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 304 (333) T protein:vir:78 228 SRINLAAQTGDVLGLPAQFGR--AVGGDLG-AAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTN 304 (333) T ss_pred cCccccCCCceeeceeeEEcc--ccCCCcc-ccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcC Confidence 11 111111112355543211 111 111 1122333344333332112111111111110 12222 Q ss_pred CceEEeceeeeeccEEEEcCceEEee---ecC Q lcl|NC_019525. 314 NFQFQNAAYGQFTGVLAYRPKELLYL---DIP 342 (343) Q Consensus 314 ~l~~~v~~~~r~GGv~v~yP~a~~Y~---D~~ 342 (343) . ..+-+++|+++. ++.|++++++ +-| T Consensus 305 ~--v~~r~~~r~d~~-v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 305 Q--IAILIEVTFGWL-LGDKQAFVKFVDDEQP 333 (333) T ss_pred c--EEEEEEEEEccE-EecccceEEEeccCCC Confidence 1 223456666654 5888887764 566 No 46 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.56 E-value=1.2e-05 Score=47.62 Aligned_cols=307 Identities=9% Similarity=-0.036 Sum_probs=138.6 Q ss_pred CCceee--ecCCchHHHHHHHHhhhccchhh-hhhhhhhhhhHHHHHHhhhhhhhccccccc-chhhcceecCCCCceeE Q lcl|NC_019525. 1 MKKFVI--RNSKGEKILLNAQEAKIAGVIQR-LCNDLGFEIDVTTLTTLMKKIIEQKFFEIS-PADYMPIRVGEGAWSTM 76 (343) Q Consensus 1 ~~~~~~--~~~~~~~~~~~a~~~~~~~~~~~-~~~d~~~~f~~~qL~~i~~~iye~~~~~l~-~~~~~pv~~~~~~w~~~ 76 (343) .+.+.. ++..+.. +...+.......... ....+|......+ +...++.....+.. .+.+..+.+..+ ... T Consensus 222 ~~a~~~~~~~~~~~~-l~~~e~~~~~~~~~~~~t~~~gg~lip~~---~~~~ii~~~~~~~~~l~~~~~~~~~~g--~~~ 295 (543) T protein:vir:81 222 LRAWSKMARNPHAAI-LTEEEKRAINEVRAMGLTKADGGYLVPFQ---LDPTVIITSNGSLNDIRRFARQVVATG--DVW 295 (543) T ss_pred hhHHHHHHHhhHHHH-hhhhhhhhhhhhhhcccccccCcccCchh---hhhHHHHHHHhhhchhhhhcccccCCc--ceE Confidence 110100 1111000 000000001110000 1111221111122 12222222222221 223333322211 222 Q ss_pred EEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHh Q lcl|NC_019525. 77 LTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDL 156 (343) Q Consensus 77 ~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~ 156 (343) +...+..+.|.+++-| ..+|.-+..+......++.++.-+.+|.+=|+. . .+|.+.-.....+++.. T Consensus 296 ~~~~~~~~~a~~v~Eg--------~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d---~--~~~~~~i~~~l~~~~~~ 362 (543) T protein:vir:81 296 HGVSSAAVQWSWDAEF--------EEVSDDSPEFGQPEIPVKKAQGFVPISIEALQD---E--ANVTETVALLFAEGKDE 362 (543) T ss_pred EEEecCCcceeecccC--------ccccccccccceeeeeeeeeEeeehhhHHHHhc---c--HHHHHHHHHHHHHHHHH Confidence 2222333445555432 357778888999999999999999998854332 2 36888888888888899 Q ss_pred hhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhc Q lcl|NC_019525. 157 GIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAG 236 (343) Q Consensus 157 ~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~ 236 (343) .+|+.+++|+.......|+++++..+..... ...+..-..+|+.+++..+- ..+. ....++|.|..|..|.. T Consensus 363 ~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~l~--~~~~-~~~~~v~n~~~~~~l~~ 434 (543) T protein:vir:81 363 LEAVTLTTGTGQGNQPTGIVTALAGTAAEIA-----PVTAETFALADVYAVYEQLA--ARHR-RQGAWLANNLIYNKIRQ 434 (543) T ss_pred HHHHHHhccCCCCcccccchhhccccccccc-----ccccccccHHHHHHHHHhhh--cccc-CCcEEEEcHHHHHHHHH Confidence 9999999996433246799988765332211 11111122455666665553 2221 22368999999999975 Q ss_pred cccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhh-cccccCCCccEEEEEEcCcceEEEec--CCchhhcc----h Q lcl|NC_019525. 237 AASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADK-ITAQVPAVAKRYALYNDNEDSLRMDI--PVDYTSTL----A 309 (343) Q Consensus 237 ~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~-~~~~g~gg~drmv~Y~~d~~~v~~~i--P~~~~~l~----p 309 (343) -+ ++.+.-+..-..+.....-.|.|+.+..- +.. .......++..+++-+. +.+.+-. .+.+...+ . T Consensus 435 lk--d~~G~~l~~~~~~g~~~~l~G~pv~~~~~--~~~~~~~~~~~~~~~i~~gd~--~~~~i~~~~~~~i~~~~~~~~~ 508 (543) T protein:vir:81 435 FD--TQGGAGLWTTIGNGEPSQLLGRPVGEAEA--MDANWNTSASADNFVLLYGNF--QNYVIADRIGMTVEFIPHLFGT 508 (543) T ss_pred hh--cCCCceeccCcCCCCCccccceeeEEecc--ccccccccccCCcceEEEeec--cceeEEeecccEEEEecccccc Confidence 43 33333222111111111224666554331 111 11111112222222222 2222211 11122111 0 Q ss_pred --hhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 310 --NSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 310 --~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ..++...+ -.+.|+|+. ++.|.+++++.++. T Consensus 509 ~~~~~~~~~~--~~~~r~d~~-v~~~~A~~~l~~~~ 541 (543) T protein:vir:81 509 NRRPNGSRGW--FAYYRMGAD-VVNPNAFRLLNVET 541 (543) T ss_pred chhhcCceEE--EEEEeeccE-eecccceEEEEecc Confidence 11222233 336677775 56799999999999 No 47 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=97.50 E-value=1.9e-05 Score=46.49 Aligned_cols=307 Identities=12% Similarity=-0.012 Sum_probs=139.3 Q ss_pred CCceeee--cCCc-hHHHHHHHHh------h--------hc-c--chhhh--hhhhhhhhhHHHHHHhhhhhhhcccccc Q lcl|NC_019525. 1 MKKFVIR--NSKG-EKILLNAQEA------K--------IA-G--VIQRL--CNDLGFEIDVTTLTTLMKKIIEQKFFEI 58 (343) Q Consensus 1 ~~~~~~~--~~~~-~~~~~~a~~~------~--------~~-~--~~~~~--~~d~~~~f~~~qL~~i~~~iye~~~~~l 58 (343) -+..+.. +... +.....+... . .. . ..+++ ..+....+.+. +.+...|++...+.- T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP--~~~~~~ii~~~~~~~ 138 (404) T protein:vir:10 61 NEDNVKSLNTGKEENVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVP--EDIQTKINTRLKDTT 138 (404) T ss_pred hhhhccccccccchhhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeec--hhHHHHHHHHHhhhh Confidence 0000000 0000 0000000000 0 00 0 00000 00111112222 223344554433333 Q ss_pred cchhhcceec-CCCCceeEEEeeeccchhhhhhcccccCCcccccccc--ccccccceeeeeEEEEeeEeecHHHHHHHH Q lcl|NC_019525. 59 SPADYMPIRV-GEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAA--VDTGVDAVSIKTYKWAKTIGWTLPELAEAM 135 (343) Q Consensus 59 ~~~~~~pv~~-~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~--vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~ 135 (343) ....++++.. ..+.+...+........+.+++.+. .+|. .+..++..+...+.++.-+.+|.+=|+.+. T Consensus 139 ~l~~l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~--------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~ 210 (404) T protein:vir:10 139 DLYNMVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQ--------QIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFAD 210 (404) T ss_pred hHhhhhceeeccCCccceEEEEecCCcceeeccccc--------cccccccccceeeeEeeheeeEeeehhhHHHHhhcH Confidence 3333333322 1222222232233334444554331 2332 345678888888888888888875544332 Q ss_pred HhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcC Q lcl|NC_019525. 136 KSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGC 215 (343) Q Consensus 136 ~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s 215 (343) .+|.+--...+.+++...+|+..+.|+.......|+++.+++...+..+. .+.+++.+.++..+ .. T Consensus 211 ----~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~-----~~~~~~~~~~~~~l-----~~ 276 (404) T protein:vir:10 211 ----KSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKS-----PALKDFKKCKNVEL-----LN 276 (404) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeecccc-----ccHHHHHHHHHhhh-----hc Confidence 35777777778888888899999999655556779998887654333222 23444444433222 12 Q ss_pred CceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcc Q lcl|NC_019525. 216 DYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNED 294 (343) Q Consensus 216 ~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~ 294 (343) ++. ....++|-|..|..|..-+ ++.+.-+.. -+.+.....-.|.|+.+.+... ..++++...+++-+-+ + T Consensus 277 ~~~-~~~~~v~n~~~~~~L~~lk--d~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~-----~~~~~~~~~~~~gd~s-~ 347 (404) T protein:vir:10 277 VFK-ATSSWIVNQDGFNYLDSLE--DKTGRPYLQPDPKDPTQYRFLGLPVIELPNDL-----LLSTESAIPVLLGDTK-E 347 (404) T ss_pred ccc-CCCEEEEcHHHHHHHHHhh--ccCCceeeccCcCCCCCccccceeeEEecccc-----cCCCCCccEEEEEecc-c Confidence 222 2346899999999996533 333332221 1112222222467765433211 1122233333333222 2 Q ss_pred eEEEe--cCCchhhc----chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 295 SLRMD--IPVDYTST----LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 295 ~v~~~--iP~~~~~l----~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .+.+. -.+.+... .-.+.+. ..+-++.|+++ .++.|.+++.+.+.+ T Consensus 348 ~~~~~~~~~~~i~~~~~~~~~~~~~~--~~~~~~~r~d~-~v~~~~a~~~~~~~~ 399 (404) T protein:vir:10 348 AYKYVSDGAYELATTNIGAGAFETNT--TKARIIMRIDG-NVKDSEALLIAEIPV 399 (404) T ss_pred cEEEEEecceEEEEeccccchhhcCc--eEEEEEEeecc-EEecccceEEEEeec Confidence 22111 12222211 1112222 22345777766 788899999999999 No 48 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.37 E-value=5e-05 Score=44.17 Aligned_cols=290 Identities=9% Similarity=-0.026 Sum_probs=136.8 Q ss_pred eeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccc Q lcl|NC_019525. 5 VIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFS 84 (343) Q Consensus 5 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg 84 (343) .-- ++..... ......+.+ .++..++ + ..+++.-.++-.-+++.++..-. .....+-...... T Consensus 1 ~g~---------~~e~~~~---~~~~t~~~~-g~l~~~~--~-~~ii~~l~~~s~i~~l~~~~~~~-~~~~~ip~~~~~~ 63 (397) T protein:vir:23 1 MGF---------SADHSQI---AQTKDTMFT-GYLDPVQ--A-KDYFAEAEKTSIVQRVAQKIPMG-ATGIVIPHWTGDV 63 (397) T ss_pred CCc---------CHHHHHH---hhccCCCCc-cccchhH--H-HHHHHHHHhccchhhhcceeecc-CCceEEEEEcCCc Confidence 000 1111000 000011112 1232222 1 22333222222223333322211 1122333334444 Q ss_pred hhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEee Q lcl|NC_019525. 85 LAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFV 164 (343) Q Consensus 85 ~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~ 164 (343) .|.+++.+ ..+|.-+..+++....+++++.-+.+|.+=|+.+. .++.+.-.+...+++...+|+.+++ T Consensus 64 ~a~wv~Eg--------~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~ 131 (397) T protein:vir:23 64 SAQWIGEG--------DMKPITKGNMTKRDVHPAKIATIFVASAETVRANP----ANYLGTMRTKVATAIAMAFDNAALH 131 (397) T ss_pred ceEEecCC--------ccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhh Confidence 55665432 36788888999999999999999988877666442 4688888889999999999999999 Q ss_pred eeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcc Q lcl|NC_019525. 165 GMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPI 244 (343) Q Consensus 165 G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~ 244 (343) |+...++..|+.+..+.....+ ...+.+.+++.+..+... + ..+..++|-++.|..|.+-+ ++.+ T Consensus 132 G~gt~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~l~~~------~-~~~a~~vmn~~~~~~L~~lk--d~~G 196 (397) T protein:vir:23 132 GTNAPSAFQGYLDQSNKTQSIS------PNAYQGLGVSGLTKLVTD------G-KKWTHTLLDDTVEPVLNGSV--DANG 196 (397) T ss_pred cccCCcccccccccccceeeec------ccchhHHHHHHHHhhhhc------c-cCCCEEEEcHHHHHHHHHhh--ccCC Confidence 9766556667776665433211 122334444444433322 1 24568999999999997533 3333 Q ss_pred hhhhh-HHHhhcchhcCCcceEEeechhhhhccccc----CCCccEEEEEEcCcceEEEecCCchhhc----------ch Q lcl|NC_019525. 245 KSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQV----PAVAKRYALYNDNEDSLRMDIPVDYTST----------LA 309 (343) Q Consensus 245 ~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g----~gg~drmv~Y~~d~~~v~~~iP~~~~~l----------~p 309 (343) +.++. -..........+..|.=.|+.......... .|.-.++++. +.+.+.+++--..... -- T Consensus 197 ~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~--~~~~i~i~~~~e~~~~~~~~~~~~~~~l 274 (397) T protein:vir:23 197 RPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWG--QVGGLSFDVTDQATLNLGSQESPNFVSL 274 (397) T ss_pred ceeecccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEE--EEeceEEEEeeeeeeeeccccccceeee Confidence 32211 111111111111112222332222211100 0111122211 1222333221111100 00 Q ss_pred hhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 310 NSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 310 ~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .+++-.. +-++.|+++ .++.|+++.+++... T Consensus 275 f~~d~v~--~ra~~r~d~-~v~~~~a~~~~~~~~ 305 (397) T protein:vir:23 275 WQHNLVA--VRVEAEYGL-LINDVNAFVKLTFDP 305 (397) T ss_pred eecccee--EEEEeeecc-ceecccceEEEeecc Confidence 1222122 334666766 888999999999855 No 49 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.35 E-value=7.4e-05 Score=43.24 Aligned_cols=308 Identities=10% Similarity=0.024 Sum_probs=143.0 Q ss_pred CCceeeecCCchHHHHHHHHh-hhc-------cchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccch----hhcceec Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEA-KIA-------GVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPA----DYMPIRV 68 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~-~~~-------~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~----~~~pv~~ 68 (343) |-.++.--..++.-...|... +.. -.+.. ..+.|. +++.+ .+...|++...+.-..+ +.+|.. T Consensus 32 ~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~-~~~~Gg-~lvP~--~~~~~ii~~l~~~s~l~~lg~~~v~~~- 106 (366) T protein:vir:57 32 MTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAIST-AAGSGG-ALIPQ--NMQNEVIELLRDRTVVRILGARSIPLP- 106 (366) T ss_pred HHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhccc-cccCCc-cccch--hHHHHHHHHHhhhcchhhhceeeeecC- Confidence 112332111111111111111 000 00111 112222 23332 12334554422222222 223321 Q ss_pred CCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHH Q lcl|NC_019525. 69 GEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEK 148 (343) Q Consensus 69 ~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~ 148 (343) .+ ...+-..+.-..|.+++.+ .++|.-+..+++.+.+.+.++.-..+|.+=|+.+. .++++--.+ T Consensus 107 -~g--~~~~p~~t~~~~a~wv~E~--------~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~~~~~i~~ 171 (366) T protein:vir:57 107 -NG--NLSMPRLSGGATAGYVGEG--------KDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAG----FNVEQLLLG 171 (366) T ss_pred -CC--ceEEEEEeCCcceeeeccC--------ccccccccceeEEEEeeEEEEEeehhhHHHHhhhh----HHHHHHHHH Confidence 11 1122222233344444432 46888888999999999999998888866665442 467787888 Q ss_pred HHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCH Q lcl|NC_019525. 149 SRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPE 228 (343) Q Consensus 149 aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~ 228 (343) ...+++...+|+..+.|+-....-.|++|.+.+...... .+-...+.+.+..++..+.........+ ......+|.+ T Consensus 172 ~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~--~~~t~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~vmn~ 248 (366) T protein:vir:57 172 DILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVA--WTGTAINLTTIDEYLDSLILKHMDSNSN-MIRCGWGLSN 248 (366) T ss_pred HHHHHHHHHHHHHhhccCCCCccccceeeccccccceee--ccccccchhhHHHHHHHHHHhhhccccc-cccCEEEecH Confidence 888888889999999995433346799998865332221 1222445555666555554443333332 2345788999 Q ss_pred HHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCC-CccEEEEEEcCcceEEEecCCchhhc Q lcl|NC_019525. 229 SDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPA-VAKRYALYNDNEDSLRMDIPVDYTST 307 (343) Q Consensus 229 ~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~g-g~drmv~Y~~d~~~v~~~iP~~~~~l 307 (343) ..|..|..-+ +..+..++. ......-.|.|+-+.. .+.. ..+.+ .+..++.-+-+.=.+...-.+..... T Consensus 249 ~~~~~L~~lk--d~~G~~l~~---~~~~g~l~G~Pvv~s~--~ip~--~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~ 319 (366) T protein:vir:57 249 RTYMTLFGLR--DGNGNKVYP---EMSQGILKGYPIQRTS--AIPA--NLGDDGNESEIYFCDFNDVVIGEDGMMKVDFS 319 (366) T ss_pred HHHHHHHhhh--ccCCceecc---CCCCCeecceeeEEcc--cccc--ccccCCCccEEEEEecceEEEEEecceEEEEe Confidence 9999997543 333333321 1111112355543321 1111 11111 12222222222111111111111111 Q ss_pred -------------chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 308 -------------LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 308 -------------~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -.++.+...+ -+++|++ +.+++|++++++---. T Consensus 320 ~ea~~~~~~g~~~~~f~~~~~~i--R~~~~~d-~~v~~~~a~~~lt~~~ 365 (366) T protein:vir:57 320 TEATYKDADGQLVSAFARNQSLI--RVVTEHD-IGFRHPEGLVLGTGVI 365 (366) T ss_pred eccccccccccchhhhhcCceeE--EeeeeeC-cEeeccccEEEEeccc Confidence 0122332333 4466665 4568999999887666 No 50 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=97.35 E-value=3.2e-05 Score=45.25 Aligned_cols=312 Identities=13% Similarity=0.030 Sum_probs=136.6 Q ss_pred CCceeeecCCc-hH-HHHHHHHhhhcc-------------------chhhhhhhhhhhhhHHHHHHhhhhhhhccccccc Q lcl|NC_019525. 1 MKKFVIRNSKG-EK-ILLNAQEAKIAG-------------------VIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEIS 59 (343) Q Consensus 1 ~~~~~~~~~~~-~~-~~~~a~~~~~~~-------------------~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~ 59 (343) ...-......+ ++ ...+........ .........| .+++. +.+...|++...+... T Consensus 115 ~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g-~~~ip--~~~~~~ii~~~~~~~~ 191 (458) T protein:vir:10 115 VGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVS-SESYE--TIFSQRIIRDLQKELV 191 (458) T ss_pred hhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCccc-cceeh--hhHhHHHHHHHHhhhh Confidence 00000000000 00 000000000000 0000011111 11221 2333444444333333 Q ss_pred chhhcceec-CCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhc Q lcl|NC_019525. 60 PADYMPIRV-GEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSG 138 (343) Q Consensus 60 ~~~~~pv~~-~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~G 138 (343) .+.+.++.. ..+ ...+......+.|.+++-+... ...+..+..+..+++.....+.++.-+.+|.+=|+.+. T Consensus 192 l~~~~~~~~~~~~--~~~~~~~~~~~~a~~v~e~~~~--~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~--- 264 (458) T protein:vir:10 192 VGALFEELPMSSK--ILTMLVEPDAGKATWVAASTYG--TDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAI--- 264 (458) T ss_pred HHhhcceeecCCc--ceEEEEecCCcceeeccccccc--ccccccccccccceeeEeeeeeEEeeehhhHHHHhcch--- Confidence 333333221 222 1122222233445555433211 01111223455678888888888888888877555432 Q ss_pred CCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCc---ccccCCHHHHHHHHHHHHHHHHhcC Q lcl|NC_019525. 139 NWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTK---SIKSMTPAELKVLCAGIIDVYRQGC 215 (343) Q Consensus 139 r~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~---~w~~kT~~eIl~Din~~l~~v~~~s 215 (343) .+|.+--.+.+..++...+|+-+++|+-. ....|++|++.+...++.... .-...|.+.|++-+..+-. T Consensus 265 -~~~~~~i~~~l~~~i~~~~d~~~l~G~G~-~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~------ 336 (458) T protein:vir:10 265 -FSLLPLLRKRLIEAHAVSIEEAFMTGDGS-GKPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGR------ 336 (458) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHhhcCCCC-CccceeeecccccccceeecccccccccccHHHHHHHHHhhhh------ Confidence 35777777777788888889999999533 346799999876533222111 2334566666654443322 Q ss_pred CceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcch----hcCCcceEEeechhhhhcccccCCCccEEEEEE Q lcl|NC_019525. 216 DYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKE----ITRNSSFEILPCVYADKITAQVPAVAKRYALYN 290 (343) Q Consensus 216 ~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~----~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~ 290 (343) .+ ..+..++|.+..|..|..- -++.+.-+.. .+...... .-.|.|+.+. .++. + +.+.++ + +|- T Consensus 337 ~~-~~~~~~v~~~~~~~~l~~l--kd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~--~~~p---~-~~~~~~-~-~~~ 405 (458) T protein:vir:10 337 HG-LKLSKLVLIVSMDAYYDLL--EDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVS--EYFP---A-KANSAE-F-AVI 405 (458) T ss_pred hh-cCCCEEEEcHHHHHHHHhh--cccCCceeeccccccccccCcCceecceeeEEc--cccc---c-ccCCcc-e-EEE Confidence 22 2356789999999998643 2433332221 22222111 1236665432 2222 1 111122 1 222 Q ss_pred cCcceEEEecCCchhhc-chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 291 DNEDSLRMDIPVDYTST-LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 291 ~d~~~v~~~iP~~~~~l-~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...+.+.+..-..++.. -+.-..+ ...+-...|+| ..+++|.+++....+- T Consensus 406 ~f~~~~~~~~~~~~~v~~d~~~~~~-~~~~~~~~r~~-~~v~~~~a~v~~~~aa 457 (458) T protein:vir:10 406 VYKDNFVMPRQRAVTVERERQAGKQ-RDAYYVTQRVN-LQRYFANGVVSGTYAA 457 (458) T ss_pred EecccEEEEEeeceEEEeecccCCC-ceEEEEEEEec-ceEecccceEEEeecc Confidence 22222222111111111 1111112 22234467775 6788999999988888 No 51 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.30 E-value=3.8e-05 Score=44.79 Aligned_cols=294 Identities=9% Similarity=-0.030 Sum_probs=138.7 Q ss_pred eeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccc Q lcl|NC_019525. 5 VIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFS 84 (343) Q Consensus 5 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg 84 (343) +.|... +-.-++... .....+.+..+ ..+ +...|++...+.-.-+.+.++.... .....+-....-+ T Consensus 1 ~~~~~~--~~~e~~~~~------~~~~~~~~~~i-p~~---~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~ 67 (318) T protein:vir:24 1 MAAGTA--FAVDHAQIA------QTGDTMFKGYL-EPE---QAKDYFAEAEKTSIVQQFAQKVPMG-TTGQKIPHWVGDV 67 (318) T ss_pred CCCCCC--CCHHHHHhh------cccCcccceee-chh---HHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEeCCc Confidence 222221 111111111 01112222222 221 2233333323332223333322111 1122222233334 Q ss_pred hhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEee Q lcl|NC_019525. 85 LAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFV 164 (343) Q Consensus 85 ~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~ 164 (343) .|.+++.+ ..+|..+..+++.....+.++.-..+|.+-|+.+. .++.+.-.+...+++...+|+..++ T Consensus 68 ~a~~v~Eg--------~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~----~~~~~~i~~~l~~~~~~~~d~a~l~ 135 (318) T protein:vir:24 68 SAQWIGEG--------DMKPITKGNMTSQTIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDGAAMH 135 (318) T ss_pred ceEEecCC--------ccccccccceeEEEEeeEEEEEeehhhHHHhhcCh----HHHHHHHHHHHHHHHHHHHHHhhhc Confidence 45555432 46888888999999999999998888887666432 4688888888888899999999999 Q ss_pred eeccccceeeeeecCCc-eeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCc Q lcl|NC_019525. 165 GMKDNANVKGLLTQTGN-VVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFP 243 (343) Q Consensus 165 G~~~~~g~~GLlN~p~v-~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~ 243 (343) |..... -.|+++.... ......+..++ .+ +++.+++..+... -..+..++|-|+.|..|.+.+ +.. T Consensus 136 G~g~~~-~~~~~~~~~~~~~~~~~~~~~~----~~---~~~~~~~~~~~~~---~~~~~~~v~n~~~~~~L~~lk--d~~ 202 (318) T protein:vir:24 136 GTDSPF-PTYIGQTTKAISIADTTGATTV----YD---QVAVNGLSLLVND---GKKWTHTLLDDITEPILNGAK--DQN 202 (318) T ss_pred ccCCCC-Ccccccccccccccccccccch----HH---HHHHHHHHhhccc---cCCCCEEEEcHHHHHHHHHhh--ccC Confidence 964322 2355544322 12111111111 11 2233333333222 234568999999999997543 333 Q ss_pred chhh-hhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcC------cceEEEecCCchhhc--------- Q lcl|NC_019525. 244 IKST-KQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDN------EDSLRMDIPVDYTST--------- 307 (343) Q Consensus 244 ~~tl-~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d------~~~v~~~iP~~~~~l--------- 307 (343) +..+ ..-..........|.++.-.|+.....+. .++..++.-+-+ .+-+++++--..... T Consensus 203 G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~----~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~ 278 (318) T protein:vir:24 203 GRPLFIESTYGEAASPFRSGRIVARPTILSDHVV----EGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNF 278 (318) T ss_pred CceeecCccccCccccccCceEEEEeeEEeCCCC----CCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccc Confidence 3322 11122222222234555555655433322 233333322222 222223221111100 Q ss_pred -chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 308 -LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 308 -~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) --++++- ..+-+.+|+++ .+++|++++.+-..+ T Consensus 279 ~~~f~~~~--~~~r~~~r~d~-~v~~~~a~~~i~~~~ 312 (318) T protein:vir:24 279 VSLWQHNL--VAVRVEAEYAF-HCNDAEAFVALTNVV 312 (318) T ss_pred hhhhhcCc--EEEEEEEEEcc-EEecccceEEEEeec Confidence 0023332 23345777855 457899999988877 No 52 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=97.24 E-value=4.3e-05 Score=44.52 Aligned_cols=300 Identities=11% Similarity=-0.014 Sum_probs=138.7 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhh-hhhhhhhhHHHHHHhhhhhhhcccccccchhhcceec-CCCCceeEEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLC-NDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRV-GEGAWSTMLT 78 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~-~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~-~~~~w~~~~~ 78 (343) +.+|+. +...+ -+.+.+.. .+.... .+.| +++. +.+.+.|++...+...-+.+.++.+ ..+ ...+. T Consensus 87 ~~~~lr-~~~~~-~~~~~e~~----a~~~~~~~~GG--~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~~~~~--~~~~~ 154 (401) T protein:vir:44 87 FVGFLR-KGRED-GLRDLERK----ALQVGTDEDGG--YAVP--EELDRSILSLLKDEVVMRQEATVITVGGS--DYKKL 154 (401) T ss_pred HHHHHh-hhhhh-hhHHHHHH----HhhcCCCCCCc--eecc--HhHHHHHHHHHHhhhhhhhhceeeecCCC--ceEEE Confidence 222221 11110 00000000 000111 1222 3333 2334445554333322333333221 221 11121 Q ss_pred eeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhh Q lcl|NC_019525. 79 TYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLG 157 (343) Q Consensus 79 ~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~ 157 (343) ....-..+.+++.+ ...|..+ ..+++....++.++.-+.+|.+=|+.+ . .+|.+.-......++... T Consensus 155 ~~~~~~~a~wv~E~--------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds---~-~~l~~~i~~~la~ai~~~ 222 (401) T protein:vir:44 155 VNLGGTASGWVGET--------DTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDA---F-FNVEAWINSELATEFAEQ 222 (401) T ss_pred EecCCccceeeccc--------cccCccccccceeeeeehhheeeehhhhHHHHhcc---h-HHHHHHHHHHHHHHHHHH Confidence 11222233444322 2345444 368888888888888888887766643 2 468888888888888899 Q ss_pred hhheEeeeeccccceeeeeecCCceeecccCC---------cccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCH Q lcl|NC_019525. 158 IQEIAFVGMKDNANVKGLLTQTGNVVNNTFLT---------KSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPE 228 (343) Q Consensus 158 ~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~---------~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~ 228 (343) ++...++|+-. ....|+++++.+...+.+.. ..-+.-|.++|++-++.+... +. ....++|.+ T Consensus 223 ~~~~~l~G~G~-~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~------~~-~~a~~v~n~ 294 (401) T protein:vir:44 223 EEIAFTTGDGT-KKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKA------HR-TGAKFMMNN 294 (401) T ss_pred HHhhhhccCCC-CccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchh------hh-cCCEEEEcH Confidence 99999999544 34679999887644322111 011223345555444433222 21 123688999 Q ss_pred HHHHHHhccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhc Q lcl|NC_019525. 229 SDYTGLAGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTST 307 (343) Q Consensus 229 ~~~~~L~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l 307 (343) +.|..|..- -+..+.-|. .-+++..+..-.|.|+.+. ...-..++ +++.+++-+-+ +.+.+---+.++.+ T Consensus 295 ~~~~~L~~l--kd~~G~~l~~~~~~~g~~~~l~G~PVv~~-----~~~p~~~~-~~~~i~~Gd~~-~~~~i~~~~~~~~~ 365 (401) T protein:vir:44 295 NSLFAIRLL--KDTEGNYLWRPGLELGQPSSLAGYGIAEN-----EQMPDIAA-DAKAIAFGNFK-RGYTIVDRIGTRIL 365 (401) T ss_pred HHHHHHHHh--hccCCceeecCCcCCCCCceecceeeEEe-----cCcCCccC-CccEEEEeehh-ccEEEEEecceEEe Confidence 999999643 344443332 1111212212235654332 22222222 22333222222 22222111111111 Q ss_pred c-hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 308 L-ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 308 ~-p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) . +.-..+ ...+-++.|+||..+. |.+++.+-++- T Consensus 366 ~~~~~~~~-~v~~~a~~r~d~~~~~-~~a~~~l~~~a 400 (401) T protein:vir:44 366 RDPYTNKP-FVGFYTTKRTGGMLVD-SQAIKLLKIAA 400 (401) T ss_pred eeccccCC-cEEEEEEEEeccEEec-ccceEEEEeec Confidence 1 111112 2334457788887666 99999999999 No 53 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=97.22 E-value=0.0001 Score=42.48 Aligned_cols=299 Identities=11% Similarity=-0.029 Sum_probs=137.4 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhh-hhhhhhhhhhHHHHHHhhhhhhhcccccccchhhccee-cCCCCceeEEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQR-LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIR-VGEGAWSTMLT 78 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~-~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~-~~~~~w~~~~~ 78 (343) ..+|+. ++.++ .+. .... ..+.. ...+.| +++. +.+...|++.......-+.+..+- +..+ ...+. T Consensus 86 ~~~~l~-~g~~~-~~~-~~e~---~a~~~~t~~~gG--~~iP--~~~~~~I~~~~~~~~~l~~~~~~~~~~~~--~~~~~ 153 (407) T protein:vir:48 86 FIGFMR-KGRED-GLR-ELER---KALQVGNDEDGG--YAIP--EELDRTILTLLKDEVVMRQEATVITLGGS--DYKKL 153 (407) T ss_pred HHHHHh-ccchh-hhh-HHHH---HhhhcccCCCCc--cccc--HhHHHHHHHHHHhhhhhhhhceeeecCCC--ceEEE Confidence 111111 11100 000 0000 00000 111222 2222 233445555433333333333321 1221 11221 Q ss_pred eeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhh Q lcl|NC_019525. 79 TYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLG 157 (343) Q Consensus 79 ~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~ 157 (343) ....-..+.+++.+ ..+|..+ ..+......++.++.-+.+|.+=|+.+. .+|.+--.....+++... T Consensus 154 ~~~~~~~a~~v~E~--------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~i~~~ 221 (407) T protein:vir:48 154 VNLGGTTSGWVGET--------DARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAF----FNVEDWINSELALEFAEQ 221 (407) T ss_pred EecCCcceeeeccc--------ccccccccccceeEEeeeeeeEeehhhHHHHHhcch----HHHHHHHHHHHHHHHHHH Confidence 11222334444332 2455444 3678888888998888888877666432 367777778888888888 Q ss_pred hhheEeeeeccccceeeeeecCCceeecccCC---------cccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCH Q lcl|NC_019525. 158 IQEIAFVGMKDNANVKGLLTQTGNVVNNTFLT---------KSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPE 228 (343) Q Consensus 158 ~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~---------~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~ 228 (343) +++..++|+-.. ...|+++++.+...+.... ..-...|.++|++.++.+...... ...++|.+ T Consensus 222 ~~~a~l~G~G~~-~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~-------~a~~v~n~ 293 (407) T protein:vir:48 222 EEIAFTSGDGSK-KPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRS-------GAKFMMNN 293 (407) T ss_pred HHhhhhccCCCC-ccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhc-------CCEEEEcH Confidence 999999996443 4679999988654322110 111223455555555443332221 22578889 Q ss_pred HHHHHHhccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEE-ecCCchhh Q lcl|NC_019525. 229 SDYTGLAGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRM-DIPVDYTS 306 (343) Q Consensus 229 ~~~~~L~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~-~iP~~~~~ 306 (343) ..|..|.. +-+..+.-|+ .=+....+..-.|.|+-+.. .+. ..+. +++.+++-+-+.-+.-+ ..=+.+.. T Consensus 294 ~~~~~L~~--lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~--~~p---~~~~-~~~~i~~Gd~~~~~~i~~~~~~~i~~ 365 (407) T protein:vir:48 294 SSLFAIRL--LKDNDGNYLWRPGIELGQPSSLAGYGIVENE--QMP---DIAA-DAKAIAFGNFKRGYTIVDRIGTRILR 365 (407) T ss_pred HHHHHHHH--hhccCCceeeccCcCCCCCceecceeeEEec--CcC---CccC-CccEEEEEeccccEEEEEeeceEEEe Confidence 99988854 3344443331 11111112122356654332 122 2222 23333322222211111 01111111 Q ss_pred cchh-hcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 307 TLAN-SVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 307 l~p~-~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -+. ..+ ....-++.|++|. ++.|++++++.+.. T Consensus 366 -d~~~~~~--~~~~~~~~r~d~~-v~~~~a~~~l~~~a 399 (407) T protein:vir:48 366 -DPYTNKP--FVGFYTTKRTGGM-LVDSQAIKLMKIGA 399 (407) T ss_pred -eccccCC--cEEEEEEEEeccE-EecccceEEEEeec Confidence 111 122 2233467888875 55699999999988 No 54 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=97.21 E-value=4.9e-05 Score=44.20 Aligned_cols=298 Identities=12% Similarity=0.044 Sum_probs=135.1 Q ss_pred CCceeeecCCchHHHHHH----------HHhhhccch-hhhhhhhhhhhhHHHHHHhhhhhhhc--cccccc-chhhcce Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNA----------QEAKIAGVI-QRLCNDLGFEIDVTTLTTLMKKIIEQ--KFFEIS-PADYMPI 66 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a----------~~~~~~~~~-~~~~~d~~~~f~~~qL~~i~~~iye~--~~~~l~-~~~~~pv 66 (343) .++-..-+.+-.....++ +........ .......|. +...+. ....|.+. +++.|. ....+|+ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~-~~~~~~--~~~~i~~~~~~~~~l~~~~~~~~~ 150 (390) T protein:vir:62 74 LQGSGSGAQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPN-VLSRTL--YGQLIAQAVERSAIMRGGATTFTT 150 (390) T ss_pred cccccccchhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCc-cccccc--hHHHHHHHHhhhhhhhhcceeeec Confidence 111111000000000000 000000000 001111221 221111 11112211 122221 1122332 Q ss_pred ecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHH Q lcl|NC_019525. 67 RVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITAL 146 (343) Q Consensus 67 ~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k 146 (343) . .+ -.-.+......+.|.+++.+ ..+|.-+..+++.+..++.++.-+..|.+=|+.+. ++|.+-- T Consensus 151 ~--~~-~~~~~p~~~~~~~a~wv~E~--------~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i 215 (390) T protein:vir:62 151 S--DA-NPLDFTVITGRSSASIVGET--------AEIPESYPATAQRSMGGFKYGFASVVSYEFATDQV----LDLVGFL 215 (390) T ss_pred C--CC-ceeEEEEEcCCcceeeeccc--------ccccccccceeeeEeeeeeEEeehHHHHHHHhhhh----HHHHHHH Confidence 1 11 11122223333455555432 36788888999999999999998888877776543 4688877 Q ss_pred HHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEe Q lcl|NC_019525. 147 EKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTI 226 (343) Q Consensus 147 ~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~l 226 (343) ......++...+|+..++|+ . +. .|++|+++....+...+ .-...|.+++++.++.+...... --.++| T Consensus 216 ~~~l~~~i~~~~d~~~l~G~-G-~p-~Gi~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~l~~~~~~-------~a~~vm 284 (390) T protein:vir:62 216 VSDAGPAIGDAMGRHFITGT-G-QP-RGILTDASPATATFLAT-DTDSKVSDALIDLFHEVPSAYRA-------NAKYVV 284 (390) T ss_pred HHHHHHHHHHHHHhhhhccC-C-cc-ccccccccccccceecc-cccccchHHHHHHHHhhhhhhhc-------CCEEEE Confidence 88888888889999999994 3 22 58999887643222111 11345677776666554333211 125788 Q ss_pred CHHHHHHHhccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchh Q lcl|NC_019525. 227 PESDYTGLAGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYT 305 (343) Q Consensus 227 p~~~~~~L~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~ 305 (343) .++.|..|..-+ +..+.-|+ .-+....+..-.|.|+.+.. ... .+.++.-+-+.=.+...-.+... T Consensus 285 n~~~~~~L~~lk--d~~g~~l~~~~~~~g~~~~l~G~Pv~~~~-----~~p------~~~i~~gd~s~~~i~~~~~~~v~ 351 (390) T protein:vir:62 285 NDLRAAQMRKLK--DANGQYLWQSGLTVGAPSLFNGKVVETDD-----GMP------ADKILFADLSKYRVRFAGSLRVD 351 (390) T ss_pred chHHHHHHHHhh--ccCCCeeecCCcCCCccceecccceEEec-----CCC------CccEEEeeccceeEEeecceEEE Confidence 889899986432 33332221 11111111222455554332 111 12222111111111111122222 Q ss_pred hcc-h-hhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 306 STL-A-NSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 306 ~l~-p-~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ... + ...+... +-++.|++| .++.|.+++.+.+-= T Consensus 352 ~~~~~~~~~~~~~--~~~~~r~d~-~~~~~~A~~~l~~~~ 388 (390) T protein:vir:62 352 RSVDAKFSTDQIV--YRFLQRADG-LLVDARGAKVLTVTP 388 (390) T ss_pred eeccccccCCcEE--EEEEEEeCc-EeechhheEEEEeec Confidence 111 1 1222222 345788886 588899998888655 No 55 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=97.19 E-value=0.00014 Score=41.79 Aligned_cols=309 Identities=10% Similarity=-0.009 Sum_probs=145.4 Q ss_pred CCceeeecCCchHHHHH--HHH-----------------hhhc-------cchhhhhhhhhhhhhHHHHHHhhhhhhhcc Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLN--AQE-----------------AKIA-------GVIQRLCNDLGFEIDVTTLTTLMKKIIEQK 54 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~--a~~-----------------~~~~-------~~~~~~~~d~~~~f~~~qL~~i~~~iye~~ 54 (343) ..+-+.-....++..-. ++. .... ..........|. +++. +.+...|++.. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg-~lvP--~~~~~~ii~~l 156 (435) T protein:vir:80 80 AAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGG-VLVP--ENLSSEVIELL 156 (435) T ss_pred cccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCc-cccc--hhHHHHHHHHH Confidence 11111000000000000 000 0000 000000111111 1222 12233444432 Q ss_pred cccccchh----hcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHH Q lcl|NC_019525. 55 FFEISPAD----YMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPE 130 (343) Q Consensus 55 ~~~l~~~~----~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~E 130 (343) .+.-..+. .+|. ..+ ...+...+..+.+.+++.+ +.+|.-+..+++.+...+.++.-+.+|.+= T Consensus 157 ~~~~~i~~~~~~~v~~--~~~--~~~~p~~~~~~~a~~v~E~--------~~~~~~~~~f~~i~~~~~k~~~~~~is~el 224 (435) T protein:vir:80 157 RPKSVVRKLGARTLPL--SNG--NITIPRLKGGAIVGYIGAD--------TDIPTTQQQFDDLKLTAKKMAALVPIANDL 224 (435) T ss_pred hhhchhhhccceeeec--CCC--ceEEEEEeCCcceeeeccC--------ccccccccceeeEEEeeEEEEEeehhhHHH Confidence 22222222 2332 111 1222223333334444332 357888889999999999999998888776 Q ss_pred HHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHH Q lcl|NC_019525. 131 LAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDV 210 (343) Q Consensus 131 L~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~ 210 (343) |+.+.. +.+|.+--.+....++...+++..++|+-......|++++..+..... .-+..|.+.+..|+.+++.. T Consensus 225 l~ds~~--~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~----~~~~~~~~~~~~d~~~~~~~ 298 (435) T protein:vir:80 225 IKYAGV--NPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVIT----ASDGSTLQKIETDLGKAILA 298 (435) T ss_pred HHhhcc--cHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceee----cccccchhhHHHHHHHHHHH Confidence 665432 135777778888888888999999999533334679999876533211 12245667777888888877 Q ss_pred HHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhh-cccccCCCccEEEEE Q lcl|NC_019525. 211 YRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADK-ITAQVPAVAKRYALY 289 (343) Q Consensus 211 v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~-~~~~g~gg~drmv~Y 289 (343) +.....+ ..+..++|.+..+..|..-+ +..+..++. ......-.|.|+.+.. .+.. .+.+ +....+++- T Consensus 299 ~~~~~~~-~~~~~~vmn~~~~~~L~~lk--d~~G~~l~~---~~~~~~l~G~pv~~~~--~~p~~~~~~--~~~~~i~~g 368 (435) T protein:vir:80 299 LENADAN-LTQPGWIMAPRTFRFLEGLR--DGNGNKVYP---ELANGMLKGYPVGKTT--QVPINLGEA--GKESEIYFT 368 (435) T ss_pred hhccccc-cccCEEEEcHHHHHHHHhhh--ccCCceecc---CCCCCeEeeeeeEEec--cccccccCC--CCcceEEEE Confidence 7665433 24568999999999996544 333333321 1111112366654322 1211 1111 111122222 Q ss_pred EcCc------ceEEEecCCchhh-------cchhhcCCceEEeceeeeeccEEEEcCceEEeeec-CC Q lcl|NC_019525. 290 NDNE------DSLRMDIPVDYTS-------TLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDI-PV 343 (343) Q Consensus 290 ~~d~------~~v~~~iP~~~~~-------l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~-~~ 343 (343) +-+. .-+++++--.... ..-++++.. .+-++.|++ +.++.|.+++++.- .+ T Consensus 369 d~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~--~~r~~~r~d-~~~~~~~a~~~l~~~~~ 433 (435) T protein:vir:80 369 DFGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQT--LIRVIAKND-FGPRHVESIAVLSGVAW 433 (435) T ss_pred EcccEEEEeecceEEEEeccccccccccchhhhhhcCcc--eeeeeeeeC-cEeecccceEEEeccCC Confidence 2211 1222221000000 011333322 234566665 46788999998853 33 No 56 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.17 E-value=0.00014 Score=41.67 Aligned_cols=312 Identities=11% Similarity=-0.064 Sum_probs=142.4 Q ss_pred eeeecCCchHHHHHHHHhhhcc-chhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeec Q lcl|NC_019525. 4 FVIRNSKGEKILLNAQEAKIAG-VIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRS 82 (343) Q Consensus 4 ~~~~~~~~~~~~~~a~~~~~~~-~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~ 82 (343) +..+ +..+....+ ..+......+..+...+ +...|++.-.+.-.-+.+.++..-. .....+-.... T Consensus 1 ~~~~---------~e~~~~~~~~~~~~~~~~~~~~liP~~---~~~~ii~~~~~~s~l~~l~~~~~~~-~~~~~ip~~~~ 67 (338) T protein:vir:78 1 MATL---------NELAPNTAGSNHQGRLAHVPSDLLPKE---IVGPIFDKAQESSLVLRLGENIPIS-YGETIIPTTVK 67 (338) T ss_pred Ccch---------HHhhhhhcccccccceecccccccchH---HHHHHHHHHHhhchhhhhcceeecc-CCceEEEEEec Confidence 1111 111111111 11111111111222222 2233444433333344444433211 12222222221 Q ss_pred cchhhhhhcc-cccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhhe Q lcl|NC_019525. 83 FSLAEDFATG-IIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEI 161 (343) Q Consensus 83 vg~a~~ia~g-~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v 161 (343) -..|.+++.+ ..+.+. ...+|.-+..+++.....++++.-+.+|.+=|+.+. .++.+.-.+...+++...+|+. T Consensus 68 ~~~a~~v~~~~~~~~~E-g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~la~a~~~~~d~~ 142 (338) T protein:vir:78 68 RPEVGQVGVGTSNEQRE-GGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNP----SGLYTKLQADLAYAIGRGIDLA 142 (338) T ss_pred Cccceeecccccccccc-cccccccccceeEEEEEEEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHH Confidence 1122222211 112222 246888888899999999999988888876555432 4677777888888888999999 Q ss_pred Eeeeeccc--cceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcc-c Q lcl|NC_019525. 162 AFVGMKDN--ANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGA-A 238 (343) Q Consensus 162 ~~~G~~~~--~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~-~ 238 (343) .++|+.+. .+..|++++..+...+.. +..+ ...+..++++.+++..+..+..+ .++.++|-|..+..|..- . T Consensus 143 ~l~G~g~~~~~~~~gi~~~~~~~~~~~~-~~~~--~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~m~~~~~~~L~~~~~ 217 (338) T protein:vir:78 143 VFHGKSPLTGSALQGIDTNNVIVNTTNV-DYLQ--TGTTPLLDRFLDGYDLVSANTDV--DFNGWAADPRYRARLLRSQA 217 (338) T ss_pred hhcccCCCcccccccccccccccccccc-cccc--ccchhhHHHHHHHHHHhhhhccc--cceEEEEchHHHHHHHHHhh Confidence 99996532 345677776665332221 1111 22334567777777766544333 466799999988887542 2 Q ss_pred cCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcc--------- Q lcl|NC_019525. 239 SADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTL--------- 308 (343) Q Consensus 239 ~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~--------- 308 (343) ..+..+.-++. .........-.|.|+.+-. .+.....+..+.+..+++-+-+.=.+...-.+.+.... T Consensus 218 l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~--~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~ 295 (338) T protein:vir:78 218 YRDANGNVDPTRINLAASAGDLLGLPVQFGK--AVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTS 295 (338) T ss_pred hccCCCceeecccccCCCCceeeeeeEEEcc--ccCccccccCCcccEEEEEecceEEEEeecccEEEEeeccccccccc Confidence 33443333321 1111111122345543211 11111111222223333333322111111122211110 Q ss_pred h-------hhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 309 A-------NSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 309 p-------~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) | .+++- .-+-|+.|+|+ .++.|.+++++---= T Consensus 296 ~~~~~~~~~~~~~--~~~r~~~r~d~-~v~~~~a~~~l~~~~ 334 (338) T protein:vir:78 296 PTPQTVSMWQTNQ--IAILIEVTFGW-LLGDKQAFVKFVDDE 334 (338) T ss_pred ccccchhhhhcCc--EEEEEEEEecc-EeecccceEEEeccc Confidence 0 22221 22345777765 466788887753322 No 57 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=97.05 E-value=0.00019 Score=40.95 Aligned_cols=303 Identities=8% Similarity=-0.031 Sum_probs=132.0 Q ss_pred CCceee-ecCCchHHHHHHHHhhhccchhhhh--hhhhhhhhHHHHHHhhhhhhhcccccccch---hhcceecCCCCce Q lcl|NC_019525. 1 MKKFVI-RNSKGEKILLNAQEAKIAGVIQRLC--NDLGFEIDVTTLTTLMKKIIEQKFFEISPA---DYMPIRVGEGAWS 74 (343) Q Consensus 1 ~~~~~~-~~~~~~~~~~~a~~~~~~~~~~~~~--~d~~~~f~~~qL~~i~~~iye~~~~~l~~~---~~~pv~~~~~~w~ 74 (343) +.+.+. .+..++....-.............. .+.| .+.+. +.+...|++...+...-+ ..+|+.+ +.+. T Consensus 92 ~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g-~~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~ 166 (415) T protein:vir:94 92 LGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSG-FVVIP--EEIVTDILKLKEVEFNLDKYVTVKRVTN--GSGK 166 (415) T ss_pred HHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccc-cccCc--HHHHHHHHHHHHhhhhhhhhcceeeccC--Ccee Confidence 000000 0001110000000000000111100 1112 22222 223334444433333333 3444432 2233 Q ss_pred eEEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHH Q lcl|NC_019525. 75 TMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKN 153 (343) Q Consensus 75 ~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a 153 (343) ..+......+.+.+++.+ .++|..+ ..+.+....++.++.-+.+|.+=|+.+ . .+|.+--.....++ T Consensus 167 ~~~~~~~~~~~~~~v~Eg--------~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds---~-~~~~~~i~~~l~~~ 234 (415) T protein:vir:94 167 YPVVRQSEVAALEKVEEL--------EENPELAVKPFFQLAYDINTHRGYFRISREAIEDA---K-VNVLQELKLWMART 234 (415) T ss_pred EEEEeecCCccceecccc--------ccccccccccceeeEeeheeeeeechhhHHHHhhc---h-HHHHHHHHHHHHHH Confidence 233333333445555432 3566543 468899999999998888887644432 2 46777777888888 Q ss_pred HHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHH Q lcl|NC_019525. 154 WDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTG 233 (343) Q Consensus 154 ~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~ 233 (343) +...+|+..+.|........++.+......... .-+..+.+.|++ ++..+.. .. ..+..++|-|+.|.. T Consensus 235 ~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~----~~~~~~~~~i~~----~~~~~~~-~~--~~~~~~vmn~~~~~~ 303 (415) T protein:vir:94 235 IAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE----VKKAKSLDDIKD----AINLNVK-PN--YEHNVAIVSQTMFAK 303 (415) T ss_pred HHHHHHHHHhhccccCccccccccccccccccc----cccccchHHHHH----HHHhhhh-hc--cCCCEEEEcHHHHHH Confidence 888899999988544333333333222111111 112344555544 4444322 11 246789999999999 Q ss_pred HhccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe-cCCchhhcchhh Q lcl|NC_019525. 234 LAGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD-IPVDYTSTLANS 311 (343) Q Consensus 234 L~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~-iP~~~~~l~p~~ 311 (343) |..- -++.+.-+. .-..+.....-.|.|+.+.+... .+..++..+++-+-+.-++-+. -.+.+... ... T Consensus 304 l~~l--kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~------~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~-~~~ 374 (415) T protein:vir:94 304 LDKM--KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEV------LGQKGNNTLIIGNLKDAIVLFDRSQYQASWT-DYM 374 (415) T ss_pred HHHh--hccCCCeeeccCcCCCCCceecceeeEEecccc------cCCCCccEEEEEehhccEEEEeecceEEEEe-ccc Confidence 9653 343333222 11111111222466665544221 1122223333322221122111 11111111 111 Q ss_pred cCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 312 VNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 312 ~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...-.+ -.+.|++ +.+.+|.+++++.+.= T Consensus 375 ~~~~~~--r~~~r~d-~~~~~~~a~~~~~~~~ 403 (415) T protein:vir:94 375 HFGECL--MIAVRQD-CRILDYKSAIVIEYDD 403 (415) T ss_pred cCceEE--EEEEEec-cEEeccccEEEEEEec Confidence 111222 3466775 5666799999998765 No 58 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=96.99 E-value=0.00012 Score=42.12 Aligned_cols=301 Identities=11% Similarity=0.023 Sum_probs=131.3 Q ss_pred CCceeeecCCc---hHHHHHHHHhh-----------hccchhh-hhhhhhhhhhHHHHHHhhhhhhhccccccc-chhhc Q lcl|NC_019525. 1 MKKFVIRNSKG---EKILLNAQEAK-----------IAGVIQR-LCNDLGFEIDVTTLTTLMKKIIEQKFFEIS-PADYM 64 (343) Q Consensus 1 ~~~~~~~~~~~---~~~~~~a~~~~-----------~~~~~~~-~~~d~~~~f~~~qL~~i~~~iye~~~~~l~-~~~~~ 64 (343) +.+.- +.... ......+...+ ....... .....+ .+...+ .....|.+. .++.. .+.+. T Consensus 71 ~~~~~-~~~~~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g-~~~~~~--~~~~~i~~~-~~~~~~l~~~~ 145 (392) T protein:vir:13 71 LSGLQ-GSGSGAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNP-NVLSRT--LYGQLIAQA-VERSAIMRGGA 145 (392) T ss_pred hcccC-CcccchhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCC-cccccc--chHHHHHHH-Hhhhhhhhhcc Confidence 11000 00000 00000000000 0000000 011111 122211 111112111 11111 11111 Q ss_pred ceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHH Q lcl|NC_019525. 65 PIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLIT 144 (343) Q Consensus 65 pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~ 144 (343) .+....+.-...+...+..+.+.+++ .. ..+|..+..+++.....+.++.-+.+|.+=|+.+. .+|.+ T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~a~~v~-------E~-~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~ 213 (392) T protein:vir:13 146 STFTTSDANPMDFTVITGRATAGIVG-------ET-AEIPESYPATTQRSMGGFKYGFASVVSYEFATDQV----LDLVG 213 (392) T ss_pred eeeecCCCceeEEEEEcCCcceeeec-------cc-ccccccccceeeEEeeeeeEEeeehhHHHHHhcch----HHHHH Confidence 11111111111121222223344443 22 36888888999999999999988888877666432 46777 Q ss_pred HHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeE Q lcl|NC_019525. 145 ALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKF 224 (343) Q Consensus 145 ~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl 224 (343) --......++...+|...++|+-.. .-.|+++++.+..... .+..-...+.+.+++-++.+.... . ....+ T Consensus 214 ~i~~~l~~~i~~~~d~~~l~G~Gt~-~p~Gil~~~~~~~~~~-~~~~~~~~~~d~l~~~~~~l~~~~------~-~~a~~ 284 (392) T protein:vir:13 214 FLVSDAGPAIGDAMGRHFLTGTGTG-QPRGILTDATGANAAF-GEADADSKVSDALIDLFHEVPSAY------R-KNAKF 284 (392) T ss_pred HHHHHHHHHHHHHHHHHHhcccCCc-cccccccccccccccc-cccccccccHHHHHHHHHhhhhhh------h-cCCEE Confidence 7777888888888999999995432 3569998876432111 111122345666655444432221 1 13357 Q ss_pred EeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCc Q lcl|NC_019525. 225 TIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVD 303 (343) Q Consensus 225 ~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~ 303 (343) +|.++.+..|..- -++.+.-++. -+....+..-.|.|+.+.. ... .+.+++-+-+.=.+...-.+. T Consensus 285 v~n~~~~~~l~~l--kd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~-----~~~------~~~i~~Gdf~~~~i~~~~~~~ 351 (392) T protein:vir:13 285 VVNDLRAAQMRKL--KDANGQYLWQSALTVGAPDTFNGKVVETDD-----GMP------ADKVLFADLSKYRVRFAGSLR 351 (392) T ss_pred EEcHHHHHHHHHh--hccCCceeecCCcCCCCCceecceeeEEcC-----CCC------CCcEEEeeccceeEEeecceE Confidence 8889999988643 2433332221 0111111112355543321 111 122222221110111111111 Q ss_pred hhhcc-h-hhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 304 YTSTL-A-NSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 304 ~~~l~-p-~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.... + ...+ ...+-++.|++|. ++.|.+++.+.+-. T Consensus 352 i~~~~~~~~~~~--~~~~r~~~r~d~~-~~~~~A~~~~~~~~ 390 (392) T protein:vir:13 352 VDRSVDAKFSTD--QIVYRFLQRADGL-LVDARGAKVLTVTP 390 (392) T ss_pred EEeeccccccCC--cEEEEEEEEeccE-EecccceEEEEeec Confidence 11111 1 1122 2334467888866 88899999999888 No 59 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=96.96 E-value=0.00021 Score=40.79 Aligned_cols=287 Identities=9% Similarity=-0.017 Sum_probs=136.5 Q ss_pred hhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceec-CCCCceeEEEeeeccchhhhhhcccccCCcccccccc Q lcl|NC_019525. 27 IQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRV-GEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAA 105 (343) Q Consensus 27 ~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~-~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~ 105 (343) +.......|....-.++ ...|++.-.+.-.-+++.++.. ..+ .-.+.....-+.|.+++.+ ..+|. T Consensus 1 Ma~~~~~~gg~~vP~~~---~~~ii~~l~~~s~i~~l~~~i~~~~~--~~~ip~~~~~~~a~wv~Eg--------~~~~~ 67 (315) T protein:vir:80 1 MADDFLSAGKLELPGSM---IGAVRDRAIDSGVLAKLSPEQPTIFG--PVKGAVFSGVPRAKIVGEG--------EVKPS 67 (315) T ss_pred CCCCcCCcCceEcchHH---HHHHHHHHHhhchhhhhcceeecCCC--ceEEEEEeCCcceEEeeCC--------ccccc Confidence 11111222322222322 2334443233322333333221 111 1233334444555666543 46888 Q ss_pred ccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccc--cceeeeeecCCcee Q lcl|NC_019525. 106 VDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDN--ANVKGLLTQTGNVV 183 (343) Q Consensus 106 vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~--~g~~GLlN~p~v~~ 183 (343) -+..+++.....++++.-..+|.+=|+.....---.|.+.-.+...+++.+.+|+..++|.... .+..|+.+.-+.+. T Consensus 68 s~~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~ 147 (315) T protein:vir:80 68 ASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTK 147 (315) T ss_pred cccceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccccccccc Confidence 8889999999999998887777765543221100137777788888899999999999995422 22334443322111 Q ss_pred ecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhc----chhc Q lcl|NC_019525. 184 NNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTF----KEIT 259 (343) Q Consensus 184 ~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~----~~~~ 259 (343) . .........+|+.+++..+..... ..++.++|-|+.+..|.+-+..+.. -+...++.... +..- T Consensus 148 ~--------~~~~~~~~~~d~~~~~~~~~~~~~--~~~~~~imn~~~~~~L~~l~~~~g~-~~~g~~~~~~~~~g~~~tl 216 (315) T protein:vir:80 148 N--------IVDATDSATADLVKAVGLIAGAGL--QVPNGVALDPAFSFALSTEVYPKGS-PLAGQPMYPAAGFAGLDNW 216 (315) T ss_pred c--------eeeccccchHHHHHHHHHHhhccC--ccceEEEEcHHHHHHHHHHhhccCC-cccccccccccccCCCcee Confidence 1 111122334566666666543322 2456799999999998654433221 11122222111 1112 Q ss_pred CCcceEEeechhhhhcccccCCCccEEEEEEcC------cceEEEecCCchhhc-----chhhcCCceEEeceeeeeccE Q lcl|NC_019525. 260 RNSSFEILPCVYADKITAQVPAVAKRYALYNDN------EDSLRMDIPVDYTST-----LANSVNNFQFQNAAYGQFTGV 328 (343) Q Consensus 260 ~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d------~~~v~~~iP~~~~~l-----~p~~~~~l~~~v~~~~r~GGv 328 (343) .|.|+.+-. .+......+.+.+..++..+-+ .+.+++++- +.... --.+.+-.. +-++.|+| . T Consensus 217 ~G~PV~~~~--~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~-~~~~~~~~~~~~~~~~~v~--~r~~~r~~-~ 290 (315) T protein:vir:80 217 RGLNVGASS--TVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELI-EYGDPDQTGRDLKGHNEVM--VRAEAVLY-V 290 (315) T ss_pred cceeeEecC--cCCcccccccccccEEEEeecccEEEEEecCeeEEEe-ccccccCcccchhhcCcEE--EEEEEEec-c Confidence 355543211 1211111222223334433322 223333321 00000 002232223 34566665 4 Q ss_pred EEEcCceEEeeecCC Q lcl|NC_019525. 329 LAYRPKELLYLDIPV 343 (343) Q Consensus 329 ~v~yP~a~~Y~D~~~ 343 (343) .++.|++++++-... T Consensus 291 ~v~~~~a~~~l~~~~ 305 (315) T protein:vir:80 291 AIESLDSFAVVKEKA 305 (315) T ss_pred eeecccceEEEeecc Confidence 688999999988665 No 60 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=96.95 E-value=0.00024 Score=40.41 Aligned_cols=275 Identities=10% Similarity=-0.013 Sum_probs=138.3 Q ss_pred hccchhhhh--hhhhhhhhHHHHHHhhhhhhhcccccc---cchhhcceecCCCCceeEEEeeeccchhhhhhcccccCC Q lcl|NC_019525. 23 IAGVIQRLC--NDLGFEIDVTTLTTLMKKIIEQKFFEI---SPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTG 97 (343) Q Consensus 23 ~~~~~~~~~--~d~~~~f~~~qL~~i~~~iye~~~~~l---~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g 97 (343) +....+... .+.+..+-. .+...|++.-.+.- ...+.+|+. .+ ...+...+. ..|.+++.+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~----~~~~~ii~~~~~~s~l~~~~~~~~~~--~~--~~~~~~~~~-~~a~~v~E~----- 66 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPI----NISEQIITGVKNGSAAMKLAKAVPMT--KP--EEEFTFMSG-VGAFWVDEA----- 66 (299) T ss_pred CCcCCCcccccCCCceecch----hHHHHHHHHHHhcchhhhhceeeecC--CC--cEEEEEEcC-CceeeeecC----- Confidence 222221111 122222211 12223333322222 223344442 22 222322222 234455422 Q ss_pred ccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeee Q lcl|NC_019525. 98 NSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLT 177 (343) Q Consensus 98 ~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN 177 (343) .++|.-+..+++.....+.++.-+.+|.+=|+.+. .++.+.-.....+++.+.+|+..++|+.+.+ -.|+++ T Consensus 67 ---~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~-~~gil~ 138 (299) T protein:vir:41 67 ---ERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSV----TNFFSLMQAEIVEAFYKKFDQAVFTGVESPY-NWNILK 138 (299) T ss_pred ---ccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhhcccCcc-cccccc Confidence 46888888999999999999999999886665332 3688888888889999999999999965543 358887 Q ss_pred cCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcc Q lcl|NC_019525. 178 QTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFK 256 (343) Q Consensus 178 ~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~ 256 (343) ......+.+. ..+.+ .+||.+++..+...- ..+..++|-+..|..|.+-+ +..+.-+.. -...... T Consensus 139 ~~~~~~~~~~----~~~~~----~~~l~~~~~~l~~~~---~~~~~~v~n~~~~~~L~~lk--d~~G~~l~~~~~~~~~~ 205 (299) T protein:vir:41 139 SATDASNLVE----ETANK----YDDLNEAIGLIEAED---LEPNGIATIRKQRVKYRSTK--DGNGMPIFNTATSNGVD 205 (299) T ss_pred cccccceeec----ccccc----HHHHHHHHHhhhccc---CCcCEEEEcHHHHHHHHHhh--ccCCceeecCCcCCCCc Confidence 6543322221 11223 456677776654322 25678999999999997543 333332221 1111111 Q ss_pred hhcCCcceEEeechhhhhcccccCCCccEEEEEEcCc------ceEEEecCCchhhc----------chhhcCCceEEec Q lcl|NC_019525. 257 EITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNE------DSLRMDIPVDYTST----------LANSVNNFQFQNA 320 (343) Q Consensus 257 ~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~------~~v~~~iP~~~~~l----------~p~~~~~l~~~v~ 320 (343) . -.|.|+-+.+ .... +. ++-.+++-+-+. +.+.+++--+.... --++++ ...+- T Consensus 206 ~-l~G~PV~~~~-----~~~~-~~-~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~r 275 (299) T protein:vir:41 206 D-VLGLPIAYTP-----KYTF-GD-KDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERD--MAAIK 275 (299) T ss_pred e-ecceeeEEec-----ccCC-CC-CceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcC--cEEEE Confidence 1 1255543322 2221 11 112222222211 11222221110000 012332 23345 Q ss_pred eeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 321 AYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 321 ~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ++.|+|+ .++.|++++-+-... T Consensus 276 ~~~~~d~-~v~~~~A~~~l~~~a 297 (299) T protein:vir:41 276 ATFEVGF-MVVKDEAFSAVQPKA 297 (299) T ss_pred EEEEecc-EEecccceEEEEecc Confidence 6778855 566699999988777 No 61 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=96.82 E-value=0.00013 Score=41.87 Aligned_cols=321 Identities=12% Similarity=0.052 Sum_probs=136.9 Q ss_pred CCceeeecCCc--------------------hHHHHHHHHhhhcc----------chhh-hhhhhhhhhhHHHHHHhhhh Q lcl|NC_019525. 1 MKKFVIRNSKG--------------------EKILLNAQEAKIAG----------VIQR-LCNDLGFEIDVTTLTTLMKK 49 (343) Q Consensus 1 ~~~~~~~~~~~--------------------~~~~~~a~~~~~~~----------~~~~-~~~d~~~~f~~~qL~~i~~~ 49 (343) ..+-+.+...+ +.+.-......... ..+. ...+.+..+++.+ +.+-.. T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~-~~~~~~ 176 (477) T protein:vir:84 98 NEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPP-LWMMNR 176 (477) T ss_pred ccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeecc-chhHHH Confidence 00000000000 00000000000000 0000 0011111111111 112223 Q ss_pred hhhcccccccchhhc---ceecCCCCceeEEEeee-ccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEe Q lcl|NC_019525. 50 IIEQKFFEISPADYM---PIRVGEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIG 125 (343) Q Consensus 50 iye~~~~~l~~~~~~---pv~~~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ 125 (343) |++...+...-++++ |+....+ .-.+-..+ ....+.+++.|.. ..+.+.|..+..++..+.+.+..+.-+. T Consensus 177 ii~~l~~~~~i~~~~~~~~~~~~~~--~~~ip~~~~~~~~a~~~~Eg~~---~~~~~~~~s~~~f~~i~~~~~k~~~~~~ 251 (477) T protein:vir:84 177 FIELARAGRTYANLCPTEPLPGGTS--SINIPKILTGTSTAIQAADNAA---LTAPSAHEVDLTDGFVQANVKTIAGQQG 251 (477) T ss_pred HHHHhhhcchHHHhhceeeecCCcc--eeEEEEEecCcceeeeeccCcc---cccccccccccceeeEEEeeeeEEeeeH Confidence 444322332223333 3322211 11111111 1111223322211 1124578888889999999999888887 Q ss_pred ecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeeccc-CCcccccCCHHHHHHHH Q lcl|NC_019525. 126 WTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTF-LTKSIKSMTPAELKVLC 204 (343) Q Consensus 126 ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~-~~~~w~~kT~~eIl~Di 204 (343) +|.+=|+.+. .+|.+--......++...+|...++|+-......|++|.+++...+.+ ...+|+. .+.+..+| T Consensus 252 iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~--~~~~~~~i 325 (477) T protein:vir:84 252 IAIQLLDQAA----VSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEK--HQIIYQKI 325 (477) T ss_pred HHHHHHhccc----hhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhh--HHHHHHHH Confidence 7776666432 468888888888899999999999995333246799999987543332 1223322 23344444 Q ss_pred HHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh--------H--H----HhhcchhcCCcceEEeech Q lcl|NC_019525. 205 AGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ--------V--L----EDTFKEITRNSSFEILPCV 270 (343) Q Consensus 205 n~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~--------~--l----~~n~~~~~~g~~l~I~~~~ 270 (343) .+++..+- +.+..-+..++|-|..|..|..-+ ++.+.-|.. + + .+.....-.|.|.-+.+- T Consensus 326 ~~~~~~~~--~~~~~~~~~~v~~~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~- 400 (477) T protein:vir:84 326 ADAIQRVH--TSRFLEPEVIVMHPRRWASFHAIF--AGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPT- 400 (477) T ss_pred HHHHhhcc--ccccCCccEEEEcHHHHHHHHHhh--ccCCCeeeecCcccccccccccccccccccchhcccceEecCc- Confidence 44444432 333334557888899898885432 333332221 0 0 000000113555533221 Q ss_pred hhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCC--ceEEeceeeeeccEEEEcCceEEeeec-----CC Q lcl|NC_019525. 271 YADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNN--FQFQNAAYGQFTGVLAYRPKELLYLDI-----PV 343 (343) Q Consensus 271 ~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~--l~~~v~~~~r~GGv~v~yP~a~~Y~D~-----~~ 343 (343) +.. +.|.++....++|-+-.+++-++--+.....+-..... ..|.+ ++.+..+-+|+|+|++-+=. |- T Consensus 401 -~p~--~~~~~~d~~~i~~gd~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v--~~~~~~~~~r~~~afv~~t~~~~~~~~ 475 (477) T protein:vir:84 401 -LPT--TLGTGTDQDVIHVLRASDLALFESSVRMRALQETRAENLSVLLQV--YGYLAFTAARFPQSVVEIGGTALTAPT 475 (477) T ss_pred -ccc--cccccCCcceEEEEEeceEEEEeeceeEEeccccccccceeeeee--hhhhhhhhhccccceEEeecccccccc Confidence 110 11222222334454444444332222222222111111 22333 22344467788998873221 11 No 62 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=96.81 E-value=0.00032 Score=39.74 Aligned_cols=303 Identities=8% Similarity=-0.021 Sum_probs=132.8 Q ss_pred CCc-eeeecCC-----------c----hHHHHHHHHhhh-----c-cchhhh--hhhhhhhhhHHHHHHhhhhhhhcccc Q lcl|NC_019525. 1 MKK-FVIRNSK-----------G----EKILLNAQEAKI-----A-GVIQRL--CNDLGFEIDVTTLTTLMKKIIEQKFF 56 (343) Q Consensus 1 ~~~-~~~~~~~-----------~----~~~~~~a~~~~~-----~-~~~~~~--~~d~~~~f~~~qL~~i~~~iye~~~~ 56 (343) .++ .-....+ + +....++..... . ...... ..+.|. +.+. +.+...|.+...+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~-~~iP--~~~~~~ii~~~~~ 147 (415) T protein:vir:46 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF-VVIP--EEIVTDILKLKEV 147 (415) T ss_pred cccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCc-cccc--HHHHHHHHHHHHh Confidence 000 0000000 0 000000000000 0 000000 112222 2222 2223334443333 Q ss_pred cccchh---hcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHH Q lcl|NC_019525. 57 EISPAD---YMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELA 132 (343) Q Consensus 57 ~l~~~~---~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~ 132 (343) ...-+. .+|+.+ +.+...+......+.+.+++.| ..+|..+ ..++......+.++.-+.+|.+=|+ T Consensus 148 ~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~Eg--------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ 217 (415) T protein:vir:46 148 EFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEEL--------EENPELAVKPFFQLAYDINTHRGYFRISREAIE 217 (415) T ss_pred hhhhhhhcceeeccC--CceeEEEEEecCCcceeecccc--------cccccccccceeeEEeeeeeeEeeehhhHHHHh Confidence 333333 344432 2232222222223344444432 2466544 4688999999999998888876554 Q ss_pred HHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHH Q lcl|NC_019525. 133 EAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYR 212 (343) Q Consensus 133 ~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~ 212 (343) . +. .+|.+--.....+++...+|+.++.|+.......++......+.... .-...|.++|++-++.+.... T Consensus 218 d---s~-~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~----~~~~~~~~~i~~~~~~~~~~~- 288 (415) T protein:vir:46 218 D---AK-VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE----VKKAKSLDDIKDAINLNVKPN- 288 (415) T ss_pred h---ch-HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceec----cccccchHHHHHHHHhhhhhc- Confidence 3 32 46788788888888888999999999644333333333222211111 112234455554444443221 Q ss_pred hcCCceecCCeEEeCHHHHHHHhccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEc Q lcl|NC_019525. 213 QGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYND 291 (343) Q Consensus 213 ~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~ 291 (343) ..+..++|-++.|..|..-+ ++.+.-+. .-+.+.....-.|.|+.+.+-.+ .+.+|+..+++-+- T Consensus 289 ------~~~~~~v~n~~~~~~L~~lk--d~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~------~~~~~~~~~~~gd~ 354 (415) T protein:vir:46 289 ------YEHNVAIVSQTMFAKLDKMK--DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEV------LGQKGNNTLIIGNL 354 (415) T ss_pred ------cCCCEEEEcHHHHHHHHHhh--ccCCCeeeccCcCCCCCccccceeeEEecccc------ccCCCccEEEEEeh Confidence 24678999999999996533 43333222 11112222223466665544221 11223333333332 Q ss_pred CcceEEEe-cCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 292 NEDSLRMD-IPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 292 d~~~v~~~-iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.-++-+. -.+.+... ..... ....-++.|++ +.+..|.+++++++-= T Consensus 355 ~~~~~~~~~~~~~v~~~-~~~~~--~~~~~~~~r~d-~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:46 355 KDAIVLFDRSQYQASWT-DYMHF--GECLMIAVRQD-CRILDYKSAIVIEYDD 403 (415) T ss_pred hccEEEEeecceEEEee-ccccC--ceEEEEEEEec-cEEeccccEEEEEeec Confidence 21122111 11111111 11121 12223567875 5667899999988754 No 63 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=96.81 E-value=0.00032 Score=39.74 Aligned_cols=303 Identities=8% Similarity=-0.021 Sum_probs=132.8 Q ss_pred CCc-eeeecCC-----------c----hHHHHHHHHhhh-----c-cchhhh--hhhhhhhhhHHHHHHhhhhhhhcccc Q lcl|NC_019525. 1 MKK-FVIRNSK-----------G----EKILLNAQEAKI-----A-GVIQRL--CNDLGFEIDVTTLTTLMKKIIEQKFF 56 (343) Q Consensus 1 ~~~-~~~~~~~-----------~----~~~~~~a~~~~~-----~-~~~~~~--~~d~~~~f~~~qL~~i~~~iye~~~~ 56 (343) .++ .-....+ + +....++..... . ...... ..+.|. +.+. +.+...|.+...+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~-~~iP--~~~~~~ii~~~~~ 147 (415) T protein:vir:47 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF-VVIP--EEIVTDILKLKEV 147 (415) T ss_pred cccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCc-cccc--HHHHHHHHHHHHh Confidence 000 0000000 0 000000000000 0 000000 112222 2222 2223334443333 Q ss_pred cccchh---hcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHH Q lcl|NC_019525. 57 EISPAD---YMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELA 132 (343) Q Consensus 57 ~l~~~~---~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~ 132 (343) ...-+. .+|+.+ +.+...+......+.+.+++.| ..+|..+ ..++......+.++.-+.+|.+=|+ T Consensus 148 ~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~Eg--------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ 217 (415) T protein:vir:47 148 EFNLDKYVTVKRVTN--GSGKYPVVRQSEVAALEKVEEL--------EENPELAVKPFFQLAYDINTHRGYFRISREAIE 217 (415) T ss_pred hhhhhhhcceeeccC--CceeEEEEEecCCcceeecccc--------cccccccccceeeEEeeeeeeEeeehhhHHHHh Confidence 333333 344432 2232222222223344444432 2466544 4688999999999998888876554 Q ss_pred HHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHH Q lcl|NC_019525. 133 EAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYR 212 (343) Q Consensus 133 ~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~ 212 (343) . +. .+|.+--.....+++...+|+.++.|+.......++......+.... .-...|.++|++-++.+.... T Consensus 218 d---s~-~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~----~~~~~~~~~i~~~~~~~~~~~- 288 (415) T protein:vir:47 218 D---AK-VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKLE----VKKAKSLDDIKDAINLNVKPN- 288 (415) T ss_pred h---ch-HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccceec----cccccchHHHHHHHHhhhhhc- Confidence 3 32 46788788888888888999999999644333333333222211111 112234455554444443221 Q ss_pred hcCCceecCCeEEeCHHHHHHHhccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEc Q lcl|NC_019525. 213 QGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYND 291 (343) Q Consensus 213 ~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~ 291 (343) ..+..++|-++.|..|..-+ ++.+.-+. .-+.+.....-.|.|+.+.+-.+ .+.+|+..+++-+- T Consensus 289 ------~~~~~~v~n~~~~~~L~~lk--d~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~------~~~~~~~~~~~gd~ 354 (415) T protein:vir:47 289 ------YEHNVAIVSQTMFAKLDKMK--DKLGNYLIQPDVKEKTQQRLLGAKIEILPDEV------LGQKGNNTLIIGNL 354 (415) T ss_pred ------cCCCEEEEcHHHHHHHHHhh--ccCCCeeeccCcCCCCCccccceeeEEecccc------ccCCCccEEEEEeh Confidence 24678999999999996533 43333222 11112222223466665544221 11223333333332 Q ss_pred CcceEEEe-cCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 292 NEDSLRMD-IPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 292 d~~~v~~~-iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.-++-+. -.+.+... ..... ....-++.|++ +.+..|.+++++++-= T Consensus 355 ~~~~~~~~~~~~~v~~~-~~~~~--~~~~~~~~r~d-~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:47 355 KDAIVLFDRSQYQASWT-DYMHF--GECLMIAVRQD-CRILDYKSAIVIEYDD 403 (415) T ss_pred hccEEEEeecceEEEee-ccccC--ceEEEEEEEec-cEEeccccEEEEEeec Confidence 21122111 11111111 11121 12223567875 5667899999988754 No 64 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=96.81 E-value=0.00032 Score=39.72 Aligned_cols=303 Identities=8% Similarity=-0.047 Sum_probs=133.0 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhh--hhhhhhhhhHHHHHHhhhhhhhcccccccc---hhhcceecCCCCcee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRL--CNDLGFEIDVTTLTTLMKKIIEQKFFEISP---ADYMPIRVGEGAWST 75 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~--~~d~~~~f~~~qL~~i~~~iye~~~~~l~~---~~~~pv~~~~~~w~~ 75 (343) +..+-..+..++....-............. ..+.| .+.+.+ .+...|++...+...- ...+|+.+ +.+.. T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-g~~iP~--~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~ 167 (415) T protein:vir:81 93 GISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSG-FVVIPE--EIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKY 167 (415) T ss_pred hhhhhhhhhHHHHHHHHHHHHhhhhhhhhcccccccc-ccccch--HHHHHHHHHHHhhhhhhhheeeeeccC--CceeE Confidence 000000000000000000000000000000 01112 233332 2333444443333333 33444422 22332 Q ss_pred EEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHH Q lcl|NC_019525. 76 MLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNW 154 (343) Q Consensus 76 ~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~ 154 (343) .+........+.+++.+ .++|..+ ..++..+..++.++.-+.+|.+=|+. +. .+|.+--.....+++ T Consensus 168 ~~~~~~~~~~~~~v~E~--------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d---s~-~~l~~~i~~~l~~~~ 235 (415) T protein:vir:81 168 PVVRQSEVAALEKVEEL--------EENPELAVKPFFQLAYDINTHRGYFRISREAIED---AK-VNVLQELKLWMARTI 235 (415) T ss_pred EEEeecCCccceeeccc--------cccCcccccceeeEEeeeeeeEeeehhhHHHHhh---ch-HHHHHHHHHHHHHHH Confidence 22222233344444322 3566554 47889999999999888877664443 22 467777777777888 Q ss_pred HhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHH Q lcl|NC_019525. 155 DLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGL 234 (343) Q Consensus 155 ~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L 234 (343) .+.+|+..+.|+....+..++.+....+...+. -+..+.+.|+ +++..+... + ..+..++|-++.|..| T Consensus 236 ~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~----~~~~~~~~i~----~~~~~~~~~--~-~~~~~~v~n~~~~~~l 304 (415) T protein:vir:81 236 AATRNKAIIDVITKGSTGSTSSGFEKEGKKLEV----KKAKSLDDIK----DAINLNVKP--N-YEHNVAIVSQTMFAKL 304 (415) T ss_pred HHHHHHHHhhccccCcccccccccccccccccc----ccccchhHHH----HHHHhhhhh--c-cCCCEEEEcHHHHHHH Confidence 888899999886443333344433322221111 1223444444 444443221 1 2467899999999999 Q ss_pred hccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe-cCCchhhcchhhc Q lcl|NC_019525. 235 AGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD-IPVDYTSTLANSV 312 (343) Q Consensus 235 ~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~-iP~~~~~l~p~~~ 312 (343) ..- -++.+.-+. .=+.+.....-.|.|+.+.+... .+.+++..+++-+-+.-++-+. ..+.+... +... T Consensus 305 ~~l--kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~------~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~-~~~~ 375 (415) T protein:vir:81 305 DKM--KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEV------LGQKGNNTLIIGNLKDAIVLFDRSQYQASWT-DYMH 375 (415) T ss_pred HHh--hccCCceeeccCcCCCCCceecceeeEEecccc------cCCCCccEEEEEehhccEEEEeecceEEEEe-cccc Confidence 643 343333221 11111122223466765554321 1122333333322221121111 11111111 2222 Q ss_pred CCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 313 NNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 313 ~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) . ....-++.|++ +.++.|.+++++++.= T Consensus 376 ~--~~~~~~~~r~d-~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:81 376 F--GECLMIAVRQD-CRILDYKSAIVIEYDD 403 (415) T ss_pred C--ceEEEEEEEec-cEEeccccEEEEEEec Confidence 1 22233567885 5566799999998876 No 65 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=96.81 E-value=0.00032 Score=39.72 Aligned_cols=303 Identities=8% Similarity=-0.047 Sum_probs=133.0 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhh--hhhhhhhhhHHHHHHhhhhhhhcccccccc---hhhcceecCCCCcee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRL--CNDLGFEIDVTTLTTLMKKIIEQKFFEISP---ADYMPIRVGEGAWST 75 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~--~~d~~~~f~~~qL~~i~~~iye~~~~~l~~---~~~~pv~~~~~~w~~ 75 (343) +..+-..+..++....-............. ..+.| .+.+.+ .+...|++...+...- ...+|+.+ +.+.. T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-g~~iP~--~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~ 167 (415) T protein:vir:98 93 GISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSG-FVVIPE--EIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKY 167 (415) T ss_pred hhhhhhhhhHHHHHHHHHHHHhhhhhhhhcccccccc-ccccch--HHHHHHHHHHHhhhhhhhheeeeeccC--CceeE Confidence 000000000000000000000000000000 01112 233332 2333444443333333 33444422 22332 Q ss_pred EEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHH Q lcl|NC_019525. 76 MLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNW 154 (343) Q Consensus 76 ~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~ 154 (343) .+........+.+++.+ .++|..+ ..++..+..++.++.-+.+|.+=|+. +. .+|.+--.....+++ T Consensus 168 ~~~~~~~~~~~~~v~E~--------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d---s~-~~l~~~i~~~l~~~~ 235 (415) T protein:vir:98 168 PVVRQSEVAALEKVEEL--------EENPELAVKPFFQLAYDINTHRGYFRISREAIED---AK-VNVLQELKLWMARTI 235 (415) T ss_pred EEEeecCCccceeeccc--------cccCcccccceeeEEeeeeeeEeeehhhHHHHhh---ch-HHHHHHHHHHHHHHH Confidence 22222233344444322 3566554 47889999999999888877664443 22 467777777777888 Q ss_pred HhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHH Q lcl|NC_019525. 155 DLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGL 234 (343) Q Consensus 155 ~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L 234 (343) .+.+|+..+.|+....+..++.+....+...+. -+..+.+.|+ +++..+... + ..+..++|-++.|..| T Consensus 236 ~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~----~~~~~~~~i~----~~~~~~~~~--~-~~~~~~v~n~~~~~~l 304 (415) T protein:vir:98 236 AATRNKAIIDVITKGSTGSTSSGFEKEGKKLEV----KKAKSLDDIK----DAINLNVKP--N-YEHNVAIVSQTMFAKL 304 (415) T ss_pred HHHHHHHHhhccccCcccccccccccccccccc----ccccchhHHH----HHHHhhhhh--c-cCCCEEEEcHHHHHHH Confidence 888899999886443333344433322221111 1223444444 444443221 1 2467899999999999 Q ss_pred hccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe-cCCchhhcchhhc Q lcl|NC_019525. 235 AGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD-IPVDYTSTLANSV 312 (343) Q Consensus 235 ~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~-iP~~~~~l~p~~~ 312 (343) ..- -++.+.-+. .=+.+.....-.|.|+.+.+... .+.+++..+++-+-+.-++-+. ..+.+... +... T Consensus 305 ~~l--kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~------~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~-~~~~ 375 (415) T protein:vir:98 305 DKM--KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEV------LGQKGNNTLIIGNLKDAIVLFDRSQYQASWT-DYMH 375 (415) T ss_pred HHh--hccCCceeeccCcCCCCCceecceeeEEecccc------cCCCCccEEEEEehhccEEEEeecceEEEEe-cccc Confidence 643 343333221 11111122223466765554321 1122333333322221121111 11111111 2222 Q ss_pred CCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 313 NNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 313 ~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) . ....-++.|++ +.++.|.+++++++.= T Consensus 376 ~--~~~~~~~~r~d-~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:98 376 F--GECLMIAVRQD-CRILDYKSAIVIEYDD 403 (415) T ss_pred C--ceEEEEEEEec-cEEeccccEEEEEEec Confidence 1 22233567885 5566799999998876 No 66 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=96.81 E-value=0.00032 Score=39.72 Aligned_cols=303 Identities=8% Similarity=-0.047 Sum_probs=133.0 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhh--hhhhhhhhhHHHHHHhhhhhhhcccccccc---hhhcceecCCCCcee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRL--CNDLGFEIDVTTLTTLMKKIIEQKFFEISP---ADYMPIRVGEGAWST 75 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~--~~d~~~~f~~~qL~~i~~~iye~~~~~l~~---~~~~pv~~~~~~w~~ 75 (343) +..+-..+..++....-............. ..+.| .+.+.+ .+...|++...+...- ...+|+.+ +.+.. T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-g~~iP~--~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~ 167 (415) T protein:vir:79 93 GISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSG-FVVIPE--EIVTDILKLKEVEFNLDKYVTVKRVTN--GSGKY 167 (415) T ss_pred hhhhhhhhhHHHHHHHHHHHHhhhhhhhhcccccccc-ccccch--HHHHHHHHHHHhhhhhhhheeeeeccC--CceeE Confidence 000000000000000000000000000000 01112 233332 2333444443333333 33444422 22332 Q ss_pred EEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHH Q lcl|NC_019525. 76 MLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNW 154 (343) Q Consensus 76 ~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~ 154 (343) .+........+.+++.+ .++|..+ ..++..+..++.++.-+.+|.+=|+. +. .+|.+--.....+++ T Consensus 168 ~~~~~~~~~~~~~v~E~--------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d---s~-~~l~~~i~~~l~~~~ 235 (415) T protein:vir:79 168 PVVRQSEVAALEKVEEL--------EENPELAVKPFFQLAYDINTHRGYFRISREAIED---AK-VNVLQELKLWMARTI 235 (415) T ss_pred EEEeecCCccceeeccc--------cccCcccccceeeEEeeeeeeEeeehhhHHHHhh---ch-HHHHHHHHHHHHHHH Confidence 22222233344444322 3566554 47889999999999888877664443 22 467777777777888 Q ss_pred HhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHH Q lcl|NC_019525. 155 DLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGL 234 (343) Q Consensus 155 ~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L 234 (343) .+.+|+..+.|+....+..++.+....+...+. -+..+.+.|+ +++..+... + ..+..++|-++.|..| T Consensus 236 ~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~----~~~~~~~~i~----~~~~~~~~~--~-~~~~~~v~n~~~~~~l 304 (415) T protein:vir:79 236 AATRNKAIIDVITKGSTGSTSSGFEKEGKKLEV----KKAKSLDDIK----DAINLNVKP--N-YEHNVAIVSQTMFAKL 304 (415) T ss_pred HHHHHHHHhhccccCcccccccccccccccccc----ccccchhHHH----HHHHhhhhh--c-cCCCEEEEcHHHHHHH Confidence 888899999886443333344433322221111 1223444444 444443221 1 2467899999999999 Q ss_pred hccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe-cCCchhhcchhhc Q lcl|NC_019525. 235 AGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD-IPVDYTSTLANSV 312 (343) Q Consensus 235 ~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~-iP~~~~~l~p~~~ 312 (343) ..- -++.+.-+. .=+.+.....-.|.|+.+.+... .+.+++..+++-+-+.-++-+. ..+.+... +... T Consensus 305 ~~l--kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~------~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~-~~~~ 375 (415) T protein:vir:79 305 DKM--KDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEV------LGQKGNNTLIIGNLKDAIVLFDRSQYQASWT-DYMH 375 (415) T ss_pred HHh--hccCCceeeccCcCCCCCceecceeeEEecccc------cCCCCccEEEEEehhccEEEEeecceEEEEe-cccc Confidence 643 343333221 11111122223466765554321 1122333333322221121111 11111111 2222 Q ss_pred CCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 313 NNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 313 ~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) . ....-++.|++ +.++.|.+++++++.= T Consensus 376 ~--~~~~~~~~r~d-~~v~~~~a~~~~~~~~ 403 (415) T protein:vir:79 376 F--GECLMIAVRQD-CRILDYKSAIVIEYDD 403 (415) T ss_pred C--ceEEEEEEEec-cEEeccccEEEEEEec Confidence 1 22233567885 5566799999998876 No 67 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=96.78 E-value=0.00035 Score=39.56 Aligned_cols=302 Identities=9% Similarity=-0.009 Sum_probs=139.4 Q ss_pred eeeecCCc-hHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccc---cchhhcceecCCCCceeEEEe Q lcl|NC_019525. 4 FVIRNSKG-EKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEI---SPADYMPIRVGEGAWSTMLTT 79 (343) Q Consensus 4 ~~~~~~~~-~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l---~~~~~~pv~~~~~~w~~~~~~ 79 (343) |..-.+++ |++-.+.+++.. ......|. ....++ -..|++.-.++- ...+.+|+. .+. ..+.. T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~-----~~~~~~g~-~ip~~~---~~~ii~~~~~~s~i~~~~~~~~~~--~~~--~~~p~ 67 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQ-----TGDSMFEG-YLEPEQ---AQDYFAEAEKISIVQQFAQKIPMG--TTG--QKIPH 67 (326) T ss_pred CCCCccchhhhcCcchhhhee-----ccccCCcc-eechhh---HHHHHHHHHhcchhhhhcceeecc--CCc--eEEEE Confidence 33333332 443332222211 11111122 222222 122333322222 223334432 222 22323 Q ss_pred eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 80 YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 80 ~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) .+.-+.|.+++.| ..+|..+..+++.....+.++.-+.+|.+-|+.+. .++.+--.....+++...+| T Consensus 68 ~~~~~~a~~v~Eg--------~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~----~~~~~~i~~~l~~a~~~~~d 135 (326) T protein:vir:42 68 WTGDVSASWIGEG--------DMKPITKGNMTSQTIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFD 135 (326) T ss_pred EeCCcceEEecCC--------ccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHH Confidence 3333444454322 46888899999999999999999988886665432 46788888888888889999 Q ss_pred heEeeeeccccceeeeeecCCce-eecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccc Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNV-VNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAA 238 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~-~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~ 238 (343) +..++|+.+. .-.|++|.+... .....++...+..+..++.. ...+..+ .........++|-+..+..|.+-+ T Consensus 136 ~a~l~G~gs~-~p~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~---~~~~~~~a~~v~n~~~~~~L~~lk 209 (326) T protein:vir:42 136 NAAINGTDSP-FPTFLAQTTKEVSLVDPDGTGSNADLTVYDAVA--VNALSLL---VNAGKKWTHTLLDDITEPILNGAK 209 (326) T ss_pred HHhhcccCCC-ccccccccccccceeecccccccccchhHHHHH--HHHHhhh---hhhccCccEEEEeHHHHHHHHHhh Confidence 9999996543 345777766432 22222233334444444321 1122221 111223557899999999997533 Q ss_pred cCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccc----cCCCccEEEEEEcCcceEEEecCCchh-------- Q lcl|NC_019525. 239 SADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQ----VPAVAKRYALYNDNEDSLRMDIPVDYT-------- 305 (343) Q Consensus 239 ~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~----g~gg~drmv~Y~~d~~~v~~~iP~~~~-------- 305 (343) ++.+.-+. .-+.+.......+.++.=.|+......... -.|.-.++++.. .+-+.+.+-.+.. T Consensus 210 --d~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~--~~~~~v~~~~e~~~~~~~~~~ 285 (326) T protein:vir:42 210 --DKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQ--VGGLSFDVTDQATLNLGTPQA 285 (326) T ss_pred --ccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEE--ecceEEEEeecceeeeccccc Confidence 33232221 111111111001112222222211111110 001112232222 2333333211110 Q ss_pred --hcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 306 --STLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 306 --~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...-++++. ..+-+..|+ +..+..|.+++++-.-+ T Consensus 286 ~~~~~~~~~d~--~~~r~~~~~-d~~v~~~~a~~~l~~~~ 322 (326) T protein:vir:42 286 PNFVSLWQHNL--VAVRVEAEY-AFHCNDKDAFVKLTNVD 322 (326) T ss_pred ccchhhhhcCc--EEEEEEEEe-ccEEecccceEEEeecc Confidence 001123322 333457777 55778999998876655 No 68 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=96.77 E-value=0.00014 Score=41.79 Aligned_cols=308 Identities=11% Similarity=0.040 Sum_probs=139.3 Q ss_pred CCceeee--cCCch-----HHHHHHHHhhhccc-----hhhh--hhhhhhhhhHHHHHHhhhhhhhcccccccchhhcce Q lcl|NC_019525. 1 MKKFVIR--NSKGE-----KILLNAQEAKIAGV-----IQRL--CNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPI 66 (343) Q Consensus 1 ~~~~~~~--~~~~~-----~~~~~a~~~~~~~~-----~~~~--~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv 66 (343) +.+.+.. ..+++ ...-.+...-+... .++. ..+.| .|++.+ .+...|++...+...-+.+..+ T Consensus 102 ~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~G-G~lvP~--~~~~~Ii~~l~~~~~i~~~~~~ 178 (434) T protein:vir:62 102 ISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNG-SVTIPD--FLSKEIITYAQEENFLRRLGTG 178 (434) T ss_pred HHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhccccccc-ceecch--hhHHHHHHhhhhhhhhhhhcce Confidence 1111100 00010 01111110000000 0000 11112 244443 2444455543333333333332 Q ss_pred ecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHH Q lcl|NC_019525. 67 RVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITAL 146 (343) Q Consensus 67 ~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k 146 (343) ....+. ..+.....-+.+.++.. +...+++|.-+..+++.....+..+.-+.+|.+=|+.+ + ++|.+-- T Consensus 179 ~~~~~~--~~~p~~~~~~~a~~~~~-----~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds---~-~~l~~~i 247 (434) T protein:vir:62 179 VKTKEN--IKYPVLVKKAEAQGHKN-----ERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLART---G-LPIEQIV 247 (434) T ss_pred eccCCc--eEEEEEecCCcccceec-----ccccccccccccceeeEEeeheeeEeehhhHHHHHhcc---h-HHHHHHH Confidence 111111 12222222222322211 12234688888899999999999999888887766543 2 4688888 Q ss_pred HHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEe Q lcl|NC_019525. 147 EKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTI 226 (343) Q Consensus 147 ~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~l 226 (343) ......++...++...++|+.......|+++.+.++..+. .+.+.+.|++.+..+..... ..-.++| T Consensus 248 ~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~------~~~~~d~l~~l~~~l~~~~~-------~~a~~v~ 314 (434) T protein:vir:62 248 MDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTD------EKNLYDALVKMKNTPVKEVR-------KKARWVL 314 (434) T ss_pred HHHHHHHHHHHHHHHHhccCCCCccccceeeccccccccc------ccchhhHHHHHHhhcchhhh-------cCCEEEE Confidence 8888888888999999999654444568888877654321 12345555544444332221 1124688 Q ss_pred CHHHHHHHhccccCCCcchhhhh-HHH--hhcchhcCCcceEEeechhhhhcccccCCCccEEEEE-EcCcceEEEec-C Q lcl|NC_019525. 227 PESDYTGLAGAASADFPIKSTKQ-VLE--DTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALY-NDNEDSLRMDI-P 301 (343) Q Consensus 227 p~~~~~~L~~~~~s~~~~~tl~~-~l~--~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y-~~d~~~v~~~i-P 301 (343) -+..|..|..- -++.+.-|+. ... .-.+..-.|.|+.+.. .... +.+++...++| +-+.=++.... + T Consensus 315 n~~~~~~L~~l--kd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~-----~~~~-~~~~~~~~i~~Gdfs~~~i~~~~g~ 386 (434) T protein:vir:62 315 NTAALTKIETM--KTDDGFPLLRPFNQAEGGIGYTLLGFPVEEED-----AIDI-PDSPDTPVFYFGDFSKFYIQDVIGS 386 (434) T ss_pred cHHHHHHHHHh--hccCCCEeeccCCCccCCCCceecceeeEEec-----CccC-ccCCCceEEEEeeccceEEEEeece Confidence 88889988643 3433333321 111 0111122466654432 2221 12222222333 22211111111 2 Q ss_pred CchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeee----cCC Q lcl|NC_019525. 302 VDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLD----IPV 343 (343) Q Consensus 302 ~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D----~~~ 343 (343) +.+..+.-........-.-++.|+.|-.++.|.+..-+= .|. T Consensus 387 ~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~ 432 (434) T protein:vir:62 387 LEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPT 432 (434) T ss_pred eEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCC Confidence 333332111111122224558899888888888665432 233 No 69 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=96.69 E-value=0.00041 Score=39.18 Aligned_cols=302 Identities=13% Similarity=0.069 Sum_probs=133.2 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcc---cccccchhhcceecCCCCceeEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQK---FFEISPADYMPIRVGEGAWSTML 77 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~---~~~l~~~~~~pv~~~~~~w~~~~ 77 (343) ++.+-....+|.+.+.+..+.-...........+| .+++. +.+...|++.. .+=+...+.+|+.. ....+ T Consensus 58 ~~~~~~~~~~~~~~l~~~~r~~~~~~~~~~~~~~g-g~lvP--~~~~~~I~~~~~~~s~i~~~~~~~~~~~----~~~~i 130 (390) T protein:vir:40 58 MNDNNVLASRGANALTSDESKYYNEVIAGNGFAGV-TALLP--PTVFERVFEDLTVEHPLLSKINFVNTTA----TTEWI 130 (390) T ss_pred HHHHHHHHhcCchhccHHHHHHHHHHHhccCcccC-ccccc--HHHHHHHHHHHHhhhhhhhhceeeecCC----ceeEE Confidence 00000000111111111111101111111111112 22222 11222233321 12233344444321 22223 Q ss_pred EeeeccchhhhhhcccccCCccccccc-cccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHh Q lcl|NC_019525. 78 TTYRSFSLAEDFATGIIDTGNSNGKLA-AVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDL 156 (343) Q Consensus 78 ~~~~~vg~a~~ia~g~~~~g~~a~Dip-~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~ 156 (343) -.....+.|.++..+ +.+| .-+..+++.....+..+.-+..|.+=|+.+. .+|.+--.....+++.. T Consensus 131 ~~~~~~~~a~~~~E~--------~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~----~~l~~~i~~~la~~i~~ 198 (390) T protein:vir:40 131 ISVGDVATAWWGPLC--------AEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGP----SWLDQYVRTILGEAMAL 198 (390) T ss_pred EEEcCCcceeeeccc--------cccCccccccceeeEeeeeeEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHH Confidence 333334455554432 2454 3466788888889998888888866666442 46888888888888999 Q ss_pred hhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHH-HHHHh Q lcl|NC_019525. 157 GIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESD-YTGLA 235 (343) Q Consensus 157 ~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~-~~~L~ 235 (343) .+|+-.+.|+... .-.|++|.+...........+..+.|.+.+.+.+..++..+.........--.++|-+.. +.+|. T Consensus 199 ~~~~a~l~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~ 277 (390) T protein:vir:40 199 GLEAGIVNGSGKD-QPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIY 277 (390) T ss_pred HHHhhhhcccCCC-ccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHH Confidence 9999999995433 245999987654333322333445566666666666666554444322222234554443 34442 Q ss_pred c-cccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcc--hhhc Q lcl|NC_019525. 236 G-AASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTL--ANSV 312 (343) Q Consensus 236 ~-~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~--p~~~ 312 (343) . +..-+..+.-+. ... ..|.|+-+ ...+.. +..-.|.-.+.+++++ .-+++. ... -..+ T Consensus 278 ~~~~~~d~~G~~v~----~~~---~~g~pvv~--~~~~p~-~~i~~Gd~s~~~i~~~--~~~~v~------~~~~~~f~~ 339 (390) T protein:vir:40 278 AATSYMTPQGVWVT----GIL---PVPLEIVQ--SVAVPV-GKAVAGRAKDYFMGIG--SEQVIR------TSTEYRLLD 339 (390) T ss_pred HHhhccCCCCcccc----ccC---CCceeEEE--cCCCCC-CcEEEEeeceEEEEee--cceEEE------ecchhhhhc Confidence 1 112222222221 111 12444422 222211 1111111223333332 222222 111 0112 Q ss_pred CCceEEeceeeeeccEEEEcCceEEeee---------cCC Q lcl|NC_019525. 313 NNFQFQNAAYGQFTGVLAYRPKELLYLD---------IPV 343 (343) Q Consensus 313 ~~l~~~v~~~~r~GGv~v~yP~a~~Y~D---------~~~ 343 (343) +...+ -.+.|++|..+ .|++++.+. +|+ T Consensus 340 ~~~~~--r~~~r~dg~v~-~~~A~~~l~~~~~~~~~~~~~ 376 (390) T protein:vir:40 340 DETLY--YAKQYANGRPK-DNSSFLVFDITGLEGSPAIDV 376 (390) T ss_pred CcEEE--EEEEEeCCEEe-cccceEEEEeeccCCCCCCCc Confidence 22233 34778887654 599999884 333 No 70 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=96.66 E-value=0.00043 Score=39.03 Aligned_cols=306 Identities=9% Similarity=0.003 Sum_probs=138.9 Q ss_pred CCceeeecCCchHHHHHHHHh-----hhccchhh-hhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCce Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEA-----KIAGVIQR-LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWS 74 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~-----~~~~~~~~-~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~ 74 (343) +..--+.... +...-++... .....+.. ...+.| +++. +.+...|++...+.-..+++.++.+... .. T Consensus 100 ~~~~~~~~~~-~~~~~~af~~~l~~~e~~~al~~~t~~~gG--~lvP--~~~~~~ii~~~~~~s~l~~l~~~~~~~~-~~ 173 (425) T protein:vir:10 100 MGANGVKPLR-DPEYTEAFKAHVKRGDVQAALNKGEDSEGG--YLTP--IEWDRTITNKLVLISPMRQLCRVQPVSK-AG 173 (425) T ss_pred cccccccccc-cHHHHHHHHHHhhhhhhHHHhhcCcCCCCc--eecc--HhHHHHHHHHHHhhhhhhhhceeeeccC-Cc Confidence 0000000000 0111111100 00001111 112222 2333 2233445554333333334333321111 11 Q ss_pred eEEEeeeccchhhhhhcccccCCccccccccccc-cccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHH Q lcl|NC_019525. 75 TMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDT-GVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKN 153 (343) Q Consensus 75 ~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~-~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a 153 (343) ..+-....-..+.+++ .. ..+|..+. .+.+.....+.++.-+.+|.+=|+.+. .+|.+.-......+ T Consensus 174 ~~~~~~~~~~~a~wv~-------E~-~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~----~~l~~~i~~~la~a 241 (425) T protein:vir:10 174 FSKLFNMGGTTSGWVG-------EA-SQRPQTNAATFQPLSFASGEIYANPAATQQILDDAE----IDLESWLATEVQTE 241 (425) T ss_pred eEEEEEcCCcceeeec-------cc-cccccccccccceeeeeheeeEeehHhHHHHHhcch----hHHHHHHHHHHHHH Confidence 1111111222333433 22 34666653 688888888898888888876665432 46888888888888 Q ss_pred HHhhhhheEeeeeccccceeeeeecCCceeecccCCc---------ccccCCHHHHHHHHHHHHHHHHhcCCceecCCeE Q lcl|NC_019525. 154 WDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTK---------SIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKF 224 (343) Q Consensus 154 ~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~---------~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl 224 (343) +...+|+..++|+-. ....|++|++........... .-...+.++|++-++.+.... . ....+ T Consensus 242 i~~~~d~~~l~G~G~-~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~------~-~~a~~ 313 (425) T protein:vir:10 242 FAKQEGKAFLAGDGT-NKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAF------T-GNARF 313 (425) T ss_pred HHHHHHhhhhcccCC-CCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhh------c-cCCEE Confidence 889999999999543 346799998875443221111 112334555554444332222 1 23367 Q ss_pred EeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCc Q lcl|NC_019525. 225 TIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVD 303 (343) Q Consensus 225 ~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~ 303 (343) +|-|+.|..|..- -+..+.-|+. =+.+..+..-.|.|+.+.. ..-..+. +++.+++-+-+.-+. +--... T Consensus 314 vmn~~~~~~L~~l--kD~~G~~l~~~~~~~g~~~~l~G~PV~~~~-----~~p~~~~-~~~~i~~Gd~~~~~~-i~~~~~ 384 (425) T protein:vir:10 314 AMNRNTQRQVRKL--KDGQGNYLWQPSYVAGQPATLAGYPVTEVP-----DMPDVAA-NSTPILFGDFQQTYL-IIDRIG 384 (425) T ss_pred EEchHHHHHHHHh--hcCCCceeeccCccCCCCceecceeeEEec-----CcCCccC-CccEEEEEehhccEE-EEEecc Confidence 8999999998643 2433333321 1111111122355554432 2212222 334333322222111 111111 Q ss_pred hhhcc-hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 304 YTSTL-ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 304 ~~~l~-p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ++.+. +.-..+ ...+-.+.|++|..+ .|++++.+-+++ T Consensus 385 ~~v~~d~~~~~~-~~~~~~~~r~d~~v~-~~~A~~~l~~~a 423 (425) T protein:vir:10 385 VRVLRDPYTAKP-YVLFYTTKRVGGGLL-NPEPMRAMKVAA 423 (425) T ss_pred eEEEecccccCC-cEEEEEEEEeccEee-cccceEEEEeec Confidence 11111 111112 233345678777655 499999999999 No 71 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=96.36 E-value=0.0006 Score=38.24 Aligned_cols=306 Identities=12% Similarity=0.044 Sum_probs=129.4 Q ss_pred CCceeeecCCc-h-HHHHHHHHhhhc--------------cchhh--hhhhhhhhhhHHHHHHhhhhhhhcccccccc-- Q lcl|NC_019525. 1 MKKFVIRNSKG-E-KILLNAQEAKIA--------------GVIQR--LCNDLGFEIDVTTLTTLMKKIIEQKFFEISP-- 60 (343) Q Consensus 1 ~~~~~~~~~~~-~-~~~~~a~~~~~~--------------~~~~~--~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~-- 60 (343) -++.+.+.... + ....++...-+. ..... ...+....+++.+ .+...|++...+.-.- T Consensus 72 ~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~--~~~~~ii~~~~~~~~l~~ 149 (409) T protein:vir:45 72 QRQNLDPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPE--TFLAKVVEKMKSYGGIAS 149 (409) T ss_pred hcccCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccH--hHHHHHHHHHHhhhhhhh Confidence 00000000000 0 000000000000 00000 0111111233322 1223344433222222 Q ss_pred -hhhcceecCCCCceeEEEeeecc-chhhhhhcccccCCccccccccccccccceeeeeEEEEee-EeecHHHHHHHHHh Q lcl|NC_019525. 61 -ADYMPIRVGEGAWSTMLTTYRSF-SLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKT-IGWTLPELAEAMKS 137 (343) Q Consensus 61 -~~~~pv~~~~~~w~~~~~~~~~v-g~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~-~~ys~~EL~~A~~~ 137 (343) .+.+|+.... ...+...+.. ..+.+++.+ ..+|.-+..+.......+..... +.+|.+=|+.+. T Consensus 150 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~v~E~--------~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~-- 216 (409) T protein:vir:45 150 VAQILTTSDGR---TMEWATADGTSEVGVLLGEN--------EEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSA-- 216 (409) T ss_pred hceeeecCCCc---eEEEEeeccCcccccccccc--------ccccccccccceeeeeeeeeeeeehhhhHHHHhccH-- Confidence 2334432111 1111112211 233444332 24677777777777665554333 456665555432 Q ss_pred cCCCcHHHHHHHHHHHHHhhhhheEeeeecc--ccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcC Q lcl|NC_019525. 138 GNWDLITALEKSRKKNWDLGIQEIAFVGMKD--NANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGC 215 (343) Q Consensus 138 Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~--~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s 215 (343) .+|.+--......++...+++..++|+.. .....|+++++.....++. -...|.++|++-++.+......+. T Consensus 217 --~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~----~~~~~~d~i~~l~~~l~~~~~~~a 290 (409) T protein:vir:45 217 --IDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAA----ANAVKWQEILALKHSIDPAYRRGP 290 (409) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeecccccccccc----ccccchHHHHHHHHhhhhhhccCC Confidence 36777777777888888889999999532 2246799988764332221 123456666666665544433332 Q ss_pred CceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEE-EcCc Q lcl|NC_019525. 216 DYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALY-NDNE 293 (343) Q Consensus 216 ~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y-~~d~ 293 (343) .+ .+++.+..|..|..- .++.+..++. -..+..+..-.|.|+.+.. ++. ..+. +++. |+| +-+. T Consensus 291 ~~-----~~~~n~~~~~~l~~l--kd~~G~~i~~~~~~~~~~~~l~G~PV~~~~--~~p---~~~~-~~~~-i~~Gd~~~ 356 (409) T protein:vir:45 291 KF-----RLAFNDNTLKLISEM--EDGQGRPLWLPDIVGVAPASVLNVPYVIDQ--EID---DIGA-GKKF-MFCGDFDR 356 (409) T ss_pred eE-----EEEECHHHHHHHHHh--hcCCCceeeccCcCCCCCceecceeeEEec--CcC---CccC-CccE-EEEeehhh Confidence 22 256677888887532 3444433321 1111111222355554321 122 1222 2233 333 2222 Q ss_pred ceEEEecCCchhhcc-hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 294 DSLRMDIPVDYTSTL-ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 294 ~~v~~~iP~~~~~l~-p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) =.+...-++....+. +.-..+ ...+-++.|+++. +..|.+++.+-+.- T Consensus 357 ~~i~~~~~~~~~~~~d~~~~~~-~~~~~~~~r~d~~-~~~~~A~~~l~~k~ 405 (409) T protein:vir:45 357 FIIRRVRYMILKRLVERYAEYD-QTGFLAFHRFDCI-LEDTSAIKALVGKG 405 (409) T ss_pred hheeeccceEEEEeecccccCC-cEEEEEEEEeccE-eechhheEEEEecc Confidence 111111122222111 111112 2223456788766 88899999888755 No 72 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=96.25 E-value=0.00082 Score=37.49 Aligned_cols=289 Identities=11% Similarity=0.024 Sum_probs=129.8 Q ss_pred CCceeeecCCc-hHHHHHHHHhhhcc-chh-------hhhhhhhhhhhHHHHHHhhhhhhhccccccc---chhhcceec Q lcl|NC_019525. 1 MKKFVIRNSKG-EKILLNAQEAKIAG-VIQ-------RLCNDLGFEIDVTTLTTLMKKIIEQKFFEIS---PADYMPIRV 68 (343) Q Consensus 1 ~~~~~~~~~~~-~~~~~~a~~~~~~~-~~~-------~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~---~~~~~pv~~ 68 (343) -++....+.+- +....++...-+.. ... ....+.| +++.+ .+...|++...+.-. ....+|+.+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg--~~iP~--~~~~~ii~~~~~~~~l~~~~~~~~~~~ 150 (397) T protein:vir:49 75 EKKPLTKNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAG--LTIPQ--DIRTAINTLVRQFDSLQEYVNVENVTT 150 (397) T ss_pred ccccccchhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCc--ceecH--HHHHHHHHHHHhhhhHhhhcceeeccC Confidence 01111111100 00011111100000 000 0112222 22221 222334443223322 334455443 Q ss_pred CCCCceeEEEeeec-cchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHH Q lcl|NC_019525. 69 GEGAWSTMLTTYRS-FSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITAL 146 (343) Q Consensus 69 ~~~~w~~~~~~~~~-vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k 146 (343) .. +...+..... .+.|.+++.+ ..+|.-+ ..+++.+...+.++.-+.+|.+=|+.+. ++|.+-- T Consensus 151 ~~--~~~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i 216 (397) T protein:vir:49 151 LT--GSRVYEKWADITGLAKLDDEG--------GQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSA----ENILAWL 216 (397) T ss_pred Cc--ceEEEEeeccCCcceeeeccc--------cccccccccceeeeEeeeeeeEeehhhHHHHHhhhh----HHHHHHH Confidence 32 2222322222 2456666533 2455544 3578888888998888887776555432 4677777 Q ss_pred HHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEe Q lcl|NC_019525. 147 EKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTI 226 (343) Q Consensus 147 ~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~l 226 (343) ...+.+++...+|+-.+.|+.... |. . ...|.+. |.+++..+... + ..+..++| T Consensus 217 ~~~l~~~~~~~~d~ail~G~g~~~--------~~------~-----~~~~~d~----i~~~~~~l~~~--~-~~~a~~v~ 270 (397) T protein:vir:49 217 SGWIAKKVVVTRNKAILEAIGTLP--------NK------P-----TLAKWDD----IIDLQAKVDPA--I-KQTSLFLT 270 (397) T ss_pred HHHHHHHHHHHHHHHHHhcccccc--------cc------c-----cccCHHH----HHHHHHhhhhh--h-cCCCEEEE Confidence 788888888888998999943211 10 0 1123344 44444444322 2 24568999 Q ss_pred CHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe--cCCc Q lcl|NC_019525. 227 PESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD--IPVD 303 (343) Q Consensus 227 p~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~--iP~~ 303 (343) .|..|..|..-+ ++.+.-++. -+.+.....-.|.|+.+....++. .+.+++.. ++|-.=.+.+.+- -.+. T Consensus 271 n~~~~~~l~~lk--d~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~----~~~~~~~~-~~~gd~~~~~~~~~~~~~~ 343 (397) T protein:vir:49 271 NTSGFTALKKVK--NAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLP----NGTGGAMP-LYFGDLKQAVTLFDRQHLS 343 (397) T ss_pred cHHHHHHHHHhh--ccCCceeecccccCCCCceecceeeEEecccccc----cccCCcee-EEEeeccceEEEEeecccE Confidence 999999997543 433333221 111222222346665544332221 11222222 3333222222111 1112 Q ss_pred hhhcc----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 304 YTSTL----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 304 ~~~l~----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.... -..++ ...+-++.|+++. ++.|.+++++.+-- T Consensus 344 i~~~~~~~~~~~~~--~~~~~~~~r~d~~-~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 344 LLSTNIGGGAFETD--TTKVRVIDRFDVV-STDTEAFVPASFKA 384 (397) T ss_pred EEEeccccchhhcC--eeeEEEEEeeccE-EecccceEEEEecc Confidence 22111 01232 2223468889886 68899999998655 No 73 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=96.18 E-value=0.00091 Score=37.27 Aligned_cols=288 Identities=9% Similarity=-0.093 Sum_probs=132.6 Q ss_pred HHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhccccc Q lcl|NC_019525. 16 LNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIID 95 (343) Q Consensus 16 ~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~ 95 (343) |..+.. ........+.|......++ ...|++...+.-.-+++.++..-. .....+-..+.-+.|++++.+ T Consensus 1 ma~~~~---~~~~~~~t~~gg~lip~~~---~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~--- 70 (304) T protein:vir:10 1 MATPTY---TPGNVILSDFKNGVIPAEQ---GTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSET--- 70 (304) T ss_pred Cccccc---ccccccccCCCceecchhH---HHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecC--- Confidence 111111 1111111222222222322 233444333333333444432211 122233334434445555432 Q ss_pred CCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeee Q lcl|NC_019525. 96 TGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGL 175 (343) Q Consensus 96 ~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GL 175 (343) ..+|..+..+++.....++++.-+.+|.+=|+.+. .+|.+.-.+...+++...+|+.+++|+.... -.|. T Consensus 71 -----~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~-~~~~ 140 (304) T protein:vir:10 71 -----ERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTA----KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPY-NTST 140 (304) T ss_pred -----cccccccceeeEEEEEEEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHhhheeccCCCc-cccc Confidence 35788888899999999999998888876555332 4688888888889999999999999965432 2343 Q ss_pred eecCCceeecc-cCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhh Q lcl|NC_019525. 176 LTQTGNVVNNT-FLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDT 254 (343) Q Consensus 176 lN~p~v~~~~a-~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n 254 (343) +....++..+. ..+....+. ..+||.+++..+...- ..+..++|-++.|..|..-+ ++.+.-++ ..+ T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~----~~~~i~~~~~~l~~~~---~~~~~~v~~~~~~~~L~~lk--d~~G~~l~---~~~ 208 (304) T protein:vir:10 141 SGKPLVEGAEEKGNVVTDTNN----LYVDLSALMATIEDEE---LDPNGVLTTRSFRSKMRNAL--DANDRPLF---DAN 208 (304) T ss_pred ccccccccccccccccccccc----hHHHHHHHHHHhhhcc---CCcCEEEEcHHHHHHHHHhh--ccCCcEee---cCC Confidence 33333222111 111122223 3556666666654322 34668999999999996433 43333221 111 Q ss_pred cchhcCCcceEEeechhhhhccccc---CCCccEEEEEEcCcceEEEecCCchh--hcc----------hhhcCCceEEe Q lcl|NC_019525. 255 FKEITRNSSFEILPCVYADKITAQV---PAVAKRYALYNDNEDSLRMDIPVDYT--STL----------ANSVNNFQFQN 319 (343) Q Consensus 255 ~~~~~~g~~l~I~~~~~~~~~~~~g---~gg~drmv~Y~~d~~~v~~~iP~~~~--~l~----------p~~~~~l~~~v 319 (343) . ..-.|.|+.+.+ .+....+.+ .|..+.+++..+ +-+++.+--... ... ..+.+- ..+ T Consensus 209 ~-~~l~G~PV~~~~--~~~~~~~~~~~~~gd~~~~~~~~~--~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~--~~~ 281 (304) T protein:vir:10 209 G-NEIMGLPLSYTG--ADVYDKKKSLALMGDWDYARYGIL--QGIEYAISEDATLTTLQASDASGQPVSLFERDM--FAL 281 (304) T ss_pred C-ccccceeeEEec--ccccCCCCcEEEEEehhhEEEEEe--cceEEEEeecceeeeecccccCccchhhhhcCc--EEE Confidence 1 112366654332 111100000 001111111111 111221100000 000 012221 223 Q ss_pred ceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 320 AAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 320 ~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -+++|+++..++ |++++.+=..= T Consensus 282 r~~~r~~~~v~~-~~a~~~l~~a~ 304 (304) T protein:vir:10 282 RATMHIAYMNVK-PEAFATLKPTE 304 (304) T ss_pred EEEEEeccEeec-ccceEEEEecC Confidence 457778776555 88876542211 No 74 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=96.18 E-value=0.00091 Score=37.27 Aligned_cols=288 Identities=9% Similarity=-0.093 Sum_probs=132.6 Q ss_pred HHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhccccc Q lcl|NC_019525. 16 LNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIID 95 (343) Q Consensus 16 ~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~ 95 (343) |..+.. ........+.|......++ ...|++...+.-.-+++.++..-. .....+-..+.-+.|++++.+ T Consensus 1 ma~~~~---~~~~~~~t~~gg~lip~~~---~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~ip~~~~~~~a~~v~E~--- 70 (304) T protein:vir:94 1 MATPTY---TPGNVILSDFKNGVIPAEQ---GTLIMKDIMANSAIMKLAKNEPMT-AQKKKFTYLAKGVGAYWVSET--- 70 (304) T ss_pred Cccccc---ccccccccCCCceecchhH---HHHHHHHHHhccchhhhcceeecc-CCceEEEEEeCCcceEEeecC--- Confidence 111111 1111111222222222322 233444333333333444432211 122233334434445555432 Q ss_pred CCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeee Q lcl|NC_019525. 96 TGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGL 175 (343) Q Consensus 96 ~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GL 175 (343) ..+|..+..+++.....++++.-+.+|.+=|+.+. .+|.+.-.+...+++...+|+.+++|+.... -.|. T Consensus 71 -----~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~-~~~~ 140 (304) T protein:vir:94 71 -----ERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTA----KDFFNEVKPLIAEAFYKAFDQAVIFGTKSPY-NTST 140 (304) T ss_pred -----cccccccceeeEEEEEEEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHhhheeccCCCc-cccc Confidence 35788888899999999999998888876555332 4688888888889999999999999965432 2343 Q ss_pred eecCCceeecc-cCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhh Q lcl|NC_019525. 176 LTQTGNVVNNT-FLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDT 254 (343) Q Consensus 176 lN~p~v~~~~a-~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n 254 (343) +....++..+. ..+....+. ..+||.+++..+...- ..+..++|-++.|..|..-+ ++.+.-++ ..+ T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~----~~~~i~~~~~~l~~~~---~~~~~~v~~~~~~~~L~~lk--d~~G~~l~---~~~ 208 (304) T protein:vir:94 141 SGKPLVEGAEEKGNVVTDTNN----LYVDLSALMATIEDEE---LDPNGVLTTRSFRSKMRNAL--DANDRPLF---DAN 208 (304) T ss_pred ccccccccccccccccccccc----hHHHHHHHHHHhhhcc---CCcCEEEEcHHHHHHHHHhh--ccCCcEee---cCC Confidence 33333222111 111122223 3556666666654322 34668999999999996433 43333221 111 Q ss_pred cchhcCCcceEEeechhhhhccccc---CCCccEEEEEEcCcceEEEecCCchh--hcc----------hhhcCCceEEe Q lcl|NC_019525. 255 FKEITRNSSFEILPCVYADKITAQV---PAVAKRYALYNDNEDSLRMDIPVDYT--STL----------ANSVNNFQFQN 319 (343) Q Consensus 255 ~~~~~~g~~l~I~~~~~~~~~~~~g---~gg~drmv~Y~~d~~~v~~~iP~~~~--~l~----------p~~~~~l~~~v 319 (343) . ..-.|.|+.+.+ .+....+.+ .|..+.+++..+ +-+++.+--... ... ..+.+- ..+ T Consensus 209 ~-~~l~G~PV~~~~--~~~~~~~~~~~~~gd~~~~~~~~~--~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~--~~~ 281 (304) T protein:vir:94 209 G-NEIMGLPLSYTG--ADVYDKKKSLALMGDWDYARYGIL--QGIEYAISEDATLTTLQASDASGQPVSLFERDM--FAL 281 (304) T ss_pred C-ccccceeeEEec--ccccCCCCcEEEEEehhhEEEEEe--cceEEEEeecceeeeecccccCccchhhhhcCc--EEE Confidence 1 112366654332 111100000 001111111111 111221100000 000 012221 223 Q ss_pred ceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 320 AAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 320 ~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -+++|+++..++ |++++.+=..= T Consensus 282 r~~~r~~~~v~~-~~a~~~l~~a~ 304 (304) T protein:vir:94 282 RATMHIAYMNVK-PEAFATLKPTE 304 (304) T ss_pred EEEEEeccEeec-ccceEEEEecC Confidence 457778776555 88876542211 No 75 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=96.08 E-value=0.0008 Score=37.57 Aligned_cols=302 Identities=12% Similarity=0.107 Sum_probs=126.5 Q ss_pred CCceeeecCCchHHHH------HHHH----------hhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhc Q lcl|NC_019525. 1 MKKFVIRNSKGEKILL------NAQE----------AKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYM 64 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~------~a~~----------~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~ 64 (343) .+++...+.+ +.+.. +... .............++..+++.+ .+...|++...+.-.-+.++ T Consensus 96 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~--~~~~~Ii~~l~~~~~i~~~~ 172 (425) T protein:vir:95 96 SRQKMQGSKG-DVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPE--VVVNRIMDIMGDYTTLYPLV 172 (425) T ss_pred hhhhhhhhhh-hHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccH--HHHHHHHHHHHhhhhHHHhh Confidence 1222111111 10000 0000 0000011111112222333332 23444555432222222333 Q ss_pred ceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccc-cccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcH Q lcl|NC_019525. 65 PIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDT-GVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLI 143 (343) Q Consensus 65 pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~-~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~ 143 (343) .+.+..+ ...+-.....+.|.+++.+ ..+|.-+. .+++.+...+.++.-+.+|-+=|..+. .+|. T Consensus 173 ~~~~~~g--~~~ip~~~~~~~a~~v~E~--------~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~----~~l~ 238 (425) T protein:vir:95 173 DKIRVKG--TTRILVDTDTSPATWIEQS--------GALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSI----INLD 238 (425) T ss_pred ceeecCc--eeEEEEecCCccccccccc--------cccccccccccceeeeeheeeeeeehhhHHHHhccH----HHHH Confidence 2221111 1122222334555555543 24666664 478888888888887777776555432 3578 Q ss_pred HHHHHHHHHHHHhhhhheEeeeeccccc-eeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCC Q lcl|NC_019525. 144 TALEKSRKKNWDLGIQEIAFVGMKDNAN-VKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPN 222 (343) Q Consensus 144 ~~k~~aar~a~~~~~n~v~~~G~~~~~g-~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~ 222 (343) +--......++...+|+..+.|+....+ -.|++++..... .... .-.+.|.++ +.+++..+..... ...+ T Consensus 239 ~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~--~~~~-~~~~~~~~~----~~~~~~~~~~~~~--~~~~ 309 (425) T protein:vir:95 239 DYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPEN--QVTV-EADNNLLKN----LVKQIGLIDTGDD--SVGE 309 (425) T ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccc--cccc-ccccchHHH----HHHHHHhhhhhcc--ccCc Confidence 8888888888899999999999533222 358887633211 1111 112334444 4444444332222 1122 Q ss_pred -eEEeCHH-HHHHHhc-cccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhccccc--CCCccEEEEEEcCcceEE Q lcl|NC_019525. 223 -KFTIPES-DYTGLAG-AASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQV--PAVAKRYALYNDNEDSLR 297 (343) Q Consensus 223 -tl~lp~~-~~~~L~~-~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g--~gg~drmv~Y~~d~~~v~ 297 (343) .++|-+. .|..|.. +..-+..+.-++.. .+.....-.|.|+ .++....... -|.-..+++..+ .-+. T Consensus 310 ~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~-~~~~~~~l~G~pv-----v~~~~~~~~~i~~Gd~~~~~~~~~--~~~~ 381 (425) T protein:vir:95 310 IVAVMKRSTYYNRLVEFSIQVDSNGNVVGKL-PNLRTPDLLGLRV-----VFNNFLDDDTVLFGEFEQYTLVER--ENIT 381 (425) T ss_pred eEEEEeChHHHHHHHHHHhhcCCCCceeecc-CCCCCccccceee-----EEcCcCCCccEEEEecccEEEEee--cceE Confidence 2344444 4554532 22334444433211 1100000124443 2222222110 011111222222 2222 Q ss_pred EecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 298 MDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 298 ~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.+-.... ..++...+ -++.|++| .++.|+++++++|.- T Consensus 382 i~~~~~~~----f~~~~~~~--~~~~r~d~-~~~~~~a~~~~~i~~ 420 (425) T protein:vir:95 382 IDSSTHVK----FTEDQTAF--RGKGRFDG-KPVKPEAFVLVTITD 420 (425) T ss_pred EEeecccc----cccCceEE--EEEEeeCc-EeecccceEEEEecC Confidence 22211100 11222223 34667775 788899999999855 No 76 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=95.98 E-value=0.0012 Score=36.65 Aligned_cols=269 Identities=10% Similarity=-0.015 Sum_probs=129.4 Q ss_pred HHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccc---cchhhcceecCCCCceeEEEeee-ccchhhhhh Q lcl|NC_019525. 15 LLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEI---SPADYMPIRVGEGAWSTMLTTYR-SFSLAEDFA 90 (343) Q Consensus 15 ~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l---~~~~~~pv~~~~~~w~~~~~~~~-~vg~a~~ia 90 (343) +++++.. .....|..+.-.++ ...|++...+.- ...+.+|+.+..+ ...+.... ..+.|++++ T Consensus 1 ~l~~~~~--------~t~~~gg~liP~~~---~~~Ii~~~~~~~~l~~~~~~~~~~~~~g--~~~~~~~~~~~~~a~~v~ 67 (293) T protein:vir:48 1 MLDSKTD--------HSGSDAGLTIPQDI---RTAINTLVRQYDSLQEYVNVENVTTLTG--SRVYEKWTDITGLANIDD 67 (293) T ss_pred Cceeecc--------cccCcCceEechhH---HHHHHHHHHhhhhhhhhceeeeccCCcc--eEEEEeecCCCcceeeec Confidence 3333332 11122322333332 233444322222 2334455544332 22232222 234566665 Q ss_pred cccccCCccccccccc-cccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccc Q lcl|NC_019525. 91 TGIIDTGNSNGKLAAV-DTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDN 169 (343) Q Consensus 91 ~g~~~~g~~a~Dip~v-d~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~ 169 (343) .+ ..+|.. +..+++.+...+..+.-+.+|.+=|+.+. .+|.+.-.+...+++...+|+-.+.|..+. T Consensus 68 Eg--------~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~la~~~~~~~~~~i~~g~~~~ 135 (293) T protein:vir:48 68 EA--------GKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILGVVDKL 135 (293) T ss_pred CC--------cccccccccceeEEEEeeeEEEEeehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHHhHHhhccccc Confidence 43 246654 35788899999999998888877776543 467777777778888888887666663221 Q ss_pred cceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh Q lcl|NC_019525. 170 ANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ 249 (343) Q Consensus 170 ~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~ 249 (343) +......|.++|++-+ ..+... + .....++|.++.|..|..-+ ++.+.-++. T Consensus 136 -------------------~~~~~~~~~d~i~~~~----~~l~~~--~-~~~a~~vmn~~~~~~L~~lk--d~~g~~l~~ 187 (293) T protein:vir:48 136 -------------------PTKPTLTKWDDIIDLE----AKVDPA--I-KQTSFFLTNTSGFTALKKVK--NALGDYLME 187 (293) T ss_pred -------------------cccccccCHHHHHHHH----Hhhhhh--h-cCCCEEEEcHHHHHHHHHhh--ccCCceEee Confidence 0112234455555544 433221 2 12347899999999996533 333332221 Q ss_pred -HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe-cCCchhhcc----hhhcCCceEEeceee Q lcl|NC_019525. 250 -VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD-IPVDYTSTL----ANSVNNFQFQNAAYG 323 (343) Q Consensus 250 -~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~-iP~~~~~l~----p~~~~~l~~~v~~~~ 323 (343) -+.+.....-.|.|+.+.....+. .. ..++..+++.+-+.-++-+. -.+.+.... -.+++ ...+-+++ T Consensus 188 ~~~~~~~~~~l~G~Pv~~~~~~~~~---~~-~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~--~~~~r~~~ 261 (293) T protein:vir:48 188 RDVKSPTGYSIAGFAVKEISDRWLP---NA-SSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETD--TTKVRVID 261 (293) T ss_pred cCcCCCCCceecceeeEEecccccC---Cc-cCCceEEEEEeccceEEEEEecceEEEEecccchhhhcC--eEEEEEEE Confidence 111222222246665543322211 11 12223333222221111110 011111111 11232 22344578 Q ss_pred eeccEEEEcCceEEeeecCC Q lcl|NC_019525. 324 QFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 324 r~GGv~v~yP~a~~Y~D~~~ 343 (343) |+++. +++|.+++.+.+-- T Consensus 262 r~d~~-~~~~~a~~~l~~~~ 280 (293) T protein:vir:48 262 RFDVV-ATDTEAFVPASFKA 280 (293) T ss_pred eeCcE-EecccceEEEEeec Confidence 88886 57899999988655 No 77 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=95.97 E-value=0.0012 Score=36.65 Aligned_cols=298 Identities=9% Similarity=-0.034 Sum_probs=130.7 Q ss_pred CCceeee---cCCchHHHHHHHHhhh---------------ccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchh Q lcl|NC_019525. 1 MKKFVIR---NSKGEKILLNAQEAKI---------------AGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPAD 62 (343) Q Consensus 1 ~~~~~~~---~~~~~~~~~~a~~~~~---------------~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~ 62 (343) ++..... +.+.+.....+..... ...........| .+++.+ +.+...+++...+....++ T Consensus 313 i~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~~~g-g~lvp~-~~~~~~iie~lr~~s~i~~ 390 (632) T protein:vir:96 313 INAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTAGKG-GELVAT-ELLSEEFIDILRNKAIIGQ 390 (632) T ss_pred HHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhccccccc-cccccc-ccchHHHHHHHhhcchhhh Confidence 1111111 1111111111100000 000000011111 122211 1111223333222222222 Q ss_pred ----hcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhc Q lcl|NC_019525. 63 ----YMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSG 138 (343) Q Consensus 63 ----~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~G 138 (343) .+|.. .+. -.+-....-+.+.+++.+ ..+|.-+..+++.+...+.++.-+.+|.+=|..+ . T Consensus 391 l~~~~~~~~--~g~--~~ip~~~~~~~a~wv~E~--------~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds---~ 455 (632) T protein:vir:96 391 MGARMLPGL--VGD--VDIPKKTSGANFYWIGED--------EDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS---S 455 (632) T ss_pred hcceEeecC--Ccc--eEEEEEeCCceeEeecCC--------ccccccccceeeEEeeeeEEEEehhhHHHHHhcc---c Confidence 22321 111 112112222333344322 3577778889999999999998888887666643 2 Q ss_pred CCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_019525. 139 NWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYT 218 (343) Q Consensus 139 r~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~ 218 (343) .++++--......++...+|..+++|+-......|++|..++...+.+ ....|.+. +.++...+...... T Consensus 456 -~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~----~~~~~~~~----i~~~~~~i~~~~~~- 525 (632) T protein:vir:96 456 -IHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYP----AGGVDWAS----VVDMETKISTFNAD- 525 (632) T ss_pred -hHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceecc----cccCCHHH----HHHHHHHHhhcccc- Confidence 467777788888888899999999995433336799999887543221 12233333 33343333322211 Q ss_pred ecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEE Q lcl|NC_019525. 219 AMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRM 298 (343) Q Consensus 219 ~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~ 298 (343) ..+...+|-+..+..|......+..+.-|+ +.+ .-.|.|..+... +. .++.-.|.-..+++... .-+.+ T Consensus 526 ~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~---~~~---~l~G~pv~~s~~--ip-~~~~~~gd~s~~~i~~~--~~~~i 594 (632) T protein:vir:96 526 AGRLAYLTSVTQRGAAKKAQVFDNTGERIW---QNN---EVNGYRAEASNQ--IP-ADTWIFGDWSQIVIAMW--GVLDL 594 (632) T ss_pred cCccEEEEchhHHHHHHHHhccCCCCceee---cCC---eecccceEeccc--cc-cCcEEEeecceEEEEEe--cceEE Confidence 112346777777777765444444333332 121 224666543221 11 11111122222332222 22333 Q ss_pred ecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 299 DIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 299 ~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .+ ...... ..+...+ -++.+++ +.+++|+++++.-.-- T Consensus 595 ~~--~~~~~~--~~~~v~~--~~~~~~d-~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 595 KV--DPYTKA--ASDGLVL--RVFQDVD-AGVRRKEAFCIAKKGA 632 (632) T ss_pred EE--cccccc--ccCceEE--EEEeecC-ceeechhhhhheeecC Confidence 22 111111 1222222 3455554 4788888887766666 No 78 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=95.72 E-value=0.0016 Score=35.96 Aligned_cols=306 Identities=10% Similarity=0.012 Sum_probs=139.8 Q ss_pred eeeecC-CchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeec Q lcl|NC_019525. 4 FVIRNS-KGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRS 82 (343) Q Consensus 4 ~~~~~~-~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~ 82 (343) +...|+ ++.++. +..++ ++ ....+|+..--.+++.+-..+.|. -| .++...|.+....+... .+. T Consensus 1 ~~~~~~~~~~~~~-~~~k~-~t-----~~d~~Gg~l~P~~~~~~i~~~~e~-s~---~l~~~~vi~~~~~~~~~---i~~ 66 (315) T protein:vir:41 1 MLTIEDIRGGKPF-EIVPK-ID-----VPDLGRGVLSVDRFGEFVKAVRDS-AV---IIPEARIDNALKSYEKD---ISR 66 (315) T ss_pred CcccchhhcCChh-hhhhh-cC-----CcCCCCceechHHHHHHHHHHHhh-hh---hhhhceeeecccccccc---ccc Confidence 332222 111111 11111 11 111234444444555544445442 12 23333332111112222 233 Q ss_pred cchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheE Q lcl|NC_019525. 83 FSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIA 162 (343) Q Consensus 83 vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~ 162 (343) .|....+.+|..+.|+. ++.+..+..+.......+....-...|.+.|+.++ -| .++.+.-.+...+++...++... T Consensus 67 ~g~~~~~~~g~~~~~~~-~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~-~~-~~~e~~l~~~~a~~~a~~~~~~~ 143 (315) T protein:vir:41 67 LSLVLDVGPGRDETGQK-LAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNI-EG-KAFEQKIVTLLGEGISYVLEKYY 143 (315) T ss_pred cccCcccccccccccCc-CCCCCCccccceeeeceeeeeeeccccHHHHHhhh-cc-ccHHHHHHHHHHHHHHHHHHHHh Confidence 44444444454455554 35666777888888888888877788888888654 34 58999999999999999999999 Q ss_pred eeeeccc-----cceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcc Q lcl|NC_019525. 163 FVGMKDN-----ANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGA 237 (343) Q Consensus 163 ~~G~~~~-----~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~ 237 (343) ++|+... +-..|++++....+.+...+..-.+. +.+.+.|+...+..-....+. --.++|..+.+..+-.- T Consensus 144 ~nGdg~s~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~-~~d~l~~l~~sl~~~yr~~~~---~~~~imn~~t~~~~rkl 219 (315) T protein:vir:41 144 LHGDTSSSDPLLRMSDGWLKLASEKLTESDVDPEAEDW-PMNLFDTMIESLPTPYRNNLP---NMKFYVTWDIYRAYRDA 219 (315) T ss_pred hccCCcCcCccccccccceecccccccccccccccccc-cHHHHHHHHHhcChHHhhcCC---ceEEEEcHHHHHHHHHH Confidence 9995321 11358888776544333222222222 334444554444433332221 11467777777665322 Q ss_pred ccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcc-hhhcCCc Q lcl|NC_019525. 238 ASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTL-ANSVNNF 315 (343) Q Consensus 238 ~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~-p~~~~~l 315 (343) + +...+-++. .+....+..--|.|+.+.+ .....+.+. . .+++.. ++++-+-+-...+..+ +... .- T Consensus 220 k--~~~g~~lw~~~~~~g~~~tl~G~PV~~~~-----~m~~~~~~~-~-~ilf~d-~~nl~~~~~~~i~i~~~~~a~-~~ 288 (315) T protein:vir:41 220 L--KGRETGLGDQALTGANSILYDGRPVQYVP-----ALEALNDGK-S-RALFVV-PTQLVYGFWRNIKVVPDYDAE-MR 288 (315) T ss_pred h--ccCCCccccchhhcCCCceecccceEecc-----cccccCCCC-c-cEEEec-ccceEEEeccccEEEeeecCC-CC Confidence 2 222333322 1222112222355554433 333322221 1 223322 3433322222222211 0111 11 Q ss_pred eEEeceeeeeccE-EEEcCceEEeeec Q lcl|NC_019525. 316 QFQNAAYGQFTGV-LAYRPKELLYLDI 341 (343) Q Consensus 316 ~~~v~~~~r~GGv-~v~yP~a~~Y~D~ 341 (343) .+..-...|+++- .+.--.++.++=| T Consensus 289 ~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 289 LTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred ceEEEEEEEeceeEEeccceeEeeeeC Confidence 1222234566653 3333345566655 No 79 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=95.57 E-value=0.0018 Score=35.60 Aligned_cols=301 Identities=10% Similarity=-0.006 Sum_probs=140.0 Q ss_pred chHHHHHHHHhhhccchhhhhhhhhhhhhH-HHHHHhhhhhhhcccccccchhhccee-cCCCCceeEEEeeeccchhhh Q lcl|NC_019525. 11 GEKILLNAQEAKIAGVIQRLCNDLGFEIDV-TTLTTLMKKIIEQKFFEISPADYMPIR-VGEGAWSTMLTTYRSFSLAED 88 (343) Q Consensus 11 ~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~-~qL~~i~~~iye~~~~~l~~~~~~pv~-~~~~~w~~~~~~~~~vg~a~~ 88 (343) =| ++-+. ..|...+--...+ |+ .++ .|+..+-..+.+ +=..+++..|. +.. .....+. .+|.... T Consensus 1 ~~-~~~~~--~~~~k~it~~d~~-gG-~L~P~~~~~~i~~l~e----~s~i~~~a~vi~t~~-s~~~~i~---~i~~g~~ 67 (314) T protein:vir:41 1 MD-FLNKP--FQITPKIDVPDLG-KG-ILAVQRFGEFVREVRE----NSAIIKDARVLNALK-SYEVDIS---RISLGVE 67 (314) T ss_pred Cc-hhhhH--HHhhcccccccCC-Cc-eeChHHHHHHHHHHHh----ccchhhheeeecccC-ccceeec---ccccCcc Confidence 00 11111 1123333222222 32 343 333332222332 22233333332 211 1222221 2333444 Q ss_pred hhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeecc Q lcl|NC_019525. 89 FATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKD 168 (343) Q Consensus 89 ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~ 168 (343) +.++..|.|+. ...|..+..++......+++..-+..|.+.|+.++ -| .+|.+.-.....+.+...+....++|+.+ T Consensus 68 ~~~~~~~~~~~-~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a-~~-~~le~~i~~~~Ae~~g~~~~~~~~nGdg~ 144 (314) T protein:vir:41 68 LEPGRNTSGTK-VAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNI-EQ-SAFEQTITSLLASGVTYDLECFFLHADSS 144 (314) T ss_pred cccccccccCC-ccCCcccccccceeeeeEEEEEeecccHHHHHhhh-ch-hhHHHHHHHHHHHHHHHHHHHHhhccccC Confidence 44444555544 35678888899999999999999999999998765 33 47999999888889999999999999632 Q ss_pred c-------cceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCC Q lcl|NC_019525. 169 N-------ANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASAD 241 (343) Q Consensus 169 ~-------~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~ 241 (343) . +-..|+++.....+.+.++ -+.+.+++.+.|+-..+...+...+.. -..+|.+..+..+-. ...+ T Consensus 145 ~~s~~~~~~~p~G~l~~a~~~~~~~~~---~~~~~~~~~~~~l~~sl~~~yr~~~~~---~~~~m~~~t~~~~r~-~l~~ 217 (314) T protein:vir:41 145 LTTGRELYRINDGWMKLAGNQYTDAEP---EDENWPLNLFDGMMDELDTRYLQLKPR---MKFYVSNEIYNGYRK-QLLV 217 (314) T ss_pred CcCcccchhcchhhhhhcccceeecCc---cccccHHHHHHHHHHhcCchhhcCCCc---eEEEecHHHHHHHHH-HHhc Confidence 1 1134888776554433321 223455666665555554444332211 146777777665532 2222 Q ss_pred Ccchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEec Q lcl|NC_019525. 242 FPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNA 320 (343) Q Consensus 242 ~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~ 320 (343) . .+.+.+ .+....+-.--|.|+.+.+ .....+. +++ .+++. |++++-+.+.+..+..+-.....-.+..- T Consensus 218 ~-~~~l~~~~~~~~~~~~l~G~PV~~~~-----~~~~~~~-~~~-~i~fg-d~~nlv~~~~~~ir~~~~~~a~~~~~~~~ 288 (314) T protein:vir:41 218 R-ETGLGDSALIGATGLQYDGIPIQYVP-----ALDALGD-DKA-RALLT-VPTNLVYGFWRNIRIEPKRDAAMRRTEYI 288 (314) T ss_pred c-CCcccchhhhCCCCceecceeeEecc-----cccccCC-CCc-eEEEe-chhheEEEeeceeEEeecccCcCCeEEEE Confidence 2 222222 1111111112255554433 2222222 222 23333 46666555555554432111111122222 Q ss_pred eeeeeccEEEEcCc-eEEeeecCC Q lcl|NC_019525. 321 AYGQFTGVLAYRPK-ELLYLDIPV 343 (343) Q Consensus 321 ~~~r~GGv~v~yP~-a~~Y~D~~~ 343 (343) ...|++....-.+. +.+++.-+= T Consensus 289 ~~~r~d~~~~~~~aa~~~~~~~~~ 312 (314) T protein:vir:41 289 ASLRADCNYEDENAAVAAVIDMSS 312 (314) T ss_pred EEEEeceEEEEcCcEEEEEeeccC Confidence 34444433333322 233333333 No 80 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=95.35 E-value=0.0022 Score=35.12 Aligned_cols=292 Identities=9% Similarity=-0.002 Sum_probs=132.6 Q ss_pred CCceeee--cCCc---hHHHHHHHHhhh-------cc-chhh---hhhhhhhhhhHHHHHHhhhhhhhcccccccchhh- Q lcl|NC_019525. 1 MKKFVIR--NSKG---EKILLNAQEAKI-------AG-VIQR---LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADY- 63 (343) Q Consensus 1 ~~~~~~~--~~~~---~~~~~~a~~~~~-------~~-~~~~---~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~- 63 (343) +++-... +... .....++..... .. ..++ .....| .+.+. +.+...|++...+....+.+ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g-g~~vP--~~~~~~Ii~~~~~~~~l~~~~ 150 (408) T protein:vir:74 74 MREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAA-GLTIP--QDIRTMINTLVRQYDSLQQYV 150 (408) T ss_pred ccccccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCC-ceeec--hhHhhHHHHHHhhhcchhhhc Confidence 1000000 0000 000111100000 00 0000 111112 12222 23334555554444444444 Q ss_pred --cceecCCCCceeEEEeeecc-chhhhhhcccccCCccccccccc-cccccceeeeeEEEEeeEeecHHHHHHHHHhcC Q lcl|NC_019525. 64 --MPIRVGEGAWSTMLTTYRSF-SLAEDFATGIIDTGNSNGKLAAV-DTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGN 139 (343) Q Consensus 64 --~pv~~~~~~w~~~~~~~~~v-g~a~~ia~g~~~~g~~a~Dip~v-d~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr 139 (343) +|+.+. .+...+...... +.+.+++. . .++|.. +..+++.....+.++.-+.+|.+=|+-+. T Consensus 151 ~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E-------~-~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~---- 216 (408) T protein:vir:74 151 RVESVSTS--SGSRVYEKWTDVTPLKAMDEE-------D-GKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTA---- 216 (408) T ss_pred ceeeccCC--cceEEEEeecCCccccccccc-------c-cccccccccceeeEEeeeeeEEeeehhHHHHHhhch---- Confidence 444322 222222222222 22333332 2 356653 46889999999999999888877665432 Q ss_pred CCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCcee Q lcl|NC_019525. 140 WDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTA 219 (343) Q Consensus 140 ~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~ 219 (343) .+|.+--.....+++...+|+..+.|+.... | .-...|.+.+.+.++..+..... T Consensus 217 ~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~--------~-----------~~~~~~~~~i~~~~~~~l~~~~~------ 271 (408) T protein:vir:74 217 ENILAWLSSWIAKKVVVTRNQAIIAAMGTVP--------K-----------KPTIANFDDVITMINTSVDPAII------ 271 (408) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------c-----------ccccccHHHHHHHHHHhhhhhhc------ Confidence 3577777777788888888888888842211 0 01123456666555544433221 Q ss_pred cCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEE Q lcl|NC_019525. 220 MPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRM 298 (343) Q Consensus 220 ~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~ 298 (343) ....++|.|..|..|.+-+ ++.+.-++. =+.......-.|.|+.+.+-.++. +.+ +++..+++.+-+.-++-+ T Consensus 272 ~~a~~v~n~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~-~~~~~i~~gd~~~~~~~~ 345 (408) T protein:vir:74 272 ATSSLLTNQSGLNKLALVK--TAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLP---NSG-STVYPLYYGDMSQAITLF 345 (408) T ss_pred CCCEEEEcHHHHHHHHHhh--cCCCceEeccCcCCCCCceecceeeEEecCcccc---ccc-CCcceEEEEehhccEEEE Confidence 2346899999999997533 443433321 011122222346666554332222 221 122333333322211111 Q ss_pred -ecCCchhhc----chhhcCCceEEeceeeeeccEEEEcCceEEeeec-CC Q lcl|NC_019525. 299 -DIPVDYTST----LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDI-PV 343 (343) Q Consensus 299 -~iP~~~~~l----~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~-~~ 343 (343) .-.+.+.+. ....++ ...+-+++|+++. ++.|.+++.+.+ || T Consensus 346 ~~~~~~i~~~~~~~~~f~~~--~~~~r~~~r~d~~-~~~~~a~~~~~~~~~ 393 (408) T protein:vir:74 346 DRENMSLLPTNIGAGAFETD--TTKIRVIDRFDVK-ATDSEALVAGSFTAI 393 (408) T ss_pred EecceEEEEeccccchhhcc--eeeEEEEEeeCcE-EecccceEEEEeecc Confidence 011111111 112232 2334578899886 777999999998 33 No 81 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=94.56 E-value=0.0023 Score=35.01 Aligned_cols=286 Identities=12% Similarity=0.030 Sum_probs=118.5 Q ss_pred CCc--------------------eeeecC---CchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhccccc Q lcl|NC_019525. 1 MKK--------------------FVIRNS---KGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFE 57 (343) Q Consensus 1 ~~~--------------------~~~~~~---~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~ 57 (343) +.. |+...- .+.+...++... ...+. ...+++.-+++.+ .+...|++...+. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~--~~al~-~~t~s~gG~~IP~--~~~~~Ii~~~~~~ 145 (387) T protein:vir:93 71 VKDTGEAYQSLNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRL--LHALP-TGNDSGGDKLLPK--TLSKEIVSEPFAK 145 (387) T ss_pred hhhccccCCCcchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHH--HHhhc-cCcCCCCceeech--hHHHHHHHHHHhh Confidence 110 100000 001111111100 00000 1112222233332 2233444432222 Q ss_pred ccchhhcceecCCCCceeEEEe-eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHH Q lcl|NC_019525. 58 ISPADYMPIRVGEGAWSTMLTT-YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMK 136 (343) Q Consensus 58 l~~~~~~pv~~~~~~w~~~~~~-~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~ 136 (343) -.-+.+..+.+..+ ..+.. ....+.+.+++.+ ...|..+..+++.....+.++.-+.+|.+=|+- T Consensus 146 ~~l~~~~~v~~~~~---~~~p~~~~~~~~a~~v~E~--------~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~D--- 211 (387) T protein:vir:93 146 NQLREKARLTNIKG---LEIPRVSYTLDDDDFITDV--------ETAKELKLKGDTVKFTTNKFKVFAAISDTVIHG--- 211 (387) T ss_pred chhhhheeeeecCC---ceEEEEeecCCccccccCc--------ccccccccccceeeeeheeeeeechhhHHHHhh--- Confidence 22233444332222 11111 1122334444432 246667778888999999998888888554443 Q ss_pred hcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCC Q lcl|NC_019525. 137 SGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCD 216 (343) Q Consensus 137 ~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~ 216 (343) ++ .+|.+--......++...++..+|.+.+....-.|.++++.++..+. ..+.|.|.+-++.+-.....+ T Consensus 212 s~-~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~-------~~~~d~i~~~~~~l~~~~~~~-- 281 (387) T protein:vir:93 212 SD-VDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEG-------ADMYDAIINALADLHEDYRDN-- 281 (387) T ss_pred hH-HHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc-------cchHHHHHHHHhccChhhhcC-- Confidence 22 35666666666666666667766654333212358888776643221 123455554444433332222 Q ss_pred ceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceE Q lcl|NC_019525. 217 YTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSL 296 (343) Q Consensus 217 ~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v 296 (343) ..++|-+..|..+.... .+.+. .+ +.. .+..--|.|+.+.. .....-.|.=.++.+ ..+ T Consensus 282 -----a~~~mn~~t~~~~~~~~-~d~~~-~~---~~~-~~~~llG~PV~~~~-----~~~~~~~GDf~~~~~---~~~-- 340 (387) T protein:vir:93 282 -----ATIYMRYADYVKIISVL-SNGTT-NF---FDT-PAEKVFGKPVVFTD-----AAVKPIVGDFNYFGI---NYD-- 340 (387) T ss_pred -----CEEEEechHHHHHHHHH-hcCCC-cc---ccc-CCccccccceEEec-----CCCceeeeehhhhhe---ehh-- Confidence 13555555554443332 33222 11 111 11112356654432 111111111111111 001 Q ss_pred EEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 297 RMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 297 ~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .+-+.+.. +...+.. ..-+..|++|.. +.|++++++.+-- T Consensus 341 ----~~~~~~~~-~~~~~~~-~~~~~~r~d~~v-~~~eA~~~l~~k~ 380 (387) T protein:vir:93 341 ----GTTYDTDK-DVKKGEY-LFVLTAWYDQQR-TLDSAFRIAKAKE 380 (387) T ss_pred ----hheeeecc-cccCCce-eEEEEeeeCcee-echhheEEEEeec Confidence 11222221 1122222 223567999885 4699999999855 No 82 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=94.54 E-value=0.0041 Score=33.65 Aligned_cols=289 Identities=10% Similarity=-0.003 Sum_probs=129.6 Q ss_pred CCce-eeecCCchHHHHHHHHh---hhccc----hh----hhhhhhhhhhhHHHHHHhhhhhhhcccccccchh---hcc Q lcl|NC_019525. 1 MKKF-VIRNSKGEKILLNAQEA---KIAGV----IQ----RLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPAD---YMP 65 (343) Q Consensus 1 ~~~~-~~~~~~~~~~~~~a~~~---~~~~~----~~----~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~---~~p 65 (343) .... -....+.+....+-.+. .+... .. ....+.|..+- +.+...|++...+...-+. .+| T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP----~~~~~~ii~~~~~~~~l~~~~~~~~ 147 (397) T protein:vir:48 72 SEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIP----QDIQTAIHTLVRQYDSLQEYVNVEN 147 (397) T ss_pred hhhccccccchhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCcccccccc----HHHHHHHHHHHHHHHHHHhhhceee Confidence 0000 00000101110000000 00000 00 01122232222 1223445554333333333 344 Q ss_pred eecCCCCceeEEEee-eccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcH Q lcl|NC_019525. 66 IRVGEGAWSTMLTTY-RSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLI 143 (343) Q Consensus 66 v~~~~~~w~~~~~~~-~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~ 143 (343) +.+.. ....+..+ +.-+.+++++.+ ..+|..+ ..+++.+...+.++.-+.+|.+=|+.+. .+|. T Consensus 148 ~~~~~--~~~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~ 213 (397) T protein:vir:48 148 VTTLT--GSRVYEKWADITGLAKLDDEA--------GSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSA----ENIL 213 (397) T ss_pred ccCCc--ceEEEEeecCCCcceeeeccc--------cccccccccceeeEEeeheeeeeehhhHHHHHhhch----HHHH Confidence 43222 22122222 222445565533 3466543 5788889999999988888877665432 3677 Q ss_pred HHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCe Q lcl|NC_019525. 144 TALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNK 223 (343) Q Consensus 144 ~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~t 223 (343) +--.+...+++...+|+..+.|........ ...|.+.|.+.+..+-.. + ..+.. T Consensus 214 ~~v~~~l~~~~~~~~d~~il~G~g~~~~~~-------------------~~~~~d~i~~~~~~l~~~------~-~~~a~ 267 (397) T protein:vir:48 214 AWLSGWIAKKVVVTRNKAILEAIATLPTKP-------------------TLTKWDDIIDLQAKVDPA------I-KQTSF 267 (397) T ss_pred HHHHHHHHHHHHHHHHHHHhhccccccccc-------------------ccccHHHHHHHHHHhhhh------h-cCCCE Confidence 777777778888888888898853321100 112344554443333222 1 23568 Q ss_pred EEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEE---- Q lcl|NC_019525. 224 FTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRM---- 298 (343) Q Consensus 224 l~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~---- 298 (343) ++|.|..|..|..-+ ++.+.-+.. -+.......-.|.|+.+....++. .+..++..+++-+-+ +.+.+ T Consensus 268 ~v~n~~~~~~L~~lk--d~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~----~~~~~~~~~~~gd~~-~~~~~~~~~ 340 (397) T protein:vir:48 268 FLTNTSGFTALKKVK--NAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLA----NASSGAMPLYFGDLK-QAVTLFDRQ 340 (397) T ss_pred EEECHHHHHHHHHhh--cCCCceeeccCcCCCCCceeccceeEEecccccC----CcCCCceEEEEEecc-ceEEEEeec Confidence 899999999996543 433333221 112222222357777665433322 112233444433322 22211 Q ss_pred ecCCchhhcch--hhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 299 DIPVDYTSTLA--NSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 299 ~iP~~~~~l~p--~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .+.+......- ...+ ...+-++.|+++ .++.|.+++-+.+-= T Consensus 341 ~~~i~~~~~~~~~~~~~--~~~~r~~~r~d~-~~~~~~a~~~~~~~~ 384 (397) T protein:vir:48 341 QMSLLSTNIGGGAFETD--TTKIRVIDRFDV-VATDTESFVPASFKA 384 (397) T ss_pred ceEEEEeccchhhhhcC--ceeEEEEeeecc-EEecccceEEEEecc Confidence 11111111111 1222 223446778877 457899998887644 No 83 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=94.35 E-value=0.0046 Score=33.37 Aligned_cols=301 Identities=10% Similarity=0.014 Sum_probs=136.3 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhc---ccccccchhhcceecCCCCceeEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQ---KFFEISPADYMPIRVGEGAWSTML 77 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~---~~~~l~~~~~~pv~~~~~~w~~~~ 77 (343) +..++..+.+.+.+-.+-++. .+.... ...+.+.-+++.+ .+..+|++. ..|=+...+..|+ .+ .. .+ T Consensus 54 ~~~~~~~~~~~~~lt~ee~~~-~~~~~~-~~~~~~gg~lvP~--~~~~~I~~~l~~~s~i~~~~~v~~~---~~-~~-~i 124 (377) T protein:vir:96 54 MERMFDLRDKNRELTAEEIKF-FNDIDK-NVGGKDKFKLLPE--ETMVQVFDDLVAEHPLLKVINFKNT---SL-RL-KA 124 (377) T ss_pred HHHHHHhccCCcccCHHHHHH-HHHHHh-cCCCCCCceecCH--HHHHHHHHHHHhhhhhhhhceeEec---CC-ce-EE Confidence 111111111111111100000 000111 1122222233332 122333332 1222333333443 11 12 22 Q ss_pred EeeeccchhhhhhcccccCCccccccc-cccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHh Q lcl|NC_019525. 78 TTYRSFSLAEDFATGIIDTGNSNGKLA-AVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDL 156 (343) Q Consensus 78 ~~~~~vg~a~~ia~g~~~~g~~a~Dip-~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~ 156 (343) -..+..+.|.++.-+ +.++ ..+..+++...+.+..+.-..+|.+=|+-+. .+|++--.....+++.. T Consensus 125 ~~~~~~~~a~wv~e~--------~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~----~~le~~i~~~l~~~~~~ 192 (377) T protein:vir:96 125 LTAETSGTAVWGDIF--------GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP----KWLKQFITEQLKEAIAV 192 (377) T ss_pred EEecCCcceeEeecc--------cccccccCccceeEeeeeeeEEeechhhHHHhhcch----hhHHHHHHHHHHHHHHH Confidence 233344555554322 2343 4567888999999998887777766665332 47888888888888999 Q ss_pred hhhheEeeeeccccceeeeeecCCceeeccc---CCc----------ccccCCHHHHHHHHHHHHHHHHhcCCce----e Q lcl|NC_019525. 157 GIQEIAFVGMKDNANVKGLLTQTGNVVNNTF---LTK----------SIKSMTPAELKVLCAGIIDVYRQGCDYT----A 219 (343) Q Consensus 157 ~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~---~~~----------~w~~kT~~eIl~Din~~l~~v~~~s~~~----~ 219 (343) .+++-.++|+-.. .-.|++|++........ +.. .....+++.+.+.+..++..+..+..+. . T Consensus 193 ~~~~a~i~G~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ 271 (377) T protein:vir:96 193 ALELAIVKGNGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIA 271 (377) T ss_pred HHhhceEeccCCC-cceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhcccccccccccc Confidence 9999999995443 35699998865432111 111 1123567777776666666654432211 1 Q ss_pred cCCeEEeCHHHHHHHhccccC-CCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEE Q lcl|NC_019525. 220 MPNKFTIPESDYTGLAGAASA-DFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRM 298 (343) Q Consensus 220 ~p~tl~lp~~~~~~L~~~~~s-~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~ 298 (343) ..-.++|-+..|..+...+.. +..+. +-..=|.|+.+...-.+. .+..-.|--.++++.++. -+++ T Consensus 272 ~~a~~~mn~~t~~~~~~~~~~~~~~G~----------~~~~l~~p~~v~~s~~~p-~~~i~fgdf~~Y~i~~r~--~~~i 338 (377) T protein:vir:96 272 GQVKLLLNPEDRWTLEAKFTSRNQFGE----------YVTVLPHGITILESLAVE-TGKAIAFVANRYDAFMAT--ASTI 338 (377) T ss_pred CceEEEEchhhHHhccccccccCCCCC----------ceeccCCCceEEecCCCC-cccEEEEEcCcEEEEEec--ccEE Confidence 112467777766544211111 10000 000002222222111111 011111222335554442 3333 Q ss_pred ecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 299 DIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 299 ~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.- +. .....+-..| -..+|++|- ++-|++++.+||-+ T Consensus 339 ~~~-~~---~~~~~d~~~f--~~~~r~dG~-~~d~~a~~vl~l~~ 376 (377) T protein:vir:96 339 EEY-DQ---TFAMEDLQLY--LTKNYFYGK-AKDNHTAALLTLAG 376 (377) T ss_pred Eee-hh---hhhhcCCeEE--EEEEEEcCE-EecCCcEEEEEEec Confidence 211 11 1112322234 347788885 57899999999999 No 84 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=94.31 E-value=0.0048 Score=33.31 Aligned_cols=289 Identities=12% Similarity=0.029 Sum_probs=126.0 Q ss_pred CCceeee--cCCchHHHHHHHHhhhc-----------------cchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccch Q lcl|NC_019525. 1 MKKFVIR--NSKGEKILLNAQEAKIA-----------------GVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPA 61 (343) Q Consensus 1 ~~~~~~~--~~~~~~~~~~a~~~~~~-----------------~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~ 61 (343) ..+-... +...++...++...-+. ..+.......| .+++. +.+...|++...+.-.-+ T Consensus 78 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~g-g~lvP--~~~~~~ii~~~~~~~~l~ 154 (397) T protein:vir:12 78 EGQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDG-GILIP--EDIGRQIHEFKRQFEPLE 154 (397) T ss_pred cccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccC-cccCc--hhHHHHHHHhhhhhhhHH Confidence 0000000 00001111111100000 00001111112 12222 223344444433333333 Q ss_pred h---hcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHh Q lcl|NC_019525. 62 D---YMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKS 137 (343) Q Consensus 62 ~---~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~ 137 (343) . .+|+.+..+ ...+......+.|.+++.| ..+|..+ ..+++.....+.++.-+.+|.+=|+.+. T Consensus 155 ~~~~~~~~~~~~~--~~~~~~~~~~~~a~~v~Eg--------~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~-- 222 (397) T protein:vir:12 155 QYVTVEPVTTRSG--TRLLEKNADMVPFSPVEEL--------GNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSD-- 222 (397) T ss_pred hhcceeeccCCce--eEEEEEecCCcceeeeccc--------ccccccccccceeEEeeheeeEeeehhhHHHHhhch-- Confidence 3 344433222 2222222233445555433 2455443 4688888888999988888877655332 Q ss_pred cCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCc Q lcl|NC_019525. 138 GNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDY 217 (343) Q Consensus 138 Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~ 217 (343) .+|.+--.....+++...+|..++.|+...+ -.|. .|.++|.+-++..+.. .+ T Consensus 223 --~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~-~~g~-------------------~~~~~i~~~~~~~l~~-----~~ 275 (397) T protein:vir:12 223 --QAIMTYVAKWFAKKSVVTRNNLILAAIASLK-KVDI-------------------DGLDGIKKALNVTLDP-----MV 275 (397) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHhcccccc-cccc-------------------ccHHHHHHHHhhccch-----hh Confidence 4577777777778888888888999853211 1111 2334444434322221 11 Q ss_pred eecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceE Q lcl|NC_019525. 218 TAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSL 296 (343) Q Consensus 218 ~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v 296 (343) .....++|-|..|..|..-+ ++.+.-++. -+.+.....-.|.|+....- .-....+++..+++-+- .+.+ T Consensus 276 -~~~a~~~~n~~~~~~L~~lk--d~~G~~l~~~~~~~g~~~~l~G~pv~~~~~-----~~~~~~~~~~~~~~gd~-~~~~ 346 (397) T protein:vir:12 276 -APGSIVLTNQDGYDWLDTLK--DGTGRYLLQPDPTNPTKKLLDGRPVVPFTN-----RVLKTQKGKAPLIIGNL-KEAI 346 (397) T ss_pred -hCCCEEEEcHHHHHHHHHhh--ccCCceeecccccCCCCccccceeeEEecc-----cccccCCCccEEEEEeh-hceE Confidence 12346889999999996432 333332211 01111111224666543221 00111122233332222 2222 Q ss_pred EEec--CCchhhc--c--hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 297 RMDI--PVDYTST--L--ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 297 ~~~i--P~~~~~l--~--p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .+.. .+.+... . -.+.+ ...+-++.|+++ .++.|.+++.+++.+ T Consensus 347 ~~~~~~~~~i~~~~~~~~~f~~~--~~~~r~~~r~d~-~~~~~~a~~~~~~t~ 396 (397) T protein:vir:12 347 VLFDREQQSIASTDTGAGAFETN--STKVRGIEREDV-RKWDEDAVVFGQITV 396 (397) T ss_pred EEEeecceEEEEeccccchhhcC--ceEEEEEEeecc-EEecccceEEEEEee Confidence 2211 1111111 1 11222 223446778877 568999999999999 No 85 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=94.28 E-value=0.0049 Score=33.27 Aligned_cols=293 Identities=11% Similarity=0.019 Sum_probs=120.1 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcc---cccccchhhcceecCCCCceeEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQK---FFEISPADYMPIRVGEGAWSTML 77 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~---~~~l~~~~~~pv~~~~~~w~~~~ 77 (343) ...++. +.+|++.+.+-+..-.+........+.| +++. +.+..+|++.- -|=+...+..|+ .+ .. .+ T Consensus 59 ~~~~~~-~~~g~~~lt~~e~~~~~~~~~~~~~~gg--~lvP--~~~~~~I~~~l~~~s~l~~~~~v~~~---~~-~~-~i 128 (383) T protein:vir:78 59 ADAYIS-ASRTDKNITNEEIKFFNDINKEVGYKEE--TLLP--QTVVDEIFEDLTTEHPFLASIGMRTT---GL-RT-KF 128 (383) T ss_pred HHHHHH-hcCChhhhhHHHHHHHHHHhccCCCCCc--cccC--HHHHHHHHHHHHhhccceeeeeeEec---CC-ce-EE Confidence 111111 2222222222111100001111112223 2222 12223333321 111222222332 11 12 23 Q ss_pred EeeeccchhhhhhcccccCCccccccc-cccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHh Q lcl|NC_019525. 78 TTYRSFSLAEDFATGIIDTGNSNGKLA-AVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDL 156 (343) Q Consensus 78 ~~~~~vg~a~~ia~g~~~~g~~a~Dip-~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~ 156 (343) -..+..+.|.++..+ +.++ ..+..+++.....+..+.-...|.+=|+-+. .+|+.--.....+++.. T Consensus 129 ~~~~~~~~a~w~~e~--------~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~----~~ie~~i~~~l~~~~a~ 196 (383) T protein:vir:78 129 LKSETSGVAVWGKIF--------GEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGP----AWVKRFVVTQIEEAFAV 196 (383) T ss_pred EEEcCCcceEEeecc--------cccccccCcceeeEeecceeeEeeccchHHHhhccH----HHHHHHHHHHHHHHHHH Confidence 333344455444321 2343 3456788888888888876666655554332 46888888888888889 Q ss_pred hhhheEeeeeccccceeeeeecCCceeecccCCcc-c---ccCCHHHHHHHHHHHHHHHHhcCCce-------ecCC-eE Q lcl|NC_019525. 157 GIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKS-I---KSMTPAELKVLCAGIIDVYRQGCDYT-------AMPN-KF 224 (343) Q Consensus 157 ~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~-w---~~kT~~eIl~Din~~l~~v~~~s~~~-------~~p~-tl 224 (343) .+|+-.+.|+-. ..-.|++++.+.....+.+... | ...|.+.+...++ .+..+..+-.+. ...+ +. T Consensus 197 ~~~~a~i~G~G~-~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~~~~~~~~~~~~~~~~~~~~~~~~ 274 (383) T protein:vir:78 197 ALESAYIVGDGN-DKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVN-ELTDVYKYHSVKENGHPLNVAGKVTL 274 (383) T ss_pred HHhhheEeccCC-CCceeeeeccCCcccccccccccccccchhhhhhhHHHHH-HHHHHHhccchhcccchhhhcCceEE Confidence 999999999533 3356999876532221111111 1 1223333332222 222222221111 0111 23 Q ss_pred EeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEee-------chhhhhcccccCCCccEEEEEEcCcceEE Q lcl|NC_019525. 225 TIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILP-------CVYADKITAQVPAVAKRYALYNDNEDSLR 297 (343) Q Consensus 225 ~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~-------~~~~~~~~~~g~gg~drmv~Y~~d~~~v~ 297 (343) ++-+..|..+...... ...+|.+.++.| ...+.. +..--|--.++++.++. -++ T Consensus 275 ~~n~~~~~~~~~~~~~----------------~~~~G~~~t~l~~~~~iv~s~~~p~-~~iifgdfs~Y~i~~r~--~~~ 335 (383) T protein:vir:78 275 LVNPTDAWDVKKQYTS----------------LNANGVYVTALPFNLNIIESLFVPE-KKAISYVAERYDALIGG--PLD 335 (383) T ss_pred EEcCcchhhhccchhc----------------cCCCCceeeecCCCceEEecCCCCc-ccEEEeeccceEEEecc--cce Confidence 3344333222100000 001233222222 111111 11111222334554442 233 Q ss_pred EecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 298 MDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 298 ~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.. .+. ....++-..| -...|++| .++.|++++++||-+ T Consensus 336 i~~-~~~---~~f~~d~~~f--~~~~r~dG-~~~~~~A~~vl~~~~ 374 (383) T protein:vir:78 336 IGT-YDQ---TLAIEDLNLY--AAKQFAYG-KAKDDKAAAVWTLNI 374 (383) T ss_pred EEe-cch---hhhhcCceEE--EEEEEEcC-EEecCCeEEEEEEEe Confidence 321 111 1112222233 34667877 788999999999988 No 86 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=93.86 E-value=0.0062 Score=32.71 Aligned_cols=291 Identities=10% Similarity=-0.009 Sum_probs=131.4 Q ss_pred CCceeeecCC----chHHHHHH-HHh---hh-ccchh-----hhhhhhhhhhhHHHHHHhhhhhhhcccccccc---hhh Q lcl|NC_019525. 1 MKKFVIRNSK----GEKILLNA-QEA---KI-AGVIQ-----RLCNDLGFEIDVTTLTTLMKKIIEQKFFEISP---ADY 63 (343) Q Consensus 1 ~~~~~~~~~~----~~~~~~~a-~~~---~~-~~~~~-----~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~---~~~ 63 (343) ..+....+.+ ++...... +++ -+ ..... .........+++. +.+...|++...+.-.- .+. T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP--~~~~~~ii~~~~~~~~l~~~~~~ 145 (397) T protein:vir:49 68 VANMSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIP--QDIQTAIHTLVSQYDSLQEYVNV 145 (397) T ss_pred hhccccccccccccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCccccc--HhHHHHHHHHHHhhhhHHhhhce Confidence 1111111111 11111111 000 00 00000 0011111123332 22334455543333333 344 Q ss_pred cceecCCCCceeEEEeee-ccchhhhhhcccccCCcccccccc-ccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCC Q lcl|NC_019525. 64 MPIRVGEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAA-VDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWD 141 (343) Q Consensus 64 ~pv~~~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~-vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~ 141 (343) +|+.+.. +...+.... .-+.|.+++.| ..+|. -...+++....++.++.-+.+|.+=|+.+. .+ T Consensus 146 ~~~~~~~--~~~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~----~~ 211 (397) T protein:vir:49 146 ENVTTLT--GSRVYEKWTDITGLANIDDEA--------GKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSA----EN 211 (397) T ss_pred eecccCc--cceEEEeeccCCcceeeecCc--------cccccccccceeeEEeeeeeEEeeehhHHHHHhhhH----HH Confidence 4543332 222232222 22456666543 24664 356789999999999988888766554332 35 Q ss_pred cHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecC Q lcl|NC_019525. 142 LITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMP 221 (343) Q Consensus 142 L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p 221 (343) |.+--.....+++...+|+-.+.|+..... .++ ..+.+.|. +++..+...- ... T Consensus 212 l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~--------------~~~-----~~~~d~i~----~~~~~l~~~~---~~~ 265 (397) T protein:vir:49 212 ILAWLSGWIAKKVVVTRNKAILEAIAALPT--------------KPT-----LTKWDDII----DLEAKVDPAI---KQT 265 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccc--------------ccc-----cccHHHHH----HHHHhhhhhh---cCC Confidence 777777777788888888888888432110 001 12334444 4444433221 234 Q ss_pred CeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe- Q lcl|NC_019525. 222 NKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD- 299 (343) Q Consensus 222 ~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~- 299 (343) ..++|.++.|..|..- -++.+.-++. =+.......-.|.|+.+....++. .+..++. .++|-+=.+.+.+- T Consensus 266 a~~vmn~~~~~~l~~l--kd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~----~~~~~~~-~i~~gd~~~~~~~~~ 338 (397) T protein:vir:49 266 SFFLTNTSGFTALKKV--KNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLA----NGTGGAM-PLYFGDLKQAVTLFD 338 (397) T ss_pred CEEEEcHHHHHHHHHh--hcCCCceeeccCcCCCCCceecceeeEEecccccc----cccCCce-eEEEeeccceEEEEe Confidence 5789999999999653 3444433321 011222222347776654433322 1112222 23332222222211 Q ss_pred -cCCchhhc--c--hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 300 -IPVDYTST--L--ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 300 -iP~~~~~l--~--p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -.+.+... . ....+ . ..+-++.|+++ .++.|.+++.+.+-- T Consensus 339 ~~~~~i~~~~~~~~~~~~~-~-~~~r~~~r~d~-~~~~~~a~~~~~~~~ 384 (397) T protein:vir:49 339 RQHMSLLSTNIGGGAFETD-T-TKVRVIDRFDV-VATDTEAFVPASFKA 384 (397) T ss_pred ecceEEEEeccccchhhcC-c-eeEEEEeeeCc-EEecccceEEEEeec Confidence 12222211 1 12232 2 22345777777 678889999988755 No 87 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=93.60 E-value=0.007 Score=32.39 Aligned_cols=295 Identities=11% Similarity=0.033 Sum_probs=123.0 Q ss_pred CCceeeecCCchHHH-HHHHHhhhccchhh-hhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKIL-LNAQEAKIAGVIQR-LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLT 78 (343) Q Consensus 1 ~~~~~~~~~~~~~~~-~~a~~~~~~~~~~~-~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~ 78 (343) ...++.+- +|.+.+ .+.++ ....+.. ...+.|. ++. +.+...|++.-...-.-+.+..+.+..+ .. .+- T Consensus 62 ~~~~~~~~-r~~~~l~~ee~~--~~~~~~~~t~~~gG~--liP--~~~~~~Ii~~l~~~s~i~~~~~v~~~~~-~~-~i~ 132 (395) T protein:vir:95 62 VDNGILAK-RSQDPLTSEERK--FFNDINYDVGYTDEK--ILP--ETVVERVFDDLQKDHPLLSKINFQNAGI-KT-RVI 132 (395) T ss_pred HHHHHHhh-cCccccchHHHH--HHHHHhhccCCCCce--ecc--HHHHHHHHHHHHhhhhhhhhceeEecCC-ce-EEE Confidence 22222111 111111 11111 1111111 1222232 222 1223333332111111222333322221 11 222 Q ss_pred eeeccchhhhhhcccccCCcccccc-ccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhh Q lcl|NC_019525. 79 TYRSFSLAEDFATGIIDTGNSNGKL-AAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLG 157 (343) Q Consensus 79 ~~~~vg~a~~ia~g~~~~g~~a~Di-p~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~ 157 (343) ..+..+.|.++.. .+.+ +..+..+++.....+..+.-...|.+=|+-+. .+|++--.+...+++... T Consensus 133 ~~~~~~~a~w~~e--------~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~----~~ie~~i~~~la~~ia~~ 200 (395) T protein:vir:95 133 KADPAGQAVWGKV--------FGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGP----AWIERFVRTQIQEAISVA 200 (395) T ss_pred EecCCcceEEeec--------ccccCccccccceeeeeceeeEEEeecccHHHHhcch----hHHHHHHHHHHHHHHHHH Confidence 3334444444321 1223 34566788888888888877777766665332 478888888999999999 Q ss_pred hhheEeeeeccccc-eeeeeecCCceeecccCCcccc-cCCHHHHHHHHHHHHHHHHhcCCc-------eecCCeEEeCH Q lcl|NC_019525. 158 IQEIAFVGMKDNAN-VKGLLTQTGNVVNNTFLTKSIK-SMTPAELKVLCAGIIDVYRQGCDY-------TAMPNKFTIPE 228 (343) Q Consensus 158 ~n~v~~~G~~~~~g-~~GLlN~p~v~~~~a~~~~~w~-~kT~~eIl~Din~~l~~v~~~s~~-------~~~p~tl~lp~ 228 (343) +|+-.+.|+-..++ =.|++|+......... ...++ ..|.+.+...+..+.......+.. ...--+.+|.+ T Consensus 201 ~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~-~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~ 279 (395) T protein:vir:95 201 LESAIINGGGAAKTQPVGLMKDVNTNSGAVT-DKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNP 279 (395) T ss_pred HhhheeeccCCCCcCceeeeecccccccccc-cccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcc Confidence 99999999532221 3599998765322111 11111 223333433333333322222111 00111355666 Q ss_pred HHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEee-------chhhhhcccccCCCccEEEEEEcCcceEEEecC Q lcl|NC_019525. 229 SDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILP-------CVYADKITAQVPAVAKRYALYNDNEDSLRMDIP 301 (343) Q Consensus 229 ~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~-------~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP 301 (343) ..+..+..... +.-..|.+.++++ ...+.. +..--|.-.+++++++ .-+++..- T Consensus 280 ~t~~~~~g~~~----------------~~~~~G~~~~~lg~g~~v~~~~~~p~-~~i~fgdfs~y~i~~r--~~~~i~~~ 340 (395) T protein:vir:95 280 RDSWDVQARYT----------------YLTANGGFVTVLPYNVTIITSEFVPE-GKLVAFVTDRYNAVRG--GGLTVKKF 340 (395) T ss_pred hhhhhcCCcce----------------eccCCCcceeccCCcceEEEcCCCCC-CcEEEEecccEEEEEe--cceEEEec Confidence 55543321110 0001233333221 111110 0011111123444443 22222211 Q ss_pred CchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 302 VDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 302 ~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +. ....++...| -...|++| .++-|+++++++|-+ T Consensus 341 -~~---~~~~~d~~~f--~~~~r~dg-~~~~~~A~~~l~i~~ 375 (395) T protein:vir:95 341 -DQ---TLALEDAVLF--TAKTFAYG-QPDDNKASAVYDLKV 375 (395) T ss_pred -cc---hhhhCCcEEE--EEEEEECC-EEeccccEEEEEeec Confidence 10 1111222223 45778876 567799999999977 No 88 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=93.40 E-value=0.0077 Score=32.17 Aligned_cols=309 Identities=8% Similarity=0.005 Sum_probs=134.3 Q ss_pred CCceeee----cCCchHHHHHHHH-------------h-hhcc----chhhhh--hhhhhhhhHHHHHHhhhhhhhcccc Q lcl|NC_019525. 1 MKKFVIR----NSKGEKILLNAQE-------------A-KIAG----VIQRLC--NDLGFEIDVTTLTTLMKKIIEQKFF 56 (343) Q Consensus 1 ~~~~~~~----~~~~~~~~~~a~~-------------~-~~~~----~~~~~~--~d~~~~f~~~qL~~i~~~iye~~~~ 56 (343) =+.-+.. ..+|+.....++. . .... ...... ...|. +++.+ .+...|++...+ T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg-~liP~--~~~~~ii~~l~~ 152 (428) T protein:vir:10 76 GPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGG-VLIPQ--NIHSEVIELLRD 152 (428) T ss_pred ccccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCc-cccch--hHHHHHHHHHhh Confidence 0000000 0111111000000 0 0000 000000 01111 22221 223345544222 Q ss_pred cccchh----hcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHH Q lcl|NC_019525. 57 EISPAD----YMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELA 132 (343) Q Consensus 57 ~l~~~~----~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~ 132 (343) ...-+. .+|..+ + ...+-....-+.|.+++.| ..+|.-+..+++.+...+.++.-+.+|.+=|+ T Consensus 153 ~~~l~~~~~~~~~~~~--g--~~~~p~~~~~~~a~~v~Eg--------~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ 220 (428) T protein:vir:10 153 RTIVRKLGARSIPLPN--G--NMSLPRLAGGATASYTGEN--------QDAKVSEARFDDVKLTAKTMIAMVPISNALIG 220 (428) T ss_pred hchhhhhcceeeecCC--c--ceEEEEEeCCcceeeeccC--------ccccccccceeeEEeeeEEEEEeehhhHHHHh Confidence 222222 222211 1 1122222222334444322 36788888899999999999999988888776 Q ss_pred HHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHH Q lcl|NC_019525. 133 EAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYR 212 (343) Q Consensus 133 ~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~ 212 (343) .+. .+|.+--...+..++...+|+..++|+.......|++|....+.... .+..-+..+.+.+...++.+..... T Consensus 221 ds~----~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (428) T protein:vir:10 221 RAG----FNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLL-PWAADAAVNLDTIDTYLDSIILMSM 295 (428) T ss_pred hhh----HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc-cccccccccHHHHHHHHHHHHHhhh Confidence 542 35777777888888888889999999644333579998765432111 0111233444444444444433222 Q ss_pred hcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcC Q lcl|NC_019525. 213 QGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDN 292 (343) Q Consensus 213 ~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d 292 (343) .... .......+|-+..|..|..-+ +..+.-++ .......-.|.|+.+.. .+.. +. +.+++...++|- | T Consensus 296 ~~~~-~~~~~~~v~n~~~~~~L~~lk--d~~G~~i~---~~~~~g~l~G~pv~~~~--~~p~-~~-~~~~~~~~i~~g-d 364 (428) T protein:vir:10 296 DGNS-NMISSGWGMSNRTYMKLFGLR--DGNGNKVY---PEMAQGMLKGYPIQRTS--AIPA-NL-GEGGKESEIYFA-D 364 (428) T ss_pred cccc-ccccCEEEEcHHHHHHHHHhh--ccCCceec---cCCCCCeeeceeeEEec--cccc-cc-cCCCccceEEEE-e Confidence 2222 234567889999999986543 43333332 11111112366654432 1111 11 112222222221 2 Q ss_pred cceEEEec--CCchhhcc-------------hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 293 EDSLRMDI--PVDYTSTL-------------ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 293 ~~~v~~~i--P~~~~~l~-------------p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...+-+-. .+.+.... -.+.+.. .+-+++|++ +.++.|++++++.-== T Consensus 365 ~s~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~--~~R~~~r~d-~~v~~p~a~~~~t~~~ 427 (428) T protein:vir:10 365 FNDVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQS--LIRVVTEHD-IGFRHPEGLVLGTGVL 427 (428) T ss_pred cceEEEEEecceEEEeecccccccccccccchhhcchh--heeeeeeeC-ceeeccceEEEEeccC Confidence 22222211 11111110 1122211 233566654 5677888887764433 No 89 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=92.89 E-value=0.0096 Score=31.65 Aligned_cols=290 Identities=9% Similarity=-0.012 Sum_probs=131.1 Q ss_pred CCceeeecCCc--hHHHHHHHHhhhccc-----------hhhhh-hhhhhhhhHHHHHHhhhhhhhcccccccch---hh Q lcl|NC_019525. 1 MKKFVIRNSKG--EKILLNAQEAKIAGV-----------IQRLC-NDLGFEIDVTTLTTLMKKIIEQKFFEISPA---DY 63 (343) Q Consensus 1 ~~~~~~~~~~~--~~~~~~a~~~~~~~~-----------~~~~~-~d~~~~f~~~qL~~i~~~iye~~~~~l~~~---~~ 63 (343) .++........ +....++...-+... ..... .+.| +++. +.+...|++...+....+ .. T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg--~~iP--~~~~~~ii~~~~~~~~l~~~~~~ 152 (404) T protein:vir:39 77 EEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAG--LTIP--QDIRTMINTLVRQYDSLQQYVRV 152 (404) T ss_pred ccccccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCc--eecc--HHHHHHHHHHHHhhhhHHhhcce Confidence 11111111110 111111111111000 00001 1122 2222 223334544433333333 44 Q ss_pred cceecCCCCceeEEEeee-ccchhhhhhcccccCCcccccccc-ccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCC Q lcl|NC_019525. 64 MPIRVGEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAA-VDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWD 141 (343) Q Consensus 64 ~pv~~~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~-vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~ 141 (343) +|+.+..+.+. +.... .-+.|.+++.+ ..+|. -+..+++....++.++.-+.+|.+=|+.+ . .+ T Consensus 153 ~~~~~~~~~~~--~~~~~~~~~~a~~v~Eg--------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~-~~ 218 (404) T protein:vir:39 153 ESVSTSNGSRV--YEKWTDVTPLTVMDAED--------GKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDT---A-EN 218 (404) T ss_pred eeccCCcceEE--EEeecCCccceeeecCc--------cccccccccceeeEEeeeeeEEeeehhHHHHHhhc---h-HH Confidence 55544333222 21111 22345555433 24664 34678999999999998888887655433 2 46 Q ss_pred cHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecC Q lcl|NC_019525. 142 LITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMP 221 (343) Q Consensus 142 L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p 221 (343) |.+--.....+++...+|+..+.|+..... .-...+.+.+.+.++..+..... .. T Consensus 219 l~~~i~~~l~~~~~~~~d~~il~g~g~~~~-------------------~~~~~~~~~i~~~~~~~~~~~~~------~~ 273 (404) T protein:vir:39 219 ILAWLSSWIAKKVVVTRNQAIIAAMGTVPK-------------------KPTIAKFDDVITMINTSVDPAII------AT 273 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccc-------------------ccccccHHHHHHHHHHhhhhhhc------cC Confidence 777777788888888888888888432110 01123455555555543332211 22 Q ss_pred CeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe- Q lcl|NC_019525. 222 NKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD- 299 (343) Q Consensus 222 ~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~- 299 (343) ..++|-|+.|..|..- -++.+..+.. -+.......-.|.|+.+..-.++ .+. +.++..+++.+-+ +.+.+. T Consensus 274 a~~v~n~~~~~~L~~l--kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~---~~~-~~~~~~~~~gd~~-~~~~~~~ 346 (404) T protein:vir:39 274 SSLLTNQSGLNKLALV--KTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWL---PNS-GSTVYPLYYGDMS-QAITLFD 346 (404) T ss_pred CEEEEcHHHHHHHHHh--hccCCceeeccCcCCCCcceecceeEEEeccccc---Ccc-CCCccEEEEEecc-ccEEEEe Confidence 3689999999999753 3443433321 11111112224666554322211 111 1122333333322 222221 Q ss_pred ---cCCchhhc--chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 300 ---IPVDYTST--LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 300 ---iP~~~~~l--~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.+..... ...+++ ...+-++.|+++ .++.|.+++.+.+.. T Consensus 347 ~~~~~i~~~~~~~~~~~~~--~~~~r~~~r~d~-~~~~~~a~~~~~~~~ 392 (404) T protein:vir:39 347 RENMSLLPTNIGAGAFETD--TTKIRVIDRFDV-KTTDSEALVAGSFTA 392 (404) T ss_pred ecceEEEEeccchhhhhhc--eeeEEEEeeecc-EEecccceEEEEeec Confidence 11111111 112232 223446888885 677899999999766 No 90 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=92.62 E-value=0.011 Score=31.40 Aligned_cols=291 Identities=11% Similarity=0.018 Sum_probs=130.7 Q ss_pred CCcee-eecCCchH----HHHHHHHhhhcc--------c---hhhhhhhhhhhhhHHHHHHhhhhhhhcccccccc---h Q lcl|NC_019525. 1 MKKFV-IRNSKGEK----ILLNAQEAKIAG--------V---IQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISP---A 61 (343) Q Consensus 1 ~~~~~-~~~~~~~~----~~~~a~~~~~~~--------~---~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~---~ 61 (343) ++.-. .+...+++ ....+....+.. . +.......| .+++. +.+...|++...+.... . T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~g-g~~vP--~~~~~~Ii~~~~~~~~l~~~~ 150 (408) T protein:vir:10 74 MREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAA-GLTIP--QDIRTMINTLVRQYDSLQQYV 150 (408) T ss_pred cccccccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCC-ceecc--HhHHHHHHHHHHhhchhhhhc Confidence 11000 00000000 011111100000 0 000111112 23333 22334455543333333 3 Q ss_pred hhcceecCCCCceeEEEeeec-cchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcC Q lcl|NC_019525. 62 DYMPIRVGEGAWSTMLTTYRS-FSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGN 139 (343) Q Consensus 62 ~~~pv~~~~~~w~~~~~~~~~-vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr 139 (343) +.+|+.+. .+...+..... .+.|.+++.+ ..+|..+ ..+++.+...+.++.-+.+|.+=|+-+ . T Consensus 151 ~~~~~~~~--~~~~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds---~- 216 (408) T protein:vir:10 151 RVESVSTS--NGSRVYEKWTDVTPLTVMDAED--------GKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDT---A- 216 (408) T ss_pred ceeeccCC--cceEEEeeccccccceeeecCc--------cccccccCcceeeEEeeeeeEEeeehhHHHHHhhc---h- Confidence 44555333 23322222222 2334444432 2466544 468999999999998888887766543 2 Q ss_pred CCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCcee Q lcl|NC_019525. 140 WDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTA 219 (343) Q Consensus 140 ~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~ 219 (343) .+|.+--.....+++...+|+-.+.|...... .-...|.++|++.++..+.. .+. T Consensus 217 ~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~-------------------~~~~~~~~~l~~~~~~~~~~-----~~~- 271 (408) T protein:vir:10 217 ENILAWLSSWIAKKVVVTRNQAIIEVMKAAPK-------------------KPTIAKFDDVITMINTAVDP-----AII- 271 (408) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------------------ccccccHHHHHHHHHHhhhh-----hhc- Confidence 46777777778888888888888888432110 01123456665555543332 221 Q ss_pred cCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEE Q lcl|NC_019525. 220 MPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRM 298 (343) Q Consensus 220 ~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~ 298 (343) ....++|-++.|..|..- -++.+..++. -+.+..+..-.|.|+.+....++ ++.+. ++.. ++|-+=.+.+.+ T Consensus 272 ~~a~~v~n~~~~~~l~~l--kd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~---~~~~~-~~~~-i~~gd~~~~~~~ 344 (408) T protein:vir:10 272 ATSSLLTNQSGLNKLALV--KTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWL---PNTGS-TVYP-LYYGDMSQAITL 344 (408) T ss_pred cCCEEEEcHHHHHHHHHh--hccCCceEeccCcCCCCCceecceeeEEeccccc---CccCC-CceE-EEEEehhccEEE Confidence 224689999999999643 3444444432 11222222335777665443222 22211 1122 222221222222 Q ss_pred e--cCCchhhc----chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 299 D--IPVDYTST----LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 299 ~--iP~~~~~l----~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) - -.+.+... ...+.+. ..+-++.|+++. ++.|.+++++.+.- T Consensus 345 ~~~~~~~v~~~~~~~~~f~~~~--~~~r~~~r~d~~-v~~~~a~~~~~~~~ 392 (408) T protein:vir:10 345 FDRENMSLLPTNIGAGAFETDT--TKIRVIDRFDVK-ATDSEALVAGSFSA 392 (408) T ss_pred EEecceEEEEcccccchhhcCc--eEEEEEEeeccE-EeccccEEEEEeec Confidence 1 12222211 1122322 233457788774 56699999999644 No 91 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=91.80 E-value=0.014 Score=30.72 Aligned_cols=285 Identities=11% Similarity=0.030 Sum_probs=118.3 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTY 80 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~ 80 (343) ++.+...+.. .+.+.++... ...+.......| -+++.+ .+...|++...+.-.-+.+..+.+..+ ..+... T Consensus 95 ~r~~~~~~~~-~~~~~~~~~~--~~a~~~~~~~~g-G~lIP~--~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~ 165 (387) T protein:vir:96 95 YRHAILPNEF-EKPSMEAQRL--LHALPTGNDSGG-DKLLPK--TLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRV 165 (387) T ss_pred HHHHHhhhhH-HHHHHHHHHH--HhhhccCCCCCC-ceeech--hHHHHHHHHHHhhchhhhhceeeecCC---ceeeee Confidence 1111100000 1111111100 001111111112 233332 234455554333322333333322221 111111 Q ss_pred e-ccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 81 R-SFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 81 ~-~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) + ..+.|.+++-| ...|.-+..+++.....+.++.-+.+|.+=|+-+. .+|.+--.....+++...++ T Consensus 166 ~~~~~~a~~v~Eg--------~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~----~~l~~~i~~~la~~~~~~e~ 233 (387) T protein:vir:96 166 SYTLDDDDFITDV--------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSD----VDLVNWVENALQSGLAAKER 233 (387) T ss_pred eccCCcccccccc--------ccccccccccceeeechheeeeechhhHHHHhhhH----HHHHHHHHHHHHHHHHHHHH Confidence 1 12334444322 25677778889999999999888888866555332 35666555555555555566 Q ss_pred heEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcccc Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAAS 239 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~ 239 (343) +.+|.+.+....-.|.+++++++..+. ..+.++|.+-++.+-.....+ ..++|-+..|..+.... T Consensus 234 ~~~~~~g~g~g~~~g~~~~~~~~~~~~-------~~~~d~i~~~~~~l~~~y~~n-------a~~imn~~t~~~~~~~~- 298 (387) T protein:vir:96 234 KDALAVSPKSGLEHMSFYNGSVKEVEG-------ADMYDAIINALADLHEDYRDN-------ATIYMRYADYVKIISVL- 298 (387) T ss_pred HhHhhcCCCccccceeeeccccccccc-------cchHHHHHHHHhccChhhhcC-------CEEEEechHHHHHHHHH- Confidence 666644333212347777776644221 223444444444333322221 23455445555443332 Q ss_pred CCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEe Q lcl|NC_019525. 240 ADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQN 319 (343) Q Consensus 240 s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v 319 (343) .+.+. .++ . ..+..--|.|+.+.. ... .+++-+-+.=++.+. .+-+.+.. ....+ .... T Consensus 299 ~~~~~-~~~---~-~~~~~llG~PV~~~~-----~~~--------~~~~GDf~~~~~~~~-~~~~~~~~-~~~~~-~~~~ 357 (387) T protein:vir:96 299 SNGTT-NFF---D-TPAEKVFGKPVVFTD-----AAV--------KPIVGDFNYFGINYD-GTTYDTDK-DVKKG-EYLF 357 (387) T ss_pred hcCCC-ccc---c-cCCccccccceEEec-----CCC--------ceeeechhhhhhhhh-hhhheecc-cccCC-ceEE Confidence 33322 221 1 111122366665432 111 122211111011000 11111111 11112 2233 Q ss_pred ceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 320 AAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 320 ~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -++.|++|..+ .|++++++.|.- T Consensus 358 ~~~~r~Dg~v~-~~~A~~~l~~ka 380 (387) T protein:vir:96 358 VLTAWYDQQRT-LDSAFRIAKAKE 380 (387) T ss_pred EEEEEeCcEee-chhheEEEEeec Confidence 45778888765 699999999955 No 92 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=91.80 E-value=0.014 Score=30.72 Aligned_cols=285 Identities=11% Similarity=0.030 Sum_probs=118.3 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTY 80 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~ 80 (343) ++.+...+.. .+.+.++... ...+.......| -+++.+ .+...|++...+.-.-+.+..+.+..+ ..+... T Consensus 95 ~r~~~~~~~~-~~~~~~~~~~--~~a~~~~~~~~g-G~lIP~--~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~ 165 (387) T protein:vir:26 95 YRHAILPNEF-EKPSMEAQRL--LHALPTGNDSGG-DKLLPK--TLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRV 165 (387) T ss_pred HHHHHhhhhH-HHHHHHHHHH--HhhhccCCCCCC-ceeech--hHHHHHHHHHHhhchhhhhceeeecCC---ceeeee Confidence 1111100000 1111111100 001111111112 233332 234455554333322333333322221 111111 Q ss_pred e-ccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 81 R-SFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 81 ~-~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) + ..+.|.+++-| ...|.-+..+++.....+.++.-+.+|.+=|+-+. .+|.+--.....+++...++ T Consensus 166 ~~~~~~a~~v~Eg--------~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~----~~l~~~i~~~la~~~~~~e~ 233 (387) T protein:vir:26 166 SYTLDDDDFITDV--------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSD----VDLVNWVENALQSGLAAKER 233 (387) T ss_pred eccCCcccccccc--------ccccccccccceeeechheeeeechhhHHHHhhhH----HHHHHHHHHHHHHHHHHHHH Confidence 1 12334444322 25677778889999999999888888866555332 35666555555555555566 Q ss_pred heEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcccc Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAAS 239 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~ 239 (343) +.+|.+.+....-.|.+++++++..+. ..+.++|.+-++.+-.....+ ..++|-+..|..+.... T Consensus 234 ~~~~~~g~g~g~~~g~~~~~~~~~~~~-------~~~~d~i~~~~~~l~~~y~~n-------a~~imn~~t~~~~~~~~- 298 (387) T protein:vir:26 234 KDALAVSPKSGLEHMSFYNGSVKEVEG-------ADMYDAIINALADLHEDYRDN-------ATIYMRYADYVKIISVL- 298 (387) T ss_pred HhHhhcCCCccccceeeeccccccccc-------cchHHHHHHHHhccChhhhcC-------CEEEEechHHHHHHHHH- Confidence 666644333212347777776644221 223444444444333322221 23455445555443332 Q ss_pred CCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEe Q lcl|NC_019525. 240 ADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQN 319 (343) Q Consensus 240 s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v 319 (343) .+.+. .++ . ..+..--|.|+.+.. ... .+++-+-+.=++.+. .+-+.+.. ....+ .... T Consensus 299 ~~~~~-~~~---~-~~~~~llG~PV~~~~-----~~~--------~~~~GDf~~~~~~~~-~~~~~~~~-~~~~~-~~~~ 357 (387) T protein:vir:26 299 SNGTT-NFF---D-TPAEKVFGKPVVFTD-----AAV--------KPIVGDFNYFGINYD-GTTYDTDK-DVKKG-EYLF 357 (387) T ss_pred hcCCC-ccc---c-cCCccccccceEEec-----CCC--------ceeeechhhhhhhhh-hhhheecc-cccCC-ceEE Confidence 33322 221 1 111122366665432 111 122211111011000 11111111 11112 2233 Q ss_pred ceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 320 AAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 320 ~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -++.|++|..+ .|++++++.|.- T Consensus 358 ~~~~r~Dg~v~-~~~A~~~l~~ka 380 (387) T protein:vir:26 358 VLTAWYDQQRT-LDSAFRIAKAKE 380 (387) T ss_pred EEEEEeCcEee-chhheEEEEeec Confidence 45778888765 699999999955 No 93 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=91.80 E-value=0.014 Score=30.72 Aligned_cols=285 Identities=11% Similarity=0.030 Sum_probs=118.3 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTY 80 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~ 80 (343) ++.+...+.. .+.+.++... ...+.......| -+++.+ .+...|++...+.-.-+.+..+.+..+ ..+... T Consensus 95 ~r~~~~~~~~-~~~~~~~~~~--~~a~~~~~~~~g-G~lIP~--~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~p~~ 165 (387) T protein:vir:94 95 YRHAILPNEF-EKPSMEAQRL--LHALPTGNDSGG-DKLLPK--TLSKEIVSEPFAKNQLREKARLTNIKG---LEIPRV 165 (387) T ss_pred HHHHHhhhhH-HHHHHHHHHH--HhhhccCCCCCC-ceeech--hHHHHHHHHHHhhchhhhhceeeecCC---ceeeee Confidence 1111100000 1111111100 001111111112 233332 234455554333322333333322221 111111 Q ss_pred e-ccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 81 R-SFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 81 ~-~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) + ..+.|.+++-| ...|.-+..+++.....+.++.-+.+|.+=|+-+. .+|.+--.....+++...++ T Consensus 166 ~~~~~~a~~v~Eg--------~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~----~~l~~~i~~~la~~~~~~e~ 233 (387) T protein:vir:94 166 SYTLDDDDFITDV--------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSD----VDLVNWVENALQSGLAAKER 233 (387) T ss_pred eccCCcccccccc--------ccccccccccceeeechheeeeechhhHHHHhhhH----HHHHHHHHHHHHHHHHHHHH Confidence 1 12334444322 25677778889999999999888888866555332 35666555555555555566 Q ss_pred heEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcccc Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAAS 239 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~ 239 (343) +.+|.+.+....-.|.+++++++..+. ..+.++|.+-++.+-.....+ ..++|-+..|..+.... T Consensus 234 ~~~~~~g~g~g~~~g~~~~~~~~~~~~-------~~~~d~i~~~~~~l~~~y~~n-------a~~imn~~t~~~~~~~~- 298 (387) T protein:vir:94 234 KDALAVSPKSGLEHMSFYNGSVKEVEG-------ADMYDAIINALADLHEDYRDN-------ATIYMRYADYVKIISVL- 298 (387) T ss_pred HhHhhcCCCccccceeeeccccccccc-------cchHHHHHHHHhccChhhhcC-------CEEEEechHHHHHHHHH- Confidence 666644333212347777776644221 223444444444333322221 23455445555443332 Q ss_pred CCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEe Q lcl|NC_019525. 240 ADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQN 319 (343) Q Consensus 240 s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v 319 (343) .+.+. .++ . ..+..--|.|+.+.. ... .+++-+-+.=++.+. .+-+.+.. ....+ .... T Consensus 299 ~~~~~-~~~---~-~~~~~llG~PV~~~~-----~~~--------~~~~GDf~~~~~~~~-~~~~~~~~-~~~~~-~~~~ 357 (387) T protein:vir:94 299 SNGTT-NFF---D-TPAEKVFGKPVVFTD-----AAV--------KPIVGDFNYFGINYD-GTTYDTDK-DVKKG-EYLF 357 (387) T ss_pred hcCCC-ccc---c-cCCccccccceEEec-----CCC--------ceeeechhhhhhhhh-hhhheecc-cccCC-ceEE Confidence 33322 221 1 111122366665432 111 122211111011000 11111111 11112 2233 Q ss_pred ceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 320 AAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 320 ~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -++.|++|..+ .|++++++.|.- T Consensus 358 ~~~~r~Dg~v~-~~~A~~~l~~ka 380 (387) T protein:vir:94 358 VLTAWYDQQRT-LDSAFRIAKAKE 380 (387) T ss_pred EEEEEeCcEee-chhheEEEEeec Confidence 45778888765 699999999955 No 94 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=91.77 E-value=0.014 Score=30.69 Aligned_cols=288 Identities=11% Similarity=0.034 Sum_probs=123.4 Q ss_pred CCceee--ec-CCchHHHHHHHHhhhcc----------------chhhh--hhhhhhhhhHHHHHHhhhhhhhccccccc Q lcl|NC_019525. 1 MKKFVI--RN-SKGEKILLNAQEAKIAG----------------VIQRL--CNDLGFEIDVTTLTTLMKKIIEQKFFEIS 59 (343) Q Consensus 1 ~~~~~~--~~-~~~~~~~~~a~~~~~~~----------------~~~~~--~~d~~~~f~~~qL~~i~~~iye~~~~~l~ 59 (343) ...... .+ ...|+. .++....... ...++ ..+.+.-|++. +.+.+.|++.....-. T Consensus 36 ~~~~~~~~~~~~~~~~~-~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP--~~~~~~Ii~~l~~~s~ 112 (352) T protein:vir:78 36 VKDKGEAYQSLNDNEKL-VKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLP--KTLSKEIVSEPFAKNQ 112 (352) T ss_pred hhhccccccccchhhhH-HHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceecc--HhHHHHHHHHHHhhcc Confidence 000000 00 011111 1110000000 00011 01122234444 2334445553322223 Q ss_pred chhhcceecCCCCceeEEEeee-ccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhc Q lcl|NC_019525. 60 PADYMPIRVGEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSG 138 (343) Q Consensus 60 ~~~~~pv~~~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~G 138 (343) -+.+..+.+..+ . .+...+ ..+.|.+++.+ ..+|..+..+++....++.++.-+.+|.+=|+-+. T Consensus 113 l~~~~~v~~~~~-~--~~p~~~~~~~~a~~v~E~--------~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~--- 178 (352) T protein:vir:78 113 LREKARLTNIKG-L--EIPRVSYTLDDDDFITDV--------ETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSD--- 178 (352) T ss_pred hhhheeeEecCC-c--eEEEEecCCCcccccccc--------cccccccccceeeeecceeEEeechhhHHHHhhhh--- Confidence 344444433222 1 111111 12345555432 35777788899999999999888888877665432 Q ss_pred CCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCce Q lcl|NC_019525. 139 NWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYT 218 (343) Q Consensus 139 r~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~ 218 (343) .+|.+--.....+++...++..+|...+....-.|.+++++++..++ ..+.|.|.+.++.+......+ T Consensus 179 -~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~-------~~~~d~i~~~~~~l~~~~~~~---- 246 (352) T protein:vir:78 179 -VDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG-------ANMYDAIINALADLHEDYRDN---- 246 (352) T ss_pred -HHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccccc-------cchHHHHHHHHhccChhhhcC---- Confidence 35666665666666655556665533232212357777777654221 123455555555433332221 Q ss_pred ecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEE Q lcl|NC_019525. 219 AMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRM 298 (343) Q Consensus 219 ~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~ 298 (343) ..++|-+..|..|...+ .+.+ ..++ . ..+..--|.|+.+.. .... +++-+-+.=++.+ T Consensus 247 ---a~~~mn~~t~~~l~~~~-~~~~-~~~~---~-~~~~~llG~PV~~~~-----~~~~--------~~~Gdf~~~~~~~ 304 (352) T protein:vir:78 247 ---ATIYMRYADYVKIISVL-SNGT-TNFF---D-TPAEKVFGKPVVFTD-----AAVK--------PIVGDFNYFGINY 304 (352) T ss_pred ---CEEEEehHHHHHHHHHH-hccC-Cccc---c-cCCccccccceEEec-----CCCc--------eeEeehhhhhhhh Confidence 24566556665554332 3333 2222 1 111112366665432 1111 1111111000000 Q ss_pred ecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 299 DIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 299 ~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) . .+-+.++.- ...+ ....-+.+|++|. ++.|++++++-+.- T Consensus 305 ~-~~~~~~~~~-~~~g-~~~f~~~~r~Dg~-~~~~eA~~~l~~~a 345 (352) T protein:vir:78 305 D-GTTYDTDKD-VKKG-EYLFVLTAWYDQQ-RTLDSAFRIAKAKE 345 (352) T ss_pred h-hheeeeecc-ccCC-eeEEEEEeeeCce-eechhheEEEEeec Confidence 0 111111111 1122 2333457899988 56699999998877 No 95 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=91.72 E-value=0.015 Score=30.66 Aligned_cols=289 Identities=9% Similarity=-0.020 Sum_probs=122.1 Q ss_pred CCceeeecCCchHHHHHHHHhh---hcc-------chhhh-hhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceec- Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAK---IAG-------VIQRL-CNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRV- 68 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~---~~~-------~~~~~-~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~- 68 (343) ..+.-......++.......+. +.. ..... ..+.| +++. +.+...|++...+.-.-+.+.++.. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg--~~vP--~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) T protein:vir:10 75 DNAQPNGTDLKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAG--VLIP--EEIIYDPTAEVNSVVDLSTLVTKTPV 150 (394) T ss_pred hhhcccccchhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCc--eecc--HHHHHHHHHHHHhhhhhhhhceeeec Confidence 0111101111011111111000 000 00001 11222 3333 2334455555444444444443321 Q ss_pred CCCCceeEEEeee-ccchhhhhhcccccCCcccccccc-ccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHH Q lcl|NC_019525. 69 GEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAA-VDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITAL 146 (343) Q Consensus 69 ~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~-vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k 146 (343) ..+.+ .+.... .-+.+.+++.+ ...|. -+..+++.+..++.++.-+.+|.+=|+.+. .+|.+-- T Consensus 151 ~~~~~--~~~~~~~~~~~~~~~~E~--------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~----~~l~~~i 216 (394) T protein:vir:10 151 TTPKG--TYPILKRATDRFSSVAEL--------AENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSA----VDLTSLV 216 (394) T ss_pred cCCce--EEEEEecCCCcccccccc--------ccccccccccceeEEeeeeeeEeeehhHHHHHhhhh----HHHHHHH Confidence 11211 222211 22344444432 24554 345788999999999988888887777543 3566766 Q ss_pred HHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEe Q lcl|NC_019525. 147 EKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTI 226 (343) Q Consensus 147 ~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~l 226 (343) .+...+++...+|+.+..|... |..+ +. -...+.+.|.+.++..+... + ...++| T Consensus 217 ~~~la~~~~~~~~~~il~g~g~--~~~~----------~~-----~~~~~~d~l~~~~~~~~~~~-----~---~a~~vm 271 (394) T protein:vir:10 217 GQSINEKSVNTYNAMIAPVLQS--FTAK----------AT-----TTDTLVDSLKHILNVDLDPA-----Y---SRALVV 271 (394) T ss_pred HHHHHHHHHHHHHHHHhhcccc--cccc----------cc-----cccccHHHHHHHHHhhhhhh-----c---cCEEEe Confidence 7777777777778777666321 1110 00 11234455554444333322 1 236899 Q ss_pred CHHHHHHHhccccCCCcchhhhhH-HHhh----cchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecC Q lcl|NC_019525. 227 PESDYTGLAGAASADFPIKSTKQV-LEDT----FKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIP 301 (343) Q Consensus 227 p~~~~~~L~~~~~s~~~~~tl~~~-l~~n----~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP 301 (343) .++.|..|..- .++.+.-++.- +.+. .+..-.|.|+.+.+..++ +. .+++..+++-+-+ +.+.+-.- T Consensus 272 n~~~~~~l~~l--kd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~---~~--~~~~~~i~~gd~s-~~~~~~~~ 343 (394) T protein:vir:10 272 TQSLFNTLDTL--KDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALL---GS--AAGDQKAFVGDLK-RGVLFADR 343 (394) T ss_pred cHHHHHHHHHh--hccCCCeeeeccccccccCCcccccccceeEEeccccc---CC--CCCceEEEEeecc-ccEEEEee Confidence 99999999643 34433332211 1110 011123566554332111 11 1122222222222 11111110 Q ss_pred Cchhhc-chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 302 VDYTST-LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 302 ~~~~~l-~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...+.. ....... ..+-.+.|++| .++.|.+++++-+.- T Consensus 344 ~~~~v~~~~~~~~~--~~~~~~~r~d~-~~~~~~ai~~~~~~~ 383 (394) T protein:vir:10 344 QQVTLAWEDSKIYG--RYLGAAFRFGV-KQADSNAGYFVTNTD 383 (394) T ss_pred cceEEEEecccccc--eeEEEEEEecc-EEeccccEEEEEeec Confidence 111111 1111111 12345678886 566699998876644 No 96 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=91.22 E-value=0.017 Score=30.30 Aligned_cols=309 Identities=11% Similarity=0.124 Sum_probs=126.0 Q ss_pred CCceeeecCC---chHHHHHHHHh---hhccchhhhhhhhhhhhhHHHHHHhhhhhhhcc---cccccchhhcceecCCC Q lcl|NC_019525. 1 MKKFVIRNSK---GEKILLNAQEA---KIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQK---FFEISPADYMPIRVGEG 71 (343) Q Consensus 1 ~~~~~~~~~~---~~~~~~~a~~~---~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~---~~~l~~~~~~pv~~~~~ 71 (343) ++++...... +.+.+-+.... .............+....+.+ .+-..|.+.. .|=++..+..|+. + T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~--~~~~~i~~~l~~~~~l~~~~~v~~~~---g 190 (466) T protein:vir:80 116 MKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPD--VMLELLRDNMHRYSKLISKVRLRPLK---G 190 (466) T ss_pred HHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccH--HHHHHHHHhhhhhhhhhhheeeeecC---c Confidence 1111111000 00000000000 000011111112222222222 1112232221 1111222233321 1 Q ss_pred CceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHH Q lcl|NC_019525. 72 AWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRK 151 (343) Q Consensus 72 ~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar 151 (343) .. .+......+.|.+++ .. .++|..+..+++....++.++.-+..|.+=|+-+ + .+|.+--..... T Consensus 191 -~~-~~~~~~~~~~a~wv~-------E~-~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~-~~l~~~i~~~la 256 (466) T protein:vir:80 191 -TA-RQNIAGAIPEGVWTE-------AV-ANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDS---D-LNLADEILDAIG 256 (466) T ss_pred -ee-EeeeecCCcceeecc-------cc-cccccccccccceeecceeeeeehhhhHHHHhcc---h-HHHHHHHHHHHH Confidence 11 111122222333332 22 4688888899999999999988777776666533 2 368888888888 Q ss_pred HHHHhhhhheEeeeeccccceeeeeecCCceeec---ccCCcccccCCHHHHHHH----------HHHHHHHHHhcCCce Q lcl|NC_019525. 152 KNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNN---TFLTKSIKSMTPAELKVL----------CAGIIDVYRQGCDYT 218 (343) Q Consensus 152 ~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~---a~~~~~w~~kT~~eIl~D----------in~~l~~v~~~s~~~ 218 (343) .++...+|+..+.|+-..+ -.|++|..+..... ....+.+.+.++..+... +.+++..+...-... T Consensus 257 ~~~~~~~~~ail~G~G~~~-P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 335 (466) T protein:vir:80 257 QAIGFALDKAILYGTGTKM-PVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARANY 335 (466) T ss_pred HHHHHHHhhheeeccCCCC-cceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhccc Confidence 8888899999999954433 45999987543211 111223333343333221 111111111111112 Q ss_pred ecCCeE-EeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEE Q lcl|NC_019525. 219 AMPNKF-TIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLR 297 (343) Q Consensus 219 ~~p~tl-~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~ 297 (343) ..|..+ ++-+..+..|........+...+.....+..+ -.|.|+.+-+ .+.. +..-.|-...++++++ .-++ T Consensus 336 ~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~~~--i~G~pvv~s~--~~~~-~~~~~g~~~~y~i~~r--~~~~ 408 (466) T protein:vir:80 336 SNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNNTMP--IVGGDIVILD--FIPD-NDIIGGYGSLYLLAER--ADIK 408 (466) T ss_pred cCCceeEEecchhHHHhhcccccccCCccccccCCCccc--ccccceeecC--ccCc-cceeeeccccEEEEee--cceE Confidence 234443 33444455554333221111111111111111 1244432211 1111 1111122233444443 2233 Q ss_pred EecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 298 MDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 298 ~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +...-. . -...+...|. .++|++|- ++.|++++++|+.= T Consensus 409 i~~~~~-~---~f~~d~~~~r--~~~r~dg~-~~~~~afv~~~~~~ 447 (466) T protein:vir:80 409 LAQSEH-V---RFIEDQTVFK--GTARYDGK-PVFGEGFVAVNIAN 447 (466) T ss_pred EEechh-h---hhhcCcEEEE--EEEEEccE-EeccCceEEEEecC Confidence 332111 1 1123333343 47787654 57999999999765 No 97 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=90.96 E-value=0.018 Score=30.12 Aligned_cols=293 Identities=12% Similarity=0.023 Sum_probs=126.4 Q ss_pred CCceeeecCCchHHHHHHHHhh-------------hcc-chhh---hhhhhhhhhhHHHHHHhhhhhhhcccccccc--- Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAK-------------IAG-VIQR---LCNDLGFEIDVTTLTTLMKKIIEQKFFEISP--- 60 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~-------------~~~-~~~~---~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~--- 60 (343) -+.--.+..+++....++.... ... ..++ .....+..+++.+ .+...|++...+.-.- T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~--~~~~~ii~~~~~~s~l~~~ 139 (392) T protein:vir:10 62 GREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ--DIQTQINELARSFDALEQY 139 (392) T ss_pred cccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecch--hHHHHHHHHHHhhhhhhhh Confidence 0000001111111111110000 000 0000 0011111233322 2233444443333222 Q ss_pred hhhcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcC Q lcl|NC_019525. 61 ADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGN 139 (343) Q Consensus 61 ~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr 139 (343) .+.+|+.+. .+...+......+.+.+++.+ ..+|..+ ..+++.....+.++.-+.+|.+=|+.+. T Consensus 140 ~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~---- 205 (392) T protein:vir:10 140 VTVEPVRTR--SGSRVLEKNSDMIPFAEITEM--------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---- 205 (392) T ss_pred ceeeeccCC--ceeEEEEeecCCccceeeccc--------ccccccccccceeEEeeeeeEEEeehhhHHHHhhhH---- Confidence 333444322 222222222222344455433 2455443 4788889999999999999987776542 Q ss_pred CCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCcee Q lcl|NC_019525. 140 WDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTA 219 (343) Q Consensus 140 ~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~ 219 (343) .+|.+--.+...+++...+|..++.|.... .- . ...+.+.|.+.++..+.. .+. T Consensus 206 ~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~-~~--------------~-----~~~~~d~i~~~~~~~l~~-----~~~- 259 (392) T protein:vir:10 206 QNILKYVTKWLGKKSKVTRNVLILGVIEKL-TK--------------Q-----AIKSLDDIKDVLNVKLDP-----AIS- 259 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccc-cc--------------c-----CccCHHHHHHHHHHhhhh-----hhc- Confidence 357777777788888888888877774221 10 0 112345555555433322 111 Q ss_pred cCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEE-eechhhhhcccccC-CCccEEEEEEcCcceE Q lcl|NC_019525. 220 MPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEI-LPCVYADKITAQVP-AVAKRYALYNDNEDSL 296 (343) Q Consensus 220 ~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I-~~~~~~~~~~~~g~-gg~drmv~Y~~d~~~v 296 (343) ....++|.|+.|..|.+- -++.+.-|+. -+.+.....-.|.|.-+ .+.. -....+. .++..++.-+-+.-++ T Consensus 260 ~~a~~vm~~~~~~~L~~l--kd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~---~~~~~~~~~~~~~~~~gdfs~~~~ 334 (392) T protein:vir:10 260 PNAILLTNQDGFNYLDKL--KDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNR---FLKSKGTTAKKAPLIIGDLKEAIV 334 (392) T ss_pred cCCEEEEcHHHHHHHHHh--hccCCCeEeecCccCCccccccCcccEEEeccc---ccCCCcccCCceEEEEEehhceEE Confidence 224689999999999643 3433332221 11111111112333221 1111 1111122 2333333222221111 Q ss_pred -EEecCCchhhcc----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 297 -RMDIPVDYTSTL----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 297 -~~~iP~~~~~l~----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...-.+.+.... -.+.+ ...+-++.|+|+ .++.|.+++.+.+.. T Consensus 335 i~~~~~~~~~~~~~~~~~f~~~--~~~~r~~~r~d~-~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 335 LFKREDMELASTDVGGKAFTRN--TLDLRAIQRDDV-QMWDNEAAVYGEIDL 383 (392) T ss_pred EEeecceEEEEeccccchhhcC--ceEEEEEEeecc-EEecccceEEEEecc Confidence 111122222211 11222 233557888886 678899999999977 No 98 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=90.96 E-value=0.018 Score=30.12 Aligned_cols=293 Identities=12% Similarity=0.023 Sum_probs=126.4 Q ss_pred CCceeeecCCchHHHHHHHHhh-------------hcc-chhh---hhhhhhhhhhHHHHHHhhhhhhhcccccccc--- Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAK-------------IAG-VIQR---LCNDLGFEIDVTTLTTLMKKIIEQKFFEISP--- 60 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~-------------~~~-~~~~---~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~--- 60 (343) -+.--.+..+++....++.... ... ..++ .....+..+++.+ .+...|++...+.-.- T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~--~~~~~ii~~~~~~s~l~~~ 139 (392) T protein:vir:10 62 GREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ--DIQTQINELARSFDALEQY 139 (392) T ss_pred cccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecch--hHHHHHHHHHHhhhhhhhh Confidence 0000001111111111110000 000 0000 0011111233322 2233444443333222 Q ss_pred hhhcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcC Q lcl|NC_019525. 61 ADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGN 139 (343) Q Consensus 61 ~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr 139 (343) .+.+|+.+. .+...+......+.+.+++.+ ..+|..+ ..+++.....+.++.-+.+|.+=|+.+. T Consensus 140 ~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~---- 205 (392) T protein:vir:10 140 VTVEPVRTR--SGSRVLEKNSDMIPFAEITEM--------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---- 205 (392) T ss_pred ceeeeccCC--ceeEEEEeecCCccceeeccc--------ccccccccccceeEEeeeeeEEEeehhhHHHHhhhH---- Confidence 333444322 222222222222344455433 2455443 4788889999999999999987776542 Q ss_pred CCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCcee Q lcl|NC_019525. 140 WDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTA 219 (343) Q Consensus 140 ~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~ 219 (343) .+|.+--.+...+++...+|..++.|.... .- . ...+.+.|.+.++..+.. .+. T Consensus 206 ~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~-~~--------------~-----~~~~~d~i~~~~~~~l~~-----~~~- 259 (392) T protein:vir:10 206 QNILKYVTKWLGKKSKVTRNVLILGVIEKL-TK--------------Q-----AIKSLDDIKDVLNVKLDP-----AIS- 259 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccc-cc--------------c-----CccCHHHHHHHHHHhhhh-----hhc- Confidence 357777777788888888888877774221 10 0 112345555555433322 111 Q ss_pred cCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEE-eechhhhhcccccC-CCccEEEEEEcCcceE Q lcl|NC_019525. 220 MPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEI-LPCVYADKITAQVP-AVAKRYALYNDNEDSL 296 (343) Q Consensus 220 ~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I-~~~~~~~~~~~~g~-gg~drmv~Y~~d~~~v 296 (343) ....++|.|+.|..|.+- -++.+.-|+. -+.+.....-.|.|.-+ .+.. -....+. .++..++.-+-+.-++ T Consensus 260 ~~a~~vm~~~~~~~L~~l--kd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~---~~~~~~~~~~~~~~~~gdfs~~~~ 334 (392) T protein:vir:10 260 PNAILLTNQDGFNYLDKL--KDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNR---FLKSKGTTAKKAPLIIGDLKEAIV 334 (392) T ss_pred cCCEEEEcHHHHHHHHHh--hccCCCeEeecCccCCccccccCcccEEEeccc---ccCCCcccCCceEEEEEehhceEE Confidence 224689999999999643 3433332221 11111111112333221 1111 1111122 2333333222221111 Q ss_pred -EEecCCchhhcc----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 297 -RMDIPVDYTSTL----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 297 -~~~iP~~~~~l~----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...-.+.+.... -.+.+ ...+-++.|+|+ .++.|.+++.+.+.. T Consensus 335 i~~~~~~~~~~~~~~~~~f~~~--~~~~r~~~r~d~-~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 335 LFKREDMELASTDVGGKAFTRN--TLDLRAIQRDDV-QMWDNEAAVYGEIDL 383 (392) T ss_pred EEeecceEEEEeccccchhhcC--ceEEEEEEeecc-EEecccceEEEEecc Confidence 111122222211 11222 233557888886 678899999999977 No 99 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=90.96 E-value=0.018 Score=30.12 Aligned_cols=293 Identities=12% Similarity=0.023 Sum_probs=126.4 Q ss_pred CCceeeecCCchHHHHHHHHhh-------------hcc-chhh---hhhhhhhhhhHHHHHHhhhhhhhcccccccc--- Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAK-------------IAG-VIQR---LCNDLGFEIDVTTLTTLMKKIIEQKFFEISP--- 60 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~-------------~~~-~~~~---~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~--- 60 (343) -+.--.+..+++....++.... ... ..++ .....+..+++.+ .+...|++...+.-.- T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~--~~~~~ii~~~~~~s~l~~~ 139 (392) T protein:vir:10 62 GREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ--DIQTQINELARSFDALEQY 139 (392) T ss_pred cccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecch--hHHHHHHHHHHhhhhhhhh Confidence 0000001111111111110000 000 0000 0011111233322 2233444443333222 Q ss_pred hhhcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcC Q lcl|NC_019525. 61 ADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGN 139 (343) Q Consensus 61 ~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr 139 (343) .+.+|+.+. .+...+......+.+.+++.+ ..+|..+ ..+++.....+.++.-+.+|.+=|+.+. T Consensus 140 ~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~---- 205 (392) T protein:vir:10 140 VTVEPVRTR--SGSRVLEKNSDMIPFAEITEM--------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---- 205 (392) T ss_pred ceeeeccCC--ceeEEEEeecCCccceeeccc--------ccccccccccceeEEeeeeeEEEeehhhHHHHhhhH---- Confidence 333444322 222222222222344455433 2455443 4788889999999999999987776542 Q ss_pred CCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCcee Q lcl|NC_019525. 140 WDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTA 219 (343) Q Consensus 140 ~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~ 219 (343) .+|.+--.+...+++...+|..++.|.... .- . ...+.+.|.+.++..+.. .+. T Consensus 206 ~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~-~~--------------~-----~~~~~d~i~~~~~~~l~~-----~~~- 259 (392) T protein:vir:10 206 QNILKYVTKWLGKKSKVTRNVLILGVIEKL-TK--------------Q-----AIKSLDDIKDVLNVKLDP-----AIS- 259 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccc-cc--------------c-----CccCHHHHHHHHHHhhhh-----hhc- Confidence 357777777788888888888877774221 10 0 112345555555433322 111 Q ss_pred cCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEE-eechhhhhcccccC-CCccEEEEEEcCcceE Q lcl|NC_019525. 220 MPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEI-LPCVYADKITAQVP-AVAKRYALYNDNEDSL 296 (343) Q Consensus 220 ~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I-~~~~~~~~~~~~g~-gg~drmv~Y~~d~~~v 296 (343) ....++|.|+.|..|.+- -++.+.-|+. -+.+.....-.|.|.-+ .+.. -....+. .++..++.-+-+.-++ T Consensus 260 ~~a~~vm~~~~~~~L~~l--kd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~---~~~~~~~~~~~~~~~~gdfs~~~~ 334 (392) T protein:vir:10 260 PNAILLTNQDGFNYLDKL--KDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNR---FLKSKGTTAKKAPLIIGDLKEAIV 334 (392) T ss_pred cCCEEEEcHHHHHHHHHh--hccCCCeEeecCccCCccccccCcccEEEeccc---ccCCCcccCCceEEEEEehhceEE Confidence 224689999999999643 3433332221 11111111112333221 1111 1111122 2333333222221111 Q ss_pred -EEecCCchhhcc----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 297 -RMDIPVDYTSTL----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 297 -~~~iP~~~~~l~----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...-.+.+.... -.+.+ ...+-++.|+|+ .++.|.+++.+.+.. T Consensus 335 i~~~~~~~~~~~~~~~~~f~~~--~~~~r~~~r~d~-~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 335 LFKREDMELASTDVGGKAFTRN--TLDLRAIQRDDV-QMWDNEAAVYGEIDL 383 (392) T ss_pred EEeecceEEEEeccccchhhcC--ceEEEEEEeecc-EEecccceEEEEecc Confidence 111122222211 11222 233557888886 678899999999977 No 100 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=90.96 E-value=0.018 Score=30.12 Aligned_cols=293 Identities=12% Similarity=0.023 Sum_probs=126.4 Q ss_pred CCceeeecCCchHHHHHHHHhh-------------hcc-chhh---hhhhhhhhhhHHHHHHhhhhhhhcccccccc--- Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAK-------------IAG-VIQR---LCNDLGFEIDVTTLTTLMKKIIEQKFFEISP--- 60 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~-------------~~~-~~~~---~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~--- 60 (343) -+.--.+..+++....++.... ... ..++ .....+..+++.+ .+...|++...+.-.- T Consensus 62 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~--~~~~~ii~~~~~~s~l~~~ 139 (392) T protein:vir:10 62 GREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQ--DIQTQINELARSFDALEQY 139 (392) T ss_pred cccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecch--hHHHHHHHHHHhhhhhhhh Confidence 0000001111111111110000 000 0000 0011111233322 2233444443333222 Q ss_pred hhhcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcC Q lcl|NC_019525. 61 ADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGN 139 (343) Q Consensus 61 ~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr 139 (343) .+.+|+.+. .+...+......+.+.+++.+ ..+|..+ ..+++.....+.++.-+.+|.+=|+.+. T Consensus 140 ~~~~~~~~~--~~~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~---- 205 (392) T protein:vir:10 140 VTVEPVRTR--SGSRVLEKNSDMIPFAEITEM--------GEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSD---- 205 (392) T ss_pred ceeeeccCC--ceeEEEEeecCCccceeeccc--------ccccccccccceeEEeeeeeEEEeehhhHHHHhhhH---- Confidence 333444322 222222222222344455433 2455443 4788889999999999999987776542 Q ss_pred CCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCcee Q lcl|NC_019525. 140 WDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTA 219 (343) Q Consensus 140 ~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~ 219 (343) .+|.+--.+...+++...+|..++.|.... .- . ...+.+.|.+.++..+.. .+. T Consensus 206 ~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~-~~--------------~-----~~~~~d~i~~~~~~~l~~-----~~~- 259 (392) T protein:vir:10 206 QNILKYVTKWLGKKSKVTRNVLILGVIEKL-TK--------------Q-----AIKSLDDIKDVLNVKLDP-----AIS- 259 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccc-cc--------------c-----CccCHHHHHHHHHHhhhh-----hhc- Confidence 357777777788888888888877774221 10 0 112345555555433322 111 Q ss_pred cCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEE-eechhhhhcccccC-CCccEEEEEEcCcceE Q lcl|NC_019525. 220 MPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEI-LPCVYADKITAQVP-AVAKRYALYNDNEDSL 296 (343) Q Consensus 220 ~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I-~~~~~~~~~~~~g~-gg~drmv~Y~~d~~~v 296 (343) ....++|.|+.|..|.+- -++.+.-|+. -+.+.....-.|.|.-+ .+.. -....+. .++..++.-+-+.-++ T Consensus 260 ~~a~~vm~~~~~~~L~~l--kd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~---~~~~~~~~~~~~~~~~gdfs~~~~ 334 (392) T protein:vir:10 260 PNAILLTNQDGFNYLDKL--KDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNR---FLKSKGTTAKKAPLIIGDLKEAIV 334 (392) T ss_pred cCCEEEEcHHHHHHHHHh--hccCCCeEeecCccCCccccccCcccEEEeccc---ccCCCcccCCceEEEEEehhceEE Confidence 224689999999999643 3433332221 11111111112333221 1111 1111122 2333333222221111 Q ss_pred -EEecCCchhhcc----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 297 -RMDIPVDYTSTL----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 297 -~~~iP~~~~~l~----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...-.+.+.... -.+.+ ...+-++.|+|+ .++.|.+++.+.+.. T Consensus 335 i~~~~~~~~~~~~~~~~~f~~~--~~~~r~~~r~d~-~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 335 LFKREDMELASTDVGGKAFTRN--TLDLRAIQRDDV-QMWDNEAAVYGEIDL 383 (392) T ss_pred EEeecceEEEEeccccchhhcC--ceEEEEEEeecc-EEecccceEEEEecc Confidence 111122222211 11222 233557888886 678899999999977 No 101 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=90.63 E-value=0.02 Score=29.92 Aligned_cols=289 Identities=10% Similarity=0.019 Sum_probs=117.0 Q ss_pred CCceee--ecCCchHHHHHHHHh-------------------hhccchhhhhhhhhhhhhHHHHHHhhhhhhhccccccc Q lcl|NC_019525. 1 MKKFVI--RNSKGEKILLNAQEA-------------------KIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEIS 59 (343) Q Consensus 1 ~~~~~~--~~~~~~~~~~~a~~~-------------------~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~ 59 (343) +..--. .+...+.-..++... .....+.. ..+.+.-+++.+ .+...|++.....-. T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~-~t~~~GG~lIP~--~~~~~Ii~~~~~~~~ 162 (402) T protein:vir:93 86 VKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPT-GNDSGGDKLLPK--TLSKEIVSEPFAKNQ 162 (402) T ss_pred hhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhcc-CCCcCCccccch--hHHHHHHHhHHhhhh Confidence 100000 000101111111000 00001111 111111233332 234445544322222 Q ss_pred chhhcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcC Q lcl|NC_019525. 60 PADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGN 139 (343) Q Consensus 60 ~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr 139 (343) -+.+..+.+..+.....+. ...+.|.+++-+ ..+|..+..+++.+..++.++.-+.+|.+=|+-+. T Consensus 163 l~~~~~v~~~~~~~~p~~~--~~~~~a~~v~Eg--------~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~---- 228 (402) T protein:vir:93 163 LREKARLTNIKGLEIPRVS--YTLDDDDFITDV--------ETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSD---- 228 (402) T ss_pred hhhhceeeecCCceeeeee--ccCCcccccccc--------ccccccccccceeeecceeeeeechhhHHHHhhhH---- Confidence 2333333322221111111 112334444432 24677777889999999999888888866555332 Q ss_pred CCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCcee Q lcl|NC_019525. 140 WDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTA 219 (343) Q Consensus 140 ~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~ 219 (343) .+|.+--.....+++...+++.+|.+.+....-.|.+++++++..+. ..+.| +|.+++..+.. .|.. T Consensus 229 ~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~-------~~~~d----~l~~~~~~l~~--~y~~ 295 (402) T protein:vir:93 229 VDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG-------ADMYD----AIINALADLHE--DYRD 295 (402) T ss_pred HHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc-------cchHH----HHHHHHhccCh--hhhc Confidence 35666555555555555566655544333212347777776644221 11234 44444443322 2221 Q ss_pred cCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe Q lcl|NC_019525. 220 MPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD 299 (343) Q Consensus 220 ~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~ 299 (343) ...++|-+..|..|...+ .+.+. .++ . ..+..--|.|+.+.. .... +++-+-+.-++-+. T Consensus 296 -na~~imn~~t~~~~~~~~-~d~~~-~~~---~-~~~~~llG~PV~~t~-----~~~~--------i~~GDf~~~~~~~~ 355 (402) T protein:vir:93 296 -NATIYMRYADYVKIISVL-SNGTT-NFF---D-TPAEKVFGKPVVFTD-----AAVK--------PIVGDFNYFGINYD 355 (402) T ss_pred -CCEEEEechHHHHHHHHH-hcCCC-ccc---c-cCCccccccceEEec-----CCCc--------eeeechhhhhhhhh Confidence 124566555554443332 33322 221 1 111122366665432 1111 12111111011010 Q ss_pred cCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 300 IPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 300 iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .+.+.+.. ....+ ....-+..|++|..+ .|++++++.+.- T Consensus 356 -~~~~~~~~-~~~~~-~~~~~~~~r~Dg~v~-~~~A~~~l~ik~ 395 (402) T protein:vir:93 356 -GTTYDTDK-DVKKG-EYLFVLTAWYDQQRT-LDSAFRIAKAKE 395 (402) T ss_pred -hhhhhhhh-cccCC-ceEEEEEEEeCcEEe-chhheEEEEeec Confidence 11111111 11112 233345779988765 599999999955 No 102 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=88.53 E-value=0.032 Score=28.80 Aligned_cols=284 Identities=12% Similarity=-0.005 Sum_probs=115.6 Q ss_pred CCceeeecCCchHH--HHHHHHhhhccchhhh--hhhhhhhhhHHHHHHhhhhhhhccccc---ccchhhcceecCCCCc Q lcl|NC_019525. 1 MKKFVIRNSKGEKI--LLNAQEAKIAGVIQRL--CNDLGFEIDVTTLTTLMKKIIEQKFFE---ISPADYMPIRVGEGAW 73 (343) Q Consensus 1 ~~~~~~~~~~~~~~--~~~a~~~~~~~~~~~~--~~d~~~~f~~~qL~~i~~~iye~~~~~---l~~~~~~pv~~~~~~w 73 (343) +.++...+...... ................ ..+.+ +.+. +.+...|.+. ... ....+.+|+.+.. + T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~vp--~~~~~~i~~~-~~~~~l~~~~~~~~~~~~~--~ 175 (397) T protein:vir:96 103 MKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGG--ALIP--QELLQPQLEP-KDIVDLSKYVRSVPVNSAS--G 175 (397) T ss_pred HHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccc--cchh--HHHHHHHHHh-hhhhhHHHhhhhccccccc--e Confidence 22222111111000 0000000000000000 11111 1111 2233334332 111 2233333432221 1 Q ss_pred eeEEEeee-ccchhhhhhcccccCCcccccccc-ccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHH Q lcl|NC_019525. 74 STMLTTYR-SFSLAEDFATGIIDTGNSNGKLAA-VDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRK 151 (343) Q Consensus 74 ~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~-vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar 151 (343) . +...+ .-+.+.++..+ ...|. -+..+++.+..++.++.-+.+|.+=|+.+. .+|.+--..... T Consensus 176 ~--~~~~~~~~~~~~~~~E~--------~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~----~~l~~~i~~~l~ 241 (397) T protein:vir:96 176 K--FPVISKSGSKMATVQQL--------EKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDAS----YDVTGLIADEIQ 241 (397) T ss_pred e--EEEEeccCCcccccccc--------ccccccccccccceeecHhHhhcchhhHHHHHhhhH----HHHHHHHHHHHH Confidence 1 11111 11222233221 23443 456778888888887777777766655443 245555555566 Q ss_pred HHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHH Q lcl|NC_019525. 152 KNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDY 231 (343) Q Consensus 152 ~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~ 231 (343) +++...+|...+.|.... +.. ...|.|+|.+-++..+... + ...++|.|+.| T Consensus 242 ~~~~~~~~~~i~~g~g~~---------------~~~-----~~~~~d~~~~~~~~~~~~~-----~---~a~~v~n~~~~ 293 (397) T protein:vir:96 242 DQSLNTKNADIAAVLKTA---------------TAK-----SVVGVDGLKDLINKEIKKV-----Y---DVKLFISASMY 293 (397) T ss_pred HHHHHHHHHHHhhccccc---------------ccc-----cccchHHHHHHHHHhhhhh-----c---CcEEEEcHHHH Confidence 666666666666663211 000 1234455554444333221 1 24699999999 Q ss_pred HHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchh Q lcl|NC_019525. 232 TGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLAN 310 (343) Q Consensus 232 ~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~ 310 (343) ..|..- .++.+..+.. -+.+.....-.|.|+...+ .....+..++..+++-+-+.-++ +-.-..++...-. T Consensus 294 ~~l~~l--kd~~G~~~~~~~~~~~~~~~l~G~pv~~~~-----~~~~~~~~~~~~~~~gd~~~~~~-~~~~~~~~~~~~~ 365 (397) T protein:vir:96 294 SELDKL--KDKNGRYLLQDSITAASGKQLLGKEVVVLD-----DDVIGKSVGNVVGFIGDAKAFAS-FFDRKQVSVSWVD 365 (397) T ss_pred HHHHHh--hccCCCeEeccCccCCCcccccccceEEec-----ccccCCCCCceEEEEeehhcceE-eEeecceEEEEec Confidence 999643 3444443321 1112222222466654332 11112222333444333222121 1111111111111 Q ss_pred hcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 311 SVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 311 ~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ... ...-+-.++|++| .++.|.+++++-+++ T Consensus 366 ~~~-~~~~~~~~~r~d~-~~~~~~a~~~~~~~~ 396 (397) T protein:vir:96 366 NNI-YGQLLAGIIRYDV-KATDKKAGFYVTFTI 396 (397) T ss_pred ccc-cceeEEEEEEEcc-EEecccceEEEEeec Confidence 111 1122234678888 667999999999999 No 103 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=88.45 E-value=0.032 Score=28.76 Aligned_cols=303 Identities=9% Similarity=-0.037 Sum_probs=125.6 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTY 80 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~ 80 (343) +..++....+ .+.+-+.++.-.+........+.| +++. +.+..+|++.-.+.=.-+++..|.+..+ .. .+... T Consensus 52 ~~~~~~~~~~-~~~lt~~e~~~~~~~~~~~~~~gg--~lvP--~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~-~i~~~ 124 (381) T protein:vir:10 52 AERVSSLPKS-AQSLSANQRSFFMDINKNVNYKEE--KLLP--EETIDRIFEDLTTNHPLLADLGIKNAGL-RL-KFLKS 124 (381) T ss_pred HHHHHHhccC-cccccHHHHHHHHHHhcccCCCCc--eecC--HHHHHHHHHHHHhhccceeheeeEecCc-ce-EEEEe Confidence 1111111111 111100000000000111112333 2332 2233344443111111122223222221 12 23333 Q ss_pred eccchhhhhhcccccCCccccccc-cccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 81 RSFSLAEDFATGIIDTGNSNGKLA-AVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 81 ~~vg~a~~ia~g~~~~g~~a~Dip-~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) +..+.|.++.-+ +.++ ..+..+.+.....+..+.-...|.+=|+-+. .+|++--.....+++...++ T Consensus 125 ~~~~~a~w~~e~--------~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~----~~ie~~i~~~la~~~a~~~~ 192 (381) T protein:vir:10 125 ETSGVAVWGKIY--------GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP----AWIERFVRVQIEEAFAVALE 192 (381) T ss_pred cCCcceeeeccc--------ccccccccccceeeeecceeEEeechhhHHHhhcCH----HHHHHHHHHHHHHHHHHHhh Confidence 344555554422 2343 3456788888888998877777765555332 36777777888888888899 Q ss_pred heEeeeeccccceeeeeecCCceeecccCCcc-------cccCCHHHHHHHHHHHHHHHHhcCCc---eecCC-eEEeCH Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNVVNNTFLTKS-------IKSMTPAELKVLCAGIIDVYRQGCDY---TAMPN-KFTIPE 228 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~-------w~~kT~~eIl~Din~~l~~v~~~s~~---~~~p~-tl~lp~ 228 (343) +-.+.|+-.. .-.|+++++......+.+... ....++.-+.+.+..++...-..-+. ....+ .++|-+ T Consensus 193 ~a~i~G~G~~-qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~ 271 (381) T protein:vir:10 193 TAFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNP 271 (381) T ss_pred heeEeccCCC-CceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEcc Confidence 9999995333 346999986543222221111 11222222333444443333221111 11122 345666 Q ss_pred HHHHHHhccccC-CCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhc Q lcl|NC_019525. 229 SDYTGLAGAASA-DFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTST 307 (343) Q Consensus 229 ~~~~~L~~~~~s-~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l 307 (343) ..+..|...... +.++.-+ ..-+.+++|.....+.. +..-.|--.++++.++. -+++..- + . T Consensus 272 ~t~~~l~~~~~~~~~~G~~v----------~~l~~g~~vv~s~~~p~-~~iifgDfs~Y~i~~r~--~~~i~~~-~---~ 334 (381) T protein:vir:10 272 SDAFEVQAQYTHLNANGVYV----------TALPFNLNVIESTVQEA-GKVLTYVKGLYDGYLAG--GINVQKF-K---E 334 (381) T ss_pred ccHHhhccccccCCCCCcee----------ecCCCCceEEecCCCCc-CcEEEEecccEEEEEec--ccEEEee-c---h Confidence 665555321111 1111100 00112233322221111 11111222234444432 2222211 1 1 Q ss_pred chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 308 LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 308 ~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ....++-..|. ...|++|- ++.|++++++|+-+ T Consensus 335 ~~~~~d~~~f~--a~~r~dg~-~~~~~A~~v~~l~~ 367 (381) T protein:vir:10 335 TLALDDMDLYT--AKQFAYGK-AKDNKVAAVWKLDL 367 (381) T ss_pred hHhhcCCeEEE--EEEEEcCE-EecCceEEEEEEEe Confidence 11223323343 47777774 57899999999988 No 104 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=88.45 E-value=0.032 Score=28.76 Aligned_cols=303 Identities=9% Similarity=-0.037 Sum_probs=125.6 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTY 80 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~ 80 (343) +..++....+ .+.+-+.++.-.+........+.| +++. +.+..+|++.-.+.=.-+++..|.+..+ .. .+... T Consensus 52 ~~~~~~~~~~-~~~lt~~e~~~~~~~~~~~~~~gg--~lvP--~~~~~~I~~~l~~~s~i~~~~~v~~~~~-~~-~i~~~ 124 (381) T protein:vir:95 52 AERVSSLPKS-AQSLSANQRSFFMDINKNVNYKEE--KLLP--EETIDRIFEDLTTNHPLLADLGIKNAGL-RL-KFLKS 124 (381) T ss_pred HHHHHHhccC-cccccHHHHHHHHHHhcccCCCCc--eecC--HHHHHHHHHHHHhhccceeheeeEecCc-ce-EEEEe Confidence 1111111111 111100000000000111112333 2332 2233344443111111122223222221 12 23333 Q ss_pred eccchhhhhhcccccCCccccccc-cccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 81 RSFSLAEDFATGIIDTGNSNGKLA-AVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 81 ~~vg~a~~ia~g~~~~g~~a~Dip-~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) +..+.|.++.-+ +.++ ..+..+.+.....+..+.-...|.+=|+-+. .+|++--.....+++...++ T Consensus 125 ~~~~~a~w~~e~--------~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~----~~ie~~i~~~la~~~a~~~~ 192 (381) T protein:vir:95 125 ETSGVAVWGKIY--------GEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP----AWIERFVRVQIEEAFAVALE 192 (381) T ss_pred cCCcceeeeccc--------ccccccccccceeeeecceeEEeechhhHHHhhcCH----HHHHHHHHHHHHHHHHHHhh Confidence 344555554422 2343 3456788888888998877777765555332 36777777888888888899 Q ss_pred heEeeeeccccceeeeeecCCceeecccCCcc-------cccCCHHHHHHHHHHHHHHHHhcCCc---eecCC-eEEeCH Q lcl|NC_019525. 160 EIAFVGMKDNANVKGLLTQTGNVVNNTFLTKS-------IKSMTPAELKVLCAGIIDVYRQGCDY---TAMPN-KFTIPE 228 (343) Q Consensus 160 ~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~-------w~~kT~~eIl~Din~~l~~v~~~s~~---~~~p~-tl~lp~ 228 (343) +-.+.|+-.. .-.|+++++......+.+... ....++.-+.+.+..++...-..-+. ....+ .++|-+ T Consensus 193 ~a~i~G~G~~-qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~ 271 (381) T protein:vir:95 193 TAFLKGTGKD-QPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNP 271 (381) T ss_pred heeEeccCCC-CceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEcc Confidence 9999995333 346999986543222221111 11222222333444443333221111 11122 345666 Q ss_pred HHHHHHhccccC-CCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhc Q lcl|NC_019525. 229 SDYTGLAGAASA-DFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTST 307 (343) Q Consensus 229 ~~~~~L~~~~~s-~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l 307 (343) ..+..|...... +.++.-+ ..-+.+++|.....+.. +..-.|--.++++.++. -+++..- + . T Consensus 272 ~t~~~l~~~~~~~~~~G~~v----------~~l~~g~~vv~s~~~p~-~~iifgDfs~Y~i~~r~--~~~i~~~-~---~ 334 (381) T protein:vir:95 272 SDAFEVQAQYTHLNANGVYV----------TALPFNLNVIESTVQEA-GKVLTYVKGLYDGYLAG--GINVQKF-K---E 334 (381) T ss_pred ccHHhhccccccCCCCCcee----------ecCCCCceEEecCCCCc-CcEEEEecccEEEEEec--ccEEEee-c---h Confidence 665555321111 1111100 00112233322221111 11111222234444432 2222211 1 1 Q ss_pred chhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 308 LANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 308 ~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ....++-..|. ...|++|- ++.|++++++|+-+ T Consensus 335 ~~~~~d~~~f~--a~~r~dg~-~~~~~A~~v~~l~~ 367 (381) T protein:vir:95 335 TLALDDMDLYT--AKQFAYGK-AKDNKVAAVWKLDL 367 (381) T ss_pred hHhhcCCeEEE--EEEEEcCE-EecCceEEEEEEEe Confidence 11223323343 47777774 57899999999988 No 105 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=87.94 E-value=0.035 Score=28.53 Aligned_cols=274 Identities=14% Similarity=0.008 Sum_probs=105.9 Q ss_pred hhhhhhhhhhhHH-HHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccc--hhhh-hhcccccCCccccccc Q lcl|NC_019525. 29 RLCNDLGFEIDVT-TLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFS--LAED-FATGIIDTGNSNGKLA 104 (343) Q Consensus 29 ~~~~d~~~~f~~~-qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg--~a~~-ia~g~~~~g~~a~Dip 104 (343) +.+.... |.+. .|+.+=- -| +.+++.+..+||+....-+.. .|..|+... ..+- .+.++. .- T Consensus 1 m~~~~~~--~~~dp~LT~~A~-gy--~n~~~ia~~l~P~vpv~~~~~-k~~~f~~eaF~~~~t~r~~~~~--------~~ 66 (307) T protein:vir:10 1 MGRLSKL--RIVDPVLTNLAI-GY--TNAEFIGQSLMPVVEVEKEGG-KIPKFGKESFRLYKTERALRAR--------SN 66 (307) T ss_pred CCCCCCC--cccChhHHHHHH-hh--cchhhhhhhcCCccccccccc-ceeeECcccccchhhhcccCCC--------cc Confidence 2222211 2221 3444332 12 235678888888643322222 122222111 1110 011110 11 Q ss_pred cccccc-cceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHH----HHHhhhhheEeeeeccccceeeeeecC Q lcl|NC_019525. 105 AVDTGV-DAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKK----NWDLGIQEIAFVGMKDNANVKGLLTQT 179 (343) Q Consensus 105 ~vd~~~-~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~----a~~~~~n~v~~~G~~~~~g~~GLlN~p 179 (343) +++-.. +.....+..-+..+ -++. +++..++ .++.++..+.++. ..|...-++++... . .| T Consensus 67 ~v~~~~~~~~~~~~~~~~L~~--~id~-r~~~~~~-~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~-~---------y~ 132 (307) T protein:vir:10 67 RMNPEDLGSIDIVLDEHDLEY--PIDY-REDQESA-FPLEQAAVQTATEAIQLRREKMVADLAQNPN-S---------YA 132 (307) T ss_pred eeecccccccccccccccccc--cCCh-hhcCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHhcCcc-c---------cC Confidence 111110 11111111111111 1111 2222222 3444444433333 33333344444431 1 11 Q ss_pred CceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcc-----ccC--CCcchhhhhHHH Q lcl|NC_019525. 180 GNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGA-----ASA--DFPIKSTKQVLE 252 (343) Q Consensus 180 ~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~-----~~s--~~~~~tl~~~l~ 252 (343) .--....+|+..|.+++. +++.||.+.+.++...++. .||+++|....|..|.+- ++. ..+..|.. .|+ T Consensus 133 ~~~k~tLsGt~~Wsd~~s-DPi~di~~~~~ai~~~~g~--~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~~-~la 208 (307) T protein:vir:10 133 GGNKKQLSATEKFTAAGS-DPVGVIEDGKEAIRTKIGR--RPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTVD-LLK 208 (307) T ss_pred CCceEEeccccccCCCCC-CcHHHHHHHHHHHHhhhCC--ccceEEeCHHHHHHHhcCHHHHHHhCCccccccCHH-HHH Confidence 111122345678988764 5899999999999998885 699999999999988541 011 11112211 122 Q ss_pred hhcchhcCCcceEEeechhhhhcc--cccCCCccEEEEEEcCc---ceEEEecCCchhhcchhhcCCceEEeceeeeecc Q lcl|NC_019525. 253 DTFKEITRNSSFEILPCVYADKIT--AQVPAVAKRYALYNDNE---DSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTG 327 (343) Q Consensus 253 ~n~~~~~~g~~l~I~~~~~~~~~~--~~g~gg~drmv~Y~~d~---~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GG 327 (343) +-+ ....+.+....+ +... ....-|.+-.++|.... +.-.+..| .+--..++.+..+..++++ .|| T Consensus 209 ~ll----~v~~i~vg~a~~-~~~~~~~~~iw~~~~vl~yv~~~~~~~~~~~~ep---sfGyT~~~~g~~~~d~~~~-~~~ 279 (307) T protein:vir:10 209 EIF----EVENIAVGEAIY-ADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEP---SYGYTLRKKGNPVVDTRIE-DGK 279 (307) T ss_pred HHh----CceeEEEeeeee-eccCCccceeCCCceEEEecccccCCCCCccccc---ccceeEEEcCCeEeeceec-CCc Confidence 111 111122111111 0000 00011345555664221 11011111 1111112334555566666 566 Q ss_pred EEEEcCc-----eEEeeecCC Q lcl|NC_019525. 328 VLAYRPK-----ELLYLDIPV 343 (343) Q Consensus 328 v~v~yP~-----a~~Y~D~~~ 343 (343) +++++-. -+.+-|+-. T Consensus 280 ~~~~r~~~~~~~~i~~~~~G~ 300 (307) T protein:vir:10 280 LELVRSTDIFRPYLLGADAGY 300 (307) T ss_pred eeEEeccccccceeecccccc Confidence 6555322 344444433 No 106 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=87.66 E-value=0.037 Score=28.42 Aligned_cols=289 Identities=8% Similarity=-0.062 Sum_probs=117.9 Q ss_pred CCceeeecCCchHHHHHH---HHhhh---cc-chhhh---hhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCC Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNA---QEAKI---AG-VIQRL---CNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGE 70 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a---~~~~~---~~-~~~~~---~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~ 70 (343) .+.-...+..++...... ...+. .. ...+. ..+.+ ..+ .+.+...|++.......-+.++++.+. T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~i--p~~~~~~ii~~~~~~~~i~~~~~~~~~- 146 (379) T protein:vir:10 72 AKSEDKSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLT--GAQ--PKDYNFDVVLNPSQMLNVSDIVGAVSI- 146 (379) T ss_pred ccccccchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCc--ccc--chhhhhHHHHhHHhhhhHHhhceeeec- Confidence 110001111111111110 00000 00 00000 01111 111 122333344443333333333332211 Q ss_pred CCceeEEEe--eecc--chhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHH Q lcl|NC_019525. 71 GAWSTMLTT--YRSF--SLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITAL 146 (343) Q Consensus 71 ~~w~~~~~~--~~~v--g~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k 146 (343) ....+.+ .... +.+.+++ . .+.+|..+..+++....++.++.-+.+|.+=|+-+.. |.+-- T Consensus 147 --~~~~~~~~~~~~~~~~~~~~v~-------E-g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~~-----l~~~i 211 (379) T protein:vir:10 147 --SGGTYTFVRENGAGEGAIGAQV-------E-GATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLPF-----LTSFI 211 (379) T ss_pred --cCCceEEEEeecCCCccccccc-------C-CccccccccceeeeEeeeeeEEeeehhhHHHHhhHHH-----HHHHH Confidence 1222222 1111 1222232 2 2478888999999999999999988888765554322 44444 Q ss_pred HHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEe Q lcl|NC_019525. 147 EKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTI 226 (343) Q Consensus 147 ~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~l 226 (343) .....+++...+|...+.|... .+..+.+ +.. ...+ +++|.+++..+... + ..++.++| T Consensus 212 ~~~la~~~~~~~~~~~~~g~~~-~~~~~~~--------~~~-----~~~~----~d~i~~~~~~~~~~-~--~~~~~~vm 270 (379) T protein:vir:10 212 PNALRRDYAKAENAAFNAVLAA-NATASTE--------IIT-----NKNK----VEMLINEIAKQENL-D--FPVTAIVL 270 (379) T ss_pred HHHHHHHHHHHHHHHHhccccc-ccccccc--------ccc-----Cccc----HHHHHHHHHhhhhc-c--CCCCEEEE Confidence 4444555555555544444211 1111110 010 0112 34556655554332 2 35678999 Q ss_pred CHHHHHHHhccccCCCcchhhhh--HH-HhhcchhcCCcceEEeechhhhhcccccCCC-ccEEEEEEcCcceEEEecCC Q lcl|NC_019525. 227 PESDYTGLAGAASADFPIKSTKQ--VL-EDTFKEITRNSSFEILPCVYADKITAQVPAV-AKRYALYNDNEDSLRMDIPV 302 (343) Q Consensus 227 p~~~~~~L~~~~~s~~~~~tl~~--~l-~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg-~drmv~Y~~d~~~v~~~iP~ 302 (343) -|..|..|..-+ ++.+.-++. +. +...+..-.|.|+-+ ..++. .+..-.|. +...+++++ -+.+++-. T Consensus 271 n~~~~~~l~~lk--d~~G~~l~~~~~~~~~~~~~~l~G~pvv~--s~~~~-ag~~~~gdf~~~~~~~~~---~~~i~~~~ 342 (379) T protein:vir:10 271 RPTDYYDILVTQ--KSVGAGYGLPGVVTQDNGVLRINGIPLFR--ATWLA-ANKYYVGDWTRVTKVTTE---GLSLEFSE 342 (379) T ss_pred cHHHHHHHHHhh--ccCCceeccCCccCCCCCcceecceeeEe--cCCCC-CCceEEeecccEEEEEEe---ceEEEEee Confidence 999999986443 433333321 00 000111122555432 22222 11111111 111222222 22232211 Q ss_pred chhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 303 DYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 303 ~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .. ....+++ ...+-++.|+|+ .++.|.++++++++= T Consensus 343 ~~--~~~f~~~--~~~~r~~~R~~~-~v~~p~a~v~~~~~~ 378 (379) T protein:vir:10 343 VE--GTNFVKN--NITARIEAQVAL-AVEQPAALIFGDFTA 378 (379) T ss_pred cc--cccccCC--cEEEEEEEEecc-EEecCccEEEEEecC Confidence 10 1113343 233446788865 556799999999987 No 107 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=87.58 E-value=0.038 Score=28.38 Aligned_cols=295 Identities=9% Similarity=-0.013 Sum_probs=130.7 Q ss_pred CCceeeecCCchHHHHHHHHhh-hccchhhhh---hhhhhhhhHHHHHHhhhhhhhcccccccchhhcceec-CCCCcee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAK-IAGVIQRLC---NDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRV-GEGAWST 75 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~-~~~~~~~~~---~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~-~~~~w~~ 75 (343) .+.-..+....+.-..++...- .......+. ...| .+.+. +.+...|++...+...-+.++++.. ..+.... T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~t~~~g-g~~vP--~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~ 137 (371) T protein:vir:81 61 DKEPLKPTVQVKENEVEAFVNHIRTRFRNAMSEGSNQDG-GYTVP--QDIQTRINELRESKDALQNLITVEPVTTLSGSR 137 (371) T ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHhhccCCCccC-ceeec--HhHHHHHHHHHHhhhhhhhhceeeeccCCceeE Confidence 1111111111111111111000 011111111 1112 12222 1233445555444444444444321 1111222 Q ss_pred EEEeeeccchhhhhhcccccCCcccccccc-ccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHH Q lcl|NC_019525. 76 MLTTYRSFSLAEDFATGIIDTGNSNGKLAA-VDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNW 154 (343) Q Consensus 76 ~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~-vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~ 154 (343) .+......+.+.+++.| +.+|. -+..+.+.+...+.++.-+.+|.+=|+.+. .+|.+--.+...+++ T Consensus 138 ~~~~~~~~~~a~~v~Eg--------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~a~ 205 (371) T protein:vir:81 138 VFKKRSQQTGFVEVAEG--------AAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDST----EAIVNTLVRWIGDES 205 (371) T ss_pred EEEeecCCcceeeeccc--------cccccccccceeeEEeeeeEEEEeehhhHHHHhhhh----HHHHHHHHHHHHHHH Confidence 22222233445555533 35664 346889999999999999988887776443 356666777777777 Q ss_pred HhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHH Q lcl|NC_019525. 155 DLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGL 234 (343) Q Consensus 155 ~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L 234 (343) ...+|...+.|..... -.| ..+.+.|...++..+..... ....++|-|+.|..| T Consensus 206 ~~~~~~~i~~g~g~~~-~~~-------------------~~~~~~i~~~~~~~l~~~~~------~~a~~vmn~~~~~~L 259 (371) T protein:vir:81 206 RVTRNGLIINVLNTKA-KTA-------------------IADLDGLKQIINVQLDPVFR------STSSVIVNQDAFNWL 259 (371) T ss_pred HHHHHHHHHhhccccc-ccc-------------------cccHHHHHHHHHhhcchhhh------cCCEEEEcHHHHHHH Confidence 7888888888842210 011 13455555555544322211 234688999999999 Q ss_pred hccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhc-ccccCC-CccEEEEEEcCcceEEEec--CCchhhcc- Q lcl|NC_019525. 235 AGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKI-TAQVPA-VAKRYALYNDNEDSLRMDI--PVDYTSTL- 308 (343) Q Consensus 235 ~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~-~~~g~g-g~drmv~Y~~d~~~v~~~i--P~~~~~l~- 308 (343) ..-+ ++.+.-++. =+....+..-.|.|+.+...-- .+. ...+.+ +...+++-+-. +.+.+-. .+...... T Consensus 260 ~~lk--d~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~-~~~~~~~~~~~~~~~i~~Gd~~-~~~~~~~~~~~~i~~~~~ 335 (371) T protein:vir:81 260 DTLK--DQNGQYLLQPSISSPTGRQLLGLPVVIVSNKV-LANRVDGGTGAQFAPIIVGDLK-EAVVMFDRQRTEIMSSNV 335 (371) T ss_pred HHhh--ccCCCeeeecccCCCCCceecceeEEEecccc-cCccccccccCCcceEEEEehh-ceEEEEeecceEEEEecc Confidence 6543 333322211 0111112222466665432110 010 011111 12222222211 1122111 11111111 Q ss_pred ---hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 309 ---ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 309 ---p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .++++ ...+-++.|+++ .++.|.+++.+++.. T Consensus 336 ~~~~f~~~--~v~~~~~~r~d~-~~~~~~a~~~~~~~~ 370 (371) T protein:vir:81 336 AMDAFETD--ATLWRAIERMDV-KMRDDEAFVFGEVQL 370 (371) T ss_pred ccchhhcC--ceEEEEEEeecc-EEecccceEEEEEec Confidence 12232 233456778877 567799999999999 No 108 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=86.66 E-value=0.044 Score=28.02 Aligned_cols=297 Identities=11% Similarity=0.063 Sum_probs=135.6 Q ss_pred CCceeeecCCchHHHHHH-HHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNA-QEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT 79 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a-~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~ 79 (343) |. .|.+.+= +.+.....+.....+.|+.......+.+-..+.+. -+=|...+.+|++... ..+. T Consensus 1 ~~---------~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~-s~~l~~i~v~~v~~~~----~~i~- 65 (321) T protein:vir:31 1 MA---------SRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEE-TPLLDAIRTETVGAKK----TRIP- 65 (321) T ss_pred Cc---------hHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHh-hhhhhhceeeeccCcc----eeee- Confidence 21 1222221 11111223333345566666555556666666654 2334455555553221 1121 Q ss_pred eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhh Q lcl|NC_019525. 80 YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQ 159 (343) Q Consensus 80 ~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n 159 (343) ..+.+..+...+ .......+.-+..+++.....++...-...|.+-|+..+ .| .++.+.-.....+++...++ T Consensus 66 --~~~~~~~~~~~~---~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a-~~-~d~e~~i~~~ia~~~a~~~~ 138 (321) T protein:vir:31 66 --TLNIGERHRRPQ---DEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENP-EG-EALADRILNLMTDAWSADVE 138 (321) T ss_pred --eeccCCcccccc---cccccccccccceeeeeeeeeEEEEeehhccHHHHHhhh-cc-hhHHHHHHHHHHHHHHHHHH Confidence 112221111100 011223444555677777888888887778888777654 34 47999999999999999999 Q ss_pred heEeeeeccccce------eeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCC-eEEeCHHHHH Q lcl|NC_019525. 160 EIAFVGMKDNANV------KGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPN-KFTIPESDYT 232 (343) Q Consensus 160 ~v~~~G~~~~~g~------~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~-tl~lp~~~~~ 232 (343) +++|+|+... .- .|+++.+.-.+.+. +..-.+.+.+ .+.|+...|... |...++ ..+|.+..+. T Consensus 139 ~~~~nGd~~~-~~~~~~~n~G~l~~a~~~~~~~--~~~~~~~~~d-~l~~l~~~l~~~-----yr~~~~~v~im~~~~~~ 209 (321) T protein:vir:31 139 DLAANGDEDA-EDSFENQNDGFITVAEGDVETI--DAADDILDND-LVIRTIAGLDSK-----YRARMNPALIVSEDQLL 209 (321) T ss_pred hheeeccccC-CCcccccchhhhhhhccccccc--cccccccCHH-HHHHHHHhccHh-----HhcCCCeEEEechHHHH Confidence 9999995321 11 36655432111111 1111122333 334444443322 222333 3567766655 Q ss_pred HHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEec--CCchhhc--c Q lcl|NC_019525. 233 GLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDI--PVDYTST--L 308 (343) Q Consensus 233 ~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~i--P~~~~~l--~ 308 (343) .+....... ++-.....+.......-.|.|+.+.|- .. .+.+++- |.+++.+-+ -+..++. . T Consensus 210 ~~~~~l~~~-~~~~~~~~l~~~~~~tl~G~pvv~~~~--mP---------~~~il~t--~~~nl~~~~~~~~~~~~~~~~ 275 (321) T protein:vir:31 210 SYHYTLTDR-DTPLGDNVIMGEADVNPFSFPIIGSGL--WP---------DDKAMFT--DPQNLIYALYRDLEIDVLTES 275 (321) T ss_pred HHHHHHhcC-CCccccchhhccccccccceeEEEcCC--CC---------CCcEEEe--ccccEEEEEeeccEEEEeecC Confidence 443322222 221222223322222234777655431 11 1223332 234433222 2222221 1 Q ss_pred h-hhcCCceEEeceeeeeccEEEEcCceEEee-ecCC Q lcl|NC_019525. 309 A-NSVNNFQFQNAAYGQFTGVLAYRPKELLYL-DIPV 343 (343) Q Consensus 309 p-~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~-D~~~ 343 (343) + ......++...+..+++.+. .-+.+++.+ |+|+ T Consensus 276 ~~~~~~~~~~~~~~~~~~~~~v-e~~~a~a~~~~i~~ 311 (321) T protein:vir:31 276 DKVSERDLHARYFMRGDDDFAI-ENTEAVVLAEGLGD 311 (321) T ss_pred ccccccceeeEeeeeeecceeE-eccccEEEEecCCc Confidence 1 11223455555555666554 445555555 5888 No 109 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=83.73 E-value=0.066 Score=27.05 Aligned_cols=289 Identities=9% Similarity=-0.029 Sum_probs=112.3 Q ss_pred CCce---eeecCCchHHHHH----HHHhhh-----ccchhh-hhhhhhhhhhHHHHHHhhhhhhhcccccccch---hhc Q lcl|NC_019525. 1 MKKF---VIRNSKGEKILLN----AQEAKI-----AGVIQR-LCNDLGFEIDVTTLTTLMKKIIEQKFFEISPA---DYM 64 (343) Q Consensus 1 ~~~~---~~~~~~~~~~~~~----a~~~~~-----~~~~~~-~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~---~~~ 64 (343) .+.. -..+...+..... +....+ ....+. .....| .+++. +.+...|++...+.-.-+ +.+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~g-g~liP--~~~~~~Ii~~~~~~~~l~~l~~~~ 151 (421) T protein:vir:13 75 TNFTGGRVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNN-GAVIP--QEFVNEFEKLKEGYPSLKEHCHVI 151 (421) T ss_pred hcccccccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCc-ceecc--hhhHHHHHHHHHhhhhhhhhceee Confidence 0000 0000110111100 100000 000011 111112 22333 222333444333332333 334 Q ss_pred ceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHH Q lcl|NC_019525. 65 PIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLIT 144 (343) Q Consensus 65 pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~ 144 (343) |+.+.. ....+.......-+.+++ .. .++|.-+..+.+....++.++.-+.+|.+=|+.+. .+|.+ T Consensus 152 ~~~~~~--~~~~~~~~~~~~~~~~~~-------E~-~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~----~~l~~ 217 (421) T protein:vir:13 152 PVNRNA--GKMPVRAGASVDKLANLA-------KD-TELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSE----INFLE 217 (421) T ss_pred eccCCc--eEEEEeecCCccceeecc-------cc-ccccccccceeEEEeeeeeeEeehhhhHHHHhhhH----HHHHH Confidence 443221 211111111111122222 22 35777788889999999999988888876555432 23444 Q ss_pred HHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeE Q lcl|NC_019525. 145 ALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKF 224 (343) Q Consensus 145 ~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl 224 (343) --...+.+++...+|.-+. + ...|+++.+++ .+.+.|.+-++.+....+ .+..+ T Consensus 218 ~i~~~la~~~~~~~~~~i~-~-----~~~g~~~~~~~-------------~~~d~i~~~~~~l~~~~~-------~~a~~ 271 (421) T protein:vir:13 218 FVNEEFAEFAVNTENAEIV-K-----QAKAVLAEETI-------------NDYAGLVKTINSLVPNAR-------KRAII 271 (421) T ss_pred HHHHHHHHHHHHHhhhhHh-h-----hhhhccccccc-------------cchHHHHHHHHHhhhhhc-------CCCEE Confidence 4444444444444443211 1 13344433221 234555555444433221 34678 Q ss_pred EeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEec--CC Q lcl|NC_019525. 225 TIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDI--PV 302 (343) Q Consensus 225 ~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~i--P~ 302 (343) +|.+..|..|..- -++.+.-+..-..+.....-.|.|+.+.+-.+ .+.++...+++.+- .+.+.+-. .+ T Consensus 272 v~n~~~~~~l~~l--kd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~------~~~~~~~~~~~gd~-~~~~~~~~~~~~ 342 (421) T protein:vir:13 272 VTNSDGRAYLDGL--MDKQGRPLLKELSDGGDLVFKGRPVIELEESI------FDVGDETKFIVSDF-KTLIKFMDRKQY 342 (421) T ss_pred EEcHHHHHHHHHh--hcCCCceeecCcCCCCCceecceeeEEecccc------ccCCCceEEEEEec-cccEEEEEecce Confidence 9999999999643 34444333221111111122355554333111 11222222332222 22222211 12 Q ss_pred chhhc--chhhcCCceEEeceeeeeccEEEE----------cCceEEe-eecCC Q lcl|NC_019525. 303 DYTST--LANSVNNFQFQNAAYGQFTGVLAY----------RPKELLY-LDIPV 343 (343) Q Consensus 303 ~~~~l--~p~~~~~l~~~v~~~~r~GGv~v~----------yP~a~~Y-~D~~~ 343 (343) .+... .-..++ ...+-++.|+++..+. .|.+++. .|.|. T Consensus 343 ~v~~~~~~~f~~~--~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~ 394 (421) T protein:vir:13 343 LIDQSKEAGYTKN--ETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLK 394 (421) T ss_pred EEEeecccccccC--eeEEEEEeeecceeecchhhheeeecccceeeccccccC Confidence 22211 111222 2234457788777543 3333333 36666 No 110 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=83.68 E-value=0.066 Score=27.04 Aligned_cols=257 Identities=9% Similarity=0.010 Sum_probs=107.5 Q ss_pred hcc-eecCCCCceeEEEeeeccchhh--hhhcccccCCccccccc--cccccccceeeeeEEEEeeEeecHHHHHHHHHh Q lcl|NC_019525. 63 YMP-IRVGEGAWSTMLTTYRSFSLAE--DFATGIIDTGNSNGKLA--AVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKS 137 (343) Q Consensus 63 ~~p-v~~~~~~w~~~~~~~~~vg~a~--~ia~g~~~~g~~a~Dip--~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~ 137 (343) ++. |+ + ..++. ++..|..+ ....| . +|. .-+..-.+....|-. ..-+..-++++..+|.. T Consensus 1 ~vr~i~-~----g~s~~-~~~iG~~~~~~~~~G-----~---~l~~~~~~~~~~e~~itID~-~l~~~~~VdDiD~~qa~ 65 (324) T protein:vir:99 1 MTRTIT-S----GKSAQ-FPVMGRTKARYLKQG-----Q---SLDDGREDIKHTEKVITIDG-LLTTDVLIYDIEDAMNH 65 (324) T ss_pred Ceeeee-c----CceEE-EeeeeeeEeccccCC-----C---CcCCCcCCcCcccEEEEecc-hhhhhhhhhhHHHHhcC Confidence 111 11 1 11221 11122221 11111 1 111 111122222222111 01122346777777653 Q ss_pred cCCCcHHHHHHHHHHHHHhhhhheEe-----eee-ccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHH Q lcl|NC_019525. 138 GNWDLITALEKSRKKNWDLGIQEIAF-----VGM-KDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVY 211 (343) Q Consensus 138 Gr~~L~~~k~~aar~a~~~~~n~v~~-----~G~-~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v 211 (343) .++-+.-.+.+-.++.+..|+.++ ... .+...-.+.....+......+++..-+..+++.+.+-|.++...+ T Consensus 66 --~Dlr~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~L 143 (324) T protein:vir:99 66 --YDVRSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAF 143 (324) T ss_pred --ccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHH Confidence 466666666666667777775443 110 110111122222232333334444555678888888888777776 Q ss_pred HhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhc-cc------------- Q lcl|NC_019525. 212 RQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKI-TA------------- 277 (343) Q Consensus 212 ~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~-~~------------- 277 (343) ....=-. ..-.++++|++|..|...+.-......-..-+.+-....-.|. .|..+..+... ++ T Consensus 144 de~~VP~-~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf--~V~~Sn~lp~~~~t~~~~a~~~~~~~~ 220 (324) T protein:vir:99 144 AKKYIPA-GDRTFYTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGF--EVVETPHMTAQMVTNPTDAFDGTGHIF 220 (324) T ss_pred hhcCCCC-CCCEEEeChHHHHHHhhcccccccccccccceecceEEEEece--EEEecCCcccccccccccccccccccc Confidence 5543221 1236999999999886554332221100011111100000111 11111111110 00 Q ss_pred ------------ccCCCccEEEEEEcCcc-eEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 278 ------------QVPAVAKRYALYNDNED-SLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 278 ------------~g~gg~drmv~Y~~d~~-~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .+..++-+.+++.++.= .++. +++..+..-...+ ..++.+.++. . |..+.+|++++.+-+|- T Consensus 221 ~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~-~~~~~e~~~~~~~-~~d~i~~~~a-~-G~~~lRPe~a~~v~l~~ 295 (324) T protein:vir:99 221 PATGDSTTTGKMTVGADNVVGLFVHRSAVATLKL-KDMALERARRPEY-QADQIIAKYA-M-GHGGLRPEAVGAIIFED 295 (324) T ss_pred ccccccccccccccccCceeEEEEehhheEEEee-ecceecceechhh-HHHhhhhhhh-h-cCcccccceEEEEEEcc Confidence 01112334466655532 2222 2322222222222 3455565553 3 77888999999988877 No 111 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=83.36 E-value=0.069 Score=26.95 Aligned_cols=278 Identities=14% Similarity=0.037 Sum_probs=103.4 Q ss_pred hhhhhhhhhhhH-HHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCcccccccccc Q lcl|NC_019525. 29 RLCNDLGFEIDV-TTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVD 107 (343) Q Consensus 29 ~~~~d~~~~f~~-~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd 107 (343) +.+.... |.+ ..|+.+=. -|. .+++.+..+||.....-.-. .|..|+....... .-....+..++.+... T Consensus 1 m~~~~~~--~~~dp~LT~~A~-gy~--n~~~Iad~lfP~vpV~~~~~-k~~~f~~e~f~~~--~t~ra~~~~~~~v~~~- 71 (307) T protein:vir:79 1 MGRLSKL--RIVDPVLTNLAI-GYT--NAEFIGQTLMPVVEVEKEGG-KIPKFGKESFRLY--QTERALRAKSNRMNPE- 71 (307) T ss_pred CCCCCCC--cccCHHHHHHHh-hcc--chhhhhhhcCCccccccccc-ceeeecccccccc--ccccccCCCcceeeee- Confidence 2222222 222 13443332 222 46688888888543221111 1222221110000 0000000111111111 Q ss_pred ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHH----HHHHHHhhhhheEeeeeccccceeeeeecCCcee Q lcl|NC_019525. 108 TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKS----RKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVV 183 (343) Q Consensus 108 ~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~a----ar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~ 183 (343) +.+.....+..-+.. +-++. +...-. .+++.+++.+. +.+..|...-+++|.+. . .+.-.. T Consensus 72 -~~~~~~~~~~~~~l~--~~id~-r~~~~~-~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~-~---------y~~~~k 136 (307) T protein:vir:79 72 -DIDSVDVNLDEHDLE--YPIDY-REDQES-AFPLEQAAVQTATDAIQLRREKMIADLSQNPS-S---------YAAGNK 136 (307) T ss_pred -ccccccccccccchh--hcccc-hhcCCC-CCCHHHHHHHHHHHHHHhHHHHHHHHHhcccc-c---------cCCCce Confidence 111112222222221 11111 111111 24444443333 33444444445555441 1 111111 Q ss_pred ecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcc-----ccC--CCcchhhhhHHHhhcc Q lcl|NC_019525. 184 NNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGA-----ASA--DFPIKSTKQVLEDTFK 256 (343) Q Consensus 184 ~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~-----~~s--~~~~~tl~~~l~~n~~ 256 (343) .+.+|+..|.+++ .+++.||.+.+.++...++. .||+++|....|..|.+- ++. ..+..|. +.|++-. T Consensus 137 ~tLsgt~~Wsd~~-sDPi~di~~~~~ai~~~~g~--~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it~-~~la~l~- 211 (307) T protein:vir:79 137 KQLSATEKFTAAN-SDPVGVIEDGKEAIRTKIGR--RPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTV-DLLKEIF- 211 (307) T ss_pred EEEccCcccCCCC-CCcHHHHHHHHHHHHHhhCC--ccceEEeCHHHHHHHhcCHHHHHHhcCccccccCH-HHHHHHh- Confidence 2234566798876 45899999999999998885 699999999999988541 111 1112222 1222211 Q ss_pred hhcCCcceEEeechhhhhc-ccccCCCccEEEEEEcCc-ceE--EEecC-CchhhcchhhcCCceEEeceeeeeccEEEE Q lcl|NC_019525. 257 EITRNSSFEILPCVYADKI-TAQVPAVAKRYALYNDNE-DSL--RMDIP-VDYTSTLANSVNNFQFQNAAYGQFTGVLAY 331 (343) Q Consensus 257 ~~~~g~~l~I~~~~~~~~~-~~~g~gg~drmv~Y~~d~-~~v--~~~iP-~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~ 331 (343) ....+-+....+-... .....-|.+-.++|.... .+- .+.-| .-++. +..+.....++++ .|+++++ T Consensus 212 ---~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~----~~~g~~~~d~~~~-~~~~~~v 283 (307) T protein:vir:79 212 ---EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTL----RKKGNPVVDTRIE-DGKLELV 283 (307) T ss_pred ---CceeEEEeeeeeecccccchhcCCCceEEEecccccCCCCCcccccccceeE----EecCceEEecccC-CCceeEE Confidence 1111222111110000 000012345555554221 000 00001 11111 2222333344444 3555553 Q ss_pred cCc-----eEEeeecCC Q lcl|NC_019525. 332 RPK-----ELLYLDIPV 343 (343) Q Consensus 332 yP~-----a~~Y~D~~~ 343 (343) +-. -+..-|+-. T Consensus 284 rv~~~~~~~i~~~~~G~ 300 (307) T protein:vir:79 284 RATDIFRPYLLGADAGY 300 (307) T ss_pred eecccccceeeccccch Confidence 221 233333222 No 112 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=83.00 E-value=0.072 Score=26.84 Aligned_cols=290 Identities=10% Similarity=-0.001 Sum_probs=126.9 Q ss_pred CCc-----ee--eecCC-chHHHHHHHHhhhccchh--hhhhhhhhhhhHHHHHHhhhhhhhcccccccchhh---ccee Q lcl|NC_019525. 1 MKK-----FV--IRNSK-GEKILLNAQEAKIAGVIQ--RLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADY---MPIR 67 (343) Q Consensus 1 ~~~-----~~--~~~~~-~~~~~~~a~~~~~~~~~~--~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~---~pv~ 67 (343) +++ .. .++.. ..+-..++.......... .....+|. +++. +.+...|++...+.-.-+.+ +|+. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg-~~vP--~~~~~~ii~~~~~~~~l~~~~~~~~~~ 147 (395) T protein:vir:38 71 LNAEPVNKKPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAG-LTIP--EDIQLQIRTLTRSFTSLESLANVENVT 147 (395) T ss_pred hhhccccccccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCc-eecc--hhHhhHHHHHHHhhcchhhhcceeecc Confidence 000 00 00000 000011110000000000 00111222 2222 12233455544333333333 4544 Q ss_pred cCCCCceeEEEeee-ccchhhhhhcccccCCccccccccc-cccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHH Q lcl|NC_019525. 68 VGEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAAV-DTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITA 145 (343) Q Consensus 68 ~~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~v-d~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~ 145 (343) +..+. ..+.... .-+.|.+++.+ ..+|.. ...+++.....+.++.-+.+|.+=|+. ++ .+|.+- T Consensus 148 ~~~~~--~~~~~~~~~~~~a~~v~E~--------~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~d---s~-~~l~~~ 213 (395) T protein:vir:38 148 TSHGS--RVYEKLADITPLKDLDDES--------ALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKD---TV-DNIIQW 213 (395) T ss_pred CCcce--EEEEeeccCCccccccccc--------cccccccccceeeEEeeeeeeEeehhhHHHHHhh---hH-HHHHHH Confidence 33322 2221111 12344455432 245644 357888888999988888877654443 22 357777 Q ss_pred HHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEE Q lcl|NC_019525. 146 LEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFT 225 (343) Q Consensus 146 k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~ 225 (343) -.....+++...+|+.+++|.....+..| ..+.+.|.+-++..+.... .....++ T Consensus 214 i~~~la~~~~~~~~~~il~g~g~~~~~~~-------------------~~~~~~i~~~~~~~l~~~~------~~~a~~v 268 (395) T protein:vir:38 214 LVNWAAKKDVVTRNAKILEVMGKAPKKPT-------------------ISQFDNIKDLENNTLDPAI------ESTSSFI 268 (395) T ss_pred HHHHHHHHHHHHHHHHHhhcccccccccc-------------------cccHHHHHHHHHHhhhhhh------cCCCEEE Confidence 77778888888888888888533221111 1234455544443332211 1233689 Q ss_pred eCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe--cCC Q lcl|NC_019525. 226 IPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD--IPV 302 (343) Q Consensus 226 lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~--iP~ 302 (343) |.|+.|..|..-+ ++.+.-++. -+.+.....-.|.|+.+.. .....+.+++..++ |-+-.+.+.+- -.+ T Consensus 269 ~n~~~~~~L~~lk--d~~G~~l~~~~~~~~~~~~l~G~pV~~~~-----~~~~~~~~~~~~i~-~gd~~~~~~i~~~~~~ 340 (395) T protein:vir:38 269 TNQSGYNILSKVK--DADGRYLMQPDVTSPDKYLIDGKPVIRIA-----DKWLPDVSGSHPLY-FGDLKQGITLFDRQQM 340 (395) T ss_pred EcHHHHHHHHHhh--ccCCceeeccCcCCCCcceeccceeEEec-----ccccCcCCCcceEE-EEeccccEEEEEecce Confidence 9999999996543 433433321 1112222222355544332 11111122333333 32222222221 122 Q ss_pred chhhcc----hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 303 DYTSTL----ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 303 ~~~~l~----p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...... -.+++ ...+-++.|+++. +..|.+++.+++-. T Consensus 341 ~i~~~~~~~~~~~~~--~~~~r~~~r~d~~-~~~~~a~~~~~~~~ 382 (395) T protein:vir:38 341 QIDTTNVGAGSFEHD--TTKLRFIDRFDVQ-LIDDGAFAAASFKT 382 (395) T ss_pred EEEEeccccchhhcC--ceEEEEEEeeccE-EecccceEEEEeec Confidence 222111 01222 2334568888875 55599999999876 No 113 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=80.53 E-value=0.094 Score=26.21 Aligned_cols=281 Identities=11% Similarity=0.019 Sum_probs=114.9 Q ss_pred CCceeeecCCc-------------------------hHHHHHHHH--hhhccchhhhhhhhhhhhhHHHHHHhhhhhhhc Q lcl|NC_019525. 1 MKKFVIRNSKG-------------------------EKILLNAQE--AKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQ 53 (343) Q Consensus 1 ~~~~~~~~~~~-------------------------~~~~~~a~~--~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~ 53 (343) -+.-...+... ......+.. ..............|. +++.+ .+...|++. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg-~liP~--~~~~~ii~~ 151 (394) T protein:vir:97 75 IGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAK-PVSSE--EILYTPARE 151 (394) T ss_pred ccccccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhcccccccccc-ccChH--HHHHHHHHH Confidence 00000000000 000000000 0000000011111121 22222 233345544 Q ss_pred cccccc---chhhcceecCCCCceeEEEeee-ccchhhhhhcccccCCccccccccc-cccccceeeeeEEEEeeEeecH Q lcl|NC_019525. 54 KFFEIS---PADYMPIRVGEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAAV-DTGVDAVSIKTYKWAKTIGWTL 128 (343) Q Consensus 54 ~~~~l~---~~~~~pv~~~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~v-d~~~~~~~~~v~~~g~~~~ys~ 128 (343) ..+.-. ..+.+|+.+..+.|. ..+ .-+.+.+++.+ ...|.. +..+++.+...+.++.-+.+|. T Consensus 152 ~~~~~~l~~~~~~~~~~~~~~~~~----~~~~~~~~~~~v~E~--------~~~~~~~~~~~~~v~l~~~k~~~~i~is~ 219 (394) T protein:vir:97 152 VKTVVDLKPFTTVYQAKKASGKYP----VLQRATTKMVTVAEL--------EKNPALAKPDFKDVAWNIDTYRGAIPLSQ 219 (394) T ss_pred hhhhhhhhhhceeeeccCcceEEE----EEecCCCccceeccc--------ccccccccccceeEEeehhheeeehhhHH Confidence 333322 334445433322221 221 11233344322 245543 3578888888888888777777 Q ss_pred HHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHH Q lcl|NC_019525. 129 PELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGII 208 (343) Q Consensus 129 ~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l 208 (343) +=|+.+. .+|.+--.....+++...+|.....|.... ++ -...+.++|.+.++..+ T Consensus 220 ell~ds~----~~~~~~i~~~la~~~~~~~~~~i~~g~~~~-------------------~~-~~~~~~~~~~~~~~~~~ 275 (394) T protein:vir:97 220 ESIDDAD----VDLVGIVSESISQIKVNTTNDAIAKVLKSF-------------------TT-KTVKNLDEIKALLNGGF 275 (394) T ss_pred HHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHhhccccc-------------------cc-cccccHHHHHHHHHhhh Confidence 6555332 245555555555555556665555552110 00 11234555555555443 Q ss_pred HHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccE-E Q lcl|NC_019525. 209 DVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKR-Y 286 (343) Q Consensus 209 ~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~dr-m 286 (343) .... ...++|.|..|..|..-+ ++.+.-+.. -+.+.....-.|.|+.+.+..++ +.+..-.|.-.. . T Consensus 276 ~~~~--------~a~~v~n~~~~~~l~~lk--d~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~-~~~~~~~gd~~~~~ 344 (394) T protein:vir:97 276 DPAY--------NVSLIVSQSFYQTLDTLK--DGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL-GANKAFIGDFKRGV 344 (394) T ss_pred hhhh--------CCEEEEcHHHHHHHHHhh--ccCCCeeeecCcCCCCCceeccceeEEeccccc-CCccEEEeeccccE Confidence 2211 246899999999996533 433332221 11111122234667655432111 100000111111 2 Q ss_pred EEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 287 ALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 287 v~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .++++.. +.+ ... ......-. +-+++|++| .+..|.+++.+++.- T Consensus 345 ~~~~~~~--~~~------~~~-~~~~~~~~--~~~~~r~d~-~v~~~~a~~~~~~~~ 389 (394) T protein:vir:97 345 LFADRKD--LGL------RWA-DNEIYGQY--LQAVLRFGV-SKVDDKAGYYVTFTP 389 (394) T ss_pred EEEEecc--eEE------EEe-ccccccee--EEEEEEEcc-EEecccceEEEEecc Confidence 2332211 111 111 11111122 345788876 566899999999855 No 114 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=76.95 E-value=0.13 Score=25.44 Aligned_cols=290 Identities=10% Similarity=0.024 Sum_probs=114.1 Q ss_pred CCceeeecC------CchHHHHHHHHhhh-cc---chhhhh-hhhhhhhhHHHHHHhhhhhhhcc-cccc-cchhhccee Q lcl|NC_019525. 1 MKKFVIRNS------KGEKILLNAQEAKI-AG---VIQRLC-NDLGFEIDVTTLTTLMKKIIEQK-FFEI-SPADYMPIR 67 (343) Q Consensus 1 ~~~~~~~~~------~~~~~~~~a~~~~~-~~---~~~~~~-~d~~~~f~~~qL~~i~~~iye~~-~~~l-~~~~~~pv~ 67 (343) +......+. ..+.....+....+ .. ...... .+.|. ....++. ..|.+.+ ...| ...+.+|+. T Consensus 120 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~-lvp~~~~---~~i~~~~~~~~l~~~~~~~~~~ 195 (437) T protein:vir:10 120 RDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKV-IIPETIL---TPEKEVHQFPRLGSLVRTESVT 195 (437) T ss_pred HhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccc-cchHHHH---HHHHHhhhhhhhhhcceeEeec Confidence 100000000 00000000000000 00 011111 12222 2222222 2222211 1111 112223332 Q ss_pred cCCCCceeEEEee-eccchhhhhhcccccCCcccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHH Q lcl|NC_019525. 68 VGEGAWSTMLTTY-RSFSLAEDFATGIIDTGNSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITA 145 (343) Q Consensus 68 ~~~~~w~~~~~~~-~~vg~a~~ia~g~~~~g~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~ 145 (343) +. .+. +... +..+.+.++..+ ..+|..+ ..+++.+...+.++.-+.+|.+=|+-+. .+|.+- T Consensus 196 ~~--~~~--~~~~~~~~~~~~~~~e~--------~~~~e~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~~~~~ 259 (437) T protein:vir:10 196 TT--TGK--LPIFNNSTDLLTAHTEY--------GQTTKNATPVITPILWDLKTYTGGYVFSQELISDSS----YDWQAE 259 (437) T ss_pred cC--cee--eEEeecccccccccccc--------ccccccccccceeeeeehhheeeehhhhHHHHhhhH----HHHHHH Confidence 22 111 1111 122333333322 2345333 4678888888888887778876555432 356666 Q ss_pred HHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEE Q lcl|NC_019525. 146 LEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFT 225 (343) Q Consensus 146 k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~ 225 (343) -......++...+|.-.+.|... |.. . ...+.+.+++.+-++..+... +. ....++ T Consensus 260 i~~~l~~~~~~~~~~~i~~g~g~--~~~------~----------~~~~~~~~~~~~~~~~~l~~~-----~~-~~~~~~ 315 (437) T protein:vir:10 260 LQSRLIELRDNTDDSLIITALTD--GIK------K----------TTSTYLLGDLKKVLNVTLKPQ-----DS-AAASIV 315 (437) T ss_pred HHHHHHHHHHHHHHHHHhhhhcc--ccc------c----------cccccchhhHHHHHHhhhhhh-----hh-cCCEEE Confidence 66677777777788877877321 111 1 112233444544444333222 21 123689 Q ss_pred eCHHHHHHHhccccCCCcchhhh-hHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCch Q lcl|NC_019525. 226 IPESDYTGLAGAASADFPIKSTK-QVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDY 304 (343) Q Consensus 226 lp~~~~~~L~~~~~s~~~~~tl~-~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~ 304 (343) |.+..|..|..- -++.+..++ .-+.+..+..-.|.|+.+.....+. +.+ .|+..+++-+-+ +.+.+-.-..+ T Consensus 316 ~~~~~~~~l~~l--kd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~---~~~-~~~~~~~~gd~~-~~~~~~~r~~~ 388 (437) T protein:vir:10 316 MSQSAYNLFDMA--TDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFP---SAS-AGDVNIVVAPLK-KAVINFKLTEI 388 (437) T ss_pred EcHHHHHHHHHh--hccCCCeeeccCccCCCCcccccceeEEecccccC---CcC-CCceEEEEeecc-ccEEEEeeece Confidence 999999998643 244443332 1112222222346776654432211 111 122222222222 22211110111 Q ss_pred hhc-chhhcCCceEEeceeeeeccEEEEcCceEEee--ecCC Q lcl|NC_019525. 305 TST-LANSVNNFQFQNAAYGQFTGVLAYRPKELLYL--DIPV 343 (343) Q Consensus 305 ~~l-~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~--D~~~ 343 (343) +.. .+ .+......+-.+.|++|. +..|.+++++ ++|. T Consensus 389 ~~~~~~-~~~~~~~~~~~~~r~d~~-~~~~~a~~~l~~~~~~ 428 (437) T protein:vir:10 389 TGQFQD-TYDIWYKQLGIFLRQNVV-QASKDLIVNLTGKLKA 428 (437) T ss_pred EEEEec-ccccccceeeEEEEEccE-EecccceEEEEeeccc Confidence 110 01 011112223345677555 4569998875 4555 No 115 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=74.45 E-value=0.16 Score=24.97 Aligned_cols=300 Identities=12% Similarity=0.031 Sum_probs=118.0 Q ss_pred CCceee-ecCCchHHHHHHHHh--------------------hhccchhhhhh-hhhhhhhHHHHHHhhhhhhhcccccc Q lcl|NC_019525. 1 MKKFVI-RNSKGEKILLNAQEA--------------------KIAGVIQRLCN-DLGFEIDVTTLTTLMKKIIEQKFFEI 58 (343) Q Consensus 1 ~~~~~~-~~~~~~~~~~~a~~~--------------------~~~~~~~~~~~-d~~~~f~~~qL~~i~~~iye~~~~~l 58 (343) .+.|-. .++-.+.++..++.. -.+........ +.|..+-..-...|-..+.+. -|=+ T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~-s~i~ 110 (377) T protein:vir:98 32 EKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNVGGKDKFKLLPEETMVQVFDDLVAE-HPLL 110 (377) T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHhccCCCCCccccCHHHHHHHHHHHHHh-hhhh Confidence 000000 000001111110000 00001111111 112222111122222222221 2223 Q ss_pred cchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccc-cccccccceeeeeEEEEeeEeecHHHHHHHHHh Q lcl|NC_019525. 59 SPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLA-AVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKS 137 (343) Q Consensus 59 ~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip-~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~ 137 (343) ...+..|+ .+ .. .+-..+..+.|.++..+ +.++ ..+..+.+...+.+..+.-..+|.+=|+-+. T Consensus 111 ~~~~v~~~---~~-~~-~~~~~~~~~~a~w~~e~--------~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~-- 175 (377) T protein:vir:98 111 KVINFKNT---SL-RL-KALTAETSGTAVWGDIF--------GEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGP-- 175 (377) T ss_pred hheeeEec---Cc-ce-EEEEecCCcceeEeecc--------cccCcccCccceeEeecceeEEeeecccHHhhhccH-- Confidence 33333333 11 12 23333444555554321 2233 4556788888888888877777766665433 Q ss_pred cCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCH---HHHHHHHHHHHH----- Q lcl|NC_019525. 138 GNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTP---AELKVLCAGIID----- 209 (343) Q Consensus 138 Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~---~eIl~Din~~l~----- 209 (343) .+|.+--.+...+++...+++-.++|+-. ..-.|++|++....... .+.+.+.|. .+-+.|+...+. T Consensus 176 --~~ie~~i~~~la~~~a~~~~~a~i~G~G~-~qP~Gil~~~~~~~~~~--~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 250 (377) T protein:vir:98 176 --KWIKQFITEQLKEAIAVALELAIVKGDGL-LQPVGLLKDLSQPTVDQ--STGRDITTYKTDKEAIADLSDLTPDNAPK 250 (377) T ss_pred --hHHHHHHHHHHHHHHHHHHhhceEeccCC-Ccceeeeeccccccccc--ccccccccccchhhhHhhhhhhchhHHHH Confidence 46888888888888889999999999533 33569999875432211 111111111 112222221111 Q ss_pred -HHHhcCCc---------eecCCe-EEeCHHHHHHHh-ccccCCCcc--hhhhhHHHhhcchhcCCcceEEeechhhhhc Q lcl|NC_019525. 210 -VYRQGCDY---------TAMPNK-FTIPESDYTGLA-GAASADFPI--KSTKQVLEDTFKEITRNSSFEILPCVYADKI 275 (343) Q Consensus 210 -~v~~~s~~---------~~~p~t-l~lp~~~~~~L~-~~~~s~~~~--~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~ 275 (343) ..|.-... ...-+. +++.|..+..+. .....+.+. .+++ |.|+++....... . T Consensus 251 ~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~l------------g~p~~vv~s~~~p-~ 317 (377) T protein:vir:98 251 KLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVL------------PHGITILESLAVE-T 317 (377) T ss_pred HHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCcccccc------------CCCceEEecCCCC-c Confidence 11110000 001111 222232222221 000000000 0111 1222222111111 0 Q ss_pred ccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 276 TAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 276 ~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.+-.|--.+++++++. -++++.- +.. ....+...|. ...|++| .++.|++++.+||-. T Consensus 318 ~~i~fgdf~~Y~i~~r~--~~~i~~~-~~~---~~~~d~~~f~--~~~r~dg-~~~~~~a~~vl~i~~ 376 (377) T protein:vir:98 318 GKAIAFVANRYDAFMAT--ASTIEEY-DQT---FAMEDLQLYL--TKNYFYG-KAKDNHTAALLTLAG 376 (377) T ss_pred ccEEEEEecceeEEeec--ceEEEee-chh---hhhcCceEEE--EEEEEcC-EEeccCcEEEEEEec Confidence 11111223345665553 2333211 111 1123223343 3677877 788999999999999 No 116 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=71.67 E-value=0.19 Score=24.51 Aligned_cols=298 Identities=9% Similarity=-0.010 Sum_probs=123.2 Q ss_pred HHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeEEEe--eeccchhhhhhc Q lcl|NC_019525. 14 ILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTT--YRSFSLAEDFAT 91 (343) Q Consensus 14 ~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~--~~~vg~a~~ia~ 91 (343) .+| .+.+++++.+.. +..-+|. .|...-=.-+|+++--.|...--.+-...+....+.+.. +..++.... T Consensus 1 ~~~---~~~~~~~~~Ms~-~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 72 (322) T protein:vir:10 1 MKL---NAIMSMLPLIAG-DIDQAFV-QTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRS--- 72 (322) T ss_pred Ccc---cceeeeeeeeec-hhhhHHH-HHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccc--- Confidence 111 223444222222 2232343 333222222444444444433222221111111111111 111222211 Q ss_pred ccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeec--cc Q lcl|NC_019525. 92 GIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMK--DN 169 (343) Q Consensus 92 g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~--~~ 169 (343) + ...++.+-|+|..+.....+...+..+..++ -++++..+++. .+..+.-++.+--++..+.|++.+-|-- .. T Consensus 73 ~-~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~--~VDd~D~~k~~--~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~ 147 (322) T protein:vir:10 73 R-QQSADGTYPTPVNNKPFAKRRTNVDTYDTGH--VVEQEDISQML--LDPNSALITSQAYAMARKTDDLIIAGAWKPAS 147 (322) T ss_pred c-ccccCcccCCCccccccceEEEeecccccce--ecchHHHHHhh--cCchHHHHHHHHHHhhhHHHHHHHhhhhcccc Confidence 1 0112344478988888888888888887664 46677777765 3677777777777887888886655321 11 Q ss_pred cceee-eeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccc-cCCCcchhh Q lcl|NC_019525. 170 ANVKG-LLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAA-SADFPIKST 247 (343) Q Consensus 170 ~g~~G-LlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~-~s~~~~~tl 247 (343) .+..| .+..|.-+.... + ....|.+ .|.++...+....=-..-+-.++++|++|..|..-. ..+.+=... T Consensus 148 ~~~~gt~v~~~ss~~i~~-g---~~g~t~~----kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~ 219 (322) T protein:vir:10 148 IKGTGQPVEFLATQEIGD-G---TKPISFD----YVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSA 219 (322) T ss_pred ccccccccccCCCccccc-C---ccchhHH----HHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccc Confidence 11111 111111000000 0 0112222 244443333333211111125999999999886321 111110111 Q ss_pred hhHHHhhcchhcCCcceEEeechh--------------hhhcccccCCCccEEEEEEcCcceEEE--ecCCchhhcchhh Q lcl|NC_019525. 248 KQVLEDTFKEITRNSSFEILPCVY--------------ADKITAQVPAVAKRYALYNDNEDSLRM--DIPVDYTSTLANS 311 (343) Q Consensus 248 ~~~l~~n~~~~~~g~~l~I~~~~~--------------~~~~~~~g~gg~drmv~Y~~d~~~v~~--~iP~~~~~l~p~~ 311 (343) ..++. +|+-=.+.+..| ..+......+.+.+.++|.++.=..-. ++..... ..+ T Consensus 220 ~~l~~-------~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~---~~~ 289 (322) T protein:vir:10 220 MDLQS-------KGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVA---EDP 289 (322) T ss_pred hhhhh-------cCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEee---ccC Confidence 11111 232223333333 222222333445667889886433322 1222221 111 Q ss_pred cCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 312 VNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 312 ~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .......|...+-+|.+.+ .|+-++-.|.== T Consensus 290 ~~~~a~~I~~~~~~Ga~ri-~~~gVv~i~~~e 320 (322) T protein:vir:10 290 SASFAWRIYSAFTADCVRV-EDEHIFKLRLKN 320 (322) T ss_pred CcchhhhhhhhhhhCceEe-ccCcEEEEEEec Confidence 1122333544555666555 455443333311 No 117 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=64.08 E-value=0.31 Score=23.40 Aligned_cols=286 Identities=9% Similarity=-0.063 Sum_probs=118.7 Q ss_pred CCcee------------eecCCchHHHHHHHHh----------------hhccchhhh-hhhhhhhhhHHHHHHhhhhhh Q lcl|NC_019525. 1 MKKFV------------IRNSKGEKILLNAQEA----------------KIAGVIQRL-CNDLGFEIDVTTLTTLMKKII 51 (343) Q Consensus 1 ~~~~~------------~~~~~~~~~~~~a~~~----------------~~~~~~~~~-~~d~~~~f~~~qL~~i~~~iy 51 (343) -.+-. ....+++....+.... ......... ....| .+++. +.+...|+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-g~~vP--~~~~~~ii 155 (400) T protein:vir:38 79 SGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADA-ASTIP--ETISNTPQ 155 (400) T ss_pred ccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCC-ccccc--HHHHHHHH Confidence 00000 0000000000000000 000000110 11111 12333 22334444 Q ss_pred hcccccccchhhcceec-CCCCceeEEEeee-ccchhhhhhcccccCCcccccccc-ccccccceeeeeEEEEeeEeecH Q lcl|NC_019525. 52 EQKFFEISPADYMPIRV-GEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLAA-VDTGVDAVSIKTYKWAKTIGWTL 128 (343) Q Consensus 52 e~~~~~l~~~~~~pv~~-~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~-vd~~~~~~~~~v~~~g~~~~ys~ 128 (343) +...+.-..+.++++.+ ..+.. .+...+ ..+.+.++..+ ...|. -+..+++.+...+.++.-+.+|. T Consensus 156 ~~~~~~~~l~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~E~--------~~~~~~~~~~f~~i~~~~~k~~~~~~is~ 225 (400) T protein:vir:38 156 RELQTVVDLKPFTNVFQASTQKG--TYPTVANATTKMVTVAEL--------EKNPAMAKPEFKPVNWSVETYRQALPVSQ 225 (400) T ss_pred HHHHhhhhhhhcceeEeccCcce--EEEEEecCCCcccccccc--------ccccccccccceeeEeehhheeeehhhHH Confidence 44333333333333321 11111 222211 22444444432 13443 34577888888888888777777 Q ss_pred HHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHH Q lcl|NC_019525. 129 PELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGII 208 (343) Q Consensus 129 ~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l 208 (343) +=|+.+ . .+|.+--......++...+|..++.|..... . -...|.++|.+-++..+ T Consensus 226 ell~ds---~-~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~---------------~-----~~~~~~~~~~~~~~~~~ 281 (400) T protein:vir:38 226 ESIDDS---A-IDLVGLIAQNGQQIKVNTTNGAVATLLKGFT---------------A-----KTISSVDDLKHINNVDL 281 (400) T ss_pred HHHhhh---H-HHHHHHHHHHHHHHHHHHHHHhhhhcccccc---------------c-----cccccHHHHHHHHHhhh Confidence 544433 2 3577766666777777777877777732110 0 01223444444444333 Q ss_pred HHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh-HHHhhcchhcCCcceEEeechhhhhcccccCCCccEEE Q lcl|NC_019525. 209 DVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ-VLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYA 287 (343) Q Consensus 209 ~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~-~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv 287 (343) ... ....++|-|+.|..|..- -++.+.-+.. =+.+.....-.|.|+.+.+-. -. +..|+..++ T Consensus 282 ~~~--------~~a~~v~~~~~~~~l~~l--kd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~-----~~-~~~g~~~~~ 345 (400) T protein:vir:38 282 DPA--------YSRVIIASQSFYNFLDTV--KDGNGRYLLQDSILTPSGKSVLGMPIAVVSDD-----TL-GAAGEAHAF 345 (400) T ss_pred hhh--------hCcEEEEcHHHHHHHHHh--hccCCCeeeecCcCCCCccccccceeEEeccc-----cc-CCCCceEEE Confidence 222 134789999999999643 3443333321 111112222246665443321 11 122333333 Q ss_pred EEEcCcceEEEe-cCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 288 LYNDNEDSLRMD-IPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 288 ~Y~~d~~~v~~~-iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +.+-+.-++-+. --+.+... ......-. +-+++|+||..+. |.+++++-+.- T Consensus 346 ~gd~s~~~~~~~~~~~~~~~~-~~~~~~~~--~~~~~r~d~~~~~-~~a~~~l~~~~ 398 (400) T protein:vir:38 346 LGDIKRAILFANRADFMVRWV-DDQIYGQF--LQAGMRFGVSVAD-EKAGYFLTYTP 398 (400) T ss_pred EEeccccEEEEeecceEEEEe-ccccccee--EEEEEEeccEEec-ccceEEEEeec Confidence 223222122221 01111111 11122222 3457888877664 99999998877 No 118 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=62.37 E-value=0.34 Score=23.18 Aligned_cols=277 Identities=10% Similarity=-0.009 Sum_probs=114.3 Q ss_pred HHhhhccchhhhhhhhhhhhhHHHH----HHhhhhhhhcccccccchhhcceecCCCCceeEEEeeec-----cchhhhh Q lcl|NC_019525. 19 QEAKIAGVIQRLCNDLGFEIDVTTL----TTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRS-----FSLAEDF 89 (343) Q Consensus 19 ~~~~~~~~~~~~~~d~~~~f~~~qL----~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~-----vg~a~~i 89 (343) +.++ ++.++.... ..+..++| +.|..++.+.--+.+-+..+|- +.+.. -+..+.++.. -+-++.+ T Consensus 1 ~~~~-~~i~s~~~~---~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~-~~~a~-~~~~v~f~~~~p~~~~~d~e~V 74 (318) T protein:vir:10 1 MTAP-TGIVSVSDG---PAITVRELVGNPLWIPTALKKMMVNQFISESLFR-NGGAN-PNGVVAYNEGNPSFLEDDVADV 74 (318) T ss_pred CCCC-CcceeeecC---CceehHHhhCCchhHHHHHHHHHhccchhhhhhh-ccccc-ccceeEEEecccccccCcHhhc Confidence 2221 222222221 12222222 1233333333233344444444 21111 1112222211 1334444 Q ss_pred hcccccCCccccccccccccccceee-eeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeecc Q lcl|NC_019525. 90 ATGIIDTGNSNGKLAAVDTGVDAVSI-KTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKD 168 (343) Q Consensus 90 a~g~~~~g~~a~Dip~vd~~~~~~~~-~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~ 168 (343) +.|+ -+|.++...+.... .+.+||.++.+|.+.+..- +++.-.+....+.+....+.|+.+|- T Consensus 75 aEgg--------EiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n----~~~~v~r~~~~l~Nti~r~~d~~a~d---- 138 (318) T protein:vir:10 75 AEFG--------EIPVSAGARGLPRTAFAVKKALGVRVSKEMIDEN----RVGAVNDQMLQLRNTFIRANDRSAKA---- 138 (318) T ss_pred cCcc--------cccccCCCCCchhhhhhehhccceeccHHHHhhc----ChhHHHHHHHHHHHHHHHHHHHHHHH---- Confidence 4442 47888777755555 5579999999998766532 24555666666666666666665442 Q ss_pred ccceeeeeecCCceeecccCCcc-cccCCHH-----HHH----HHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccc Q lcl|NC_019525. 169 NANVKGLLTQTGNVVNNTFLTKS-IKSMTPA-----ELK----VLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAA 238 (343) Q Consensus 169 ~~g~~GLlN~p~v~~~~a~~~~~-w~~kT~~-----eIl----~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~ 238 (343) .|.+++++-..++++|. |...+.| |.. .|++.+-..-. ..++-..||+|+|-|.++..|..- T Consensus 139 ------al~sa~t~~~~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~-~~~~GY~pdtIVlhP~~~~~l~~n- 210 (318) T protein:vir:10 139 ------LLQSPIVPTLAVPTAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSS-DEYFGFIPDTIVMHYALLPILMDN- 210 (318) T ss_pred ------HHhccccccccCCcCCCCcccccccchhhhhhhhhhhhhhhhhhhhhh-hhccCccceeeEECHHHHHHHhcc- Confidence 23444433323332322 2111111 111 12222211111 123334799999999999999522 Q ss_pred cCCCcchhhhh-HHHhhc-chhcC-------Cc--ceEEeechhhhhcccccCCCccEEEEEEcCcceEEEe---cCCch Q lcl|NC_019525. 239 SADFPIKSTKQ-VLEDTF-KEITR-------NS--SFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMD---IPVDY 304 (343) Q Consensus 239 ~s~~~~~tl~~-~l~~n~-~~~~~-------g~--~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~---iP~~~ 304 (343) .-+.. |..+++ .+... |+ -|++...+.+. +++..+..+ .++.+- .|+.. T Consensus 211 ------~~~~~~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p---------~~~alvlq~--g~vG~~~d~~pl~~ 273 (318) T protein:vir:10 211 ------ENFMKVYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFP---------IDRVLIMER--GTVGFYSDTRPLQF 273 (318) T ss_pred ------hhhhhhhhccchhhhhcccccccccceeeceEEeecCccC---------CCeeEEEec--CCcceeecccccee Confidence 11222 222322 12111 11 13333333222 244444443 444443 33332 Q ss_pred hhcchh-----hcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 305 TSTLAN-----SVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 305 ~~l~p~-----~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) +-+-++ .-.+..|.+-+ .|.-..-|..|+++...===+ T Consensus 274 t~~~~egg~~~g~~~~s~~~~~-~~~~~~~V~~PkA~~~itgi~ 316 (318) T protein:vir:10 274 TALYPEGNGPNGGPTESYRADA-SHKRALAVDQPKAALWLTGIV 316 (318) T ss_pred eecccCCCCCCCCcchhhheeh-heeeeeeeeCcceeEEEeecc Confidence 222111 00122333322 234444555555544322111 No 119 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=55.72 E-value=0.47 Score=22.37 Aligned_cols=306 Identities=10% Similarity=0.003 Sum_probs=117.6 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcce---ecCCCCceeEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPI---RVGEGAWSTML 77 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv---~~~~~~w~~~~ 77 (343) |-....++-+|.|--+.-|-- +-.+.++.-.++.-... +.+...|+|.--++-.-.+++|. +.....|-+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p--~l~m~alTLaea~~l~~---d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~~~r~~ 75 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFP--ELKMPTVTLAESAKLSQ---DHLVSGLIETIVEVNPLYEMMPFTEIEGNALAYNREN 75 (330) T ss_pred CceecCCccccceeehhcccc--ccchhhhhhhHHhhcCc---hhhHHHHHHhhhccchHHhhcccccccCCcceeeeee Confidence 777778888886654442211 11111222112211111 23344455543333333455553 33322222211 Q ss_pred EeeeccchhhhhhcccccCCcccccccc-ccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHH--HHHHHHHHH Q lcl|NC_019525. 78 TTYRSFSLAEDFATGIIDTGNSNGKLAA-VDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITA--LEKSRKKNW 154 (343) Q Consensus 78 ~~~~~vg~a~~ia~g~~~~g~~a~Dip~-vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~--k~~aar~a~ 154 (343) . .+-+.+..-+ .-+|. -...+.+.+..+..++.. ..-+=+-|...|+ +-+.+ .....-++. T Consensus 76 ~----lp~a~~r~~n--------~~~~~~~~~Tf~q~t~~l~~l~~~---~~Vd~~iadl~g~-~~d~~~~q~~~~ieal 139 (330) T protein:vir:94 76 V----LGDVQFLAVG--------GTITAKNPATFTKVTSELTTLIGD---AEVNGLIQATRSD-FMDQTSVQVASKAKSI 139 (330) T ss_pred c----CCcceeeecc--------ccccccCcceeeeeeechhhhhhh---HHHHHHHHHhcCC-HHHHHHHHHHHHHHHH Confidence 1 1222221100 00111 111223333333333322 1111122333343 22322 233333455 Q ss_pred HhhhhheEeeeeccccceeeeeecCCc-eeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHH Q lcl|NC_019525. 155 DLGIQEIAFVGMKDNANVKGLLTQTGN-VVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTG 233 (343) Q Consensus 155 ~~~~n~v~~~G~~~~~g~~GLlN~p~v-~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~ 233 (343) .++..+..++|+.....+.||+++-.- ....+.++ -...| ++|+.+++..+|...+ .|+.|+|....... T Consensus 140 ~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~tg~~--gg~~T----~d~LDeLl~~v~~~~g---~~~~~l~n~a~~r~ 210 (330) T protein:vir:94 140 GRQYQASMITGDGTGNSFQGMMGLVAASQTISAGAN--GGTLT----FELLDQLLDLVKDKDG---QVDYLMSSFAMRRK 210 (330) T ss_pred HHHHHHHhhccCCCCccccchhhcCCcccEEecCCC--CCCCC----HHHHHHHHHHhcCCCC---CCcEEEechhHHHH Confidence 566677789996554568899753211 11112111 12233 5788999999987655 68899987776665 Q ss_pred HhccccCCCcchhhhh----HHHhhcchhcCCcceEEeechhhh-hcccccCCCccEEEEEEcCc-----ceEEEecCCc Q lcl|NC_019525. 234 LAGAASADFPIKSTKQ----VLEDTFKEITRNSSFEILPCVYAD-KITAQVPAVAKRYALYNDNE-----DSLRMDIPVD 303 (343) Q Consensus 234 L~~~~~s~~~~~tl~~----~l~~n~~~~~~g~~l~I~~~~~~~-~~~~~g~gg~drmv~Y~~d~-----~~v~~~iP~~ 303 (343) |..-.... +...+.. .+-.-.. .-+|.| |.++-+.. +.+....+|+..+++.+-.. -++.++-+-. T Consensus 211 I~a~~R~~-~~~~v~~~~~~~~G~~v~-~~~GvP--i~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~ 286 (330) T protein:vir:94 211 YFSLLRAL-GGAAIGEVMTLPSGRQIP-TYRGVP--WFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGS 286 (330) T ss_pred HHHHHHhc-cCCCCCCcccccCCCEEe-eeCCeE--EEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCC Confidence 54322211 1111111 1111111 124665 34443322 22222235667776666332 3344432210 Q ss_pred h---hhcc--hhhcCCceEEeceeeeeccEEEEcCceEEee-ecCC Q lcl|NC_019525. 304 Y---TSTL--ANSVNNFQFQNAAYGQFTGVLAYRPKELLYL-DIPV 343 (343) Q Consensus 304 ~---~~l~--p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~-D~~~ 343 (343) + -+.+ -+.+.-..+.|-.| =|+.+.-|.+++-+ +|-+ T Consensus 287 ~glsVr~~G~~~~k~v~~~~v~~y---~~~av~~~~a~~~L~~V~~ 329 (330) T protein:vir:94 287 AGLRVQNVGAKENADETITRVKMY---CGFANFSQLGLAAIKGLIP 329 (330) T ss_pred CcceeeeCCCccccceeeEEEEEe---eeeEEechhheeeeccccC Confidence 0 0000 00111111222111 12222223322221 1222 No 120 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=39.87 E-value=1 Score=20.59 Aligned_cols=301 Identities=9% Similarity=-0.019 Sum_probs=119.1 Q ss_pred CCce---ee------------------ecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcc---cc Q lcl|NC_019525. 1 MKKF---VI------------------RNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQK---FF 56 (343) Q Consensus 1 ~~~~---~~------------------~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~---~~ 56 (343) .+.| .. ...+|.+.+-+.+..-.+........+.| +++. +.+..+|++.- -| T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~e~~~~~~~~~~t~~~Gg--~lvP--~~~~~~I~~~l~~~sp 105 (381) T protein:vir:10 30 NELYGDMINQLFEETKLQAKAEAERVSSLPKSAQTLSANQRNFFMDINKSVGYKEE--KLLP--EETIDRIFEDLTTNHP 105 (381) T ss_pred HHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccCHHHHHHHHHHhhcCCCCCc--eecC--HHHHHHHHHHHHhhcc Confidence 0000 00 00000000000000000000001111222 2332 22333444421 11 Q ss_pred cccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccc-cccccccceeeeeEEEEeeEeecHHHHHHHH Q lcl|NC_019525. 57 EISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLA-AVDTGVDAVSIKTYKWAKTIGWTLPELAEAM 135 (343) Q Consensus 57 ~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip-~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~ 135 (343) =+...+.+|+ ++ .. .+...+.-+.|.++.. .+.++ ..+..+++...+.+..+.-...|.+=|+-+. T Consensus 106 ir~~a~v~~~--~~--~~-~i~~~~~~~~a~W~~e--------~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~ 172 (381) T protein:vir:10 106 LLADLGIKNA--GL--RL-KFLKSETSGVAVWGKI--------YGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGP 172 (381) T ss_pred eeeeeeeEec--Cc--ce-EEEeecCCcceEEeec--------ccccccccCccceeEeecceeEEeeccccHHHHhccH Confidence 1222233332 11 12 2222333344544332 12343 4456788888899998877776765555432 Q ss_pred HhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCC-c---ccccCCHHHHH---HHHHHHH Q lcl|NC_019525. 136 KSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLT-K---SIKSMTPAELK---VLCAGII 208 (343) Q Consensus 136 ~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~-~---~w~~kT~~eIl---~Din~~l 208 (343) ++|+.--.....+++...+++-...|+-.. .-.|++++++-...-+.+. + ...+.|...+. +.+..++ T Consensus 173 ----~~le~~i~~~la~~~a~~~~~afi~GdG~~-qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~ 247 (381) T protein:vir:10 173 ----AWIERFVRVQIEEAFAVALETAFLKGTGKD-QPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVF 247 (381) T ss_pred ----HHHHHHHHHHHHHHHHHHhhceeEecccCC-CceeeeecCCccccccccccccccccccccccchhhHHHHHHHHH Confidence 478888888888899999999888995433 3469998654321111111 1 11122333322 2233322 Q ss_pred HHHHhc--CCc-eecCC-eEEeCHHHHHHHhcccc-CCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCc Q lcl|NC_019525. 209 DVYRQG--CDY-TAMPN-KFTIPESDYTGLAGAAS-ADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVA 283 (343) Q Consensus 209 ~~v~~~--s~~-~~~p~-tl~lp~~~~~~L~~~~~-s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~ 283 (343) ...... ... ....+ +++|-+..+..|..... -+..+.-+. .. +.++.|.....+. .+..--|-- T Consensus 248 ~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~~~~G~~v~--------~l--p~g~~vv~~~~~p-~~~i~fGDf 316 (381) T protein:vir:10 248 KYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVT--------AL--PFNLNVIESTVQE-AGKVLTYVK 316 (381) T ss_pred HhhhhhhccccccccCceEEEEchhhHHhhccccccCCCCCceee--------cC--CCCceeEEcCCCC-cCcEEEEEc Confidence 222111 110 01111 35666666655532111 111111110 00 0111121111111 011111112 Q ss_pred cEEEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 284 KRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 284 drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) .++++.++.. +++..- +. ....++-..|. ...|++| .++.|++++++|+-+ T Consensus 317 s~Y~i~~r~~--~~i~~~-~~---~~~~~d~~~f~--a~~r~dG-~~~~~~A~~v~~l~~ 367 (381) T protein:vir:10 317 GLYDGYLAGG--INVQKF-KE---TLALDDMDLYT--AKQFAYG-KAKDNKVAAVWKLDL 367 (381) T ss_pred ccEEEEEecc--cEEEee-ch---hhhhcCceEEE--EEEEEcC-EEecCCcEEEEEEee Confidence 2344444432 222110 11 11123222343 4667777 467899999999976 No 121 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=38.75 E-value=1.1 Score=20.46 Aligned_cols=263 Identities=12% Similarity=0.044 Sum_probs=112.8 Q ss_pred HHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecC-CCCceeEEE--eeeccchhhhhhccccc Q lcl|NC_019525. 19 QEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVG-EGAWSTMLT--TYRSFSLAEDFATGIID 95 (343) Q Consensus 19 ~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~-~~~w~~~~~--~~~~vg~a~~ia~g~~~ 95 (343) |+...+ ..+ ...+.+ .+...+.+.-...+....+.-++.. .+.-+.++. .++..|-+++++.| T Consensus 1 MA~~~T--------~~~-~~~iPe--v~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg--- 66 (272) T protein:vir:98 1 MAVGTT--------KMA-QMLDPE--VLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEG--- 66 (272) T ss_pred CCCccc--------cch-heechH--HHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCC--- Confidence 110000 001 011111 1111122211111222222222211 011122332 34445667666543 Q ss_pred CCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeee Q lcl|NC_019525. 96 TGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGL 175 (343) Q Consensus 96 ~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GL 175 (343) +++|.-+...++....++..+..+.+|-+++.++ + .++.+.-.+.+-+++...+|.-.+-- +.|- T Consensus 67 -----~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s---~-~d~~~~~~~~~~~~~a~~~d~~i~~~------~~~a 131 (272) T protein:vir:98 67 -----EAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSG---Y-GDPVGQAAKQIVEAIDHKVDADVLDA------LSKS 131 (272) T ss_pred -----CcccccccccceEEEEeeeeeeeeeecHHHHhhc---c-ccHHHHHHHHHHHHHHHHHHHHHHHH------hccc Confidence 4789999999999999999988877776655432 2 35777777777777777776543311 1111 Q ss_pred eecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCC-Cc-chhhhhHHHh Q lcl|NC_019525. 176 LTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASAD-FP-IKSTKQVLED 253 (343) Q Consensus 176 lN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~-~~-~~tl~~~l~~ 253 (343) .+ +.. ...| .++|.+++..+-.. ...+..+++.|..|..|..-...+ .. +......+.+ T Consensus 132 ~~--------~~~----~~~t----~d~i~da~~~l~~~---~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:98 132 TQ--------TVE----ATAT----VDGVSKALDIFNDE---DDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred cc--------ccc----cccC----HHHHHHHHHHHhcc---CCCccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 00 000 1123 34555565555322 234678999999999885322111 11 1111122222 Q ss_pred hcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcC Q lcl|NC_019525. 254 TFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRP 333 (343) Q Consensus 254 n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP 333 (343) .....-.|.++-+ ...+. .++.-..++..+..+.+.. +.++ ..+. +.+ ....+-...++| +.+..| T Consensus 193 g~ig~i~G~~Vi~--s~~~p-~~t~~~~~~~a~~~~~~~~--~~ve----~~r~---~~~-~~~~i~~~~~~~-~~v~~~ 258 (272) T protein:vir:98 193 GVYGEVLGVQIVR--SRKCP-KGTAYMVRKGALRIMLKRN--TMVE----TDRD---ITK-AINQIVANKHYG-VYLYKA 258 (272) T ss_pred ccchhhcCeeEEE--cCCCC-cceEEEEcCCeEEEEecCC--ceee----eccc---ccc-ceeEEEEEEEEE-EEEEcC Confidence 1111113444322 11111 0111111222222222211 1111 1111 121 233343444454 889999 Q ss_pred ceEEeeecCC Q lcl|NC_019525. 334 KELLYLDIPV 343 (343) Q Consensus 334 ~a~~Y~D~~~ 343 (343) ..++.+=+-- T Consensus 259 ~~vv~~t~~~ 268 (272) T protein:vir:98 259 EKAVKITLKD 268 (272) T ss_pred CceEEEEecc Confidence 8877665544 No 122 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=38.75 E-value=1.1 Score=20.46 Aligned_cols=263 Identities=12% Similarity=0.044 Sum_probs=112.8 Q ss_pred HHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecC-CCCceeEEE--eeeccchhhhhhccccc Q lcl|NC_019525. 19 QEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVG-EGAWSTMLT--TYRSFSLAEDFATGIID 95 (343) Q Consensus 19 ~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~-~~~w~~~~~--~~~~vg~a~~ia~g~~~ 95 (343) |+...+ ..+ ...+.+ .+...+.+.-...+....+.-++.. .+.-+.++. .++..|-+++++.| T Consensus 1 MA~~~T--------~~~-~~~iPe--v~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg--- 66 (272) T protein:vir:30 1 MAVGTT--------KMA-QMLDPE--VLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAEDVAEG--- 66 (272) T ss_pred CCCccc--------cch-heechH--HHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEecCCCCcccccCC--- Confidence 110000 001 011111 1111122211111222222222211 011122332 34445667666543 Q ss_pred CCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeee Q lcl|NC_019525. 96 TGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGL 175 (343) Q Consensus 96 ~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GL 175 (343) +++|.-+...++....++..+..+.+|-+++.++ + .++.+.-.+.+-+++...+|.-.+-- +.|- T Consensus 67 -----~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s---~-~d~~~~~~~~~~~~~a~~~d~~i~~~------~~~a 131 (272) T protein:vir:30 67 -----EAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSG---Y-GDPVGQAAKQIVEAIDHKVDADVLDA------LSKS 131 (272) T ss_pred -----CcccccccccceEEEEeeeeeeeeeecHHHHhhc---c-ccHHHHHHHHHHHHHHHHHHHHHHHH------hccc Confidence 4789999999999999999988877776655432 2 35777777777777777776543311 1111 Q ss_pred eecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCC-Cc-chhhhhHHHh Q lcl|NC_019525. 176 LTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASAD-FP-IKSTKQVLED 253 (343) Q Consensus 176 lN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~-~~-~~tl~~~l~~ 253 (343) .+ +.. ...| .++|.+++..+-.. ...+..+++.|..|..|..-...+ .. +......+.+ T Consensus 132 ~~--------~~~----~~~t----~d~i~da~~~l~~~---~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~ 192 (272) T protein:vir:30 132 TQ--------TVE----ATAT----VDGVSKALDIFNDE---DDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVVS 192 (272) T ss_pred cc--------ccc----cccC----HHHHHHHHHHHhcc---CCCccEEEEcHHHHHHHHHhcccccccccccccccccc Confidence 00 000 1123 34555565555322 234678999999999885322111 11 1111122222 Q ss_pred hcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcC Q lcl|NC_019525. 254 TFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRP 333 (343) Q Consensus 254 n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP 333 (343) .....-.|.++-+ ...+. .++.-..++..+..+.+.. +.++ ..+. +.+ ....+-...++| +.+..| T Consensus 193 g~ig~i~G~~Vi~--s~~~p-~~t~~~~~~~a~~~~~~~~--~~ve----~~r~---~~~-~~~~i~~~~~~~-~~v~~~ 258 (272) T protein:vir:30 193 GVYGEVLGVQIVR--SRKCP-KGTAYMVRKGALRIMLKRN--TMVE----TDRD---ITK-AINQIVANKHYG-VYLYKA 258 (272) T ss_pred ccchhhcCeeEEE--cCCCC-cceEEEEcCCeEEEEecCC--ceee----eccc---ccc-ceeEEEEEEEEE-EEEEcC Confidence 1111113444322 11111 0111111222222222211 1111 1111 121 233343444454 889999 Q ss_pred ceEEeeecCC Q lcl|NC_019525. 334 KELLYLDIPV 343 (343) Q Consensus 334 ~a~~Y~D~~~ 343 (343) ..++.+=+-- T Consensus 259 ~~vv~~t~~~ 268 (272) T protein:vir:30 259 EKAVKITLKD 268 (272) T ss_pred CceEEEEecc Confidence 8877665544 No 123 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=38.64 E-value=1.1 Score=20.45 Aligned_cols=289 Identities=10% Similarity=0.015 Sum_probs=118.9 Q ss_pred CCceeeecCCchHHHHHHHHhhhcc----------chhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchh---hccee Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAG----------VIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPAD---YMPIR 67 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~----------~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~---~~pv~ 67 (343) .+.....+...+.-..+.+...... .+.......| .+.+.+ .+.+.|++...+...-+. .+|+. T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~g-g~~vP~--~~~~~i~~~~~~~~~l~~~~~~~~~~ 149 (389) T protein:vir:10 73 KDDGSKKGTDLSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEA-GVLIPE--EIIYDPTAEVNSVVDLSTLVTKTPVT 149 (389) T ss_pred cccccccccccchhHHHHHHHHHHHHhhcchhhhhhhcccccCCc-ceeehH--HHHHHHHHHHHhhhhHHhhcceeecc Confidence 1111111111111111111000000 0111111222 233332 223344444333333333 34443 Q ss_pred cCCCCceeEEEeee-ccchhhhhhcccccCCccccccc-cccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHH Q lcl|NC_019525. 68 VGEGAWSTMLTTYR-SFSLAEDFATGIIDTGNSNGKLA-AVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITA 145 (343) Q Consensus 68 ~~~~~w~~~~~~~~-~vg~a~~ia~g~~~~g~~a~Dip-~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~ 145 (343) +... .+.... ..+.+.+++.+ ...| .-+..+.+....++.++.-+.+|.+=|+.+. .+|.+. T Consensus 150 ~~~~----~~~~~~~~~~~~~~~~E~--------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~ 213 (389) T protein:vir:10 150 TPKG----TYPILKRATDRFSSVAEL--------AENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSA----VDLTAL 213 (389) T ss_pred CCee----EEEEEecCCCcccccccc--------ccccccccccceeeeeeheeeEeeehhhHHHHhhhh----HHHHHH Confidence 2211 121111 22333333322 1344 3356788888889999888888877665432 356666 Q ss_pred HHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEE Q lcl|NC_019525. 146 LEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFT 225 (343) Q Consensus 146 k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~ 225 (343) -.+...+++...+|..+..|... +. + . ..-...+.+++.+-++..+... + ...++ T Consensus 214 i~~~la~~~~~~~~~~i~~g~~~--~~---------~----~--~~~~~~~~d~l~~~~~~~~~~~-----~---~a~~~ 268 (389) T protein:vir:10 214 VGQSIKEKSVNTYNAMIAPVLQS--FT---------A----K--KTTTDTLVDSLKHILNVDLDPA-----Y---SRALV 268 (389) T ss_pred HHHHHHHHHHHHHHHHHhhhhcc--cc---------c----c--cccccccHHHHHHHHHhhhhhh-----h---CcEEE Confidence 66777777777777766655311 10 0 0 0111234454444343332221 1 24789 Q ss_pred eCHHHHHHHhccccCCCcchhhhh-HHHh----hcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEec Q lcl|NC_019525. 226 IPESDYTGLAGAASADFPIKSTKQ-VLED----TFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDI 300 (343) Q Consensus 226 lp~~~~~~L~~~~~s~~~~~tl~~-~l~~----n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~i 300 (343) |.++.|..|.+-+ ++.+.-++. -+.+ ..+..-.|.|+.+.+..+. ...++.-.+++-+-+ +.+.+-. T Consensus 269 ~n~~~~~~L~~lk--d~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~-----~~~~~~~~~~~gd~~-~~~~~~~ 340 (389) T protein:vir:10 269 VTQSLFNTLDTLK--DKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLL-----GSLAGDQKAFVGDLK-RGVLFTD 340 (389) T ss_pred ecHHHHHHHHHhh--ccCCCeeeecCcccccccccccccccceeEEeccccc-----CCCCCceEEEEeecc-ccEEEEe Confidence 9999999997543 333322211 0011 0111224666654432221 111222222222222 1111111 Q ss_pred CCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 301 PVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 301 P~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) ...++...-.... .....-.+.|++|. +..|.+++++.+.= T Consensus 341 ~~~~~i~~~~~~~-~~~~~~~~~r~d~~-~~~~~a~~~~~~~~ 381 (389) T protein:vir:10 341 RQQVTLAWEDSKI-YGKYLGAAFRFGVQ-KADSKAGYFVTNTD 381 (389) T ss_pred ecceEEEeecccc-ccceEEEEEEeccE-EecccceEEEEeec Confidence 1111111111111 11222345788887 57799999888653 No 124 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=37.79 E-value=1.1 Score=20.36 Aligned_cols=277 Identities=6% Similarity=-0.077 Sum_probs=117.1 Q ss_pred hhhhHHHHHHhhhhhhhccccccc------chhhcceecCC---CCceeEEEeeec-c-chhhhhhcccccCCccccccc Q lcl|NC_019525. 36 FEIDVTTLTTLMKKIIEQKFFEIS------PADYMPIRVGE---GAWSTMLTTYRS-F-SLAEDFATGIIDTGNSNGKLA 104 (343) Q Consensus 36 ~~f~~~qL~~i~~~iye~~~~~l~------~~~~~pv~~~~---~~w~~~~~~~~~-v-g~a~~ia~g~~~~g~~a~Dip 104 (343) |+........-+..-. .++|. -+..=|+-+-. ..-...+.+..+ . ..++ .+. ..+ .|.| T Consensus 1 ma~~~~~~~t~~~~g~---~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~-~~~-----~EG-~da~ 70 (317) T protein:vir:88 1 MATPTNAVSTVEINGK---REDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGK-NTR-----VEG-EDAT 70 (317) T ss_pred CCccccceEeeeeeee---eechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccc-ccc-----ccC-cccc Confidence 3333222222111111 11111 01111111100 011111222110 0 1111 110 011 1334 Q ss_pred cccccccceeeee---EEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeecc--------cccee Q lcl|NC_019525. 105 AVDTGVDAVSIKT---YKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKD--------NANVK 173 (343) Q Consensus 105 ~vd~~~~~~~~~v---~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~--------~~g~~ 173 (343) .....-......+ +.=..++..|.+-... .|+-++.+....-+-..+..+++...+.|... -..+- T Consensus 71 ~~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~---~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~ 147 (317) T protein:vir:88 71 IKAGSFTTMLNNYCQISDETLQVTGTADRVKK---AGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMA 147 (317) T ss_pred cccccCCEEeccEEEEEEeEEEEeehhhhhhh---cCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhh Confidence 4333333333333 3334555566655533 45435444433334444555566666666321 11245 Q ss_pred eeeec--CC-ce-ee----cccCCcccccCCHHHHH-HHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcc Q lcl|NC_019525. 174 GLLTQ--TG-NV-VN----NTFLTKSIKSMTPAELK-VLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPI 244 (343) Q Consensus 174 GLlN~--p~-v~-~~----~a~~~~~w~~kT~~eIl-~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~ 244 (343) ||++. ++ +. .. ...++..|...|+..+. ++|++++..+|...+ .|++|+++|.+-..|+.- ..+... T Consensus 148 Gl~~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg---~~~~i~v~a~~k~~i~~~-~~~~~~ 223 (317) T protein:vir:88 148 NIFAYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGG---QANSIQTSSSIKKAISKN-MKGRAT 223 (317) T ss_pred hHHHHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCC---CCCEEEeChHHHHHHHHH-hcCCce Confidence 76653 22 11 11 11123335555555544 459999999999876 578999999988888633 111111 Q ss_pred hhhhhHHHhh------cchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcCCceEE Q lcl|NC_019525. 245 KSTKQVLEDT------FKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQ 318 (343) Q Consensus 245 ~tl~~~l~~n------~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~ 318 (343) .+...-..+ ..+....-.++|.+-+|+. .+.|+++ |++++++.. +...+..+..+.+...+ T Consensus 224 -~i~~~~~~~~~g~~v~~~~tdfG~v~ii~~r~lp---------~~~~~~~--D~~~~~l~~-Lr~~~~e~laKtGd~~k 290 (317) T protein:vir:88 224 -EITLDASDNRIAQTVDVYESDFGKYTIRANRWFH---------ENTLFVF--DPKMHSLCY-LRPFFQHELAKTGDSEK 290 (317) T ss_pred -eEEEcccCeEEEEEEEEEEeCCeEEEEEeCCCCC---------CCeEEEE--cccccceee-cccceeeccCCCcccce Confidence 111000000 1112233457888888764 2556665 567777653 22122223333222222 Q ss_pred eceeeeeccEEEEcCceEEee-ecCC Q lcl|NC_019525. 319 NAAYGQFTGVLAYRPKELLYL-DIPV 343 (343) Q Consensus 319 v~~~~r~GGv~v~yP~a~~Y~-D~~~ 343 (343) .-.+ -=.+++++-|.+.+-. |+-- T Consensus 291 ~~i~-~E~tLe~~N~~a~a~i~~l~~ 315 (317) T protein:vir:88 291 RQLL-VEYTFRVNNEKSGALIRDVVA 315 (317) T ss_pred eEEE-EEEEEEEcCccceeEEEEecc Confidence 2222 2347888888865443 3322 No 125 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=36.30 E-value=1.2 Score=20.19 Aligned_cols=278 Identities=10% Similarity=0.068 Sum_probs=102.3 Q ss_pred hccchhhhhhhhhhhhhHHHHHHhhhhhh-hcccccccchhhcceecCCCCceeEEEeee----ccchhhhhhcccccCC Q lcl|NC_019525. 23 IAGVIQRLCNDLGFEIDVTTLTTLMKKII-EQKFFEISPADYMPIRVGEGAWSTMLTTYR----SFSLAEDFATGIIDTG 97 (343) Q Consensus 23 ~~~~~~~~~~d~~~~f~~~qL~~i~~~iy-e~~~~~l~~~~~~pv~~~~~~w~~~~~~~~----~vg~a~~ia~g~~~~g 97 (343) ++..++ .| -|...+|..+=..+. ..+.+.+-..++||..... .-.+.+.. ..-+|.+++-+. T Consensus 1 M~~~~~---~d---~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~~~---~~~~~~~~~~~~~~~~a~~~~~~~---- 67 (348) T protein:vir:98 1 MSWTLD---TE---FIEPTQLTGLIREALRDLQVNRFRLARWLPNVDVD---DITFEFLRGGGGLAETASYRSWDT---- 67 (348) T ss_pred Ccchhh---hh---ccCHHHHHHHHHHHhhccCcchhhHHhcCCCcccc---ceEEEEEeccCCceeeeeeecCCC---- Confidence 111111 11 245566665544433 2334557778899943211 11222211 111234433210 Q ss_pred cccccccccc-ccccceeeeeEEEEeeEeecHHHHHHHHHhc-----C--CCcHHHHHHHHHHHHHhhhhheEeeeeccc Q lcl|NC_019525. 98 NSNGKLAAVD-TGVDAVSIKTYKWAKTIGWTLPELAEAMKSG-----N--WDLITALEKSRKKNWDLGIQEIAFVGMKDN 169 (343) Q Consensus 98 ~~a~Dip~vd-~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~G-----r--~~L~~~k~~aar~a~~~~~n~v~~~G~~~~ 169 (343) -.|... ..+...+..+-.++..+..+..|+...+++- + ...-.+..+++++..|.-.-+..+.|.=.. T Consensus 68 ----~~~~~~r~g~~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~ 143 (348) T protein:vir:98 68 ----ESKIGRREGLAKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPV 143 (348) T ss_pred ----ccceeecccceeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEE Confidence 112222 2345555556667777777777775432110 0 001111233333333333333444441000 Q ss_pred --cceeee-eecCCceeecccCCccccc-CCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcch Q lcl|NC_019525. 170 --ANVKGL-LTQTGNVVNNTFLTKSIKS-MTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIK 245 (343) Q Consensus 170 --~g~~GL-lN~p~v~~~~a~~~~~w~~-kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~ 245 (343) .+. .. +..|.- .+..++..|.+ .|+ .++.||.+.+..+...+|. .|++++|.+..|..|.+- . T Consensus 144 ~g~~~-~vDyg~~~~--~~~t~~~~Ws~~~~a-dp~~di~~~~~~~~~~~G~--~p~~~vm~~~~~~~l~~~-------~ 210 (348) T protein:vir:98 144 TELQQ-TVDFGRIGS--HSVVAAVLWSVHATA-TPISDLESWVATYEDTNGQ--SPGVILMPKAAVSHMRQC-------E 210 (348) T ss_pred ecCce-EEccccCcc--cccccccccCCCCCC-CHHHHHHHHHHHHHHccCC--cceEEEeCHHHHHHHhcC-------H Confidence 011 00 111211 12234567864 444 4889999999999887775 499999999999998421 1 Q ss_pred hhhhHHHhhc---------------chhcCC-cceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCC------- Q lcl|NC_019525. 246 STKQVLEDTF---------------KEITRN-SSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPV------- 302 (343) Q Consensus 246 tl~~~l~~n~---------------~~~~~g-~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~------- 302 (343) .+.+.+.... .....| ..+.+-...+ .. .|.+.+++ . .+.+.| +|- T Consensus 211 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~i~~~d~~~----~~--~g~~~~~~--p--~~~i~l-~p~~~~~~~~ 279 (348) T protein:vir:98 211 EVIRQVFPLAPSGTAPMVSVEQLNTVLSSMGLPPIEVYDAKV----AV--DGVSTRIT--P--ANAIAL-LPEPGATDAA 279 (348) T ss_pred HHHHHHhccCccccccccCHHHHHHHHHhhCCeEEEEeeeEE----Ec--CCceecee--c--CCeEEE-EecCCccccc Confidence 1111111100 000111 2222211111 11 11222221 0 011111 110 Q ss_pred -----chhhc--chhhcCCceEEeceeeeeccEE-----------------------EEcCceEEeeecCC Q lcl|NC_019525. 303 -----DYTST--LANSVNNFQFQNAAYGQFTGVL-----------------------AYRPKELLYLDIPV 343 (343) Q Consensus 303 -----~~~~l--~p~~~~~l~~~v~~~~r~GGv~-----------------------v~yP~a~~Y~D~~~ 343 (343) -.+.. +++.. .......+.. -.|+. +..|.++..++|=- T Consensus 280 ~~~~~G~t~~G~~~e~~-~~~~~~~~~~-~~~i~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~~~a~Vl~ 348 (348) T protein:vir:98 280 QPTELGATLLGTTAESL-EDDYALAPGE-QPGIVAATWKTKDPVRLWTHAAAVGIPVLREPNLTFKAQVLA 348 (348) T ss_pred ccccccceecccchhhh-ccccccceec-cCceeeeeeeecCCcEEEEEEeeeeeccccCCCcEEEEEEeC Confidence 00000 00111 0111111111 01110 01111111111100 No 126 >protein:vir:106590 Length: 349 # NCBI annotation: putative major head protein # Family: family:all:1083 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958585;genbank:gi:41179245;genbank:GeneID:2717126 Probab=31.63 E-value=1.5 Score=19.64 Aligned_cols=287 Identities=13% Similarity=0.063 Sum_probs=100.5 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceec-CCCCceeEEEe Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRV-GEGAWSTMLTT 79 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~-~~~~w~~~~~~ 79 (343) || |.. +-++ .+......---|...+|..+-+. .+.+.+-..++||... ..-+|. .+.. T Consensus 1 ~~-----~~~---~~~~---------~~~~~~~~~d~~~~~~l~~~~~~---~~~~~~l~~~~Fp~~~~~~~~~~-~~~~ 59 (349) T protein:vir:10 1 MK-----NQK---LQLD---------LQRFATPILDMFSQNTVLDYTRN---RQYPEMLGDTLFPAVKVPTLEVD-ILKA 59 (349) T ss_pred CC-----cch---hhHH---------HHHHHHHhhcccCHHHHHHHHHh---cCcchhhHhhcCCccccccceeE-EEee Confidence 22 222 1111 11111111113555555554442 2345677788999432 221221 1111 Q ss_pred e-eccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHH-----------HH Q lcl|NC_019525. 80 Y-RSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITA-----------LE 147 (343) Q Consensus 80 ~-~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~-----------k~ 147 (343) . ...-+|.+++-+. -.|..+.+....+..+-.++..+..+..|+...+..++-..... .. T Consensus 60 ~~~~~~~a~~v~~~~--------~~~~~~r~~~~~~~~~p~ik~~~~i~e~dl~~~~~~~~~~~~~~~~~~i~~d~~~l~ 131 (349) T protein:vir:10 60 GSRVPTIASVSAFDA--------EAEIGTREASKMTAELAYVKRKMQITEEMLIKLQSPRNTAEENYLKQYVFDDIDAMV 131 (349) T ss_pred ccCcceeeeeecCCC--------CcceecccceeEEeeccccccccccCHHHHHHHhhccCcchHHHHHHHHHHHHHHHH Confidence 1 1112244333210 12233333333334444556677778888876655543111111 11 Q ss_pred HHHHHHHHhhhhheEeeeecc--ccceeeeeecCCceee---cccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCC Q lcl|NC_019525. 148 KSRKKNWDLGIQEIAFVGMKD--NANVKGLLTQTGNVVN---NTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPN 222 (343) Q Consensus 148 ~aar~a~~~~~n~v~~~G~~~--~~g~~GLlN~p~v~~~---~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~ 222 (343) +++++-.|...-+..+.|.=. ..|+. +. =+++.. +.+++..|.+.++ .+++||.+.+..+ +. .|+ T Consensus 132 ~~i~~r~E~m~~q~l~~Gki~~~~~g~~--vD-~g~~~~~~~~lt~~~~Ws~~~a-dpi~Di~~~~~~~----g~--~p~ 201 (349) T protein:vir:10 132 QAVKARGEKMTMEMFATGKITDKKNGIA--ID-YGVPKKHQETLSGTKTWDKSDA-SIIDNLQDWSDSL----DV--TPT 201 (349) T ss_pred HHHHHHHHHHHHHHHhCCeeEEcCCcEE--Ee-cccCccceeEecCcccCCCCCC-CHHHHHHHHHHHh----CC--Ccc Confidence 222222222223333344100 00111 11 011111 1335667988765 5888998876654 33 589 Q ss_pred eEEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhh-cccccCCCccEEEEEEcC----cc--- Q lcl|NC_019525. 223 KFTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADK-ITAQVPAVAKRYALYNDN----ED--- 294 (343) Q Consensus 223 tl~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~-~~~~g~gg~drmv~Y~~d----~~--- 294 (343) +++|.+..|..|.+ +..+.+.+..++ .+. +.+...++. ++. .|+ =...+|+.. .. T Consensus 202 ~~vm~~~~~~~l~~-------~~~i~~~~~~~~----~~~---~~~~~~~~~~l~~--~~~-~~i~~yd~~y~d~~~~~~ 264 (349) T protein:vir:10 202 RALTSKKVLRILMR-------STEIKEAIFGKD----TGR---VVGQADLDQWMTA--QGL-PIIRAYDGKYRDEDSRGN 264 (349) T ss_pred EEEeCHHHHHHHhc-------CHHHHHHhcccc----ccc---ccCHHHHHHHHHh--cCC-ceEEEEeeEEEeecCCCc Confidence 99999999999853 111222221110 000 000000000 000 011 022333210 00 Q ss_pred -eEEEecCCchh-hcchhhcCCceEEeceee---eeccEEEEcCceEE---e-e-ecCC Q lcl|NC_019525. 295 -SLRMDIPVDYT-STLANSVNNFQFQNAAYG---QFTGVLAYRPKELL---Y-L-DIPV 343 (343) Q Consensus 295 -~v~~~iP~~~~-~l~p~~~~~l~~~v~~~~---r~GGv~v~yP~a~~---Y-~-D~~~ 343 (343) .-+-.+|-..- ++++-.-+...|-....+ ..|.+.+.-+.... + . +=|+ T Consensus 265 ~t~~~~~p~~~v~l~~~~~~G~~~yG~~~e~~~~~~g~~~~~~~~~~~~~~~~~~~dP~ 323 (349) T protein:vir:10 265 LTTNSYFPEDRIVLFNDEVPGQKIYGPTPEENRLISSNAQVSNVGNIMAKIYETSEDPI 323 (349) T ss_pred eeecccccCCeEEEecCCCceeEEeeccchhhhhcccccceeeccceEEEeeeecCCCc Confidence 00000111111 111111111222111100 11222222221111 1 1 1254 No 127 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=27.97 E-value=1.8 Score=19.19 Aligned_cols=274 Identities=10% Similarity=0.002 Sum_probs=123.0 Q ss_pred hccchhhhhhhhhhhhhHH-HHHHhhhhhhhcccccccchhhcceecCCCCceeEEEeee---ccchhhh-hhcccccCC Q lcl|NC_019525. 23 IAGVIQRLCNDLGFEIDVT-TLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYR---SFSLAED-FATGIIDTG 97 (343) Q Consensus 23 ~~~~~~~~~~d~~~~f~~~-qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~---~vg~a~~-ia~g~~~~g 97 (343) |+. + .|... .|+.+=. =| +.+++.+.++||.....-+-. .|..|+ .+...+. .+-+ T Consensus 1 ~~~---------~-~~~~dp~LT~~A~-gy--~n~~~Ia~~l~P~vpV~~~~~-~~~~f~~~e~F~~~~t~r~~~----- 61 (309) T protein:vir:99 1 MSN---------A-PFPIDPELTAIAI-AY--RNGRMISDEVLPRVPVGKQEF-KFWKYDLAQGFTVPETLVGRK----- 61 (309) T ss_pred CCC---------C-CcCcCHhHHHHHh-hc--cChhhhhhhcCCccccCcccc-ceeeechhhcccccchhhccC----- Confidence 121 1 12221 2333322 12 245677788888643322111 111121 1111111 1111 Q ss_pred ccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhh----hheEeeeecccccee Q lcl|NC_019525. 98 NSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGI----QEIAFVGMKDNANVK 173 (343) Q Consensus 98 ~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~----n~v~~~G~~~~~g~~ 173 (343) .+.-+|+....+....+..-+..+.....|..+|. . +.++.+...+.++..++.+. -++++.- T Consensus 62 ---~~~~~v~~~~~~~~~~~~~~~L~~~i~~~~~~~a~-~-~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~-------- 128 (309) T protein:vir:99 62 ---SKPNEVEFSATDETGSTEDHGLDAPVPQADIDNAP-T-NYNPLGHATEQTTNLILLDREARTSKLVFSP-------- 128 (309) T ss_pred ---CCcceEeecccCceeeecccceeecCCchhhhhcc-C-CCCHHHHHHHHHHHHHHHHHHHHHHHHhcCh-------- Confidence 13346666777777778888888888888888765 2 35777666655554443333 2332222 Q ss_pred eeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhcc-----ccC-CCc--ch Q lcl|NC_019525. 174 GLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGA-----ASA-DFP--IK 245 (343) Q Consensus 174 GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~-----~~s-~~~--~~ 245 (343) -|.|.=-..+-+|+..|.++++| ++.||.+.+..+ + .+||+++|....|..|.+- ++. ... .. T Consensus 129 --a~y~~~~k~~Lsgt~~wsd~~SD-Pi~~i~~~~~~~----g--~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~ 199 (309) T protein:vir:99 129 --NSYAAGNKTTLSGADQWSDPTSN-PLPVITDALDSV----I--LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGM 199 (309) T ss_pred --hhcCCCceEEecCccccCCCCCC-cHHHHHHHHHhh----C--CCcceEEechHHHHHHhhCHHHHHHhcCCCccccc Confidence 11121011223445678886654 788888888775 3 4899999999999887431 111 111 11 Q ss_pred hhhhHHHhhcchhcCCcceEEeechhhh-hccccc----CCCccEEEEEEcCcceEEEecCCchhhcchh-hcCCceEEe Q lcl|NC_019525. 246 STKQVLEDTFKEITRNSSFEILPCVYAD-KITAQV----PAVAKRYALYNDNEDSLRMDIPVDYTSTLAN-SVNNFQFQN 319 (343) Q Consensus 246 tl~~~l~~n~~~~~~g~~l~I~~~~~~~-~~~~~g----~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~-~~~~l~~~v 319 (343) .-.+.|++-+- ...+-+....+.. ..+..+ .-|.+..++|.....- .+.-| .|-....+ .+....+.. T Consensus 200 it~~~la~l~~----ve~V~vg~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~~~~-~~~~p-s~G~t~~~~~r~~g~~~d 273 (309) T protein:vir:99 200 VPMAFLQELLE----LDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLAD-TRNGT-TFGLTAQWGDRVSGSIAD 273 (309) T ss_pred cCHHHHHHHhC----cceEEeecceeeccccccccccccccCCcEEEEEcCCCCC-Ccccc-cccceeecccccCCceee Confidence 22333443221 1122221111110 001111 1144556666554321 11112 12211111 133345667 Q ss_pred ceeeeeccEEEEcCce----EEeeecCC Q lcl|NC_019525. 320 AAYGQFTGVLAYRPKE----LLYLDIPV 343 (343) Q Consensus 320 ~~~~r~GGv~v~yP~a----~~Y~D~~~ 343 (343) |++..-||-.||.-.. +..-|+-. T Consensus 274 ~~~~~~g~~~vr~~~~~k~~i~~~d~G~ 301 (309) T protein:vir:99 274 PNIGLRGGQRVRVGESVKELVTAPDLGF 301 (309) T ss_pred eeeccCCceEEEEeccccchhcchhcch Confidence 8887777744442110 11111111 No 128 >protein:vir:6378 Length: 346 # NCBI annotation: capsid protein E # Family: family:all:1021 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918991;genbank:gi:34610166;genbank:GeneID:2559600 Probab=24.60 E-value=2.2 Score=18.75 Aligned_cols=284 Identities=10% Similarity=-0.011 Sum_probs=98.7 Q ss_pred hhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCceeE--EEeee-ccchhhhhhcccccCCcccccccccccccc Q lcl|NC_019525. 35 GFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAWSTM--LTTYR-SFSLAEDFATGIIDTGNSNGKLAAVDTGVD 111 (343) Q Consensus 35 ~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~--~~~~~-~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~ 111 (343) --.|..++|+..=+++ +.+.+-..++||-....+ .+. +...+ ...+|.+++.+. . ..+.-...+. T Consensus 1 ~d~f~~~~l~~~i~~~---p~~~~l~~~~fp~~~~~~--t~~i~i~~~~g~~~la~~v~~~~-----~--~~~~~~~g~~ 68 (346) T protein:vir:63 1 MEIFDTLTLAGVIQSG---PALSMYWQGFYPNEITFD--TDEILFDLVFKDKKLAPFVAPNV-----Q--GRVIAARGYT 68 (346) T ss_pred CCccCHHHHHHHHHhc---CCccchhhhcCccccccc--cceEEEEEecCceeeeeeecCCC-----C--cceeccccee Confidence 2257778876544432 344555667777321111 122 22222 234454444321 1 1111111121 Q ss_pred ceeeeeEEEEeeEeecHHHHHHHHH-----hcCCCcHH-------HHHHHHHHHHHhhhhheE----eeeec--ccccee Q lcl|NC_019525. 112 AVSIKTYKWAKTIGWTLPELAEAMK-----SGNWDLIT-------ALEKSRKKNWDLGIQEIA----FVGMK--DNANVK 173 (343) Q Consensus 112 ~~~~~v~~~g~~~~ys~~EL~~A~~-----~Gr~~L~~-------~k~~aar~a~~~~~n~v~----~~G~~--~~~g~~ 173 (343) ......-.++-...++-.|+..-+. .|+..... ++....++.++..+..++ ..|.= +..+.. T Consensus 69 ~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~i~~~~E~m~~~al~~gki~~~g~~~~ 148 (346) T protein:vir:63 69 TKTFRPAYVKPKDVINPNRTLKRRAGEQPIIGGMSLQERFQAVVADSQLEQRQRIENRIEWMCAMATIYGYVDVVGEAFP 148 (346) T ss_pred eeEeecCccCccceeCHHHHHHHhhhhhhccCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCEEEeeCCcee Confidence 1122223345566677777754221 12211221 222233333332222221 11210 000111 Q ss_pred ee-eecCCceee---cccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhh Q lcl|NC_019525. 174 GL-LTQTGNVVN---NTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQ 249 (343) Q Consensus 174 GL-lN~p~v~~~---~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~ 249 (343) -+ ++ =+++.. +.+++..|.+.|++ ++.||.+.+..+...++. .|++++|.++.|..|.+- ..+.+ T Consensus 149 ~~~vd-fg~~~~~~~~lt~~~~W~~~~ad-p~~di~~~~~~~~~~~g~--~~~~~i~~~~~~~~l~~~-------~~v~~ 217 (346) T protein:vir:63 149 MQRVD-FGRDPALTVQLTGGAAWDQATSD-PLGNIQTMRTTAWKKSNS--TITRLTMGLDAWSLFSQK-------PAVVE 217 (346) T ss_pred EEEEe-eCCCccceeeecccccCCCCCCC-HHHHHHHHHHHHHHccCC--ceEEEEECHHHHHHHhcC-------HHHHH Confidence 11 11 122211 12345679877665 899999999999888776 588999999999988521 12222 Q ss_pred HHHhhcchhcCCcc---eEE-eechhhhhcc-cccCCCccEEEEEEc----CcceEEEecCCchhhc-chhhcCCceEEe Q lcl|NC_019525. 250 VLEDTFKEITRNSS---FEI-LPCVYADKIT-AQVPAVAKRYALYND----NEDSLRMDIPVDYTST-LANSVNNFQFQN 319 (343) Q Consensus 250 ~l~~n~~~~~~g~~---l~I-~~~~~~~~~~-~~g~gg~drmv~Y~~----d~~~v~~~iP~~~~~l-~p~~~~~l~~~v 319 (343) .+..+......... +.. ..+.+..... ....+|-+ .+.|+. +....+--+|-..-.+ ++...+.+.|-. T Consensus 218 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~gi~-i~~y~~~y~d~~G~~~~~ip~~~v~~~p~~~~g~~~yg~ 296 (346) T protein:vir:63 218 LLNLFYKGSTSDFNRSRLDDGSPVQYQGTIGGYNGMGTLE-LYTYHDTYTGDDNTEQEILGSYDVVGTGPGLQGTQCFGA 296 (346) T ss_pred HHhhhccccccccchhhcccchhhhhhhhHhhhhccCCeE-EEEeccEEEcCCCceeccccCCeEEEEecCCcceEEEee Confidence 22111100000000 000 0000000000 00000000 111110 0000010011111001 111111111111 Q ss_pred ceeeeeccEE-EEcCceEEeeecCC Q lcl|NC_019525. 320 AAYGQFTGVL-AYRPKELLYLDIPV 343 (343) Q Consensus 320 ~~~~r~GGv~-v~yP~a~~Y~D~~~ 343 (343) +.....+.+. -+||....--| |+ T Consensus 297 ~~d~~~~~~~~~~~~~~~~~~d-p~ 320 (346) T protein:vir:63 297 IMDFKNGLVPTRMFPKMWEEED-PS 320 (346) T ss_pred ccccccCcccceeeeEEEEecC-CC Confidence 1001111000 02333222212 22 No 129 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=23.17 E-value=2.3 Score=18.56 Aligned_cols=303 Identities=10% Similarity=-0.027 Sum_probs=100.9 Q ss_pred CCceeeecCCch-----HHHHHHHHhhhccchhhhhhhhhhhhhH----HHH---------------------------- Q lcl|NC_019525. 1 MKKFVIRNSKGE-----KILLNAQEAKIAGVIQRLCNDLGFEIDV----TTL---------------------------- 43 (343) Q Consensus 1 ~~~~~~~~~~~~-----~~~~~a~~~~~~~~~~~~~~d~~~~f~~----~qL---------------------------- 43 (343) .++......+-+ .+...++..++...-.......+..|.. ... T Consensus 172 ~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 251 (517) T protein:vir:97 172 ENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPA 251 (517) T ss_pred HHHHHHHHHhhhhhhhhhHHHHHHhhhhcccccccccchhhHHHHHHHHHHHHHHhcccccccceeeeeccccccccccc Confidence 000000000000 0000000000000000000000000000 000 Q ss_pred -HHhhhhhhhcccccccchhhcceecCCCCceeEEEeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEe Q lcl|NC_019525. 44 -TTLMKKIIEQKFFEISPADYMPIRVGEGAWSTMLTTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAK 122 (343) Q Consensus 44 -~~i~~~iye~~~~~l~~~~~~pv~~~~~~w~~~~~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~ 122 (343) ..+-..+...-.+.-....++++... ..-........+.|..+. ++ ...|.-+..+...+.+++.++. T Consensus 252 p~~~~~~i~~~~~~~~~i~~~~~~~~i---~~~~~~~~~~~~~a~~~~-------eG-~~kp~s~~tf~~~~~~~~~ia~ 320 (517) T protein:vir:97 252 PAGILKRIQDAVNDEGSLLPFIRHENL---PTLVVGGDNALTQGTGHT-------TG-TDKTESNITLQTRVLTPQYVYK 320 (517) T ss_pred chHHHHHHHHhhhhhccceeeeeeccc---cceeeecccccceeeeee-------cC-CcccccccceeeEEeeHhhhhh Confidence 00000000000000000111111000 000000000111111111 11 2356666677777777777655 Q ss_pred eEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHHhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHH Q lcl|NC_019525. 123 TIGWTLPELAEAMKSGNWDLITALEKSRKKNWDLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKV 202 (343) Q Consensus 123 ~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~ 202 (343) -...|.+-|+-+..----.|.+--..-.+.++...+++...+|+-...+..|.++..+.... .+.-.+.+..+++. T Consensus 321 ~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~----~~~~~~~~~~d~i~ 396 (517) T protein:vir:97 321 YIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWA----TNVTGTTNIQELLE 396 (517) T ss_pred hhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCccccccccccccccc----ccccccchHHHHHH Confidence 54455554443321000015555566777888888899999995332334455543221110 11111222333333 Q ss_pred HHHHHHHHHHhcCCceecCCeEEeCHHHHHHHhccccCCCcchhhhhHH-HhhcchhcCCcceEEeechhhhhcccccCC Q lcl|NC_019525. 203 LCAGIIDVYRQGCDYTAMPNKFTIPESDYTGLAGAASADFPIKSTKQVL-EDTFKEITRNSSFEILPCVYADKITAQVPA 281 (343) Q Consensus 203 Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L~~~~~s~~~~~tl~~~l-~~n~~~~~~g~~l~I~~~~~~~~~~~~g~g 281 (343) .+..++.. . ....++|-+..|..|...+ |+.+.-|+.-. ........-|. -++.|..|... -..+. T Consensus 397 ~l~~a~~~---a-----~~a~~vmn~~t~~~I~klK--D~~G~Yl~~~~~~~~~~~~l~G~-~~~~~~~~~~~-~~~~~- 463 (517) T protein:vir:97 397 KLSVATPK---A-----ADSTLVIHRNDLAAIRFLK--DKNGNYVFPVGVSNQTIATHFGF-NRLVQSVAVDE-KTAVS- 463 (517) T ss_pred HHHHHhhh---c-----cCCEEEECHHHHHHHHHhh--cCCCCeeccCcCCcccccccCCc-cccccccccCc-eeEee- Confidence 33332221 1 1346899999999996544 54444443111 00000000011 01122222110 00000 Q ss_pred CccEEEEEEcCcceEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeec--CC Q lcl|NC_019525. 282 VAKRYALYNDNEDSLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDI--PV 343 (343) Q Consensus 282 g~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~--~~ 343 (343) ..+-.++-...-++++. |-+ ..+...|. ...|+|| .|+.|++++|.-. || T Consensus 464 ~~~y~i~~~~g~~~~~~-----fd~----~~n~~~f~--~~~~~~g-~i~~~~r~a~~~~~p~~ 515 (517) T protein:vir:97 464 LSGYVTNGSRGMEFEQG-----TIL----VENNKEYL--FEMPISG-SLEYKGTTAYGTYTPPV 515 (517) T ss_pred ccccEEEeecceeeeee-----eec----ccCceeEe--eeeeecc-ccccccceEEEEEcCCC Confidence 00111111111111111 111 11222232 2467777 7888998887654 33 No 130 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=20.96 E-value=2.7 Score=18.24 Aligned_cols=285 Identities=11% Similarity=0.026 Sum_probs=104.1 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcc---eecCCCCceeEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMP---IRVGEGAWSTML 77 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~p---v~~~~~~w~~~~ 77 (343) || ++.-.++--... +.+...|+|.--.+-.-.+.+| |+.+...|-+.- T Consensus 1 mp--------------------------altLaea~k~~~---d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~ 51 (310) T protein:vir:97 1 MA--------------------------SVTLAESAKLAQ---DELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNREN 51 (310) T ss_pred Cc--------------------------ccchHHHhhcCc---chHHHHHHHHHhccchHHHhCCcccccCCcceeeEee Confidence 22 111111111111 1112223332111111223344 433322222222 Q ss_pred EeeeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHH--HHH-hcCCCcHHH--HHHHHHH Q lcl|NC_019525. 78 TTYRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAE--AMK-SGNWDLITA--LEKSRKK 152 (343) Q Consensus 78 ~~~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~--A~~-~Gr~~L~~~--k~~aar~ 152 (343) + ...++.+..- ... ++ ...|.....+.+.+..+..++-.+ |+-. ++. .++ +-+.. .....-+ T Consensus 52 ~-~~~~~~~~v~---~~~-~~--~g~~~~~~t~~~~~~~L~i~~g~~-----~Vd~~i~dl~~~~-~~dq~~~Ql~~~ie 118 (310) T protein:vir:97 52 V-LGDVIMAGVG---TTF-SG--AGAGKAAATFTKVNSNLTTIMGDA-----EVNGLIQATRSGD-GNDQTAVQIASKAK 118 (310) T ss_pred c-cCCccccccc---ccc-cC--CCccccccccceeeeeeeeeeehh-----hhhhHHHhhhcCC-hHHHHHHHHHHHHH Confidence 1 1122322211 000 01 122333333444444455544433 3322 121 222 22221 1223333 Q ss_pred HHHhhhhheEeeeeccccceeeeeecCCc-eeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHH-- Q lcl|NC_019525. 153 NWDLGIQEIAFVGMKDNANVKGLLTQTGN-VVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPES-- 229 (343) Q Consensus 153 a~~~~~n~v~~~G~~~~~g~~GLlN~p~v-~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~-- 229 (343) ++.++.++..++|+.+...+.||+.+-.- ....+. ..-...| ++|+.++++.+|..-+ .|+.|++-|. T Consensus 119 a~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~--~~gg~~t----~d~LDeLl~~v~~~~g---~p~~~l~~~~~~ 189 (310) T protein:vir:97 119 SAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTG--ATGSAIS----FAILDELMDLVVDKDG---QVDYLTMHARTL 189 (310) T ss_pred HHHHHHHHHhhccccCCCcccchhhcCCccceeecC--CCCCCCC----HHHHHHHHHHHhcCCC---CCCEEEecHHHH Confidence 44455567788997656678899875321 111111 0112234 4789999999987655 4788999996 Q ss_pred -HHHHHhccccC-CCcchhhhhHHHhhcchhcCCcceEEeechhhhhccc-ccCCCccEEEEEEcCcc-----eEEEecC Q lcl|NC_019525. 230 -DYTGLAGAASA-DFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITA-QVPAVAKRYALYNDNED-----SLRMDIP 301 (343) Q Consensus 230 -~~~~L~~~~~s-~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~-~g~gg~drmv~Y~~d~~-----~v~~~iP 301 (343) ++.-+.+.... .....+. ..+-+-.. .-+|.|| .++-+...-+. ...+|+..+.+.+-..+ ++.++.+ T Consensus 190 r~i~A~~R~~~~~g~~~~~~-~~~G~~v~-~~~GiPi--~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~ 265 (310) T protein:vir:97 190 RSYKALLRALGGASINEVVE-LPSGAEVP-AYSGTPI--FRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTAT 265 (310) T ss_pred HHHHHHHHHhcCCCCCCccc-cCCCCEEe-eeCCeEE--EEeCccCCCccccccCCceeEEEEeeCccccccceeccccC Confidence 45555432110 0000111 11111111 1245553 34433222111 12346777777776654 2222111 Q ss_pred C----chhhcc-hhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 302 V----DYTSTL-ANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 302 ~----~~~~l~-p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) - ..++.- -+.+.-..+.|-. .=|+.+.-|.+++-+.==- T Consensus 266 ~~~glsVr~~G~~~~~~v~~~~V~~---Y~~~av~~~~A~a~L~~V~ 309 (310) T protein:vir:97 266 QAAGIQVVDVGESEDSDEHIWRVKW---YCGLALFSEKGLACADGIT 309 (310) T ss_pred CccceeEEeCCcccCCcceeEEEEE---eeeEEEecccceeeecccc Confidence 0 000000 0011111122211 1233333333333221111 No 131 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=20.75 E-value=2.7 Score=18.20 Aligned_cols=286 Identities=10% Similarity=-0.002 Sum_probs=115.4 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCc--eeEEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAW--STMLT 78 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w--~~~~~ 78 (343) |.|-++ |+-| +++++-|--.-.. - +-.++ .|...-..++++.+....+..-+-++ ...+| +.++. T Consensus 1 ~~~~~~-~~~~-~~~~~~~~~~~~~------~-~~nt~---~l~~k~~~~LD~~~~~~~~s~~~~~N-~~~e~~gg~tVk 67 (319) T protein:vir:97 1 MNKTIK-NATG-MLKLNLQHFANKS------V-EPGQT---LLKNKHVGILERVTAVNAYSTPALIS-NDAIFMEGRSFT 67 (319) T ss_pred CCcccc-cccc-eeEeehhhhhccC------C-CcchH---HHHHHHHHHHHHHHHHhhhhhhcccC-cceEeccCcEEE Confidence 766543 4443 3444433211000 0 11111 12222222223222222222211111 11122 44443 Q ss_pred e--eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHH--HHHHHHHH Q lcl|NC_019525. 79 T--YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITAL--EKSRKKNW 154 (343) Q Consensus 79 ~--~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k--~~aar~a~ 154 (343) . .+.+|+++.=-+++...| ++..++....+-+ .-++.+.++++..++..+ .|.... .+-+|... T Consensus 68 Ip~i~~~gl~DY~R~~g~~~g---------~vt~~~~t~tidq-dR~~~F~VD~~D~~Etn~--~l~a~~i~~~~~~~~v 135 (319) T protein:vir:97 68 VMKGDTTELKDYKRNATNEFD---------HPKIEETTYFLDQ-EKYWGRFVDALDRKDTEG--NIDINYVVARQGAEVV 135 (319) T ss_pred EeeecccccccccCCCCcccC---------CcccceeEEEeec-ccccccccchhhHhhhhc--hhhHHHHHHHHHHHHh Confidence 3 345566654222222211 2233444433333 557778889988776443 454333 33444444 Q ss_pred HhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHH Q lcl|NC_019525. 155 DLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGL 234 (343) Q Consensus 155 ~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L 234 (343) .-.+|...|--..+ +.+ +..-...|++.+++.|.++...+....-- ..-.|.++|..|..| T Consensus 136 ~PEiDay~~skla~---------~a~--------~~~~~~~t~~n~y~~i~~a~~~Lde~~VP--~~Rvl~Vtp~~~~~L 196 (319) T protein:vir:97 136 APYLDNLRFATLAR---------NKA--------KHLTVGTGSDAQYDAVLDVSVELDEIKAP--ENRVLFVSPTFYKGI 196 (319) T ss_pred hhhhhHHHHHHHHh---------hcc--------cccccccCHHHHHHHHHHHHHHHHhcCCC--CCcEEEeCHHHHHHH Confidence 44445332221111 000 11112357788899999988888665321 234689999999999 Q ss_pred hcc-ccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcC Q lcl|NC_019525. 235 AGA-ASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVN 313 (343) Q Consensus 235 ~~~-~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~ 313 (343) ..- +.....+ +....+.+.....-.|.++--.|...+++.+- -.+..+.+++-..-+.+++. .|.|.. T Consensus 197 ~~~~~f~~~~~-~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~--i~~h~~A~~~~~k~~~~~~~--------~p~~~~ 265 (319) T protein:vir:97 197 KKFVIALPQGD-TRQQVLGKGVQGELDGFVIVKVPTKLLQGLQA--IAVVGEVLASPIQADLAKTN--------SNIPGM 265 (319) T ss_pred Hhhhhhhcccc-ccccceeeeeceeecCeEEEEecccccccceE--EEEcCCeeeeeeeeeeeecc--------CCCccc Confidence 432 1111111 01111111111111234332222211111000 00111222222222222221 122221 Q ss_pred CceEEeceeeeeccEEEEcCc-eEEeeecCC Q lcl|NC_019525. 314 NFQFQNAAYGQFTGVLAYRPK-ELLYLDIPV 343 (343) Q Consensus 314 ~l~~~v~~~~r~GGv~v~yP~-a~~Y~D~~~ 343 (343) . .+.+-+ -++.|+.|..|+ ..+|+..+. T Consensus 266 ~-a~~v~g-r~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:97 266 F-GTLAEQ-LLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred c-ceeeee-eeeeeeEEeccccceEEEeecC Confidence 1 233433 468899999988 455875555 No 132 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=20.75 E-value=2.7 Score=18.20 Aligned_cols=286 Identities=10% Similarity=-0.002 Sum_probs=115.4 Q ss_pred CCceeeecCCchHHHHHHHHhhhccchhhhhhhhhhhhhHHHHHHhhhhhhhcccccccchhhcceecCCCCc--eeEEE Q lcl|NC_019525. 1 MKKFVIRNSKGEKILLNAQEAKIAGVIQRLCNDLGFEIDVTTLTTLMKKIIEQKFFEISPADYMPIRVGEGAW--STMLT 78 (343) Q Consensus 1 ~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~d~~~~f~~~qL~~i~~~iye~~~~~l~~~~~~pv~~~~~~w--~~~~~ 78 (343) |.|-++ |+-| +++++-|--.-.. - +-.++ .|...-..++++.+....+..-+-++ ...+| +.++. T Consensus 1 ~~~~~~-~~~~-~~~~~~~~~~~~~------~-~~nt~---~l~~k~~~~LD~~~~~~~~s~~~~~N-~~~e~~gg~tVk 67 (319) T protein:vir:94 1 MNKTIK-NATG-MLKLNLQHFANKS------V-EPGQT---LLKNKHVGILERVTAVNAYSTPALIS-NDAIFMEGRSFT 67 (319) T ss_pred CCcccc-cccc-eeEeehhhhhccC------C-CcchH---HHHHHHHHHHHHHHHHhhhhhhcccC-cceEeccCcEEE Confidence 766543 4443 3444433211000 0 11111 12222222223222222222211111 11122 44443 Q ss_pred e--eeccchhhhhhcccccCCccccccccccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHH--HHHHHHHH Q lcl|NC_019525. 79 T--YRSFSLAEDFATGIIDTGNSNGKLAAVDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITAL--EKSRKKNW 154 (343) Q Consensus 79 ~--~~~vg~a~~ia~g~~~~g~~a~Dip~vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k--~~aar~a~ 154 (343) . .+.+|+++.=-+++...| ++..++....+-+ .-++.+.++++..++..+ .|.... .+-+|... T Consensus 68 Ip~i~~~gl~DY~R~~g~~~g---------~vt~~~~t~tidq-dR~~~F~VD~~D~~Etn~--~l~a~~i~~~~~~~~v 135 (319) T protein:vir:94 68 VMKGDTTELKDYKRNATNEFD---------HPKIEETTYFLDQ-EKYWGRFVDALDRKDTEG--NIDINYVVARQGAEVV 135 (319) T ss_pred EeeecccccccccCCCCcccC---------CcccceeEEEeec-ccccccccchhhHhhhhc--hhhHHHHHHHHHHHHh Confidence 3 345566654222222211 2233444433333 557778889988776443 454333 33444444 Q ss_pred HhhhhheEeeeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCCeEEeCHHHHHHH Q lcl|NC_019525. 155 DLGIQEIAFVGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPNKFTIPESDYTGL 234 (343) Q Consensus 155 ~~~~n~v~~~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~tl~lp~~~~~~L 234 (343) .-.+|...|--..+ +.+ +..-...|++.+++.|.++...+....-- ..-.|.++|..|..| T Consensus 136 ~PEiDay~~skla~---------~a~--------~~~~~~~t~~n~y~~i~~a~~~Lde~~VP--~~Rvl~Vtp~~~~~L 196 (319) T protein:vir:94 136 APYLDNLRFATLAR---------NKA--------KHLTVGTGSDAQYDAVLDVSVELDEIKAP--ENRVLFVSPTFYKGI 196 (319) T ss_pred hhhhhHHHHHHHHh---------hcc--------cccccccCHHHHHHHHHHHHHHHHhcCCC--CCcEEEeCHHHHHHH Confidence 44445332221111 000 11112357788899999988888665321 234689999999999 Q ss_pred hcc-ccCCCcchhhhhHHHhhcchhcCCcceEEeechhhhhcccccCCCccEEEEEEcCcceEEEecCCchhhcchhhcC Q lcl|NC_019525. 235 AGA-ASADFPIKSTKQVLEDTFKEITRNSSFEILPCVYADKITAQVPAVAKRYALYNDNEDSLRMDIPVDYTSTLANSVN 313 (343) Q Consensus 235 ~~~-~~s~~~~~tl~~~l~~n~~~~~~g~~l~I~~~~~~~~~~~~g~gg~drmv~Y~~d~~~v~~~iP~~~~~l~p~~~~ 313 (343) ..- +.....+ +....+.+.....-.|.++--.|...+++.+- -.+..+.+++-..-+.+++. .|.|.. T Consensus 197 ~~~~~f~~~~~-~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~--i~~h~~A~~~~~k~~~~~~~--------~p~~~~ 265 (319) T protein:vir:94 197 KKFVIALPQGD-TRQQVLGKGVQGELDGFVIVKVPTKLLQGLQA--IAVVGEVLASPIQADLAKTN--------SNIPGM 265 (319) T ss_pred Hhhhhhhcccc-ccccceeeeeceeecCeEEEEecccccccceE--EEEcCCeeeeeeeeeeeecc--------CCCccc Confidence 432 1111111 01111111111111234332222211111000 00111222222222222221 122221 Q ss_pred CceEEeceeeeeccEEEEcCc-eEEeeecCC Q lcl|NC_019525. 314 NFQFQNAAYGQFTGVLAYRPK-ELLYLDIPV 343 (343) Q Consensus 314 ~l~~~v~~~~r~GGv~v~yP~-a~~Y~D~~~ 343 (343) . .+.+-+ -++.|+.|..|+ ..+|+..+. T Consensus 266 ~-a~~v~g-r~y~d~~V~~~k~~~Iy~~~~~ 294 (319) T protein:vir:94 266 F-GTLAEQ-LLYTGAFVPEHLQKYIFTIGGT 294 (319) T ss_pred c-ceeeee-eeeeeeEEeccccceEEEeecC Confidence 1 233433 468899999988 455875555 No 133 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=20.63 E-value=2.7 Score=18.19 Aligned_cols=290 Identities=10% Similarity=0.021 Sum_probs=109.5 Q ss_pred eecCCchHHHHHHHHhhhccchhhhhhhh--hhhhhHHHHHHhhhhhhhccccccc----chhhcceecCCCCceeEEEe Q lcl|NC_019525. 6 IRNSKGEKILLNAQEAKIAGVIQRLCNDL--GFEIDVTTLTTLMKKIIEQKFFEIS----PADYMPIRVGEGAWSTMLTT 79 (343) Q Consensus 6 ~~~~~~~~~~~~a~~~~~~~~~~~~~~d~--~~~f~~~qL~~i~~~iye~~~~~l~----~~~~~pv~~~~~~w~~~~~~ 79 (343) --|..+..+..+ +..+..+ ..+.++ +.+.=|. ...+. .+.++.+.+.. -+.++.. T Consensus 1 m~~~~~~~~~t~----------~g~~~~~~d~~al~i------k~f~~eV-~~~f~~~s~~~~~~~~r~i~--~G~sv~i 61 (347) T protein:vir:94 1 MANVPGQKIGTD----------QGKGKSSSDALALFL------KVFAGEV-LTAFTRRSVTADKHIVRTIQ--NGKSAQF 61 (347) T ss_pred CCCCCccccccc----------cccCCccccHHHHHH------HHHhHHH-HHHHHHHHhhhccccccccc--ccceEEE Confidence 111111111100 0111111 112221 1111111 11111 12223333221 1233322 Q ss_pred eeccchh--hhhhcccccCCcccccccc--ccccccceeeeeEEEEeeEeecHHHHHHHHHhcCCCcHHHHHHHHHHHHH Q lcl|NC_019525. 80 YRSFSLA--EDFATGIIDTGNSNGKLAA--VDTGVDAVSIKTYKWAKTIGWTLPELAEAMKSGNWDLITALEKSRKKNWD 155 (343) Q Consensus 80 ~~~vg~a--~~ia~g~~~~g~~a~Dip~--vd~~~~~~~~~v~~~g~~~~ys~~EL~~A~~~Gr~~L~~~k~~aar~a~~ 155 (343) ..+|.. +....|. +|+. .+..-.+....|-.+- -+..-++++..+|.. .++-+.-.+.+-.++. T Consensus 62 -~~iG~~tv~~~t~G~--------~l~~~~~~~~~~e~~itID~~~-~~~~~VddiD~~q~~--~D~~~~~~~~~g~aLa 129 (347) T protein:vir:94 62 -PVMGRTSGVYLAPGE--------RLSDKRKGIKHTEKVITIDGLL-TADVMIFDIEDAMNH--YDVAGEYSNQLGEALA 129 (347) T ss_pred -ecccceeeeeecCCC--------CcCCCCCCCCcceEEEEecchh-hhhHHhhhHHHHhcC--cchHHHHHHHHHHHHH Confidence 222222 2222121 2211 1222233333333322 112347788888754 4677777777777777 Q ss_pred hhhhheEe---------eeeccccceeeeeecCCceeecccCCcccccCCHHHHHHHHHHHHHHHHhcCCceecCC---e Q lcl|NC_019525. 156 LGIQEIAF---------VGMKDNANVKGLLTQTGNVVNNTFLTKSIKSMTPAELKVLCAGIIDVYRQGCDYTAMPN---K 223 (343) Q Consensus 156 ~~~n~v~~---------~G~~~~~g~~GLlN~p~v~~~~a~~~~~w~~kT~~eIl~Din~~l~~v~~~s~~~~~p~---t 223 (343) +..|+.++ .+ .++....|+.. +++......+...-..++++.+.+-|.++...+.... .|. . T Consensus 130 ~~~D~~i~~~~~~~aa~~~-~~~~~~~g~~~-~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~----VP~~~R~ 203 (347) T protein:vir:94 130 IAADGAVLAEMAILCNLPA-ASNENIAGLGT-ASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNY----VPAGDRY 203 (347) T ss_pred HHHHHHHHHHHHHHhcccc-ccccccCCCcc-cceeeccccccccchhhhHHHHHHHHHHHHHHHhhcC----CCCCCcE Confidence 77776442 22 22223344432 2221111122223345667777776666665554332 244 7 Q ss_pred EEeCHHHHHHHhccccCCCcchhhhhHHHhhcchhcCCcce-----EEeechhhhh--ccc-------ccCCCc------ Q lcl|NC_019525. 224 FTIPESDYTGLAGAASADFPIKSTKQVLEDTFKEITRNSSF-----EILPCVYADK--ITA-------QVPAVA------ 283 (343) Q Consensus 224 l~lp~~~~~~L~~~~~s~~~~~tl~~~l~~n~~~~~~g~~l-----~I~~~~~~~~--~~~-------~g~gg~------ 283 (343) ++++|.+|..|...+.-.. ..+..+. ...+|.=. +|..+..+.. .++ ....|. T Consensus 204 ~vv~P~~~~~Ll~~~~~~~-----~~~~~~~--~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~ 276 (347) T protein:vir:94 204 FYTTPDNYSAILAALMPNA-----ANYAALI--DPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPA 276 (347) T ss_pred EEeCHHHHHHHhccchhhh-----hhccccc--cccccceEEEeceEEEecCcccccccccccccCcceecCcccccccc Confidence 9999999998854332111 1111111 11122211 1222221110 000 000111 Q ss_pred ------------cEEEEEEcCcc-eEEEecCCchhhcchhhcCCceEEeceeeeeccEEEEcCceEEeeecCC Q lcl|NC_019525. 284 ------------KRYALYNDNED-SLRMDIPVDYTSTLANSVNNFQFQNAAYGQFTGVLAYRPKELLYLDIPV 343 (343) Q Consensus 284 ------------drmv~Y~~d~~-~v~~~iP~~~~~l~p~~~~~l~~~v~~~~r~GGv~v~yP~a~~Y~D~~~ 343 (343) -..++|.++.= .++. +++..+..-++.+. ..+ +-+.. .=|..+.+|++++-+-.+. T Consensus 277 ~~~~~~~~~~~~~~~l~~h~~A~~~v~~-~~~~~e~~r~~~~~-~d~-i~~~~-~~G~~~~rP~~a~~~~~~~ 345 (347) T protein:vir:94 277 TASSDVKVTMDNVVGLFSHRSAVGTVKL-RDLALERDRDVDAQ-GDL-IVGKY-AMGHGGLRPEAAGALVFSP 345 (347) T ss_pred cchhhhcccccceeEEEeehhhhhhhhc-ccccccchhchhhH-HHH-hhhhh-hhcCcccccceeEEEEecC Confidence 12233322210 1111 12222222222221 122 22332 3378899999999998888 Done!