Query lcl|Aclame:protein:vir:94895|NCBI_annot:putative structural protein|genbank:acc:YP_762517;genbank:gi:115304216;genbank:GeneID:5141208 Match_columns 393 No_of_seqs 8 out of 12 Neff 2.7 Searched_HMMs 1612 Date Sun Dec 1 23:36:51 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_49 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_49_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:1663 Length: 393 # 100.0 4E-182 2E-185 1015.4 21.6 393 1-393 1-393 (393) 2 protein:vir:93966 Length: 400 100.0 6E-178 4E-181 992.1 22.5 393 1-393 8-400 (400) 3 protein:vir:93858 Length: 400 100.0 9E-166 5E-169 925.5 27.9 393 1-393 8-400 (400) 4 protein:vir:861 Length: 318 # 100.0 1E-142 7E-146 798.9 16.5 318 76-393 1-318 (318) 5 protein:vir:94870 Length: 318 100.0 2E-129 1E-132 726.7 16.7 318 76-393 1-318 (318) 6 protein:vir:97397 Length: 517 100.0 2.6E-33 1.6E-36 199.2 20.3 360 1-393 124-512 (517) 7 protein:vir:4074 Length: 480 # 99.9 1.2E-23 7.4E-27 146.3 16.1 345 1-393 96-475 (480) 8 protein:vir:9704 Length: 394 # 99.7 1.1E-18 6.6E-22 119.1 23.4 350 1-393 1-388 (394) 9 protein:vir:81070 Length: 390 99.7 6.8E-19 4.2E-22 120.2 19.0 364 1-393 1-390 (390) 10 protein:vir:79987 Length: 415 99.7 5.9E-18 3.7E-21 115.0 23.4 366 1-393 1-402 (415) 11 protein:vir:81100 Length: 415 99.7 5.9E-18 3.7E-21 115.0 23.4 366 1-393 1-402 (415) 12 protein:vir:98339 Length: 415 99.7 5.9E-18 3.7E-21 115.0 23.4 366 1-393 1-402 (415) 13 protein:vir:191 Length: 385 # 99.7 2E-18 1.3E-21 117.6 20.6 359 1-393 1-382 (385) 14 protein:vir:1886 Length: 385 # 99.7 2E-18 1.3E-21 117.6 20.6 359 1-393 1-382 (385) 15 protein:vir:97053 Length: 390 99.7 2.7E-18 1.7E-21 116.9 20.2 364 1-393 1-390 (390) 16 protein:vir:10364 Length: 390 99.7 6.2E-18 3.8E-21 114.9 21.5 367 1-393 1-390 (390) 17 protein:vir:7409 Length: 408 # 99.7 2.8E-18 1.7E-21 116.8 19.5 353 1-393 1-391 (408) 18 protein:vir:100884 Length: 389 99.7 2.3E-18 1.4E-21 117.3 18.8 347 6-393 1-380 (389) 19 protein:vir:4600 Length: 415 # 99.7 1.4E-17 8.7E-21 113.0 23.0 364 1-393 1-402 (415) 20 protein:vir:4700 Length: 415 # 99.7 1.4E-17 8.7E-21 113.0 23.0 364 1-393 1-402 (415) 21 protein:vir:100172 Length: 394 99.7 3.5E-18 2.2E-21 116.3 19.5 346 1-393 1-382 (394) 22 protein:vir:95376 Length: 425 99.7 2.5E-17 1.6E-20 111.6 23.7 371 1-393 14-419 (425) 23 protein:vir:4953 Length: 397 # 99.7 2.6E-18 1.6E-21 117.0 18.3 341 1-393 1-383 (397) 24 protein:vir:4997 Length: 397 # 99.7 5.7E-18 3.5E-21 115.1 19.6 338 1-393 1-383 (397) 25 protein:vir:4830 Length: 397 # 99.7 1.2E-17 7.3E-21 113.4 20.1 345 1-393 1-383 (397) 26 protein:vir:102119 Length: 404 99.7 2.2E-17 1.4E-20 111.9 21.0 365 1-393 1-398 (404) 27 protein:vir:9410 Length: 415 # 99.7 7.7E-17 4.8E-20 108.9 23.9 366 1-393 1-402 (415) 28 protein:vir:3991 Length: 404 # 99.7 2E-17 1.2E-20 112.1 19.7 349 1-393 2-391 (404) 29 protein:vir:3845 Length: 395 # 99.7 3E-17 1.8E-20 111.2 20.1 340 1-393 1-381 (395) 30 protein:vir:4511 Length: 409 # 99.6 1.4E-16 8.5E-20 107.5 23.7 364 1-393 1-404 (409) 31 protein:vir:962 Length: 397 # 99.6 1.2E-16 7.2E-20 107.9 23.3 350 1-393 1-395 (397) 32 protein:vir:3870 Length: 400 # 99.6 2.4E-16 1.5E-19 106.2 24.8 355 1-393 1-397 (400) 33 protein:vir:1268 Length: 397 # 99.6 2.3E-16 1.4E-19 106.3 24.4 352 1-393 3-395 (397) 34 protein:vir:4339 Length: 395 # 99.6 6.3E-17 3.9E-20 109.4 20.5 363 1-393 1-393 (395) 35 protein:vir:1025 Length: 408 # 99.6 5.7E-17 3.6E-20 109.6 20.0 352 1-393 1-391 (408) 36 protein:vir:102873 Length: 392 99.6 8.4E-17 5.2E-20 108.7 20.4 340 1-393 1-382 (392) 37 protein:vir:107593 Length: 392 99.6 8.4E-17 5.2E-20 108.7 20.4 340 1-393 1-382 (392) 38 protein:vir:105004 Length: 392 99.6 8.4E-17 5.2E-20 108.7 20.4 340 1-393 1-382 (392) 39 protein:vir:102082 Length: 392 99.6 8.4E-17 5.2E-20 108.7 20.4 340 1-393 1-382 (392) 40 protein:vir:8102 Length: 543 # 99.6 5E-16 3.1E-19 104.5 23.1 367 1-393 118-540 (543) 41 protein:vir:100135 Length: 418 99.6 6.5E-16 4E-19 103.8 21.3 361 1-393 4-413 (418) 42 protein:vir:81160 Length: 371 99.6 4.7E-16 2.9E-19 104.6 20.1 327 1-393 1-369 (371) 43 protein:vir:4456 Length: 401 # 99.6 3.3E-15 2E-18 100.0 24.6 363 1-393 1-399 (401) 44 protein:vir:1084 Length: 437 # 99.6 2.8E-15 1.7E-18 100.4 23.9 354 1-393 1-425 (437) 45 protein:vir:1383 Length: 421 # 99.5 3.2E-15 2E-18 100.0 20.0 345 1-393 1-390 (421) 46 protein:vir:100247 Length: 425 99.5 5E-15 3.1E-18 99.0 20.5 354 1-393 1-422 (425) 47 protein:vir:485 Length: 407 # 99.5 9.7E-15 6E-18 97.4 21.4 362 3-393 1-398 (407) 48 protein:vir:1433 Length: 435 # 99.5 4.2E-15 2.6E-18 99.4 19.3 365 1-393 1-431 (435) 49 protein:vir:4092 Length: 390 # 99.5 3.7E-15 2.3E-18 99.7 18.5 349 1-393 1-366 (390) 50 protein:vir:105038 Length: 428 99.5 1.2E-14 7.4E-18 96.9 20.1 369 1-393 1-424 (428) 51 protein:vir:80376 Length: 435 99.5 2.3E-14 1.4E-17 95.4 20.6 365 1-393 1-431 (435) 52 protein:vir:1328 Length: 392 # 99.4 3.1E-14 1.9E-17 94.6 20.0 363 1-393 1-389 (392) 53 protein:vir:6212 Length: 434 # 99.4 1.8E-13 1.1E-16 90.5 24.0 365 1-393 1-427 (434) 54 protein:vir:9361 Length: 402 # 99.4 1.7E-13 1E-16 90.7 23.0 360 1-393 13-394 (402) 55 protein:vir:104256 Length: 458 99.4 1.3E-13 8E-17 91.2 22.0 366 1-393 12-456 (458) 56 protein:vir:2685 Length: 387 # 99.4 2.3E-13 1.4E-16 89.9 23.3 360 1-393 1-379 (387) 57 protein:vir:94424 Length: 387 99.4 2.3E-13 1.4E-16 89.9 23.3 360 1-393 1-379 (387) 58 protein:vir:96978 Length: 387 99.4 2.3E-13 1.4E-16 89.9 23.3 360 1-393 1-379 (387) 59 protein:vir:94673 Length: 419 99.4 7.6E-14 4.7E-17 92.5 20.5 367 1-393 1-415 (419) 60 protein:vir:80128 Length: 466 99.4 4.2E-13 2.6E-16 88.4 24.3 369 1-393 7-446 (466) 61 protein:vir:93881 Length: 387 99.4 3.2E-13 2E-16 89.1 22.1 357 1-393 1-379 (387) 62 protein:vir:101607 Length: 379 99.4 2.7E-13 1.7E-16 89.5 20.9 345 1-393 1-377 (379) 63 protein:vir:6242 Length: 390 # 99.4 2.4E-13 1.5E-16 89.7 20.1 351 1-393 1-387 (390) 64 protein:vir:81227 Length: 413 99.4 2.6E-13 1.6E-16 89.6 19.5 354 1-393 1-408 (413) 65 protein:vir:9643 Length: 377 # 99.2 3.3E-12 2E-15 83.5 19.0 342 1-393 1-375 (377) 66 protein:vir:101650 Length: 497 99.2 5.1E-12 3.2E-15 82.5 19.7 371 1-393 1-491 (497) 67 protein:vir:7855 Length: 497 # 99.2 5.1E-12 3.2E-15 82.5 19.7 371 1-393 1-491 (497) 68 protein:vir:9309 Length: 324 # 99.2 3.6E-13 2.3E-16 88.8 11.3 276 93-393 1-313 (324) 69 protein:vir:98635 Length: 377 99.2 4E-12 2.5E-15 83.1 16.9 341 1-393 1-375 (377) 70 protein:vir:99749 Length: 324 99.2 3.3E-13 2E-16 89.0 10.7 277 57-393 1-313 (324) 71 protein:vir:78640 Length: 352 99.1 1.6E-11 1E-14 79.7 19.0 330 32-393 1-344 (352) 72 protein:vir:8420 Length: 477 # 99.1 3.6E-11 2.2E-14 77.9 20.0 372 1-393 1-467 (477) 73 protein:vir:97148 Length: 324 99.1 1E-12 6.3E-16 86.3 11.2 277 93-393 1-313 (324) 74 protein:vir:78830 Length: 324 99.1 1.5E-12 9.1E-16 85.5 11.6 277 93-393 1-313 (324) 75 protein:vir:96392 Length: 324 99.1 1.5E-12 9.1E-16 85.5 11.6 277 93-393 1-313 (324) 76 protein:vir:103955 Length: 324 99.1 2.3E-12 1.4E-15 84.4 11.0 283 66-393 1-313 (324) 77 protein:vir:105905 Length: 304 99.1 1.5E-12 9.5E-16 85.4 9.5 269 94-393 1-303 (304) 78 protein:vir:94142 Length: 304 99.1 1.5E-12 9.5E-16 85.4 9.5 269 94-393 1-303 (304) 79 protein:vir:95963 Length: 395 99.1 1.6E-10 1E-13 74.3 20.6 352 3-393 1-374 (395) 80 protein:vir:9509 Length: 381 # 99.0 1.4E-10 8.6E-14 74.6 19.1 340 1-393 1-366 (381) 81 protein:vir:101291 Length: 381 99.0 1.4E-10 8.6E-14 74.6 19.1 340 1-393 1-366 (381) 82 protein:vir:96223 Length: 324 99.0 8E-12 5E-15 81.4 11.1 277 93-393 1-313 (324) 83 protein:vir:100632 Length: 381 98.9 6.7E-10 4.1E-13 70.9 19.8 338 1-393 3-366 (381) 84 protein:vir:41 Length: 299 # N 98.9 2.8E-11 1.7E-14 78.5 11.3 264 94-393 1-296 (299) 85 protein:vir:2430 Length: 318 # 98.9 2.3E-11 1.4E-14 78.9 10.9 275 94-393 1-311 (318) 86 protein:vir:4226 Length: 326 # 98.9 1.4E-11 8.6E-15 80.1 9.2 282 93-393 1-321 (326) 87 protein:vir:78350 Length: 383 98.9 7.5E-10 4.6E-13 70.6 18.6 343 1-393 1-373 (383) 88 protein:vir:7771 Length: 330 # 98.9 1.4E-11 8.5E-15 80.1 8.7 273 94-393 1-321 (330) 89 protein:vir:78223 Length: 333 98.8 1.1E-10 6.7E-14 75.2 10.1 278 104-393 1-330 (333) 90 protein:vir:2504 Length: 305 # 98.8 1.5E-10 9.2E-14 74.5 10.7 262 112-393 1-296 (305) 91 protein:vir:95763 Length: 297 98.7 1.8E-10 1.1E-13 74.0 8.8 264 94-393 1-294 (297) 92 protein:vir:78523 Length: 338 98.6 7.6E-10 4.7E-13 70.6 9.7 278 104-393 1-333 (338) 93 protein:vir:2344 Length: 397 # 98.6 2.2E-09 1.4E-12 68.1 11.8 268 97-393 1-304 (397) 94 protein:vir:1638 Length: 298 # 98.5 1.9E-09 1.2E-12 68.3 10.2 265 110-393 1-297 (298) 95 protein:vir:4856 Length: 293 # 98.5 2.1E-09 1.3E-12 68.1 9.6 249 109-393 1-279 (293) 96 protein:vir:104085 Length: 320 98.4 4.9E-09 3E-12 66.2 10.7 278 94-393 1-315 (320) 97 protein:vir:9574 Length: 300 # 98.4 4.2E-09 2.6E-12 66.5 9.6 265 110-393 1-298 (300) 98 protein:vir:9759 Length: 303 # 98.4 7E-09 4.3E-12 65.3 9.7 264 113-393 1-301 (303) 99 protein:vir:99920 Length: 311 98.3 6.3E-09 3.9E-12 65.5 8.0 262 110-393 1-309 (311) 100 protein:vir:93616 Length: 645 98.3 3E-07 1.9E-10 56.4 17.1 364 1-393 193-637 (645) 101 protein:vir:94771 Length: 298 98.2 3.2E-08 2E-11 61.7 10.9 262 116-393 1-297 (298) 102 protein:vir:8187 Length: 311 # 98.2 2.5E-08 1.5E-11 62.3 8.9 264 107-393 1-308 (311) 103 protein:vir:5739 Length: 366 # 97.8 6.9E-07 4.3E-10 54.4 10.2 316 24-393 1-364 (366) 104 protein:vir:80684 Length: 315 97.4 4.3E-06 2.7E-09 50.0 10.4 274 110-393 1-304 (315) 105 protein:vir:96762 Length: 632 97.1 0.00013 8.2E-08 41.9 15.2 357 1-393 219-631 (632) 106 protein:vir:3158 Length: 321 # 97.1 3.3E-05 2E-08 45.2 11.8 281 97-393 1-310 (321) 107 protein:vir:4197 Length: 314 # 95.7 0.0008 4.9E-07 37.6 11.5 288 99-393 1-309 (314) 108 protein:vir:9820 Length: 272 # 95.6 0.00071 4.4E-07 37.8 10.8 247 94-393 1-267 (272) 109 protein:vir:3033 Length: 272 # 95.6 0.00071 4.4E-07 37.8 10.8 247 94-393 1-267 (272) 110 protein:vir:4159 Length: 315 # 94.8 0.00032 2E-07 39.7 6.7 272 92-392 1-315 (315) 111 protein:vir:93742 Length: 274 79.4 0.1 6.5E-05 26.0 10.3 247 94-393 1-268 (274) 112 protein:vir:97031 Length: 402 78.5 0.062 3.8E-05 27.2 7.0 267 94-393 1-295 (402) 113 protein:vir:3613 Length: 272 # 75.5 0.15 9E-05 25.2 13.3 248 94-393 1-270 (272) 114 protein:vir:94933 Length: 330 72.3 0.18 0.00011 24.6 13.3 285 94-393 1-328 (330) 115 protein:vir:96123 Length: 274 60.2 0.38 0.00023 22.9 10.1 247 94-393 1-268 (274) 116 protein:vir:8885 Length: 347 # 47.2 0.71 0.00044 21.4 8.5 286 93-393 1-344 (347) 117 protein:vir:80180 Length: 381 40.3 0.98 0.00061 20.6 9.0 280 93-393 1-329 (381) 118 protein:vir:103323 Length: 364 40.2 0.98 0.00061 20.6 7.7 272 94-393 1-337 (364) 119 protein:vir:108295 Length: 711 32.2 1.4 0.0009 19.7 6.3 92 1-103 612-711 (711) 120 protein:vir:94576 Length: 347 32.1 1.4 0.0009 19.7 8.6 267 93-393 1-313 (347) 121 protein:vir:97255 Length: 310 27.2 1.9 0.0012 19.1 11.1 262 94-393 1-309 (310) 122 protein:vir:80213 Length: 334 24.3 2.2 0.0014 18.7 5.8 266 93-393 1-330 (334) 123 protein:vir:7019 Length: 401 # 22.8 2.4 0.0015 18.5 6.4 270 94-393 1-292 (401) 124 protein:vir:97433 Length: 274 22.0 2.5 0.0016 18.4 10.3 247 94-393 1-268 (274) 125 protein:vir:94494 Length: 274 22.0 2.5 0.0016 18.4 10.3 247 94-393 1-268 (274) 126 protein:vir:94711 Length: 347 21.5 2.6 0.0016 18.3 6.7 275 93-393 1-344 (347) No 1 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=100.00 E-value=3.5e-182 Score=1015.37 Aligned_cols=393 Identities=99% Similarity=1.287 Sum_probs=392.6 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) ||||||+|+|+|||+||++++|++|||++|+|++|+++|||++|||+|+|+++.||+++|||||+++|+||||++|++|| T Consensus 1 mnkpdliekqnrlaelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~en~LN~~eE~~KGK~kMt~~i 80 (393) T protein:vir:16 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) T ss_pred CCCcchhhhhhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhcchhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l 160 (393) +|++|+.+||++||+|+|+++.++||.++|+|+||+++|++++||+|++++|++||++|||||++|||+|+|+|+++++| T Consensus 81 esq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~ 160 (393) T protein:vir:16 81 ESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) T ss_pred hhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCC Q lcl|Aclame:pro 161 DSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 240 (393) Q Consensus 161 ~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~ 240 (393) |++++|+||++|++|++|.++|.++||+|+|||++|+|||++++++|+|++||||+|+||+|+||+|||+||+|+|||++ T Consensus 161 ~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N 240 (393) T protein:vir:16 161 DSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 240 (393) T ss_pred hhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCe Q lcl|Aclame:pro 241 GFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTE 320 (393) Q Consensus 241 ~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~ 320 (393) +++++++++|+++|++||++++++|+|||+|+++||+||+||+|||||||++++||+||+||||||++|||+||+++|++ T Consensus 241 ~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatananvriknddte 320 (393) T protein:vir:16 241 GFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTE 320 (393) T ss_pred CccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhccCceeeeccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeecccchhhcccchhceeeeecccceecccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 321 IASEVGVDEIIVYTGSKAVKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 321 ~~~~v~~~~~~~~tG~k~~~ptv~vD~k~~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++++|||+++++|||+|+++|||+|||+||||||+++++++|+|+|||||||||+||||||++||+|++|+|| T Consensus 321 iasevgvdeiivytgskalkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 321 IASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred hhhhcCcceeeeeeccccccceeeeccccccchhhhhhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=100.00 E-value=6.3e-178 Score=992.08 Aligned_cols=393 Identities=99% Similarity=1.284 Sum_probs=392.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |||||++|||++|+++||+++++||+++|++|++|+++|||++|||+|+|+++.||+++|||||+++|+||||++|++|| T Consensus 8 ~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LNa~~E~~KGK~kMt~~i 87 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 87 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhhhhhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l 160 (393) +|++|+.+||++||+|+|+++.++||.++|+|+||+++|++++||+|++++|++||+||||||++|||+|+|+|+++++| T Consensus 88 ~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~ 167 (400) T protein:vir:93 88 ESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 167 (400) T ss_pred hhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCC Q lcl|Aclame:pro 161 DSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 240 (393) Q Consensus 161 ~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~ 240 (393) |++++|+||++|++|++|.++|.++||+|+|||++|+|||++++++|+|++||||+|+||+|+||+|||+||+|+|||++ T Consensus 168 ~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N 247 (400) T protein:vir:93 168 DSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 247 (400) T ss_pred hhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCe Q lcl|Aclame:pro 241 GFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTE 320 (393) Q Consensus 241 ~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~ 320 (393) +++++++++|+++|++||++++++|+|||+|+++||+||+||+|||||||++++||+||+||||||++||.+||+++|.+ T Consensus 248 ~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatanahvriknddae 327 (400) T protein:vir:93 248 GFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANAHVRIKNDDAE 327 (400) T ss_pred CccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhccccceEeecchhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeecccchhhcccchhceeeeecccceecccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 321 IASEVGVDEIIVYTGSKAVKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 321 ~~~~v~~~~~~~~tG~k~~~ptv~vD~k~~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++++|||+++++|||+|+++|||+|||+||||||+++++++|+|+|||||||||+||||||++||+|++|+|| T Consensus 328 iasevgvdeiivytgskalkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 328 IASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred hhhhcCcceeeeeeccccccceeeeccccccchhhhhhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=100.00 E-value=8.7e-166 Score=925.51 Aligned_cols=393 Identities=99% Similarity=1.286 Sum_probs=382.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |||||++|||++|+++|++++++||++++++|++|+++|||++|||+|+|+|++||||+|||||+++|+|||||+||||| T Consensus 8 ~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln~~~E~~Kgk~~mtefL 87 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 87 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhhhhhhhcccchhHHHhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l 160 (393) +|++|+++||++||+|||++|+++||+|+|+|+||++||+|+|||+|||+|||+||++++++|++|||+|+|++++..++ T Consensus 88 kT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~l~V~~~~ 167 (400) T protein:vir:93 88 ESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 167 (400) T ss_pred hhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCCceeeecch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCC Q lcl|Aclame:pro 161 DSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 240 (393) Q Consensus 161 ~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~ 240 (393) +++++|||||+|++|++|+++|++|+|+|+|||+||+|+|++++++|+||+||+|||+|||||||||||+||+++|||++ T Consensus 168 dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Aii~GdG~N 247 (400) T protein:vir:93 168 DSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 247 (400) T ss_pred hhhcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhheeeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCe Q lcl|Aclame:pro 241 GFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTE 320 (393) Q Consensus 241 ~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~ 320 (393) ++...++++|++.|++++..+...++|++.|.+++++||++|+++|+++|||++|++|++++||++..||+++..++|.. T Consensus 248 gf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~~~n~d~~ 327 (400) T protein:vir:93 248 GFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTE 327 (400) T ss_pred ccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeeeeccccch Confidence 99999999999999999666555555555555555579999999999999999999999999999999999999999999 Q ss_pred EEEeeecccchhhcccchhceeeeecccceecccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 321 IASEVGVDEIIVYTGSKAVKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 321 ~~~~v~~~~~~~~tG~k~~~ptv~vD~k~~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) |+..+||+++.+|||.+..+|+|+||++||||||||++++||+|+||||||||||||+||+++||+|||++|| T Consensus 328 IA~~fGv~~Lv~~Tr~~~~kp~V~VDek~~i~~~~~~t~~sf~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 328 IASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred hhhhcccceeeeeccCCCCCceeeeehhhhccccCceeccceeeeeccceEEeeeeeccceecccceeeEeeC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=100.00 E-value=1.1e-142 Score=798.91 Aligned_cols=318 Identities=98% Similarity=1.283 Sum_probs=317.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee Q lcl|Aclame:pro 76 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 155 (393) Q Consensus 76 mtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a 155 (393) |++||+|++|+.+||++||+|+|++|.++||+++|+|+||+++|++++||+|++++|++||+||||||++|||+|+|+|+ T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~ 80 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 80 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceee Q lcl|Aclame:pro 156 VSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE 235 (393) Q Consensus 156 ~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~ 235 (393) ++++||++-+||||++|++|++|.++|.++||+|+|||++|+|||++++++|+|++||||+|+||+|+||+|||+||+|+ T Consensus 81 V~~s~~s~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~ 160 (318) T protein:vir:86 81 VSRSFDSSAEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE 160 (318) T ss_pred hhhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecchhhhHhhhhhccccccceeee Q lcl|Aclame:pro 236 GDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIK 315 (393) Q Consensus 236 gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~ 315 (393) |||.|+++++++++|+++|++||++++++|.|||+.+++||+||+||+|||||||++.+||+||+||||||++||.+||+ T Consensus 161 GDG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdfvrptagrrylivkaedrkalldelrqatanahvrik 240 (318) T protein:vir:86 161 GDGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANAHVRIK 240 (318) T ss_pred ecCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhhhccCCCceEEEEeecchHHHHHHHHhhcccceeEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCeEEEeeecccchhhcccchhceeeeecccceecccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 316 NDDTEIASEVGVDEIIVYTGSKAVKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 316 ~~~~~~~~~v~~~~~~~~tG~k~~~ptv~vD~k~~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++|++++++|||+++++|||+|+++|||+||||||||||+++++++|+|+|||||||||+|+||||++||+|++|+|| T Consensus 241 nddteiasevgvdeiivytgskalkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 241 NDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred ccchhhhhhcCcceeeeeeccccccceeeeccceecchhhhhhhhcceeccCCceEEEeecccCcceeecCceeEEeC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=100.00 E-value=1.6e-129 Score=726.70 Aligned_cols=318 Identities=100% Similarity=1.300 Sum_probs=317.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee Q lcl|Aclame:pro 76 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 155 (393) Q Consensus 76 mtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a 155 (393) |++|+++++|+.+|+++|..|+|++++++||+++|+|+||+++|++++||++++++|++|+.++++||.+|||+|.+++. T Consensus 1 mtnfiesqnavteffdvlkknsgkseiknawnaklaengvtitdttfqlprklvesintallntnpvfkvfhvtnvgall 80 (318) T protein:vir:94 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 80 (318) T ss_pred CccchhhhhhHHHHHHHHhcccChhhhhhhhhhhhhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhhhee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceee Q lcl|Aclame:pro 156 VSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE 235 (393) Q Consensus 156 ~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~ 235 (393) ++++++++|++|.||+|++|++|..+|.++++.|.||||+|+|++++++|+|+|++|||+++.||.|+++++||+.|+|. T Consensus 81 vsrsfdssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnkivdlalve 160 (318) T protein:vir:94 81 VSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE 160 (318) T ss_pred eeccccccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhhhheeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecchhhhHhhhhhccccccceeee Q lcl|Aclame:pro 236 GDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIK 315 (393) Q Consensus 236 gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~ 315 (393) |||+++++++++++|+++||+||++++++|+|||+|+++||+||+||+|||||||++++||+||+||||||++|||+||+ T Consensus 161 gdgtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatananvrik 240 (318) T protein:vir:94 161 GDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIK 240 (318) T ss_pred cCCcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhcccceEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCeEEEeeecccchhhcccchhceeeeecccceecccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 316 NDDTEIASEVGVDEIIVYTGSKAVKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 316 ~~~~~~~~~v~~~~~~~~tG~k~~~ptv~vD~k~~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++|++++++|||+++++|||+|+|+|+|+||||||||||+++++++|+|+|||||||||+++||||++||+|++|+|| T Consensus 241 nddteiasevgvdeiivytgskavkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 241 NDDTEIASEVGVDEIIVYTGSKAVKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred ccchhhhhhcCcceeEEeeccccccceeEeccceecchhhhhhhhceeeccCCceEEEEecccCcceeecCceeEEeC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=2.6e-33 Score=199.23 Aligned_cols=360 Identities=21% Similarity=0.253 Sum_probs=265.1 Q ss_pred CCcch-----hhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHH-HHHHHHHhhh------- Q lcl|Aclame:pro 1 MNKPD-----LIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEII-KIENELNAQE------- 67 (393) Q Consensus 1 ~~k~d-----~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~-k~enel~~~~------- 67 (393) +|.-- ++.++.++.++++....+++.++.. +++.++..+++++++++.+.+.+.. +.+++++... T Consensus 124 a~~~a~I~~vke~~~~e~~~~~~~~a~~ee~~e~~--~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~ 201 (517) T protein:vir:97 124 SNKNAVVTYFREEKKKEENKMTFDQNLMQELLDAK--KLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILG 201 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhcc Confidence 32211 2334556666666666666655543 3445555677777777666555443 2333333222 Q ss_pred -hhhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 68 -EKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVF 146 (393) Q Consensus 68 -Ek~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~f 146 (393) .+....+...+|++++.+...++.. ....+.+..|...+.+.++.+ ...|..++..|.+.+....++...+ T Consensus 202 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~----~~~p~~~~~~i~~~~~~~~~i~~~~ 273 (517) T protein:vir:97 202 VEALKVTPEATEFLKTREAEVAYMSA----SLTKDPKAAWTAELKERGISG----MPAPAGILKRIQDAVNDEGSLLPFI 273 (517) T ss_pred cccccccchhhHHHHHHHHHHHHHHh----cccccccceeeeecccccccc----cccchHHHHHHHHhhhhhccceeee Confidence 2333444566888888877766433 233455677888888887744 2468889999999999888898877 Q ss_pred eeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHH-HHHHHhhcC-chhHHHHHHHHHHHHH Q lcl|Aclame:pro 147 HVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQM-SYSELYNLIVAELTQA 223 (393) Q Consensus 147 hV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~L-ad~~k~l~g-~ygalvnyvm~ELaq~ 223 (393) .+++++.. .+.+++ ...+++|..|+.|.++.++|...++.|.++|.+.++ .+++.+.-- ...+|.+|++++|+.+ T Consensus 274 ~~~~i~~~--~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~ 351 (517) T protein:vir:97 274 RHENLPTL--VVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDM 351 (517) T ss_pred eeccccce--eeecccccceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHH Confidence 78877654 555554 458899999999999999999999999999999887 333333211 1246899999999999 Q ss_pred HHHHHHhcceeeccCC--CccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecchhhhHhh Q lcl|Aclame:pro 224 IVNKIVDLALVEGDGT--NGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLD 301 (393) Q Consensus 224 fI~Rav~rAvv~gDG~--~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~ 301 (393) |- +..++++++|||+ +.+-+.+..++.+ +..+..|...+++...|..+.+.+.+..+++++.|+++|+- T Consensus 352 l~-~~ee~a~l~GdGtg~~~~gi~~~a~~~~--------~~~~~~~~~~~d~i~~l~~a~~~a~~a~~vmn~~t~~~I~k 422 (517) T protein:vir:97 352 VI-MAVNRAIIMGGVTGVSETQIYPVVGDAW--------ATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAAIRF 422 (517) T ss_pred HH-HHHHHHHhcccCCCcccccccccccccc--------cccccccchHHHHHHHHHHHhhhccCCEEEECHHHHHHHHH Confidence 99 7999999999886 4555666655544 23344466666777788767777788899999999999998 Q ss_pred hhhccccccceeeecCCCeEEEeeecccchhhc--ccchhceeeeecccceecccceeeecc--------eeEeecCceE Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVKPTVLVDQKYHIDMQDLTKVDA--------FEWKTNSNMI 371 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~ptv~vD~k~~~~~~~~~~~~s--------~~~~~ns~~i 371 (393) |||.+|+|-++.++++-..+| |..-+.|.+.+|+...++++||..++. |.+++|+.++ T Consensus 423 ------------lKD~~G~Yl~~~~~~~~~~~~l~G~~~~~~~~~~~~~~~~~~~~y~i~~~~g~~~~~~fd~~~n~~~f 490 (517) T protein:vir:97 423 ------------LKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEY 490 (517) T ss_pred ------------hhcCCCCeeccCcCCcccccccCCccccccccccCceeEeeccccEEEeecceeeeeeeecccCceeE Confidence 999999999999888887777 766678999999998888899877665 5557899999 Q ss_pred EEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 372 LVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 372 ~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +++++++|+|..+++.++.++. T Consensus 491 ~~~~~~~g~i~~~~r~a~~~~~ 512 (517) T protein:vir:97 491 LFEMPISGSLEYKGTTAYGTYT 512 (517) T ss_pred eeeeeeccccccccceEEEEEc Confidence 9999999999999999999888 No 7 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.85 E-value=1.2e-23 Score=146.27 Aligned_cols=345 Identities=17% Similarity=0.214 Sum_probs=178.8 Q ss_pred CCcchhhH---------HHHHHHHHHHhhHHHHhhhhhhhhhh----HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 1 MNKPDLIE---------KQNRLAELKENNVSLKSQISGFEVKN----AIEDLPKVQELEKTLSENSIEIIKIENELNAQE 67 (393) Q Consensus 1 ~~k~d~~e---------kq~eLa~lK~~~~~~~s~i~~~~v~~----a~~~~skieelektis~l~aEi~k~enel~~~~ 67 (393) ..+.||-| .+++...+|+.....+.+....+.++ +.....+..+++..+.+++.+++..+....... T Consensus 96 ~~~~~l~EvS~v~~pa~~~a~v~~vks~~~~~e~~~~~~e~~e~~~e~~e~~~~~~el~akl~el~k~~ee~k~~~~~~~ 175 (480) T protein:vir:40 96 YKDVTITEVSLTPLPSNKGAKVTKVREENKGEQEQMGANETQEIMKQAIEAGVKVRELEAKVEELNKEREELKKEREASI 175 (480) T ss_pred EEEEEEEEeEEeecccchhhhhhhhhhhhhhhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhc Confidence 11111111 13444444443322222211111111 111122333333333333333332221111111 Q ss_pred hhhcchhHHH--HHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccch-hhhHhhcchhHHHHHHHHHHhhCcccc Q lcl|Aclame:pro 68 EKPKGKDKMT--NFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFK 144 (393) Q Consensus 68 Ek~K~k~emt--EfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~-qd~~eiLP~~ii~AIe~A~ed~d~vl~ 144 (393) .. ..++.. ..+++ ...|.+-+.+ ......+.+....+ .+-.-.+|+.+...+--......+.+. T Consensus 176 ~~--~~~~~~~~~e~r~---~~~~~~~~~e--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (480) T protein:vir:40 176 PS--EKPEDAERKFMRE---LGSKMAEMPE--------QGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQ 242 (480) T ss_pred cc--cchhhhhhHHHHH---HHHHhccchh--------hhhhhhhhhhccccccccccccccchhhheeechhhhhhhhh Confidence 10 011110 11111 1111111100 01111111211100 000012232211111000111111111 Q ss_pred ceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 145 VFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQA 223 (393) Q Consensus 145 ~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~ 223 (393) ...+... +..+ -.-+.+|..+++++.+. +..+.+.|..+|+++.+++.-..+-....+|.+|+.+||+.+ T Consensus 243 ~~~~~~~-------g~~~~~~~~e~~~~~~~~~~~~--~~~~~~~~~~v~~l~~~~k~t~~lLDDa~~l~~~i~~~l~~~ 313 (480) T protein:vir:40 243 GLTLAED-------GVDDTFISGTFKAGTDKNKSQT--ATKRSLRPQMAEAYLQMDKATVRGVNDSGALSEYVMSEMVNR 313 (480) T ss_pred cceeeec-------cccceeeeeeeecccccccccc--cccchhhHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHHHHHH Confidence 1111111 1112 23567888888877664 567888999999999996655443332337999999999999 Q ss_pred HHHHHHhcceeec--cCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh--ceeeecCCCc-eEEEecchhhh Q lcl|Aclame:pro 224 IVNKIVDLALVEG--DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV--DFVRPTAGRR-YLIVKTEDRKA 298 (393) Q Consensus 224 fI~Rav~rAvv~g--DG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal--d~a~~~a~~~-~l~i~~~d~~a 298 (393) |- ++.+.|+++| ||.+.+..++..++.+ +.+.+++++...| ....+-..+. .+++|+.++++ T Consensus 314 ~~-~~ee~a~l~G~g~g~~~~~g~~~~~~~~------------~~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~ 380 (480) T protein:vir:40 314 VI-QKVEYNMILGSVDGSNGFYGLKTATDGW------------TKQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAE 380 (480) T ss_pred HH-HHHHHHhhccCCCCccccccceeecccc------------cccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHH Confidence 99 7999999999 7777778887766533 1122344555444 2333333334 68899999999 Q ss_pred HhhhhhccccccceeeecCCCeEEEeeecccchhhc--ccchh---------ceeeeeccccee--cccceeeecceeEe Q lcl|Aclame:pro 299 LLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAV---------KPTVLVDQKYHI--DMQDLTKVDAFEWK 365 (393) Q Consensus 299 ~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~---------~ptv~vD~k~~~--~~~~~~~~~s~~~~ 365 (393) |+- |||.+|+|-+.-++..-...| |.-.| .|++..+.+|++ |+ +.+..+.|.|+ T Consensus 381 I~k------------lKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~~~~~~~~~~~~d~-~~~~~~~~~~~ 447 (480) T protein:vir:40 381 LRK------------AKGTDGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEVAVYNHDEYVLIGDL-NVENYNDFDLR 447 (480) T ss_pred HHH------------hhcCCCCeeccCcccccCcceecccceeeeeccccCCcceeeeCCccEEEEec-ccceecccccc Confidence 998 999999998755443333222 44322 267888887766 65 57889999999 Q ss_pred ecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 366 TNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 366 ~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +|+..++.|++++|+|..||+++|++.- T Consensus 448 ~~~~~~~~e~~v~g~~~~~~~~~~~~~~ 475 (480) T protein:vir:40 448 YNVEQWLSETLVGGSIRGKNRSAYLKKK 475 (480) T ss_pred cchhhhhhhhhhceeeEccccEEEEEec Confidence 9999999999999999999999998876 No 8 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.74 E-value=1.1e-18 Score=119.11 Aligned_cols=350 Identities=13% Similarity=0.067 Sum_probs=204.3 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhh----HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhcch-- Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKN----AIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-KPKGK-- 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~----a~~~~skieelektis~l~aEi~k~enel~~~~E-k~K~k-- 73 (393) |.|-.++|.+..|+++++.+...+..+...-.++ .....++++.+++.+.+++.+++..+...+...+ ....+ T Consensus 1 M~~~~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~ 80 (394) T protein:vir:97 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 9999999999999999988877666554433222 2222344444555555444444433333222221 11111 Q ss_pred --------hHHHHHHHHHHHHHHHHHHHHHccCChh----HHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCc Q lcl|Aclame:pro 74 --------DKMTNFIESQNAVTEFFDVLKKNSGKSE----IKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNP 141 (393) Q Consensus 74 --------~emtEfLkTkqA~~dya~ll~~nqg~ke----~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~ 141 (393) ..+.+|++.+.....-. .......+ ............|++..+.-..+|..+...|-+.+.++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~ 157 (394) T protein:vir:97 81 TQEEKTYRESVNDFIRSKGKIVNDS---LRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVD 157 (394) T ss_pred chhhHHHHHHHHHHHHHHHHHhhhh---hhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhh Confidence 12233444333222111 11111111 1111222223336665555557999999999999999888 Q ss_pred cccceeeecccceeEEEee-cc-ccccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHH Q lcl|Aclame:pro 142 VFKVFHVTNVGALLVSRSF-DS-SNEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVA 218 (393) Q Consensus 142 vl~~fhV~n~~~~a~~i~l-~n-a~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ 218 (393) +.+..++.+.+.....+.. +. ...+..+.-|.++++ ...+|...++.|..++.+..+.+-+.+-.+ -++.+|+++ T Consensus 158 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~--~~~~~~i~~ 235 (394) T protein:vir:97 158 LKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDAD--VDLVGIVSE 235 (394) T ss_pred hhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhh--HHHHHHHHH Confidence 8887777666654433332 22 234445666677765 568999999999999988877443333222 368999999 Q ss_pred HHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecchhhh Q lcl|Aclame:pro 219 ELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKA 298 (393) Q Consensus 219 ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d~~a 298 (393) +|++.+. ++.+.+++.|+|.... +.+.+.+++..++....+.+.+..+++|+.+..+ T Consensus 236 ~la~~~~-~~~~~~i~~g~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~ 292 (394) T protein:vir:97 236 SISQIKV-NTTNDAIAKVLKSFTT----------------------KTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQT 292 (394) T ss_pred HHHHHHH-HHHHHHHhhccccccc----------------------cccccHHHHHHHHHhhhhhhhCCEEEEcHHHHHH Confidence 9999999 7999999998876321 1234566777777544445566778999999988 Q ss_pred HhhhhhccccccceeeecCCCeEEEeeeccc----chhhcccchhc---------eeeeeccc-cee--cccceeeecce Q lcl|Aclame:pro 299 LLDELRQATANANVRIKNDDTEIASEVGVDE----IIVYTGSKAVK---------PTVLVDQK-YHI--DMQDLTKVDAF 362 (393) Q Consensus 299 ~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~----~~~~tG~k~~~---------ptv~vD~k-~~~--~~~~~~~~~s~ 362 (393) |+. |++.+|.+.+...+.+ ..+ |...++ +.++-|-. +|. +.+|++--.+. T Consensus 293 l~~------------lkd~~G~~i~~~~~~~~~~~~l~--G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~ 358 (394) T protein:vir:97 293 LDT------------LKDGNGRYLLQDDITAVSGKVLL--GKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWAD 358 (394) T ss_pred HHH------------hhccCCCeeeecCcCCCCCceec--cceeEEecccccCCccEEEeeccccEEEEEecceEEEEec Confidence 877 7788888765332211 112 322111 22333321 111 23333322222 Q ss_pred eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 ~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ...+ ...+.+..+..|.+.-|.+-+.++++ T Consensus 359 ~~~~-~~~~~~~~r~d~~v~~~~a~~~~~~~ 388 (394) T protein:vir:97 359 NEIY-GQYLQAVLRFGVSKVDDKAGYYVTFT 388 (394) T ss_pred cccc-ceeEEEEEEEccEEecccceEEEEec Confidence 2222 33567788889999999998888888 No 9 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.72 E-value=6.8e-19 Score=120.17 Aligned_cols=364 Identities=13% Similarity=0.090 Sum_probs=210.2 Q ss_pred CCcchhhH-HHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh-----hhcchh Q lcl|Aclame:pro 1 MNKPDLIE-KQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-----KPKGKD 74 (393) Q Consensus 1 ~~k~d~~e-kq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E-----k~K~k~ 74 (393) |.+ +.+ .+.++.++++.+.++-++...-.-..+.. ..++++++..+.+++++|+..+..+..... ....+. T Consensus 1 m~~--l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~-~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:81 1 MTD--ITSKLEATLANVTDSLRAFGERAVRDGELNASA-RSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred ChH--HHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 433 333 35566666666655444332221111222 367888888888888888765544444331 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCC--hhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccc Q lcl|Aclame:pro 75 KMTNFIESQNAVTEFFDVLKKNSGK--SEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 152 (393) Q Consensus 75 emtEfLkTkqA~~dya~ll~~nqg~--ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~ 152 (393) +.+......+...|+.......+. -+++..+...... ...+-..++|..++..|-..++++.++++...+...+ T Consensus 78 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~ 153 (390) T protein:vir:81 78 -VGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTD---AAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD 153 (390) T ss_pred -chhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccc---cccCCcceechhhhHHHHHHHhhhhhhhhhcceeecc Confidence 222233334444555444443332 2333333322110 0111112688888888888899888888765655444 Q ss_pred ceeEEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 153 ALLVSRSFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVD 230 (393) Q Consensus 153 ~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~ 230 (393) .....+-..+ ...+..+--|.++.+...+|...++.|..++....+.+-+-+ .+ .++.+|++++|+.++- |+++ T Consensus 154 ~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~--d~-~~~~~~i~~~l~~~~~-~~~d 229 (390) T protein:vir:81 154 SALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILS--DA-PQLASYMNNRLIRGLK-VKED 229 (390) T ss_pred CCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHH--hH-HHHHHHHHHHHHHHHH-HHHH Confidence 4433333333 345555677889999999999999999998888777444333 22 4689999999999999 7999 Q ss_pred cceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccc Q lcl|Aclame:pro 231 LALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATAN 309 (393) Q Consensus 231 rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~ 309 (393) +++++|||++.. ..-+... .-....+...+.+...|.+..++ .+.........+++|+.++.+|+. T Consensus 230 ~a~l~G~g~~~~-~~Gi~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~-------- 296 (390) T protein:vir:81 230 AEILRGTGANDG-LLGLIPQ----ATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIEL-------- 296 (390) T ss_pred HHHHhcCCCCCc-ccceeec----ccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH-------- Confidence 999999998531 0001111 00111223344456678888777 333333344478999999998887 Q ss_pred cceeeecCCCeEEEeeecccc--hhhcccchhc-------eeeeecccc--ee-cccceeeecce---eEeecCceEEEe Q lcl|Aclame:pro 310 ANVRIKNDDTEIASEVGVDEI--IVYTGSKAVK-------PTVLVDQKY--HI-DMQDLTKVDAF---EWKTNSNMILVE 374 (393) Q Consensus 310 a~~~l~~~~~~~~~~v~~~~~--~~~tG~k~~~-------ptv~vD~k~--~~-~~~~~~~~~s~---~~~~ns~~i~~~ 374 (393) |++.+|.+-+.-..+.. ++ -|-..+. +.++.|-+. .+ +-+|++-.-+. .|.+|+-.+.+. T Consensus 297 ----lkd~~G~~l~~~~~~~~~~~l-~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~ 371 (390) T protein:vir:81 297 ----AKDANNQYLIGNARGTLTPTL-WGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAE 371 (390) T ss_pred ----hhcCCCceeecCcccccCcee-cceeeEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchhhcCcEEEEEE Confidence 77777776553322111 11 1333222 223444432 22 33444432222 234455567788 Q ss_pred eeecccccccCcceeeeeC Q lcl|Aclame:pro 375 TLTSGHVETYNAGAVITVS 393 (393) Q Consensus 375 ~~~~g~~~~~n~~~~~~v~ 393 (393) .+..|.+.-|++-+++++| T Consensus 372 ~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 372 ERLALVVYRPEALISGSFA 390 (390) T ss_pred EeeccEEecccceEEEEeC Confidence 9999999999999999999 No 10 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.71 E-value=5.9e-18 Score=115.04 Aligned_cols=366 Identities=13% Similarity=0.076 Sum_probs=208.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------hhcc Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KPKG 72 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E--------k~K~ 72 (393) |+.. +|.+.+|.++++.....++.....-. .+++.+.+++++.+++++++|.+.++.++.+++ .+.. T Consensus 1 mk~~--~el~~~l~el~~~~~~~~~e~~~~l~---~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:79 1 MKTK--EELQSEISDIKRQIDLKVKYATRALN---NDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHhc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 7764 45556677766655544443333222 222345666777777777777655555444331 1111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC------ccchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 73 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN------GVTITDTTFQLPRKLVESINTALLNTNPVFKVF 146 (393) Q Consensus 73 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek------gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~f 146 (393) ........+...........+...+......++|...+... ++.+.+-..++|..+...|-+.++++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:79 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhhe Confidence 11111111222222222233344444556677777766554 222223333799999999999999999998877 Q ss_pred eeecccceeEE--Eeeccc-cccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 147 HVTNVGALLVS--RSFDSS-NEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 222 (393) Q Consensus 147 hV~n~~~~a~~--i~l~na-~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 222 (393) .|.+.+..... +.-.+. ..+..+.-|.+.++. ..+|...++.|..++.+..+.+-+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:79 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred eeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 77666544323 322332 333344556666654 46899999999999988777444333222 3579999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc-eeeecCCCceEEEecchhhhHhh Q lcl|Aclame:pro 223 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKTEDRKALLD 301 (393) Q Consensus 223 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald-~a~~~a~~~~l~i~~~d~~a~~~ 301 (393) .+. ++++.+++.|+|...... ...+... ....+..+.+...|++..++. +..+-...-.+++|+.++.+|+. T Consensus 234 ~~~-~~~~~~il~g~g~g~~~~-~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:79 234 TIA-ATRNKAIIDVITKGSTGS-TSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCcccc-ccccccc-----cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998733111 1111111 112334445667788888883 33333445578899999988876 Q ss_pred hhhccccccceeeecCCCeEEEeeecccchhhc--ccchh------------ceeeeeccc-ce-e-cccceeeecceeE Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAV------------KPTVLVDQK-YH-I-DMQDLTKVDAFEW 364 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~------------~ptv~vD~k-~~-~-~~~~~~~~~s~~~ 364 (393) |++.+|.+.+.-.+.+-.-.| |...+ .|.++-|-+ +| + +.+|++-.-+. . T Consensus 307 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-~ 373 (415) T protein:vir:79 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-Y 373 (415) T ss_pred ------------hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-c Confidence 677777765532221111101 22211 133444533 22 2 33444322111 1 Q ss_pred eecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 365 KTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 365 ~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ..++..+.+..+..|.+.-|++-++++++ T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:79 374 MHFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEe Confidence 23344566778899999999999999888 No 11 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.71 E-value=5.9e-18 Score=115.04 Aligned_cols=366 Identities=13% Similarity=0.076 Sum_probs=208.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------hhcc Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KPKG 72 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E--------k~K~ 72 (393) |+.. +|.+.+|.++++.....++.....-. .+++.+.+++++.+++++++|.+.++.++.+++ .+.. T Consensus 1 mk~~--~el~~~l~el~~~~~~~~~e~~~~l~---~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:81 1 MKTK--EELQSEISDIKRQIDLKVKYATRALN---NDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHhc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 7764 45556677766655544443333222 222345666777777777777655555444331 1111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC------ccchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 73 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN------GVTITDTTFQLPRKLVESINTALLNTNPVFKVF 146 (393) Q Consensus 73 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek------gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~f 146 (393) ........+...........+...+......++|...+... ++.+.+-..++|..+...|-+.++++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:81 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhhe Confidence 11111111222222222233344444556677777766554 222223333799999999999999999998877 Q ss_pred eeecccceeEE--Eeeccc-cccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 147 HVTNVGALLVS--RSFDSS-NEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 222 (393) Q Consensus 147 hV~n~~~~a~~--i~l~na-~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 222 (393) .|.+.+..... +.-.+. ..+..+.-|.+.++. ..+|...++.|..++.+..+.+-+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:81 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred eeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 77666544323 322332 333344556666654 46899999999999988777444333222 3579999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc-eeeecCCCceEEEecchhhhHhh Q lcl|Aclame:pro 223 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKTEDRKALLD 301 (393) Q Consensus 223 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald-~a~~~a~~~~l~i~~~d~~a~~~ 301 (393) .+. ++++.+++.|+|...... ...+... ....+..+.+...|++..++. +..+-...-.+++|+.++.+|+. T Consensus 234 ~~~-~~~~~~il~g~g~g~~~~-~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:81 234 TIA-ATRNKAIIDVITKGSTGS-TSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCcccc-ccccccc-----cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998733111 1111111 112334445667788888883 33333445578899999988876 Q ss_pred hhhccccccceeeecCCCeEEEeeecccchhhc--ccchh------------ceeeeeccc-ce-e-cccceeeecceeE Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAV------------KPTVLVDQK-YH-I-DMQDLTKVDAFEW 364 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~------------~ptv~vD~k-~~-~-~~~~~~~~~s~~~ 364 (393) |++.+|.+.+.-.+.+-.-.| |...+ .|.++-|-+ +| + +.+|++-.-+. . T Consensus 307 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-~ 373 (415) T protein:vir:81 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-Y 373 (415) T ss_pred ------------hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-c Confidence 677777765532221111101 22211 133444533 22 2 33444322111 1 Q ss_pred eecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 365 KTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 365 ~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ..++..+.+..+..|.+.-|++-++++++ T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:81 374 MHFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEe Confidence 23344566778899999999999999888 No 12 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.71 E-value=5.9e-18 Score=115.04 Aligned_cols=366 Identities=13% Similarity=0.076 Sum_probs=208.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------hhcc Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KPKG 72 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E--------k~K~ 72 (393) |+.. +|.+.+|.++++.....++.....-. .+++.+.+++++.+++++++|.+.++.++.+++ .+.. T Consensus 1 mk~~--~el~~~l~el~~~~~~~~~e~~~~l~---~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:98 1 MKTK--EELQSEISDIKRQIDLKVKYATRALN---NDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHhc---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 7764 45556677766655544443333222 222345666777777777777655555444331 1111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC------ccchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 73 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN------GVTITDTTFQLPRKLVESINTALLNTNPVFKVF 146 (393) Q Consensus 73 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek------gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~f 146 (393) ........+...........+...+......++|...+... ++.+.+-..++|..+...|-+.++++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:98 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhhe Confidence 11111111222222222233344444556677777766554 222223333799999999999999999998877 Q ss_pred eeecccceeEE--Eeeccc-cccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 147 HVTNVGALLVS--RSFDSS-NEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 222 (393) Q Consensus 147 hV~n~~~~a~~--i~l~na-~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 222 (393) .|.+.+..... +.-.+. ..+..+.-|.+.++. ..+|...++.|..++.+..+.+-+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:98 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred eeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 77666544323 322332 333344556666654 46899999999999988777444333222 3579999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc-eeeecCCCceEEEecchhhhHhh Q lcl|Aclame:pro 223 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKTEDRKALLD 301 (393) Q Consensus 223 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald-~a~~~a~~~~l~i~~~d~~a~~~ 301 (393) .+. ++++.+++.|+|...... ...+... ....+..+.+...|++..++. +..+-...-.+++|+.++.+|+. T Consensus 234 ~~~-~~~~~~il~g~g~g~~~~-~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:98 234 TIA-ATRNKAIIDVITKGSTGS-TSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCcccc-ccccccc-----cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998733111 1111111 112334445667788888883 33333445578899999988876 Q ss_pred hhhccccccceeeecCCCeEEEeeecccchhhc--ccchh------------ceeeeeccc-ce-e-cccceeeecceeE Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAV------------KPTVLVDQK-YH-I-DMQDLTKVDAFEW 364 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~------------~ptv~vD~k-~~-~-~~~~~~~~~s~~~ 364 (393) |++.+|.+.+.-.+.+-.-.| |...+ .|.++-|-+ +| + +.+|++-.-+. . T Consensus 307 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~-~ 373 (415) T protein:vir:98 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-Y 373 (415) T ss_pred ------------hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-c Confidence 677777765532221111101 22211 133444533 22 2 33444322111 1 Q ss_pred eecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 365 KTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 365 ~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ..++..+.+..+..|.+.-|++-++++++ T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:98 374 MHFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEe Confidence 23344566778899999999999999888 No 13 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.71 E-value=2e-18 Score=117.58 Aligned_cols=359 Identities=17% Similarity=0.115 Sum_probs=206.4 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHH--HH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKM--TN 78 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~em--tE 78 (393) |+| +.|...+++++.+.+.++....++ ++++.. ..+++++..+.++.+++++.+..+...+.+....... .. T Consensus 1 M~~--l~el~~~~~~~~~e~~~l~~~~~~-e~~~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (385) T protein:vir:19 1 MSE--LALIQKAIEESQQKMTQLFDAQKA-EIESTG---QVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEK 74 (385) T ss_pred ChH--HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchh Confidence 987 667777788888777766544332 222222 3455666666666666665444444333221111111 01 Q ss_pred HHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEE Q lcl|Aclame:pro 79 FIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 158 (393) Q Consensus 79 fLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i 158 (393) -...+.+..++.+.+....+........ ..+ .....+....+|..+...|-..+.++.++++.+.+.+.+.....+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 150 (385) T protein:vir:19 75 KSFSERAAEELIKSWDGKQGTFGAKTFN-KSL---GSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY 150 (385) T ss_pred hhhHHHHHHHHHHHHHHhhccchhhHHH-hhh---ccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE Confidence 1122334444444443333321111111 011 111111122578778888889999999999876665554443333 Q ss_pred eecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec Q lcl|Aclame:pro 159 SFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 236 (393) Q Consensus 159 ~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g 236 (393) -..+ ...++.+.-|.++.+...+|...++.|..++....+.+-+.+ ++ ..+.+|++++|+.++. ++++.++..| T Consensus 151 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~-d~--~~l~~~i~~~la~a~~-~~~d~~~l~G 226 (385) T protein:vir:19 151 VREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMD-DA--PMLQSYINNRLMYGLA-LKEEGQLLNG 226 (385) T ss_pred EEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHh-hH--HHHHHHHHHHHHHHHH-HHHHHHHHhc Confidence 3222 345555667889999999999999999999988777554433 22 5699999999999988 7999999999 Q ss_pred cCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeee Q lcl|Aclame:pro 237 DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIK 315 (393) Q Consensus 237 DG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~ 315 (393) ||++.. ..-+ ....-.++.+...++....|.|..++ .+-........+++|+.++.+|+. ++ T Consensus 227 ~g~~~~--~~Gi---~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~------------lk 289 (385) T protein:vir:19 227 DGTGDN--LEGL---NKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIAL------------LK 289 (385) T ss_pred cCCCCc--cccc---ccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH------------hh Confidence 998531 0000 00000011112223445668888877 444455566788999999999887 77 Q ss_pred cCCCeEEEeee---cccchhhcccchhce--------eeeeccc--cee-cccceeeecc----eeEeecCceEEEeeee Q lcl|Aclame:pro 316 NDDTEIASEVG---VDEIIVYTGSKAVKP--------TVLVDQK--YHI-DMQDLTKVDA----FEWKTNSNMILVETLT 377 (393) Q Consensus 316 ~~~~~~~~~v~---~~~~~~~tG~k~~~p--------tv~vD~k--~~~-~~~~~~~~~s----~~~~~ns~~i~~~~~~ 377 (393) +.+|.+.++-. .+...+ |- .|+. .++.|-+ |.+ +-+|++-.-+ --|..|.-.|.++.+. T Consensus 290 d~~G~~l~~~~~~~~~~~l~--G~-pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~ 366 (385) T protein:vir:19 290 DNEGRYIFGGPQAFTSNIMW--GL-PVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERL 366 (385) T ss_pred cCCCceeccCcccCCCceec--ce-eeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEee Confidence 88888766431 112222 42 2222 2333432 222 3333321111 1133455566777899 Q ss_pred cccccccCcceeeeeC Q lcl|Aclame:pro 378 SGHVETYNAGAVITVS 393 (393) Q Consensus 378 ~g~~~~~n~~~~~~v~ 393 (393) .|.+.-|++-++++++ T Consensus 367 ~~~v~~~~a~~~~~~~ 382 (385) T protein:vir:19 367 ALAHYRPTAIIKGTFS 382 (385) T ss_pred ccEEecccceEEEEec Confidence 9999999999999999 No 14 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.71 E-value=2e-18 Score=117.58 Aligned_cols=359 Identities=17% Similarity=0.115 Sum_probs=206.4 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHH--HH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKM--TN 78 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~em--tE 78 (393) |+| +.|...+++++.+.+.++....++ ++++.. ..+++++..+.++.+++++.+..+...+.+....... .. T Consensus 1 M~~--l~el~~~~~~~~~e~~~l~~~~~~-e~~~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (385) T protein:vir:18 1 MSE--LALIQKAIEESQQKMTQLFDAQKA-EIESTG---QVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEK 74 (385) T ss_pred ChH--HHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchh Confidence 987 667777788888777766544332 222222 3455666666666666665444444333221111111 01 Q ss_pred HHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEE Q lcl|Aclame:pro 79 FIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 158 (393) Q Consensus 79 fLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i 158 (393) -...+.+..++.+.+....+........ ..+ .....+....+|..+...|-..+.++.++++.+.+.+.+.....+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 150 (385) T protein:vir:18 75 KSFSERAAEELIKSWDGKQGTFGAKTFN-KSL---GSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY 150 (385) T ss_pred hhhHHHHHHHHHHHHHHhhccchhhHHH-hhh---ccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE Confidence 1122334444444443333321111111 011 111111122578778888889999999999876665554443333 Q ss_pred eecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec Q lcl|Aclame:pro 159 SFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 236 (393) Q Consensus 159 ~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g 236 (393) -..+ ...++.+.-|.++.+...+|...++.|..++....+.+-+.+ ++ ..+.+|++++|+.++. ++++.++..| T Consensus 151 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~-d~--~~l~~~i~~~la~a~~-~~~d~~~l~G 226 (385) T protein:vir:18 151 VREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMD-DA--PMLQSYINNRLMYGLA-LKEEGQLLNG 226 (385) T ss_pred EEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHh-hH--HHHHHHHHHHHHHHHH-HHHHHHHHhc Confidence 3222 345555667889999999999999999999988777554433 22 5699999999999988 7999999999 Q ss_pred cCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeee Q lcl|Aclame:pro 237 DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIK 315 (393) Q Consensus 237 DG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~ 315 (393) ||++.. ..-+ ....-.++.+...++....|.|..++ .+-........+++|+.++.+|+. ++ T Consensus 227 ~g~~~~--~~Gi---~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~------------lk 289 (385) T protein:vir:18 227 DGTGDN--LEGL---NKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIAL------------LK 289 (385) T ss_pred cCCCCc--cccc---ccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH------------hh Confidence 998531 0000 00000011112223445668888877 444455566788999999999887 77 Q ss_pred cCCCeEEEeee---cccchhhcccchhce--------eeeeccc--cee-cccceeeecc----eeEeecCceEEEeeee Q lcl|Aclame:pro 316 NDDTEIASEVG---VDEIIVYTGSKAVKP--------TVLVDQK--YHI-DMQDLTKVDA----FEWKTNSNMILVETLT 377 (393) Q Consensus 316 ~~~~~~~~~v~---~~~~~~~tG~k~~~p--------tv~vD~k--~~~-~~~~~~~~~s----~~~~~ns~~i~~~~~~ 377 (393) +.+|.+.++-. .+...+ |- .|+. .++.|-+ |.+ +-+|++-.-+ --|..|.-.|.++.+. T Consensus 290 d~~G~~l~~~~~~~~~~~l~--G~-pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~ 366 (385) T protein:vir:18 290 DNEGRYIFGGPQAFTSNIMW--GL-PVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERL 366 (385) T ss_pred cCCCceeccCcccCCCceec--ce-eeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEee Confidence 88888766431 112222 42 2222 2333432 222 3333321111 1133455566777899 Q ss_pred cccccccCcceeeeeC Q lcl|Aclame:pro 378 SGHVETYNAGAVITVS 393 (393) Q Consensus 378 ~g~~~~~n~~~~~~v~ 393 (393) .|.+.-|++-++++++ T Consensus 367 ~~~v~~~~a~~~~~~~ 382 (385) T protein:vir:18 367 ALAHYRPTAIIKGTFS 382 (385) T ss_pred ccEEecccceEEEEec Confidence 9999999999999999 No 15 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.70 E-value=2.7e-18 Score=116.86 Aligned_cols=364 Identities=13% Similarity=0.075 Sum_probs=210.5 Q ss_pred CCcchhh-HHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-----hcchh Q lcl|Aclame:pro 1 MNKPDLI-EKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEK-----PKGKD 74 (393) Q Consensus 1 ~~k~d~~-ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek-----~K~k~ 74 (393) |. |+. +-+.++.++.+.+..+.++...-.-..+ +...++++++.++.+++++|++.+.+++....+ ...+. T Consensus 1 m~--~~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~-e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:97 1 MT--DITAKLEATLANVTDSLKAFGERAVRDGELNA-SARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred Ch--HHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 32 222 2344555555555544444332211112 223678889999999999998877666654321 11111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccch--hhhHhhcchhHHHHHHHHHHhhCccccceeeeccc Q lcl|Aclame:pro 75 KMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTI--TDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 152 (393) Q Consensus 75 emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~--qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~ 152 (393) ..+..........|........+. ....+.+.+. .+.++ .+--.++|..++..|-..++++.++++.+.+...+ T Consensus 78 -~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~ 153 (390) T protein:vir:97 78 -VGDMFVASEQFQASTGRWNDRSAR--ATMNIKAALN-TASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD 153 (390) T ss_pred -chhhhhhhHHHHHHHHHhhhhhhh--hhhHHHHHHH-hhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeecc Confidence 123334444445554444333332 1111222221 12211 22223789888899999999999988866655554 Q ss_pred ceeEEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 153 ALLVSRSFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVD 230 (393) Q Consensus 153 ~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~ 230 (393) .....+-..+ ...+..+.-|+++.+...+|...++.|..++.+..+.+-+. +.+ .++.+|++++|+.++- +.++ T Consensus 154 ~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell--~ds-~~l~~~i~~~la~a~~-~~~d 229 (390) T protein:vir:97 154 SALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQIL--SDA-PQLASYMNNRLIRGLK-VKED 229 (390) T ss_pred CCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHH--HhH-HHHHHHHHHHHHHHHH-HHHH Confidence 4433343332 23444456688999999999999999999888877744332 223 4689999999999999 7999 Q ss_pred cceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeee-cCCCceEEEecchhhhHhhhhhccccc Q lcl|Aclame:pro 231 LALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRRYLIVKTEDRKALLDELRQATAN 309 (393) Q Consensus 231 rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~-~a~~~~l~i~~~d~~a~~~~~~~~~~~ 309 (393) ++++.|||++.. ..-+.... -.++.++..+++...|.+..++.-+.+ -.....+++|+.++.+|+. T Consensus 230 ~a~l~G~g~~~~-p~Gi~~~~----~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~-------- 296 (390) T protein:vir:97 230 AEILRGTGANDG-LLGLIPQA----TTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIEL-------- 296 (390) T ss_pred HHHhhcCCCCcc-ccceeecc----ccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH-------- Confidence 999999998541 00111111 111222333445566778777733333 3345568889999888886 Q ss_pred cceeeecCCCeEEEeeeccc--chhhcccchhc-------eeeeecccc-e-e-cccceeeecce---eEeecCceEEEe Q lcl|Aclame:pro 310 ANVRIKNDDTEIASEVGVDE--IIVYTGSKAVK-------PTVLVDQKY-H-I-DMQDLTKVDAF---EWKTNSNMILVE 374 (393) Q Consensus 310 a~~~l~~~~~~~~~~v~~~~--~~~~tG~k~~~-------ptv~vD~k~-~-~-~~~~~~~~~s~---~~~~ns~~i~~~ 374 (393) |++.+|.+-+.-+.+. -++ -|-..++ +.++.|-+. + + +-+|++-.-+. .|..|.-.+.++ T Consensus 297 ----lkd~~G~~l~~~~~~~~~~~l-~G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~ 371 (390) T protein:vir:97 297 ----AKDANNQYLIGNARGTLTPTL-WGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAE 371 (390) T ss_pred ----hhcCCCceeecCccCCCCcee-cceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEE Confidence 7777777665432221 111 0322221 234445432 2 2 33444432222 244555567788 Q ss_pred eeecccccccCcceeeeeC Q lcl|Aclame:pro 375 TLTSGHVETYNAGAVITVS 393 (393) Q Consensus 375 ~~~~g~~~~~n~~~~~~v~ 393 (393) .+..|.+..|++-+++++| T Consensus 372 ~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 372 ERLALVVYRPEALITGSFA 390 (390) T ss_pred EeeccEEeccccEEEEEeC Confidence 8999999999999999999 No 16 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.70 E-value=6.2e-18 Score=114.94 Aligned_cols=367 Identities=13% Similarity=0.071 Sum_probs=211.1 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh----cchhHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKP----KGKDKM 76 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~----K~k~em 76 (393) |-+. +++.++++.++++.+..+.++...-....+. +..++++++.++++++++|++.+..++...... .....+ T Consensus 1 m~e~-~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e-~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 78 (390) T protein:vir:10 1 MTDI-TSKLEATLANVTDSLRAFGERAVRDGELNAS-ARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSV 78 (390) T ss_pred ChHH-HHHHHHHHHHHHHHHHHHHHHHHhhcccCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccch Confidence 3222 1223555666665555544443322111122 236788889999999999887666555544211 111123 Q ss_pred HHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchh-hhHhhcchhHHHHHHHHHHhhCccccceeeeccccee Q lcl|Aclame:pro 77 TNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTIT-DTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 155 (393) Q Consensus 77 tEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~q-d~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a 155 (393) .+......+...|+.......+.. ...+.+........++ +-...+|..++..|-+.++++.++++.+.+.+.+... T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 156 (390) T protein:vir:10 79 GDLFVASEQFQASAGRWNDRSARA--TMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGRTDSAL 156 (390) T ss_pred hhhhhhhHHHHHHHHhhhhhhhhh--hhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 344455555555554444333321 1122222222211111 1222678888888888889889999866666655443 Q ss_pred EEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcce Q lcl|Aclame:pro 156 VSRSFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLAL 233 (393) Q Consensus 156 ~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAv 233 (393) ..+-..+ ...+.-+.-|+++++...+|...++.|..++....+.+.+.+ .+ .++.+|++++|+..+- |.+++++ T Consensus 157 ~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~--d~-~~l~~~i~~~l~~~~~-~~~~~~i 232 (390) T protein:vir:10 157 IEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILS--DA-PQLASYMNNRLIRGLK-VKEDAEI 232 (390) T ss_pred eEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHH--hH-HHHHHHHHHHHHHHHH-HHHHHHH Confidence 3333322 234444566889999999999999999988877777544432 23 3799999999999999 7999999 Q ss_pred eeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccce Q lcl|Aclame:pro 234 VEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANV 312 (393) Q Consensus 234 v~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~ 312 (393) +.|||++.. ..-+..+.. .+..++..+++...|.+..++ .+.........+++|+.++.+|+. T Consensus 233 l~G~G~~~~-p~Gi~~~~~----~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~----------- 296 (390) T protein:vir:10 233 LRGTGANDG-LLGLIPQAT----TYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIEL----------- 296 (390) T ss_pred hhcCCCCcc-ccccccccc----cccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH----------- Confidence 999998531 111111110 111122334445567787777 333344445578889999888876 Q ss_pred eeecCCCeEEEeeecccc--hhhcccchhc-------eeeeecccc--ee-cccceeeecce---eEeecCceEEEeeee Q lcl|Aclame:pro 313 RIKNDDTEIASEVGVDEI--IVYTGSKAVK-------PTVLVDQKY--HI-DMQDLTKVDAF---EWKTNSNMILVETLT 377 (393) Q Consensus 313 ~l~~~~~~~~~~v~~~~~--~~~tG~k~~~-------ptv~vD~k~--~~-~~~~~~~~~s~---~~~~ns~~i~~~~~~ 377 (393) |++.+|.+-+.-.++.. ++ -|-..+. +.++.|-+. .+ +-+|+.-.-+. .|.+|.-.+.++.+. T Consensus 297 -lkd~~g~~l~~~~~~~~~~~l-~G~pv~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~ 374 (390) T protein:vir:10 297 -AKDANNQYLIGNARGTLTPTL-WGLPVVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERL 374 (390) T ss_pred -hhcCCCceeecCCcCcCCcee-cceeeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEee Confidence 78888887665433221 11 0222111 233344332 12 33443322111 133444466677899 Q ss_pred cccccccCcceeeeeC Q lcl|Aclame:pro 378 SGHVETYNAGAVITVS 393 (393) Q Consensus 378 ~g~~~~~n~~~~~~v~ 393 (393) .|.+.-|++-+++++| T Consensus 375 d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 375 ALVVYRPEALISGSFA 390 (390) T ss_pred ccEEeccccEEEEEeC Confidence 9999999999999999 No 17 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.70 E-value=2.8e-18 Score=116.85 Aligned_cols=353 Identities=12% Similarity=0.093 Sum_probs=197.6 Q ss_pred CCc-chhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcc----hhH Q lcl|Aclame:pro 1 MNK-PDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKG----KDK 75 (393) Q Consensus 1 ~~k-~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~----k~e 75 (393) ||+ .+++|..++++++++....+..++.. ..+++......++|+.+.++++..+++..++.+....+.+.. ... T Consensus 1 m~~~m~i~el~~~~~~~~~~~~~~~~e~~~-~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) T protein:vir:74 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINM-ALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) T ss_pred CChhhhHHHHHHHHHHHHHHHHHHHHHHHH-HHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 774 48888899999999999988888865 344444344456666666666666665544444332211000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHc-cCChhHHHHHHHHHHhC-ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccc Q lcl|Aclame:pro 76 MTNFIESQNAVTEFFDVLKKN-SGKSEIKNAWNAKLAEN-GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 153 (393) Q Consensus 76 mtEfLkTkqA~~dya~ll~~n-qg~ke~k~AW~a~L~ek-gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~ 153 (393) ...-....+....|.+-+... .+...... ......- ..+..+--.++|..+...|-+.++++.++++..++.+.+. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 157 (408) T protein:vir:74 80 GPLNKSENELKDKFVKDFVNMVRNPMAFLN--TVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVST 157 (408) T ss_pred ccccchhhhhHHHHHHHHHHHHhcchhhhh--hhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccC Confidence 001111222222222211111 11111100 0111111 1112222336899999999999999999988777766655 Q ss_pred eeEEEee--ccc--c-ccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 154 LLVSRSF--DSS--N-EAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK 227 (393) Q Consensus 154 ~a~~i~l--~na--~-~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~R 227 (393) ....+.. .+. . ..|+. -|+++.+ ...+|...++.|..++....+.+.+.+ .+.-++.+|++++|+.++. + T Consensus 158 ~~~~~~~~~~~~~~~~~~~v~-E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~~~-~ 233 (408) T protein:vir:74 158 SSGSRVYEKWTDVTPLKAMDE-EDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLK--DTAENILAWLSSWIAKKVV-V 233 (408) T ss_pred CcceEEEEeecCCcccccccc-cccccccccccceeeEEeeeeeEEeeehhHHHHHh--hchHHHHHHHHHHHHHHHH-H Confidence 4433222 222 2 22333 2344454 568999999999999988777444333 2224689999999999998 7 Q ss_pred HHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc-eeee-cCCCceEEEecchhhhHhhhhhc Q lcl|Aclame:pro 228 IVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRP-TAGRRYLIVKTEDRKALLDELRQ 305 (393) Q Consensus 228 av~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald-~a~~-~a~~~~l~i~~~d~~a~~~~~~~ 305 (393) .++.++++|||+... .+.+.+.+++..++. ...+ -..+..+++|+.++.+|+- T Consensus 234 ~~d~~il~G~G~~~~---------------------~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~---- 288 (408) T protein:vir:74 234 TRNQAIIAAMGTVPK---------------------KPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLAL---- 288 (408) T ss_pred HHHHHHhhccccccc---------------------ccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH---- Confidence 999999999998431 233445677776662 1111 1234567889999888886 Q ss_pred cccccceeeecCCCeEEEeeecccchhhc--ccchh------ceee--------eecccc-ee--cccceeeecce---- Q lcl|Aclame:pro 306 ATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAV------KPTV--------LVDQKY-HI--DMQDLTKVDAF---- 362 (393) Q Consensus 306 ~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~------~ptv--------~vD~k~-~~--~~~~~~~~~s~---- 362 (393) |++.+|.+-+...+.+-.-.| |...+ .|++ +-|-+. ++ +-+|+.-.-+. T Consensus 289 --------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~ 360 (408) T protein:vir:74 289 --------VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAG 360 (408) T ss_pred --------hhcCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccc Confidence 777777776543222211111 33221 1222 223221 11 22333221111 Q ss_pred eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 ~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .|..|+-.+.++....|.+--|++-+++++. T Consensus 361 ~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 391 (408) T protein:vir:74 361 AFETDTTKIRVIDRFDVKATDSEALVAGSFT 391 (408) T ss_pred hhhcceeeEEEEEeeCcEEecccceEEEEee Confidence 1334556677888889999999988888876 No 18 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.69 E-value=2.3e-18 Score=117.26 Aligned_cols=347 Identities=13% Similarity=0.068 Sum_probs=198.1 Q ss_pred hhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHHHHHHH Q lcl|Aclame:pro 6 LIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFIESQNA 85 (393) Q Consensus 6 ~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfLkTkqA 85 (393) +++.+..|.+++....+++.+|+....++... ..++++++..++++.+++...+.+....+.+..... ........ T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~~~~-~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~---~~~~~~~~ 76 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDENAS-VDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEP---KTEPKDDG 76 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhHhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh---hccccccc Confidence 66677778888888888888877655443222 255666666666665555443333332221100000 00000000 Q ss_pred HHHHHHHHHHccCChhHHHHHHHHHHhC--------ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEE Q lcl|Aclame:pro 86 VTEFFDVLKKNSGKSEIKNAWNAKLAEN--------GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVS 157 (393) Q Consensus 86 ~~dya~ll~~nqg~ke~k~AW~a~L~ek--------gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~ 157 (393) ...-.. .........+++|++.|... +.+..|-...+|..+...|-+.+.++.++++..++.+....... T Consensus 77 ~~~~~~--~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~ 154 (389) T protein:vir:10 77 SKKGTD--LSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGT 154 (389) T ss_pred cccccc--cchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeE Confidence 000000 00000011122333333211 23333444579999999999999999999887777666544333 Q ss_pred Eee--ccccccceecccchhh-hhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhccee Q lcl|Aclame:pro 158 RSF--DSSNEAQVHKDGQTKT-EQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALV 234 (393) Q Consensus 158 i~l--~na~~a~GHk~ga~Kk-~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv 234 (393) +.. .....+..+.-|.++. ....+|...++.|..++....+-+.+.+ .+.-++.+|++++|+..+. +..+.+++ T Consensus 155 ~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~la~~~~-~~~~~~i~ 231 (389) T protein:vir:10 155 YPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIA--DSAVDLTALVGQSIKEKSV-NTYNAMIA 231 (389) T ss_pred EEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHh--hhhHHHHHHHHHHHHHHHH-HHHHHHHh Confidence 322 2233333455556565 4678999999999999888777444332 2223679999999999999 79999999 Q ss_pred eccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecchhhhHhhhhhccccccceee Q lcl|Aclame:pro 235 EGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRI 314 (393) Q Consensus 235 ~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l 314 (393) .|+|...+ ...+.....|++...+...-+.+.+..+++|+.++.+|+- | T Consensus 232 ~g~~~~~~-------------------~~~~~~~~~d~l~~~~~~~~~~~~~a~~~~n~~~~~~L~~------------l 280 (389) T protein:vir:10 232 PVLQSFTA-------------------KKTTTDTLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDT------------L 280 (389) T ss_pred hhhccccc-------------------ccccccccHHHHHHHHHhhhhhhhCcEEEecHHHHHHHHH------------h Confidence 99876321 1122334567777666544455567889999999999987 8 Q ss_pred ecCCCeEEEeeecccchhhc------ccchhc------e-------eeeeccc--cee-cccceeeecceeEeecCceEE Q lcl|Aclame:pro 315 KNDDTEIASEVGVDEIIVYT------GSKAVK------P-------TVLVDQK--YHI-DMQDLTKVDAFEWKTNSNMIL 372 (393) Q Consensus 315 ~~~~~~~~~~v~~~~~~~~t------G~k~~~------p-------tv~vD~k--~~~-~~~~~~~~~s~~~~~ns~~i~ 372 (393) ++.+|.+-+..++.+.+... |-..++ | .++-|-+ |.+ +-+|++-..+.. .+....+. T Consensus 281 kd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-~~~~~~~~ 359 (389) T protein:vir:10 281 KDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDS-KIYGKYLG 359 (389) T ss_pred hccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeecc-ccccceEE Confidence 88888887755554432211 222111 1 1223433 222 334444332222 22334556 Q ss_pred EeeeecccccccCcceeeeeC Q lcl|Aclame:pro 373 VETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 373 ~~~~~~g~~~~~n~~~~~~v~ 393 (393) +.....|-+..|++.++++++ T Consensus 360 ~~~r~d~~~~~~~a~~~~~~~ 380 (389) T protein:vir:10 360 AAFRFGVQKADSKAGYFVTNT 380 (389) T ss_pred EEEEeccEEecccceEEEEee Confidence 666788889999998888888 No 19 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.69 E-value=1.4e-17 Score=112.98 Aligned_cols=364 Identities=13% Similarity=0.096 Sum_probs=203.3 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcc-------- Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKG-------- 72 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~-------- 72 (393) |.+ ++|...+|.+++++.......+...-.+ +++.+.+.+++.++++.++|.+.+..++..+++... T Consensus 1 mk~--~~em~~~l~el~~~~~~~~~e~~~~~~~---~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:46 1 MKT--KEELQSEISDIKRQIDLKVKYATRALNN---DELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSV 75 (415) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHhch---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 654 4566667777777655555444432222 223345555555666666555444444333321111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC-----c-cchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 73 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN-----G-VTITDTTFQLPRKLVESINTALLNTNPVFKVF 146 (393) Q Consensus 73 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek-----g-V~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~f 146 (393) ...-................+..........++|...+... + +.+.+--.++|..+...|-+.+.++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:46 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhc Confidence 10110111111111112222223333344555555544332 2 22223333799999999999999999998876 Q ss_pred eeecccceeE--EEeeccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 147 HVTNVGALLV--SRSFDSS-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 222 (393) Q Consensus 147 hV~n~~~~a~--~i~l~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 222 (393) .+.+.+.+.. .+.-.+. ..+..+.-|.++.+ ...+|...++.|..++....+.+.+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:46 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred ceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 6666554432 2222332 23334455666665 457999999999999888777544443222 4679999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhh Q lcl|Aclame:pro 223 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLD 301 (393) Q Consensus 223 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~ 301 (393) ++- |+++.++..|||+..... .... . ......+..+.+...+++..++ .+..+-.+...+++|+.++.+|+. T Consensus 234 ~i~-~~~d~~il~g~g~g~~~~--~~~~-~---~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:46 234 TIA-ATRNKAIIDVITKGSTGS--TSSG-F---EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCCccc--cccc-c---ccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998743211 1111 0 1112233444556678888777 444555666789999999988876 Q ss_pred hhhccccccceeeecCCCeEEEeeeccc----chhhcccchhc------------eeeeecccc-ee--cccceeeecce Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDE----IIVYTGSKAVK------------PTVLVDQKY-HI--DMQDLTKVDAF 362 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~----~~~~tG~k~~~------------ptv~vD~k~-~~--~~~~~~~~~s~ 362 (393) |++.+|.+.+.-.+.+ ... |...++ +.++-|-+. ++ +.+|++-. .. T Consensus 307 ------------lkd~~G~~i~~~~~~~~~~~~l~--G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~-~~ 371 (415) T protein:vir:46 307 ------------MKDKLGNYLIQPDVKEKTQQRLL--GAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS-WT 371 (415) T ss_pred ------------hhccCCCeeeccCcCCCCCcccc--ceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEE-ee Confidence 6777777654221111 111 322211 223334332 22 33444322 11 Q ss_pred eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 ~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .-.+++..+.+..+..|.+.-|++-++++++ T Consensus 372 ~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:46 372 DYMHFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred ccccCceEEEEEEEeccEEeccccEEEEEee Confidence 1234455677788889999999999999988 No 20 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.69 E-value=1.4e-17 Score=112.98 Aligned_cols=364 Identities=13% Similarity=0.096 Sum_probs=203.3 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcc-------- Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKG-------- 72 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~-------- 72 (393) |.+ ++|...+|.+++++.......+...-.+ +++.+.+.+++.++++.++|.+.+..++..+++... T Consensus 1 mk~--~~em~~~l~el~~~~~~~~~e~~~~~~~---~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:47 1 MKT--KEELQSEISDIKRQIDLKVKYATRALNN---DELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSV 75 (415) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHhch---hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 654 4566667777777655555444432222 223345555555666666555444444333321111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC-----c-cchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 73 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN-----G-VTITDTTFQLPRKLVESINTALLNTNPVFKVF 146 (393) Q Consensus 73 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek-----g-V~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~f 146 (393) ...-................+..........++|...+... + +.+.+--.++|..+...|-+.+.++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:47 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhc Confidence 10110111111111112222223333344555555544332 2 22223333799999999999999999998876 Q ss_pred eeecccceeE--EEeeccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 147 HVTNVGALLV--SRSFDSS-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 222 (393) Q Consensus 147 hV~n~~~~a~--~i~l~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 222 (393) .+.+.+.+.. .+.-.+. ..+..+.-|.++.+ ...+|...++.|..++....+.+.+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:47 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred ceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 6666554432 2222332 23334455666665 457999999999999888777544443222 4679999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhh Q lcl|Aclame:pro 223 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLD 301 (393) Q Consensus 223 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~ 301 (393) ++- |+++.++..|||+..... .... . ......+..+.+...+++..++ .+..+-.+...+++|+.++.+|+. T Consensus 234 ~i~-~~~d~~il~g~g~g~~~~--~~~~-~---~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:47 234 TIA-ATRNKAIIDVITKGSTGS--TSSG-F---EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCCccc--cccc-c---ccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998743211 1111 0 1112233444556678888777 444555666789999999988876 Q ss_pred hhhccccccceeeecCCCeEEEeeeccc----chhhcccchhc------------eeeeecccc-ee--cccceeeecce Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDE----IIVYTGSKAVK------------PTVLVDQKY-HI--DMQDLTKVDAF 362 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~----~~~~tG~k~~~------------ptv~vD~k~-~~--~~~~~~~~~s~ 362 (393) |++.+|.+.+.-.+.+ ... |...++ +.++-|-+. ++ +.+|++-. .. T Consensus 307 ------------lkd~~G~~i~~~~~~~~~~~~l~--G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~-~~ 371 (415) T protein:vir:47 307 ------------MKDKLGNYLIQPDVKEKTQQRLL--GAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQAS-WT 371 (415) T ss_pred ------------hhccCCCeeeccCcCCCCCcccc--ceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEE-ee Confidence 6777777654221111 111 322211 223334332 22 33444322 11 Q ss_pred eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 ~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .-.+++..+.+..+..|.+.-|++-++++++ T Consensus 372 ~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:47 372 DYMHFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred ccccCceEEEEEEEeccEEeccccEEEEEee Confidence 1234455677788889999999999999988 No 21 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.69 E-value=3.5e-18 Score=116.30 Aligned_cols=346 Identities=14% Similarity=0.096 Sum_probs=192.0 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |+| .+..+.+++....+++.+++.... +......++++++..++.+..+++..+.++..+++..+......... T Consensus 1 M~~-----l~~l~~~~~~~~~e~~~~~~~~~~-~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~ 74 (394) T protein:vir:10 1 MDK-----LQTLFNEVSAKCADLNAQLNAKLQ-DENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPV 74 (394) T ss_pred ChH-----HHHHHHHHHHHHHHHHHHHHHHHh-hhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhh Confidence 655 344444444444555555543211 12222344555666666666666555444444333222221111111 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC---------ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecc Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN---------GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 151 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek---------gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~ 151 (393) ..++.... + .........+++|.+-|... +....|-...+|..+...|-+.+.++.++++...+.+. T Consensus 75 ~~~~~~~~--~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) T protein:vir:10 75 DNAQPNGT--D--LKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPV 150 (394) T ss_pred hhhccccc--c--hhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeec Confidence 11100000 0 00000012233444433322 23333444579999999999999999998887666666 Q ss_pred cceeEEEee-c-c-ccccceecccchhhh-hhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 152 GALLVSRSF-D-S-SNEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVN 226 (393) Q Consensus 152 ~~~a~~i~l-~-n-a~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~ 226 (393) +.....+.. + . ...+|. .-|.++.+ +...|...++.|..+|.+..+. +++.+. .-++.+|++++|++.+. T Consensus 151 ~~~~~~~~~~~~~~~~~~~~-~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~la~~~~- 225 (394) T protein:vir:10 151 TTPKGTYPILKRATDRFSSV-AELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADS---AVDLTSLVGQSINEKSV- 225 (394) T ss_pred cCCceEEEEEecCCCccccc-cccccccccccccceeEEeeeeeeEeeehhHHHHHhhh---hHHHHHHHHHHHHHHHH- Confidence 554433332 2 2 233443 33445554 6789999999999998887773 333332 24679999999999999 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecchhhhHhhhhhcc Q lcl|Aclame:pro 227 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQA 306 (393) Q Consensus 227 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~ 306 (393) +..+.++..|+|+-.+ ...+.....|++...++..-+.+.+..+++|+.++.+|+. T Consensus 226 ~~~~~~il~g~g~~~~-------------------~~~~~~~~~d~l~~~~~~~~~~~~~a~~vmn~~~~~~l~~----- 281 (394) T protein:vir:10 226 NTYNAMIAPVLQSFTA-------------------KATTTDTLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDT----- 281 (394) T ss_pred HHHHHHHhhccccccc-------------------ccccccccHHHHHHHHHhhhhhhccCEEEecHHHHHHHHH----- Confidence 7999999999886321 1122344567777777555556677899999999999886 Q ss_pred ccccceeeecCCCeEEEeeecccchhhcccc--hhceeeee-----------------ccc-cee--cccceeeecceeE Q lcl|Aclame:pro 307 TANANVRIKNDDTEIASEVGVDEIIVYTGSK--AVKPTVLV-----------------DQK-YHI--DMQDLTKVDAFEW 364 (393) Q Consensus 307 ~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k--~~~ptv~v-----------------D~k-~~~--~~~~~~~~~s~~~ 364 (393) |++.+|.+.+..++.+.+--.+.. .=+|.+++ |-+ +++ +.+|++-.-+... T Consensus 282 -------lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~ 354 (394) T protein:vir:10 282 -------LKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSK 354 (394) T ss_pred -------hhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEeccc Confidence 888888876654443322100000 11233333 322 122 2334332222222 Q ss_pred eecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 365 KTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 365 ~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .+. ..|.+.....|.+.-+++-++++++ T Consensus 355 ~~~-~~~~~~~r~d~~~~~~~ai~~~~~~ 382 (394) T protein:vir:10 355 IYG-RYLGAAFRFGVKQADSNAGYFVTNT 382 (394) T ss_pred ccc-eeEEEEEEeccEEeccccEEEEEee Confidence 222 3466677888999999999999988 No 22 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.69 E-value=2.5e-17 Score=111.59 Aligned_cols=371 Identities=15% Similarity=0.170 Sum_probs=199.1 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhH----HhhhhHH----HHHHHHHHHHHHHHHHHHHHHHhhhhhhcc Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNA----IEDLPKV----QELEKTLSENSIEIIKIENELNAQEEKPKG 72 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a----~~~~ski----eelektis~l~aEi~k~enel~~~~Ek~K~ 72 (393) -.+..+.|.++++.++.++..+++.+++....+.. .+++.++ .+++..+++++.++...+++++.+... +. T Consensus 14 ~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~~~~l~~~~~~-~~ 92 (425) T protein:vir:95 14 QRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQLEDELEQINSK-QP 92 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-cc Confidence 11344556666777777777777666665433221 1122222 223333333333333344444443221 11 Q ss_pred hhHHHHHHHHHH------HHHHHHHHHHHccC--ChhHHHHHHHHHHhC-ccchhhhHhhcchhHHHHHHHHHHhhCccc Q lcl|Aclame:pro 73 KDKMTNFIESQN------AVTEFFDVLKKNSG--KSEIKNAWNAKLAEN-GVTITDTTFQLPRKLVESINTALLNTNPVF 143 (393) Q Consensus 73 k~emtEfLkTkq------A~~dya~ll~~nqg--~ke~k~AW~a~L~ek-gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl 143 (393) +.+..++....+ ....++..+ ..+. ...--..+...+.+. ++ .+...++|.-+...|-+.+.++.+++ T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--~~gg~~vP~~~~~~Ii~~l~~~~~i~ 169 (425) T protein:vir:95 93 SNQSRQKMQGSKGDVVEMNRLQVREML-KTGEYYKRSEVVEFYEKFRNLRAV--AGGELTIPEVVVNRIMDIMGDYTTLY 169 (425) T ss_pred chhhhhhhhhhhhhHHHHHHHHHHHHH-hhhhhhhhhHHHHHHHHHHhhccc--ccCceeccHHHHHHHHHHHHhhhhHH Confidence 222222222111 111122222 1111 111112233333322 22 33444789999999999999999999 Q ss_pred cceeeecccceeEEEeecc-ccccceecccchhhhhhh-hhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHH Q lcl|Aclame:pro 144 KVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAA-TLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELT 221 (393) Q Consensus 144 ~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~-~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELa 221 (393) +...+.+.... ..+-... ...+.-+.-|.+..++.. +|...++.|..++.+..+.+.+-+ .+..++.+||.++|+ T Consensus 170 ~~~~~~~~~g~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~ 246 (425) T protein:vir:95 170 PLVDKIRVKGT-TRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQ--DSIINLDDYVTKKIA 246 (425) T ss_pred HhhceeecCce-eEEEEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHHHh--ccHHHHHHHHHHHHH Confidence 97776665433 2332223 345555667777777765 799999999988887777444433 333578999999999 Q ss_pred HHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhce---eeecCCCceEEEecchhhh Q lcl|Aclame:pro 222 QAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDF---VRPTAGRRYLIVKTEDRKA 298 (393) Q Consensus 222 q~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~---a~~~a~~~~l~i~~~d~~a 298 (393) .++. ++++.+++.|||+......-+..++.... .++..+.+...+.+...+.- +....++-..++++.|... T Consensus 247 ~~i~-~~~d~~il~G~G~~~~~p~Gil~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~ 321 (425) T protein:vir:95 247 RAIA-KALDLAIVKGTGAANKQPLGIIPSLPPEN----QVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYN 321 (425) T ss_pred HHHH-HHHHHHhhccCCCCccccceeeccccccc----ccccccccchHHHHHHHHHhhhhhccccCceEEEEeChHHHH Confidence 9999 79999999999974211111111111111 12233455566677666532 2223345567788887544 Q ss_pred HhhhhhccccccceeeecCCCeEEE--eeecccchhhcccchhc-------eeeeeccccee--cccceeeecceeEeec Q lcl|Aclame:pro 299 LLDELRQATANANVRIKNDDTEIAS--EVGVDEIIVYTGSKAVK-------PTVLVDQKYHI--DMQDLTKVDAFEWKTN 367 (393) Q Consensus 299 ~~~~~~~~~~~a~~~l~~~~~~~~~--~v~~~~~~~~tG~k~~~-------ptv~vD~k~~~--~~~~~~~~~s~~~~~n 367 (393) .+..|+ .++|++|.|.. +.+.....+ |...+. +.++-|-++|+ +.+|++---+..-.|. T Consensus 322 ~l~~l~--------~~kd~~g~~i~~~~~~~~~~l~--G~pvv~~~~~~~~~i~~Gd~~~~~~~~~~~~~i~~~~~~~f~ 391 (425) T protein:vir:95 322 RLVEFS--------IQVDSNGNVVGKLPNLRTPDLL--GLRVVFNNFLDDDTVLFGEFEQYTLVERENITIDSSTHVKFT 391 (425) T ss_pred HHHHHH--------hhcCCCCceeeccCCCCCcccc--ceeeEEcCcCCCccEEEEecccEEEEeecceEEEeecccccc Confidence 332222 27788887653 223222222 433222 33444554443 4455554444444555 Q ss_pred Cce--EEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 368 SNM--ILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 368 s~~--i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .+. |.+..+..|-+..|++-++.+|. T Consensus 392 ~~~~~~~~~~r~d~~~~~~~a~~~~~i~ 419 (425) T protein:vir:95 392 EDQTAFRGKGRFDGKPVKPEAFVLVTIT 419 (425) T ss_pred cCceEEEEEEeeCcEeecccceEEEEec Confidence 554 44556788999999999999988 No 23 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.69 E-value=2.6e-18 Score=117.00 Aligned_cols=341 Identities=11% Similarity=0.086 Sum_probs=198.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHH--H Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMT--N 78 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emt--E 78 (393) |. .++|..++++++++....+.++++....++. ....++++++..++++..+++.....++...+......... . T Consensus 1 Mk--~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MK--TSNELHDLWVAQGDKVENLNEKLNVAMLDDS-VSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 75 5667778888888888888887776543332 22345666677777766666543333322111000000000 0 Q ss_pred HH--HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC---------ccchhhhHhhcchhHHHHHHHHHHhhCcccccee Q lcl|Aclame:pro 79 FI--ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN---------GVTITDTTFQLPRKLVESINTALLNTNPVFKVFH 147 (393) Q Consensus 79 fL--kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek---------gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh 147 (393) .+ ...+.. .+..++|...|... +.+..+--.++|..+...|-+.++++.++++... T Consensus 78 ~~~~~~~~~~-------------~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~ 144 (397) T protein:vir:49 78 PLTKSEEEVK-------------AGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVN 144 (397) T ss_pred ccccchhHHH-------------HHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhc Confidence 00 000001 12223333333221 1222222336899999999999999999888767 Q ss_pred eeccccee--EEEee-ccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 148 VTNVGALL--VSRSF-DSS-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 222 (393) Q Consensus 148 V~n~~~~a--~~i~l-~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 222 (393) +.+.+... ..+.. .+. ..+..+--|.++.+ ...+|...++.|..+|....+.+.+.+ .+.-++.+|++++|+. T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~ 222 (397) T protein:vir:49 145 VENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLA--DSAENILAWLSGWIAK 222 (397) T ss_pred eeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHh--hhHHHHHHHHHHHHHH Confidence 76665432 22222 222 33555666778776 468999999999999988777544433 2224679999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhh Q lcl|Aclame:pro 223 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLD 301 (393) Q Consensus 223 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~ 301 (393) .+. |.++.+++.|||+... .+.+...|++..++ ++...-..+..+++|+.++.+|+. T Consensus 223 ~~~-~~~d~ai~~G~g~~~~---------------------~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~ 280 (397) T protein:vir:49 223 KVV-VTRNKAILEAIAALPT---------------------KPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKK 280 (397) T ss_pred HHH-HHHHHHHHhhcccccc---------------------ccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH Confidence 998 7999999999998432 22334556676666 443334456678889999988886 Q ss_pred hhhccccccceeeecCCCeEEEeeecccchhhc--ccchh------ce--------eeeecccc-ee--cccceeeec-c Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAV------KP--------TVLVDQKY-HI--DMQDLTKVD-A 361 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~------~p--------tv~vD~k~-~~--~~~~~~~~~-s 361 (393) |++.+|.+.+...+..-.-.| |-..+ .| .++-|-+. ++ +-+|++-.- . T Consensus 281 ------------lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~ 348 (397) T protein:vir:49 281 ------------VKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTN 348 (397) T ss_pred ------------hhcCCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEec Confidence 778888876643222111101 32111 12 22223332 11 233433221 1 Q ss_pred ee---EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 362 FE---WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 362 ~~---~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .. |..|+-.+.++.+..|-+.-|++-++++++ T Consensus 349 ~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 349 IGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFK 383 (397) T ss_pred cccchhhcCceeEEEEeeeCcEEecccceEEEEee Confidence 11 334445567778889999999988888887 No 24 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.68 E-value=5.7e-18 Score=115.13 Aligned_cols=338 Identities=12% Similarity=0.084 Sum_probs=195.6 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHH--H Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMT--N 78 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emt--E 78 (393) |.+ ++|..++++++++.++++..+++....++.... .+++.++..++.+.++++....++....+.+....... . T Consensus 1 Mk~--~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~-ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MKT--SNELHDLWIAQGDKVENLNEKLNVAMLDDSVSA-EELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 654 566778889999888888888877655443222 45667777777777666554444433332111100000 0 Q ss_pred H--HHHHHHHHHHHHHHHHccCChhHHHHHHHHHHh--------C-ccchhhhHhhcchhHHHHHHHHHHhhCcccccee Q lcl|Aclame:pro 79 F--IESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAE--------N-GVTITDTTFQLPRKLVESINTALLNTNPVFKVFH 147 (393) Q Consensus 79 f--LkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~e--------k-gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh 147 (393) . -..++.. .+..++|...|.. . ..++.+--..+|..+...|-+.++++.++++... T Consensus 78 ~~~~~~~~~~-------------~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~ 144 (397) T protein:vir:49 78 PLTKNEEEVK-------------ANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVN 144 (397) T ss_pred cccchhhHHH-------------HHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcc Confidence 0 0000000 1222233332221 1 1112222236899999999999999898877666 Q ss_pred eecccceeE--EEeec-c-c-cccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHH Q lcl|Aclame:pro 148 VTNVGALLV--SRSFD-S-S-NEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELT 221 (393) Q Consensus 148 V~n~~~~a~--~i~l~-n-a-~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELa 221 (393) |.+.+...- .+... + . ...|++- |.++++. ..+|...++.|..++.+..+.+.+.+-. .-++.+|++++|+ T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~a~~v~E-~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--~~~l~~~i~~~l~ 221 (397) T protein:vir:49 145 VENVTTLTGSRVYEKWADITGLAKLDDE-GGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADS--AENILAWLSGWIA 221 (397) T ss_pred eeeccCCcceEEEEeeccCCcceeeecc-ccccccccccceeeeEeeeeeeEeehhhHHHHHhhh--hHHHHHHHHHHHH Confidence 666654432 22222 2 2 2345544 4444444 4589999999999998888854443322 2467999999999 Q ss_pred HHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHh Q lcl|Aclame:pro 222 QAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALL 300 (393) Q Consensus 222 q~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~ 300 (393) ..+- |.++.+++.|||+... .+...+.|++..++ ++..--.....+++|+.++.+|+ T Consensus 222 ~~~~-~~~d~ail~G~g~~~~---------------------~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~ 279 (397) T protein:vir:49 222 KKVV-VTRNKAILEAIGTLPN---------------------KPTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALK 279 (397) T ss_pred HHHH-HHHHHHHHhccccccc---------------------cccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHH Confidence 9998 7999999999998432 22344567777666 33322334457888999988888 Q ss_pred hhhhccccccceeeecCCCeEEEeeeccc----chhhcccchhc------e--------eeeecccc-e-e-cccceeee Q lcl|Aclame:pro 301 DELRQATANANVRIKNDDTEIASEVGVDE----IIVYTGSKAVK------P--------TVLVDQKY-H-I-DMQDLTKV 359 (393) Q Consensus 301 ~~~~~~~~~a~~~l~~~~~~~~~~v~~~~----~~~~tG~k~~~------p--------tv~vD~k~-~-~-~~~~~~~~ 359 (393) . |++.+|.+-+...+.+ ... |...++ | .++.|-+. + + +-+|++-. T Consensus 280 ~------------lkd~~g~~l~~~~~~~g~~~~l~--G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~ 345 (397) T protein:vir:49 280 K------------VKNAMGDYLMERDVKSPTGYSID--GFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLL 345 (397) T ss_pred H------------hhccCCceeecccccCCCCceec--ceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEE Confidence 7 7888888765332221 222 332111 2 22333332 1 1 22333222 Q ss_pred cc-e-e--EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 360 DA-F-E--WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 360 ~s-~-~--~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) -+ . + |..++-.+.++.+..|.+.-|++-++++++ T Consensus 346 ~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 346 STNIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFK 383 (397) T ss_pred EeccccchhhcCeeeEEEEEeeccEEecccceEEEEec Confidence 11 1 1 334455577788889999999998889887 No 25 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.67 E-value=1.2e-17 Score=113.39 Aligned_cols=345 Identities=12% Similarity=0.100 Sum_probs=193.9 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------hhcc Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KPKG 72 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E--------k~K~ 72 (393) |.+ ++|.++++.+++.....++.+++....+++... .+++.++..+..++.+++..+........ ..+. T Consensus 1 Mk~--~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:48 1 MKT--SNELHDLWVAQGDKVENLNEKLNVAMLDDSVTA-EELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKK 77 (397) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhccc Confidence 654 455688999999999999999888877665433 56677777777777666543222221111 0000 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC-ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecc Q lcl|Aclame:pro 73 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN-GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 151 (393) Q Consensus 73 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek-gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~ 151 (393) ..+..+.-........|...+.. .....+..+ ..++++--.++|..+...|-+.++++.++++...+.+. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~ 148 (397) T protein:vir:48 78 PLTKSEEEVKAGFVKDFKNLVRG---------RYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQEYVNVENV 148 (397) T ss_pred cccchhhHHHHHHHHHHHHHHhh---------hhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHhhhceeec Confidence 00000111111111111111111 011111111 12222333378999999999999999999887666555 Q ss_pred cceeE--EEee-ccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 152 GALLV--SRSF-DSS-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 226 (393) Q Consensus 152 ~~~a~--~i~l-~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 226 (393) +.... .+.. .+. ..+..+.-|..+.+ ...+|...++.|..++.+.++.+.+.+- +.-++.+|++++|+..+. T Consensus 149 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d--s~~~l~~~v~~~l~~~~~- 225 (397) T protein:vir:48 149 TTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLAD--SAENILAWLSGWIAKKVV- 225 (397) T ss_pred cCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhh--chHHHHHHHHHHHHHHHH- Confidence 54322 2222 222 23444444555544 4579999999999999988885544332 224689999999999998 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhc Q lcl|Aclame:pro 227 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQ 305 (393) Q Consensus 227 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~ 305 (393) +.++.+++.|||+... .+.....|++...+ ++-..-...-.+++|+.++.+|+- T Consensus 226 ~~~d~~il~G~g~~~~---------------------~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~---- 280 (397) T protein:vir:48 226 VTRNKAILEAIATLPT---------------------KPTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKK---- 280 (397) T ss_pred HHHHHHHhhccccccc---------------------ccccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHH---- Confidence 7999999999998542 12233455555544 222222334577889888888876 Q ss_pred cccccceeeecCCCeEEEeeecccchhhc--ccchh--------------ceeeeeccccee---cccceeeecce-e-- Q lcl|Aclame:pro 306 ATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAV--------------KPTVLVDQKYHI---DMQDLTKVDAF-E-- 363 (393) Q Consensus 306 ~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~--------------~ptv~vD~k~~~---~~~~~~~~~s~-~-- 363 (393) |++.+|.+.+...+.+-+-.| |...+ .+.++-|-+.++ +.+|++-.-+. . T Consensus 281 --------lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 352 (397) T protein:vir:48 281 --------VKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGG 352 (397) T ss_pred --------hhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchh Confidence 778888776643322211111 32111 122333433211 33343322221 1 Q ss_pred -EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 364 -WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 364 -~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) |..++-.+.+..+..|.+.-|++-+.+++. T Consensus 353 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:48 353 AFETDTTKIRVIDRFDVVATDTESFVPASFK 383 (397) T ss_pred hhhcCceeEEEEeeeccEEecccceEEEEec Confidence 334444555666678888888777777776 No 26 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.66 E-value=2.2e-17 Score=111.86 Aligned_cols=365 Identities=15% Similarity=0.137 Sum_probs=189.9 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |+| -+.|.+.+++++++.+.++..+.+. +.++..+...++++|++.++.++.. ...++.+.....+......-.... T Consensus 1 M~k-~l~el~~~~~~~~~e~~~~~~~~~~-~~ee~~~~~~e~~~l~~~i~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~ 77 (404) T protein:vir:10 1 MSK-ELRELLNQLDSKNKELNSLLNKDGV-TAEELNKTSNEIDILQAKIEAQKRK-ENIENNFNEDNVKSLNTGKEENVI 77 (404) T ss_pred CcH-HHHHHHHHHHHHHHHHHHHHhhcCC-CHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhhhccccccccchhhH Confidence 998 4667777777777765554332211 1122222223344444443322211 112222222111111110000011 Q ss_pred HHHHHHHHHHHHHHHccCChhHH---HHHHHHHHhCcc-chhhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee- Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIK---NAWNAKLAENGV-TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL- 155 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k---~AW~a~L~ekgV-~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a- 155 (393) ....+ +.+-..++..+..-+ ....+....-+. ++.+--.++|..+...|-+.+.++.++++.+.+.+.+... T Consensus 78 ~~~~~---~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~g 154 (404) T protein:vir:10 78 YNGAL---FVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEPVFTRSG 154 (404) T ss_pred HHHHH---HHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceeeccCCcc Confidence 11111 111111111110000 011111111111 1123333689999999999999999998877777665432 Q ss_pred -EEEeecc-ccccceecccchhhhh--hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 156 -VSRSFDS-SNEAQVHKDGQTKTEQ--AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDL 231 (393) Q Consensus 156 -~~i~l~n-a~~a~GHk~ga~Kk~q--~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~r 231 (393) ..+...+ ...+..+.-|.++..+ +.+|...++.|..++...++.+-+. +.+.-++.+|++++|+.++- +.++. T Consensus 155 ~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell--~ds~~~l~~~i~~~la~~~~-~~~~~ 231 (404) T protein:vir:10 155 SRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLL--KFADKSLEDWIINWFVDKVR-ITRNA 231 (404) T ss_pred ceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHH--hhcHHHHHHHHHHHHHHHHH-HHHHH Confidence 2222222 2344446666665554 5789999999998888877744332 22224789999999999999 79999 Q ss_pred ceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCC--CceEEEecchhhhHhhhhhccccc Q lcl|Aclame:pro 232 ALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQATAN 309 (393) Q Consensus 232 Avv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~--~~~l~i~~~d~~a~~~~~~~~~~~ 309 (393) +++.|||++. ..........+ .+...+.....+++..++...-+.+. +..+++|+.++.+|+- T Consensus 232 ~il~G~g~~~--~~~gi~~~~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~-------- 296 (404) T protein:vir:10 232 EILYGAGGDE--HATGIMTANKF-----KKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDS-------- 296 (404) T ss_pred HHhhcCCCCC--cccceeecccc-----ceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHH-------- Confidence 9999999743 11111111111 12334455677888887754433332 3467889999998887 Q ss_pred cceeeecCCCeEEEeeecccchhhc--ccchh-ce------------eeeecccce--e-cccceeeecc----eeEeec Q lcl|Aclame:pro 310 ANVRIKNDDTEIASEVGVDEIIVYT--GSKAV-KP------------TVLVDQKYH--I-DMQDLTKVDA----FEWKTN 367 (393) Q Consensus 310 a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~-~p------------tv~vD~k~~--~-~~~~~~~~~s----~~~~~n 367 (393) |++++|.+.+...+.+-...| |-..+ .| .++-|-+.+ + +-.|++-.-+ ..|.+| T Consensus 297 ----lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~ 372 (404) T protein:vir:10 297 ----LEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETN 372 (404) T ss_pred ----hhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcC Confidence 778888877654333322211 43221 11 122222211 1 1122222111 113345 Q ss_pred CceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 368 SNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 368 s~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +-.|.++.+..|-+.-|++-++++++ T Consensus 373 ~~~~~~~~r~d~~v~~~~a~~~~~~~ 398 (404) T protein:vir:10 373 TTKARIIMRIDGNVKDSEALLIAEIP 398 (404) T ss_pred ceEEEEEEeeccEEecccceEEEEee Confidence 55678888899999999999999988 No 27 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.66 E-value=7.7e-17 Score=108.93 Aligned_cols=366 Identities=13% Similarity=0.079 Sum_probs=201.3 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh-------hhcch Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-------KPKGK 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E-------k~K~k 73 (393) |... +|.+.+|.++++.....+..+...-.+ +++.+++.++..+.+++++|++.+.++...++ .++.. T Consensus 1 mk~~--~el~~~l~el~~~~~~~~~~~~~~~~~---~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:94 1 MKTK--EELQSEISDIKRQIDLKVKYATRALNN---DELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred CChH--HHHHHHHHHHHHHHHHHHHHHHHHhch---hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 6643 444556666655544444333322222 22334556666666666666554444433332 11111 Q ss_pred hHHHHHHHHHH-HHHHHHHHHHHccCChhHHHHHHHHHHhC-----c-cchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 74 DKMTNFIESQN-AVTEFFDVLKKNSGKSEIKNAWNAKLAEN-----G-VTITDTTFQLPRKLVESINTALLNTNPVFKVF 146 (393) Q Consensus 74 ~emtEfLkTkq-A~~dya~ll~~nqg~ke~k~AW~a~L~ek-----g-V~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~f 146 (393) ....+-...+. ....+..-+..........++|...+... + ..+.+--..+|..+...|.+.++++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:94 76 EVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred cccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhc Confidence 11111111111 11111112222233345556665544332 2 22223334799999999999999999998866 Q ss_pred eeecccceeEE--Eeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 147 HVTNVGALLVS--RSFDS-SNEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 222 (393) Q Consensus 147 hV~n~~~~a~~--i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 222 (393) .+.+.+..... +...+ ...+..+.-|.++++. ...|...++.|..++....+.+-+.+ .+.-++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~--ds~~~~~~~i~~~l~~ 233 (415) T protein:vir:94 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIE--DAKVNVLQELKLWMAR 233 (415) T ss_pred ceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHh--hchHHHHHHHHHHHHH Confidence 66666544322 22233 2344455566777654 46899999999999998777443333 2224679999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhh Q lcl|Aclame:pro 223 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLD 301 (393) Q Consensus 223 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~ 301 (393) .+. ++++.+++.|+|+....... .+... ...+...+.+...|++..++ .+..+-.....+++|+.+..+|+. T Consensus 234 ~~~-~~~~~~il~g~g~g~~~~~~-~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~ 306 (415) T protein:vir:94 234 TIA-ATRNKAIIDVITKGSTGSTS-SGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCcccccc-ccccc-----cccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 79999999998873321111 11111 11223344456678888888 444444456688999999988876 Q ss_pred hhhccccccceeeecCCCeEEEeeecccchhhc--ccc------------hhceeeeeccc-cee--cccceeeecceeE Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSK------------AVKPTVLVDQK-YHI--DMQDLTKVDAFEW 364 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k------------~~~ptv~vD~k-~~~--~~~~~~~~~s~~~ 364 (393) +++.+|.+.+.-.+.+-...+ |.. --+|.++.|-+ +++ +-+|++-..+- - T Consensus 307 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~-~ 373 (415) T protein:vir:94 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD-Y 373 (415) T ss_pred ------------hhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec-c Confidence 666666655422211110000 221 11234445533 232 33444322111 1 Q ss_pred eecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 365 KTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 365 ~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .+++..+.+..+..|.+.-|++-++++++ T Consensus 374 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 402 (415) T protein:vir:94 374 MHFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEe Confidence 33445577788899999999999999888 No 28 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.65 E-value=2e-17 Score=112.14 Aligned_cols=349 Identities=11% Similarity=0.065 Sum_probs=197.4 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh-----------h Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-----------K 69 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E-----------k 69 (393) --|.+++|.+.++++++.....+..+++..-.++. ....+++++.+.+.++..++++.++.++...+ . T Consensus 2 ~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:39 2 GVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDN-FSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (404) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 23668899999999999998888887765433221 11134445555555555555443333333221 0 Q ss_pred hcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeee Q lcl|Aclame:pro 70 PKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVT 149 (393) Q Consensus 70 ~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~ 149 (393) ......+...-...++...| +... .......+.+-.. ..++.+--.++|..+...|-+.++++.++++...+. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~---~~~~---~~~~~~~e~~a~~-~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 153 (404) T protein:vir:39 81 PLNKSEYELKDKFVKEFVNM---VRNP---MAFLNTVSSKTET-SGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVE 153 (404) T ss_pred ccccchhhhHHHHHHHHHHH---Hhcc---hhhhhhhhhhhhh-cccccCCceeccHHHHHHHHHHHHhhhhHHhhccee Confidence 01111122222222222222 2111 1112222222111 112222223689999999999999999998877766 Q ss_pred ccccee--EEEeeccc--cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 150 NVGALL--VSRSFDSS--NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAI 224 (393) Q Consensus 150 n~~~~a--~~i~l~na--~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~f 224 (393) +.+... ..+...+. ..+.-+--|.++.+ ...+|...++.|..++....+.+.+.+- +.-++.+|++++|+.++ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~~~~ 231 (404) T protein:vir:39 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKD--TAENILAWLSSWIAKKV 231 (404) T ss_pred eccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhh--chHHHHHHHHHHHHHHH Confidence 655432 22222222 23334555667776 5689999999999999887775544432 22568999999999999 Q ss_pred HHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeec--CCCceEEEecchhhhHhhh Q lcl|Aclame:pro 225 VNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT--AGRRYLIVKTEDRKALLDE 302 (393) Q Consensus 225 I~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~--a~~~~l~i~~~d~~a~~~~ 302 (393) . +.++.+++.|||+.. ..+.+...+++..++.-.-+. ..+..+++|+.++.+|+- T Consensus 232 ~-~~~d~~il~g~g~~~---------------------~~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~- 288 (404) T protein:vir:39 232 V-VTRNQAIIAAMGTVP---------------------KKPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLAL- 288 (404) T ss_pred H-HHHHHHHHhcccccc---------------------cccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH- Confidence 9 799999999999842 223344456666666321111 234578899999988886 Q ss_pred hhccccccceeeecCCCeEEEeeecccchhhc--ccchhc------e--------eeeecccc-ee--cccceee-ecce Q lcl|Aclame:pro 303 LRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK------P--------TVLVDQKY-HI--DMQDLTK-VDAF 362 (393) Q Consensus 303 ~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~------p--------tv~vD~k~-~~--~~~~~~~-~~s~ 362 (393) |++.+|.+-+..++.+-.-.+ |...++ | .++.|-+. ++ +-+|++. .+.+ T Consensus 289 -----------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 357 (404) T protein:vir:39 289 -----------VKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNI 357 (404) T ss_pred -----------hhccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEecc Confidence 667777665433222111101 322111 2 23333321 11 2234332 1122 Q ss_pred e---EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 E---WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 ~---~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) . |..|+-.+.++....|.+.-|++-+++++. T Consensus 358 ~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 391 (404) T protein:vir:39 358 GAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFT 391 (404) T ss_pred chhhhhhceeeEEEEeeeccEEecccceEEEEee Confidence 2 345556677888899999999999888877 No 29 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.65 E-value=3e-17 Score=111.20 Aligned_cols=340 Identities=12% Similarity=0.117 Sum_probs=185.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHH----hhhhHHHHHHHHHHHHHHHHHHHHHHHHh----hhhhhcc Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAI----EDLPKVQELEKTLSENSIEIIKIENELNA----QEEKPKG 72 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~----~~~skieelektis~l~aEi~k~enel~~----~~Ek~K~ 72 (393) ||. +|.+++++++.+++.++.+++.....+... ....+++.+++.+..+++.....+..... ....+.. T Consensus 1 M~~---~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (395) T protein:vir:38 1 MNI---NQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVN 77 (395) T ss_pred CCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 664 556677778888888888888776654221 11223333333333333332211110000 0000000 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC---ccch-hhhHhhcchhHHHHHHHHHHhhCccccceee Q lcl|Aclame:pro 73 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN---GVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHV 148 (393) Q Consensus 73 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek---gV~~-qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV 148 (393) ...+ .. +.... ..+++.+.|...+... +.++ .+--.++|..+..-|-+.++++.++++..++ T Consensus 78 ~~~~-~~---~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~ 143 (395) T protein:vir:38 78 KKPL-PV---KDGKP----------DAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANV 143 (395) T ss_pred cccc-ch---hhhhH----------HHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcce Confidence 0000 00 00000 0123444444444332 2222 2222379999999999999999988886665 Q ss_pred ecccce--eEEE-ee-ccccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 149 TNVGAL--LVSR-SF-DSSNEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQA 223 (393) Q Consensus 149 ~n~~~~--a~~i-~l-~na~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~ 223 (393) .+.+.. ...+ .. .....+.-+.-|.++.+. ..+|...++.|..++.+..+.+.+.+-.+ -++.+|++++|+++ T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~la~~ 221 (395) T protein:vir:38 144 ENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTV--DNIIQWLVNWAAKK 221 (395) T ss_pred eeccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhH--HHHHHHHHHHHHHH Confidence 444322 2222 22 223344445566676654 57999999999988888777554443222 46899999999999 Q ss_pred HHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeec--CCCceEEEecchhhhHhh Q lcl|Aclame:pro 224 IVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT--AGRRYLIVKTEDRKALLD 301 (393) Q Consensus 224 fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~--a~~~~l~i~~~d~~a~~~ 301 (393) +. |..+.+++.|+|+... .+...+.|++..++...-+. ..+..+++|+.++.+|+. T Consensus 222 ~~-~~~~~~il~g~g~~~~---------------------~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~ 279 (395) T protein:vir:38 222 DV-VTRNAKILEVMGKAPK---------------------KPTISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSK 279 (395) T ss_pred HH-HHHHHHHhhccccccc---------------------ccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH Confidence 99 7999999999997431 12234456666666322222 244568889999888876 Q ss_pred hhhccccccceeeecCCCeEEEeeecccchhhc--ccchhc-------------eeeeecccc-ee--cccceeeecce- Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK-------------PTVLVDQKY-HI--DMQDLTKVDAF- 362 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~-------------ptv~vD~k~-~~--~~~~~~~~~s~- 362 (393) |++.+|.+-+...+.+-.-+| |...++ +.++-|-+. ++ +-+|++---+. T Consensus 280 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~ 347 (395) T protein:vir:38 280 ------------VKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNV 347 (395) T ss_pred ------------hhccCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEecc Confidence 777777766533322211111 332211 122333221 21 22333221111 Q ss_pred ---eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 ---EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 ---~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .|..|+-.+.++.+..|-+.-|++-+++++. T Consensus 348 ~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 381 (395) T protein:vir:38 348 GAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFK 381 (395) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEee Confidence 1445555666777788999999999998887 No 30 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.65 E-value=1.4e-16 Score=107.54 Aligned_cols=364 Identities=11% Similarity=0.085 Sum_probs=191.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHH-------Hhhhh--hhc Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENEL-------NAQEE--KPK 71 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel-------~~~~E--k~K 71 (393) ||.-. .+.+.+++...+.++..+.+.-...+ ++..+++++++.++.++.+|...+... ....+ +++ T Consensus 1 M~l~e---L~e~r~~l~~e~~~l~~k~~~~~~t~--e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~ 75 (409) T protein:vir:45 1 MKLHE---LKQKRNTIATDMRALNEKIGDNAWTE--EQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQN 75 (409) T ss_pred CCHHH---HHHHHHHHHHHHHHHHHHhhcCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhccc Confidence 88554 44555566666666666554322211 122456666666666666664322211 11111 010 Q ss_pred --chhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeee Q lcl|Aclame:pro 72 --GKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVT 149 (393) Q Consensus 72 --~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~ 149 (393) ......+--+.+++...|.+- +......+.+++..+.-........+--.++|..+...|-+.+.++.++++..++. T Consensus 76 ~~~~~~~~~~~~~~~a~~~~l~~-~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~ 154 (409) T protein:vir:45 76 LDPENNSQQDEKRAQVFDKWMRH-GASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQIL 154 (409) T ss_pred CCCCCcchhhHHHHHHHHHHHHh-hhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhceee Confidence 011111111222333333111 12223345666655432222222222223799999999999999999998877766 Q ss_pred cccceeEEEee--cc-ccccceecccchhhhhhhhhhhhhccHHHHHH-HHHHH-HHHHhhcCchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 150 NVGALLVSRSF--DS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYK-LQSLA-ERVKRLQMSYSELYNLIVAELTQAI 224 (393) Q Consensus 150 n~~~~a~~i~l--~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYk-kq~La-d~~k~l~g~ygalvnyvm~ELaq~f 224 (393) +......+.-+ .. ...+.-+--|.++.++...|...++.|..+|- .-.+. +++.+ +.-++.+|++++|+.++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~d---s~~~l~~~i~~~la~a~ 231 (409) T protein:vir:45 155 TTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQD---SAIDMEAYLARRIAERI 231 (409) T ss_pred ecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhc---cHHHHHHHHHHHHHHHH Confidence 65443322222 22 34555566778899999999999999876653 23342 33333 22478999999999999 Q ss_pred HHHHHhcceeeccCCCccccch-hhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeee---cCCCceEEEecchhhhHh Q lcl|Aclame:pro 225 VNKIVDLALVEGDGTNGFKSID-KEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP---TAGRRYLIVKTEDRKALL 300 (393) Q Consensus 225 I~Rav~rAvv~gDG~~~t~~~~-~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~---~a~~~~l~i~~~d~~a~~ 300 (393) . +.++.++++|||+.....+- +-..+.. ...+..+++...|++...+.-..+ ..+.-++++++.+..+|+ T Consensus 232 ~-~~~~~a~l~G~G~~~~~~p~Gil~~~~~-----~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~ 305 (409) T protein:vir:45 232 G-RGEARYLIQGTGAGTPKQPKGLAASVTG-----TTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLIS 305 (409) T ss_pred H-HHHHHHhhccCCCCCccccceeeecccc-----ccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHH Confidence 9 79999999999985321111 1111110 112333445566777766532211 222234566888887777 Q ss_pred hhhhccccccceeeecCCCeEEEeeecc----cchhhcccchhc-----------eeee-eccc-ceecccceeeec-ce Q lcl|Aclame:pro 301 DELRQATANANVRIKNDDTEIASEVGVD----EIIVYTGSKAVK-----------PTVL-VDQK-YHIDMQDLTKVD-AF 362 (393) Q Consensus 301 ~~~~~~~~~a~~~l~~~~~~~~~~v~~~----~~~~~tG~k~~~-----------ptv~-vD~k-~~~~~~~~~~~~-s~ 362 (393) . |++++|.+-+.-.+. ...+ |...+. ++++ -|-+ |.+...+-.++. +. T Consensus 306 ~------------lkd~~G~~i~~~~~~~~~~~~l~--G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~~~~~ 371 (409) T protein:vir:45 306 E------------MEDGQGRPLWLPDIVGVAPASVL--NVPYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMILKRLV 371 (409) T ss_pred H------------hhcCCCceeeccCcCCCCCceec--ceeeEEecCcCCccCCccEEEEeehhhhheeeccceEEEEee Confidence 6 777777765432111 1111 321111 1122 2432 222222222121 11 Q ss_pred eEeecCc--eEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 EWKTNSN--MILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 ~~~~ns~--~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ..-+..+ .|.+..+..|.+.-|++-.++++. T Consensus 372 d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k 404 (409) T protein:vir:45 372 ERYAEYDQTGFLAFHRFDCILEDTSAIKALVGK 404 (409) T ss_pred cccccCCcEEEEEEEEeccEeechhheEEEEec Confidence 2223334 466777889999999988888886 No 31 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.65 E-value=1.2e-16 Score=107.95 Aligned_cols=350 Identities=13% Similarity=0.063 Sum_probs=184.0 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhh--------------hH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVK--------------NA-IEDLPKVQELEKTLSENSIEIIKIENELNA 65 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~--------------~a-~~~~skieelektis~l~aEi~k~enel~~ 65 (393) |.+-=+ ..+.++.++++++.++.++.+.++-+ .. .+.-.+++++++.+.+++.+|.+.+.+... T Consensus 1 m~~k~~-~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~ 79 (397) T protein:vir:96 1 MALKQL-ILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQD 79 (397) T ss_pred CcHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 544321 12233444444444444443333321 10 011234555555555555555443333333 Q ss_pred hhhhhc---chhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC------ccchhhhHhhcchhHHHHHHHHH Q lcl|Aclame:pro 66 QEEKPK---GKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN------GVTITDTTFQLPRKLVESINTAL 136 (393) Q Consensus 66 ~~Ek~K---~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek------gV~~qd~~eiLP~~ii~AIe~A~ 136 (393) +.+..+ ........-..+.+...+. .......+.+.+....+... +....+-...+|..+...|.+ + T Consensus 80 l~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~ 155 (397) T protein:vir:96 80 LEDELAKAADPTDQKPKDGEKRKMKKFK---VTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-P 155 (397) T ss_pred HHHHHHhhhhhhhhhhHHHHHHHHHHHh---hhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-h Confidence 221100 0000001111111111111 11111112333333333322 333334444788888888877 4 Q ss_pred HhhCccccceeeecccceeEEEee-c--cccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHH Q lcl|Aclame:pro 137 LNTNPVFKVFHVTNVGALLVSRSF-D--SSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELY 213 (393) Q Consensus 137 ed~d~vl~~fhV~n~~~~a~~i~l-~--na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalv 213 (393) .++..+++...+.+.+.....+.. + +...+|..-.+........+|...++.|..++....+...+. +.+.-++. T Consensus 156 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell--~ds~~~l~ 233 (397) T protein:vir:96 156 KDIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMI--DDASYDVT 233 (397) T ss_pred hhhhhHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHH--hhhHHHHH Confidence 556778776666555443332222 2 233444433333333467899999999999998877744332 23334689 Q ss_pred HHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEec Q lcl|Aclame:pro 214 NLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKT 293 (393) Q Consensus 214 nyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~ 293 (393) +|+.++|+..+. ++.+.++..|+|... ..++...|++..++....+.+.+..+++|. T Consensus 234 ~~i~~~l~~~~~-~~~~~~i~~g~g~~~----------------------~~~~~~~d~~~~~~~~~~~~~~~a~~v~n~ 290 (397) T protein:vir:96 234 GLIADEIQDQSL-NTKNADIAAVLKTAT----------------------AKSVVGVDGLKDLINKEIKKVYDVKLFISA 290 (397) T ss_pred HHHHHHHHHHHH-HHHHHHHhhcccccc----------------------cccccchHHHHHHHHHhhhhhcCcEEEEcH Confidence 999999999999 799999999998632 122445677777776556666778899999 Q ss_pred chhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhc--ccchhc-------------eeeeecccc-ee--cccc Q lcl|Aclame:pro 294 EDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK-------------PTVLVDQKY-HI--DMQD 355 (393) Q Consensus 294 ~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~-------------ptv~vD~k~-~~--~~~~ 355 (393) .++.+|+. |++.+|.|.+.-.+.+-.-+| |...++ +.++-|-+. |+ +.+| T Consensus 291 ~~~~~l~~------------lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~ 358 (397) T protein:vir:96 291 SMYSELDK------------LKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQ 358 (397) T ss_pred HHHHHHHH------------hhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEeecc Confidence 99999987 788888877632222111111 322221 112224342 21 3334 Q ss_pred eeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 356 LTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 356 ~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++...+-...+ +..|.+.....|-+-.|++.++++|. T Consensus 359 ~~~~~~~~~~~-~~~~~~~~r~d~~~~~~~a~~~~~~~ 395 (397) T protein:vir:96 359 VSVSWVDNNIY-GQLLAGIIRYDVKATDKKAGFYVTFT 395 (397) T ss_pred eEEEEeccccc-ceeEEEEEEEccEEecccceEEEEee Confidence 33332222222 34466777889999999999999888 No 32 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.65 E-value=2.4e-16 Score=106.23 Aligned_cols=355 Identities=12% Similarity=0.066 Sum_probs=195.9 Q ss_pred CCcch-hhHHHHHHHHHHHhhHHHHhhhhhhh----hhh----HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh---- Q lcl|Aclame:pro 1 MNKPD-LIEKQNRLAELKENNVSLKSQISGFE----VKN----AIEDLPKVQELEKTLSENSIEIIKIENELNAQE---- 67 (393) Q Consensus 1 ~~k~d-~~ekq~eLa~lK~~~~~~~s~i~~~~----v~~----a~~~~skieelektis~l~aEi~k~enel~~~~---- 67 (393) ||--. +.+.+++|.++++....++..+++.. .+. ..+...++++++..++++...+...+....... T Consensus 1 ~~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~ 80 (400) T protein:vir:38 1 MTLDEKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSG 80 (400) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 66433 34455666666666665555554321 111 111134455555555555555433222221111 Q ss_pred ------hhhcchhHHHHHHHHHHHHHHHHHHHHHccCC-hhHHHHHHH--HHHhCccchhhhHhhcchhHHHHHHHHHHh Q lcl|Aclame:pro 68 ------EKPKGKDKMTNFIESQNAVTEFFDVLKKNSGK-SEIKNAWNA--KLAENGVTITDTTFQLPRKLVESINTALLN 138 (393) Q Consensus 68 ------Ek~K~k~emtEfLkTkqA~~dya~ll~~nqg~-ke~k~AW~a--~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed 138 (393) +....+..+..+.+....-.+-........+. ........+ .....|+...+-..++|..+...|...+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~ 160 (400) T protein:vir:38 81 KKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQT 160 (400) T ss_pred ccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHh Confidence 11122223333444333332222222111111 011111111 122335545554457999999999999999 Q ss_pred hCccccceeeecccceeEEEee-c-cccccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHH Q lcl|Aclame:pro 139 TNPVFKVFHVTNVGALLVSRSF-D-SSNEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNL 215 (393) Q Consensus 139 ~d~vl~~fhV~n~~~~a~~i~l-~-na~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvny 215 (393) +..+++.+.+.+.+.....+-. . ....+..+.-|.++++ +...|...++.|..++.+..+.+.+.+-.+ -++.+| T Consensus 161 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~--~~~~~~ 238 (400) T protein:vir:38 161 VVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSA--IDLVGL 238 (400) T ss_pred hhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhH--HHHHHH Confidence 9999887776655544332222 2 2333334445555554 678999999999999998888554443222 467999 Q ss_pred HHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCceEEEecch Q lcl|Aclame:pro 216 IVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTED 295 (393) Q Consensus 216 vm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~~l~i~~~d 295 (393) ++++|+..+. ++.+.++..|+|.... ++....+++..++....+.+.+..+++|+.+ T Consensus 239 i~~~l~~~~~-~~~~~~i~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~ 295 (400) T protein:vir:38 239 IAQNGQQIKV-NTTNGAVATLLKGFTA----------------------KTISSVDDLKHINNVDLDPAYSRVIIASQSF 295 (400) T ss_pred HHHHHHHHHH-HHHHHhhhhccccccc----------------------cccccHHHHHHHHHhhhhhhhCcEEEEcHHH Confidence 9999999999 6999999999886321 1233345566666544455667788999999 Q ss_pred hhhHhhhhhccccccceeeecCCCeEEEeeecccchhhc--ccchhc------------eeeeecccc-ee--cccceee Q lcl|Aclame:pro 296 RKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK------------PTVLVDQKY-HI--DMQDLTK 358 (393) Q Consensus 296 ~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~------------ptv~vD~k~-~~--~~~~~~~ 358 (393) +.+|+. +++.+|.|.+...+.+-.-.| |...++ +.++-|-.. ++ +.+|++- T Consensus 296 ~~~l~~------------lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~ 363 (400) T protein:vir:38 296 YNFLDT------------VKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMV 363 (400) T ss_pred HHHHHH------------hhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEE Confidence 988887 788888877643332211111 322221 112223221 11 2334332 Q ss_pred ecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 359 VDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 359 ~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) -.+.. .+++..+.+..+..|.+--+++-++++++ T Consensus 364 ~~~~~-~~~~~~~~~~~r~d~~~~~~~a~~~l~~~ 397 (400) T protein:vir:38 364 RWVDD-QIYGQFLQAGMRFGVSVADEKAGYFLTYT 397 (400) T ss_pred EEecc-cccceeEEEEEEeccEEecccceEEEEee Confidence 22222 23344677788889999999999999998 No 33 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.64 E-value=2.3e-16 Score=106.33 Aligned_cols=352 Identities=12% Similarity=0.033 Sum_probs=190.9 Q ss_pred CCcc-hhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHh----hhhh---hcc Q lcl|Aclame:pro 1 MNKP-DLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNA----QEEK---PKG 72 (393) Q Consensus 1 ~~k~-d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~----~~Ek---~K~ 72 (393) |+.. .+.|..+++++++..+.++. ..-.++++.+...+++.+...+.++....+....++.. ..+. ... T Consensus 3 ~~m~k~l~el~~~~~~~~~~~~~~~---~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (397) T protein:vir:12 3 MQMSKKEIALRQQFTEKKQQADKAL---QEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEG 79 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHh---hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcc Confidence 3322 45666666666665554432 22222222222233333333333322222111111111 0100 000 Q ss_pred hh--HHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCcc---chhhhHhhcchhHHHHHHHHHHhhCcccccee Q lcl|Aclame:pro 73 KD--KMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGV---TITDTTFQLPRKLVESINTALLNTNPVFKVFH 147 (393) Q Consensus 73 k~--emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV---~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh 147 (393) .. .....-.+..-...|++-+... ...+..+.|...+..+.. ..++--.++|..+...|-+.+.++.++++... T Consensus 80 ~~~~~~~~~~~~~~~~~a~~~~~~~~-~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~~~~~l~~~~~ 158 (397) T protein:vir:12 80 QRSQGQGNEERQQQYSKAFLKGLRGK-RLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKRQFEPLEQYVT 158 (397) T ss_pred cccccchhhHHHHHHHHHHHHHHhcc-CCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhhhhhhHHhhcc Confidence 00 0000111111112222223322 223556666666555533 22333347899999999999999999988666 Q ss_pred eecccc--eeEEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 148 VTNVGA--LLVSRSFDS-SNEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQA 223 (393) Q Consensus 148 V~n~~~--~a~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~ 223 (393) +.+.+. +...+..++ ...+..|.-|.++++. ..+|...++.|..++....+.+.+.+-.+ -++.+|++++|+.. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~--~~l~~~i~~~l~~~ 236 (397) T protein:vir:12 159 VEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSD--QAIMTYVAKWFAKK 236 (397) T ss_pred eeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhch--HHHHHHHHHHHHHH Confidence 666553 333444444 3567777888888765 57999999999999888777554443222 46899999999999 Q ss_pred HHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh--ceeeecCCCceEEEecchhhhHhh Q lcl|Aclame:pro 224 IVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV--DFVRPTAGRRYLIVKTEDRKALLD 301 (393) Q Consensus 224 fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal--d~a~~~a~~~~l~i~~~d~~a~~~ 301 (393) +- |+++.+++.|||..-. .+....+++..++ .+......+..+++++.++.+|+. T Consensus 237 ~~-~~~d~~il~G~g~~~~----------------------~g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~ 293 (397) T protein:vir:12 237 SV-VTRNNLILAAIASLKK----------------------VDIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDT 293 (397) T ss_pred HH-HHHHHHHHhccccccc----------------------cccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHH Confidence 98 7999999999997421 1123345666665 333444556678899999888876 Q ss_pred hhhccccccceeeecCCCeEEEeeecccchhhc--ccchhc-------------eeeeeccc-cee--cccceeee--cc Q lcl|Aclame:pro 302 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK-------------PTVLVDQK-YHI--DMQDLTKV--DA 361 (393) Q Consensus 302 ~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~-------------ptv~vD~k-~~~--~~~~~~~~--~s 361 (393) |++.+|.+.+.-.+.+-.-.| |...++ |.++-|-+ +++ +.+|++-. +. T Consensus 294 ------------lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 361 (397) T protein:vir:12 294 ------------LKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDT 361 (397) T ss_pred ------------hhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEecc Confidence 777888776533222211111 332211 12222322 121 22232211 11 Q ss_pred --eeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 362 --FEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 362 --~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ..|..|+-.+.++.+..|.+--|++-++++++ T Consensus 362 ~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t 395 (397) T protein:vir:12 362 GAGAFETNSTKVRGIEREDVRKWDEDAVVFGQIT 395 (397) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEEe Confidence 11344555666777788888888888889998 No 34 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.64 E-value=6.3e-17 Score=109.42 Aligned_cols=363 Identities=15% Similarity=0.079 Sum_probs=188.9 Q ss_pred CCcch--hhHHHHHHHHHHHhhHHHHhhhhhhhhhhH------HhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcc Q lcl|Aclame:pro 1 MNKPD--LIEKQNRLAELKENNVSLKSQISGFEVKNA------IEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKG 72 (393) Q Consensus 1 ~~k~d--~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a------~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~ 72 (393) |...+ ++|.++++.+++. +.++..+..+-+.. .++-.++++++..+.++++++...+....... +... T Consensus 1 m~~~~k~l~el~~~~~~~~~---~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 76 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGD---QIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANE-KRDG 76 (395) T ss_pred ChhHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cccc Confidence 54222 3333333333333 23333332211110 01113334444444444444433322222211 1111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHh--hcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 73 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTF--QLPRKLVESINTALLNTNPVFKVFHVTN 150 (393) Q Consensus 73 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~e--iLP~~ii~AIe~A~ed~d~vl~~fhV~n 150 (393) ...............+|.+.+. ...+..+...+..+...+++.+. ++|..+...|-+.++++.++++.+.+.+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~ 151 (395) T protein:vir:43 77 GEEAPKTAGQMVAESLKEQGVT-----SSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGT 151 (395) T ss_pred ccchhhhHHHHHHHHHHHHHHH-----HHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhcccee Confidence 1122111112122222211111 12233333334444444444333 6888888889999999999988666655 Q ss_pred ccceeEEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 151 VGALLVSRSFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI 228 (393) Q Consensus 151 ~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Ra 228 (393) .+.....+-..+ ...+..+.-|.++.+...+|...++.|..++....+.+.+.+ .+ +++.+|++++|+.++. +. T Consensus 152 ~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~--d~-~~l~~~v~~~la~a~~-~~ 227 (395) T protein:vir:43 152 TESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILD--DA-SALQSYIDARARYGLM-LV 227 (395) T ss_pred cCCCceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHH--hH-HHHHHHHHHHHHHHHH-HH Confidence 554433333332 234444566889999999999999999999988778555433 22 4689999999999998 79 Q ss_pred HhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccc Q lcl|Aclame:pro 229 VDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQAT 307 (393) Q Consensus 229 v~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~ 307 (393) ++.++++|||+... ..-+...... ...+ .++..+.....|.+..++ ++.........+++|+.+..+|+. T Consensus 228 ~d~~~l~G~g~~~~-~~Gi~~~~~~-~~~~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------ 298 (395) T protein:vir:43 228 EECQLLYGNGTGAN-LHGIIPQAQA-YAPP-SGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIEL------ 298 (395) T ss_pred HHHHHHhccCCCCc-cccccccccc-cccc-cccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHH------ Confidence 99999999998431 0011111110 0000 011122223466777776 444444455678999999888876 Q ss_pred cccceeeecCCCeEEEeeecccc--hhhcccchhcee--------eeecccc--ee-cccceeeecce----eEeecCce Q lcl|Aclame:pro 308 ANANVRIKNDDTEIASEVGVDEI--IVYTGSKAVKPT--------VLVDQKY--HI-DMQDLTKVDAF----EWKTNSNM 370 (393) Q Consensus 308 ~~a~~~l~~~~~~~~~~v~~~~~--~~~tG~k~~~pt--------v~vD~k~--~~-~~~~~~~~~s~----~~~~ns~~ 370 (393) +++.+|.+.++-..+.. ++ -|- .|+.+ ++.|-+. .+ +-.|++-.-+. .|..|+-. T Consensus 299 ------lkd~~G~~i~~~~~~~~~~~l-~G~-pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~ 370 (395) T protein:vir:43 299 ------NKDAENRYIIGSPQNGTTPTL-WRL-PVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVT 370 (395) T ss_pred ------hhccCCceeccccccCCCcee-cce-eeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEE Confidence 66777776654221111 11 032 22222 2233322 11 22233222111 23344556 Q ss_pred EEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 371 ILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 371 i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +.++.+..|.+--|++-++++|+ T Consensus 371 ~r~~~r~d~~v~~~~a~~~~~~t 393 (395) T protein:vir:43 371 IRAEERLAFAVYRPEAFVTGSLT 393 (395) T ss_pred EEEEEeeccEEecccceEEEEec Confidence 67778899999999999999998 No 35 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.63 E-value=5.7e-17 Score=109.63 Aligned_cols=352 Identities=11% Similarity=0.067 Sum_probs=190.9 Q ss_pred CC-cchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh------hhcch Q lcl|Aclame:pro 1 MN-KPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE------KPKGK 73 (393) Q Consensus 1 ~~-k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E------k~K~k 73 (393) |+ |.++.|.++++.+++..+..+..++... .++......++++++..++++.+++++.+.++....+ ++..+ T Consensus 1 m~~~m~l~el~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (408) T protein:vir:10 1 MGVKLTVNQLNEAWIASGDKVTDFNDQINMA-LNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK 79 (408) T ss_pred CCccccHHHHHHHHHHHHHHHHHHHHHHHHH-hhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 55 3467889999999999888888777542 2222222234445555555554444433333322211 00000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHcc-CChhHHHHHHHHHHhCcc-chhhhHhhcchhHHHHHHHHHHhhCccccceeeecc Q lcl|Aclame:pro 74 DKMTNFIESQNAVTEFFDVLKKNS-GKSEIKNAWNAKLAENGV-TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 151 (393) Q Consensus 74 ~emtEfLkTkqA~~dya~ll~~nq-g~ke~k~AW~a~L~ekgV-~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~ 151 (393) .. .-...+.....|.+-++..- +..... .....+.... ...|--..+|..+..-|-..++++.++++...+.+. T Consensus 80 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 155 (408) T protein:vir:10 80 GP--LNKSENELKDKFVKDFVNMVRNPMAFM--NTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESV 155 (408) T ss_pred cc--cccchhhhHHHHHHHHHHHhhcchhhh--hhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhhcceeec Confidence 00 00111111122211111110 111111 1111111111 112222368999988899999999999887666665 Q ss_pred cceeE--EEeeccc--cccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 152 GALLV--SRSFDSS--NEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 226 (393) Q Consensus 152 ~~~a~--~i~l~na--~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 226 (393) +.... .+...+. ..+..+--|+++.+. ..+|...++.+..++....+-+.+.+ .+.-++.+|++++|+..+. T Consensus 156 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~~~- 232 (408) T protein:vir:10 156 STSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLK--DTAENILAWLSSWIAKKVV- 232 (408) T ss_pred cCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHh--hchHHHHHHHHHHHHHHHH- Confidence 43322 2222322 223233345666664 46899999999998888777554433 2335789999999999999 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhce-eeec-CCCceEEEecchhhhHhhhhh Q lcl|Aclame:pro 227 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDF-VRPT-AGRRYLIVKTEDRKALLDELR 304 (393) Q Consensus 227 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~-a~~~-a~~~~l~i~~~d~~a~~~~~~ 304 (393) +.++.+++.|||+.. ..+.....|++..++.. ..+. ..+..+++|+.++.+|+. T Consensus 233 ~~~~~~il~g~g~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~--- 288 (408) T protein:vir:10 233 VTRNQAIIEVMKAAP---------------------KKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLAL--- 288 (408) T ss_pred HHHHHHHhhcccccc---------------------cccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH--- Confidence 799999999999732 12223346677666621 1111 223467889999988887 Q ss_pred ccccccceeeecCCCeEEEeeecccchhhc--ccchhc------ee--------eeecccc-ee--cccceeeeccee-- Q lcl|Aclame:pro 305 QATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK------PT--------VLVDQKY-HI--DMQDLTKVDAFE-- 363 (393) Q Consensus 305 ~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~------pt--------v~vD~k~-~~--~~~~~~~~~s~~-- 363 (393) +++.+|.+.+.-++.+-.-+| |...++ |. ++-|-+. |+ +-+|++..-+.. T Consensus 289 ---------lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~ 359 (408) T protein:vir:10 289 ---------VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA 359 (408) T ss_pred ---------hhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEccccc Confidence 778888776543322211111 332221 22 2224331 11 223333222111 Q ss_pred --EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 364 --WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 364 --~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) |.+|+-.+.++.+..|-+--|++-++++++ T Consensus 360 ~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~ 391 (408) T protein:vir:10 360 GAFETDTTKIRVIDRFDVKATDSEALVAGSFS 391 (408) T ss_pred chhhcCceEEEEEEeeccEEeccccEEEEEee Confidence 344556677777788999889888888877 No 36 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=99.63 E-value=8.4e-17 Score=108.71 Aligned_cols=340 Identities=12% Similarity=0.078 Sum_probs=192.7 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHH--HH-----HHhhhhhhcch Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIE--NE-----LNAQEEKPKGK 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~e--ne-----l~~~~Ek~K~k 73 (393) |+|- |.|++++|+++++.+.++-.. +++.+++.+...+..|+++|+..+ .+ .....+..+.. T Consensus 1 M~k~-l~el~~~~~~~~~e~~~~~~~----------~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) T protein:vir:10 1 MSKE-LRELLAKLEGKKEEVRSLMGE----------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHhhH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 9876 778888888887766554221 222445555555666666664321 11 11111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc---hhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 74 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVT---ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 150 (393) Q Consensus 74 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~---~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n 150 (393) ....-+.+.+ +++.+...+.+.+.+.........+... +.+--.++|..+..-|-+.+.++.++++...+.+ T Consensus 70 --~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~ 144 (392) T protein:vir:10 70 --VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) T ss_pred --ccchHHHHHH---HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee Confidence 1111122222 3344544444445544444444444332 2233337899998889999998888887667777 Q ss_pred cccee--EEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 151 VGALL--VSRSFDS-SNEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 226 (393) Q Consensus 151 ~~~~a--~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 226 (393) .+... ..+...+ ...+..+.-|+++.+. ..+|...++.|..+|...++-+.+.+ .+.-++.+|++++|+..+- T Consensus 145 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~- 221 (392) T protein:vir:10 145 VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ--DSDQNILKYVTKWLGKKSK- 221 (392) T ss_pred ccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHh--hhHHHHHHHHHHHHHHHHH- Confidence 65443 2233333 3344456667777765 57999999999999988888444432 2224689999999999998 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc--eeeecCCCceEEEecchhhhHhhhhh Q lcl|Aclame:pro 227 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD--FVRPTAGRRYLIVKTEDRKALLDELR 304 (393) Q Consensus 227 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald--~a~~~a~~~~l~i~~~d~~a~~~~~~ 304 (393) +..+.+++.|+|+... +.....+++..++. ....-..+..+++|+.++.+|+- T Consensus 222 ~~~d~~~~~g~g~~~~----------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~--- 276 (392) T protein:vir:10 222 VTRNVLILGVIEKLTK----------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK--- 276 (392) T ss_pred HHHHHHHhhccccccc----------------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH--- Confidence 7999999999997431 11233455555552 22222345678999999999876 Q ss_pred ccccccceeeecCCCeEEEeeecccchhhc--ccchhc---------eeeeeccccee--cccc---------eeeecce Q lcl|Aclame:pro 305 QATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK---------PTVLVDQKYHI--DMQD---------LTKVDAF 362 (393) Q Consensus 305 ~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~---------ptv~vD~k~~~--~~~~---------~~~~~s~ 362 (393) |++.+|.+.+...+.+-.-+| |...|+ |.+..+....+ |+|. ++-.-+. T Consensus 277 ---------lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 277 ---------LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred ---------hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 788888877643332211111 332221 11111111111 3232 2211110 Q ss_pred --e--EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 --E--WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 --~--~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) . |..|+-.+.++.+..|-+.-+++-+.+++. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 1 223444466777788888888888887776 No 37 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=99.63 E-value=8.4e-17 Score=108.71 Aligned_cols=340 Identities=12% Similarity=0.078 Sum_probs=192.7 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHH--HH-----HHhhhhhhcch Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIE--NE-----LNAQEEKPKGK 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~e--ne-----l~~~~Ek~K~k 73 (393) |+|- |.|++++|+++++.+.++-.. +++.+++.+...+..|+++|+..+ .+ .....+..+.. T Consensus 1 M~k~-l~el~~~~~~~~~e~~~~~~~----------~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) T protein:vir:10 1 MSKE-LRELLAKLEGKKEEVRSLMGE----------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHhhH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 9876 778888888887766554221 222445555555666666664321 11 11111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc---hhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 74 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVT---ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 150 (393) Q Consensus 74 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~---~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n 150 (393) ....-+.+.+ +++.+...+.+.+.+.........+... +.+--.++|..+..-|-+.+.++.++++...+.+ T Consensus 70 --~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~ 144 (392) T protein:vir:10 70 --VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) T ss_pred --ccchHHHHHH---HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee Confidence 1111122222 3344544444445544444444444332 2233337899998889999998888887667777 Q ss_pred cccee--EEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 151 VGALL--VSRSFDS-SNEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 226 (393) Q Consensus 151 ~~~~a--~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 226 (393) .+... ..+...+ ...+..+.-|+++.+. ..+|...++.|..+|...++-+.+.+ .+.-++.+|++++|+..+- T Consensus 145 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~- 221 (392) T protein:vir:10 145 VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ--DSDQNILKYVTKWLGKKSK- 221 (392) T ss_pred ccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHh--hhHHHHHHHHHHHHHHHHH- Confidence 65443 2233333 3344456667777765 57999999999999988888444432 2224689999999999998 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc--eeeecCCCceEEEecchhhhHhhhhh Q lcl|Aclame:pro 227 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD--FVRPTAGRRYLIVKTEDRKALLDELR 304 (393) Q Consensus 227 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald--~a~~~a~~~~l~i~~~d~~a~~~~~~ 304 (393) +..+.+++.|+|+... +.....+++..++. ....-..+..+++|+.++.+|+- T Consensus 222 ~~~d~~~~~g~g~~~~----------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~--- 276 (392) T protein:vir:10 222 VTRNVLILGVIEKLTK----------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK--- 276 (392) T ss_pred HHHHHHHhhccccccc----------------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH--- Confidence 7999999999997431 11233455555552 22222345678999999999876 Q ss_pred ccccccceeeecCCCeEEEeeecccchhhc--ccchhc---------eeeeeccccee--cccc---------eeeecce Q lcl|Aclame:pro 305 QATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK---------PTVLVDQKYHI--DMQD---------LTKVDAF 362 (393) Q Consensus 305 ~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~---------ptv~vD~k~~~--~~~~---------~~~~~s~ 362 (393) |++.+|.+.+...+.+-.-+| |...|+ |.+..+....+ |+|. ++-.-+. T Consensus 277 ---------lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 277 ---------LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred ---------hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 788888877643332211111 332221 11111111111 3232 2211110 Q ss_pred --e--EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 --E--WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 --~--~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) . |..|+-.+.++.+..|-+.-+++-+.+++. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 1 223444466777788888888888887776 No 38 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=99.63 E-value=8.4e-17 Score=108.71 Aligned_cols=340 Identities=12% Similarity=0.078 Sum_probs=192.7 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHH--HH-----HHhhhhhhcch Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIE--NE-----LNAQEEKPKGK 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~e--ne-----l~~~~Ek~K~k 73 (393) |+|- |.|++++|+++++.+.++-.. +++.+++.+...+..|+++|+..+ .+ .....+..+.. T Consensus 1 M~k~-l~el~~~~~~~~~e~~~~~~~----------~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) T protein:vir:10 1 MSKE-LRELLAKLEGKKEEVRSLMGE----------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHhhH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 9876 778888888887766554221 222445555555666666664321 11 11111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc---hhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 74 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVT---ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 150 (393) Q Consensus 74 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~---~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n 150 (393) ....-+.+.+ +++.+...+.+.+.+.........+... +.+--.++|..+..-|-+.+.++.++++...+.+ T Consensus 70 --~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~ 144 (392) T protein:vir:10 70 --VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) T ss_pred --ccchHHHHHH---HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee Confidence 1111122222 3344544444445544444444444332 2233337899998889999998888887667777 Q ss_pred cccee--EEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 151 VGALL--VSRSFDS-SNEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 226 (393) Q Consensus 151 ~~~~a--~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 226 (393) .+... ..+...+ ...+..+.-|+++.+. ..+|...++.|..+|...++-+.+.+ .+.-++.+|++++|+..+- T Consensus 145 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~- 221 (392) T protein:vir:10 145 VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ--DSDQNILKYVTKWLGKKSK- 221 (392) T ss_pred ccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHh--hhHHHHHHHHHHHHHHHHH- Confidence 65443 2233333 3344456667777765 57999999999999988888444432 2224689999999999998 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc--eeeecCCCceEEEecchhhhHhhhhh Q lcl|Aclame:pro 227 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD--FVRPTAGRRYLIVKTEDRKALLDELR 304 (393) Q Consensus 227 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald--~a~~~a~~~~l~i~~~d~~a~~~~~~ 304 (393) +..+.+++.|+|+... +.....+++..++. ....-..+..+++|+.++.+|+- T Consensus 222 ~~~d~~~~~g~g~~~~----------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~--- 276 (392) T protein:vir:10 222 VTRNVLILGVIEKLTK----------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK--- 276 (392) T ss_pred HHHHHHHhhccccccc----------------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH--- Confidence 7999999999997431 11233455555552 22222345678999999999876 Q ss_pred ccccccceeeecCCCeEEEeeecccchhhc--ccchhc---------eeeeeccccee--cccc---------eeeecce Q lcl|Aclame:pro 305 QATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK---------PTVLVDQKYHI--DMQD---------LTKVDAF 362 (393) Q Consensus 305 ~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~---------ptv~vD~k~~~--~~~~---------~~~~~s~ 362 (393) |++.+|.+.+...+.+-.-+| |...|+ |.+..+....+ |+|. ++-.-+. T Consensus 277 ---------lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 277 ---------LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred ---------hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 788888877643332211111 332221 11111111111 3232 2211110 Q ss_pred --e--EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 --E--WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 --~--~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) . |..|+-.+.++.+..|-+.-+++-+.+++. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 1 223444466777788888888888887776 No 39 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=99.63 E-value=8.4e-17 Score=108.71 Aligned_cols=340 Identities=12% Similarity=0.078 Sum_probs=192.7 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHH--HH-----HHhhhhhhcch Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIE--NE-----LNAQEEKPKGK 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~e--ne-----l~~~~Ek~K~k 73 (393) |+|- |.|++++|+++++.+.++-.. +++.+++.+...+..|+++|+..+ .+ .....+..+.. T Consensus 1 M~k~-l~el~~~~~~~~~e~~~~~~~----------~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) T protein:vir:10 1 MSKE-LRELLAKLEGKKEEVRSLMGE----------DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHhhH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 9876 778888888887766554221 222445555555666666664321 11 11111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc---hhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 74 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVT---ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 150 (393) Q Consensus 74 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~---~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n 150 (393) ....-+.+.+ +++.+...+.+.+.+.........+... +.+--.++|..+..-|-+.+.++.++++...+.+ T Consensus 70 --~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~ 144 (392) T protein:vir:10 70 --VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) T ss_pred --ccchHHHHHH---HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee Confidence 1111122222 3344544444445544444444444332 2233337899998889999998888887667777 Q ss_pred cccee--EEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 151 VGALL--VSRSFDS-SNEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 226 (393) Q Consensus 151 ~~~~a--~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 226 (393) .+... ..+...+ ...+..+.-|+++.+. ..+|...++.|..+|...++-+.+.+ .+.-++.+|++++|+..+- T Consensus 145 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~- 221 (392) T protein:vir:10 145 VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ--DSDQNILKYVTKWLGKKSK- 221 (392) T ss_pred ccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHh--hhHHHHHHHHHHHHHHHHH- Confidence 65443 2233333 3344456667777765 57999999999999988888444432 2224689999999999998 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc--eeeecCCCceEEEecchhhhHhhhhh Q lcl|Aclame:pro 227 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD--FVRPTAGRRYLIVKTEDRKALLDELR 304 (393) Q Consensus 227 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald--~a~~~a~~~~l~i~~~d~~a~~~~~~ 304 (393) +..+.+++.|+|+... +.....+++..++. ....-..+..+++|+.++.+|+- T Consensus 222 ~~~d~~~~~g~g~~~~----------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~--- 276 (392) T protein:vir:10 222 VTRNVLILGVIEKLTK----------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK--- 276 (392) T ss_pred HHHHHHHhhccccccc----------------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH--- Confidence 7999999999997431 11233455555552 22222345678999999999876 Q ss_pred ccccccceeeecCCCeEEEeeecccchhhc--ccchhc---------eeeeeccccee--cccc---------eeeecce Q lcl|Aclame:pro 305 QATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK---------PTVLVDQKYHI--DMQD---------LTKVDAF 362 (393) Q Consensus 305 ~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~---------ptv~vD~k~~~--~~~~---------~~~~~s~ 362 (393) |++.+|.+.+...+.+-.-+| |...|+ |.+..+....+ |+|. ++-.-+. T Consensus 277 ---------lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 277 ---------LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred ---------hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 788888877643332211111 332221 11111111111 3232 2211110 Q ss_pred --e--EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 --E--WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 --~--~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) . |..|+-.+.++.+..|-+.-+++-+.+++. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 1 223444466777788888888888887776 No 40 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.61 E-value=5e-16 Score=104.46 Aligned_cols=367 Identities=8% Similarity=0.012 Sum_probs=173.4 Q ss_pred CCc------ch------hhH---HHHHHHHHHHhhHHHHhhhhh----hhh---hhHHhhhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MNK------PD------LIE---KQNRLAELKENNVSLKSQISG----FEV---KNAIEDLPKVQELEKTLSENSIEIIK 58 (393) Q Consensus 1 ~~k------~d------~~e---kq~eLa~lK~~~~~~~s~i~~----~~v---~~a~~~~skieelektis~l~aEi~k 58 (393) +.+ ++ .++ .-.++.++.++......++++ .+- +...++...+++++..+.++...+.+ T Consensus 118 ~e~r~e~~a~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~ 197 (543) T protein:vir:81 118 RRMRVEAGSSQGGRGDYDRDAILEPDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATK 197 (543) T ss_pred HHhhhhhhhHHHhhHHHHHhhhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 000 00 000 000111111111111111111 100 11112233445555555554444433 Q ss_pred HHHHHHhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHH---HhCccchhhhHhhcchhHHH-HHHH Q lcl|Aclame:pro 59 IENELNAQEEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKL---AENGVTITDTTFQLPRKLVE-SINT 134 (393) Q Consensus 59 ~enel~~~~Ek~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L---~ekgV~~qd~~eiLP~~ii~-AIe~ 134 (393) .+..+.....+........+-...+.+...| +...... .....-...+ ...+++.++.-.++|..+.. -|.. T Consensus 198 ~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~---~~~~~~~-~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~ 273 (543) T protein:vir:81 198 IIERFDDEDSTLARQCLATSSPAYLRAWSKM---ARNPHAA-ILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIIT 273 (543) T ss_pred HHHHHHHHHHHHhhhhhhhhhhhhhhHHHHH---HHhhHHH-HhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHHH Confidence 3322222111100000000111111121111 2211111 1111111112 22345444444478877664 3677 Q ss_pred HHHhhCccccceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHH Q lcl|Aclame:pro 135 ALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELY 213 (393) Q Consensus 135 A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalv 213 (393) +++++.++.....|.... +-..+-..+ ...+.-+.-|.+++....+|...++.|..++.+..+.+-+. +.++ ++. T Consensus 274 ~~~~~~~l~~~~~~~~~~-g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell--~d~~-~~~ 349 (543) T protein:vir:81 274 SNGSLNDIRRFARQVVAT-GDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEAL--QDEA-NVT 349 (543) T ss_pred HHhhhchhhhhcccccCC-cceEEEEecCCcceeecccCccccccccccceeeeeeeeeEeeehhhHHHH--hccH-HHH Confidence 888767666554444332 222232222 23333345677888999999999999999998877744333 3334 799 Q ss_pred HHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEe Q lcl|Aclame:pro 214 NLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVK 292 (393) Q Consensus 214 nyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~ 292 (393) +||++.|+.++. +.++.++++|||++. ...-+...... ......+..+.+...+++...+ .+...-..+..++++ T Consensus 350 ~~i~~~l~~~~~-~~~d~ail~G~Gt~~-~p~Gi~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n 425 (543) T protein:vir:81 350 ETVALLFAEGKD-ELEAVTLTTGTGQGN-QPTGIVTALAG--TAAEIAPVTAETFALADVYAVYEQLAARHRRQGAWLAN 425 (543) T ss_pred HHHHHHHHHHHH-HHHHHHHhccCCCCc-ccccchhhccc--ccccccccccccccHHHHHHHHHhhhccccCCcEEEEc Confidence 999999999998 799999999999852 11111110000 0011112223344455555554 333333445678889 Q ss_pred cchhhhHhhhhhccccccceeeecCCCeEEEeee---cccchhhcccchhc-----------------eeeeeccccee- Q lcl|Aclame:pro 293 TEDRKALLDELRQATANANVRIKNDDTEIASEVG---VDEIIVYTGSKAVK-----------------PTVLVDQKYHI- 351 (393) Q Consensus 293 ~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~---~~~~~~~tG~k~~~-----------------ptv~vD~k~~~- 351 (393) +.++.+|+. +++++|.+.+.-. .+...+ |-..++ |.++.|-+.++ T Consensus 426 ~~~~~~l~~------------lkd~~G~~l~~~~~~g~~~~l~--G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i 491 (543) T protein:vir:81 426 NLIYNKIRQ------------FDTQGGAGLWTTIGNGEPSQLL--GRPVGEAEAMDANWNTSASADNFVLLYGNFQNYVI 491 (543) T ss_pred HHHHHHHHH------------hhcCCCceeccCcCCCCCcccc--ceeeEEeccccccccccccCCcceEEEeeccceeE Confidence 988888887 7777777665321 111111 321111 22223333222 Q ss_pred -cccceee-e-----cceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 352 -DMQDLTK-V-----DAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 352 -~~~~~~~-~-----~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +-+|++- . ..+.+.+|+-.|.++.+..|-+.-+++-++++++ T Consensus 492 ~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~ 540 (543) T protein:vir:81 492 ADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVE 540 (543) T ss_pred EeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEec Confidence 1123211 1 1223344566788889999999999998888888 No 41 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.59 E-value=6.5e-16 Score=103.85 Aligned_cols=361 Identities=12% Similarity=0.088 Sum_probs=184.8 Q ss_pred CCcc-hhhHHH-------HHHHHHHHhhHHHHhhhhhhhhhhHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhhh Q lcl|Aclame:pro 1 MNKP-DLIEKQ-------NRLAELKENNVSLKSQISGFEVKNAIEDLP----KVQELEKTLSENSIEIIKIENELNAQEE 68 (393) Q Consensus 1 ~~k~-d~~ekq-------~eLa~lK~~~~~~~s~i~~~~v~~a~~~~s----kieelektis~l~aEi~k~enel~~~~E 68 (393) |+++ ++.+++ .+|.+++...+++..++++. .+++..++. ..+|++..++++.+++...+..+..+++ T Consensus 4 ~~~~~~~~~~~~~~~el~~~~~e~~~~l~~~~~e~~~~-~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~ 82 (418) T protein:vir:10 4 MNEPRQFGRKSGGDSHPEQVLETVTKELKRIGDEVKSA-GEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQ 82 (418) T ss_pred chhHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5554 333333 33444444444444444332 122111111 1223333333333333322222211111 Q ss_pred ------------hhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC----ccchhhhHhhcchhHHHHH Q lcl|Aclame:pro 69 ------------KPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN----GVTITDTTFQLPRKLVESI 132 (393) Q Consensus 69 ------------k~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek----gV~~qd~~eiLP~~ii~AI 132 (393) .++...+. ..+.... +-++..-......+.....+.+. +....+...++|..+...| T Consensus 83 ~~~~~~~~~~~~~~~~~~~~---~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~i 155 (418) T protein:vir:10 83 KLARGGGSAELETPKTLGQL---VTESEEM----KGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGI 155 (418) T ss_pred HHhhcccccccchhhhhhHH---hhhHHHH----HHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHH Confidence 11111111 1111111 11111111111122222222221 2222333348999998888 Q ss_pred HHHHHhhCccccceeeecccceeEEEeeccc--cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchh Q lcl|Aclame:pro 133 NTALLNTNPVFKVFHVTNVGALLVSRSFDSS--NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYS 210 (393) Q Consensus 133 e~A~ed~d~vl~~fhV~n~~~~a~~i~l~na--~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~yg 210 (393) -..+.++..+++.+.+.+.+.....+-..+. ..+.-+--|+++.+...+|...++.|..++....+.+-+.+ ++ + T Consensus 156 i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds--~ 232 (418) T protein:vir:10 156 IAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILD-DA--P 232 (418) T ss_pred HHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHHHHH-hH--H Confidence 8999998888886555555444333333222 23333456778899999999999999998887777444333 22 4 Q ss_pred HHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceE Q lcl|Aclame:pro 211 ELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYL 289 (393) Q Consensus 211 alvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l 289 (393) ++.+|++++|+..+. ++++.++++|||++.. |. | +..-.-..+.+...+.....+++..++ .+..+......+ T Consensus 233 ~l~~~i~~~l~~a~~-~~~d~a~l~G~g~~~~---p~-G-i~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 306 (418) T protein:vir:10 233 ALQSYIDGRARYGLQ-LTEEGQILKGDGTGAN---IL-G-ILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGI 306 (418) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHhccCCCCcc---cc-c-cccccccccccccccccccHHHHHHHHHhhccccCCCCEE Confidence 799999999999999 7999999999998531 10 1 000001111222333345568888887 343344444468 Q ss_pred EEecchhhhHhhhhhccccccceeeecCCCeEEEeeecc---cchhhcccchhc--------eeeeecccc-e-e-cccc Q lcl|Aclame:pro 290 IVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVD---EIIVYTGSKAVK--------PTVLVDQKY-H-I-DMQD 355 (393) Q Consensus 290 ~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~---~~~~~tG~k~~~--------ptv~vD~k~-~-~-~~~~ 355 (393) ++++.+..+|+. +++++|.+.++-.++ ...+ |- .|+ +.++.|-+. + + +-.| T Consensus 307 v~n~~~~~~L~~------------lkd~~G~~i~~~~~~~~~~~l~--G~-pV~~~~~~p~~~~~~gd~s~~~~~~~~~~ 371 (418) T protein:vir:10 307 VLNPIDWASIEL------------TKDSQGRYIVGNPVNGTTPRLW--NL-PVVETQAMTANEFLVGAFSMAAQIFDRME 371 (418) T ss_pred EEcHHHHHHHHH------------hhcCCCceeccccccCCCceec--ce-eeEEcCCCCCCcEEEeeccceEEEEEecc Confidence 888888888776 778888777642111 1111 32 222 123334332 1 1 2233 Q ss_pred eeeecce----eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 356 LTKVDAF----EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 356 ~~~~~s~----~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++-.-+. -|.+|.-.+.++.+..|.+.-|.+-+++++. T Consensus 372 ~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~ 413 (418) T protein:vir:10 372 IEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALV 413 (418) T ss_pred eEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEec Confidence 3322111 1334445566777889999999888888888 No 42 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.58 E-value=4.7e-16 Score=104.63 Aligned_cols=327 Identities=14% Similarity=0.125 Sum_probs=184.7 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhh----hcchhHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEK----PKGKDKM 76 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek----~K~k~em 76 (393) |+| ++.+...+++.+++.+.++-. .+++.++++++..+..++.+|...+.......+. ...+... T Consensus 1 M~k-~l~~l~e~~~~~~~e~~~~~~----------~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (371) T protein:vir:81 1 MPK-ELRELLEQINNKKEEARKLLA----------ENKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPTV 69 (371) T ss_pred CcH-HHHHHHHHHHHHHHHHHHHhh----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Confidence 998 577766666666665544321 1222345566666666666665433333322211 1111111 Q ss_pred HHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccce-- Q lcl|Aclame:pro 77 TNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGAL-- 154 (393) Q Consensus 77 tEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~-- 154 (393) .. ..+....|++-+.. .+.+++ . .|. ..+--.++|..+..-|-..+.++.++++.+.+.+.+.. T Consensus 70 ~~---~~~~~~~~~~~l~~-----~~~~a~----~-~~t-~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~ 135 (371) T protein:vir:81 70 QV---KENEVEAFVNHIRT-----RFRNAM----S-EGS-NQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSG 135 (371) T ss_pred hh---HHHHHHHHHHHHHH-----HHHHhh----c-cCC-CccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCce Confidence 00 00111122211211 111121 1 111 11222368999999999999999999887666665543 Q ss_pred eEEEeeccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 155 LVSRSFDSS-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLA 232 (393) Q Consensus 155 a~~i~l~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rA 232 (393) -..+..... ..+..+.-|+++.+ ...+|...++.|..++...++.+.+.+ .+.-++.+|++++|+.++. |+.+.+ T Consensus 136 ~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~a~~-~~~~~~ 212 (371) T protein:vir:81 136 SRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLN--DSTEAIVNTLVRWIGDESR-VTRNGL 212 (371) T ss_pred eEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHh--hhhHHHHHHHHHHHHHHHH-HHHHHH Confidence 233333343 45656777788776 468999999999999888777444433 2224789999999999999 799999 Q ss_pred eeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhce--eeecCCCceEEEecchhhhHhhhhhcccccc Q lcl|Aclame:pro 233 LVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDF--VRPTAGRRYLIVKTEDRKALLDELRQATANA 310 (393) Q Consensus 233 vv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~--a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a 310 (393) ++.|||+.... +....+++...+.. ......+..+++|+.++.+|+. T Consensus 213 i~~g~g~~~~~----------------------~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~--------- 261 (371) T protein:vir:81 213 IINVLNTKAKT----------------------AIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDT--------- 261 (371) T ss_pred HHhhccccccc----------------------ccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHH--------- Confidence 99999984321 12233444444421 1222345678999999988886 Q ss_pred ceeeecCCCeEEEeeecccchhhc--ccchhceeeeec-----------------------cccee---cccceeeecce Q lcl|Aclame:pro 311 NVRIKNDDTEIASEVGVDEIIVYT--GSKAVKPTVLVD-----------------------QKYHI---DMQDLTKVDAF 362 (393) Q Consensus 311 ~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~ptv~vD-----------------------~k~~~---~~~~~~~~~s~ 362 (393) |++.+|.+-+...+..-.-.| | +|.+++| -+.++ +..|++-.-+. T Consensus 262 ---lkd~~g~~l~~~~~~~~~~~~l~G----~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~ 334 (371) T protein:vir:81 262 ---LKDQNGQYLLQPSISSPTGRQLLG----LPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSN 334 (371) T ss_pred ---hhccCCCeeeecccCCCCCceecc----eeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEec Confidence 777777765532221110000 2 2333332 22111 22232211111 Q ss_pred ----eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 363 ----EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 363 ----~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .|.+|+-.+.++....|-+.-|++-++++++ T Consensus 335 ~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~ 369 (371) T protein:vir:81 335 VAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQ 369 (371) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEEe Confidence 1345666777888889999999999999998 No 43 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.58 E-value=3.3e-15 Score=99.98 Aligned_cols=363 Identities=17% Similarity=0.197 Sum_probs=185.7 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhh----hhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcch-hH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQI----SGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK-DK 75 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i----~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k-~e 75 (393) |.- ++.+..+.+.+|+++..+++... ++++-+.+ +.-.+++.++..+.+++..+...+.+.....+...+. .. T Consensus 1 m~~-~lk~l~~~~~el~~~~~~~k~~~~~~~~~~e~~~~-~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (401) T protein:vir:44 1 MAV-DIKDVEQVAQELQQKFDDFKAKNDKRVEAIEQEKG-KLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNK 78 (401) T ss_pred CCc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 543 34455555555555554444332 22221111 1123445555555555544443332222222211111 01 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee Q lcl|Aclame:pro 76 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 155 (393) Q Consensus 76 mtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a 155 (393) .. -..+++...| +. +.-..++...+...|....- .+--..+|..+..-|-+.+.++.++++..++.+..... T Consensus 79 ~~--~e~~~a~~~~---lr-~~~~~~~~~~e~~a~~~~~~--~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 150 (401) T protein:vir:44 79 VA--AEHKDAFVGF---LR-KGREDGLRDLERKALQVGTD--EDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGGSD 150 (401) T ss_pred hh--HHHHHHHHHH---Hh-hhhhhhhHHHHHHHhhcCCC--CCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCCCc Confidence 10 1123444444 22 11112444444433432211 11122689988888889999989888876765554443 Q ss_pred EEEeecc--ccccceecccchhhhhh-hhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 156 VSRSFDS--SNEAQVHKDGQTKTEQA-ATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLA 232 (393) Q Consensus 156 ~~i~l~n--a~~a~GHk~ga~Kk~q~-~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rA 232 (393) ..+-... ...+|. --|.++.+.. .+|...++.+..++....+.+-+ ++.+.-++.+|++++|+.++- +.++.+ T Consensus 151 ~~~~~~~~~~~a~wv-~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el--l~ds~~~l~~~i~~~la~ai~-~~~~~~ 226 (401) T protein:vir:44 151 YKKLVNLGGTASGWV-GETDTRSQTATSRLGLIEPFMGEIYGNPQATQKM--LDDAFFNVEAWINSELATEFA-EQEEIA 226 (401) T ss_pred eEEEEecCCccceee-ccccccCccccccceeeeeehhheeeehhhhHHH--HhcchHHHHHHHHHHHHHHHH-HHHHhh Confidence 3333222 233342 2234445433 47888888888777776663332 223335789999999999998 799999 Q ss_pred eeeccCCCcccc-----c-hhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhc Q lcl|Aclame:pro 233 LVEGDGTNGFKS-----I-DKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQ 305 (393) Q Consensus 233 vv~gDG~~~t~~-----~-~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~ 305 (393) +++|||++...- . ........+..+....+..+.+...|++...+ .+...-..+..+++++.++.+|+. T Consensus 227 ~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~---- 302 (401) T protein:vir:44 227 FTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRL---- 302 (401) T ss_pred hhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHH---- Confidence 999999853110 0 00011111122222223334445577887766 333222356678999999888876 Q ss_pred cccccceeeecCCCeEEEeeec--c--cchhhcccchhc------------eeeeecccc--ee-cccceeeecceeEee Q lcl|Aclame:pro 306 ATANANVRIKNDDTEIASEVGV--D--EIIVYTGSKAVK------------PTVLVDQKY--HI-DMQDLTKVDAFEWKT 366 (393) Q Consensus 306 ~~~~a~~~l~~~~~~~~~~v~~--~--~~~~~tG~k~~~------------ptv~vD~k~--~~-~~~~~~~~~s~~~~~ 366 (393) |+|++|.+-+--.+ | ...+ |...+. |.++-|-++ .+ +-.|++...+..+ T Consensus 303 --------lkd~~G~~l~~~~~~~g~~~~l~--G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~-- 370 (401) T protein:vir:44 303 --------LKDTEGNYLWRPGLELGQPSSLA--GYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYT-- 370 (401) T ss_pred --------hhccCCceeecCCcCCCCCceec--ceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeeccc-- Confidence 66666665442111 1 1111 322211 223345332 22 4455554433333 Q ss_pred cCceE--EEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 367 NSNMI--LVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 367 ns~~i--~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++|++ .+..+..|-+-.+++-..++++ T Consensus 371 ~~~~v~~~a~~r~d~~~~~~~a~~~l~~~ 399 (401) T protein:vir:44 371 NKPFVGFYTTKRTGGMLVDSQAIKLLKIA 399 (401) T ss_pred cCCcEEEEEEEEeccEEecccceEEEEee Confidence 34554 4555688889999999999998 No 44 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=99.58 E-value=2.8e-15 Score=100.37 Aligned_cols=354 Identities=12% Similarity=0.054 Sum_probs=176.6 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhh------hhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh------- Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEV------KNAIEDLPKVQELEKTLSENSIEIIKIENELNAQE------- 67 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v------~~a~~~~skieelektis~l~aEi~k~enel~~~~------- 67 (393) |. ++|.|.+|+++++........+....- +...+...+++++.+.++++..++.+.+....... T Consensus 1 Mk---i~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~ 77 (437) T protein:vir:10 1 MK---IEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDS 77 (437) T ss_pred CC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66 566666666655544443333322211 11222234455555555555555532111111100 Q ss_pred -----------------hhhcchhHHHHHHH-HHHHHHHHHHHHHH----------ccCChhHHHHHHHHHHhC------ Q lcl|Aclame:pro 68 -----------------EKPKGKDKMTNFIE-SQNAVTEFFDVLKK----------NSGKSEIKNAWNAKLAEN------ 113 (393) Q Consensus 68 -----------------Ek~K~k~emtEfLk-TkqA~~dya~ll~~----------nqg~ke~k~AW~a~L~ek------ 113 (393) +..+...+..+-+. .+++..+....... ......-.+++...+... T Consensus 78 ~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 157 (437) T protein:vir:10 78 DLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVT 157 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhh Confidence 00000000000000 00001110000000 000011122233333221 Q ss_pred ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee--ccc-cccceecccchhhhhhhhhhhhhccHH Q lcl|Aclame:pro 114 GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF--DSS-NEAQVHKDGQTKTEQAATLTIDTLEPV 190 (393) Q Consensus 114 gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l--~na-~~a~GHk~ga~Kk~q~~~le~~ti~p~ 190 (393) .....+.-.++|..+...|... ..+..+.....+.+.+.....+.. .+. ..+|+.-.+..++....+|...++.|. T Consensus 158 ~~~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~ 236 (437) T protein:vir:10 158 GIALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLK 236 (437) T ss_pred hcccccccccchHHHHHHHHHh-hhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeehh Confidence 1222333337888877777764 446667665666665555444333 222 344444444444456678999999999 Q ss_pred HHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCH Q lcl|Aclame:pro 191 MVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFA 270 (393) Q Consensus 191 ~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~ 270 (393) .+|....+...+.+ .+.-++.+|++++|+.++. ++.+.++++|||+..+ ....+... T Consensus 237 k~~~~~~is~ell~--ds~~~~~~~i~~~l~~~~~-~~~~~~i~~g~g~~~~--------------------~~~~~~~~ 293 (437) T protein:vir:10 237 TYTGGYVFSQELIS--DSSYDWQAELQSRLIELRD-NTDDSLIITALTDGIK--------------------KTTSTYLL 293 (437) T ss_pred heeeehhhhHHHHh--hhHHHHHHHHHHHHHHHHH-HHHHHHHhhhhccccc--------------------ccccccch Confidence 99988777444433 2224689999999999999 7999999999987321 11224445 Q ss_pred HHHHhhhceeeec--CCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhc--ccchhc------ Q lcl|Aclame:pro 271 DAIEEAVDFVRPT--AGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK------ 340 (393) Q Consensus 271 dal~Eald~a~~~--a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~------ 340 (393) +++..++..+-+. ..+.+.++|+.++.+|+. |++.+|.|.+...+++-.-+| |...+. T Consensus 294 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~------------lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~ 361 (437) T protein:vir:10 294 GDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDM------------ATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLF 361 (437) T ss_pred hhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHH------------hhccCCCeeeccCccCCCCcccccceeEEeccccc Confidence 5666665432222 245678899999988877 788888877643333211111 422211 Q ss_pred e--------eeeeccc-cee--cccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 341 P--------TVLVDQK-YHI--DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 341 p--------tv~vD~k-~~~--~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) | .++-|-+ +|+ +.+|++.-.+--..+.+..+.+.....|.+-.|++.++++.- T Consensus 362 ~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~ 425 (437) T protein:vir:10 362 PSASAGDVNIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGK 425 (437) T ss_pred CCcCCCceEEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEee Confidence 2 2233422 222 233443322212334455666666678888888887777643 No 45 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.52 E-value=3.2e-15 Score=100.03 Aligned_cols=345 Identities=11% Similarity=0.084 Sum_probs=190.2 Q ss_pred CCcc-hhhHHHHHHHHHHHhhHHHHhhhhhhhhhh----HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh----- Q lcl|Aclame:pro 1 MNKP-DLIEKQNRLAELKENNVSLKSQISGFEVKN----AIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKP----- 70 (393) Q Consensus 1 ~~k~-d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~----a~~~~skieelektis~l~aEi~k~enel~~~~Ek~----- 70 (393) ||-. .+.+-++++++|++++..+..++.+...+. +.+...++++++..+..+..+++..+..+......+ T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 9844 466678889999999888877776654322 223344556666666655555544333333222100 Q ss_pred --cchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceee Q lcl|Aclame:pro 71 --KGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHV 148 (393) Q Consensus 71 --K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV 148 (393) ...... .-.+.......|++-+.......+. .+++.+.+--.++|..+...|...++++..+++.+.+ T Consensus 81 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~---------ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~ 150 (421) T protein:vir:13 81 RVIINGDS-KEEKRSLQLSAMSKTIRGIQLSEEE---------RDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHV 150 (421) T ss_pred ccccccch-hHHHHHHHHHHHHHhhhccchhHHH---------hhccccCCcceecchhhHHHHHHHHHhhhhhhhhcee Confidence 000000 0011111112222222211111111 1245555555579999999999999999999887777 Q ss_pred ecccceeEEEee--cc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 149 TNVGALLVSRSF--DS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIV 225 (393) Q Consensus 149 ~n~~~~a~~i~l--~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI 225 (393) .+.+.....+.- .. ......+--|..+.+...+|...++.|..++.+..+.+-+. +.+.-++.+|++++|+.++. T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell--~ds~~~l~~~i~~~la~~~~ 228 (421) T protein:vir:13 151 IPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLL--EDSEINFLEFVNEEFAEFAV 228 (421) T ss_pred eeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHH--hhhHHHHHHHHHHHHHHHHH Confidence 666655444332 22 23344466677888889999999999998888877744332 22224689999999999988 Q ss_pred HHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc-eeeecCCCceEEEecchhhhHhhhhh Q lcl|Aclame:pro 226 NKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKTEDRKALLDELR 304 (393) Q Consensus 226 ~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald-~a~~~a~~~~l~i~~~d~~a~~~~~~ 304 (393) +.++.+++.. ++-+.+ .+...+.|++..++. +...-..+..+++++.++.+|+. T Consensus 229 -~~~~~~i~~~-----------------~~g~~~----~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~--- 283 (421) T protein:vir:13 229 -NTENAEIVKQ-----------------AKAVLA----EETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDG--- 283 (421) T ss_pred -HHhhhhHhhh-----------------hhhccc----cccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHH--- Confidence 5777665421 111211 122345677877773 32223345678899999888876 Q ss_pred ccccccceeeecCCCeEEEee---ecccchhhcccchh------------ceeeeecccc-e-e-cccceeeecceeEee Q lcl|Aclame:pro 305 QATANANVRIKNDDTEIASEV---GVDEIIVYTGSKAV------------KPTVLVDQKY-H-I-DMQDLTKVDAFEWKT 366 (393) Q Consensus 305 ~~~~~a~~~l~~~~~~~~~~v---~~~~~~~~tG~k~~------------~ptv~vD~k~-~-~-~~~~~~~~~s~~~~~ 366 (393) |++.+|.|.+.- |.+...+ |...+ .+.++-|-+. | + +-+|++-..+..-.| T Consensus 284 ---------lkd~~G~~i~~~~~~~~~~tl~--G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f 352 (421) T protein:vir:13 284 ---------LMDKQGRPLLKELSDGGDLVFK--GRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGY 352 (421) T ss_pred ---------hhcCCCceeecCcCCCCCceec--ceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeeccccc Confidence 777777776632 1111111 32221 1234455442 2 2 446665544444445 Q ss_pred cCceEEEee--eecccccccCcc---------eeeeeC Q lcl|Aclame:pro 367 NSNMILVET--LTSGHVETYNAG---------AVITVS 393 (393) Q Consensus 367 ns~~i~~~~--~~~g~~~~~n~~---------~~~~v~ 393 (393) ..|++.++. ...|-+-.+++. ++++-. T Consensus 353 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~ 390 (421) T protein:vir:13 353 TKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQ 390 (421) T ss_pred ccCeeEEEEEeeecceeecchhhheeeecccceeeccc Confidence 566555444 444555444442 222221 No 46 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.52 E-value=5e-15 Score=99.00 Aligned_cols=354 Identities=14% Similarity=0.166 Sum_probs=177.3 Q ss_pred CCc--------------------chhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHH------HHHHHHHHHH Q lcl|Aclame:pro 1 MNK--------------------PDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQE------LEKTLSENSI 54 (393) Q Consensus 1 ~~k--------------------~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skiee------lektis~l~a 54 (393) ||| --++.++...+++++.++++...++.+.-+.+. .+..++. ....++++.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~-~~~~~~~~~~~~e~~~~~~~~~~ 79 (425) T protein:vir:10 1 MSKKLLIAVLTAALTGPVGAVPRGIISVRAEGPTEVKALIENLQKAFHDFKAEHTK-QLDAVKAGLPTSDALAKVDKVSA 79 (425) T ss_pred CchhHHHHhhHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhhccHHHHHHHHHHHH Confidence 444 333344444455555555554444443221110 0111110 1111222222 Q ss_pred HHHHHHHHHHhhhh---hhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHh--------CccchhhhHhh Q lcl|Aclame:pro 55 EIIKIENELNAQEE---KPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAE--------NGVTITDTTFQ 123 (393) Q Consensus 55 Ei~k~enel~~~~E---k~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~e--------kgV~~qd~~ei 123 (393) +++..+.++..... +.+..-.-.+.+++ .+.+++|...|.. .|. ..+--.+ T Consensus 80 ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------~~~~~af~~~l~~~e~~~al~~~t-~~~gG~l 141 (425) T protein:vir:10 80 DLEALQAAVDEANIKIAAAQMGANGVKPLRD-----------------PEYTEAFKAHVKRGDVQAALNKGE-DSEGGYL 141 (425) T ss_pred HHHHHHHHHHHHHHHHHhhhccccccccccc-----------------HHHHHHHHHHhhhhhhHHHhhcCc-CCCCcee Confidence 22222222111100 00000000011111 1334444433321 121 1222237 Q ss_pred cchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc--ccccceecccchhhhhh-hhhhhhhccHHHHHHHHHHHH Q lcl|Aclame:pro 124 LPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS--SNEAQVHKDGQTKTEQA-ATLTIDTLEPVMVYKLQSLAE 200 (393) Q Consensus 124 LP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~-~~le~~ti~p~~VYkkq~Lad 200 (393) +|..+..-|-..++++.++++...+.+.+.....+-..+ .+..|. .-|..+.+.. .+|...++.|..++-+..+.+ T Consensus 142 vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~wv-~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ 220 (425) T protein:vir:10 142 TPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGWV-GEASQRPQTNAATFQPLSFASGEIYANPAATQ 220 (425) T ss_pred ccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEcCCcceeee-ccccccccccccccceeeeeheeeEeehHhHH Confidence 899998889999999889888767666555544444433 233343 2334555554 479999999988877766644 Q ss_pred HHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhh--------hhhhhhhhhccCCCCCHHH Q lcl|Aclame:pro 201 RVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKK--------IKKITTKAKSAGKTPFADA 272 (393) Q Consensus 201 ~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~--------ik~it~~at~~~~t~~~da 272 (393) .+.+ .+.-++.+||+++|+.++- +.++.++++|||++... -+..+... ...+....+..+.+...|+ T Consensus 221 ell~--ds~~~l~~~i~~~la~ai~-~~~d~~~l~G~G~~~p~--Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 295 (425) T protein:vir:10 221 QILD--DAEIDLESWLATEVQTEFA-KQEGKAFLAGDGTNKPN--GLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDG 295 (425) T ss_pred HHHh--cchhHHHHHHHHHHHHHHH-HHHHhhhhcccCCCCcc--eeeeccccccccccccccccccccccccccccHHH Confidence 3333 2224679999999999998 79999999999974210 01111000 0000001122234455677 Q ss_pred HHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecc----cchhhcccchh-------- Q lcl|Aclame:pro 273 IEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVD----EIIVYTGSKAV-------- 339 (393) Q Consensus 273 l~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~----~~~~~tG~k~~-------- 339 (393) |...+ ++...-.++.++++++.++.+|+- |+|.+|.+-+.-.+. ...+ |-..+ T Consensus 296 l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~------------lkD~~G~~l~~~~~~~g~~~~l~--G~PV~~~~~~p~~ 361 (425) T protein:vir:10 296 IIDLVYDLPSAFTGNARFAMNRNTQRQVRK------------LKDGQGNYLWQPSYVAGQPATLA--GYPVTEVPDMPDV 361 (425) T ss_pred HHHHHhhhhhhhccCCEEEEchHHHHHHHH------------hhcCCCceeeccCccCCCCceec--ceeeEEecCcCCc Confidence 76655 333333456678999998888776 677777765421111 1111 32111 Q ss_pred ----ceeeeecccce--e-cccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 340 ----KPTVLVDQKYH--I-DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 340 ----~ptv~vD~k~~--~-~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .|.++.|-+.+ + +-+|++....--+..+.-.|....+..|.|--|++-++++++ T Consensus 362 ~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~ 422 (425) T protein:vir:10 362 AANSTPILFGDFQQTYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVA 422 (425) T ss_pred cCCccEEEEEehhccEEEEEecceEEEecccccCCcEEEEEEEEeccEeecccceEEEEee Confidence 23444454332 2 445555433333333333455666788999999999999998 No 47 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.51 E-value=9.7e-15 Score=97.40 Aligned_cols=362 Identities=16% Similarity=0.178 Sum_probs=167.0 Q ss_pred cchhhHHHHHHHHHHHhhHHHHh----hhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh---hhhcchhH Q lcl|Aclame:pro 3 KPDLIEKQNRLAELKENNVSLKS----QISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQE---EKPKGKDK 75 (393) Q Consensus 3 k~d~~ekq~eLa~lK~~~~~~~s----~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~---Ek~K~k~e 75 (393) =.|+.+.++++.++......++. ++++++-+.. ....+++.++..+.++.......+.++.... ...+.. . T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~~~~~~~e~~~~-~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~-~ 78 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKNDKRIDAIEQEKG-KLAGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQNK-V 78 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-h Confidence 12233333333333222222221 1221110000 0012222222222222222211111111100 000111 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee Q lcl|Aclame:pro 76 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 155 (393) Q Consensus 76 mtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a 155 (393) ..+ .++|...| +.... ..++...+...|..- . ..|--.++|..+..-|-..++++.++++..++....... T Consensus 79 ~~e---~~~a~~~~---l~~g~-~~~~~~~e~~a~~~~-t-~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~ 149 (407) T protein:vir:48 79 ASE---HKEAFIGF---MRKGR-EDGLRELERKALQVG-N-DEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLGGSD 149 (407) T ss_pred hhH---HHHHHHHH---Hhccc-hhhhhHHHHHhhhcc-c-CCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecCCCc Confidence 112 23333333 22111 123433333333221 1 112223689999999999999999998866654444443 Q ss_pred EEEeecc--ccccceecccchhhhhh-hhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcc Q lcl|Aclame:pro 156 VSRSFDS--SNEAQVHKDGQTKTEQA-ATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLA 232 (393) Q Consensus 156 ~~i~l~n--a~~a~GHk~ga~Kk~q~-~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rA 232 (393) ..+-... ..-+| ..-|..+.++. -+|...++.+..++....+.+.+.+ .+..++.+|++++|+.++- +.++.+ T Consensus 150 ~~~~~~~~~~~a~~-v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~-~~~~~a 225 (407) T protein:vir:48 150 YKKLVNLGGTTSGW-VGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLD--DAFFNVEDWINSELALEFA-EQEEIA 225 (407) T ss_pred eEEEEecCCcceee-ecccccccccccccceeEEeeeeeeEeehhhHHHHHh--cchHHHHHHHHHHHHHHHH-HHHHhh Confidence 3433322 23333 23344555554 4788888888777666555333322 2335679999999999998 799999 Q ss_pred eeeccCCCccc---cchhhh--h-hhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhc Q lcl|Aclame:pro 233 LVEGDGTNGFK---SIDKEA--D-VKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQ 305 (393) Q Consensus 233 vv~gDG~~~t~---~~~~e~--D-~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~ 305 (393) +++|||++... ..+... + ...+..+....+..+.+-..|+|...+ .....-..+..+++++.++..|+- T Consensus 226 ~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~---- 301 (407) T protein:vir:48 226 FTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRL---- 301 (407) T ss_pred hhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHH---- Confidence 99999984210 000000 0 000111111223334444567776555 222222345578888888877766 Q ss_pred cccccceeeecCCCeEEEeeec--cc--chhhcccchhc----e--------eeeecccc--ee-cccceeeecceeEee Q lcl|Aclame:pro 306 ATANANVRIKNDDTEIASEVGV--DE--IIVYTGSKAVK----P--------TVLVDQKY--HI-DMQDLTKVDAFEWKT 366 (393) Q Consensus 306 ~~~~a~~~l~~~~~~~~~~v~~--~~--~~~~tG~k~~~----p--------tv~vD~k~--~~-~~~~~~~~~s~~~~~ 366 (393) |++++|.+.+--++ |. ..+ |-..+. | .++-|-++ .+ +-.|++....--+.. T Consensus 302 --------lkD~~Gr~l~~~~~~~g~~~~l~--G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~~ 371 (407) T protein:vir:48 302 --------LKDNDGNYLWRPGIELGQPSSLA--GYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNK 371 (407) T ss_pred --------hhccCCceeeccCcCCCCCceec--ceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeeccccC Confidence 77888876542111 11 111 322111 1 11223221 11 223333222212233 Q ss_pred cCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 367 NSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 367 ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +.-.|.+.....|-+.-+.+-++++|+ T Consensus 372 ~~~~~~~~~r~d~~v~~~~a~~~l~~~ 398 (407) T protein:vir:48 372 PFVGFYTTKRTGGMLVDSQAIKLMKIG 398 (407) T ss_pred CcEEEEEEEEeccEEecccceEEEEee Confidence 444455666788888888888888887 No 48 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.51 E-value=4.2e-15 Score=99.40 Aligned_cols=365 Identities=12% Similarity=0.084 Sum_probs=181.6 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHh-hhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhh---------- Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKS-QISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEK---------- 69 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s-~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek---------- 69 (393) ||.-+|.|+.+++. +.+..+.. +-++-.+. +.+ ..+++++++.+.+|+.+|+..+. ++...++ T Consensus 1 M~i~eL~e~r~~~~---~~~~~l~~~~~e~~~lt-~ee-~~~~~~l~~ei~~l~~~I~~~e~-~~~~~~~~~~~~~~~~~ 74 (435) T protein:vir:14 1 MNVNELRRERAAVN---QRVQALAQIEVGGTALS-VEQ-QAEFDQLSSKFSELTAQIERAEA-AERMAAAAAVPVDPNPT 74 (435) T ss_pred CCHHHHHHHHHHHH---HHHHHHHHHHhccCCCC-HHH-HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcccccchhh Confidence 88777776665554 33333322 22222222 222 36889999999999998865332 1111100 Q ss_pred ---------hcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHH-------HhCccch-hhh--HhhcchhHHH Q lcl|Aclame:pro 70 ---------PKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKL-------AENGVTI-TDT--TFQLPRKLVE 130 (393) Q Consensus 70 ---------~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L-------~ekgV~~-qd~--~eiLP~~ii~ 130 (393) ...+....++-. .+...|.+-+....| +.+.++.... ..+.+.+ ++. -.++|..+.. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~ 150 (435) T protein:vir:14 75 AVAAPAAAPVHAQPKALEVKG--AKMARMVRALAAARG--DAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSS 150 (435) T ss_pred hhhhccccccccccchhhhhH--HHHHHHHHHHHhhcc--hhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHH Confidence 001101111100 111223333333333 2222222222 2222211 111 1268988777 Q ss_pred HHHHHHHhhCccccc-eeeecccceeEEEeec-c-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcC Q lcl|Aclame:pro 131 SINTALLNTNPVFKV-FHVTNVGALLVSRSFD-S-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQM 207 (393) Q Consensus 131 AIe~A~ed~d~vl~~-fhV~n~~~~a~~i~l~-n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g 207 (393) .|-+.+.++..++.. .++.....+...+.-. . ...+| .--|..+++...+|...++.|..++....+.+.+-+-.+ T Consensus 151 ~ii~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~-v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~ 229 (435) T protein:vir:14 151 EVIELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGY-IGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAG 229 (435) T ss_pred HHHHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceee-eccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhc Confidence 777777776766553 2232222222233222 2 23333 345667888889999999999888888767444332222 Q ss_pred chhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhh---hceeeecC Q lcl|Aclame:pro 208 SYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEA---VDFVRPTA 284 (393) Q Consensus 208 ~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Ea---ld~a~~~a 284 (393) ....+-+|++++|+..+- |.++.+++.|||+... +.-+-.......+++.+...+.+...+++... +.-+.+.. T Consensus 230 ~~~~l~~~i~~~l~~ai~-~~~d~a~l~G~G~~~~--p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 306 (435) T protein:vir:14 230 VNPNVDQIVVGDLTAAIG-AREDKAFIRDDGTANT--PKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANL 306 (435) T ss_pred cCHHHHHHHHHHHHHHHH-HHHHHHhhccCCCCcc--ccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccc Confidence 223578999999999999 7999999999998420 00010000011111111111111111222222 22233333 Q ss_pred CCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhcccch---------------hceeeeecccc Q lcl|Aclame:pro 285 GRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKA---------------VKPTVLVDQKY 349 (393) Q Consensus 285 ~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~---------------~~ptv~vD~k~ 349 (393) .+..+++++.++.+|+. |++++|.+-++-..+.... |... ..+.++.|-+. T Consensus 307 ~~~~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~g~l~--G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~ 372 (435) T protein:vir:14 307 TQPGWIMAPRTFRFLEG------------LRDGNGNKVYPELANGMLK--GYPVGKTTQVPINLGETGKESEIYFTDFGD 372 (435) T ss_pred cCCEEEEcHHHHHHHHH------------hhccCCceeccCCCCCeee--cceeEeeccccccccCCCccceEEEeeccc Confidence 45567889999988876 6666666655422222211 2111 11233344333 Q ss_pred ee--cccceeeeccee-------------EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 350 HI--DMQDLTKVDAFE-------------WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 350 ~~--~~~~~~~~~s~~-------------~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++ +-+|++-.-+.. |..|+-.|.++.+..|.+--|.+-++++=+ T Consensus 373 ~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~ 431 (435) T protein:vir:14 373 VFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGV 431 (435) T ss_pred EEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecC Confidence 32 222222211111 556778888888999988888876666655 No 49 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.50 E-value=3.7e-15 Score=99.67 Aligned_cols=349 Identities=14% Similarity=0.092 Sum_probs=178.9 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |.. |.|+.++..+++. .+...++ ...+.++....+.++ ....+++.....+ ... T Consensus 1 ik~--L~e~~~e~~e~~~---~~~~~~~---------~~~~~~e~~~~~~~~---~~~~~~~~~~~~~---------~~~ 54 (390) T protein:vir:40 1 MNN--LDKKDSETLNIST---AFLNAIK---------EGATEAEQVTAFTNM---AEQIQNNIIAQAR---------KEV 54 (390) T ss_pred Cch--HHHHHHHHHHHHH---HHHHHHh---------hhhhHHHHHHHHHHH---HHHHHHHHHHHHH---------HHH Confidence 322 1122222222221 1111111 101111111111111 1111111111100 000 Q ss_pred HHHHHHHHHHHHHH--HccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEE Q lcl|Aclame:pro 81 ESQNAVTEFFDVLK--KNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 158 (393) Q Consensus 81 kTkqA~~dya~ll~--~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i 158 (393) + ....+.+-... ......+.++.|++.+...+. +|-..++|..+...|-+.++++.++++.+++.+.......+ T Consensus 55 ~--~~~~~~~~~~~~~~~~l~~~~r~~~~~~~~~~~~--~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i 130 (390) T protein:vir:40 55 N--REMNDNNVLASRGANALTSDESKYYNEVIAGNGF--AGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWI 130 (390) T ss_pred H--HHHHHHHHHHhcCchhccHHHHHHHHHHHhccCc--ccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEE Confidence 0 01111111111 111235778888887766555 55566899999999999999999999987877776665544 Q ss_pred eecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec Q lcl|Aclame:pro 159 SFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 236 (393) Q Consensus 159 ~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g 236 (393) --.+ ....|+.-.+..++....+|...++.|..+|.+..+.+.+.+- +.-++.+|++++|+.++- +.++.+++.| T Consensus 131 ~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~d--s~~~l~~~i~~~la~~i~-~~~~~a~l~G 207 (390) T protein:vir:40 131 ISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDL--GPSWLDQYVRTILGEAMA-LGLEAGIVNG 207 (390) T ss_pred EEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHH-HHHHhhhhcc Confidence 4433 3466776656667777889999999999888887774333332 224579999999999999 7999999999 Q ss_pred cCCCccccchhhhhh---hhhhhhhhhhhccCCC---CCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccc Q lcl|Aclame:pro 237 DGTNGFKSIDKEADV---KKIKKITTKAKSAGKT---PFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATAN 309 (393) Q Consensus 237 DG~~~t~~~~~e~D~---~~ik~it~~at~~~~t---~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~ 309 (393) ||++.. .-+..+. ..-.....++..++.- ...+.+.-++ +-...-.++.++++++.+....+..+| T Consensus 208 ~G~~~P--~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~----- 280 (390) T protein:vir:40 208 SGKDQP--IGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAAT----- 280 (390) T ss_pred cCCCcc--ceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHh----- Confidence 997421 0011110 0000001111111110 0111223333 333344567788998888655555443 Q ss_pred cceeeecCCCeEEEeeecccchhhcccchh--ceeeeeccccee--cccceeeecceeEee--cCceEEEeeeecccccc Q lcl|Aclame:pro 310 ANVRIKNDDTEIASEVGVDEIIVYTGSKAV--KPTVLVDQKYHI--DMQDLTKVDAFEWKT--NSNMILVETLTSGHVET 383 (393) Q Consensus 310 a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~--~ptv~vD~k~~~--~~~~~~~~~s~~~~~--ns~~i~~~~~~~g~~~~ 383 (393) .+++.+|.|..+...-...++. +..| ...++.|-+.|+ +-+|++---+-+-.| ++-.+.+.....|.+.- T Consensus 281 ---~~~d~~G~~v~~~~~~g~pvv~-~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~dg~v~~ 356 (390) T protein:vir:40 281 ---SYMTPQGVWVTGILPVPLEIVQ-SVAVPVGKAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYANGRPKD 356 (390) T ss_pred ---hccCCCCccccccCCCceeEEE-cCCCCCCcEEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEeCCEEec Confidence 3788888876544322221110 0000 023344444433 334443322222233 44445666778888888 Q ss_pred cCcceeeeeC Q lcl|Aclame:pro 384 YNAGAVITVS 393 (393) Q Consensus 384 ~n~~~~~~v~ 393 (393) +++-.+..++ T Consensus 357 ~~A~~~l~~~ 366 (390) T protein:vir:40 357 NSSFLVFDIT 366 (390) T ss_pred ccceEEEEee Confidence 8888888777 No 50 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.48 E-value=1.2e-14 Score=96.91 Aligned_cols=369 Identities=12% Similarity=0.088 Sum_probs=172.9 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh--h-----h--c Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--K-----P--K 71 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E--k-----~--K 71 (393) |+| +.|.+.+.+++.+.+..+..+...=+.- -.++..+++++++.+++|+.+|+..+.+-+..++ + . + T Consensus 1 M~k--l~~L~e~r~~l~~~~~~l~~~~~e~~~l-t~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~ 77 (428) T protein:vir:10 1 MPQ--IEELRRQRAGINEQIQALATIEATNGTL-TAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGP 77 (428) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhcc Confidence 988 4555555566655555555432111100 1123467889999999999998753322222111 0 0 0 Q ss_pred chhHHHHHHHHHH-HHHHHHHHHHHccCC-hhHHHHHHHHH----HhCccchhhh--HhhcchhHHHHHHHHHHhhCccc Q lcl|Aclame:pro 72 GKDKMTNFIESQN-AVTEFFDVLKKNSGK-SEIKNAWNAKL----AENGVTITDT--TFQLPRKLVESINTALLNTNPVF 143 (393) Q Consensus 72 ~k~emtEfLkTkq-A~~dya~ll~~nqg~-ke~k~AW~a~L----~ekgV~~qd~--~eiLP~~ii~AIe~A~ed~d~vl 143 (393) ......+....+. ....++.-+....|. +.....+.... ....+.+... -.++|..+..-|=.-++++..++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l~ 157 (428) T protein:vir:10 78 AVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIVR 157 (428) T ss_pred ccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchhh Confidence 0000001111111 111122222222221 12222221111 1111111111 12578777666656667666665 Q ss_pred cc-eeeecccceeEEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHH Q lcl|Aclame:pro 144 KV-FHVTNVGALLVSRSFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAEL 220 (393) Q Consensus 144 ~~-fhV~n~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~EL 220 (393) +. .++-.-..+-..+.-.+ ....| ..-|.++++...+|...++.|..++....+.+.+-+ .+--++.+|++++| T Consensus 158 ~~~~~~~~~~~g~~~~p~~~~~~~a~~-v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~--ds~~~l~~~i~~~l 234 (428) T protein:vir:10 158 KLGARSIPLPNGNMSLPRLAGGATASY-TGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIG--RAGFNVEQLVLQDI 234 (428) T ss_pred hhcceeeecCCcceEEEEEeCCcceee-eccCccccccccceeeEEeeeEEEEEeehhhHHHHh--hhhHHHHHHHHHHH Confidence 53 22221111212222122 22333 345788899999999999999888887777554432 22246799999999 Q ss_pred HHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCC-CCHHHHHhhh-ce---eeecCCCceEEEecch Q lcl|Aclame:pro 221 TQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKT-PFADAIEEAV-DF---VRPTAGRRYLIVKTED 295 (393) Q Consensus 221 aq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t-~~~dal~Eal-d~---a~~~a~~~~l~i~~~d 295 (393) +.++- +.++.+++.|||++. ..--+.........+++++...+.+ .+.+....++ .+ ..+-......+++..+ T Consensus 235 ~~ai~-~~~d~~~l~G~G~~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~ 312 (428) T protein:vir:10 235 LTAIS-VREDKAFMRDDGTGD-TPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRT 312 (428) T ss_pred HHHHH-HHHHHHHhccCCCCc-cccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHH Confidence 99999 799999999999742 1111111211112222222222222 2333444444 11 2223334567888888 Q ss_pred hhhHhhhhhccccccceeeecCCCeEEEeeecccchhhcccchh---------------ceeeeeccccee--cccceee Q lcl|Aclame:pro 296 RKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAV---------------KPTVLVDQKYHI--DMQDLTK 358 (393) Q Consensus 296 ~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~---------------~ptv~vD~k~~~--~~~~~~~ 358 (393) +.+|+- |++++|.+.++-..+.... |...+ .+.++.|-..++ +-++++- T Consensus 313 ~~~L~~------------lkd~~G~~i~~~~~~g~l~--G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i 378 (428) T protein:vir:10 313 YMKLFG------------LRDGNGNKVYPEMAQGMLK--GYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKV 378 (428) T ss_pred HHHHHH------------hhccCCceeccCCCCCeee--ceeeEEeccccccccCCCccceEEEEecceEEEEEecceEE Confidence 888766 6666766654321111111 22211 123344444333 2233332 Q ss_pred eccee-------------EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 359 VDAFE-------------WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 359 ~~s~~-------------~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .-+.. |..|+-.|.++....+.+ ++..++..+. T Consensus 379 ~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v--~~p~a~~~~t 424 (428) T protein:vir:10 379 DFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGF--RHPEGLVLGT 424 (428) T ss_pred EeecccccccccccccchhhcchhheeeeeeeCcee--eccceEEEEe Confidence 21111 233344444444444444 4444444444 No 51 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.47 E-value=2.3e-14 Score=95.37 Aligned_cols=365 Identities=12% Similarity=0.094 Sum_probs=180.2 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHh-hhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh--hh------hc Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKS-QISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQE--EK------PK 71 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s-~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~--Ek------~K 71 (393) ||--.|.|+.+++ .+.+.++.. +-++-.+.. ++.+++++++..+++++.+|...+....... ++ +. T Consensus 1 M~l~eL~~~r~~~---~~~~~~l~~~~~e~~~l~~--ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~ 75 (435) T protein:vir:80 1 MNVNELRRERAAV---NQRVQALAQIEVGGTALSV--EQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAA 75 (435) T ss_pred CCHHHHHHHHHHH---HHHHHHHHHHHhccCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhh Confidence 8776666655544 444444322 222222221 2236788899999999988875432111111 10 00 Q ss_pred chhHHHH-------HHHHHH-HHHHHHHHHHHccCChhHHHHHHHHHH-------hCccch---hhhHhhcchhHHHHHH Q lcl|Aclame:pro 72 GKDKMTN-------FIESQN-AVTEFFDVLKKNSGKSEIKNAWNAKLA-------ENGVTI---TDTTFQLPRKLVESIN 133 (393) Q Consensus 72 ~k~emtE-------fLkTkq-A~~dya~ll~~nqg~ke~k~AW~a~L~-------ekgV~~---qd~~eiLP~~ii~AIe 133 (393) ....... -...+. +...|++-+...++ +...+....+. .+.+.+ .+--.++|..+..-|- T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii 153 (435) T protein:vir:80 76 VTASAAAPVYAQPKAPEVKGAKMARMVRALAAARG--DAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVI 153 (435) T ss_pred hccccccccccccchhhhhHHHHHHHHHHHHhccc--hhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHH Confidence 0000000 000111 11122222222222 22222221111 111111 1111267888777777 Q ss_pred HHHHhhCccccceeeeccc--ceeEEEee-ccc-cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCch Q lcl|Aclame:pro 134 TALLNTNPVFKVFHVTNVG--ALLVSRSF-DSS-NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSY 209 (393) Q Consensus 134 ~A~ed~d~vl~~fhV~n~~--~~a~~i~l-~na-~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~y 209 (393) +.++.+.++++ +.+...| .+...+.- ... ...|. --|..+++...+|...++.|..++....+.+.+-+-.+.- T Consensus 154 ~~l~~~~~i~~-~~~~~v~~~~~~~~~p~~~~~~~a~~v-~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~ 231 (435) T protein:vir:80 154 ELLRPKSVVRK-LGARTLPLSNGNITIPRLKGGAIVGYI-GADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVN 231 (435) T ss_pred HHHhhhchhhh-ccceeeecCCCceEEEEEeCCcceeee-ccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhccc Confidence 77776666655 3222222 22223322 222 23332 3466788889999999999998888877744443222221 Q ss_pred hHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCC--HHHHHhhh---ceeeecC Q lcl|Aclame:pro 210 SELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPF--ADAIEEAV---DFVRPTA 284 (393) Q Consensus 210 galvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~--~dal~Eal---d~a~~~a 284 (393) .++.+|++++|+.++- ++.+.+++.|||++.. .--+..+.. +...+ ....+.|.. ..++..++ .-+.+.. T Consensus 232 ~~l~~~i~~~l~~a~~-~~~d~a~l~G~G~~~~-p~Gi~~~~~-~~~~~--~~~~~~~~~~~~~d~~~~~~~~~~~~~~~ 306 (435) T protein:vir:80 232 PNVDQIVVGDLTAAIG-AREDKAFIRDDGTANT-PKGLRFWAL-PGNVI--TASDGSTLQKIETDLGKAILALENADANL 306 (435) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHhhccCCCCCc-ccceeeccc-cccee--ecccccchhhHHHHHHHHHHHhhcccccc Confidence 3578999999999999 7999999999997421 000111100 00111 111111111 11233333 2222333 Q ss_pred CCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhcccc---------------hhceeeeecccc Q lcl|Aclame:pro 285 GRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSK---------------AVKPTVLVDQKY 349 (393) Q Consensus 285 ~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k---------------~~~ptv~vD~k~ 349 (393) .+..+++++.++.+|+- |++++|.+-++-..++..+ |-. -..+.++.|-.+ T Consensus 307 ~~~~~vmn~~~~~~L~~------------lkd~~G~~l~~~~~~~~l~--G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~ 372 (435) T protein:vir:80 307 TQPGWIMAPRTFRFLEG------------LRDGNGNKVYPELANGMLK--GYPVGKTTQVPINLGEAGKESEIYFTDFGD 372 (435) T ss_pred ccCEEEEcHHHHHHHHh------------hhccCCceeccCCCCCeEe--eeeeEEeccccccccCCCCcceEEEEEccc Confidence 45567889999888766 6666666655432222221 211 011334444443 Q ss_pred ee--cccceeeecce-------------eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 350 HI--DMQDLTKVDAF-------------EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 350 ~~--~~~~~~~~~s~-------------~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++ +-+|++---+. -|..|+..|.++....|.+--|++-++++=. T Consensus 373 ~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~ 431 (435) T protein:vir:80 373 VFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGV 431 (435) T ss_pred EEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEecc Confidence 33 22222211111 1667888899999999999999888877755 No 52 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.45 E-value=3.1e-14 Score=94.64 Aligned_cols=363 Identities=13% Similarity=0.090 Sum_probs=184.4 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh------hhhcchh Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQE------EKPKGKD 74 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~------Ek~K~k~ 74 (393) |++.=+.+...+.+++...+.++..+.+.....+ ++..+.++++..+.+++.+|++.+....... ..++... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~--e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~ 78 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAGKEMTA--EAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSG 78 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcc Confidence 8888777777777777666666655554433222 2234566777777777777654222111111 1111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCC-hhHHHHHHHHHHhCccchhhhHhhcchhHHHH-HHHHHHhhCccccce-eeecc Q lcl|Aclame:pro 75 KMTNFIESQNAVTEFFDVLKKNSGK-SEIKNAWNAKLAENGVTITDTTFQLPRKLVES-INTALLNTNPVFKVF-HVTNV 151 (393) Q Consensus 75 emtEfLkTkqA~~dya~ll~~nqg~-ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~A-Ie~A~ed~d~vl~~f-hV~n~ 151 (393) .- ..+.+..++...+. .|. .+.+.. .-....++.+......++|+.+... |...+. ...++..+ ++... T Consensus 79 ~~----~~~~~~~~~~~~~r--~g~~~~~~~~-~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~-~~~~l~~~~~~~~~ 150 (392) T protein:vir:13 79 SG----AQRSADHDDDAVLR--AGNLGEARSF-EFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVE-RSAIMRGGASTFTT 150 (392) T ss_pred cc----hhhhhhHHHHHHHh--ccchhhhHHH-HhhhhhhcccccCCCccccccchHHHHHHHHh-hhhhhhhcceeeec Confidence 00 00111111211111 111 111111 1122223222222233678776555 444555 45555433 33222 Q ss_pred ---cceeEEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 152 ---GALLVSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI 228 (393) Q Consensus 152 ---~~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Ra 228 (393) ..+-..+.-.....+| ..-|.++.+...+|...++.|..++-...+.+-.-+ .+.-++.+||.++|+..+- +. T Consensus 151 ~~~~~~~~~~~~~~~~a~~-v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~-~~ 226 (392) T protein:vir:13 151 SDANPMDFTVITGRATAGI-VGETAEIPESYPATTQRSMGGFKYGFASVVSYEFAT--DQVLDLVGFLVSDAGPAIG-DA 226 (392) T ss_pred CCCceeEEEEEcCCcceee-ecccccccccccceeeEEeeeeeEEeeehhHHHHHh--cchHHHHHHHHHHHHHHHH-HH Confidence 1222211112234445 467778889999999999999887777555333322 2223679999999999998 79 Q ss_pred HhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccc Q lcl|Aclame:pro 229 VDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQAT 307 (393) Q Consensus 229 v~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~ 307 (393) .+.++++|||++.. .-+......... ...+..+.+...|+|...+ .+-..-..+...++++.++.+|+- T Consensus 227 ~d~~~l~G~Gt~~p--~Gil~~~~~~~~--~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~------ 296 (392) T protein:vir:13 227 MGRHFLTGTGTGQP--RGILTDATGANA--AFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRK------ 296 (392) T ss_pred HHHHHhcccCCccc--cccccccccccc--cccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHH------ Confidence 99999999998521 111111110000 0111223445567776655 222222345567889999888776 Q ss_pred cccceeeecCCCeEEEeeecccchhhc--ccchhc-------eeeeecccce-e-cccceeeecceeEeecCc--eEEEe Q lcl|Aclame:pro 308 ANANVRIKNDDTEIASEVGVDEIIVYT--GSKAVK-------PTVLVDQKYH-I-DMQDLTKVDAFEWKTNSN--MILVE 374 (393) Q Consensus 308 ~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~~-------ptv~vD~k~~-~-~~~~~~~~~s~~~~~ns~--~i~~~ 374 (393) |++.+|.+-+--++..-+-+| |.-.+. +.++-|-+.| + +-+|++--.+..-.|..+ .+.+. T Consensus 297 ------lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~~ 370 (392) T protein:vir:13 297 ------LKDANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRFL 370 (392) T ss_pred ------hhccCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEEeeccceeEEeecceEEEeeccccccCCcEEEEEE Confidence 778888765422211111111 322111 2233343222 2 334443332333334444 44566 Q ss_pred eeecccccccCcceeeeeC Q lcl|Aclame:pro 375 TLTSGHVETYNAGAVITVS 393 (393) Q Consensus 375 ~~~~g~~~~~n~~~~~~v~ 393 (393) .+..|.+.-|++..+++|+ T Consensus 371 ~r~d~~~~~~~A~~~~~~~ 389 (392) T protein:vir:13 371 QRADGLLVDARGAKVLTVT 389 (392) T ss_pred EEeccEEecccceEEEEee Confidence 6778888889998888888 No 53 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.45 E-value=1.8e-13 Score=90.50 Aligned_cols=365 Identities=17% Similarity=0.178 Sum_probs=179.6 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhh--HHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh--------hh Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKN--AIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KP 70 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~--a~~~~skieelektis~l~aEi~k~enel~~~~E--------k~ 70 (393) ||--.+.| .+.+.+++...++++++..-++.. ..+.-.+++++.+.++.+..+|.+.+.......+ .+ T Consensus 1 M~l~el~~--~~~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~ 78 (434) T protein:vir:62 1 MNLKEILN--ASLTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEK 78 (434) T ss_pred CCHHHHHH--HHHHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhh Confidence 77433322 233345666666666666655522 2222356777777777777766442221111110 00 Q ss_pred cchhHHHHH-----HHHHHHHHHHHHHHH---HccC-----ChhHHHHHHHHHHhC---------ccchhhhHhhcchhH Q lcl|Aclame:pro 71 KGKDKMTNF-----IESQNAVTEFFDVLK---KNSG-----KSEIKNAWNAKLAEN---------GVTITDTTFQLPRKL 128 (393) Q Consensus 71 K~k~emtEf-----LkTkqA~~dya~ll~---~nqg-----~ke~k~AW~a~L~ek---------gV~~qd~~eiLP~~i 128 (393) ...+...+. ..+.+.-..+.+.+. ...+ ..+.+++|...|... ++.+.+--.++|..+ T Consensus 79 ~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~ 158 (434) T protein:vir:62 79 KEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFL 158 (434) T ss_pred hcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccceecchhh Confidence 000000000 000011111112121 1111 136677776655421 222222223689888 Q ss_pred HHHHHHHHHhhCccccceeeecccceeEEEe-ecccccccee---cccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 129 VESINTALLNTNPVFKVFHVTNVGALLVSRS-FDSSNEAQVH---KDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKR 204 (393) Q Consensus 129 i~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~-l~na~~a~GH---k~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~ 204 (393) ...|-..++++.++.+..++.+.... ..+- +.....+.++ ..|..++....+|...++.+..++.+..+-+.+.+ T Consensus 159 ~~~Ii~~l~~~~~i~~~~~~~~~~~~-~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ 237 (434) T protein:vir:62 159 SKEIITYAQEENFLRRLGTGVKTKEN-IKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLA 237 (434) T ss_pred HHHHHHhhhhhhhhhhhcceeccCCc-eEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHh Confidence 88888999998888775554333221 1121 1111222222 33557777888999999999888887666333332 Q ss_pred hcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeec Q lcl|Aclame:pro 205 LQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPT 283 (393) Q Consensus 205 l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~ 283 (393) - +.-++.+|+.++|+..+. ++.+.++++|||++... ... ..-..++ ...+.+...|+|.... ++...- T Consensus 238 d--s~~~l~~~i~~~la~~~~-~~~d~~~l~G~G~~~~~--~g~---~~~~~~~---~~~~~~~~~d~l~~l~~~l~~~~ 306 (434) T protein:vir:62 238 R--TGLPIEQIVMDELKKAYV-RKETQYMVNGDEANNIN--DGA---LAKKAVE---FKTDEKNLYDALVKMKNTPVKEV 306 (434) T ss_pred c--chHHHHHHHHHHHHHHHH-HHHHHHHhccCCCCccc--cce---eeccccc---ccccccchhhHHHHHHhhcchhh Confidence 2 223579999999999999 79999999999985411 011 0001111 1223344567776655 443333 Q ss_pred CCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEee------ecccchhhcccchhc------------eeee- Q lcl|Aclame:pro 284 AGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEV------GVDEIIVYTGSKAVK------------PTVL- 344 (393) Q Consensus 284 a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v------~~~~~~~~tG~k~~~------------ptv~- 344 (393) ..+...++++.++.+|+- |+|++|.+.+.- |.+...+ |...+. |.++ T Consensus 307 ~~~a~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~~~~g~~~tl~--G~pV~~~~~~~~~~~~~~~~i~~ 372 (434) T protein:vir:62 307 RKKARWVLNTAALTKIET------------MKTDDGFPLLRPFNQAEGGIGYTLL--GFPVEEEDAIDIPDSPDTPVFYF 372 (434) T ss_pred hcCCEEEEcHHHHHHHHH------------hhccCCCEeeccCCCccCCCCceec--ceeeEEecCccCccCCCceEEEE Confidence 345677889999998876 788888876531 1112222 322111 1121 Q ss_pred eccccee--cccceeee---cceeEeecCceEEEeeeecccccc-cCcceeeeeC Q lcl|Aclame:pro 345 VDQKYHI--DMQDLTKV---DAFEWKTNSNMILVETLTSGHVET-YNAGAVITVS 393 (393) Q Consensus 345 vD~k~~~--~~~~~~~~---~s~~~~~ns~~i~~~~~~~g~~~~-~n~~~~~~v~ 393 (393) -|-+.|+ +.+|..++ ....+.+++=.+.++....|.+-. |=+..+..+- T Consensus 373 Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~ 427 (434) T protein:vir:62 373 GDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYV 427 (434) T ss_pred eeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEE Confidence 1332221 22221111 111222233235555666666431 3333332221 No 54 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=99.43 E-value=1.7e-13 Score=90.66 Aligned_cols=360 Identities=14% Similarity=0.088 Sum_probs=189.8 Q ss_pred CCcch-hhHHHHHHHHHHHhhHHHHhhhhhhhh------hhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhh---hhh Q lcl|Aclame:pro 1 MNKPD-LIEKQNRLAELKENNVSLKSQISGFEV------KNAIEDLPKVQELEKTLSENSIEIIKIENELNAQE---EKP 70 (393) Q Consensus 1 ~~k~d-~~ekq~eLa~lK~~~~~~~s~i~~~~v------~~a~~~~skieelektis~l~aEi~k~enel~~~~---Ek~ 70 (393) .|+.+ +.|.+.+|.++.+...+++.++..... ++..+.-.+++.++..+.++..+++..+.++.... +.+ T Consensus 13 g~~mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 92 (402) T protein:vir:93 13 GNEMPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEA 92 (402) T ss_pred CCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 46666 468888888887776666655532211 22222223445555555555555544444433322 111 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHH-HccCChhHHHHHHHHHHhCcc-chhhhHhhcchhHHHHHHHHHHhhCccccceee Q lcl|Aclame:pro 71 KGKDKMTNFIESQNAVTEFFDVLK-KNSGKSEIKNAWNAKLAENGV-TITDTTFQLPRKLVESINTALLNTNPVFKVFHV 148 (393) Q Consensus 71 K~k~emtEfLkTkqA~~dya~ll~-~nqg~ke~k~AW~a~L~ekgV-~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV 148 (393) ...+.+. -+..++-.+|++-.+ ..... .....+......... ...+--.++|..+...|-..+.+++++....+| T Consensus 93 ~~~~~~~--~~~~~~~~~~~r~~~~~~~~~-~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l~~~~~v 169 (402) T protein:vir:93 93 YQSLSDN--EKMVKAKAEFYRHAILPNEFE-KPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARL 169 (402) T ss_pred CCCCchh--HHHHHHHHHHHHHHHhhhhHH-HHHHhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhhhhhcee Confidence 1111111 111122223322111 11111 111111211111111 111223378999888888999999999887777 Q ss_pred ecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 149 TNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVN 226 (393) Q Consensus 149 ~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~ 226 (393) .+.+...... +.. ...+..+.-|++.++...+|...++.+..+|.+..+. ++..+ +..++.+||+++|+..|. T Consensus 170 ~~~~~~~~p~-~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~D---s~~~l~~~i~~~la~~~~- 244 (402) T protein:vir:93 170 TNIKGLEIPR-VSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHG---SDVDLVNWVENALQSGLA- 244 (402) T ss_pred eecCCceeee-eeccCCccccccccccccccccccceeeecceeeeeechhhHHHHhh---hHHHHHHHHHHHHHHHHH- Confidence 7766543322 222 2234556778889999999999999998888775552 33333 334679999999999998 Q ss_pred HHH-hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhh Q lcl|Aclame:pro 227 KIV-DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELR 304 (393) Q Consensus 227 Rav-~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~ 304 (393) ++. +.++..|+|+....- +.. +...+.++++...|+|..++ ++...-..+..+++++.+..+++.-+. T Consensus 245 ~~e~~~~~~~g~g~g~p~g------~~~----~~~~~~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~ 314 (402) T protein:vir:93 245 AKERKDALAVSPKSGLEHM------SFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLS 314 (402) T ss_pred HHHHHhHhhcCCCccccce------eee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHh Confidence 454 456666777642110 010 01122233455568888877 443333456678888888888877543 Q ss_pred ccccccceeeecCCCeEEEeeecccchhhcccc-----hhceeeeecccce-ecccceeeecceeEeecCceEEEeeeec Q lcl|Aclame:pro 305 QATANANVRIKNDDTEIASEVGVDEIIVYTGSK-----AVKPTVLVDQKYH-IDMQDLTKVDAFEWKTNSNMILVETLTS 378 (393) Q Consensus 305 ~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k-----~~~ptv~vD~k~~-~~~~~~~~~~s~~~~~ns~~i~~~~~~~ 378 (393) .. ++.+- .|.++..+ |.- .+.|.++=|-+++ +..++...--.....++.-.+....++- T Consensus 315 d~-----------~~~~~--~~~~~~ll--G~PV~~t~~~~~i~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~D 379 (402) T protein:vir:93 315 NG-----------TTNFF--DTPAEKVF--GKPVVFTDAAVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYD 379 (402) T ss_pred cC-----------CCccc--ccCCcccc--ccceEEecCCCceeeechhhhhhhhhhhhhhhhhcccCCceEEEEEEEeC Confidence 32 22221 12222222 321 1234555555543 3222222111122234455566777888 Q ss_pred ccccccCcceeeeeC Q lcl|Aclame:pro 379 GHVETYNAGAVITVS 393 (393) Q Consensus 379 g~~~~~n~~~~~~v~ 393 (393) |-|.-+++-.+.++. T Consensus 380 g~v~~~~A~~~l~ik 394 (402) T protein:vir:93 380 QQRTLDSAFRIAKAK 394 (402) T ss_pred cEEechhheEEEEee Confidence 999888888777775 No 55 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.43 E-value=1.3e-13 Score=91.25 Aligned_cols=366 Identities=13% Similarity=0.102 Sum_probs=156.8 Q ss_pred CCcchhhHHHH---------HHHHH-HHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHh----- Q lcl|Aclame:pro 1 MNKPDLIEKQN---------RLAEL-KENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNA----- 65 (393) Q Consensus 1 ~~k~d~~ekq~---------eLa~l-K~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~----- 65 (393) |.+-++-+.|. ++..+ |+..++...++.....+...+...++++....+..+..++.+ ..++.+ T Consensus 12 ~~~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~-~~~~~a~~~e~ 90 (458) T protein:vir:10 12 LGLGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKK-SNELFAQTVEK 90 (458) T ss_pred hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHH Confidence 33333222221 11111 111111111111111110001111111111111111111100 000000 Q ss_pred ----hhhhhcchhHHHHHHHHHHH---------------------HH---HHHHHHHHccCC--hhHHHHHHHHHHhCcc Q lcl|Aclame:pro 66 ----QEEKPKGKDKMTNFIESQNA---------------------VT---EFFDVLKKNSGK--SEIKNAWNAKLAENGV 115 (393) Q Consensus 66 ----~~Ek~K~k~emtEfLkTkqA---------------------~~---dya~ll~~nqg~--ke~k~AW~a~L~ekgV 115 (393) +.+.++ +..+.+.+... .. .|...+. ..+. .+....+...+.. +. T Consensus 91 ~~~~~~~~~~---~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~-~~~~~~~~~~~~~~~a~~~-~~ 165 (458) T protein:vir:10 91 QQETIVGLQD---EIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVM-EKGVFETEHGQRHLKAVNQ-SS 165 (458) T ss_pred HHHHHHHHHH---HHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHH-hhccchhhhhhhhhhhhhh-cc Confidence 000000 00000000000 00 0000000 0111 0111111111111 12 Q ss_pred chhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc--ccccceecccchh------hhhhhhhhhhhc Q lcl|Aclame:pro 116 TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS--SNEAQVHKDGQTK------TEQAATLTIDTL 187 (393) Q Consensus 116 ~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n--a~~a~GHk~ga~K------k~q~~~le~~ti 187 (393) .+.+....+|..+..-|-+.+.+++++++.+.+...+.....+...+ ....|+. -|..+ +..+.+|...++ T Consensus 166 ~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~-e~~~~~~~~~~~~~~~~~~~i~~ 244 (458) T protein:vir:10 166 SVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVA-ASTYGTDTTTGEEVKGALKEIHF 244 (458) T ss_pred cCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecc-cccccccccccccccccceeeEe Confidence 23344557888888888888888888887666666655544433333 2223322 22222 344567888899 Q ss_pred cHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccc---cchhhhhhhhhhhhhhhhhcc Q lcl|Aclame:pro 188 EPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFK---SIDKEADVKKIKKITTKAKSA 264 (393) Q Consensus 188 ~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~---~~~~e~D~~~ik~it~~at~~ 264 (393) .|..++....+.+.+ ++.+.-++.+|+.++|+.++- |.++.++++|||++... ..+..+... -++..+... T Consensus 245 ~~~k~~~~v~is~el--l~ds~~~~~~~i~~~l~~~i~-~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~---~~~~~~~~~ 318 (458) T protein:vir:10 245 STYKLAAKSFITDET--EEDAIFSLLPLLRKRLIEAHA-VSIEEAFMTGDGSGKPKGLLTLASEDSAK---VVTEAKADG 318 (458) T ss_pred eeeeEEeeehhhHHH--HhcchHHHHHHHHHHHHHHHH-HHHHHHhhcCCCCCccceeeecccccccc---eeecccccc Confidence 988777766663332 232324689999999999999 79999999999984211 111111111 111111122 Q ss_pred CCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhcccch---hc Q lcl|Aclame:pro 265 GKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKA---VK 340 (393) Q Consensus 265 ~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~---~~ 340 (393) +.+-..|+|..++ ++...-.++..+++|+.++.+|+. |++.+|.+.......+... .|... =+ T Consensus 319 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~------------lkd~~G~~i~~~~~~~~~~-~~~~~~l~G~ 385 (458) T protein:vir:10 319 SVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDL------------LEDEEWQDVAQVGNDSVKL-QGQVGRIYGL 385 (458) T ss_pred cccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHh------------hcccCCceeeccccccccc-cCcCceecce Confidence 2233467777766 333333345678999999888776 7777777654332221111 01000 01 Q ss_pred eeeee---------------ccc-ce-e-cccceeee-cceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 341 PTVLV---------------DQK-YH-I-DMQDLTKV-DAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 341 ptv~v---------------D~k-~~-~-~~~~~~~~-~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) |.++. |-+ +| + +-+|++-. +.| ..++.=.+..+.-+.+.+--|++-++.++| T Consensus 386 pv~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~-~~~~~~~~~~~~r~~~~v~~~~a~v~~~~a 456 (458) T protein:vir:10 386 PVVVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERERQ-AGKQRDAYYVTQRVNLQRYFANGVVSGTYA 456 (458) T ss_pred eeEEccccccccCCcceEEEEecccEEEEEeeceEEEeecc-cCCCceEEEEEEEecceEecccceEEEeec Confidence 22222 221 11 1 22333211 111 112222345555667888888888888888 No 56 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=99.43 E-value=2.3e-13 Score=89.87 Aligned_cols=360 Identities=14% Similarity=0.106 Sum_probs=183.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhh------hhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhcch Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGF------EVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-KPKGK 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~------~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E-k~K~k 73 (393) |.| +.|.+++|.++..+..+++.++... +.++..+.-.+++.++..+.++..+++..+.++..... ..+.. T Consensus 1 Mk~--l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~ 78 (387) T protein:vir:26 1 MPT--LYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAY 78 (387) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccC Confidence 766 4566666766666665555554321 11222222223334444444444444333333322211 00000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHH-ccCChhHHHHHHHHHHh--CccchhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 74 DKMTNFIESQNAVTEFFDVLKK-NSGKSEIKNAWNAKLAE--NGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 150 (393) Q Consensus 74 ~emtEfLkTkqA~~dya~ll~~-nqg~ke~k~AW~a~L~e--kgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n 150 (393) ....+=-+...+-.+|.+-.+. ....+ ....+...... .|. ..+--.++|..+..-|-..+++++++.....|.+ T Consensus 79 ~~~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~ 156 (387) T protein:vir:26 79 QSLSDNEKMVKAKAEFYRHAILPNEFEK-PSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN 156 (387) T ss_pred CCCchhHHHHHHHHHHHHHHHhhhhHHH-HHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeee Confidence 0000001111222223221111 11110 01111111111 111 1111237899988888899999999887666666 Q ss_pred ccceeEEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 151 VGALLVSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIV 229 (393) Q Consensus 151 ~~~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav 229 (393) .+.......-.+...+..+.-|.+.++...+|...++.+..++.+..+. ++..+. -.++.+||+++|+..+. ++. T Consensus 157 ~~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds---~~~l~~~i~~~la~~~~-~~e 232 (387) T protein:vir:26 157 IKGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS---DVDLVNWVENALQSGLA-AKE 232 (387) T ss_pred cCCceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhh---HHHHHHHHHHHHHHHHH-HHH Confidence 6544322211122345567788899999999999999998888776663 333332 34679999999999998 453 Q ss_pred -hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccc Q lcl|Aclame:pro 230 -DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQAT 307 (393) Q Consensus 230 -~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~ 307 (393) +.++..|+|+....- +.. +...+.++++...|+|..++ ++...-..+..+++++.+...++..+... T Consensus 233 ~~~~~~~g~g~g~~~g------~~~----~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~- 301 (387) T protein:vir:26 233 RKDALAVSPKSGLEHM------SFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNG- 301 (387) T ss_pred HHhHhhcCCCccccce------eee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcC- Confidence 456667777632100 000 01122334455678888887 44433345667888888888877754432 Q ss_pred cccceeeecCCCeEEEeeecccchhhcccch-----hceeeeecccc-eecccceeeecceeEeecCceEEEeeeecccc Q lcl|Aclame:pro 308 ANANVRIKNDDTEIASEVGVDEIIVYTGSKA-----VKPTVLVDQKY-HIDMQDLTKVDAFEWKTNSNMILVETLTSGHV 381 (393) Q Consensus 308 ~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~-----~~ptv~vD~k~-~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~ 381 (393) ++.+- .|.++..+ |... +.|.++-|-++ |+..+|..-.-+....+..-.+.+..+.-|.+ T Consensus 302 ----------~~~~~--~~~~~~ll--G~PV~~~~~~~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v 367 (387) T protein:vir:26 302 ----------TTNFF--DTPAEKVF--GKPVVFTDAAVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQR 367 (387) T ss_pred ----------CCccc--ccCCcccc--ccceEEecCCCceeeechhhhhhhhhhhhheecccccCCceEEEEEEEeCcEe Confidence 22221 12222222 3211 22444445443 33333332222222334455677777889999 Q ss_pred cccCcceeeeeC Q lcl|Aclame:pro 382 ETYNAGAVITVS 393 (393) Q Consensus 382 ~~~n~~~~~~v~ 393 (393) .-+++-.+..+. T Consensus 368 ~~~~A~~~l~~k 379 (387) T protein:vir:26 368 TLDSAFRIAKAK 379 (387) T ss_pred echhheEEEEee Confidence 988888888886 No 57 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=99.43 E-value=2.3e-13 Score=89.87 Aligned_cols=360 Identities=14% Similarity=0.106 Sum_probs=183.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhh------hhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhcch Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGF------EVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-KPKGK 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~------~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E-k~K~k 73 (393) |.| +.|.+++|.++..+..+++.++... +.++..+.-.+++.++..+.++..+++..+.++..... ..+.. T Consensus 1 Mk~--l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~ 78 (387) T protein:vir:94 1 MPT--LYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAY 78 (387) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccC Confidence 766 4566666766666665555554321 11222222223334444444444444333333322211 00000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHH-ccCChhHHHHHHHHHHh--CccchhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 74 DKMTNFIESQNAVTEFFDVLKK-NSGKSEIKNAWNAKLAE--NGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 150 (393) Q Consensus 74 ~emtEfLkTkqA~~dya~ll~~-nqg~ke~k~AW~a~L~e--kgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n 150 (393) ....+=-+...+-.+|.+-.+. ....+ ....+...... .|. ..+--.++|..+..-|-..+++++++.....|.+ T Consensus 79 ~~~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~ 156 (387) T protein:vir:94 79 QSLSDNEKMVKAKAEFYRHAILPNEFEK-PSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN 156 (387) T ss_pred CCCchhHHHHHHHHHHHHHHHhhhhHHH-HHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeee Confidence 0000001111222223221111 11110 01111111111 111 1111237899988888899999999887666666 Q ss_pred ccceeEEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 151 VGALLVSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIV 229 (393) Q Consensus 151 ~~~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav 229 (393) .+.......-.+...+..+.-|.+.++...+|...++.+..++.+..+. ++..+. -.++.+||+++|+..+. ++. T Consensus 157 ~~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds---~~~l~~~i~~~la~~~~-~~e 232 (387) T protein:vir:94 157 IKGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS---DVDLVNWVENALQSGLA-AKE 232 (387) T ss_pred cCCceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhh---HHHHHHHHHHHHHHHHH-HHH Confidence 6544322211122345567788899999999999999998888776663 333332 34679999999999998 453 Q ss_pred -hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccc Q lcl|Aclame:pro 230 -DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQAT 307 (393) Q Consensus 230 -~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~ 307 (393) +.++..|+|+....- +.. +...+.++++...|+|..++ ++...-..+..+++++.+...++..+... T Consensus 233 ~~~~~~~g~g~g~~~g------~~~----~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~- 301 (387) T protein:vir:94 233 RKDALAVSPKSGLEHM------SFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNG- 301 (387) T ss_pred HHhHhhcCCCccccce------eee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcC- Confidence 456667777632100 000 01122334455678888887 44433345667888888888877754432 Q ss_pred cccceeeecCCCeEEEeeecccchhhcccch-----hceeeeecccc-eecccceeeecceeEeecCceEEEeeeecccc Q lcl|Aclame:pro 308 ANANVRIKNDDTEIASEVGVDEIIVYTGSKA-----VKPTVLVDQKY-HIDMQDLTKVDAFEWKTNSNMILVETLTSGHV 381 (393) Q Consensus 308 ~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~-----~~ptv~vD~k~-~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~ 381 (393) ++.+- .|.++..+ |... +.|.++-|-++ |+..+|..-.-+....+..-.+.+..+.-|.+ T Consensus 302 ----------~~~~~--~~~~~~ll--G~PV~~~~~~~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v 367 (387) T protein:vir:94 302 ----------TTNFF--DTPAEKVF--GKPVVFTDAAVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQR 367 (387) T ss_pred ----------CCccc--ccCCcccc--ccceEEecCCCceeeechhhhhhhhhhhhheecccccCCceEEEEEEEeCcEe Confidence 22221 12222222 3211 22444445443 33333332222222334455677777889999 Q ss_pred cccCcceeeeeC Q lcl|Aclame:pro 382 ETYNAGAVITVS 393 (393) Q Consensus 382 ~~~n~~~~~~v~ 393 (393) .-+++-.+..+. T Consensus 368 ~~~~A~~~l~~k 379 (387) T protein:vir:94 368 TLDSAFRIAKAK 379 (387) T ss_pred echhheEEEEee Confidence 988888888886 No 58 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=99.43 E-value=2.3e-13 Score=89.87 Aligned_cols=360 Identities=14% Similarity=0.106 Sum_probs=183.5 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhh------hhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhcch Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGF------EVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-KPKGK 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~------~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E-k~K~k 73 (393) |.| +.|.+++|.++..+..+++.++... +.++..+.-.+++.++..+.++..+++..+.++..... ..+.. T Consensus 1 Mk~--l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~ 78 (387) T protein:vir:96 1 MPT--LYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAY 78 (387) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccC Confidence 766 4566666766666665555554321 11222222223334444444444444333333322211 00000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHH-ccCChhHHHHHHHHHHh--CccchhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 74 DKMTNFIESQNAVTEFFDVLKK-NSGKSEIKNAWNAKLAE--NGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 150 (393) Q Consensus 74 ~emtEfLkTkqA~~dya~ll~~-nqg~ke~k~AW~a~L~e--kgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n 150 (393) ....+=-+...+-.+|.+-.+. ....+ ....+...... .|. ..+--.++|..+..-|-..+++++++.....|.+ T Consensus 79 ~~~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~ 156 (387) T protein:vir:96 79 QSLSDNEKMVKAKAEFYRHAILPNEFEK-PSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTN 156 (387) T ss_pred CCCchhHHHHHHHHHHHHHHHhhhhHHH-HHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeee Confidence 0000001111222223221111 11110 01111111111 111 1111237899988888899999999887666666 Q ss_pred ccceeEEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 151 VGALLVSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIV 229 (393) Q Consensus 151 ~~~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav 229 (393) .+.......-.+...+..+.-|.+.++...+|...++.+..++.+..+. ++..+. -.++.+||+++|+..+. ++. T Consensus 157 ~~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds---~~~l~~~i~~~la~~~~-~~e 232 (387) T protein:vir:96 157 IKGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS---DVDLVNWVENALQSGLA-AKE 232 (387) T ss_pred cCCceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhh---HHHHHHHHHHHHHHHHH-HHH Confidence 6544322211122345567788899999999999999998888776663 333332 34679999999999998 453 Q ss_pred -hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccc Q lcl|Aclame:pro 230 -DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQAT 307 (393) Q Consensus 230 -~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~ 307 (393) +.++..|+|+....- +.. +...+.++++...|+|..++ ++...-..+..+++++.+...++..+... T Consensus 233 ~~~~~~~g~g~g~~~g------~~~----~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~- 301 (387) T protein:vir:96 233 RKDALAVSPKSGLEHM------SFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNG- 301 (387) T ss_pred HHhHhhcCCCccccce------eee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcC- Confidence 456667777632100 000 01122334455678888887 44433345667888888888877754432 Q ss_pred cccceeeecCCCeEEEeeecccchhhcccch-----hceeeeecccc-eecccceeeecceeEeecCceEEEeeeecccc Q lcl|Aclame:pro 308 ANANVRIKNDDTEIASEVGVDEIIVYTGSKA-----VKPTVLVDQKY-HIDMQDLTKVDAFEWKTNSNMILVETLTSGHV 381 (393) Q Consensus 308 ~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~-----~~ptv~vD~k~-~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~ 381 (393) ++.+- .|.++..+ |... +.|.++-|-++ |+..+|..-.-+....+..-.+.+..+.-|.+ T Consensus 302 ----------~~~~~--~~~~~~ll--G~PV~~~~~~~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v 367 (387) T protein:vir:96 302 ----------TTNFF--DTPAEKVF--GKPVVFTDAAVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQR 367 (387) T ss_pred ----------CCccc--ccCCcccc--ccceEEecCCCceeeechhhhhhhhhhhhheecccccCCceEEEEEEEeCcEe Confidence 22221 12222222 3211 22444445443 33333332222222334455677777889999 Q ss_pred cccCcceeeeeC Q lcl|Aclame:pro 382 ETYNAGAVITVS 393 (393) Q Consensus 382 ~~~n~~~~~~v~ 393 (393) .-+++-.+..+. T Consensus 368 ~~~~A~~~l~~k 379 (387) T protein:vir:96 368 TLDSAFRIAKAK 379 (387) T ss_pred echhheEEEEee Confidence 988888888886 No 59 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.42 E-value=7.6e-14 Score=92.50 Aligned_cols=367 Identities=16% Similarity=0.103 Sum_probs=172.7 Q ss_pred CCcc-hhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHh-------hhh---- Q lcl|Aclame:pro 1 MNKP-DLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNA-------QEE---- 68 (393) Q Consensus 1 ~~k~-d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~-------~~E---- 68 (393) |++. .++|.+++|.+........+.+.... .++.. ...++++..++++..++...+..... ... T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~~~~~~~~~-~~e~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEI-VAEAR---GLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPA 76 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 7764 45556666655444443333332221 11111 11233333333333333221111111 100 Q ss_pred -hhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC----ccchhhhHhhcchhHHHHHHHHHHhhCccc Q lcl|Aclame:pro 69 -KPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN----GVTITDTTFQLPRKLVESINTALLNTNPVF 143 (393) Q Consensus 69 -k~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek----gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl 143 (393) ....+... +..........+...........+++......+... |.......-+.|.-+-.-|....+.+..+. T Consensus 77 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~ 155 (419) T protein:vir:94 77 EAGTFRSLA-QRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVA 155 (419) T ss_pred ccccccchh-hhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhh Confidence 00011111 111111111111100111111123333333332222 111122212445444444455555433333 Q ss_pred cceeeecccceeEEEee--------cc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHH Q lcl|Aclame:pro 144 KVFHVTNVGALLVSRSF--------DS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYN 214 (393) Q Consensus 144 ~~fhV~n~~~~a~~i~l--------~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvn 214 (393) ..+++.......+.+-- .. .+.+.-+--|+++++.+.+|...++.|..++.+..+.+.+-+ ++ +.+.+ T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~--~~l~~ 232 (419) T protein:vir:94 156 DLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAAD-DN--SQLMG 232 (419) T ss_pred hcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHH-hH--HHHHH Confidence 33343332222111111 11 134455567889999999999999999999888777443333 22 46899 Q ss_pred HHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhh--hhccCCCCCHHHHHhhh-ceeeecCCCceEEE Q lcl|Aclame:pro 215 LIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTK--AKSAGKTPFADAIEEAV-DFVRPTAGRRYLIV 291 (393) Q Consensus 215 yvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~--at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i 291 (393) |++.+|+..+. ++++.++++|||+....-.-.... +..+.+. ....+.+...+.|..++ .+..+......+++ T Consensus 233 ~i~~~la~a~~-~~~d~aii~G~G~~~p~Gi~~~~~---~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~ 308 (419) T protein:vir:94 233 YIQGRLTYGLR-FLRDRQLLNGNGSTEMQGILTTPG---IGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVV 308 (419) T ss_pred HHHHHHHHHHH-HHHHHHHHhccCcccccceecccc---cccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEE Confidence 99999999999 799999999999853211111111 1112221 11233444568888888 33333344457888 Q ss_pred ecchhhhHhhhhhccccccceeeecCCCe-EEEeeecc----cchhhcccchhc-------eeeeecccce--e-cccce Q lcl|Aclame:pro 292 KTEDRKALLDELRQATANANVRIKNDDTE-IASEVGVD----EIIVYTGSKAVK-------PTVLVDQKYH--I-DMQDL 356 (393) Q Consensus 292 ~~~d~~a~~~~~~~~~~~a~~~l~~~~~~-~~~~v~~~----~~~~~tG~k~~~-------ptv~vD~k~~--~-~~~~~ 356 (393) ++.++..|+. +++++|. +-....+. ...+ |-..+. +.++.|-+++ + +-+|+ T Consensus 309 n~~~~~~l~~------------~k~~~~~~~~~~~~~~~~~~~~l~--G~pV~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 374 (419) T protein:vir:94 309 HPQDWESIEL------------DQAPGSGVFRVIANVQGEATPRIW--GLNVVSTVAIAQGTALVGGFRQGATLWSRQGI 374 (419) T ss_pred cHHHHHHHHH------------HhhcCCCceeecCCcccCCCcccc--ceeeEEcCCCCCccEEEeeccceEEEEEecce Confidence 9999888876 4443222 21111111 1111 221111 2334454432 1 33344 Q ss_pred eeecce----eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 357 TKVDAF----EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 357 ~~~~s~----~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +-.-+. -|..|+-.+.++.+..|.+.-|++-++++++ T Consensus 375 ~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~ 415 (419) T protein:vir:94 375 TVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) T ss_pred EEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEec Confidence 322111 1456666788999999999999999999998 No 60 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=99.42 E-value=4.2e-13 Score=88.41 Aligned_cols=369 Identities=14% Similarity=0.161 Sum_probs=172.8 Q ss_pred CCcchhhHHHHHHHHHHHhhHH-------HHhhhhhhhhhhH----Hhhh----hHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVS-------LKSQISGFEVKNA----IEDL----PKVQELEKTLSENSIEIIKIENELNA 65 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~-------~~s~i~~~~v~~a----~~~~----skieelektis~l~aEi~k~enel~~ 65 (393) |=..-+++++.+|.+|.+.... +..+|+....+.+ .+++ ..+.+++..+.++..||...+++++. T Consensus 7 ~l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~le~el~e 86 (466) T protein:vir:80 7 MLAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKELENELEQ 86 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2122234444444444444333 3333322221111 1111 22334455555566666554554443 Q ss_pred hh---hhhcchhHHHHH-----HHHH-HHHHHHHHHHHHccC-----ChhHHHHHHHHH---Hh-CccchhhhHhhcchh Q lcl|Aclame:pro 66 QE---EKPKGKDKMTNF-----IESQ-NAVTEFFDVLKKNSG-----KSEIKNAWNAKL---AE-NGVTITDTTFQLPRK 127 (393) Q Consensus 66 ~~---Ek~K~k~emtEf-----LkTk-qA~~dya~ll~~nqg-----~ke~k~AW~a~L---~e-kgV~~qd~~eiLP~~ 127 (393) +. +....++...+. ++.. .....+.+-+...+. .++.+..|.+.- .+ .+. .+...++|.- T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~g~~~~vP~~ 164 (466) T protein:vir:80 87 LNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAV--SGAELTIPDV 164 (466) T ss_pred HHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhh--ccccccccHH Confidence 22 222222222111 1111 111112111111111 112233222211 11 122 2222379999 Q ss_pred HHHHHHHHHHhhCccccceeeecccceeEEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 128 LVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRL 205 (393) Q Consensus 128 ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l 205 (393) ++..|.+.+.+++++++...|.+..... .+...+ ....|+. -|.++++...+|...++.+..++-+..+.+.+-+ T Consensus 165 ~~~~i~~~l~~~~~l~~~~~v~~~~g~~-~~~~~~~~~~a~wv~-E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~- 241 (466) T protein:vir:80 165 MLELLRDNMHRYSKLISKVRLRPLKGTA-RQNIAGAIPEGVWTE-AVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLE- 241 (466) T ss_pred HHHHHHHhhhhhhhhhhheeeeecCcee-EeeeecCCcceeecc-cccccccccccccceeecceeeeeehhhhHHHHh- Confidence 9999999999999999966666665432 222233 2355554 5667777788899999998888887777444433 Q ss_pred cCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhh-------------hhhhhhh------hhhccCC Q lcl|Aclame:pro 206 QMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVK-------------KIKKITT------KAKSAGK 266 (393) Q Consensus 206 ~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~-------------~ik~it~------~at~~~~ 266 (393) .+.-++..|++.+|+..|. ++.+.++++|||+.... -+..+.- .+..++. ....... T Consensus 242 -ds~~~l~~~i~~~la~~~~-~~~~~ail~G~G~~~P~--Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (466) T protein:vir:80 242 -DSDLNLADEILDAIGQAIG-FALDKAILYGTGTKMPV--GIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSA 317 (466) T ss_pred -cchHHHHHHHHHHHHHHHH-HHHhhheeeccCCCCcc--eeeecccccccccccccccccccccchhhhhhhhhhccch Confidence 2335689999999999999 79999999999984210 0100000 0000000 0000000 Q ss_pred CCCHHHHHhhhceeeecCCC--ceEEEecchhhhHhhhhhccccccceeee---cCCCeEEEeeecccchhhcccchhce Q lcl|Aclame:pro 267 TPFADAIEEAVDFVRPTAGR--RYLIVKTEDRKALLDELRQATANANVRIK---NDDTEIASEVGVDEIIVYTGSKAVKP 341 (393) Q Consensus 267 t~~~dal~Eald~a~~~a~~--~~l~i~~~d~~a~~~~~~~~~~~a~~~l~---~~~~~~~~~v~~~~~~~~tG~k~~~p 341 (393) ..+...+.-++....+..++ .+.+.+......|+. ++ +.+|.|....+-+...+ |.. |++ T Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~------------~~~~~~~~g~~~~~~~~~~~i~--G~p-vv~ 382 (466) T protein:vir:80 318 EEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMS------------KAITFNSAGALVASLNNTMPIV--GGD-IVI 382 (466) T ss_pred hhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhc------------ccccccCCccccccCCCccccc--ccc-eee Confidence 01111111122222222222 223333333333322 22 44556655443322222 432 222 Q ss_pred e-------eeec-cccee--cccceeeecceeEee--cCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 342 T-------VLVD-QKYHI--DMQDLTKVDAFEWKT--NSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 342 t-------v~vD-~k~~~--~~~~~~~~~s~~~~~--ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) + ++++ -++|+ +-+|++-..+..-.| ++-.+.+..+..|-+.-+++-.+.+++ T Consensus 383 s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~ 446 (466) T protein:vir:80 383 LDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIA 446 (466) T ss_pred cCccCccceeeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEec Confidence 2 2223 22222 455655444444444 444567777888999888888888888 No 61 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=99.39 E-value=3.2e-13 Score=89.09 Aligned_cols=357 Identities=15% Similarity=0.112 Sum_probs=177.0 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhh------hhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-cch Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEV------KNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKP-KGK 73 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v------~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~-K~k 73 (393) |.|. .|.+.+|.++++....++.++..... ++..+...+++.++..++.+..+++..+.+.....+.. +.. T Consensus 1 Mk~l--~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~ 78 (387) T protein:vir:93 1 MPTL--YELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAY 78 (387) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccC Confidence 8775 34455555555555444444332221 22222234445555555555555544333332221110 100 Q ss_pred hHHHHHHHHHHHHHHHHHHH-HHccCChhHHHHHHHHHHhCccch-hhhHhhcchhHHHHHHHHHHhhCccccceeeecc Q lcl|Aclame:pro 74 DKMTNFIESQNAVTEFFDVL-KKNSGKSEIKNAWNAKLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 151 (393) Q Consensus 74 ~emtEfLkTkqA~~dya~ll-~~nqg~ke~k~AW~a~L~ekgV~~-qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~ 151 (393) ..+.+=-+..++-..|.+-. ...++. +....+.......+..+ .+--.++|..+...|-..+.+++++.+...|.+. T Consensus 79 ~~~~~~~~~~~~~~~~~r~~~~~~~~~-~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~ 157 (387) T protein:vir:93 79 QSLNDHEKMVKAKAEFYRHAILPNEFE-KPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred CCcchhhHHHHHHHHHHHHHhhhhhhh-hhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeec Confidence 00100001111111221111 111111 11122222222222110 1112278999988899999999998887777766 Q ss_pred cceeEEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 152 GALLVSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIV- 229 (393) Q Consensus 152 ~~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav- 229 (393) +.......--+...+..+.-|++.+++..+|...++.+..++.+..+. +++ +.+..++.+|++++|+..|. +.. T Consensus 158 ~~~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell---~Ds~~~l~~~i~~~la~~~~-~~e~ 233 (387) T protein:vir:93 158 KGLEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVI---HGSDVDLVNWVENALQSGLA-AKER 233 (387) T ss_pred CCceEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhhHHHH---hhhHHHHHHHHHHHHHHHHH-HHHH Confidence 654332211122334456678888999999999999988887776663 333 33334689999999999998 454 Q ss_pred hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhcccc Q lcl|Aclame:pro 230 DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATA 308 (393) Q Consensus 230 ~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~ 308 (393) ..++..|+|++...- .... .+.+.++.+...|+|..++ ++...-..+..+++++.+..+++.-+ T Consensus 234 ~~~~~~g~g~g~p~g--~l~~--------~~~~~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~----- 298 (387) T protein:vir:93 234 KDALAVSPKSGLDHM--SFYN--------GSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVL----- 298 (387) T ss_pred HhHhhcCCCccccce--eeec--------cccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHH----- Confidence 446677787743111 0000 1112234455578888877 44333345567788888887777643 Q ss_pred ccceeeecCCCeEEEeeecccchhhcccchhceeeeeccc---------ce-ecccceeeecceeEeecCceEEEeeeec Q lcl|Aclame:pro 309 NANVRIKNDDTEIASEVGVDEIIVYTGSKAVKPTVLVDQK---------YH-IDMQDLTKVDAFEWKTNSNMILVETLTS 378 (393) Q Consensus 309 ~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~ptv~vD~k---------~~-~~~~~~~~~~s~~~~~ns~~i~~~~~~~ 378 (393) ++++|.+- .|.++..+ | +|.++.|-+ ++ +...+..-.-.....+..-.+..+...- T Consensus 299 ------~d~~~~~~--~~~~~~ll--G----~PV~~~~~~~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d 364 (387) T protein:vir:93 299 ------SNGTTNFF--DTPAEKVF--G----KPVVFTDAAVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYD 364 (387) T ss_pred ------hcCCCccc--ccCCcccc--c----cceEEecCCCceeeeehhhhheehhhheeeecccccCCceeEEEEeeeC Confidence 33344332 23333333 4 244444332 22 2111111100011112233345566778 Q ss_pred ccccccCcceeeeeC Q lcl|Aclame:pro 379 GHVETYNAGAVITVS 393 (393) Q Consensus 379 g~~~~~n~~~~~~v~ 393 (393) |-|--+++-.+.++. T Consensus 365 ~~v~~~eA~~~l~~k 379 (387) T protein:vir:93 365 QQRTLDSAFRIAKAK 379 (387) T ss_pred ceeechhheEEEEee Confidence 888878877777775 No 62 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.38 E-value=2.7e-13 Score=89.52 Aligned_cols=345 Identities=12% Similarity=0.085 Sum_probs=184.9 Q ss_pred CCcchhhHHHHH-HHHHHHhhHHHHhhhhhhhhhhHHhh-----hhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchh Q lcl|Aclame:pro 1 MNKPDLIEKQNR-LAELKENNVSLKSQISGFEVKNAIED-----LPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKD 74 (393) Q Consensus 1 ~~k~d~~ekq~e-La~lK~~~~~~~s~i~~~~v~~a~~~-----~skieelektis~l~aEi~k~enel~~~~Ek~K~k~ 74 (393) |+.-+++++-.+ +++++++......++.+ ..+...++ -.+++++++.+.++++.+++.+.... ...++.. T Consensus 1 m~~~e~~~~~~~~~~~l~~~~~~~~~e~~~-~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~---~~~~~~~ 76 (379) T protein:vir:10 1 MEALEIKVALEAIKGQVDSKSSAQALEVKG-LIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLK---EKAKSED 76 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hcccccc Confidence 776666555333 35555444333222221 01111111 13345555556665555554333222 2223333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC--ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccc Q lcl|Aclame:pro 75 KMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN--GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 152 (393) Q Consensus 75 emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek--gV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~ 152 (393) ...++.+........ .. .++..=.....++ +..+++.....|..+..-|-..++++.++.+..++.... T Consensus 77 ~~~~~~~~~~~~~~~---~~------~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~ 147 (379) T protein:vir:10 77 KSDSLVKSITENFND---IK------EVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSIS 147 (379) T ss_pred cchhHHHHHHHHHHh---HH------HHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeecc Confidence 333333332221111 00 0111000112223 344445555678777777777777778887766654444 Q ss_pred ceeEEEeecc-cc-ccc-eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 153 ALLVSRSFDS-SN-EAQ-VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIV 229 (393) Q Consensus 153 ~~a~~i~l~n-a~-~a~-GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav 229 (393) ...+.+--.+ .. .++ ..--|.++.....+|...++.|..++....+.+.+- +.+ ..+.+|+..+|+..+- +.. T Consensus 148 ~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell--~D~-~~l~~~i~~~la~~~~-~~~ 223 (379) T protein:vir:10 148 GGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMA--NNL-PFLTSFIPNALRRDYA-KAE 223 (379) T ss_pred CCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHH--hhH-HHHHHHHHHHHHHHHH-HHH Confidence 4433333222 21 121 123466888888999999999998888866744442 222 4599999999999998 799 Q ss_pred hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhcccc Q lcl|Aclame:pro 230 DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATA 308 (393) Q Consensus 230 ~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~ 308 (393) +.+++.|+|...+... ...++....|.+..++ ++.........+++++.+..+|+- T Consensus 224 ~~~~~~g~~~~~~~~~----------------~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------- 280 (379) T protein:vir:10 224 NAAFNAVLAANATAST----------------EIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILV------- 280 (379) T ss_pred HHHHhccccccccccc----------------ccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------- Confidence 9999999987643221 1122334456777766 333333344468889999888876 Q ss_pred ccceeeecCCCeEEEeeecccchhhcc------cchhceeeeeccccee--cccceeeec------------ceeEeecC Q lcl|Aclame:pro 309 NANVRIKNDDTEIASEVGVDEIIVYTG------SKAVKPTVLVDQKYHI--DMQDLTKVD------------AFEWKTNS 368 (393) Q Consensus 309 ~a~~~l~~~~~~~~~~v~~~~~~~~tG------~k~~~ptv~vD~k~~~--~~~~~~~~~------------s~~~~~ns 368 (393) |++.+|.+-...++ ...+| +..|+.+-.++....+ |.+.+..+. ...|.+|. T Consensus 281 -----lkd~~G~~l~~~~~---~~~~~~~~~l~G~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~ 352 (379) T protein:vir:10 281 -----TQKSVGAGYGLPGV---VTQDNGVLRINGIPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNN 352 (379) T ss_pred -----hhccCCceeccCCc---cCCCCCcceecceeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecccccccCCc Confidence 88888887543222 11111 2233333333322222 444432221 11244555 Q ss_pred ceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 369 NMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 369 ~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) -.|.++.-+.+.|--|.+-++++++ T Consensus 353 ~~~r~~~R~~~~v~~p~a~v~~~~~ 377 (379) T protein:vir:10 353 ITARIEAQVALAVEQPAALIFGDFT 377 (379) T ss_pred EEEEEEEEeccEEecCccEEEEEec Confidence 5566667888999999999999988 No 63 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.37 E-value=2.4e-13 Score=89.75 Aligned_cols=351 Identities=13% Similarity=0.085 Sum_probs=180.3 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhh------------ Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE------------ 68 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~E------------ 68 (393) |.++-+.+.+.+.+++...+.++......-... .++..++++++..+++++.+|.....+.....+ T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt--~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 78 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFAGKEMT--DEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSG 78 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhhccccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 999988888888887777777666554432211 123367777888888887777543222222111 Q ss_pred ---hhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHH-HHHHHHHhhCcccc Q lcl|Aclame:pro 69 ---KPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVE-SINTALLNTNPVFK 144 (393) Q Consensus 69 ---k~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~-AIe~A~ed~d~vl~ 144 (393) ..+.......|+++-... +.+.. .-...+.+.+.......+|.++.. -|.+.++. .++|. T Consensus 79 ~~~~~~~~~~~~~~~r~~~~~--------------~~r~~-~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~-~~~l~ 142 (390) T protein:vir:62 79 SGAQRSADVDDDATLRAGNLG--------------EARSF-EFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVER-SAIMR 142 (390) T ss_pred ccchhhcchHHHHHHhhhhhh--------------hhHHH-HhhhhhhcccccCCCccccccchHHHHHHHHhh-hhhhh Confidence 000011111222221111 11110 011111212222222356766543 35666665 44443 Q ss_pred ce-eeecccce-eEEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHH Q lcl|Aclame:pro 145 VF-HVTNVGAL-LVSRSFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAEL 220 (393) Q Consensus 145 ~f-hV~n~~~~-a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~EL 220 (393) .+ +|...... .+.+...+ ..-+| ..-|++.+++..+|...++.|..++.+..+.+-+.+- +.-++..||+++| T Consensus 143 ~~~~~~~~~~~~~~~~p~~~~~~~a~w-v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l 219 (390) T protein:vir:62 143 GGATTFTTSDANPLDFTVITGRSSASI-VGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATD--QVLDLVGFLVSDA 219 (390) T ss_pred hcceeeecCCCceeEEEEEcCCcceee-ecccccccccccceeeeEeeeeeEEeehHHHHHHHhh--hhHHHHHHHHHHH Confidence 22 44433221 12232222 23333 4556788889999999999998888776663333221 2236789999999 Q ss_pred HHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhH Q lcl|Aclame:pro 221 TQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKAL 299 (393) Q Consensus 221 aq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~ 299 (393) +.++- +.++.++++|||.|.- +....-. ...+.+. ..+.+-+.|+|...+ +.-..-..+...++|+.....| T Consensus 220 ~~~i~-~~~d~~~l~G~G~p~G----i~~~~~~-~~~~~~~-~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L 292 (390) T protein:vir:62 220 GPAIG-DAMGRHFITGTGQPRG----ILTDASP-ATATFLA-TDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQM 292 (390) T ss_pred HHHHH-HHHHhhhhccCCcccc----ccccccc-cccceec-ccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHH Confidence 99998 7999999999997521 1111100 0001111 111122345554433 2211122345678888888777 Q ss_pred hhhhhccccccceeeecCCCeEEEeeecc----cchhhcccchhc-------eeeeeccccee--cccceeeecceeEee Q lcl|Aclame:pro 300 LDELRQATANANVRIKNDDTEIASEVGVD----EIIVYTGSKAVK-------PTVLVDQKYHI--DMQDLTKVDAFEWKT 366 (393) Q Consensus 300 ~~~~~~~~~~a~~~l~~~~~~~~~~v~~~----~~~~~tG~k~~~-------ptv~vD~k~~~--~~~~~~~~~s~~~~~ 366 (393) +. |++++|.+-+.-++. .+.+ |.-.+. |.++-|-+.|+ +-+|++--.+....| T Consensus 293 ~~------------lkd~~g~~l~~~~~~~g~~~~l~--G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~~~v~~~~~~~~ 358 (390) T protein:vir:62 293 RK------------LKDANGQYLWQSGLTVGAPSLFN--GKVVETDDGMPADKILFADLSKYRVRFAGSLRVDRSVDAKF 358 (390) T ss_pred HH------------hhccCCCeeecCCcCCCccceec--ccceEEecCCCCccEEEeeccceeEEeecceEEEeeccccc Confidence 65 777788765422221 1111 321111 22233332222 223333333344445 Q ss_pred cCceEE--EeeeecccccccCcceeeeeC Q lcl|Aclame:pro 367 NSNMIL--VETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 367 ns~~i~--~~~~~~g~~~~~n~~~~~~v~ 393 (393) ..+++. +.....|-+.-|++-.+++|. T Consensus 359 ~~~~~~~~~~~r~d~~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 359 STDQIVYRFLQRADGLLVDARGAKVLTVT 387 (390) T ss_pred cCCcEEEEEEEEeCcEeechhheEEEEee Confidence 666664 455578888888888888888 No 64 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.36 E-value=2.6e-13 Score=89.58 Aligned_cols=354 Identities=13% Similarity=0.057 Sum_probs=173.1 Q ss_pred CCc----chhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHH---H-HHHHHhhhhh--- Q lcl|Aclame:pro 1 MNK----PDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIK---I-ENELNAQEEK--- 69 (393) Q Consensus 1 ~~k----~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k---~-enel~~~~Ek--- 69 (393) |-| ....+++++.+|++....+++..++..+ ++...+..++..+.. . ++.......+ T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 68 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVKSMVEQFKADEDAKR------------ERAKSVKANQDFLRELQEATAGSVDSEKSGELT 68 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHh Confidence 433 2233344445555444444433322221 111111111111110 0 0000000000 Q ss_pred --hcchhHH--------HHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhh Q lcl|Aclame:pro 70 --PKGKDKM--------TNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNT 139 (393) Q Consensus 70 --~K~k~em--------tEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~ 139 (393) ......+ .+..+... ..............-+. ++..... ..+-...+....+|..+...|-..++++ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~vp~~~~~~ii~~~~~~ 145 (413) T protein:vir:81 69 RKGEGYKSIGEFFAKRAGDQIKQQA-GGAQLNYSVGEYVAPRV-KAASDPA-STATLTDEFQGGYGTTWNRNIIYRRREK 145 (413) T ss_pred hhhhhhhhhhhhhhhhhhhHHHHHH-HHHHhhhhhhhhhhhHH-Hhhhhhh-hhcccccccccccchhhHHHHHHHHhhh Confidence 0000000 00000000 00000000000001011 1122111 1233344566688988888888889988 Q ss_pred CccccceeeecccceeEEEeecc----c-cccceecccchhhhhhh-hhhhhhccHHHHHHHHHHHHHHHhhcCchhHHH Q lcl|Aclame:pro 140 NPVFKVFHVTNVGALLVSRSFDS----S-NEAQVHKDGQTKTEQAA-TLTIDTLEPVMVYKLQSLAERVKRLQMSYSELY 213 (393) Q Consensus 140 d~vl~~fhV~n~~~~a~~i~l~n----a-~~a~GHk~ga~Kk~q~~-~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalv 213 (393) .++.+.+.+-+.+.....+.... . ..+.-+.-|.++.+... .|...++.|..++.+..+.+-+.+ .+ +.+. T Consensus 146 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~--ds-~~l~ 222 (413) T protein:vir:81 146 LVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIE--DY-DFLV 222 (413) T ss_pred hhHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHH--HH-HHHH Confidence 88887767666655443332221 1 23344556777777764 799999999888777666333322 22 3589 Q ss_pred HHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeec-CCC-ceEEE Q lcl|Aclame:pro 214 NLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGR-RYLIV 291 (393) Q Consensus 214 nyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~-a~~-~~l~i 291 (393) +|++++|+.++. |+++.++++|||+... ..-+.....+. + .+..+.....+.+..++.-+... +.+ ..+++ T Consensus 223 ~~i~~~la~~~~-~~~d~~~l~G~G~~~~--~~Gi~~~~~~~---~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vm 295 (413) T protein:vir:81 223 SYINARLLEELA-IEEERQLLLGDGTGNN--LTGLLKRDGIQ---T-LAVSNKDELADSIYKAMTNISLATPFQADALVI 295 (413) T ss_pred HHHHHHHHHHHH-HHHHHHHhccCCCCCc--ccccccccccc---c-ccccccchhHHHHHHHHHHhhhhccCCCcEEEE Confidence 999999999998 7999999999998531 11111111111 1 11222334466676776333322 222 24888 Q ss_pred ecchhhhHhhhhhccccccceeeecCCCeEEEeeecccch----------hhcccchhc--------eeeeecccc-ee- Q lcl|Aclame:pro 292 KTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEII----------VYTGSKAVK--------PTVLVDQKY-HI- 351 (393) Q Consensus 292 ~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~----------~~tG~k~~~--------ptv~vD~k~-~~- 351 (393) |+.++.+|+. |++.+|.+-+...+.... +. |- .|+ +.++.|-+. |. T Consensus 296 n~~~~~~l~~------------lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~-G~-pv~~s~~~~~~~~~~gd~~~~~~~ 361 (413) T protein:vir:81 296 NPLDYQELRL------------AKDANGQYYGGGVFQGQYGSGGIMLDPAPW-GL-RTVQSQVVPVGKPVVGAFRSAASV 361 (413) T ss_pred cHHHHHHHHH------------hhccCCceeccccccccccccccccCceec-ce-eeEEcCCCCcccEEEEecccEEEE Confidence 9999999887 888888876643332211 10 21 122 233344432 11 Q ss_pred -cccceeeeccee----EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 352 -DMQDLTKVDAFE----WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 352 -~~~~~~~~~s~~----~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +-.|++-.-+.. |.+|+-.+.++.+..|.+.-|++-++++++ T Consensus 362 ~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 362 LRKGGVRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVA 408 (413) T ss_pred EEecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEec Confidence 223332211111 334445666777899999999999999988 No 65 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=99.23 E-value=3.3e-12 Score=83.54 Aligned_cols=342 Identities=18% Similarity=0.190 Sum_probs=175.8 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |... ..++.+++++...+..+++.- ..++++ .+.++.....++.++.+ ..++ ++.+++ T Consensus 1 M~i~-----~~~~~~~~e~~~~l~~~~~~~--~~~e~~---~~~~~~~~~~~~~~~~~-------~~~~-----e~~~~~ 58 (377) T protein:vir:96 1 MAIN-----LKELPKYREAVAELSAKISAG--ATPEEQ---EKLFEAAFTTMGDEILA-------KNEE-----EMERMF 58 (377) T ss_pred CCcc-----HHHHHHHHHHHHHHHHHHhhc--ccHHHH---HHHHHHHHHHHHHHHHH-------HHHH-----HHHHHH Confidence 5442 122333333333333333321 111111 11122222333333322 1111 111111 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l 160 (393) -.+ ......+.+-++...+-+...+. .+--.++|..++..|-+.+.+++++++...|.+.+... .+-. T Consensus 59 ~~~---------~~~~~lt~ee~~~~~~~~~~~~~--~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~-~i~~ 126 (377) T protein:vir:96 59 DLR---------DKNRELTAEEIKFFNDIDKNVGG--KDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRL-KALT 126 (377) T ss_pred Hhc---------cCCcccCHHHHHHHHHHHhcCCC--CCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCCce-EEEE Confidence 000 00111123344444444433332 34444899999999999999999999988887776542 3333 Q ss_pred -ccc-cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccC Q lcl|Aclame:pro 161 -DSS-NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDG 238 (393) Q Consensus 161 -~na-~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG 238 (393) .+. ...||--.+.-+.....+|...++.+..+|.+..+..-+ |+.++-++..|+.++|+..|- ++.+.|+++||| T Consensus 127 ~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~l--l~ds~~~le~~i~~~l~~~~~-~~~~~a~i~G~G 203 (377) T protein:vir:96 127 AETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA--LKFGPKWLKQFITEQLKEAIA-VALELAIVKGNG 203 (377) T ss_pred ecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHH--hhcchhhHHHHHHHHHHHHHH-HHHhhceEeccC Confidence 222 455664444556666789999999998887775553333 444557789999999999999 799999999999 Q ss_pred CCccccchhhhhhhhhhhhhhhh----------hccCCC--CCHHHHHh-------hh---ceeeecC--CCceEEEecc Q lcl|Aclame:pro 239 TNGFKSIDKEADVKKIKKITTKA----------KSAGKT--PFADAIEE-------AV---DFVRPTA--GRRYLIVKTE 294 (393) Q Consensus 239 ~~~t~~~~~e~D~~~ik~it~~a----------t~~~~t--~~~dal~E-------al---d~a~~~a--~~~~l~i~~~ 294 (393) ... ..-+..+......-.+.+ +..++. .+.+.+.. ++ +..+|.. ++-+.++++. T Consensus 204 ~~~--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~ 281 (377) T protein:vir:96 204 LLQ--PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPE 281 (377) T ss_pred CCc--ceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchh Confidence 742 111111110000000000 001111 12222222 22 1222222 2345667766 Q ss_pred hhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhcccchhce---eeeeccccee--cccc--eeeecceeEeec Q lcl|Aclame:pro 295 DRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAVKP---TVLVDQKYHI--DMQD--LTKVDAFEWKTN 367 (393) Q Consensus 295 d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~p---tv~vD~k~~~--~~~~--~~~~~s~~~~~n 367 (393) +...++ .+....+.+|.|+.-+|.+-..+- +..| | .++-|-+.|+ +-+| +...+...+... T Consensus 282 t~~~~~---------~~~~~~~~~G~~~~~l~~p~~v~~--s~~~-p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~d 349 (377) T protein:vir:96 282 DRWTLE---------AKFTSRNQFGEYVTVLPHGITILE--SLAV-ETGKAIAFVANRYDAFMATASTIEEYDQTFAMED 349 (377) T ss_pred hHHhcc---------ccccccCCCCCceeccCCCceEEe--cCCC-CcccEEEEEcCcEEEEEecccEEEeehhhhhhcC Confidence 655442 122344556777765554422221 1111 2 4455555554 3333 334444555566 Q ss_pred CceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 368 SNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 368 s~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +-.+....+.-|.+.-+++-.|.+|+ T Consensus 350 ~~~f~~~~r~dG~~~d~~a~~vl~l~ 375 (377) T protein:vir:96 350 LQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) T ss_pred CeEEEEEEEEcCEEecCCcEEEEEEe Confidence 77788888999999999999999999 No 66 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=99.22 E-value=5.1e-12 Score=82.50 Aligned_cols=371 Identities=14% Similarity=0.115 Sum_probs=168.9 Q ss_pred CC-cchhhHHHHHH---------------HHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MN-KPDLIEKQNRL---------------AELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 64 (393) Q Consensus 1 ~~-k~d~~ekq~eL---------------a~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~ 64 (393) |. +++++.+..++ +|+++.+.+...+++.++- ..+...+.+++.+.+.++++++...++++. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 78 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQA--EVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43 33444443333 2333333333333333322 111112223333334444444433222222 Q ss_pred hhh--------hhhcchhHHHHHHHHHHHHHHHHHHH---HHccCC-------hhHHHHHHHHHH------hCcc-chhh Q lcl|Aclame:pro 65 AQE--------EKPKGKDKMTNFIESQNAVTEFFDVL---KKNSGK-------SEIKNAWNAKLA------ENGV-TITD 119 (393) Q Consensus 65 ~~~--------Ek~K~k~emtEfLkTkqA~~dya~ll---~~nqg~-------ke~k~AW~a~L~------ekgV-~~qd 119 (393) ..+ +..+....+.+-++........-... ..+... .+...+...-+. .... +..+ T Consensus 79 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) T protein:vir:10 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) T ss_pred HHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcc Confidence 211 11111112222222111111110000 000000 111211111111 1111 1111 Q ss_pred hHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeeccc---cccceecccchhhhhhhhhhhhhccHHHHHHHH Q lcl|Aclame:pro 120 TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSS---NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQ 196 (393) Q Consensus 120 ~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na---~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq 196 (393) --..+|..+..-|-..++++..|.+.+.+.......+.+-.++. ..+|. --|.++.+...+|...++.|..|+-+. T Consensus 159 gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv-~E~~~~~~s~~~f~~i~~~~~k~a~~~ 237 (497) T protein:vir:10 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAV-AEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) T ss_pred cccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceee-ccCcccccccccceeeEeeeeeeEeec Confidence 12268887777777777878888776555444444445444432 23343 356688888999999999999888887 Q ss_pred HHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhh-----------------h---- Q lcl|Aclame:pro 197 SLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKI-----------------K---- 255 (393) Q Consensus 197 ~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~i-----------------k---- 255 (393) .+.+.+-+ + ++ .+.+||.++|+..+- +.++.+++.|||+.....+--......+ . T Consensus 238 ~iS~ell~-d-~~-~l~~~i~~~l~~~i~-~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) T protein:vir:10 238 TITDEGLR-D-AP-ELFNFVQGRLLEGIQ-RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) T ss_pred HhHHHHHH-h-HH-HHHHHHHHHHHHHHH-HHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcc Confidence 77444333 2 23 489999999999999 7999999999998431111100000000 0 Q ss_pred -------------hhhhhh--------------hccCCCCCHHHHHhhh-ceeeecC-CCceEEEecchhhhHhhhhhcc Q lcl|Aclame:pro 256 -------------KITTKA--------------KSAGKTPFADAIEEAV-DFVRPTA-GRRYLIVKTEDRKALLDELRQA 306 (393) Q Consensus 256 -------------~it~~a--------------t~~~~t~~~dal~Eal-d~a~~~a-~~~~l~i~~~d~~a~~~~~~~~ 306 (393) .+...+ .........+.+.-++ .+.+.-. ....+++|+.|..+|+- T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~----- 388 (497) T protein:vir:10 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL----- 388 (497) T ss_pred cccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHH----- Confidence 000000 0000001111122222 1111111 11257888888888877 Q ss_pred ccccceeeecCCCeEEEee----------ecccchhhcccchhce--------eeeeccc--ce-e-cccceeeecc--- Q lcl|Aclame:pro 307 TANANVRIKNDDTEIASEV----------GVDEIIVYTGSKAVKP--------TVLVDQK--YH-I-DMQDLTKVDA--- 361 (393) Q Consensus 307 ~~~a~~~l~~~~~~~~~~v----------~~~~~~~~tG~k~~~p--------tv~vD~k--~~-~-~~~~~~~~~s--- 361 (393) |+|.+|.|-+.= +.+...+ | ..|+- .++-|-+ ++ | +-.|++-..+ T Consensus 389 -------lkd~~G~~i~~~~~~~~~~~~~~~~~~l~--G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~ 458 (497) T protein:vir:10 389 -------TKDANGQYMGGNFFGNAYGNPVNGGKNIW--G-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSN 458 (497) T ss_pred -------hhcCCCceeccCcccccccccccCCceee--c-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeeccc Confidence 899998875421 1111111 2 11110 0122221 11 1 2233322111 Q ss_pred -eeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 362 -FEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 362 -~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .-|.+|+=.|.++....|-|.-|.+=+++++. T Consensus 459 ~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 459 GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) T ss_pred chhhhcCcEEEEEEEeecceeeccccEEEEEec Confidence 22556666677778889999988888888887 No 67 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=99.22 E-value=5.1e-12 Score=82.50 Aligned_cols=371 Identities=14% Similarity=0.115 Sum_probs=168.9 Q ss_pred CC-cchhhHHHHHH---------------HHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MN-KPDLIEKQNRL---------------AELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 64 (393) Q Consensus 1 ~~-k~d~~ekq~eL---------------a~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~ 64 (393) |. +++++.+..++ +|+++.+.+...+++.++- ..+...+.+++.+.+.++++++...++++. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~--~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 78 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQA--EVEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43 33444443333 2333333333333333322 111112223333334444444433222222 Q ss_pred hhh--------hhhcchhHHHHHHHHHHHHHHHHHHH---HHccCC-------hhHHHHHHHHHH------hCcc-chhh Q lcl|Aclame:pro 65 AQE--------EKPKGKDKMTNFIESQNAVTEFFDVL---KKNSGK-------SEIKNAWNAKLA------ENGV-TITD 119 (393) Q Consensus 65 ~~~--------Ek~K~k~emtEfLkTkqA~~dya~ll---~~nqg~-------ke~k~AW~a~L~------ekgV-~~qd 119 (393) ..+ +..+....+.+-++........-... ..+... .+...+...-+. .... +..+ T Consensus 79 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) T protein:vir:78 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) T ss_pred HHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcc Confidence 211 11111112222222111111110000 000000 111211111111 1111 1111 Q ss_pred hHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeeccc---cccceecccchhhhhhhhhhhhhccHHHHHHHH Q lcl|Aclame:pro 120 TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSS---NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQ 196 (393) Q Consensus 120 ~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na---~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq 196 (393) --..+|..+..-|-..++++..|.+.+.+.......+.+-.++. ..+|. --|.++.+...+|...++.|..|+-+. T Consensus 159 gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv-~E~~~~~~s~~~f~~i~~~~~k~a~~~ 237 (497) T protein:vir:78 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAV-AEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) T ss_pred cccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceee-ccCcccccccccceeeEeeeeeeEeec Confidence 12268887777777777878888776555444444445444432 23343 356688888999999999999888887 Q ss_pred HHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhh-----------------h---- Q lcl|Aclame:pro 197 SLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKI-----------------K---- 255 (393) Q Consensus 197 ~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~i-----------------k---- 255 (393) .+.+.+-+ + ++ .+.+||.++|+..+- +.++.+++.|||+.....+--......+ . T Consensus 238 ~iS~ell~-d-~~-~l~~~i~~~l~~~i~-~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) T protein:vir:78 238 TITDEGLR-D-AP-ELFNFVQGRLLEGIQ-RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) T ss_pred HhHHHHHH-h-HH-HHHHHHHHHHHHHHH-HHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcc Confidence 77444333 2 23 489999999999999 7999999999998431111100000000 0 Q ss_pred -------------hhhhhh--------------hccCCCCCHHHHHhhh-ceeeecC-CCceEEEecchhhhHhhhhhcc Q lcl|Aclame:pro 256 -------------KITTKA--------------KSAGKTPFADAIEEAV-DFVRPTA-GRRYLIVKTEDRKALLDELRQA 306 (393) Q Consensus 256 -------------~it~~a--------------t~~~~t~~~dal~Eal-d~a~~~a-~~~~l~i~~~d~~a~~~~~~~~ 306 (393) .+...+ .........+.+.-++ .+.+.-. ....+++|+.|..+|+- T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~----- 388 (497) T protein:vir:78 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL----- 388 (497) T ss_pred cccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHH----- Confidence 000000 0000001111122222 1111111 11257888888888877 Q ss_pred ccccceeeecCCCeEEEee----------ecccchhhcccchhce--------eeeeccc--ce-e-cccceeeecc--- Q lcl|Aclame:pro 307 TANANVRIKNDDTEIASEV----------GVDEIIVYTGSKAVKP--------TVLVDQK--YH-I-DMQDLTKVDA--- 361 (393) Q Consensus 307 ~~~a~~~l~~~~~~~~~~v----------~~~~~~~~tG~k~~~p--------tv~vD~k--~~-~-~~~~~~~~~s--- 361 (393) |+|.+|.|-+.= +.+...+ | ..|+- .++-|-+ ++ | +-.|++-..+ T Consensus 389 -------lkd~~G~~i~~~~~~~~~~~~~~~~~~l~--G-~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~ 458 (497) T protein:vir:78 389 -------TKDANGQYMGGNFFGNAYGNPVNGGKNIW--G-VPVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNSN 458 (497) T ss_pred -------hhcCCCceeccCcccccccccccCCceee--c-eeeEecCCCCCCceEEeecccceEEEEEecccEEEeeccc Confidence 899998875421 1111111 2 11110 0122221 11 1 2233322111 Q ss_pred -eeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 362 -FEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 362 -~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .-|.+|+=.|.++....|-|.-|.+=+++++. T Consensus 459 ~~~f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 459 GTDFVDGKVTVRAEERLGLLVYRPSAFQLIQLK 491 (497) T ss_pred chhhhcCcEEEEEEEeecceeeccccEEEEEec Confidence 22556666677778889999988888888887 No 68 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=99.18 E-value=3.6e-13 Score=88.78 Aligned_cols=276 Identities=14% Similarity=0.098 Sum_probs=164.0 Q ss_pred HHHccCCh-hHHHHHHHHHHhC------ccchhh-hHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-c Q lcl|Aclame:pro 93 LKKNSGKS-EIKNAWNAKLAEN------GVTITD-TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-S 163 (393) Q Consensus 93 l~~nqg~k-e~k~AW~a~L~ek------gV~~qd-~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a 163 (393) |++.+-.+ +++ .|...+.+. .+...+ -.-.+|..+..-|-..+.++.++++.+.+.+.+.....+-..+ . T Consensus 1 ~~~~~~~~~~~~-~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~ 79 (324) T protein:vir:93 1 MEQTQKLKLNLQ-HFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADK 79 (324) T ss_pred CchhHHHHHHHH-HHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecC Confidence 22222222 222 222222211 111112 1226899999988888898899988777666655444443333 3 Q ss_pred cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccc Q lcl|Aclame:pro 164 NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFK 243 (393) Q Consensus 164 ~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~ 243 (393) ..+.-+.-|.++.+...+|...++.|..++....+.+-+.+ .+.-++.+|++++|++++- |+++.+++.|||.+..- T Consensus 80 ~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~aia-~~~d~a~l~G~g~~~~~ 156 (324) T protein:vir:93 80 PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFG 156 (324) T ss_pred cceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCCcC Confidence 34444567888899999999999999998888777443333 2235779999999999999 79999999999975321 Q ss_pred cchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEE Q lcl|Aclame:pro 244 SIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIA 322 (393) Q Consensus 244 ~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~ 322 (393) . .... ......+....+...+++..++ .+.........+++++.++.+|+. +++.+|.+- T Consensus 157 ~--~~~~-----~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~------------l~d~~G~~~ 217 (324) T protein:vir:93 157 K--SIAQ-----SIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKER 217 (324) T ss_pred c--cccc-----cccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhCCCCCee Confidence 1 1111 1111223334566778888877 333334455678899999998876 566666655 Q ss_pred Eeeecccchhhcccchhc---------eeeeeccccee--cccceeeecce----------------eEeecCceEEEee Q lcl|Aclame:pro 323 SEVGVDEIIVYTGSKAVK---------PTVLVDQKYHI--DMQDLTKVDAF----------------EWKTNSNMILVET 375 (393) Q Consensus 323 ~~v~~~~~~~~tG~k~~~---------ptv~vD~k~~~--~~~~~~~~~s~----------------~~~~ns~~i~~~~ 375 (393) +.-+.+.... |-..+. +.++.|-.+++ +-+|++---+. -|..|+-.|.++. T Consensus 218 ~~~~~~~~l~--G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~ 295 (324) T protein:vir:93 218 IYDRNSDSLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) T ss_pred ecCCCCCccc--ceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEE Confidence 4433332222 311110 12233433332 22222111111 1566777788888 Q ss_pred eecccccccCcceeeeeC Q lcl|Aclame:pro 376 LTSGHVETYNAGAVITVS 393 (393) Q Consensus 376 ~~~g~~~~~n~~~~~~v~ 393 (393) +..+.+.-|++-++++-+ T Consensus 296 r~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 296 HVALHIADDKAFAKLVPA 313 (324) T ss_pred EeccEEecccceEEEecc Confidence 899999999887777765 No 69 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=99.18 E-value=4e-12 Score=83.09 Aligned_cols=341 Identities=16% Similarity=0.175 Sum_probs=174.9 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |... +.+|.++++++..+...++.-. ..+|.++..+++... .+++..... ..++.+.+ T Consensus 1 M~i~-----~k~~~~~~~~~~~l~~~~~~~~---------~~ee~~~~~~~~~~~---~~~~~~~~~-----~~e~~~~~ 58 (377) T protein:vir:98 1 MAIN-----LKELPKYREAVAELSAKISAGA---------TSEEQEKLFEAAFTT---MGDEILAKN-----EEEMERMF 58 (377) T ss_pred CCCc-----HHHHHHHHHHHHHHHHHHHhhh---------hhHHHHHHHHHHHHh---HHHHHHHHH-----HHHHHHHH Confidence 6652 2335555555555544433211 011112222111111 111111100 11111111 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l 160 (393) ..+ ......+.+-++.|++-+...+. .|....+|..++..|-..+.+++++++...|.+..... .+-. T Consensus 59 ~~~---------~~~~~lt~ee~~~~~~~~~~~~~--~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~-~~~~ 126 (377) T protein:vir:98 59 DLR---------DKNRELTAEEIKFFNDIDKNVGG--KDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSLRL-KALT 126 (377) T ss_pred Hhc---------cCCcccCHHHHHHHHHHHhccCC--CCCccccCHHHHHHHHHHHHHhhhhhhheeeEecCcce-EEEE Confidence 000 01122234566777766554444 44455899999999999999999999988777776442 3333 Q ss_pred cc-c-cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccC Q lcl|Aclame:pro 161 DS-S-NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDG 238 (393) Q Consensus 161 ~n-a-~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG 238 (393) .+ . ...||--.+..++.....|...++.+..+|-+-.+..-+ |+.++-++-.|++++|+..|- ++.+.|+++||| T Consensus 127 ~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~el--L~ds~~~ie~~i~~~la~~~a-~~~~~a~i~G~G 203 (377) T protein:vir:98 127 AETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA--LKFGPKWIKQFITEQLKEAIA-VALELAIVKGDG 203 (377) T ss_pred ecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHh--hhccHhHHHHHHHHHHHHHHH-HHHhhceEeccC Confidence 33 3 345655445555566778999999998888775553322 444456789999999999999 799999999999 Q ss_pred CCccccchhhhhhhhhhhhhhhhhccCCC-CCHHHHHhhhceeeecCCCceEEEecchhhhHhhhhhccccccceeeecC Q lcl|Aclame:pro 239 TNGFKSIDKEADVKKIKKITTKAKSAGKT-PFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKND 317 (393) Q Consensus 239 ~~~t~~~~~e~D~~~ik~it~~at~~~~t-~~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~ 317 (393) +.. ..-+..++.....-.+++...+.. ++.+++.. +.++- +...++...+-+...|.++-.+|++. T Consensus 204 ~~q--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~~----------~~~~~~~a~~~m~~~t~~~~~klkd~ 270 (377) T protein:vir:98 204 LLQ--PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIAD-LSDLT----------PDNAPKKLVPVMKHLSVNDKKRPLKI 270 (377) T ss_pred CCc--ceeeeecccccccccccccccccccchhhhHhh-hhhhc----------hhHHHHHHHHHHHHHHHHHHhhhhcc Confidence 742 111111111000000111111111 11222222 22222 22233334444455555666667777 Q ss_pred CCeEEEeeeccc------------------chhhcc-------cchh--ceeeeeccccee--ccccee--eecceeEee Q lcl|Aclame:pro 318 DTEIASEVGVDE------------------IIVYTG-------SKAV--KPTVLVDQKYHI--DMQDLT--KVDAFEWKT 366 (393) Q Consensus 318 ~~~~~~~v~~~~------------------~~~~tG-------~k~~--~ptv~vD~k~~~--~~~~~~--~~~s~~~~~ 366 (393) +|++..-+..++ ..+ | +..| ...++.|-++|+ +-+|++ ..+...+.. T Consensus 271 ~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~l--g~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~i~~~~~~~~~~ 348 (377) T protein:vir:98 271 AGQVKLILNPEDRWALEAQFTSRNQFGEYVTVL--PHGITILESLAVETGKAIAFVANRYDAFMATASTIEEYDQTFAME 348 (377) T ss_pred CCceEEEecccchhhccccccccCCCCcccccc--CCCceEEecCCCCcccEEEEEecceeEEeecceEEEeechhhhhc Confidence 777665322110 111 1 1111 013344444444 222322 222233334 Q ss_pred cCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 367 NSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 367 ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .+-.+.+..+.-|.+.-+++-.+++|+ T Consensus 349 d~~~f~~~~r~dg~~~~~~a~~vl~i~ 375 (377) T protein:vir:98 349 DLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) T ss_pred CceEEEEEEEEcCEEeccCcEEEEEEe Confidence 455567777899999999999999999 No 70 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=99.17 E-value=3.3e-13 Score=89.03 Aligned_cols=277 Identities=14% Similarity=0.086 Sum_probs=163.2 Q ss_pred HHHHHHHHhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC------ccc-hhhhHhhcchhHH Q lcl|Aclame:pro 57 IKIENELNAQEEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAEN------GVT-ITDTTFQLPRKLV 129 (393) Q Consensus 57 ~k~enel~~~~Ek~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek------gV~-~qd~~eiLP~~ii 129 (393) || +.+-.+.--+.|...+... .+. .++..-.+|..+. T Consensus 1 ~~------------------------------------k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~ 44 (324) T protein:vir:99 1 ME------------------------------------QTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFT 44 (324) T ss_pred CC------------------------------------CchHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHH Confidence 11 1111111111122222111 111 1122226899998 Q ss_pred HHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCc Q lcl|Aclame:pro 130 ESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMS 208 (393) Q Consensus 130 ~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ 208 (393) .-|-..++++.++++.+++...+.....+--.+ ...+.-..-|.++.+...+|...++.|..++...++.+-+.+ .+ T Consensus 45 ~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~--ds 122 (324) T protein:vir:99 45 TPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YT 122 (324) T ss_pred HHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHh--cc Confidence 888888898999988877776665554443333 344555567888999999999999999988888777443332 22 Q ss_pred hhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeee-cCCCc Q lcl|Aclame:pro 209 YSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRR 287 (393) Q Consensus 209 ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~-~a~~~ 287 (393) .-++.+|+.++|++++. |+++.+++.|||.+..- .+-...+ ....+..+.+...+++..++.-..+ ..... T Consensus 123 ~~~l~~~i~~~l~~ai~-~~~d~~~l~G~g~~~~~----~~~~~~~---~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~ 194 (324) T protein:vir:99 123 YSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFG----KSIAQSI---EKTNKVIKGDFTQDNIIDLEALLEDDELEAN 194 (324) T ss_pred hHHHHHHHHHHHHHHHH-HHHHHHhhhcCCCCccC----ccccccc---cccceeccccCCHHHHHHHHHhhhhccCCCC Confidence 35789999999999999 79999999999985311 1111111 1122334456677888888733333 34555 Q ss_pred eEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhcccchhc---------eeeeeccccee--cccce Q lcl|Aclame:pro 288 YLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAVK---------PTVLVDQKYHI--DMQDL 356 (393) Q Consensus 288 ~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~---------ptv~vD~k~~~--~~~~~ 356 (393) .+++++.++..|+. |++.+|.+.+.-+.++... |--.+. +.++.|-++++ .-+|+ T Consensus 195 ~~v~n~~~~~~L~~------------l~d~~g~~~~~~~~~~~l~--G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~ 260 (324) T protein:vir:99 195 AFISKTQNRSLLRK------------IVDPETKERIYDRNSDTLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLI 260 (324) T ss_pred EEEEcHHHHHHHHH------------hhcCCCceeecCCCCcccc--ceeEEeecCCCCCcceEEEEecccEEEEEecCc Confidence 78999999888875 5666666555444333322 321110 12333443332 12222 Q ss_pred eeecce----------------eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 357 TKVDAF----------------EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 357 ~~~~s~----------------~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +---+. -|..|+-.+.++.+..+.+..+++-++++.+ T Consensus 261 ~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a 313 (324) T protein:vir:99 261 EYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred EEEEeecccccccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEec Confidence 110000 0445566667778888999988888887777 No 71 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=99.15 E-value=1.6e-11 Score=79.73 Aligned_cols=330 Identities=13% Similarity=0.092 Sum_probs=160.4 Q ss_pred hhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-hcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHH Q lcl|Aclame:pro 32 VKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEK-PKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKL 110 (393) Q Consensus 32 v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek-~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L 110 (393) +|+..+...+++++++.++.+..++++.+.++....+. .+......+--+..++..+| +........+.+.-.... T Consensus 1 ~eei~~l~~~~~~l~~~~~~l~~~~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~r~~~~~~~~~~~~~~~~ 77 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEF---YRHAILPNEFEKPSMEAQ 77 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccchhhhHHHHHHHH---HHHHhhhhHHHHHHhhHH Confidence 44433444445555555555555555444443332211 00111111111111111111 111111112222111111 Q ss_pred HhC---cc-chhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEE-Eeeccccccceecccchhhhhhhhhhhh Q lcl|Aclame:pro 111 AEN---GV-TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVS-RSFDSSNEAQVHKDGQTKTEQAATLTID 185 (393) Q Consensus 111 ~ek---gV-~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~-i~l~na~~a~GHk~ga~Kk~q~~~le~~ 185 (393) ... +. ...+--.++|..+..-|-..+.++.++.+...|.+....... +...+....|. .-|+.+++...+|... T Consensus 78 ~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~~~p~~~~~~~~a~~v-~E~~~~~~~~~~f~~v 156 (352) T protein:vir:78 78 RLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIPRVSYTLDDDDFI-TDVETAKELKLKGDTV 156 (352) T ss_pred HHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCceEEEEecCCCccccc-ccccccccccccceee Confidence 111 11 122333479999888888889999999887777776544322 22222344454 3466788888999999 Q ss_pred hccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHH-HhcceeeccCCCccccchhhhhhhhhhhhhhhhhcc Q lcl|Aclame:pro 186 TLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI-VDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSA 264 (393) Q Consensus 186 ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Ra-v~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~ 264 (393) ++.|..++-...+.+.+ |+.+..++.+|++++|++.+. ++ ...++..|||++... ..... ..+ +.+ T Consensus 157 ~~~~~k~~~~i~is~el--l~Ds~~~l~~~i~~~la~~~~-~~e~~~~~~~g~g~~~~~--g~l~~-~~~-------~~~ 223 (352) T protein:vir:78 157 KFTTNKFKVFAAISDTV--IHGSDVDLVNWVENALQSGLA-AKERKDALAVSPKSGLEH--MSFYN-GSV-------KEV 223 (352) T ss_pred eecceeEEeechhhHHH--HhhhhHHHHHHHHHHHHHHHH-HHHHHhhhhcCCCCcccc--cceec-ccc-------ccc Confidence 99987666655552222 233335789999999999998 45 344667777764311 11111 111 122 Q ss_pred CCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhcccc-----h Q lcl|Aclame:pro 265 GKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSK-----A 338 (393) Q Consensus 265 ~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k-----~ 338 (393) +++...|+|..++ +....-..+...++++.+..+++.-+.. +|..-. .|..+..+ |-. . T Consensus 224 t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~------------~~~~~~-~~~~~~ll--G~PV~~~~~ 288 (352) T protein:vir:78 224 EGANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSN------------GTTNFF-DTPAEKVF--GKPVVFTDA 288 (352) T ss_pred cccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhc------------cCCccc-ccCCcccc--ccceEEecC Confidence 3344578888877 4333333456678888888887763332 222111 12222222 311 1 Q ss_pred hceeeeeccccee-cccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 339 VKPTVLVDQKYHI-DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 339 ~~ptv~vD~k~~~-~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +.|.++=|=+++. ..+++..-......++.-.+....+..|-+--|++=.+.+++ T Consensus 289 ~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~~ 344 (352) T protein:vir:78 289 AVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKAK 344 (352) T ss_pred CCceeEeehhhhhhhhhhheeeeeccccCCeeEEEEEeeeCceeechhheEEEEee Confidence 1222333333322 111211100011112233445567778888888877777776 No 72 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.13 E-value=3.6e-11 Score=77.86 Aligned_cols=372 Identities=13% Similarity=0.107 Sum_probs=153.8 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhh-h-HHhh---hhHHHHHHHHHHHHHHHHHHHH---H---HHHhhh-- Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVK-N-AIED---LPKVQELEKTLSENSIEIIKIE---N---ELNAQE-- 67 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~-~-a~~~---~skieelektis~l~aEi~k~e---n---el~~~~-- 67 (393) |+| .++|....|.+|+++...+..+++++-=+ + +..+ -.+.++.+..+.++.++|.+.+ . +++... T Consensus 1 ~~k-~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~ 79 (477) T protein:vir:84 1 MEK-HLEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIER 79 (477) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 776 56677777888888888877666654211 0 1000 1122233333333333332111 1 111100 Q ss_pred -------hhh-------------cchhHHHHHHHHH----------HHHHHHHHHHHHccCChhHHHHHHHHHHhCccch Q lcl|Aclame:pro 68 -------EKP-------------KGKDKMTNFIESQ----------NAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTI 117 (393) Q Consensus 68 -------Ek~-------------K~k~emtEfLkTk----------qA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~ 117 (393) +++ -......+|++.. .+.....+.+....+..+...........+...+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (477) T protein:vir:84 80 SGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDR 159 (477) T ss_pred hhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccc Confidence 000 0000111222111 1111111111111111122221111112222222 Q ss_pred hhhH--hhcchh-HHHHHHHHHHhhCccccceeeeccccee--EEEee-cc-ccccceecccc-----hhhhhhhhhhhh Q lcl|Aclame:pro 118 TDTT--FQLPRK-LVESINTALLNTNPVFKVFHVTNVGALL--VSRSF-DS-SNEAQVHKDGQ-----TKTEQAATLTID 185 (393) Q Consensus 118 qd~~--eiLP~~-ii~AIe~A~ed~d~vl~~fhV~n~~~~a--~~i~l-~n-a~~a~GHk~ga-----~Kk~q~~~le~~ 185 (393) .+.. .++|.. +..-|-+.++++..+.+.+.+-..+... +.+-. ++ ...++.+--|+ .|.+...+|... T Consensus 160 ~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i 239 (477) T protein:vir:84 160 NGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFV 239 (477) T ss_pred cCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeE Confidence 2221 134443 3455666777666666644544333332 22222 22 34555555554 456677889999 Q ss_pred hccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccC Q lcl|Aclame:pro 186 TLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAG 265 (393) Q Consensus 186 ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~ 265 (393) ++.|..++-+..+-+.+-+-.+ -++.+|+.++|+.++- +.++.+++.|||+... .--+... ..+..++. +..+ T Consensus 240 ~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~-~~~d~~~l~G~Gt~~~-p~Gi~~~-~~~~~~~~--~~~~ 312 (477) T protein:vir:84 240 QANVKTIAGQQGIAIQLLDQAA--VSVDEFVFRDLAADYA-NKLNVQVISGTGSNNQ-VVGVRAT-AGITQVTA--TSAG 312 (477) T ss_pred EEeeeeEEeeeHHHHHHHhccc--hhHHHHHHHHHHHHHH-HHHHHHHhccCCCCCc-cceeeec-cccccccc--cccc Confidence 9999888777666333322222 3578999999999999 7999999999997431 0000000 00111111 1111 Q ss_pred CC-CCH----HHHHhhhceeeecC--CCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeec---ccchhhc- Q lcl|Aclame:pro 266 KT-PFA----DAIEEAVDFVRPTA--GRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGV---DEIIVYT- 334 (393) Q Consensus 266 ~t-~~~----dal~Eald~a~~~a--~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~---~~~~~~t- 334 (393) .| ... +.+..++.-..+.. ....+++|..+.++|+. |++.+|.+-+.... +...+.+ T Consensus 313 ~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~------------lkd~~G~~l~~~~~~~~~~~~~~~~ 380 (477) T protein:vir:84 313 SALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHA------------IFAGDDRPLIVPSGPGFNNLGVLTE 380 (477) T ss_pred cchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHH------------hhccCCCeeeecCcccccccccccc Confidence 11 112 22333332222221 22367888888888887 88888876543211 1111000 Q ss_pred -------ccchhceeeeecccc-----------ee--ccc-------ceeeecceeEeecCceEEEe--eeecccccccC Q lcl|Aclame:pro 335 -------GSKAVKPTVLVDQKY-----------HI--DMQ-------DLTKVDAFEWKTNSNMILVE--TLTSGHVETYN 385 (393) Q Consensus 335 -------G~k~~~ptv~vD~k~-----------~~--~~~-------~~~~~~s~~~~~ns~~i~~~--~~~~g~~~~~n 385 (393) |.=.=+|.+..+.-. .+ +.+ |+....+..--...+...++ .|...-.. .. T Consensus 381 ~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~-r~ 459 (477) T protein:vir:84 381 VASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFESSVRMRALQETRAENLSVLLQVYGYLAFTAA-RF 459 (477) T ss_pred cccccccchhcccceEecCcccccccccCCcceEEEEEeceEEEEeeceeEEeccccccccceeeeeehhhhhhhhh-cc Confidence 000012333222110 00 222 22222222222222222221 11221111 11 Q ss_pred cceeeeeC Q lcl|Aclame:pro 386 AGAVITVS 393 (393) Q Consensus 386 ~~~~~~v~ 393 (393) ..+++.+. T Consensus 460 ~~afv~~t 467 (477) T protein:vir:84 460 PQSVVEIG 467 (477) T ss_pred ccceEEee Confidence 22223333 No 73 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=99.12 E-value=1e-12 Score=86.33 Aligned_cols=277 Identities=14% Similarity=0.084 Sum_probs=164.6 Q ss_pred HHHccCCh-hHHHHHHHHHH-----hCccc-hhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-cc Q lcl|Aclame:pro 93 LKKNSGKS-EIKNAWNAKLA-----ENGVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SN 164 (393) Q Consensus 93 l~~nqg~k-e~k~AW~a~L~-----ekgV~-~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~ 164 (393) |+.++-.+ +.++-|...+. ..++. .++.-..+|..+..-|-..++++.++++.+++.+.+.....+--.+ .. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~ 80 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCc Confidence 33333322 22222221111 11221 1222337899998888888898899888777777665544443333 34 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcccc Q lcl|Aclame:pro 165 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKS 244 (393) Q Consensus 165 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~ 244 (393) .+.-+.-|.++.+...+|...++.|..++....+.+-+.+ .+.-++.+|+.++|++++- |.++.+++.|||.+.. T Consensus 81 ~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~--ds~~~l~~~i~~~l~~aia-~~~d~a~l~G~g~~~~-- 155 (324) T protein:vir:97 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPF-- 155 (324) T ss_pred ceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHhhccCCCCcc-- Confidence 4555567888999999999999999988888777443222 2235779999999999999 7999999999997531 Q ss_pred chhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEE Q lcl|Aclame:pro 245 IDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIAS 323 (393) Q Consensus 245 ~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~ 323 (393) ....... + .......+.+...+++..+. .+........++++++.++..|+. +++.+|.+.+ T Consensus 156 ~~gi~~~--~---~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~------------lkd~~g~~~~ 218 (324) T protein:vir:97 156 GKSIAQS--I---EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKERI 218 (324) T ss_pred Ccccccc--c---cccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhcCCCceee Confidence 1111110 1 11112334456678888777 343334455588999999888876 6777776665 Q ss_pred eeecccchhhcccchhc---------eeeeeccccee--cccceeeecce----------------eEeecCceEEEeee Q lcl|Aclame:pro 324 EVGVDEIIVYTGSKAVK---------PTVLVDQKYHI--DMQDLTKVDAF----------------EWKTNSNMILVETL 376 (393) Q Consensus 324 ~v~~~~~~~~tG~k~~~---------ptv~vD~k~~~--~~~~~~~~~s~----------------~~~~ns~~i~~~~~ 376 (393) .-+.++... |-..+. ..++.|-+.++ +-+|++---+. -|..|+-.+.++.+ T Consensus 219 ~~~~~~tl~--G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r 296 (324) T protein:vir:97 219 YDRNSDTLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred cCCCCcccc--ceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 444444333 322111 12223333322 11222110000 14456666777788 Q ss_pred ecccccccCcceeeeeC Q lcl|Aclame:pro 377 TSGHVETYNAGAVITVS 393 (393) Q Consensus 377 ~~g~~~~~n~~~~~~v~ 393 (393) ..|.+..+++-++++.. T Consensus 297 ~d~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 297 VALHIADDKAFAKLVPA 313 (324) T ss_pred eccEEecccceEEEEec Confidence 88888888887777776 No 74 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=99.11 E-value=1.5e-12 Score=85.47 Aligned_cols=277 Identities=14% Similarity=0.080 Sum_probs=161.7 Q ss_pred HHHccCChhHHHHHHHHHHh------Cccc-hhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-cc Q lcl|Aclame:pro 93 LKKNSGKSEIKNAWNAKLAE------NGVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SN 164 (393) Q Consensus 93 l~~nqg~ke~k~AW~a~L~e------kgV~-~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~ 164 (393) |+..+-.+.-.+.|..++.. .++. ..+...++|..+...|-+.++++.++++-+.+.+.+.....+--.+ .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 22222222111222222111 1221 2223337999998888888898898888776666655444443333 33 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcccc Q lcl|Aclame:pro 165 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKS 244 (393) Q Consensus 165 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~ 244 (393) .+.-+--|.++.+...+|...++.|..++....+.+-+-+ .+.-++.+|+.++|++++. |+++.+++.|+|.+.... T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~--ds~~~l~~~i~~~la~ai~-~~~d~a~l~G~g~~~~~~ 157 (324) T protein:vir:78 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFGK 157 (324) T ss_pred ceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHHhccCCCCCcCc Confidence 4444566888999999999999999888877777433222 2235789999999999999 899999999999754211 Q ss_pred chhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEE Q lcl|Aclame:pro 245 IDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIAS 323 (393) Q Consensus 245 ~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~ 323 (393) .... . .....+..+.+...+.+..++ .+.........+++++.++.+|+. +++++|.+.+ T Consensus 158 --gi~~--~---~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~------------l~d~~G~~~~ 218 (324) T protein:vir:78 158 --SIAQ--S---IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKERI 218 (324) T ss_pred --cccc--c---ccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhccCCCeee Confidence 1111 1 111222334566678888777 343334455578999999988876 5555665544 Q ss_pred eeecccchhhcccchhc---------eeeeeccccee--cccceeeecc----------------eeEeecCceEEEeee Q lcl|Aclame:pro 324 EVGVDEIIVYTGSKAVK---------PTVLVDQKYHI--DMQDLTKVDA----------------FEWKTNSNMILVETL 376 (393) Q Consensus 324 ~v~~~~~~~~tG~k~~~---------ptv~vD~k~~~--~~~~~~~~~s----------------~~~~~ns~~i~~~~~ 376 (393) .=+.+.... |-..+. +.++.|-++++ .-++++---+ .-|..|+-.+.++.+ T Consensus 219 ~~~~~~~l~--G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r 296 (324) T protein:vir:78 219 YDRNSDSLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred cCCCCCccc--ceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 323332222 221111 22333443332 1122111000 014566777778888 Q ss_pred ecccccccCcceeeeeC Q lcl|Aclame:pro 377 TSGHVETYNAGAVITVS 393 (393) Q Consensus 377 ~~g~~~~~n~~~~~~v~ 393 (393) ..+.+..|++-++++-+ T Consensus 297 ~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 297 VALHIADDKAFAKLVPA 313 (324) T ss_pred EccEEecccceEEEecc Confidence 99999998887777765 No 75 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=99.11 E-value=1.5e-12 Score=85.47 Aligned_cols=277 Identities=14% Similarity=0.080 Sum_probs=161.7 Q ss_pred HHHccCChhHHHHHHHHHHh------Cccc-hhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-cc Q lcl|Aclame:pro 93 LKKNSGKSEIKNAWNAKLAE------NGVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SN 164 (393) Q Consensus 93 l~~nqg~ke~k~AW~a~L~e------kgV~-~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~ 164 (393) |+..+-.+.-.+.|..++.. .++. ..+...++|..+...|-+.++++.++++-+.+.+.+.....+--.+ .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 22222222111222222111 1221 2223337999998888888898898888776666655444443333 33 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcccc Q lcl|Aclame:pro 165 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKS 244 (393) Q Consensus 165 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~ 244 (393) .+.-+--|.++.+...+|...++.|..++....+.+-+-+ .+.-++.+|+.++|++++. |+++.+++.|+|.+.... T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~--ds~~~l~~~i~~~la~ai~-~~~d~a~l~G~g~~~~~~ 157 (324) T protein:vir:96 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFGK 157 (324) T ss_pred ceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHHhccCCCCCcCc Confidence 4444566888999999999999999888877777433222 2235789999999999999 899999999999754211 Q ss_pred chhhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEE Q lcl|Aclame:pro 245 IDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIAS 323 (393) Q Consensus 245 ~~~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~ 323 (393) .... . .....+..+.+...+.+..++ .+.........+++++.++.+|+. +++++|.+.+ T Consensus 158 --gi~~--~---~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~------------l~d~~G~~~~ 218 (324) T protein:vir:96 158 --SIAQ--S---IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKERI 218 (324) T ss_pred --cccc--c---ccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhccCCCeee Confidence 1111 1 111222334566678888777 343334455578999999988876 5555665544 Q ss_pred eeecccchhhcccchhc---------eeeeeccccee--cccceeeecc----------------eeEeecCceEEEeee Q lcl|Aclame:pro 324 EVGVDEIIVYTGSKAVK---------PTVLVDQKYHI--DMQDLTKVDA----------------FEWKTNSNMILVETL 376 (393) Q Consensus 324 ~v~~~~~~~~tG~k~~~---------ptv~vD~k~~~--~~~~~~~~~s----------------~~~~~ns~~i~~~~~ 376 (393) .=+.+.... |-..+. +.++.|-++++ .-++++---+ .-|..|+-.+.++.+ T Consensus 219 ~~~~~~~l~--G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r 296 (324) T protein:vir:96 219 YDRNSDSLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred cCCCCCccc--ceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 323332222 221111 22333443332 1122111000 014566777778888 Q ss_pred ecccccccCcceeeeeC Q lcl|Aclame:pro 377 TSGHVETYNAGAVITVS 393 (393) Q Consensus 377 ~~g~~~~~n~~~~~~v~ 393 (393) ..+.+..|++-++++-+ T Consensus 297 ~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 297 VALHIADDKAFAKLVPA 313 (324) T ss_pred EccEEecccceEEEecc Confidence 99999998887777765 No 76 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.07 E-value=2.3e-12 Score=84.44 Aligned_cols=283 Identities=15% Similarity=0.107 Sum_probs=159.8 Q ss_pred hhhhhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc-hhhhHhhcchhHHHHHHHHHHhhCcccc Q lcl|Aclame:pro 66 QEEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVT-ITDTTFQLPRKLVESINTALLNTNPVFK 144 (393) Q Consensus 66 ~~Ek~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~-~qd~~eiLP~~ii~AIe~A~ed~d~vl~ 144 (393) .+..+|.+-+...|.....+.. - +..-.+. .++..-.+|..+..-|-..++++.++++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~----------~-----------~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~ 59 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQ----------V-----------FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ 59 (324) T ss_pred CCCchHHHHHHHHHHHHhhccc----------e-----------ecccceeccCCCcceechhHHHHHHHHHHhhchhhh Confidence 1111111111111111110000 0 0001111 1122226999999988888998999998 Q ss_pred ceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 145 VFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQA 223 (393) Q Consensus 145 ~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~ 223 (393) .+++.+.+.....+--.+ ...+.-+.-|.++.+...+|...++.|..++...++.+-+.+ .+.-++.+|++++|+++ T Consensus 60 ~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~a 137 (324) T protein:vir:10 60 LGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEA 137 (324) T ss_pred hcceeeccCCceEEEEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHH Confidence 888777665544443332 345555667888999999999999999988888777443332 22357899999999999 Q ss_pred HHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceee-ecCCCceEEEecchhhhHhhh Q lcl|Aclame:pro 224 IVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVR-PTAGRRYLIVKTEDRKALLDE 302 (393) Q Consensus 224 fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~-~~a~~~~l~i~~~d~~a~~~~ 302 (393) +- |+++.+++.|||.+..-. ... .. +....+..+++...+++..++.-.. .......+++++.++..|+. T Consensus 138 i~-~~~d~a~l~G~g~~~~~~--~i~--~~---~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~- 208 (324) T protein:vir:10 138 FY-KKFDEAGILNQGNNPFGK--SIA--QS---IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK- 208 (324) T ss_pred HH-HHHHHHhhhcCCCCccCc--ccc--cc---ccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH- Confidence 99 799999999999853111 010 11 1111223345666788888873333 33455578889999888776 Q ss_pred hhccccccceeeecCCCeEEEeeecccchhhcccchhc---------eeeeeccccee--cccceeee-------cc--- Q lcl|Aclame:pro 303 LRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAVK---------PTVLVDQKYHI--DMQDLTKV-------DA--- 361 (393) Q Consensus 303 ~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~---------ptv~vD~k~~~--~~~~~~~~-------~s--- 361 (393) |++.+|.+.+.-+.++..+ |-..+. ..++.|-..++ .-+|++-- .. T Consensus 209 -----------l~d~~g~~~~~~~~~~~l~--G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:10 209 -----------IVDPETKERIYDRNSDTLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred -----------hhccCCceeecCCCCcccc--ceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccccccccc Confidence 5666666554433333322 321110 12222332222 11111100 00 Q ss_pred ------eeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 362 ------FEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 362 ------~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .-|..|+--+.++.+..|.+..+++=++++.+ T Consensus 276 ~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:10 276 EDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEEecccceEEEEec Confidence 01445555566677788888888877777766 No 77 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=99.05 E-value=1.5e-12 Score=85.35 Aligned_cols=269 Identities=10% Similarity=0.048 Sum_probs=161.5 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhh-HhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDT-TFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKD 171 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~-~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ 171 (393) |..+.- ....+..++. -..+|+.+...|-..+.++.++++..++.+.+.....+-..+ ...+.-+.- T Consensus 1 ma~~~~-----------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 69 (304) T protein:vir:10 1 MATPTY-----------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSE 69 (304) T ss_pred Cccccc-----------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeec Confidence 222221 1122322222 226999999999999999999999888777766555543333 345555677 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhh Q lcl|Aclame:pro 172 GQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 251 (393) Q Consensus 172 ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~ 251 (393) |+++.++..+|...++.|..++....+.+-+.. .+.-++.+|++++|+.++- |+++.+++.|||.+.....-..+-+ T Consensus 70 ~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~ia-~~~d~~~l~G~g~~~~~~~~~~~~~ 146 (304) T protein:vir:10 70 TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK--WTAKDFFNEVKPLIAEAFY-KAFDQAVIFGTKSPYNTSTSGKPLV 146 (304) T ss_pred CcccccccceeeEEEEEEEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHhhheeccCCCccccccccccc Confidence 889999999999999999888888777443322 2335789999999999999 7999999999998542221111211 Q ss_pred hhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccc Q lcl|Aclame:pro 252 KKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEI 330 (393) Q Consensus 252 ~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~ 330 (393) .......++ ..+.+..-++|..++ .+.........+++|+.++.+|+. +++.+|.+-+....+.+ T Consensus 147 ~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~------------lkd~~G~~l~~~~~~~l 212 (304) T protein:vir:10 147 EGAEEKGNV--VTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN------------ALDANDRPLFDANGNEI 212 (304) T ss_pred ccccccccc--cccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH------------hhccCCcEeecCCCccc Confidence 111111111 123344567777776 333334455678889999988876 45555554432222211 Q ss_pred hhhcccchhc-----------eeeeeccccee--cccceeeecce------------------eEeecCceEEEeeeecc Q lcl|Aclame:pro 331 IVYTGSKAVK-----------PTVLVDQKYHI--DMQDLTKVDAF------------------EWKTNSNMILVETLTSG 379 (393) Q Consensus 331 ~~~tG~k~~~-----------ptv~vD~k~~~--~~~~~~~~~s~------------------~~~~ns~~i~~~~~~~g 379 (393) + |-..+. +..+.|-+.++ ..+|++---+. -|.+|+-.+.++.+..+ T Consensus 213 -~--G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~ 289 (304) T protein:vir:10 213 -M--GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAY 289 (304) T ss_pred -c--ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEecc Confidence 1 211110 12233433332 22232210011 14556666677788899 Q ss_pred cccccCcceeeeeC Q lcl|Aclame:pro 380 HVETYNAGAVITVS 393 (393) Q Consensus 380 ~~~~~n~~~~~~v~ 393 (393) -+..|++-++++.+ T Consensus 290 ~v~~~~a~~~l~~a 303 (304) T protein:vir:10 290 MNVKPEAFATLKPT 303 (304) T ss_pred EeecccceEEEEec Confidence 99999998888888 No 78 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=99.05 E-value=1.5e-12 Score=85.35 Aligned_cols=269 Identities=10% Similarity=0.048 Sum_probs=161.5 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhh-HhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDT-TFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKD 171 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~-~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ 171 (393) |..+.- ....+..++. -..+|+.+...|-..+.++.++++..++.+.+.....+-..+ ...+.-+.- T Consensus 1 ma~~~~-----------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 69 (304) T protein:vir:94 1 MATPTY-----------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSE 69 (304) T ss_pred Cccccc-----------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeec Confidence 222221 1122322222 226999999999999999999999888777766555543333 345555677 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhh Q lcl|Aclame:pro 172 GQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 251 (393) Q Consensus 172 ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~ 251 (393) |+++.++..+|...++.|..++....+.+-+.. .+.-++.+|++++|+.++- |+++.+++.|||.+.....-..+-+ T Consensus 70 ~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~ia-~~~d~~~l~G~g~~~~~~~~~~~~~ 146 (304) T protein:vir:94 70 TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK--WTAKDFFNEVKPLIAEAFY-KAFDQAVIFGTKSPYNTSTSGKPLV 146 (304) T ss_pred CcccccccceeeEEEEEEEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHhhheeccCCCccccccccccc Confidence 889999999999999999888888777443322 2335789999999999999 7999999999998542221111211 Q ss_pred hhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccc Q lcl|Aclame:pro 252 KKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEI 330 (393) Q Consensus 252 ~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~ 330 (393) .......++ ..+.+..-++|..++ .+.........+++|+.++.+|+. +++.+|.+-+....+.+ T Consensus 147 ~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~------------lkd~~G~~l~~~~~~~l 212 (304) T protein:vir:94 147 EGAEEKGNV--VTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN------------ALDANDRPLFDANGNEI 212 (304) T ss_pred ccccccccc--cccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH------------hhccCCcEeecCCCccc Confidence 111111111 123344567777776 333334455678889999988876 45555554432222211 Q ss_pred hhhcccchhc-----------eeeeeccccee--cccceeeecce------------------eEeecCceEEEeeeecc Q lcl|Aclame:pro 331 IVYTGSKAVK-----------PTVLVDQKYHI--DMQDLTKVDAF------------------EWKTNSNMILVETLTSG 379 (393) Q Consensus 331 ~~~tG~k~~~-----------ptv~vD~k~~~--~~~~~~~~~s~------------------~~~~ns~~i~~~~~~~g 379 (393) + |-..+. +..+.|-+.++ ..+|++---+. -|.+|+-.+.++.+..+ T Consensus 213 -~--G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~ 289 (304) T protein:vir:94 213 -M--GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAY 289 (304) T ss_pred -c--ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEecc Confidence 1 211110 12233433332 22232210011 14556666677788899 Q ss_pred cccccCcceeeeeC Q lcl|Aclame:pro 380 HVETYNAGAVITVS 393 (393) Q Consensus 380 ~~~~~n~~~~~~v~ 393 (393) -+..|++-++++.+ T Consensus 290 ~v~~~~a~~~l~~a 303 (304) T protein:vir:94 290 MNVKPEAFATLKPT 303 (304) T ss_pred EeecccceEEEEec Confidence 99999998888888 No 79 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=99.05 E-value=1.6e-10 Score=74.27 Aligned_cols=352 Identities=14% Similarity=0.140 Sum_probs=169.7 Q ss_pred cchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHHHH Q lcl|Aclame:pro 3 KPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFIES 82 (393) Q Consensus 3 k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfLkT 82 (393) =.|+-+.+.+++++.+.+.+++..++.-.- .++ ....+...+...++++..... .++...... T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~~~~---------~e~---~~~~~~~~~~~~~~~~~~~~~-----~e~~~~~~~ 63 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQNGAS---------DEE---QSKAFGAMFDALSNDLQEEIT-----AEINNRVVD 63 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhhhh---------HHH---HHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHH Confidence 234555555555555554444433221100 011 111111111112222211111 111111111 Q ss_pred HHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee-c Q lcl|Aclame:pro 83 QNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-D 161 (393) Q Consensus 83 kqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l-~ 161 (393) +..... ...+....+.++.+++. .+.+. .+.-.++|..+...|-..+.++.++++..+|.+.+... .+.. . T Consensus 64 --~~~~~~--r~~~~l~~ee~~~~~~~-~~~t~--~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~~~-~i~~~~ 135 (395) T protein:vir:95 64 --NGILAK--RSQDPLTSEERKFFNDI-NYDVG--YTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGIKT-RVIKAD 135 (395) T ss_pred --HHHHhh--cCccccchHHHHHHHHH-hhccC--CCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCce-EEEEec Confidence 011000 01112234556666543 33222 23334799999999999999999999988887776543 3333 3 Q ss_pred cc-cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCC Q lcl|Aclame:pro 162 SS-NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 240 (393) Q Consensus 162 na-~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~ 240 (393) .. ...||--.+..++....+|...++.+..+|.+..+...+ |+-+.-.+-+|+.++|++.|- ++++.|++.|||+. T Consensus 136 ~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~el--l~ds~~~ie~~i~~~la~~ia-~~~~~a~i~G~G~~ 212 (395) T protein:vir:95 136 PAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDL--STFGPAWIERFVRTQIQEAIS-VALESAIINGGGAA 212 (395) T ss_pred CCcceEEeecccccCccccccceeeeeceeeEEEeecccHHH--HhcchhHHHHHHHHHHHHHHH-HHHhhheeeccCCC Confidence 33 455544445556667889999999988877765553333 233335678999999999999 79999999999985 Q ss_pred ccccchhhhhhhhhhhhhhhhhccCCCCC------HHHHHhhhceee--------ecCCCceEEEecchhhhHhhhhhcc Q lcl|Aclame:pro 241 GFKSIDKEADVKKIKKITTKAKSAGKTPF------ADAIEEAVDFVR--------PTAGRRYLIVKTEDRKALLDELRQA 306 (393) Q Consensus 241 ~t~~~~~e~D~~~ik~it~~at~~~~t~~------~dal~Eald~a~--------~~a~~~~l~i~~~d~~a~~~~~~~~ 306 (393) ..+-.-+..+......-.+.....+.+.+ .+.+...+-... .-.++.+.+.++.++..+.- T Consensus 213 ~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g----- 287 (395) T protein:vir:95 213 KTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQA----- 287 (395) T ss_pred CcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCC----- Confidence 32111111111100000000000000111 111222111110 11233455666655543321 Q ss_pred ccccceeeecCCCeEEEeeecccchhhcccchhc--eeeeeccccee--cccce--eeecceeEeecCceEEEeeeeccc Q lcl|Aclame:pro 307 TANANVRIKNDDTEIASEVGVDEIIVYTGSKAVK--PTVLVDQKYHI--DMQDL--TKVDAFEWKTNSNMILVETLTSGH 380 (393) Q Consensus 307 ~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~--ptv~vD~k~~~--~~~~~--~~~~s~~~~~ns~~i~~~~~~~g~ 380 (393) +--..+.+|.++..+|.|-..+. +..|- ..++-|-+.|+ +-+|+ .......+-..+-.+....++-|. T Consensus 288 ----~~~~~~~~G~~~~~lg~g~~v~~--~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~ 361 (395) T protein:vir:95 288 ----RYTYLTANGGFVTVLPYNVTIIT--SEFVPEGKLVAFVTDRYNAVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQ 361 (395) T ss_pred ----cceeccCCCcceeccCCcceEEE--cCCCCCCcEEEEecccEEEEEecceEEEeccchhhhCCcEEEEEEEEECCE Confidence 11112345666655544421111 11111 24455555544 33332 222222233355557778889999 Q ss_pred ccccCcceeeeeC Q lcl|Aclame:pro 381 VETYNAGAVITVS 393 (393) Q Consensus 381 ~~~~n~~~~~~v~ 393 (393) +.-+|+-.|.+|+ T Consensus 362 ~~~~~A~~~l~i~ 374 (395) T protein:vir:95 362 PDDNKASAVYDLK 374 (395) T ss_pred EeccccEEEEEee Confidence 9999999998888 No 80 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=99.02 E-value=1.4e-10 Score=74.62 Aligned_cols=340 Identities=11% Similarity=0.100 Sum_probs=168.3 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |.. +..+++. ++..++...|+. .++.+...++. ...+..+..++ +.+.+.++.++ T Consensus 1 m~i----k~~~~~~---~~~~e~~~~~~~---~~~~~~~~~~~--~~~~~~~~~~~------------~~~~~~e~~~~- 55 (381) T protein:vir:95 1 MTI----NLSETFA---NAKNEFINAVNN---GEPQERQNELY--GDMINQLFEET------------KLQAKAEAERV- 55 (381) T ss_pred Cch----hhHHHHH---HHHHHHHHHHhh---hhhhHHHHHHH--HHHHHhhhhhH------------HHHHHHHHHHH- Confidence 321 0111111 111111111110 00000000000 00011111111 00111122111 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l 160 (393) ..+.+ ..+....+.++.+++- .. |. +.+--.++|..+...|-+.+.+++++++..+|.+.+... .+-. T Consensus 56 ------~~~~~--~~~~lt~~e~~~~~~~-~~-~~-~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~-~i~~ 123 (381) T protein:vir:95 56 ------SSLPK--SAQSLSANQRSFFMDI-NK-NV-NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRL-KFLK 123 (381) T ss_pred ------HHhcc--CcccccHHHHHHHHHH-hc-cc-CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcce-EEEE Confidence 11100 1111233555555442 21 21 233345899999999999999999999988888877543 3333 Q ss_pred -ccc-cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccC Q lcl|Aclame:pro 161 -DSS-NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDG 238 (393) Q Consensus 161 -~na-~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG 238 (393) .+. .-.||--.+..+.....+|...++.+..+|..-.+..-+ |+.++-++-+|+.++|++.|- ++.+.|++.||| T Consensus 124 ~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~el--L~Ds~~~ie~~i~~~la~~~a-~~~~~a~i~G~G 200 (381) T protein:vir:95 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL--NDFGPAWIERFVRVQIEEAFA-VALETAFLKGTG 200 (381) T ss_pred ecCCcceeeecccccccccccccceeeeecceeEEeechhhHHH--hhcCHHHHHHHHHHHHHHHHH-HHhhheeEeccC Confidence 233 344544444445555778999999988888776664433 444556889999999999999 799999999999 Q ss_pred CCccccchhhhhhhhhhhh----------hhhhhccCCCCCHHH---HHhhhcee-----eecCCCceEEEecchhhhHh Q lcl|Aclame:pro 239 TNGFKSIDKEADVKKIKKI----------TTKAKSAGKTPFADA---IEEAVDFV-----RPTAGRRYLIVKTEDRKALL 300 (393) Q Consensus 239 ~~~t~~~~~e~D~~~ik~i----------t~~at~~~~t~~~da---l~Eald~a-----~~~a~~~~l~i~~~d~~a~~ 300 (393) +.-. .-+..++...... ..+.+........+. +..+++.. +.-.++-+.++++.+...|+ T Consensus 201 ~~qP--~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~ 278 (381) T protein:vir:95 201 KDQP--IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) T ss_pred CCCc--eeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhc Confidence 8321 1111111000000 000111111111223 33344321 12233445677888877665 Q ss_pred hhhhccccccceeeecCCCeEEEeeecccchhhcccchhc--eeeeeccccee--cccce--eeecceeEeecCceEEEe Q lcl|Aclame:pro 301 DELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAVK--PTVLVDQKYHI--DMQDL--TKVDAFEWKTNSNMILVE 374 (393) Q Consensus 301 ~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~--ptv~vD~k~~~--~~~~~--~~~~s~~~~~ns~~i~~~ 374 (393) .-. -..+.+|.|+..++.+-..+. +..|- ..++-|-++|+ +-+|+ ...+...+-..+-.+.+. T Consensus 279 ~~~---------~~~~~~G~~v~~l~~g~~vv~--s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~ 347 (381) T protein:vir:95 279 AQY---------THLNANGVYVTALPFNLNVIE--STVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAK 347 (381) T ss_pred ccc---------ccCCCCCceeecCCCCceEEe--cCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEE Confidence 310 023456777766655533331 11221 15556655554 33333 333334455556667778 Q ss_pred eeecccccccCcceeeeeC Q lcl|Aclame:pro 375 TLTSGHVETYNAGAVITVS 393 (393) Q Consensus 375 ~~~~g~~~~~n~~~~~~v~ 393 (393) .+.-|.+.-+++.+|.+++ T Consensus 348 ~r~dg~~~~~~A~~v~~l~ 366 (381) T protein:vir:95 348 QFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred EEEcCEEecCceEEEEEEE Confidence 8889999999999998887 No 81 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=99.02 E-value=1.4e-10 Score=74.62 Aligned_cols=340 Identities=11% Similarity=0.100 Sum_probs=168.3 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |.. +..+++. ++..++...|+. .++.+...++. ...+..+..++ +.+.+.++.++ T Consensus 1 m~i----k~~~~~~---~~~~e~~~~~~~---~~~~~~~~~~~--~~~~~~~~~~~------------~~~~~~e~~~~- 55 (381) T protein:vir:10 1 MTI----NLSETFA---NAKNEFINAVNN---GEPQERQNELY--GDMINQLFEET------------KLQAKAEAERV- 55 (381) T ss_pred Cch----hhHHHHH---HHHHHHHHHHhh---hhhhHHHHHHH--HHHHHhhhhhH------------HHHHHHHHHHH- Confidence 321 0111111 111111111110 00000000000 00011111111 00111122111 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l 160 (393) ..+.+ ..+....+.++.+++- .. |. +.+--.++|..+...|-+.+.+++++++..+|.+.+... .+-. T Consensus 56 ------~~~~~--~~~~lt~~e~~~~~~~-~~-~~-~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~-~i~~ 123 (381) T protein:vir:10 56 ------SSLPK--SAQSLSANQRSFFMDI-NK-NV-NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRL-KFLK 123 (381) T ss_pred ------HHhcc--CcccccHHHHHHHHHH-hc-cc-CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcce-EEEE Confidence 11100 1111233555555442 21 21 233345899999999999999999999988888877543 3333 Q ss_pred -ccc-cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccC Q lcl|Aclame:pro 161 -DSS-NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDG 238 (393) Q Consensus 161 -~na-~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG 238 (393) .+. .-.||--.+..+.....+|...++.+..+|..-.+..-+ |+.++-++-+|+.++|++.|- ++.+.|++.||| T Consensus 124 ~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~el--L~Ds~~~ie~~i~~~la~~~a-~~~~~a~i~G~G 200 (381) T protein:vir:10 124 SETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL--NDFGPAWIERFVRVQIEEAFA-VALETAFLKGTG 200 (381) T ss_pred ecCCcceeeecccccccccccccceeeeecceeEEeechhhHHH--hhcCHHHHHHHHHHHHHHHHH-HHhhheeEeccC Confidence 233 344544444445555778999999988888776664433 444556889999999999999 799999999999 Q ss_pred CCccccchhhhhhhhhhhh----------hhhhhccCCCCCHHH---HHhhhcee-----eecCCCceEEEecchhhhHh Q lcl|Aclame:pro 239 TNGFKSIDKEADVKKIKKI----------TTKAKSAGKTPFADA---IEEAVDFV-----RPTAGRRYLIVKTEDRKALL 300 (393) Q Consensus 239 ~~~t~~~~~e~D~~~ik~i----------t~~at~~~~t~~~da---l~Eald~a-----~~~a~~~~l~i~~~d~~a~~ 300 (393) +.-. .-+..++...... ..+.+........+. +..+++.. +.-.++-+.++++.+...|+ T Consensus 201 ~~qP--~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~ 278 (381) T protein:vir:10 201 KDQP--IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) T ss_pred CCCc--eeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhc Confidence 8321 1111111000000 000111111111223 33344321 12233445677888877665 Q ss_pred hhhhccccccceeeecCCCeEEEeeecccchhhcccchhc--eeeeeccccee--cccce--eeecceeEeecCceEEEe Q lcl|Aclame:pro 301 DELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAVK--PTVLVDQKYHI--DMQDL--TKVDAFEWKTNSNMILVE 374 (393) Q Consensus 301 ~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~--ptv~vD~k~~~--~~~~~--~~~~s~~~~~ns~~i~~~ 374 (393) .-. -..+.+|.|+..++.+-..+. +..|- ..++-|-++|+ +-+|+ ...+...+-..+-.+.+. T Consensus 279 ~~~---------~~~~~~G~~v~~l~~g~~vv~--s~~~p~~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~ 347 (381) T protein:vir:10 279 AQY---------THLNANGVYVTALPFNLNVIE--STVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAK 347 (381) T ss_pred ccc---------ccCCCCCceeecCCCCceEEe--cCCCCcCcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEE Confidence 310 023456777766655533331 11221 15556655554 33333 333334455556667778 Q ss_pred eeecccccccCcceeeeeC Q lcl|Aclame:pro 375 TLTSGHVETYNAGAVITVS 393 (393) Q Consensus 375 ~~~~g~~~~~n~~~~~~v~ 393 (393) .+.-|.+.-+++.+|.+++ T Consensus 348 ~r~dg~~~~~~A~~v~~l~ 366 (381) T protein:vir:10 348 QFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred EEEcCEEecCceEEEEEEE Confidence 8889999999999998887 No 82 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.99 E-value=8e-12 Score=81.41 Aligned_cols=277 Identities=14% Similarity=0.092 Sum_probs=158.7 Q ss_pred HHHccCCh-hHHHHHHHHHHh-----Cccc-hhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeec-ccc Q lcl|Aclame:pro 93 LKKNSGKS-EIKNAWNAKLAE-----NGVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFD-SSN 164 (393) Q Consensus 93 l~~nqg~k-e~k~AW~a~L~e-----kgV~-~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~-na~ 164 (393) |++.+-.+ +.++-+...+.. -++. .++..-++|..+..-|-..+.++.++++.+.+...+.....+--. ... T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 11111111 222211111110 1111 122223789988888888888888888866665555443343222 223 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcccc Q lcl|Aclame:pro 165 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKS 244 (393) Q Consensus 165 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~ 244 (393) .+.-.--|.++.+...+|...++.|..++...++.+-+.+ .+..++.+|+.++|++++- |+++.+++.|||.+..-. T Consensus 81 ~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~--ds~~~l~~~i~~~l~~aia-~~~d~~~l~G~g~~~~~~ 157 (324) T protein:vir:96 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFGK 157 (324) T ss_pred ceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHhhhcCCCCCcCc Confidence 4444566788899999999999999988888777443332 3335789999999999999 799999999999753211 Q ss_pred chhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc-eeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEE Q lcl|Aclame:pro 245 IDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIAS 323 (393) Q Consensus 245 ~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald-~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~ 323 (393) ... ..+ ....+..+++...++|..++. +.........+++++.++.+|+. +++.+|.+-+ T Consensus 158 --~~~--~~~---~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~------------lkd~~G~~~~ 218 (324) T protein:vir:96 158 --SIA--QSI---KKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKERI 218 (324) T ss_pred --ccc--ccc---cccceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhCCCCCeee Confidence 111 111 111223344566788888773 32333444578999999998886 5566665554 Q ss_pred eeecccchhhcccchhc---------eeeeeccccee--cccceeeecce----------------eEeecCceEEEeee Q lcl|Aclame:pro 324 EVGVDEIIVYTGSKAVK---------PTVLVDQKYHI--DMQDLTKVDAF----------------EWKTNSNMILVETL 376 (393) Q Consensus 324 ~v~~~~~~~~tG~k~~~---------ptv~vD~k~~~--~~~~~~~~~s~----------------~~~~ns~~i~~~~~ 376 (393) .-+.+.... |-..+. +.++.|-.+++ .-++++---+. -|..|+-.+.++.+ T Consensus 219 ~~~~~~~l~--G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r 296 (324) T protein:vir:96 219 YDRNSDSLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred cCCCCCccc--ceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 323222222 322111 12233433332 12222111110 14556666778888 Q ss_pred ecccccccCcceeeeeC Q lcl|Aclame:pro 377 TSGHVETYNAGAVITVS 393 (393) Q Consensus 377 ~~g~~~~~n~~~~~~v~ 393 (393) ..+.+..+++-++++-+ T Consensus 297 ~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 297 VALHIADDKAFAKLVPA 313 (324) T ss_pred eccEEecccceEEEecc Confidence 88888888887777766 No 83 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.94 E-value=6.7e-10 Score=70.89 Aligned_cols=338 Identities=12% Similarity=0.114 Sum_probs=165.9 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |+..+ + ++++..++...++.-+. +..+-...+.+.+.+.+ ++ +.+...++.+ T Consensus 3 ~kl~~------~---~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~---~~------------~~~~~~e~~~-- 54 (381) T protein:vir:10 3 INLSE------T---FANAKNEFINAVNNGEP--QERQNELYGDMINQLFE---ET------------KLQAKAEAER-- 54 (381) T ss_pred hhHHH------H---HHHHHHHHHHHHHhhhH--HHHHHHHHHHHHHhhhh---hH------------HHHHHHHHHH-- Confidence 22111 1 11111111111110000 00000000011111100 00 0001111111 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l 160 (393) + ....+ -.+....+-++.+++ +...+- .+-..++|..+...|-+.+.+++++++...|.+.+...-+.-. T Consensus 55 ----~-~~~~~--~~~~l~~~e~~~~~~-~~~~t~--~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~~~~i~~~ 124 (381) T protein:vir:10 55 ----V-SSLPK--SAQTLSANQRNFFMD-INKSVG--YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKS 124 (381) T ss_pred ----H-HHhcc--cccccCHHHHHHHHH-HhhcCC--CCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCcceEEEee Confidence 0 00000 111112344444443 222211 1223479999999999999999999998888777654322222 Q ss_pred cc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCC Q lcl|Aclame:pro 161 DS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGT 239 (393) Q Consensus 161 ~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~ 239 (393) .. ....||.-.+..+.....+|...++.+..+|..-.+..-+ |+-++-++-+|+.++|+..|- ++.+.|++.|||+ T Consensus 125 ~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~el--L~Ds~~~le~~i~~~la~~~a-~~~~~afi~GdG~ 201 (381) T protein:vir:10 125 ETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL--NDFGPAWIERFVRVQIEEAFA-VALETAFLKGTGK 201 (381) T ss_pred cCCcceEEeecccccccccCccceeEeecceeEEeeccccHHH--HhccHHHHHHHHHHHHHHHHH-HHhhceeEecccC Confidence 33 2456777666666666789999999988888776663333 344446788999999999999 7999999999998 Q ss_pred Cccccchhhhhhhhhhhhhhhh-----hccCCCCC------HH---HHHhhhce-----eeecCCCceEEEecchhhhHh Q lcl|Aclame:pro 240 NGFKSIDKEADVKKIKKITTKA-----KSAGKTPF------AD---AIEEAVDF-----VRPTAGRRYLIVKTEDRKALL 300 (393) Q Consensus 240 ~~t~~~~~e~D~~~ik~it~~a-----t~~~~t~~------~d---al~Eald~-----a~~~a~~~~l~i~~~d~~a~~ 300 (393) .-. .-+..++-... ..+.+ ++.+..++ .+ ++...++. .++..++.++++++.+...|+ T Consensus 202 ~qP--~Gil~~~~~~~-~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~ 278 (381) T protein:vir:10 202 DQP--IGLNRQVQKGV-SVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQ 278 (381) T ss_pred CCc--eeeeecCCccc-cccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhc Confidence 421 11111100000 00000 00100011 11 11122211 112334556677777777665 Q ss_pred hhhhccccccceeeecCCCeEEEeeecccchhhcccchh--ceeeeeccccee--cccc--eeeecceeEeecCceEEEe Q lcl|Aclame:pro 301 DELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAV--KPTVLVDQKYHI--DMQD--LTKVDAFEWKTNSNMILVE 374 (393) Q Consensus 301 ~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~--~ptv~vD~k~~~--~~~~--~~~~~s~~~~~ns~~i~~~ 374 (393) --. -..+.+|.|+...+.|-..+.+ ..| ...++-|-++|+ +-+| +...+...+-.++-.+... T Consensus 279 ~~~---------~~~~~~G~~v~~lp~g~~vv~~--~~~p~~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~ 347 (381) T protein:vir:10 279 AQY---------THLNANGVYVTALPFNLNVIES--TVQEAGKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAK 347 (381) T ss_pred ccc---------ccCCCCCceeecCCCCceeEEc--CCCCcCcEEEEEcccEEEEEecccEEEeechhhhhcCceEEEEE Confidence 311 1235567776554444322211 111 125666766655 3333 3344444455566677788 Q ss_pred eeecccccccCcceeeeeC Q lcl|Aclame:pro 375 TLTSGHVETYNAGAVITVS 393 (393) Q Consensus 375 ~~~~g~~~~~n~~~~~~v~ 393 (393) .+.-|.+.-+++..|.+++ T Consensus 348 ~r~dG~~~~~~A~~v~~l~ 366 (381) T protein:vir:10 348 QFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred EEEcCEEecCCcEEEEEEe Confidence 8899999999999997777 No 84 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.91 E-value=2.8e-11 Score=78.47 Aligned_cols=264 Identities=13% Similarity=0.016 Sum_probs=158.1 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeeccccccceecccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSSNEAQVHKDGQ 173 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na~~a~GHk~ga 173 (393) |..+.- .+-...+---.+|..+...|-+.+.++..+++..++.+.+.....+-..+...+.=+.-|. T Consensus 1 ~g~~a~-------------~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 67 (299) T protein:vir:41 1 MGFNPD-------------TTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSGVGAFWVDEAE 67 (299) T ss_pred CCcCCC-------------cccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcCCceeeeecCc Confidence 111100 0011111112689998888888899888888877777766665444334434455567888 Q ss_pred hhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhh Q lcl|Aclame:pro 174 TKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKK 253 (393) Q Consensus 174 ~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ 253 (393) ++.++..+|...++.|..++....+.+-+.+ .++-++.+|+++.|+.++- |+++.+++.|||.+.... .... T Consensus 68 ~~~~~~~~f~~v~l~~~k~~~~~~is~ell~--ds~~~~~~~i~~~l~~a~~-~~~d~a~l~G~g~~~~~g--il~~--- 139 (299) T protein:vir:41 68 RIQTSKPTFTKAKMRSKKMGVIIPTTKENLN--YSVTNFFSLMQAEIVEAFY-KKFDQAVFTGVESPYNWN--ILKS--- 139 (299) T ss_pred cccccccceeEEEEeeEEEEEeehhhHHHHh--cCHHHHHHHHHHHHHHHHH-HHHHHHHhhcccCccccc--cccc--- Confidence 9999999999999999999888777444333 3335789999999999999 799999999999864321 1111 Q ss_pred hhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeeccc--c Q lcl|Aclame:pro 254 IKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDE--I 330 (393) Q Consensus 254 ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~--~ 330 (393) ..... .+..+.+...+++..++ .+.........+++++.++.+|+. |++.+|.+-+.-.+.+ - T Consensus 140 ~~~~~--~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~~~~~ 205 (299) T protein:vir:41 140 ATDAS--NLVEETANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRS------------TKDGNGMPIFNTATSNGVD 205 (299) T ss_pred ccccc--eeeccccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHH------------hhccCCceeecCCcCCCCc Confidence 11111 12233445678898888 444444445679999999998887 6677776554211111 0 Q ss_pred hhhcccch-----------hceeeeeccccee--cccceeeeccee----------------EeecCceEEEeeeecccc Q lcl|Aclame:pro 331 IVYTGSKA-----------VKPTVLVDQKYHI--DMQDLTKVDAFE----------------WKTNSNMILVETLTSGHV 381 (393) Q Consensus 331 ~~~tG~k~-----------~~ptv~vD~k~~~--~~~~~~~~~s~~----------------~~~ns~~i~~~~~~~g~~ 381 (393) ++. |-.. ..+..+.|-+.++ .-++++---+.+ |..|+-.|.++.+..+-+ T Consensus 206 ~l~-G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v 284 (299) T protein:vir:41 206 DVL-GLPIAYTPKYTFGDKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMV 284 (299) T ss_pred eec-ceeeEEecccCCCCCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEE Confidence 110 2111 1122333333222 111111110110 333444456667888888 Q ss_pred cccCcceeeeeC Q lcl|Aclame:pro 382 ETYNAGAVITVS 393 (393) Q Consensus 382 ~~~n~~~~~~v~ 393 (393) ..|++-+.++.+ T Consensus 285 ~~~~A~~~l~~~ 296 (299) T protein:vir:41 285 VKDEAFSAVQPK 296 (299) T ss_pred ecccceEEEEec Confidence 888888888877 No 85 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.91 E-value=2.3e-11 Score=78.88 Aligned_cols=275 Identities=15% Similarity=0.081 Sum_probs=163.8 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceeccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDG 172 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~g 172 (393) +... .--.+|+..+...+- ++.-..+|..+..-|-+.++++.++++...+...+.....+-..+ ...+.-+.-| T Consensus 1 ~~~~---~~~~~e~~~~~~~~~--~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (318) T protein:vir:24 1 MAAG---TAFAVDHAQIAQTGD--TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEG 75 (318) T ss_pred CCCC---CCCCHHHHHhhcccC--cccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCC Confidence 2222 122347766665543 444447999988888888888888888666666555555554433 3345555668 Q ss_pred chhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhh Q lcl|Aclame:pro 173 QTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVK 252 (393) Q Consensus 173 a~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~ 252 (393) .+++++..+|+..++.|..++....+.+.+.+ .+..++.+|++++|+..+- +.++.++++|||.+.....- ... T Consensus 76 ~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~--ds~~~~~~~i~~~l~~~~~-~~~d~a~l~G~g~~~~~~~~--~~~- 149 (318) T protein:vir:24 76 DMKPITKGNMTSQTIAPHKIATIFVASAETVR--ANPANYLGTMRTKVATAFA-MAFDGAAMHGTDSPFPTYIG--QTT- 149 (318) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHhh--cChHHHHHHHHHHHHHHHH-HHHHHhhhcccCCCCCcccc--ccc- Confidence 88999999999999999988887777443322 2224689999999999999 79999999999986422221 111 Q ss_pred hhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccch Q lcl|Aclame:pro 253 KIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEII 331 (393) Q Consensus 253 ~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~ 331 (393) ..++......+.+...+.+..++ ...........+++++.++.+|+. +++.+|.+-+.=.+.+-. T Consensus 150 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~~~~ 215 (318) T protein:vir:24 150 --KAISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNG------------AKDQNGRPLFIESTYGEA 215 (318) T ss_pred --ccccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH------------hhccCCceeecCccccCc Confidence 11222222223333334445555 333333445678889999888876 667766654321111111 Q ss_pred h--hcccchh-ceeeee-------------ccccee--cccceeeecce----------------eEeecCceEEEeeee Q lcl|Aclame:pro 332 V--YTGSKAV-KPTVLV-------------DQKYHI--DMQDLTKVDAF----------------EWKTNSNMILVETLT 377 (393) Q Consensus 332 ~--~tG~k~~-~ptv~v-------------D~k~~~--~~~~~~~~~s~----------------~~~~ns~~i~~~~~~ 377 (393) . ..|.... .|.++. |-..++ ..+|+.---+. -|..|+-.|.++.+. T Consensus 216 ~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~ 295 (318) T protein:vir:24 216 ASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEY 295 (318) T ss_pred cccccCceEEEEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEE Confidence 1 0121211 133322 222221 12222111111 055666777888999 Q ss_pred cccccccCcceeeeeC Q lcl|Aclame:pro 378 SGHVETYNAGAVITVS 393 (393) Q Consensus 378 ~g~~~~~n~~~~~~v~ 393 (393) .+.+.-|++-++++.. T Consensus 296 d~~v~~~~a~~~i~~~ 311 (318) T protein:vir:24 296 AFHCNDAEAFVALTNV 311 (318) T ss_pred ccEEecccceEEEEee Confidence 9999999887787776 No 86 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.90 E-value=1.4e-11 Score=80.11 Aligned_cols=282 Identities=12% Similarity=0.055 Sum_probs=153.1 Q ss_pred HHHccC-ChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceec Q lcl|Aclame:pro 93 LKKNSG-KSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHK 170 (393) Q Consensus 93 l~~nqg-~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk 170 (393) |.-|-. ..++-.+- ..+.....+.+--..+|..+..-|=+.++++.++++..++...+.....+-..+ ...+.-+. T Consensus 1 ~~~~~~r~~~~~~~~--e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~ 78 (326) T protein:vir:42 1 MAVNPDRTTPFLGVN--DPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIG 78 (326) T ss_pred CCCCccchhhhcCcc--hhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEEec Confidence 111111 11221110 011111211221226999999988888998898888666666555444443333 33444567 Q ss_pred ccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhh Q lcl|Aclame:pro 171 DGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEAD 250 (393) Q Consensus 171 ~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D 250 (393) -|.++.+...+|...++.|..++....+.+.+ ++.+.-++.+|++++|++++- |..+.++++|||.+...... .. T Consensus 79 Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~el--l~~s~~~~~~~i~~~l~~a~~-~~~d~a~l~G~gs~~p~gi~--~~ 153 (326) T protein:vir:42 79 EGDMKPITKGNMTSQTIAPHKIATIFVASAET--VRANPANYLGTMRTKVATAFA-MAFDNAAINGTDSPFPTFLA--QT 153 (326) T ss_pred CCccccccccceeEEEEeeEEEEEeehhhHHH--HhcCHHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCcccccc--cc Confidence 78899999999999999999888887774433 233346789999999999999 79999999999986532211 11 Q ss_pred hhhhhhhhhhhhccCC--CCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeec Q lcl|Aclame:pro 251 VKKIKKITTKAKSAGK--TPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGV 327 (393) Q Consensus 251 ~~~ik~it~~at~~~~--t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~ 327 (393) .....-....++.... ++....+...+ ..+........+++|+.++.+|+. |++.+|.+-+.-.+ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~ 221 (326) T protein:vir:42 154 TKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNG------------AKDKSGRPLFIEST 221 (326) T ss_pred ccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHH------------hhccCCceeecccc Confidence 1111111111222221 22222233333 444445556678899999988886 56666654433221 Q ss_pred ccchhh--cccch-hceeeee-------------cccce-e-cccceeee--c--cee------------EeecCceEEE Q lcl|Aclame:pro 328 DEIIVY--TGSKA-VKPTVLV-------------DQKYH-I-DMQDLTKV--D--AFE------------WKTNSNMILV 373 (393) Q Consensus 328 ~~~~~~--tG~k~-~~ptv~v-------------D~k~~-~-~~~~~~~~--~--s~~------------~~~ns~~i~~ 373 (393) .+.... .|... -+|.++. |-+.+ + .-+|++-- + .+. |..|+-.|.+ T Consensus 222 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~ 301 (326) T protein:vir:42 222 YTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRV 301 (326) T ss_pred ccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEE Confidence 111100 01111 1233322 21111 1 11122100 0 000 3345556678 Q ss_pred eeeecccccccCcceeeeeC Q lcl|Aclame:pro 374 ETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 374 ~~~~~g~~~~~n~~~~~~v~ 393 (393) +.+..+.+..+.+-++++.. T Consensus 302 ~~~~d~~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 302 EAEYAFHCNDKDAFVKLTNV 321 (326) T ss_pred EEEeccEEecccceEEEeec Confidence 88889999888877677665 No 87 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=98.89 E-value=7.5e-10 Score=70.61 Aligned_cols=343 Identities=15% Similarity=0.142 Sum_probs=166.6 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHH Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfL 80 (393) |+ ++.+++++++.+....+.+.++.-+. +..+...++++ +..++.++ .. +. +.++.+ T Consensus 1 M~----~kl~~~~~~~~e~~~~l~~~~~~~~~--~~~~~~~~~~~---~~~~~~~~-------~~--~~---~~~~~~-- 57 (383) T protein:vir:78 1 MT----IKLKNNLANYEEKRTAFVNAVKNEDT--QEIQNKAYVEM---VDAMAADI-------ME--QA---KKEARQ-- 57 (383) T ss_pred Cc----hhHHHHHHHHHHHHHHHHHHHhccCh--HHHHHHHHHHH---HHHHHHHH-------HH--HH---HHHHHH-- Confidence 87 34555666666665555444332111 11111111111 11111111 10 00 001111 Q ss_pred HHHHHHHHHHHHHHHccC----ChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeE Q lcl|Aclame:pro 81 ESQNAVTEFFDVLKKNSG----KSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLV 156 (393) Q Consensus 81 kTkqA~~dya~ll~~nqg----~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~ 156 (393) ...+.. ....| ..+-++.+++ +...+ +.|--.++|..+...|-..+.+++++++...|.+..... T Consensus 58 -~~~~~~------~~~~g~~~lt~~e~~~~~~-~~~~~--~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~~- 126 (383) T protein:vir:78 58 -EADAYI------SASRTDKNITNEEIKFFND-INKEV--GYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLRT- 126 (383) T ss_pred -HHHHHH------HhcCChhhhhHHHHHHHHH-HhccC--CCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCce- Confidence 111111 11111 1222333322 22222 234445899999999999999999999977777766543 Q ss_pred EEee-cc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhccee Q lcl|Aclame:pro 157 SRSF-DS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALV 234 (393) Q Consensus 157 ~i~l-~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv 234 (393) .+-. .+ ....||.-.+.-++....+|...++.+..+|.+-.+..-+ |+-+.-++-+|+.++|+..|- ++.+.|++ T Consensus 127 ~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~el--l~Ds~~~ie~~i~~~l~~~~a-~~~~~a~i 203 (383) T protein:vir:78 127 KFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDL--EKFGPAWVKRFVVTQIEEAFA-VALESAYI 203 (383) T ss_pred EEEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHH--hhccHHHHHHHHHHHHHHHHH-HHHhhheE Confidence 3333 23 3466877666666667789999999988777776654333 333345788999999999999 79999999 Q ss_pred eccCCCccccchhhhhhhhhhhhhh----hhhccCCCCCHH--HHHhhh-------ceee----ecCCCc-eEEEecchh Q lcl|Aclame:pro 235 EGDGTNGFKSIDKEADVKKIKKITT----KAKSAGKTPFAD--AIEEAV-------DFVR----PTAGRR-YLIVKTEDR 296 (393) Q Consensus 235 ~gDG~~~t~~~~~e~D~~~ik~it~----~at~~~~t~~~d--al~Eal-------d~a~----~~a~~~-~l~i~~~d~ 296 (393) .|||+.- ..-+..++-.....+. ..+..+.+.+.+ .+...| .|.. ..+.+. ..+.++.|. T Consensus 204 ~G~G~~q--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~ 281 (383) T protein:vir:78 204 VGDGNDK--PIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDA 281 (383) T ss_pred eccCCCC--ceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcch Confidence 9999742 1111111100000000 001111111111 111111 1110 011111 223333322 Q ss_pred hhHhhhhhccccccceeeecCCCeEEEeeecccchhhcccchhc--eeeeeccccee--ccccee--eecceeEeecCce Q lcl|Aclame:pro 297 KALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAVK--PTVLVDQKYHI--DMQDLT--KVDAFEWKTNSNM 370 (393) Q Consensus 297 ~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~--ptv~vD~k~~~--~~~~~~--~~~s~~~~~ns~~ 370 (393) -.+. . ..-..+.+|.|+-.+|.+-..+- +..|- -.++-|-++|+ +.+|+. ..+...+-..+-. T Consensus 282 ~~~~-----~----~~~~~~~~G~~~t~l~~~~~iv~--s~~~p~~~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~d~~~ 350 (383) T protein:vir:78 282 WDVK-----K----QYTSLNANGVYVTALPFNLNIIE--SLFVPEKKAISYVAERYDALIGGPLDIGTYDQTLAIEDLNL 350 (383) T ss_pred hhhc-----c----chhccCCCCceeeecCCCceEEe--cCCCCcccEEEeeccceEEEecccceEEecchhhhhcCceE Confidence 1110 0 00123456777765554322221 11110 13455545444 334433 2333334445566 Q ss_pred EEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 371 ILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 371 i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) +....+.-|.+.-+++-.|.+++ T Consensus 351 f~~~~r~dG~~~~~~A~~vl~~~ 373 (383) T protein:vir:78 351 YAAKQFAYGKAKDDKAAAVWTLN 373 (383) T ss_pred EEEEEEEcCEEecCCeEEEEEEE Confidence 67777889999999998898888 No 88 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.88 E-value=1.4e-11 Score=80.14 Aligned_cols=273 Identities=15% Similarity=0.087 Sum_probs=152.2 Q ss_pred HHccCChhHHHHHHHHHHhCccch-hhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeec-cccccceecc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFD-SSNEAQVHKD 171 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~-qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~-na~~a~GHk~ 171 (393) |..+.. ..-.+.. .+-...+|..+...|-+.++++..+++...+...+.....+--. ....+.-..- T Consensus 1 m~~~~~-----------~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E 69 (330) T protein:vir:77 1 MAGSTV-----------PSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGE 69 (330) T ss_pred Cccccc-----------chhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecC Confidence 111111 1111221 22223688777777777788878877755544443332232222 2333444566 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhh Q lcl|Aclame:pro 172 GQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 251 (393) Q Consensus 172 ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~ 251 (393) |.++++...+|...++.|..++.+..+.+-+.+ .+.-++.+|++++|+.++- +.++.++++|||.+.. ..-...+. T Consensus 70 g~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~--ds~~~~~~~i~~~l~~ai~-~~~~~~~l~G~g~~~~-~~g~~~~~ 145 (330) T protein:vir:77 70 AERKPITKGSFGKQELEPVKITTIFAESAEVVR--LNPLNYLNTMRTKIAEAIA-LKFDAAAIHGIDKPSA-FKGYLAET 145 (330) T ss_pred CCccccccceeeEEEEeEEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCCc-cccccccc Confidence 889999999999999999988888777433322 2335789999999999999 7999999999997431 11111111 Q ss_pred hhhhhhhh---hhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeec Q lcl|Aclame:pro 252 KKIKKITT---KAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGV 327 (393) Q Consensus 252 ~~ik~it~---~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~ 327 (393) ........ .......+...+++..++ .+.+.......+++|+.++.+|+. +++.+|.+-+.-+. T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------------lkd~~G~~l~~~~~ 213 (330) T protein:vir:77 146 TKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNT------------AVDGNGRPLFVEST 213 (330) T ss_pred cccceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH------------HhccCCceeecCcc Confidence 11111110 011111222346676666 444455556678999999988887 66767665543221 Q ss_pred ccchh--hcccchh-ceee-----------------eeccccee--cccceee--------------------ecceeEe Q lcl|Aclame:pro 328 DEIIV--YTGSKAV-KPTV-----------------LVDQKYHI--DMQDLTK--------------------VDAFEWK 365 (393) Q Consensus 328 ~~~~~--~tG~k~~-~ptv-----------------~vD~k~~~--~~~~~~~--------------------~~s~~~~ 365 (393) ..-.. ..|...+ +|.+ +.|-..++ +-.|++- ..---|. T Consensus 214 ~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~ 293 (330) T protein:vir:77 214 YTEQVGAIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQ 293 (330) T ss_pred ccccccccCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhh Confidence 11111 0011111 2322 33322222 1112111 0011155 Q ss_pred ecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 366 TNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 366 ~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .|+-.+.++.+..|.+.-|++-++++.. T Consensus 294 ~~~~~~r~~~r~d~~v~~~~a~~~i~~~ 321 (330) T protein:vir:77 294 HNMVAVRCEAEFAFMVNDKDAFVKLTDQ 321 (330) T ss_pred cCcEEEEEEEEeccEEecccceEEEEec Confidence 6667778888999999999888888877 No 89 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.77 E-value=1.1e-10 Score=75.22 Aligned_cols=278 Identities=16% Similarity=0.129 Sum_probs=156.6 Q ss_pred HHHHHHHHhC--ccchhhh-----HhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeeccc--cccc---e--- Q lcl|Aclame:pro 104 NAWNAKLAEN--GVTITDT-----TFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSS--NEAQ---V--- 168 (393) Q Consensus 104 ~AW~a~L~ek--gV~~qd~-----~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na--~~a~---G--- 168 (393) =||+.+|... |+..+.. ...+|..+...|-+.+.++.++++...+...+.....+-..+. .-.| | T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~ 80 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCcccc Confidence 5666666543 3322211 1168999999999999998988887666655544344433321 2222 1 Q ss_pred -ecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchh Q lcl|Aclame:pro 169 -HKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDK 247 (393) Q Consensus 169 -Hk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~ 247 (393) ...+..++.+..+|...++.|..++...++.+-+. +.+.-++..|+.++|++.+- |.++-++++|||........- T Consensus 81 ~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell--~~s~~~~~~~i~~~la~ai~-~~~d~~~l~G~g~~~~~~~~g 157 (333) T protein:vir:78 81 EQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFA--RMNPSGLYTKLQGDLAYAIG-RGIDLAVFHGKSPLTGSALQG 157 (333) T ss_pred cccccccccccccceeEEEEeeEEEEEeehhhHHHH--hcCHHHHHHHHHHHHHHHHH-HHHHHHHhcccCCCCCccccc Confidence 23356788889999999999877777766644332 33335789999999999999 799999999999743221111 Q ss_pred hhhhhhhhhhh-hhhhccCCCCCHHHHHhhhceeeecCC--CceEEEecchhhhHhhhhhccccccceeeecCCCeEEEe Q lcl|Aclame:pro 248 EADVKKIKKIT-TKAKSAGKTPFADAIEEAVDFVRPTAG--RRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASE 324 (393) Q Consensus 248 e~D~~~ik~it-~~at~~~~t~~~dal~Eald~a~~~a~--~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~ 324 (393) ......+...+ ......+++.+.+++..++.-...... ...+++|+.++..|+. ++ .++|.+|.+-+. T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~-~~--------~~~d~~G~~i~~ 228 (333) T protein:vir:78 158 IDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLR-AQ--------AYRDANGNVDPS 228 (333) T ss_pred ccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHH-Hh--------hhcCCCCceeec Confidence 11000000000 111223345567788888754433322 2357778877776653 11 244555554432 Q ss_pred eecccchhhc--ccchhc----------------eeeeeccccee--cccce-------eee---cc---eeEeecCceE Q lcl|Aclame:pro 325 VGVDEIIVYT--GSKAVK----------------PTVLVDQKYHI--DMQDL-------TKV---DA---FEWKTNSNMI 371 (393) Q Consensus 325 v~~~~~~~~t--G~k~~~----------------ptv~vD~k~~~--~~~~~-------~~~---~s---~~~~~ns~~i 371 (393) -.+-...-.| |...++ +.++.|-+.++ .-+|+ .+. +. .-|.+++-.+ T Consensus 229 ~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~ 308 (333) T protein:vir:78 229 RINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAI 308 (333) T ss_pred CccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEE Confidence 1110000000 322221 23444544443 11222 111 11 1245555566 Q ss_pred EEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 372 LVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 372 ~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .++.+..+.+..+++-++++-+ T Consensus 309 r~~~r~d~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 309 LIEVTFGWLLGDKQAFVKFVDD 330 (333) T ss_pred EEEEEEccEEecccceEEEecc Confidence 6777889999999888888877 No 90 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.76 E-value=1.5e-10 Score=74.48 Aligned_cols=262 Identities=12% Similarity=0.037 Sum_probs=143.6 Q ss_pred hCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccc-----hhhhhhhhhhhh Q lcl|Aclame:pro 112 ENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQ-----TKTEQAATLTID 185 (393) Q Consensus 112 ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga-----~Kk~q~~~le~~ 185 (393) =+....++--.++|..+..-|-+.++++.++++...+.+.+.....+-..+ ...+.-+.-|+ ++.....+|... T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 122222444447999999999999998898888777766655544443322 22333233333 466678889999 Q ss_pred hccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccc-cchhhhhhhhhhhhhhhhhcc Q lcl|Aclame:pro 186 TLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFK-SIDKEADVKKIKKITTKAKSA 264 (393) Q Consensus 186 ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~-~~~~e~D~~~ik~it~~at~~ 264 (393) ++.|..++....+.+.+.+ .+.-++.+|+.++|+..+- |..+.++++|||.+... .............-. +.. T Consensus 81 ~~~~~k~~~~~~is~ell~--ds~~~~~~~i~~~l~~~~a-~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~---~~~ 154 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVID--DATVAVLTEVAELGGQAIG-KKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV---EVV 154 (305) T ss_pred EeeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHhhhheeccCCCCCccccccccccccccccc---ccc Confidence 9999888877666443333 3335789999999999999 79999999999975421 222222211111111 111 Q ss_pred CCCCCH----HHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEE--------eeecccch Q lcl|Aclame:pro 265 GKTPFA----DAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIAS--------EVGVDEII 331 (393) Q Consensus 265 ~~t~~~----dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~--------~v~~~~~~ 331 (393) ..+... +++..++ .+.........+++|+.++.+|+- |++.+|.+.+ ||.+.+.. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~------------lkd~~G~~i~~~~~l~G~Pv~~~~~~ 222 (305) T protein:vir:25 155 GGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN------------IRDANGNPVFRDDSFAGFRTFFNRNG 222 (305) T ss_pred ccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHH------------hhccCCceeecCCcccccceEEcCcc Confidence 112222 2233333 222222223347888888888765 6666666543 33333221 Q ss_pred hhcccchhceeeeeccccee--cccceeeec----cee--------EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 332 VYTGSKAVKPTVLVDQKYHI--DMQDLTKVD----AFE--------WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 332 ~~tG~k~~~ptv~vD~k~~~--~~~~~~~~~----s~~--------~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) -..+.+ .|.++.|-+.++ .-+|++--- +|. |..|+-.|.++....+.+.-|-+.+.++.. T Consensus 223 ~~~~~~--~~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~ 296 (305) T protein:vir:25 223 AWDADA--AIEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) T ss_pred CCCCCc--cEEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccc Confidence 111111 133444544333 222321100 111 334444555666677777777766666654 No 91 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.68 E-value=1.8e-10 Score=74.01 Aligned_cols=264 Identities=13% Similarity=0.084 Sum_probs=156.9 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEee--ccccccceecc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF--DSSNEAQVHKD 171 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l--~na~~a~GHk~ 171 (393) |..++-+.... ...++-..++|+.+...|-+.+.++..+++...+.+.+...-..-+ .....+.-+.- T Consensus 1 m~~~~~~~~~~----------~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 70 (297) T protein:vir:95 1 MTVQTFNPENV----------LVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNE 70 (297) T ss_pred CCccccccccc----------cccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeec Confidence 44443321110 1112222368999999988889988999997777666543322323 22345666778 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhh Q lcl|Aclame:pro 172 GQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 251 (393) Q Consensus 172 ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~ 251 (393) |+++++...+|...++.|..++....+.+-. ++.+.-++.+|++++|+.++- ++++.+++.|||.+........ T Consensus 71 g~~~~~~~~~f~~v~l~~~k~~~~~~is~el--l~ds~~~l~~~i~~~la~ai~-~~~d~a~l~G~g~~~~~gi~~~--- 144 (297) T protein:vir:95 71 TEKIKTDKPEVVPVTLKAHKLGIILVTSREA--LNYTWKKFFEDMKPQIVEAFY-KKIDEAGLLGHDTPFANSVAKA--- 144 (297) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHH--HhcCHHHHHHHHHHHHHHHHH-HHHHHHHhcccCCccccccccc--- Confidence 8899999999999999998888877763322 222235679999999999999 7999999999998653322111 Q ss_pred hhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccc Q lcl|Aclame:pro 252 KKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEI 330 (393) Q Consensus 252 ~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~ 330 (393) .....+..+.+...+++..++ .+.........+++++.++.+|+. |++.+|.+-+....+ . T Consensus 145 -----~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~------------l~d~~G~~i~~~~~~-~ 206 (297) T protein:vir:95 145 -----AKDANKVIGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALRE------------ARDGNKVSIYDKAAN-T 206 (297) T ss_pred -----ccccceecccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH------------hhccCCceeecCCCC-c Confidence 111122234456678888877 333333344567778888888876 555555543321111 1 Q ss_pred hhhcccchh---------ceeeeeccccee--cccceee-------ecce---------eEeecCceEEEeeeecccccc Q lcl|Aclame:pro 331 IVYTGSKAV---------KPTVLVDQKYHI--DMQDLTK-------VDAF---------EWKTNSNMILVETLTSGHVET 383 (393) Q Consensus 331 ~~~tG~k~~---------~ptv~vD~k~~~--~~~~~~~-------~~s~---------~~~~ns~~i~~~~~~~g~~~~ 383 (393) .. |.-.+ -+.++.|-+.++ .-+|++- .... -|..|+-.+.++.+..+-+.. T Consensus 207 l~--G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 284 (297) T protein:vir:95 207 ID--GITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITK 284 (297) T ss_pred cc--ceeeEeecCCCCCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeec Confidence 11 21111 022334433332 1112111 0000 144556666777888999988 Q ss_pred cCcceeeeeC Q lcl|Aclame:pro 384 YNAGAVITVS 393 (393) Q Consensus 384 ~n~~~~~~v~ 393 (393) |++=+.++.+ T Consensus 285 ~~a~~~l~~a 294 (297) T protein:vir:95 285 TDAFAKLTPA 294 (297) T ss_pred ccceEEEeec Confidence 8888888887 No 92 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.59 E-value=7.6e-10 Score=70.58 Aligned_cols=278 Identities=17% Similarity=0.128 Sum_probs=153.6 Q ss_pred HHHHHHHHhC--ccchhh-----hHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeeccc-c--------ccc Q lcl|Aclame:pro 104 NAWNAKLAEN--GVTITD-----TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSS-N--------EAQ 167 (393) Q Consensus 104 ~AW~a~L~ek--gV~~qd-----~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na-~--------~a~ 167 (393) -||+..|... |...+. .-.++|+.+..-|-+.+++...+++.+++.+.+.....+--.+. . .+. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 5666666655 322221 12279999999988888988999887777766655443332221 1 111 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcc-ccch Q lcl|Aclame:pro 168 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGF-KSID 246 (393) Q Consensus 168 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t-~~~~ 246 (393) -..-|+++.+...+|...++.|..++...++.+.+.+- +.-++.+|++++|++.+- |.++.+++.|||...- .... T Consensus 81 ~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~d--s~~~~~~~i~~~la~a~~-~~~d~~~l~G~g~~~~~~~~g 157 (338) T protein:vir:78 81 EQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARM--NPSGLYTKLQADLAYAIG-RGIDLAVFHGKSPLTGSALQG 157 (338) T ss_pred cccccccccccccceeEEEEEEEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCccccccc Confidence 12346788899999999999998888887775544332 335689999999999999 7999999999996321 1111 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceee--ecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEe Q lcl|Aclame:pro 247 KEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVR--PTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASE 324 (393) Q Consensus 247 ~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~--~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~ 324 (393) ...+.....-.+......+.....|++..++.-.. .......+++++.++.+|+. +| +++|.+|.+-++ T Consensus 158 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~-~~--------~l~d~~g~~l~~ 228 (338) T protein:vir:78 158 IDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLR-SQ--------AYRDANGNVDPT 228 (338) T ss_pred cccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHH-Hh--------hhccCCCceeec Confidence 11211111100111111222334567776664332 22234457788777776643 11 255666655432 Q ss_pred eecccchhhc--ccchhc----------------eeeeeccccee--cccceeeecce----------------eEeecC Q lcl|Aclame:pro 325 VGVDEIIVYT--GSKAVK----------------PTVLVDQKYHI--DMQDLTKVDAF----------------EWKTNS 368 (393) Q Consensus 325 v~~~~~~~~t--G~k~~~----------------ptv~vD~k~~~--~~~~~~~~~s~----------------~~~~ns 368 (393) -....-.-+| |-..++ +.++.|-..++ +-+|++---+. -|..|+ T Consensus 229 ~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (338) T protein:vir:78 229 RINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQ 308 (338) T ss_pred ccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCc Confidence 1110000000 221111 12233433332 11222110000 033444 Q ss_pred ceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 369 NMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 369 ~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) --+.++.+..|-+.-|++-++++-+ T Consensus 309 ~~~r~~~r~d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 309 IAILIEVTFGWLLGDKQAFVKFVDD 333 (338) T ss_pred EEEEEEEEeccEeecccceEEEecc Confidence 5566777888999988887777777 No 93 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.57 E-value=2.2e-09 Score=68.05 Aligned_cols=268 Identities=16% Similarity=0.108 Sum_probs=153.2 Q ss_pred cCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccchh Q lcl|Aclame:pro 97 SGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTK 175 (393) Q Consensus 97 qg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~K 175 (393) -|. ++........+- ++....||..+...|=..++++.++++.+.+.+.+.....+-..+ ...+.-+--|..+ T Consensus 1 ~g~----~~e~~~~~~~~t--~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~ 74 (397) T protein:vir:23 1 MGF----SADHSQIAQTKD--TMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMK 74 (397) T ss_pred CCc----CHHHHHHhhccC--CCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCccc Confidence 222 122333333322 222335777766666666677888888777666665544443333 3344555678889 Q ss_pred hhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhh Q lcl|Aclame:pro 176 TEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIK 255 (393) Q Consensus 176 k~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik 255 (393) ++....|...++.|..++....+.+-+- +.+.-++.+|++++|+..+- |.++.+++.|||.+.. .....+ T Consensus 75 ~~s~~~f~~v~l~~~k~~~~v~iS~ell--~ds~~~l~~~i~~~l~~aia-~~~d~a~l~G~gt~~~--~~~~~~----- 144 (397) T protein:vir:23 75 PITKGNMTKRDVHPAKIATIFVASAETV--RANPANYLGTMRTKVATAIA-MAFDNAALHGTNAPSA--FQGYLD----- 144 (397) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHH--hcchHHHHHHHHHHHHHHHH-HHHHHHHhhcccCCcc--cccccc----- Confidence 9999999999999988777766643322 23335789999999999999 7999999999998541 111111 Q ss_pred hhhhhhhccCCCCCHHHHHhhhc-eeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhc Q lcl|Aclame:pro 256 KITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT 334 (393) Q Consensus 256 ~it~~at~~~~t~~~dal~Eald-~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t 334 (393) .+........+...+.+..+++ +...-.....+++++.++.+|+- +++.+|.+.+.-...+..... T Consensus 145 -~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~------------lkd~~G~~i~~~~~~~~~~~~ 211 (397) T protein:vir:23 145 -QSNKTQSISPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNG------------SVDANGRPLFVESTYESLTTP 211 (397) T ss_pred -cccceeeecccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHH------------hhccCCceeeccccccccccc Confidence 1111222334445556666662 22223334567888888888876 777777776543332221111 Q ss_pred --ccchh-ceeee-------------eccccee--ccccee--eeccee--------------EeecCceEEEeeeeccc Q lcl|Aclame:pro 335 --GSKAV-KPTVL-------------VDQKYHI--DMQDLT--KVDAFE--------------WKTNSNMILVETLTSGH 380 (393) Q Consensus 335 --G~k~~-~ptv~-------------vD~k~~~--~~~~~~--~~~s~~--------------~~~ns~~i~~~~~~~g~ 380 (393) ++..+ +|.++ -|-+.++ ..+|++ ..+... |..|+-.+.++.+..+. T Consensus 212 ~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~ 291 (397) T protein:vir:23 212 FREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLL 291 (397) T ss_pred ccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccc Confidence 11111 13322 2333221 222221 111111 34445556666778888 Q ss_pred ccccCcceeeeeC Q lcl|Aclame:pro 381 VETYNAGAVITVS 393 (393) Q Consensus 381 ~~~~n~~~~~~v~ 393 (393) +.-+++-+++... T Consensus 292 v~~~~a~~~~~~~ 304 (397) T protein:vir:23 292 INDVNAFVKLTFD 304 (397) T ss_pred eecccceEEEeec Confidence 8888887777765 No 94 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.52 E-value=1.9e-09 Score=68.34 Aligned_cols=265 Identities=14% Similarity=0.064 Sum_probs=145.3 Q ss_pred HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhcc Q lcl|Aclame:pro 110 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLE 188 (393) Q Consensus 110 L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~ 188 (393) ++..|= .++|..+..-|-..++++..+.+...+...+.....+-..+ .-.+.-+--|+++++...+|...++. T Consensus 1 ma~~gG------~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~ 74 (298) T protein:vir:16 1 MVLNKG------TLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMV 74 (298) T ss_pred CcccCc------ceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEe Confidence 333321 24787777777777777777777666555554444444433 33455556677999999999999999 Q ss_pred HHHHHHHHHH-HHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCC Q lcl|Aclame:pro 189 PVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKT 267 (393) Q Consensus 189 p~~VYkkq~L-ad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t 267 (393) |..++....+ .+++.....++-++.+|+.++|+..+- |.++.++..|+|...-...+..+...-....+........+ T Consensus 75 ~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~-~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) T protein:vir:16 75 PIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVA-RGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGI 153 (298) T ss_pred eeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHH-HHHHHHhhccccCCCCccccccccccccccccccccccccc Confidence 9888887777 344444555566889999999999998 89999999994322111122222111011111111111112 Q ss_pred CC-HHHHHhhhceeee-cCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEe---------------eecccc Q lcl|Aclame:pro 268 PF-ADAIEEAVDFVRP-TAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASE---------------VGVDEI 330 (393) Q Consensus 268 ~~-~dal~Eald~a~~-~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~---------------v~~~~~ 330 (393) .. .+++..++.-..+ ......+++|+.++.+|+. |++.+|.+-++ |-+++. T Consensus 154 ~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------------lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~ 221 (298) T protein:vir:16 154 ADPNGAIENAVELLTGVDADVTGIAINPSFRSALAK------------QKDLQDNALFPELKWGATPDTINGLPVDVNKT 221 (298) T ss_pred ccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHH------------hhccCCCeeecCcccCCCCceecceeeEEecc Confidence 22 2344454432222 2223358889999988877 66666665542 222221 Q ss_pred hhhcccchhceeeeeccc--ceec-ccceee-ecce---------eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 331 IVYTGSKAVKPTVLVDQK--YHID-MQDLTK-VDAF---------EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 331 ~~~tG~k~~~ptv~vD~k--~~~~-~~~~~~-~~s~---------~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .-..+...-.+.++-|-+ +.+. .++++- +... -|.+|+-.+.++.+..+-+..|++=++++-+ T Consensus 222 v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~a 297 (298) T protein:vir:16 222 VSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) T ss_pred cccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeec Confidence 110000000112222322 1121 111110 0000 1344555566677888888888888888777 No 95 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.49 E-value=2.1e-09 Score=68.12 Aligned_cols=249 Identities=14% Similarity=0.103 Sum_probs=145.8 Q ss_pred HHHhCccch-hhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee--EEEeeccc--cccceecccchhhhh-hhhh Q lcl|Aclame:pro 109 KLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL--VSRSFDSS--NEAQVHKDGQTKTEQ-AATL 182 (393) Q Consensus 109 ~L~ekgV~~-qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a--~~i~l~na--~~a~GHk~ga~Kk~q-~~~l 182 (393) -|......+ .+--..+|..+...|-+.++++.++.+...+.+.+... ..+...+. ..+.-+--|.++++. ..+| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 333332211 12112689999999999999988887766655544332 23332221 222223345566654 5799 Q ss_pred hhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 183 TIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAK 262 (393) Q Consensus 183 e~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at 262 (393) ...++.|..++....+-+.+.+ .+.-++.+|++++|+..+- |..+.+++.|+|...+. T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~la~~~~-~~~~~~i~~g~~~~~~~------------------- 138 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLA--DSAENILAWLSGWIAKKVV-VTRNKAILGVVDKLPTK------------------- 138 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHh--hhhHHHHHHHHHHHHHHHH-HHHHhHHhhcccccccc------------------- Confidence 9999999999988777443332 2224689999999999998 79999999999875532 Q ss_pred ccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeecccchhhc--ccchh Q lcl|Aclame:pro 263 SAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAV 339 (393) Q Consensus 263 ~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~~~~~~~t--G~k~~ 339 (393) +++...|+|..++ .+...-..+..+++|+.++..|+. |++.+|.+-+.-.+.+-.-.| |...+ T Consensus 139 --~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~------------lkd~~g~~l~~~~~~~~~~~~l~G~Pv~ 204 (293) T protein:vir:48 139 --PTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKK------------VKNALGDYLMERDVKSPTGYSIAGFAVK 204 (293) T ss_pred --ccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH------------hhccCCceEeecCcCCCCCceecceeeE Confidence 2233456666665 232223345568889999888876 677777755433222211100 22111 Q ss_pred --------------ceeeeecccce--e-cccceeeecce----eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 340 --------------KPTVLVDQKYH--I-DMQDLTKVDAF----EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 340 --------------~ptv~vD~k~~--~-~~~~~~~~~s~----~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .+.++.|-+.+ + +.+|++-.-+. -|..++-.+.++.+..|-+.-+++-+.++++ T Consensus 205 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~ 279 (293) T protein:vir:48 205 EISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFK 279 (293) T ss_pred EecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEee Confidence 12223332211 1 22333222111 1344555577777888888888887777877 No 96 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.45 E-value=4.9e-09 Score=66.16 Aligned_cols=278 Identities=14% Similarity=0.078 Sum_probs=149.5 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceeccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDG 172 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~g 172 (393) |..+.+-+.. ...+...+- ++--..+|..+..-|-+.+.++..+++...+.+.+.....+-..+ ...+.-+.-| T Consensus 1 ~~~~~~~~~~---~~~~~~t~~--~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~ 75 (320) T protein:vir:10 1 MAAGTAFQVD---HAQIAQTGD--TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEG 75 (320) T ss_pred CCCCccCCHH---HHHhhcccc--ccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCC Confidence 3333221222 222333321 222225898888888888888888888666666554444443333 3344445678 Q ss_pred chhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhh Q lcl|Aclame:pro 173 QTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVK 252 (393) Q Consensus 173 a~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~ 252 (393) +++.+...+|...++.|..++....+-+-+- +.+.-++.+|+.++|++++- |.++.+++.|||.+............ T Consensus 76 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell--~ds~~~l~~~i~~~l~~a~a-~~~d~a~l~G~g~~~~~~~~~~~~~~ 152 (320) T protein:vir:10 76 DMKPITKGNMTSQNIAPHKIATIFVASAETV--RANPANYLGTMRTKVATAFA-MAFDSAALNGTDSPFPTYLAQTTKSV 152 (320) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHH--hcChHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCCCcccccccccc Confidence 8899999999999999988877766633322 22235789999999999999 79999999999975422221111111 Q ss_pred hhhhhhhhhhccCCCCCHH-HHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEeeec--c Q lcl|Aclame:pro 253 KIKKITTKAKSAGKTPFAD-AIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEVGV--D 328 (393) Q Consensus 253 ~ik~it~~at~~~~t~~~d-al~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v~~--~ 328 (393) .. .++...........+ .+..++ .......+...+++|+.++.+|+. |++.+|.+-+.-.+ + T Consensus 153 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~~~ 218 (320) T protein:vir:10 153 SL--ADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNG------------AKDKNGRPLFIESTYTD 218 (320) T ss_pred cc--eecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHH------------hhccCCceeeccccccC Confidence 00 001111111111222 333333 222333456688999999988886 66666665543211 1 Q ss_pred cchhhcccchh-cee-------------eeeccccee--cccceeeecce----e------------EeecCceEEEeee Q lcl|Aclame:pro 329 EIIVYTGSKAV-KPT-------------VLVDQKYHI--DMQDLTKVDAF----E------------WKTNSNMILVETL 376 (393) Q Consensus 329 ~~~~~tG~k~~-~pt-------------v~vD~k~~~--~~~~~~~~~s~----~------------~~~ns~~i~~~~~ 376 (393) ...-..|...+ +|. ++.|-.+++ ...|++---+. . |..|+--|.++.+ T Consensus 219 ~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~ 298 (320) T protein:vir:10 219 ENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAE 298 (320) T ss_pred ccccccCceeeeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEe Confidence 11111111111 111 122332222 11222110000 0 3344445566678 Q ss_pred ecccccccCcceeeeeC Q lcl|Aclame:pro 377 TSGHVETYNAGAVITVS 393 (393) Q Consensus 377 ~~g~~~~~n~~~~~~v~ 393 (393) ..+.+.-|.+-++++.. T Consensus 299 ~d~~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 299 YAFHNNDKDAFVKLTNV 315 (320) T ss_pred eccEEecccceEEEEec Confidence 88888888777777644 No 97 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.41 E-value=4.2e-09 Score=66.49 Aligned_cols=265 Identities=14% Similarity=0.058 Sum_probs=142.8 Q ss_pred HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhcc Q lcl|Aclame:pro 110 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLE 188 (393) Q Consensus 110 L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~ 188 (393) +++.- ++.-.++|..+..-|-..+.++..+.....+...+.....+--.+ .-.+.=.--|.++++...+|...++. T Consensus 1 ma~~t---~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~ 77 (300) T protein:vir:95 1 MSEAQ---LSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIV 77 (300) T ss_pred Ccccc---cCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEee Confidence 22111 111115788777777777777777766555554443333332222 22333334578999999999999999 Q ss_pred HHHHHHHHHH-HHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCC Q lcl|Aclame:pro 189 PVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKT 267 (393) Q Consensus 189 p~~VYkkq~L-ad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t 267 (393) |..++....+ .++.......+-++.+|+.++|++++- |..+.++..|+|..--......+....- ...+.....+.+ T Consensus 78 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia-~~~d~~~l~G~~~~~g~~~~~~~~~~~~-~~~~~~~~~~~~ 155 (300) T protein:vir:95 78 PLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLA-RGLDIMSIHGINPRTKQASTIIGDNCFD-KKVTQTVPFKDT 155 (300) T ss_pred eEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHH-HHHHHhhhhcccCCCCCCcccccccccc-cccceeeccccc Confidence 9888777666 344444456677899999999999999 7999999999532111111111211100 111111122234 Q ss_pred CCHHHHHhhhc-eeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEe-eecc---cchhhcccchhc-- Q lcl|Aclame:pro 268 PFADAIEEAVD-FVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASE-VGVD---EIIVYTGSKAVK-- 340 (393) Q Consensus 268 ~~~dal~Eald-~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~-v~~~---~~~~~tG~k~~~-- 340 (393) ...+++..++. +.........+++|+.++.+|+. |++.+|.+-++ ...+ .... |-..++ T Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~------------lkd~~G~~i~~~~~~~~~~~~l~--G~Pv~~s~ 221 (300) T protein:vir:95 156 NPDESMEDAVGMIDGSERDITGAILDPIFTTALSK------------MKNAEGGKLYPELAWGGVPDAIN--GLAVDKNR 221 (300) T ss_pred chHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHH------------hhccCCCeeccCccccCCCceec--ceeeEEec Confidence 44566777663 32223333468899999988877 77777766542 1111 1111 322111 Q ss_pred ----------eeee-ecccce--ec-cccee-eecce---------eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 341 ----------PTVL-VDQKYH--ID-MQDLT-KVDAF---------EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 341 ----------ptv~-vD~k~~--~~-~~~~~-~~~s~---------~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ..++ -|-..+ +. .++++ +++.+ -|.+|+-.+.++....+-+..|++=+.++-. T Consensus 222 ~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~ 298 (300) T protein:vir:95 222 TVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKT 298 (300) T ss_pred CCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecC Confidence 1112 242211 11 11110 00000 1445555556666777788877777776666 No 98 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.36 E-value=7e-09 Score=65.30 Aligned_cols=264 Identities=13% Similarity=0.098 Sum_probs=143.3 Q ss_pred CccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhccHHH Q lcl|Aclame:pro 113 NGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLEPVM 191 (393) Q Consensus 113 kgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~ 191 (393) -|+. ++--.++|..+...|-..++++..+++...+...+.....+...+ ...+.=+.-|.+++..+.+|...++.|.. T Consensus 1 m~t~-t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTE-TSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIK 79 (303) T ss_pred Cccc-CCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEE Confidence 2211 111227898888888888888788887766766665555554433 23444455678999999999999999988 Q ss_pred HHHHHHHHH-HHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCc-cccchhhhhhhhhhhhhhhhhcc-CCCC Q lcl|Aclame:pro 192 VYKLQSLAE-RVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNG-FKSIDKEADVKKIKKITTKAKSA-GKTP 268 (393) Q Consensus 192 VYkkq~Lad-~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~-t~~~~~e~D~~~ik~it~~at~~-~~t~ 268 (393) ++...++-+ ++.....+.-.+.+|+.++|+.++- |.++.++..|+|... +...+ .+... ....++.+... ++.. T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~-~~ld~a~l~G~~~~~g~~~~~-~~~~~-~~~~~~~~~~~~~~~~ 156 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLA-RGIDLMAMHGINPRTKKASDV-IGTNH-FDSKVTQVVKFTESED 156 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHH-HHHHhhhhcccccCCcccccc-ccccc-cccccccccccccccc Confidence 888877733 3333344456789999999999999 799999999964311 11111 11100 00111111111 2233 Q ss_pred CHHHHHhhhceee-ecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEE----------------Eeeecccch Q lcl|Aclame:pro 269 FADAIEEAVDFVR-PTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIA----------------SEVGVDEII 331 (393) Q Consensus 269 ~~dal~Eald~a~-~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~----------------~~v~~~~~~ 331 (393) ..+++..++.-.. .......+++|+.++.+|+. |++.+|.+- .||-+++.. T Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~------------lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v 224 (303) T protein:vir:97 157 ADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAK------------VTNGEMGPKMYPELAWGANPDSINGLKSSVNTTV 224 (303) T ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHH------------hhccCCCeEEecCccCCCCCceecceeeEEeccc Confidence 3466666663322 23334458899999998886 455555432 232222210 Q ss_pred hhccc---chhceeeeeccc--ceecc-ccee-eeccee---------EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 332 VYTGS---KAVKPTVLVDQK--YHIDM-QDLT-KVDAFE---------WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 332 ~~tG~---k~~~ptv~vD~k--~~~~~-~~~~-~~~s~~---------~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) = ++. ..-.+.++.|=. +++.. ++++ ....+. |..|+=.+.++.+..+-+.-|++=++++=+ T Consensus 225 ~-~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~ 301 (303) T protein:vir:97 225 G-AGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKG 301 (303) T ss_pred C-CccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCC Confidence 0 000 000112333322 22211 1111 011110 333333444566777777777666666555 No 99 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.30 E-value=6.3e-09 Score=65.54 Aligned_cols=262 Identities=14% Similarity=0.029 Sum_probs=137.1 Q ss_pred HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhcc Q lcl|Aclame:pro 110 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLE 188 (393) Q Consensus 110 L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~ 188 (393) ++ . .+++.-..+|+.+..-|-..++++..+.+..++...+.....+-..+ ...+.-.--|.++.+...+|...++. T Consensus 1 Ma--t-~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~ 77 (311) T protein:vir:99 1 MA--T-FGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTST 77 (311) T ss_pred Cc--e-ecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEe Confidence 11 1 11222236898888888888888888877666666555444543333 22333345577888899999999999 Q ss_pred HHHHHHHHHH-HHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhh--hccC Q lcl|Aclame:pro 189 PVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKA--KSAG 265 (393) Q Consensus 189 p~~VYkkq~L-ad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~a--t~~~ 265 (393) |..++..-.+ .+++...+-+.-++.+|+.++|++.+- +..+.++..|||...-....-..- .+...+... +..+ T Consensus 78 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~-~~~d~~~l~G~g~~~g~~~~g~~~--~~~~~~~~~~~~~~~ 154 (311) T protein:vir:99 78 PKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALA-RALDLGLYHRINPLTGTVIPGWSN--YLGAASKRVELTADT 154 (311) T ss_pred eEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHH-HHHHHHhhcccCcccCcccccccc--ccccccceeeccccc Confidence 9877777666 333333333345689999999999999 799999999988532111111100 000011111 1111 Q ss_pred CCCCHHHHHhhhce---eeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEE---------------eeec Q lcl|Aclame:pro 266 KTPFADAIEEAVDF---VRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIAS---------------EVGV 327 (393) Q Consensus 266 ~t~~~dal~Eald~---a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~---------------~v~~ 327 (393) .+...+.+.-++.. +.-.....-+++|+.++.+|+- |++.+|.+.+ ||.+ T Consensus 155 ~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~------------lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~ 222 (311) T protein:vir:99 155 IANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLST------------ARYTDGRKKFPELGLGIGVSSFEGIDASV 222 (311) T ss_pred cchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHh------------hhccCCCeeecCcccCCCCceecceeeEe Confidence 12222233323321 1112222337889888888876 5666665443 3333 Q ss_pred ccchh------------hcccchhceeeeecccce--e-cccceeeeccee----------EeecCceEEEeeeeccccc Q lcl|Aclame:pro 328 DEIIV------------YTGSKAVKPTVLVDQKYH--I-DMQDLTKVDAFE----------WKTNSNMILVETLTSGHVE 382 (393) Q Consensus 328 ~~~~~------------~tG~k~~~ptv~vD~k~~--~-~~~~~~~~~s~~----------~~~ns~~i~~~~~~~g~~~ 382 (393) ++..- +.|.+ .+.++-|-... + ..++++ +.... |..|+-.+.++....|.+. T Consensus 223 s~~i~~~~~~~~~~~~~~~~~~--~~~~~Gdf~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~ 299 (311) T protein:vir:99 223 SDTVNGGDEADPDDEDLDAARA--VRGIVGDFANGIHWGVQRDIP-VELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVF 299 (311) T ss_pred ecccccccccccccchhhccCc--ceEEEeeccccEEEEEecCce-EEEeecCCCCcchhhhhcCcEEEEEEEeecceec Confidence 22211 00000 01222232211 1 111111 11111 4445555566788888886 Q ss_pred ccCcceeeeeC Q lcl|Aclame:pro 383 TYNAGAVITVS 393 (393) Q Consensus 383 ~~n~~~~~~v~ 393 (393) .+ +.+++.-+ T Consensus 300 ~~-~~v~~~~~ 309 (311) T protein:vir:99 300 TD-RFVVIENA 309 (311) T ss_pred Ch-hHeeeecc Confidence 64 45554444 No 100 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=98.29 E-value=3e-07 Score=56.35 Aligned_cols=364 Identities=15% Similarity=0.126 Sum_probs=150.5 Q ss_pred CCcch-hhHHHHHHHHHHHhhHHHHhhhhhhh-hhhHHhhhhHHHHHHHHHHHHHHHHHHHHH-HHHhhh------hhhc Q lcl|Aclame:pro 1 MNKPD-LIEKQNRLAELKENNVSLKSQISGFE-VKNAIEDLPKVQELEKTLSENSIEIIKIEN-ELNAQE------EKPK 71 (393) Q Consensus 1 ~~k~d-~~ekq~eLa~lK~~~~~~~s~i~~~~-v~~a~~~~skieelektis~l~aEi~k~en-el~~~~------Ek~K 71 (393) |.-.+ +.+.+.+.+++-+.+..+..+...-. .-++.+ ..++++++..+..++.+|...+. +..... +... T Consensus 193 ~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee-~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~ 271 (645) T protein:vir:93 193 MNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEE-EEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGN 271 (645) T ss_pred cchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 11111 12233333333333333222211100 011222 36777777777777777653222 111100 0000 Q ss_pred ch---------hHHHHHHHHHHHHHHHHHHHHHccCCh----hHH-HHH---------HHHHHhCccchhhhH----hhc Q lcl|Aclame:pro 72 GK---------DKMTNFIESQNAVTEFFDVLKKNSGKS----EIK-NAW---------NAKLAENGVTITDTT----FQL 124 (393) Q Consensus 72 ~k---------~emtEfLkTkqA~~dya~ll~~nqg~k----e~k-~AW---------~a~L~ekgV~~qd~~----eiL 124 (393) +. +...+=.........|++-|....|.- ++. ..| ..+-...|.++ +.. .+. T Consensus 272 ~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~-~~~~~Gg~~v 350 (645) T protein:vir:93 272 GNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTT-DPQWAGSLSE 350 (645) T ss_pred cccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccc-cccccCCccC Confidence 00 000000001111122333333333320 111 111 11111233321 211 145 Q ss_pred chhHHHHHHHHHHhhCcccccee------eecccceeEEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHH Q lcl|Aclame:pro 125 PRKLVESINTALLNTNPVFKVFH------VTNVGALLVSRSFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQ 196 (393) Q Consensus 125 P~~ii~AIe~A~ed~d~vl~~fh------V~n~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq 196 (393) |..+..-|=+.+.+. .++..+- ....| +-+.+--+. ....|. --|.++..+..+|...++.|..++-.. T Consensus 351 p~~~~~~ii~~l~~~-svv~~l~~~~~~~~~~~~-~~~~ip~~t~~~~a~wv-~Eg~~~~~s~~~f~~v~l~~~kla~~~ 427 (645) T protein:vir:93 351 YQEYAQDFIDYLRPQ-TIIGRFGQGGIPALRQVP-FNIRVHAQVSGGAAGWV-GEGKTKPLTKFDFESITFSHAKVSAIA 427 (645) T ss_pred chhhHHHHHHhhhhh-hhHHhhcccccccccccc-CceeeeeeecCcceEEe-ccCccccccccceeEEEEeeEEEEEee Confidence 544333333344432 2222221 01111 111222222 234454 357788999999999999998888776 Q ss_pred HHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHh Q lcl|Aclame:pro 197 SLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEE 275 (393) Q Consensus 197 ~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~E 275 (393) .+- +++.+.. -++.+|+.++|+.++- +.++.|++.|||...+-..|. + +. -..+++..+++. ..++.. T Consensus 428 ~iS~ell~ds~---~~~~~~i~~~l~~aia-~~~d~a~l~g~g~~~~~~~p~-g-i~----~~~~~~~~~~~~-~~d~~~ 496 (645) T protein:vir:93 428 VLTEELIRFSS---PAADALVRNALAEAVV-ARLDTDFVDPKKAAVADVSPA-S-IT----HDVKGTASSGNP-DADAEA 496 (645) T ss_pred hhHHHHHhhch---HHHHHHHHHHHHHHHH-HHHHHHhhcCCCcccCCcccc-c-ee----ccccccccccch-HHHHHH Confidence 663 3334332 3568999999999999 799999999998754333332 1 00 011122222222 233333 Q ss_pred hh---ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEe-eecccchhhcccchhc----ee--eee Q lcl|Aclame:pro 276 AV---DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASE-VGVDEIIVYTGSKAVK----PT--VLV 345 (393) Q Consensus 276 al---d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~-v~~~~~~~~tG~k~~~----pt--v~v 345 (393) .+ .-+.........++|+.+..+|+. |++.+|.+.++ ++..+-++ -|--.+. |. .++ T Consensus 497 ~~~~~~~a~~~~~~a~~vmn~~~~~~L~~------------lkd~~G~~~~~~~~~~~~tL-~G~PV~~s~~vp~~~~~g 563 (645) T protein:vir:93 497 AFGQFVAANLQPTGAVWLMSSTNALALSM------------RKNALGQKEYPDMTLLGGSF-QGLPVIVSQYVGDQLVLV 563 (645) T ss_pred HHHHHHhcCCCccccEEEEcHHHHHHHHh------------ccccCCceeecCCCCCCcee-eceeeEEeccCCcceeEe Confidence 33 212222234467788888888876 66777765543 11111111 0211111 21 222 Q ss_pred ccccee---------cccceee-------------eccee----EeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 346 DQKYHI---------DMQDLTK-------------VDAFE----WKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 346 D~k~~~---------~~~~~~~-------------~~s~~----~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) |....+ +.+..-+ .+... |..|+=-|.++....+-+-.|.+-++++=. T Consensus 564 d~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 564 NAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGV 637 (645) T ss_pred ccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecc Confidence 322111 1111101 01111 333333444445556666555555555533 No 101 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.25 E-value=3.2e-08 Score=61.68 Aligned_cols=262 Identities=14% Similarity=0.070 Sum_probs=144.8 Q ss_pred chhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHH Q lcl|Aclame:pro 116 TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYK 194 (393) Q Consensus 116 ~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYk 194 (393) -.++.-.++|..+...|-..++++..+++..++...+.....+--.+ .-.+.-+--|+++.+...+|...++.|..++. T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 80 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEEEE Confidence 22333346898888888788888888888777666655544443322 33444466788999999999999999988877 Q ss_pred HHHHHH-HHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec----cCCCccccchhhhhhhhhhhhhhhhhccCCCCC Q lcl|Aclame:pro 195 LQSLAE-RVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG----DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPF 269 (393) Q Consensus 195 kq~Lad-~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g----DG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~ 269 (393) .-.+-+ ++.........+..|+.++|+..+- |.++.++..| +|.+..... .-+..-........+ ...+.. T Consensus 81 ~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~-~~~d~~~l~G~~~~~g~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~ 156 (298) T protein:vir:94 81 GARISDEFMYASDEEKINILQAFNDGFAKKVA-RGIDLMAFHGVNPRLGTASAVIG-TNHFDSKVTQKVEAP--RGIADP 156 (298) T ss_pred eeehhHHHhccCCccHHHHHHHHHHHHHHHHH-HHHHHHhhcccccCCCccccccc-ccccccccccccccc--cccccH Confidence 766633 3333333455788999999999998 8999999998 333221111 011111011011111 111222 Q ss_pred HHHHHhhhceee-ecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEE---------------eeecccchhh Q lcl|Aclame:pro 270 ADAIEEAVDFVR-PTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIAS---------------EVGVDEIIVY 333 (393) Q Consensus 270 ~dal~Eald~a~-~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~---------------~v~~~~~~~~ 333 (393) .+++..++.-.. .......+++|+.++.+|+. |++.+|.+.+ ||-+++..-. T Consensus 157 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------------lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~ 224 (298) T protein:vir:94 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAK------------QKDLQGNALFPELKWGATPDTINGLPVDVNKTVSD 224 (298) T ss_pred HHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH------------hhccCCCeeecCcccCCCCceecceeeEEeccccc Confidence 345666663332 23344468899999988877 5555555443 3333332111 Q ss_pred cccchhceeeeecccce--e-cccceee-ecce---------eEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 334 TGSKAVKPTVLVDQKYH--I-DMQDLTK-VDAF---------EWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 334 tG~k~~~ptv~vD~k~~--~-~~~~~~~-~~s~---------~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .+...-.+.++-|-+.. + ..++++- +..+ -|.+|+=.+.++.+..+.+..|++=++++-+ T Consensus 225 ~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~ 297 (298) T protein:vir:94 225 MSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) T ss_pred ccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEec Confidence 11111112223343321 1 1122210 1100 1233334455566778888888877777766 No 102 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.18 E-value=2.5e-08 Score=62.31 Aligned_cols=264 Identities=16% Similarity=0.039 Sum_probs=142.3 Q ss_pred HHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhh Q lcl|Aclame:pro 107 NAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTID 185 (393) Q Consensus 107 ~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ 185 (393) -+-+..-| .++|..+...|=+.++++.++.+...+.+.+...+.+--.+ ...+.=+--|+++.+...+|... T Consensus 1 mat~~~gg-------~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v 73 (311) T protein:vir:81 1 MVALATGT-------FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) T ss_pred CceecCCc-------eEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEE Confidence 11111112 26898888888888888788887666766665555554433 33444456788999999999999 Q ss_pred hccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcc-ccchhhhhhhhhhhhhhhhhc Q lcl|Aclame:pro 186 TLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGF-KSIDKEADVKKIKKITTKAKS 263 (393) Q Consensus 186 ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t-~~~~~e~D~~~ik~it~~at~ 263 (393) ++.|..++.+.++- +++.......-.+.+|+.++|+..+- |..+.++..|+|.... ........+.......+.+ . T Consensus 74 ~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~-~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~-~ 151 (311) T protein:vir:81 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALG-RALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT-T 151 (311) T ss_pred EEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHH-HHHHHhhhccccCCCCcccccccccccccceeeeec-c Confidence 99998887776663 33433334445689999999999987 7999999999642111 1111112221111111111 1 Q ss_pred cCCCCCHH-HHHhhhceeeecCCCc-eEEEecchhhhHhhhhhccccccceeeecCCCeEEEe---------------ee Q lcl|Aclame:pro 264 AGKTPFAD-AIEEAVDFVRPTAGRR-YLIVKTEDRKALLDELRQATANANVRIKNDDTEIASE---------------VG 326 (393) Q Consensus 264 ~~~t~~~d-al~Eald~a~~~a~~~-~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~---------------v~ 326 (393) .+ +...+ .+..+++.......++ .+++|+.++.+|+. |++.+|.+.+. |. T Consensus 152 ~~-~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------------lkd~~G~~l~~~~~~~~~~~tl~G~Pv~ 218 (311) T protein:vir:81 152 GT-SATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLAT------------QRDSQGRKLYPELGFGTDVASFAGLNAA 218 (311) T ss_pred cc-cchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHh------------hhccCCCeeecCccccCCCceecceeEE Confidence 11 11222 2333333322222233 38889999999987 77777766542 22 Q ss_pred cccchh-------------hcccchhceeeeeccccee--cccceeeeccee---------EeecCceEEEeeeeccccc Q lcl|Aclame:pro 327 VDEIIV-------------YTGSKAVKPTVLVDQKYHI--DMQDLTKVDAFE---------WKTNSNMILVETLTSGHVE 382 (393) Q Consensus 327 ~~~~~~-------------~tG~k~~~ptv~vD~k~~~--~~~~~~~~~s~~---------~~~ns~~i~~~~~~~g~~~ 382 (393) +.+..- +++.+. .+.++.|-..++ .-++++---+.. |..|+-.+.++.+..+.+- T Consensus 219 ~~~~i~~~~~~~~~~~~~~~~~~~~-~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~ 297 (311) T protein:vir:81 219 VSDTVRGGPEAVTASTGVYRTTNPN-VKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIM 297 (311) T ss_pred ecccccccccccccccchhcccCCc-cEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEee Confidence 211110 000000 011223322211 111111100111 3333334445577888888 Q ss_pred ccCcceeeeeC Q lcl|Aclame:pro 383 TYNAGAVITVS 393 (393) Q Consensus 383 ~~n~~~~~~v~ 393 (393) -|++-++++-+ T Consensus 298 ~~~a~~~l~~a 308 (311) T protein:vir:81 298 STDAFAVVRDA 308 (311) T ss_pred cccceEEEEee Confidence 88887787777 No 103 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.77 E-value=6.9e-07 Score=54.35 Aligned_cols=316 Identities=13% Similarity=0.071 Sum_probs=144.4 Q ss_pred HhhhhhhhhhhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchhHHHHHHHHHHHHHHHHHHHHHccCC--hh Q lcl|Aclame:pro 24 KSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGK--SE 101 (393) Q Consensus 24 ~s~i~~~~v~~a~~~~skieelektis~l~aEi~k~enel~~~~Ek~K~k~emtEfLkTkqA~~dya~ll~~nqg~--ke 101 (393) -...+++.+++.. ..+.-+.+ -.+++..+..+..|.+. ++...|. .. T Consensus 1 ~a~~~a~~~~~~~--------------~~~~~~~~-------~~~~~~kg~~~~~~~~a----------~a~~~g~~~~a 49 (366) T protein:vir:57 1 MAAAVAVPVKAHS--------------VAPGIIIK-------EELQQYKGAGMTRMVMS----------IAAGKGNLADA 49 (366) T ss_pred Ccccccccccccc--------------cccccccc-------cccccccchhHHHHHHH----------HHhcccchhHH Confidence 1111111111100 00000000 00111122233333321 2222222 01 Q ss_pred H---HHHHHHHHHhCccchhh--hHhhcchhHHHHHHHHHHhhCccccceeeecc--cceeEEEeecc--ccccceeccc Q lcl|Aclame:pro 102 I---KNAWNAKLAENGVTITD--TTFQLPRKLVESINTALLNTNPVFKVFHVTNV--GALLVSRSFDS--SNEAQVHKDG 172 (393) Q Consensus 102 ~---k~AW~a~L~ekgV~~qd--~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~--~~~a~~i~l~n--a~~a~GHk~g 172 (393) . ...+.+.-..+.+.++. --.++|..+..-|-+.++++.. +..+....+ ..+.+.+--.+ ...+| ..-| T Consensus 50 ~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~-l~~lg~~~v~~~~g~~~~p~~t~~~~a~w-v~E~ 127 (366) T protein:vir:57 50 AKFAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTV-VRILGARSIPLPNGNLSMPRLSGGATAGY-VGEG 127 (366) T ss_pred HHHHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcc-hhhhceeeeecCCCceEEEEEeCCcceee-eccC Confidence 1 11111111112222111 1125798887777778886544 444432222 23323332222 23344 3567 Q ss_pred chhhhhhhhhhhhhccHHHHHHHHHHHHH-HHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhh Q lcl|Aclame:pro 173 QTKTEQAATLTIDTLEPVMVYKLQSLAER-VKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 251 (393) Q Consensus 173 a~Kk~q~~~le~~ti~p~~VYkkq~Lad~-~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~ 251 (393) +++.+...+|...++.|..++....+.+. +.+.. -++.+|+.++|+..+- |..+.+++.|||+.. +..-+.... T Consensus 128 ~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~---~~~~~~i~~~l~~a~~-~~~d~a~l~G~G~~~-~p~Gi~~~~ 202 (366) T protein:vir:57 128 KDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAG---FNVEQLLLGDILSAIA-TREDKAFLRDDGTGD-TPKGMKAVA 202 (366) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhhhh---HHHHHHHHHHHHHHHH-HHHHHHhhccCCCCc-cccceeecc Confidence 78888999999999999888877777433 33332 3578999999999999 799999999999732 100011110 Q ss_pred hhhhhh--hhhhhccCCCCCHHHHHhhhc----eeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEee Q lcl|Aclame:pro 252 KKIKKI--TTKAKSAGKTPFADAIEEAVD----FVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEV 325 (393) Q Consensus 252 ~~ik~i--t~~at~~~~t~~~dal~Eald----~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v 325 (393) . +... +.+++.. .....+++...++ -+....+....++++.++.+|+. |++.+|.+.++- T Consensus 203 ~-~~~~~~~~~~t~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~------------lkd~~G~~l~~~ 268 (366) T protein:vir:57 203 T-AANRLVAWTGTAI-NLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFG------------LRDGNGNKVYPE 268 (366) T ss_pred c-cccceeecccccc-chhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHh------------hhccCCceeccC Confidence 0 0000 1111111 1112333444442 22334456667888888888776 566666655432 Q ss_pred ecccchhhcccchh---------------ceeeeeccccee--cccceeeecc-------------eeEeecCceEEEee Q lcl|Aclame:pro 326 GVDEIIVYTGSKAV---------------KPTVLVDQKYHI--DMQDLTKVDA-------------FEWKTNSNMILVET 375 (393) Q Consensus 326 ~~~~~~~~tG~k~~---------------~ptv~vD~k~~~--~~~~~~~~~s-------------~~~~~ns~~i~~~~ 375 (393) ..+.+.. |.-.+ .+..+.|-..++ +-.|++---+ .-|..|+-.|.++. T Consensus 269 ~~~g~l~--G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~ 346 (366) T protein:vir:57 269 MSQGILK--GYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVT 346 (366) T ss_pred CCCCeec--ceeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEeee Confidence 1111111 21111 122233433322 1112111000 12445666677777 Q ss_pred eecccccccCcceeeeeC Q lcl|Aclame:pro 376 LTSGHVETYNAGAVITVS 393 (393) Q Consensus 376 ~~~g~~~~~n~~~~~~v~ 393 (393) +..+.+.-|.+=++++=. T Consensus 347 ~~d~~v~~~~a~~~lt~~ 364 (366) T protein:vir:57 347 EHDIGFRHPEGLVLGTGV 364 (366) T ss_pred eeCcEeeccccEEEEecc Confidence 788888777665555555 No 104 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.44 E-value=4.3e-06 Score=50.02 Aligned_cols=274 Identities=14% Similarity=0.083 Sum_probs=138.7 Q ss_pred HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeecc-ccccceecccchhhhhhhhhhhhhcc Q lcl|Aclame:pro 110 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-SNEAQVHKDGQTKTEQAATLTIDTLE 188 (393) Q Consensus 110 L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~ 188 (393) +++. . +.+--..+|.-+..-|=..++++.++.+..++-..+.....+--.+ ...+.=+--|.+++.+..+|...++. T Consensus 1 Ma~~-~-~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~ 78 (315) T protein:vir:80 1 MADD-F-LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) T ss_pred CCCC-c-CCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEee Confidence 2111 0 1111226888877777777877777776555555444434433333 23444455677889999999999999 Q ss_pred HHHHHHHHHH-HHHHHhhcCc-hhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCC Q lcl|Aclame:pro 189 PVMVYKLQSL-AERVKRLQMS-YSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGK 266 (393) Q Consensus 189 p~~VYkkq~L-ad~~k~l~g~-ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~ 266 (393) |..++....+ .++..+.... .+.|-+|+.++|+.++- |+++.|+.+|+|...-....-.... +.++++.... T Consensus 79 ~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~-~~~d~a~~~G~~~~~~~~~~~~~~~-----~~~~~~~~~~ 152 (315) T protein:vir:80 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIG-RAVDLIAFHGIDPATGKAASAVHTS-----LNKTKNIVDA 152 (315) T ss_pred eeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHH-HHHhhheeeccCCCCCccccccccc-----cccccceeec Confidence 9877776666 3344443332 22356889999999988 7999999999764221111111221 1111111111 Q ss_pred CCC-HHHHHhh---hceeeecCCCceEEEecchhhhHhhhhhccc-ccccee-ee----c--CCCeEEEeeecccchhh- Q lcl|Aclame:pro 267 TPF-ADAIEEA---VDFVRPTAGRRYLIVKTEDRKALLDELRQAT-ANANVR-IK----N--DDTEIASEVGVDEIIVY- 333 (393) Q Consensus 267 t~~-~dal~Ea---ld~a~~~a~~~~l~i~~~d~~a~~~~~~~~~-~~a~~~-l~----~--~~~~~~~~v~~~~~~~~- 333 (393) +.. .+++..+ +.-+.... ....+.|+..+.+|+- |+... ...|.. +- . .+.=+-.||-+++..-. T Consensus 153 ~~~~~~d~~~~~~~~~~~~~~~-~~~~imn~~~~~~L~~-l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV~~~~~~~~~ 230 (315) T protein:vir:80 153 TDSATADLVKAVGLIAGAGLQV-PNGVALDPAFSFALST-EVYPKGSPLAGQPMYPAAGFAGLDNWRGLNVGASSTVSGA 230 (315) T ss_pred cccchHHHHHHHHHHhhccCcc-ceEEEEcHHHHHHHHH-HhhccCCcccccccccccccCCCceecceeeEecCcCCcc Confidence 111 1233333 32222222 2347788888888865 22111 111111 00 0 01122233333221100 Q ss_pred --cccchhceeeeecccce-e--------cccceeeecc---eeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 334 --TGSKAVKPTVLVDQKYH-I--------DMQDLTKVDA---FEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 334 --tG~k~~~ptv~vD~k~~-~--------~~~~~~~~~s---~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) ++..-..|.++-|-+.+ + .++...+-+. --|.+|+-.+.++.++.|.|.-|++=++++.+ T Consensus 231 ~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 304 (315) T protein:vir:80 231 PEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) T ss_pred cccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeec Confidence 00001112233343321 1 1111111010 01667777888889999999999998888866 No 105 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=97.10 E-value=0.00013 Score=41.85 Aligned_cols=357 Identities=14% Similarity=0.092 Sum_probs=134.5 Q ss_pred CCcchh-hHHHHHHHHHHHhhHHHHhhhhhhh-hhhHHhhhhHHHHHHHHHH-HHHHH----HHHH-HHHHH---hhh-- Q lcl|Aclame:pro 1 MNKPDL-IEKQNRLAELKENNVSLKSQISGFE-VKNAIEDLPKVQELEKTLS-ENSIE----IIKI-ENELN---AQE-- 67 (393) Q Consensus 1 ~~k~d~-~ekq~eLa~lK~~~~~~~s~i~~~~-v~~a~~~~skieelektis-~l~aE----i~k~-enel~---~~~-- 67 (393) -+..+. .+.+.+.+++-.- ..+.+.-+ ..+++..--.+++.+..+- .++.+ ..+. ...+. .+. T Consensus 219 a~~~~~~~~E~~r~~eI~~l----~~~~~~~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~ 294 (632) T protein:vir:96 219 ANENDILSRERTRISEITAI----GQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSA 294 (632) T ss_pred hhhhhhhhhhHHHHHHHHHH----HHHhhhhhhHHHHHhccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhh Confidence 111111 1122233333211 11111000 1111111111222222111 11110 0000 00000 000 Q ss_pred ------hhhcchhHHHHHHHHH------HH--HHHHHHHHHHccCCh--hHHHHHHHHHHhC-ccchhhhH--hhcc--- Q lcl|Aclame:pro 68 ------EKPKGKDKMTNFIESQ------NA--VTEFFDVLKKNSGKS--EIKNAWNAKLAEN-GVTITDTT--FQLP--- 125 (393) Q Consensus 68 ------Ek~K~k~emtEfLkTk------qA--~~dya~ll~~nqg~k--e~k~AW~a~L~ek-gV~~qd~~--eiLP--- 125 (393) ++.-....|...++.. .+ ...++.-++...|.. .+.... +.|..+ ...+++.. .++| T Consensus 295 ~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~-~~l~~ra~~~~t~~~gg~lvp~~~ 373 (632) T protein:vir:96 295 RDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPH-EVLVQRQLEKKTAGKGGELVATEL 373 (632) T ss_pred hhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhH-HHHHHhhhhccccccccccccccc Confidence 0000001111111100 00 001111122222221 010111 122222 12222221 1344 Q ss_pred --hhHHHHHHHHHHhhCccccceeeecccc--eeEEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH Q lcl|Aclame:pro 126 --RKLVESINTALLNTNPVFKVFHVTNVGA--LLVSRSFDS--SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA 199 (393) Q Consensus 126 --~~ii~AIe~A~ed~d~vl~~fhV~n~~~--~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La 199 (393) .++|+.+ ++ ..++..+.+...|. +.+.+--.+ ...+| ..-|.+++....+|...++.|..++....+. T Consensus 374 ~~~~iie~l----r~-~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~w-v~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS 447 (632) T protein:vir:96 374 LSEEFIDIL----RN-KAIIGQMGARMLPGLVGDVDIPKKTSGANFYW-IGEDEDVQDSDFDFTTLSFSPKTIAGAVPVT 447 (632) T ss_pred chHHHHHHH----hh-cchhhhhcceEeecCCcceEEEEEeCCceeEe-ecCCccccccccceeeEEeeeeEEEEehhhH Confidence 4455444 43 44555454333332 222222222 23333 3456678889999999999999888877774 Q ss_pred HHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHH---hh Q lcl|Aclame:pro 200 ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIE---EA 276 (393) Q Consensus 200 d~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~---Ea 276 (393) ..+-+-.+ -++.+||.++|+.++- +.++.+++.|||+... +.-+--...+..++. .+.+...+.+. .+ T Consensus 448 ~ell~ds~--~~~~~~i~~~l~~a~~-~~~d~a~l~G~G~~~~--p~Gi~~~~~~~~~~~----~~~~~~~~~i~~~~~~ 518 (632) T protein:vir:96 448 RKLRKQSS--IHVENLIREDLIEGIG-VALDLAMLTGTGLAND--PVGLLNMTGVPALTY----PAGGVDWASVVDMETK 518 (632) T ss_pred HHHHhccc--hHHHHHHHHHHHHHHH-HHHHHHhhcccCCCCc--cceeeecccccceec----ccccCCHHHHHHHHHH Confidence 44332222 3679999999999999 7999999999997431 100100000111111 11222233333 33 Q ss_pred hceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEE--------eeecccchhhcccchhceeeeeccc Q lcl|Aclame:pro 277 VDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIAS--------EVGVDEIIVYTGSKAVKPTVLVDQK 348 (393) Q Consensus 277 ld~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~--------~v~~~~~~~~tG~k~~~ptv~vD~k 348 (393) +.-+....++-..+.+...+.+++- .+|++++|.+.. ||.+++..- .. -.++.|-. T Consensus 519 i~~~~~~~~~~~~~~~~~~~~~l~~----------~~l~d~~G~~i~~~~~l~G~pv~~s~~ip--~~----~~~~gd~s 582 (632) T protein:vir:96 519 ISTFNADAGRLAYLTSVTQRGAAKK----------AQVFDNTGERIWQNNEVNGYRAEASNQIP--AD----TWIFGDWS 582 (632) T ss_pred HhhcccccCccEEEEchhHHHHHHH----------HhccCCCCceeecCCeecccceEeccccc--cC----cEEEeecc Confidence 3323223333344554444444432 225555565543 222222211 00 12344543 Q ss_pred cee--cccceeeecceeEeecCceEEE--eeeecccccccCcceeeeeC Q lcl|Aclame:pro 349 YHI--DMQDLTKVDAFEWKTNSNMILV--ETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 349 ~~~--~~~~~~~~~s~~~~~ns~~i~~--~~~~~g~~~~~n~~~~~~v~ 393 (393) .++ +.+|+.-.-+..-.+.+|.+.+ .....+-+..+.+=++...+ T Consensus 583 ~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~ 631 (632) T protein:vir:96 583 QIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKG 631 (632) T ss_pred eEEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeec Confidence 332 2233332222222334444444 44455555555555555555 No 106 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=97.09 E-value=3.3e-05 Score=45.18 Aligned_cols=281 Identities=16% Similarity=0.103 Sum_probs=127.9 Q ss_pred cCChhHHHHHHHHHHhCccchhhhH--hhcchhHHHHHHHHHHhhCccccceeeecccceeEEEeeccccccceec---c Q lcl|Aclame:pro 97 SGKSEIKNAWNAKLAENGVTITDTT--FQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSSNEAQVHK---D 171 (393) Q Consensus 97 qg~ke~k~AW~a~L~ekgV~~qd~~--eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na~~a~GHk---~ 171 (393) =+.+.|.+-=.+.++.+++...|+. +.+|.++-..|-.++.+..++|+.+.|.....+.-.+..-+.....+.. . T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~e~ 80 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQDEG 80 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCccccccccc Confidence 1123344432333444455444444 3789888777777777788999988866655443333221111111111 1 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhh Q lcl|Aclame:pro 172 GQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEAD 250 (393) Q Consensus 172 ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D 250 (393) +........+|..+.+....++-.-.+. +...+ +....++-+|+++.+++.|= +.++++.++|||.....-. ..-+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d-~a~~~d~e~~i~~~ia~~~a-~~~~~~~~nGd~~~~~~~~-~~n~ 157 (321) T protein:vir:31 81 EWNENESDVSTGTIDISTEKATVAWDLPREVVQE-NPEGEALADRILNLMTDAWS-ADVEDLAANGDEDAEDSFE-NQND 157 (321) T ss_pred ccccccccceeeeeeeeeEEEEeehhccHHHHHh-hhcchhHHHHHHHHHHHHHH-HHHHhheeeccccCCCccc-ccch Confidence 1122233445666655554443332221 22222 21124689999999999998 7999999999997332100 1111 Q ss_pred hh--hhhhhhhhhhccCCCCCHHHHHhhh---ceeeecCCCceEEEecchhhhHhhhhhccccccceeeecCCCeEEEee Q lcl|Aclame:pro 251 VK--KIKKITTKAKSAGKTPFADAIEEAV---DFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTEIASEV 325 (393) Q Consensus 251 ~~--~ik~it~~at~~~~t~~~dal~Eal---d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~~~~~~~~~~v 325 (393) =| .++.-..+.+..+.+...|.+...+ +-+--..++-..++++..+.++++.|. +.++.+.-+. T Consensus 158 G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~-----------~~~~~~~~~~ 226 (321) T protein:vir:31 158 GFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLT-----------DRDTPLGDNV 226 (321) T ss_pred hhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHh-----------cCCCccccch Confidence 11 1111111111122333445554444 222112234466787777777877543 3333332222 Q ss_pred ec--ccchhhcccchhceeeeeccccee--ccccee----eecceeEeecC-----ceEEEeeeecc----cccccCcce Q lcl|Aclame:pro 326 GV--DEIIVYTGSKAVKPTVLVDQKYHI--DMQDLT----KVDAFEWKTNS-----NMILVETLTSG----HVETYNAGA 388 (393) Q Consensus 326 ~~--~~~~~~tG~k~~~ptv~vD~k~~~--~~~~~~----~~~s~~~~~ns-----~~i~~~~~~~g----~~~~~n~~~ 388 (393) .. ...++ |+.-++.+-.+.....+ +++.|. ...++....+. .-+-++.+.++ -|+-|.+.+ T Consensus 227 l~~~~~~tl--~G~pvv~~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a 304 (321) T protein:vir:31 227 IMGEADVNP--FSFPIIGSGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVV 304 (321) T ss_pred hhccccccc--cceeEEEcCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEE Confidence 22 22233 33333322111111111 332221 11122222111 12344444433 346666766 Q ss_pred eee-eC Q lcl|Aclame:pro 389 VIT-VS 393 (393) Q Consensus 389 ~~~-v~ 393 (393) +++ +- T Consensus 305 ~~~~i~ 310 (321) T protein:vir:31 305 LAEGLG 310 (321) T ss_pred EEecCC Confidence 665 22 No 107 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=95.67 E-value=0.0008 Score=37.57 Aligned_cols=288 Identities=10% Similarity=0.036 Sum_probs=125.1 Q ss_pred ChhHHHHHHHHHHhCccchhhhHh--hcchhHHHHHHHHHHhhCccccceeeec-ccceeEEE---eeccc--cccceec Q lcl|Aclame:pro 99 KSEIKNAWNAKLAENGVTITDTTF--QLPRKLVESINTALLNTNPVFKVFHVTN-VGALLVSR---SFDSS--NEAQVHK 170 (393) Q Consensus 99 ~ke~k~AW~a~L~ekgV~~qd~~e--iLP~~ii~AIe~A~ed~d~vl~~fhV~n-~~~~a~~i---~l~na--~~a~GHk 170 (393) -.++++.-+ ..++++..|.-- +.|... ..+-+.+.++.++++..+|.+ .......+ +.... .....-. T Consensus 1 ~~~~~~~~~---~~k~it~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~ 76 (314) T protein:vir:41 1 MDFLNKPFQ---ITPKIDVPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSG 76 (314) T ss_pred CchhhhHHH---hhcccccccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCccccccccccc Confidence 112222222 122333333322 456543 445566777899999777543 22221111 11110 1111111 Q ss_pred ccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccc-hhhh Q lcl|Aclame:pro 171 DGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSI-DKEA 249 (393) Q Consensus 171 ~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~-~~e~ 249 (393) .+.+-+++..+|-...|....+.-.-++.+-.-+-+....++=+|+++.+++.|= |.++.+.++|||...+..+ ...- T Consensus 77 ~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g-~~~~~~~~nGdg~~~s~~~~~~~p 155 (314) T protein:vir:41 77 TKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVT-YDLECFFLHADSSLTTGRELYRIN 155 (314) T ss_pred CCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHH-HHHHHHhhccccCCcCcccchhcc Confidence 2223455666777777766444433333222222222113678999999999988 7999999999996432211 0111 Q ss_pred hhhhhhh--hhhhhhccCCCCCHHHHHhhh-ceeee----cCCCceEEEecchhhhHhhhhhccccccceee--ecCCCe Q lcl|Aclame:pro 250 DVKKIKK--ITTKAKSAGKTPFADAIEEAV-DFVRP----TAGRRYLIVKTEDRKALLDELRQATANANVRI--KNDDTE 320 (393) Q Consensus 250 D~~~ik~--it~~at~~~~t~~~dal~Eal-d~a~~----~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l--~~~~~~ 320 (393) |=| +++ ..++..+.....+.+++.-++ .-..+ .+++...+++++.+.++|..|..--...|-.. ...+.. T Consensus 156 ~G~-l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~ 234 (314) T protein:vir:41 156 DGW-MKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQ 234 (314) T ss_pred hhh-hhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCce Confidence 111 111 111111222233445444333 22222 23455788899999999886542222222111 111111 Q ss_pred E-EEeeecccchhhcccchhceeeeeccccee--cccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 321 I-ASEVGVDEIIVYTGSKAVKPTVLVDQKYHI--DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 321 ~-~~~v~~~~~~~~tG~k~~~ptv~vD~k~~~--~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) + ..||...+--=..|+. -.|+.+.|-+.++ ....+.-...+-.+..+=-+..+..+-..++-+++++...+- T Consensus 235 l~G~PV~~~~~~~~~~~~-~~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~ 309 (314) T protein:vir:41 235 YDGIPIQYVPALDALGDD-KARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVID 309 (314) T ss_pred ecceeeEecccccccCCC-CceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEee Confidence 1 2222111100000111 1344555554433 111121111111111122233344455566667777666665 No 108 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=95.55 E-value=0.00071 Score=37.84 Aligned_cols=247 Identities=13% Similarity=0.084 Sum_probs=124.7 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ce-eEEEee-ccccccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--AL-LVSRSF-DSSNEAQ 167 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~-~--~~-a~~i~l-~na~~a~ 167 (393) |.+..+ +.-.-+.|+-+-..|...+.+. .+|..+- +.+. + .+ .+.+-. .....+. T Consensus 1 MA~~~T------------------~~~~~~iPev~s~~v~~~~~~~-~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~ 61 (272) T protein:vir:98 1 MAVGTT------------------KMAQMLDPEVLADMIDAEVGKA-IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAE 61 (272) T ss_pred CCCccc------------------cchheechHHHHHHHHHHHHHH-hhhhccccccccccCCCCCEEEEEEecCCCCcc Confidence 332111 0001133433333333333321 1222111 1110 0 01 111111 1111222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchh Q lcl|Aclame:pro 168 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDK 247 (393) Q Consensus 168 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~ 247 (393) --.-|.+.+.+..++....+.+..+++.-++-|......+ +++++++.+.++.++- |.++.++...- T Consensus 62 ~v~eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~--~d~~~~~~~~~~~~~a-~~~d~~i~~~~---------- 128 (272) T protein:vir:98 62 DVAEGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGY--GDPVGQAAKQIVEAID-HKVDADVLDAL---------- 128 (272) T ss_pred cccCCCcccccccccceEEEEeeeeeeeeeecHHHHhhcc--ccHHHHHHHHHHHHHH-HHHHHHHHHHh---------- Confidence 2334667788888999999999887776666666665555 5689999999999987 67776654321 Q ss_pred hhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhh-hhc---cccccceeeecCCCeEE Q lcl|Aclame:pro 248 EADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDE-LRQ---ATANANVRIKNDDTEIA 322 (393) Q Consensus 248 e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~-~~~---~~~~a~~~l~~~~~~~~ 322 (393) .......+.+.+.|++..++ -|.......+++|+|+.+...|+.- +-. ++.-.+- T Consensus 129 ----------~~a~~~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~---------- 188 (272) T protein:vir:98 129 ----------SKSTQTVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGAN---------- 188 (272) T ss_pred ----------cccccccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccccccccccccc---------- Confidence 11112223344567777776 3444456678999999998888641 111 1110000 Q ss_pred Eeeecccchhhcccchhc----e---eeeecccc--eecccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 323 SEVGVDEIIVYTGSKAVK----P---TVLVDQKY--HIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 323 ~~v~~~~~~~~tG~k~~~----p---tv~vD~k~--~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .+.-|.+-=+-|...++ | +++++... +....+++.-..|....-+..|-++.+...|+.-|.+.+.+++. T Consensus 189 -~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:98 189 -RVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLK 267 (272) T ss_pred -ccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEec Confidence 01111110011433222 1 12222211 01233444444455556678888888888999999999999998 No 109 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=95.55 E-value=0.00071 Score=37.84 Aligned_cols=247 Identities=13% Similarity=0.084 Sum_probs=124.7 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ce-eEEEee-ccccccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--AL-LVSRSF-DSSNEAQ 167 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~-~--~~-a~~i~l-~na~~a~ 167 (393) |.+..+ +.-.-+.|+-+-..|...+.+. .+|..+- +.+. + .+ .+.+-. .....+. T Consensus 1 MA~~~T------------------~~~~~~iPev~s~~v~~~~~~~-~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~ 61 (272) T protein:vir:30 1 MAVGTT------------------KMAQMLDPEVLADMIDAEVGKA-IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAE 61 (272) T ss_pred CCCccc------------------cchheechHHHHHHHHHHHHHH-hhhhccccccccccCCCCCEEEEEEecCCCCcc Confidence 332111 0001133433333333333321 1222111 1110 0 01 111111 1111222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchh Q lcl|Aclame:pro 168 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDK 247 (393) Q Consensus 168 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~ 247 (393) --.-|.+.+.+..++....+.+..+++.-++-|......+ +++++++.+.++.++- |.++.++...- T Consensus 62 ~v~eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~--~d~~~~~~~~~~~~~a-~~~d~~i~~~~---------- 128 (272) T protein:vir:30 62 DVAEGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGY--GDPVGQAAKQIVEAID-HKVDADVLDAL---------- 128 (272) T ss_pred cccCCCcccccccccceEEEEeeeeeeeeeecHHHHhhcc--ccHHHHHHHHHHHHHH-HHHHHHHHHHh---------- Confidence 2334667788888999999999887776666666665555 5689999999999987 67776654321 Q ss_pred hhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhh-hhc---cccccceeeecCCCeEE Q lcl|Aclame:pro 248 EADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDE-LRQ---ATANANVRIKNDDTEIA 322 (393) Q Consensus 248 e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~-~~~---~~~~a~~~l~~~~~~~~ 322 (393) .......+.+.+.|++..++ -|.......+++|+|+.+...|+.- +-. ++.-.+- T Consensus 129 ----------~~a~~~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~---------- 188 (272) T protein:vir:30 129 ----------SKSTQTVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGAN---------- 188 (272) T ss_pred ----------cccccccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccccccccccccc---------- Confidence 11112223344567777776 3444456678999999998888641 111 1110000 Q ss_pred Eeeecccchhhcccchhc----e---eeeecccc--eecccceeeecceeEeecCceEEEeeeecccccccCcceeeeeC Q lcl|Aclame:pro 323 SEVGVDEIIVYTGSKAVK----P---TVLVDQKY--HIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) Q Consensus 323 ~~v~~~~~~~~tG~k~~~----p---tv~vD~k~--~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v~ 393 (393) .+.-|.+-=+-|...++ | +++++... +....+++.-..|....-+..|-++.+...|+.-|.+.+.+++. T Consensus 189 -~~~~g~ig~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:30 189 -RVVSGVYGEVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLK 267 (272) T ss_pred -ccccccchhhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEec Confidence 01111110011433222 1 12222211 01233444444455556678888888888999999999999998 No 110 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=94.79 E-value=0.00032 Score=39.71 Aligned_cols=272 Identities=14% Similarity=0.068 Sum_probs=122.3 Q ss_pred HH----HHccCChhHHHHHHHHHHhCccchhhhHh-hcchhHHHHHHHHHHhhCccccceeeecc---cceeE-EEeecc Q lcl|Aclame:pro 92 VL----KKNSGKSEIKNAWNAKLAENGVTITDTTF-QLPRKLVESINTALLNTNPVFKVFHVTNV---GALLV-SRSFDS 162 (393) Q Consensus 92 ll----~~nqg~ke~k~AW~a~L~ekgV~~qd~~e-iLP~~ii~AIe~A~ed~d~vl~~fhV~n~---~~~a~-~i~l~n 162 (393) +| +.++.- ...++.-++ .|..- .|+......+-+.+.++.++++..+|-+. ...-+ .++... T Consensus 1 ~~~~~~~~~~~~-------~~~~k~~t~--~d~~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~ 71 (315) T protein:vir:41 1 MLTIEDIRGGKP-------FEIVPKIDV--PDLGRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVL 71 (315) T ss_pred CcccchhhcCCh-------hhhhhhcCC--cCCCCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCc Confidence 22 111111 112222233 33322 45555566676777778899987775321 11110 111111 Q ss_pred --ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCC Q lcl|Aclame:pro 163 --SNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 240 (393) Q Consensus 163 --a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~ 240 (393) ..++..-..+.+.+++..+|....+.+..++-+-.+.+-.-+-+...-++-+|++.++++.|= |..+.+.++|||.. T Consensus 72 ~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a-~~~~~~~~nGdg~s 150 (315) T protein:vir:41 72 DVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGIS-YVLEKYYLHGDTSS 150 (315) T ss_pred ccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHH-HHHHHHhhccCCcC Confidence 112222223335566778888888887776655333222222122113678999999999998 79999999999963 Q ss_pred ccccchhhhhhhhhhhhhhh--hhcc--CCCCC-HH---HHHhhh-ceeeecCCCceEEEecchhhhHhhhhhccccccc Q lcl|Aclame:pro 241 GFKSIDKEADVKKIKKITTK--AKSA--GKTPF-AD---AIEEAV-DFVRPTAGRRYLIVKTEDRKALLDELRQATANAN 311 (393) Q Consensus 241 ~t~~~~~e~D~~~ik~it~~--at~~--~~t~~-~d---al~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~ 311 (393) ... ....-|=| ++..+.. ++.. ..+.+ .| +|..+| .-.+-.+.+-..++++..+.++|. T Consensus 151 ~~p-~~~~~~G~-l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rk---------- 218 (315) T protein:vir:41 151 SDP-LLRMSDGW-LKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRD---------- 218 (315) T ss_pred cCc-cccccccc-eecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHH---------- Confidence 110 00000101 0111111 1111 11122 23 333333 122233445678899999999888 Q ss_pred eeeecCCCeEE--Eeeeccc-chhhcccchhceeeeecc-------ccee---cccceeeecceeE------eecCc--e Q lcl|Aclame:pro 312 VRIKNDDTEIA--SEVGVDE-IIVYTGSKAVKPTVLVDQ-------KYHI---DMQDLTKVDAFEW------KTNSN--M 370 (393) Q Consensus 312 ~~l~~~~~~~~--~~v~~~~-~~~~tG~k~~~ptv~vD~-------k~~~---~~~~~~~~~s~~~------~~ns~--~ 370 (393) +++.+|.+- ..+..|+ .++ -| +|.+.++. +..| |++.|.-...+.+ .++++ - T Consensus 219 --lk~~~g~~lw~~~~~~g~~~tl-~G----~PV~~~~~m~~~~~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~ 291 (315) T protein:vir:41 219 --ALKGRETGLGDQALTGANSILY-DG----RPVQYVPALEALNDGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTK 291 (315) T ss_pred --HhccCCCccccchhhcCCCcee-cc----cceEecccccccCCCCccEEEecccceEEEeccccEEEeeecCCCCceE Confidence 666555432 2222222 222 12 22222221 1111 3333221111111 11222 2 Q ss_pred EEEeeeecccccccCc--ceeeee Q lcl|Aclame:pro 371 ILVETLTSGHVETYNA--GAVITV 392 (393) Q Consensus 371 i~~~~~~~g~~~~~n~--~~~~~v 392 (393) |..+..+-|.+.-.|. ..+++| T Consensus 292 ~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 292 YVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred EEEEEEeceeEEeccceeEeeeeC Confidence 3333445555555665 345555 No 111 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=79.44 E-value=0.1 Score=25.96 Aligned_cols=247 Identities=14% Similarity=0.083 Sum_probs=111.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ce-eEEEeeccc-cccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--AL-LVSRSFDSS-NEAQ 167 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~-~--~~-a~~i~l~na-~~a~ 167 (393) |.|.-++ --+-+.|+-+-..+...+++. -+|.++- +.+. + ++ .+.+-.=+. ..++ T Consensus 1 ma~~~T~------------------~~~~iiPev~~~~v~~~~~~~-~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~ 61 (274) T protein:vir:93 1 MPQGITK------------------TSNQIIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ 61 (274) T ss_pred CCcccee------------------hhheechHHHHHHHHHHHHhh-hhhcccccccccccCCCCCEEEEEeeccCCCcc Confidence 4443321 000023322222222222211 1111111 1110 0 01 011100011 1222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 168 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 246 (393) Q Consensus 168 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 246 (393) -...|.+-+.+.++....++.....++-.++-|....+.+ ++++..+++.++.++- |-++..+... .|.+ T Consensus 62 ~~~eg~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~--~d~~~~~~~~~~~~~a-~~~d~~~~~~~~~a~------ 132 (274) T protein:vir:93 62 VVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHA-NKVDNDVLEALMGAK------ 132 (274) T ss_pred cccCCCcccccccccceeEEEeeeecccccccHHHHHhhc--cchHHHHHHHHHHHHH-HHHHHHHHHHHhccc------ Confidence 3344555566667777777777666655556666655554 6779999999998888 5666555432 1110 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhh----hhccccccceeeecCCCeE Q lcl|Aclame:pro 247 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDE----LRQATANANVRIKNDDTEI 321 (393) Q Consensus 247 ~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~----~~~~~~~a~~~l~~~~~~~ 321 (393) .++ .+.+...+++.+|+ -|-......+++|+|..+...|+.. .-.++...+-.+. +|.+ T Consensus 133 ----------~~~----~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~--~G~i 196 (274) T protein:vir:93 133 ----------LTV----NADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIV--KGAF 196 (274) T ss_pred ----------ccc----cccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhccccccccccccee--eccc Confidence 000 11122355566555 1222344678999999999998742 1122221111111 1111 Q ss_pred EEeeecccchhhcccchhc-eeeeecccc-----ee---cccceeeecceeEeecCceEEEeeeecccccccCcceeeee Q lcl|Aclame:pro 322 ASEVGVDEIIVYTGSKAVK-PTVLVDQKY-----HI---DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITV 392 (393) Q Consensus 322 ~~~v~~~~~~~~tG~k~~~-ptv~vD~k~-----~~---~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v 392 (393) .- +-|...++ +.+=.++.+ ++ ..+++..-..|..+.-+..|-++.+-...+--+++.++++. T Consensus 197 g~---------~~G~~Vi~s~~~p~~t~~l~~~gai~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~ 267 (274) T protein:vir:93 197 GE---------ALGAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITK 267 (274) T ss_pred ce---------ecCeeEEEcCCCCcceEEEEeCCeEEEEecCCcccccccchhhcccEEEEEEEEEEEEEcCCceEEEee Confidence 11 11322222 000011111 11 22344444445555566777777777777777777777777 Q ss_pred C Q lcl|Aclame:pro 393 S 393 (393) Q Consensus 393 ~ 393 (393) + T Consensus 268 ~ 268 (274) T protein:vir:93 268 G 268 (274) T ss_pred C Confidence 6 No 112 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=78.55 E-value=0.062 Score=27.22 Aligned_cols=267 Identities=11% Similarity=0.028 Sum_probs=111.5 Q ss_pred HHccCChhHHHHHHHHHHhCccchh-hhHhhcc-hhHHHHHHHHHHhhCcccccee-eecccceeEEEeecc-cccccee Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTIT-DTTFQLP-RKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFDS-SNEAQVH 169 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~q-d~~eiLP-~~ii~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~n-a~~a~GH 169 (393) |.+... .=+=+| .+. |.. .|. +.-..-+.++|.. ..+|.-+| +-.+..+.-..-+.- ...+.+| T Consensus 1 Ms~~n~-~t~~~~---------~~s~~~~-al~le~f~geV~taF~~-~si~~~~~~vrti~~GkS~qf~~iG~~~a~y~ 68 (402) T protein:vir:97 1 MSTPNT-LTNVAV---------SASGEVD-SLLIEKFNGKVNEQYLK-GENILSYFDVQTVTGTNTVSNKYLGETELQVL 68 (402) T ss_pred CCCccc-cccccc---------ccccchh-hhhhhhhhhhHHHHHHH-HHhhcCcceeeeecccceEEEEEEeeeEEeee Confidence 332211 111112 111 111 334 4566778889984 77777777 544444332222222 4577899 Q ss_pred cccchh------h-hhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHH-hcceeeccCCCc Q lcl|Aclame:pro 170 KDGQTK------T-EQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIV-DLALVEGDGTNG 241 (393) Q Consensus 170 k~ga~K------k-~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav-~rAvv~gDG~~~ 241 (393) +.|+.= . +..++...+-+.+..||.+-..-.|...+.++|+.---|.+.+..+.+|-|.+ ..|...-.+ . T Consensus 69 ~~G~~ldg~~~~~~k~~ItID~lL~a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~--~ 146 (402) T protein:vir:97 69 APGQSPNATPTQADKNQLVIDTTVIARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKA--E 146 (402) T ss_pred ccccccCCCCcccccEEEEeCceeechhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c Confidence 988821 1 11233344445677888775553444444566777777777777766554422 223221111 1 Q ss_pred cccchhhhhhhhhhhhhhhhhccCCCCCHHHHH-------hhhceeeecCCCceEEEecchhhhHhhhhhccccccceee Q lcl|Aclame:pro 242 FKSIDKEADVKKIKKITTKAKSAGKTPFADAIE-------EAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRI 314 (393) Q Consensus 242 t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~-------Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l 314 (393) ..-++-.+.-. .+.++.+..+...++++|- +.||-.....++|++++...-..+|+.- ..=-|... T Consensus 147 ~~~~~~~~~g~---s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~----~rl~n~d~ 219 (402) T protein:vir:97 147 RNKPRVKGHGF---SINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDA----DRIVDKTY 219 (402) T ss_pred cccCccccccc---ccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhc----ccccchhh Confidence 11112222222 3334444444455777554 5567777777789999988777777752 11011111 Q ss_pred e-cCCCeEEEeeecccchhhcccchhc----eeeeecccce-ecccceeeecceeEeecCceEEEeeeecccccccCcce Q lcl|Aclame:pro 315 K-NDDTEIASEVGVDEIIVYTGSKAVK----PTVLVDQKYH-IDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGA 388 (393) Q Consensus 315 ~-~~~~~~~~~v~~~~~~~~tG~k~~~----ptv~vD~k~~-~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~ 388 (393) - ..+|.+. -|.+-..-|.+.|+ |+..-+...| +|-+|- -.+|... ++ ++-+.|-+-.+++-. T Consensus 220 ~~~~~g~~~----~G~v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~--G~~y~~t--~d----~t~~~~~~f~~~Av~ 287 (402) T protein:vir:97 220 TISQSGATI----NGFVLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDN--GYRYDPI--AE----MNGAVAVLFTSDALL 287 (402) T ss_pred ccccCCccc----cceeEEEeceEEEecCccccccccccccccccCCC--CccCCcC--cc----cceeEEEEEecceEE Confidence 0 1111110 01010111443333 3322111111 110000 0000000 00 000111111222221 Q ss_pred eeee---C Q lcl|Aclame:pro 389 VITV---S 393 (393) Q Consensus 389 ~~~v---~ 393 (393) ..++ + T Consensus 288 tvk~~~vT 295 (402) T protein:vir:97 288 VGRTIEVT 295 (402) T ss_pred EEEeeccc Confidence 1111 1 No 113 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=75.54 E-value=0.15 Score=25.17 Aligned_cols=248 Identities=14% Similarity=0.079 Sum_probs=116.5 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eeccc---ceeEEEeec-cc-cccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVG---ALLVSRSFD-SS-NEAQ 167 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~~---~~a~~i~l~-na-~~a~ 167 (393) |.+.-+ +--+-|.|+=.=..|...++. ..+|.++= +++.- ++--+--|. +. ..++ T Consensus 1 ma~~~T------------------~~~d~iiPev~~~~v~~~~~~-~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~ 61 (272) T protein:vir:36 1 MSKQKT------------------TLADLVNPEVLAPIVSYELNK-ALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAA 61 (272) T ss_pred CCCcce------------------ehhhhhchHHHHHHHHHHHHh-hhhhccccccccccccCCCCEEEEeeeccCcccc Confidence 322221 111114454333333333332 23333332 11110 011111111 11 2344 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 168 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 246 (393) Q Consensus 168 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 246 (393) -+..|.+.+.+.++....++.....++-.+.-|....+.+ ++.+..++++++.++- |-++..+... .|. T Consensus 62 ~~~eg~~i~~~~lt~~~~~~~i~~~~k~~~vtD~~~~~~~--~d~~~~~~~~~a~~~a-~~~d~~i~~~l~~~------- 131 (272) T protein:vir:36 62 DVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGY--GDPIGESNKQLGLSLA-NKVDDDLLSAAKTT------- 131 (272) T ss_pred ccCCCCccChhhcCCcceeEeeehhhccccccHHHHhhcc--chHHHHHHHHHHHHHH-HHHHHHHHHHhccc------- Confidence 4666777778888877777777765554555565555544 7889999999998876 5666544322 221 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHhhhc-eeeecCCCceEEEecchhhhHhhhhhccccccceee-ecCCCeEEEe Q lcl|Aclame:pro 247 KEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKTEDRKALLDELRQATANANVRI-KNDDTEIASE 324 (393) Q Consensus 247 ~e~D~~~ik~it~~at~~~~t~~~dal~Eald-~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l-~~~~~~~~~~ 324 (393) ....+..-+.|.+..|++ |-.-....+++|+|..+...||.. ++.+-.- ..+++.. T Consensus 132 --------------~~~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~-----~~~~~~~~~~~~~~~--- 189 (272) T protein:vir:36 132 --------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKD-----ANAKNIGSEVGANAL--- 189 (272) T ss_pred --------------cccccccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcc-----cccccccccccccce--- Confidence 112233445666666661 111122357999999998888652 1111111 1111110 Q ss_pred eecccchhhcccchhc----ee------eeeccccee---cccceeeecceeEeecCceEEEeeeecccccccCcceeee Q lcl|Aclame:pro 325 VGVDEIIVYTGSKAVK----PT------VLVDQKYHI---DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVIT 391 (393) Q Consensus 325 v~~~~~~~~tG~k~~~----pt------v~vD~k~~~---~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~ 391 (393) .-|.+--|-|...++ |. -++-.+.++ ..++++.-.-|..+..+..|-.+.+-..+|--|.+-+.++ T Consensus 190 -~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t 268 (272) T protein:vir:36 190 -INGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNIT 268 (272) T ss_pred -eeeccceecCeeEEEeCCCCCCceeEEEEEecccceeeeecCCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEe Confidence 011110111332222 10 011111111 2244444445555556667777766667777777777777 Q ss_pred eC Q lcl|Aclame:pro 392 VS 393 (393) Q Consensus 392 v~ 393 (393) .. T Consensus 269 ~~ 270 (272) T protein:vir:36 269 FT 270 (272) T ss_pred ec Confidence 77 No 114 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=72.32 E-value=0.18 Score=24.61 Aligned_cols=285 Identities=17% Similarity=0.170 Sum_probs=116.0 Q ss_pred HHccCChhHHHHHHHH-------------HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceee----ecccceeE Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAK-------------LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHV----TNVGALLV 156 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~-------------L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV----~n~~~~a~ 156 (393) |-.--+-+..-.|+-. |++++.-.+|. +++.||+.+ .+++.+|..+-| .+..+|-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~alTLaea~~l~~d~---~~~~VIE~l----~~~s~iL~~lpf~~ve~~~~~~~r 73 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMPTVTLAESAKLSQDH---LVSGLIETI----VEVNPLYEMMPFTEIEGNALAYNR 73 (330) T ss_pred CceecCCccccceeehhccccccchhhhhhhHHhhcCchh---hHHHHHHhh----hccchHHhhcccccccCCcceeee Confidence 1111122333334322 33433322333 455555554 444555544333 33333322 Q ss_pred EEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec Q lcl|Aclame:pro 157 SRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 236 (393) Q Consensus 157 ~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g 236 (393) .-.+.. .+|-.-.+.-+.....+|+..+.+...+--.-..-....+|.|.+-+...+-++....++= +..+..+++| T Consensus 74 ~~~lp~--a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~-~~~e~~linG 150 (330) T protein:vir:94 74 ENVLGD--VQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIG-RQYQASMITG 150 (330) T ss_pred eecCCc--ceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH-HHHHHHhhcc Confidence 222222 2222222222222234455555543333333333444555666553333333333333443 3677899999 Q ss_pred cCCC-cc-ccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeecCCCc-eEEEecchhhhHhhhhhcccccccee Q lcl|Aclame:pro 237 DGTN-GF-KSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRR-YLIVKTEDRKALLDELRQATANANVR 313 (393) Q Consensus 237 DG~~-~t-~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~a~~~-~l~i~~~d~~a~~~~~~~~~~~a~~~ 313 (393) |+.+ .| -........ +-|+ +-+-++.++.|+|-|.||.+..+.+.+ +|+.++-.+.+|+--.|+++...-.. T Consensus 151 Ds~~~~F~GL~~~~~~~---q~i~--tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~ 225 (330) T protein:vir:94 151 DGTGNSFQGMMGLVAAS---QTIS--AGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGE 225 (330) T ss_pred CCCCccccchhhcCCcc---cEEe--cCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCC Confidence 8662 11 111111111 1111 223456788899999999997776655 55556777788888777665433211 Q ss_pred e-ecCCCeEE-----Eeeecccchhhc-c---cchhceeeeec--ccc---ee---cccceeeecceeEe--ecCce--E Q lcl|Aclame:pro 314 I-KNDDTEIA-----SEVGVDEIIVYT-G---SKAVKPTVLVD--QKY---HI---DMQDLTKVDAFEWK--TNSNM--I 371 (393) Q Consensus 314 l-~~~~~~~~-----~~v~~~~~~~~t-G---~k~~~ptv~vD--~k~---~~---~~~~~~~~~s~~~~--~ns~~--i 371 (393) . .+..|.-+ .|+...+-+-.+ | +...-.+.+|. +.. .| .+.|.-.++-+.|- .+.+. - T Consensus 226 ~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~ 305 (330) T protein:vir:94 226 VMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETIT 305 (330) T ss_pred cccccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeE Confidence 1 11122211 222222211100 0 00000111111 110 00 11111111112211 12222 2 Q ss_pred EEeeeecccccccCcceeeee-C Q lcl|Aclame:pro 372 LVETLTSGHVETYNAGAVITV-S 393 (393) Q Consensus 372 ~~~~~~~g~~~~~n~~~~~~v-~ 393 (393) +|+.|.+--|..+-+.++++= . T Consensus 306 ~v~~y~~~av~~~~a~~~L~~V~ 328 (330) T protein:vir:94 306 RVKMYCGFANFSQLGLAAIKGLI 328 (330) T ss_pred EEEEeeeeEEechhheeeecccc Confidence 555565555555555544432 2 No 115 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=60.19 E-value=0.38 Score=22.90 Aligned_cols=247 Identities=14% Similarity=0.097 Sum_probs=110.6 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecccc---e-eEEEeeccc-cccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVGA---L-LVSRSFDSS-NEAQ 167 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~~~---~-a~~i~l~na-~~a~ 167 (393) |.+..++ -.|+ +.|+=+-..+...++. -.+|+.+- +.+.-. + .+.+-.=+. -.+. T Consensus 1 ma~~~T~----------------~~d~--i~Pev~s~~v~~~~~~-~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~ 61 (274) T protein:vir:96 1 MAQGTTK----------------VSNL--IVPEVLAPMMQAELDK-KLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQ 61 (274) T ss_pred CCccccc----------------hhhh--hhhHHHHHHHHHHHHh-hhhhcccccccccccCCCCCEEEEEeeccCCCcc Confidence 3332221 0111 3333322223333322 22333221 111000 1 011100011 1222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 168 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 246 (393) Q Consensus 168 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 246 (393) -...|.+-+.++++.....+.-...|+..++-|....+.+ ++.+..+.+.++.++- |-++..+... +|.+-+ T Consensus 62 ~~~~g~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~--~d~~~~~~~~~~~~~a-~~~d~~i~~~l~~a~~~---- 134 (274) T protein:vir:96 62 VIAEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLSGF--GDPQGEAVRQHGLAIA-NKVDNDVLEALKGATLT---- 134 (274) T ss_pred ccCCCCcCchhhcccceeEEEEEeeeceeeecHHHHHhhc--chHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCC---- Confidence 2333445555556655555555554555555666655544 7789999999998877 5666554433 222110 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhh----hhccccccceeeecCCCeE Q lcl|Aclame:pro 247 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDE----LRQATANANVRIKNDDTEI 321 (393) Q Consensus 247 ~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~----~~~~~~~a~~~l~~~~~~~ 321 (393) -.+.+...|.+.+|+ -|-......++||+|..+...|+.. .-+++...+-.+. +|.+ T Consensus 135 ----------------~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~--~g~i 196 (274) T protein:vir:96 135 ----------------VEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIV--KGAF 196 (274) T ss_pred ----------------cCcccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhccccccccccccccccee--eccc Confidence 011222356666665 1222234678999999998888652 1122221111111 1111 Q ss_pred EEeeecccchhhcccchhc----e--eeeeccccee---cccceeeecceeEeecCceEEEeeeecccccccCcceeeee Q lcl|Aclame:pro 322 ASEVGVDEIIVYTGSKAVK----P--TVLVDQKYHI---DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITV 392 (393) Q Consensus 322 ~~~v~~~~~~~~tG~k~~~----p--tv~vD~k~~~---~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v 392 (393) .. +-|+..++ | +.++-.+.++ ...+++.-.-|....-+..|-.+.+-...+--|.+.++++. T Consensus 197 g~---------~~G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~ 267 (274) T protein:vir:96 197 GE---------ALGAVIVRSNKLNKGEALLAKKGAVKLITKRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITK 267 (274) T ss_pred ce---------ecCeeEEEcCCCCcceEEEEeCcceeeeecCCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEc Confidence 11 11332222 1 0111111111 23333444445555566777777666666777778888887 Q ss_pred C Q lcl|Aclame:pro 393 S 393 (393) Q Consensus 393 ~ 393 (393) + T Consensus 268 ~ 268 (274) T protein:vir:96 268 G 268 (274) T ss_pred C Confidence 7 No 116 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=47.23 E-value=0.71 Score=21.40 Aligned_cols=286 Identities=14% Similarity=0.092 Sum_probs=106.8 Q ss_pred HHHccCCh--hHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecccceeEEEeecc-ccccce Q lcl|Aclame:pro 93 LKKNSGKS--EIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFDS-SNEAQV 168 (393) Q Consensus 93 l~~nqg~k--e~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~n-a~~a~G 168 (393) |+..+|.- --+.-| .|.. .|.-.+.=+.-..-+..+|.. ..+|..+| +..+..+..+.-+.- ...+.+ T Consensus 1 ~a~~~~~~~~~~~~g~------~~~~-~d~~al~ie~~~geV~~~f~~-~s~~~~~~~~r~i~~G~sv~~~~iG~~~~~~ 72 (347) T protein:vir:88 1 MANATGGQQIGANQGK------GQSA-ADKLALFLKVFGGEVLTAFVR-RSVTMDKHMVRTIQNGKSASFPVMGRTKGYY 72 (347) T ss_pred CCCcccchhhhccCCC------Cccc-cchHHHHHHHHHHHHHHHHHH-HhhhhhccccccccCcceEEEeeecceeeee Confidence 33233321 112222 1221 232122225566677778884 67777777 444443332222222 345666 Q ss_pred ecccchh---------hhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCC Q lcl|Aclame:pro 169 HKDGQTK---------TEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGT 239 (393) Q Consensus 169 Hk~ga~K---------k~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~ 239 (393) |+.|+.= .+..+++-......-.||..-.. ..--++.++|..---|.|.+-.+.+|-+.+..+- +. T Consensus 73 ~~~g~~l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~-q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a----~~ 147 (347) T protein:vir:88 73 LAPGENLDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDA-MNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLC----NL 147 (347) T ss_pred eccccCCCCCCCCCccceEEEEEechhhhhhhhhhHHHH-hhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhh----cc Confidence 6666531 12223333333455566655433 1112344445555556666555444322111110 00 Q ss_pred Cccccchhhhhhhhh-hhhhhhhhccCCCCCH----HH---HHhhhceeeecCCCceEEEecchhhhHhhhhhccccccc Q lcl|Aclame:pro 240 NGFKSIDKEADVKKI-KKITTKAKSAGKTPFA----DA---IEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANAN 311 (393) Q Consensus 240 ~~t~~~~~e~D~~~i-k~it~~at~~~~t~~~----da---l~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~ 311 (393) +........|....+ ..+++.++.......+ ++ +.++||-..+..+.|+++|..+-..+|+.-.+-.+.+.+ T Consensus 148 ~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~ 227 (347) T protein:vir:88 148 PAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYA 227 (347) T ss_pred ccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhc Confidence 000000011100000 0011111111222233 33 344677777777789999988878788875554444433 Q ss_pred eeeecCCCeEEEeeec----cc-chhhcccchhceeeee-------------cccceec------------------ccc Q lcl|Aclame:pro 312 VRIKNDDTEIASEVGV----DE-IIVYTGSKAVKPTVLV-------------DQKYHID------------------MQD 355 (393) Q Consensus 312 ~~l~~~~~~~~~~v~~----~~-~~~~tG~k~~~ptv~v-------------D~k~~~~------------------~~~ 355 (393) .-.....|.+..-.|+ ++ +.+ +++....+.-.+ -.+|..+ ..+ T Consensus 228 ~~~~~~~G~vg~i~G~~V~~s~nlp~-~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d 306 (347) T protein:vir:88 228 ALIDPETGNIRNVMGFEVIEVPHLTV-GGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKD 306 (347) T ss_pred cccchhcceeeeeccceEEEeecccc-cccccccccccccccccccccccccccccccccCcEEEEEechhhhhheeccc Confidence 2222222333222221 11 111 111111110000 0001000 111 Q ss_pred eeeecceeEeecCceEEEeeeeccc-ccccCcceeeeeC Q lcl|Aclame:pro 356 LTKVDAFEWKTNSNMILVETLTSGH-VETYNAGAVITVS 393 (393) Q Consensus 356 ~~~~~s~~~~~ns~~i~~~~~~~g~-~~~~n~~~~~~v~ 393 (393) +.+-..|.-+ .+....+--++-|| +-.|.+++++... T Consensus 307 ~~~e~~r~~~-~~~d~i~~~~~~G~~~~rPe~a~~~~~~ 344 (347) T protein:vir:88 307 MALERARRPE-FQADQIIGKYAMGHGGLRPEAAGALVFT 344 (347) T ss_pred ceeeeeechh-hHHHHhhhhhhhcCceeccceEEEEEeC Confidence 1111111111 11111111122222 3345666556555 No 117 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=40.28 E-value=0.98 Score=20.63 Aligned_cols=280 Identities=14% Similarity=0.007 Sum_probs=99.5 Q ss_pred HHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecc---cce-eEEEeeccccccce Q lcl|Aclame:pro 93 LKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV---GAL-LVSRSFDSSNEAQV 168 (393) Q Consensus 93 l~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~---~~~-a~~i~l~na~~a~G 168 (393) |+-.||. -.-..+|+..++++-++|+-.-..|...|+. ..+|..+-...+ ..+ .+.|--=..-.+.. T Consensus 1 ~~~~~~~--------~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~-~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a~d 71 (381) T protein:vir:80 1 MATIQGT--------GGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQ-KFAALEATKKIPFEGKKGDLIHIPNISRAAVYD 71 (381) T ss_pred Cceeccc--------ccccCcccchhhHHhhhhHHHHHHHHHHHHH-hhhhhhccccccceeecCceEEeeccCcceeee Confidence 4444443 1224568888888888898888888888875 555543211111 011 11111011223444 Q ss_pred ecccchhhhhhhhhhh-------hhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceee------ Q lcl|Aclame:pro 169 HKDGQTKTEQAATLTI-------DTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE------ 235 (393) Q Consensus 169 Hk~ga~Kk~q~~~le~-------~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~------ 235 (393) |+.|..=..+...-.. ....+..|+..-.. ...| ++...+++++...+= |.+|.++.. T Consensus 72 ~~~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~-------~~~~-D~~~~~~~~~~~aLA-~~~D~~i~~~~~~~~ 142 (381) T protein:vir:80 72 KQPQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNT-------QASY-TLRQYYTKEAGYALA-RDMDNFALAHRAVIN 142 (381) T ss_pred ecCCCcccccccCCceEEEEEeeeeecceeechHHHH-------hhcc-ChHHHHHHHHHHHHH-HHHHHHHHHHHhhcc Confidence 5544422222222221 12233333332221 1111 334445555554442 333433321 Q ss_pred cc--CCCccccchhhhhhhhhhhhhhhhhccCCCC-CHHHHHhhhceeeecCCCceEEEecchhhhHhhhhhccccccce Q lcl|Aclame:pro 236 GD--GTNGFKSIDKEADVKKIKKITTKAKSAGKTP-FADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANV 312 (393) Q Consensus 236 gD--G~~~t~~~~~e~D~~~ik~it~~at~~~~t~-~~dal~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~ 312 (393) .. +.+++... -+++.......+.... ..|+ ...++.++||-+.....+|++||+.....+|+..-+-....... T Consensus 143 ~~~~~~~~t~~~-~i~~~~~~~~~t~~~~--~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~ 219 (381) T protein:vir:80 143 AFPSQRIYSYDT-TLGDGTVNAHLTGTPA--PLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQ 219 (381) T ss_pred cccccccccccc-cccccccccccccchh--hHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhcc Confidence 11 11111111 1111111110111000 0011 13456677776665556789999998888888643321111000 Q ss_pred eeecCCCeEE----Eeeecccchhh---cccchh--ceeeee----cccceecccc----eeeecceeEeecCceEEEee Q lcl|Aclame:pro 313 RIKNDDTEIA----SEVGVDEIIVY---TGSKAV--KPTVLV----DQKYHIDMQD----LTKVDAFEWKTNSNMILVET 375 (393) Q Consensus 313 ~l~~~~~~~~----~~v~~~~~~~~---tG~k~~--~ptv~v----D~k~~~~~~~----~~~~~s~~~~~ns~~i~~~~ 375 (393) --...+|.+. +.|..+|..=. ||.+.. .|...- ..+|.-|.+. +.++--+...+.+..-.+.+ T Consensus 220 ~~~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~ 299 (381) T protein:vir:80 220 VKPVTSGVVGTILGMEVIVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPV 299 (381) T ss_pred chhhhceeeeEEcceEEEeecccccccccceeeeccccccccccccccccccccccceeeeeeeeeeceeeeeeecccee Confidence 0001223222 22222221100 111100 111111 1111111111 11122222222222222222 Q ss_pred ee------------cccccccCcceeeeeC Q lcl|Aclame:pro 376 LT------------SGHVETYNAGAVITVS 393 (393) Q Consensus 376 ~~------------~g~~~~~n~~~~~~v~ 393 (393) +. .|.++.+++.+---|. T Consensus 300 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 329 (381) T protein:vir:80 300 FSGAGATAADGGQTLGSFGGANRWATAVVC 329 (381) T ss_pred eecceeeecCCCceeeeehhhhhhhhhccc Confidence 22 2223223332211110 No 118 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=40.15 E-value=0.98 Score=20.62 Aligned_cols=272 Identities=12% Similarity=0.065 Sum_probs=110.6 Q ss_pred HHccCChhHHHHHHHHHHhCccchh-hhHhhcc-hhHHHHHHHHHHhhCcccccee-eecccceeEEEeecc-cccccee Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTIT-DTTFQLP-RKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFDS-SNEAQVH 169 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~q-d~~eiLP-~~ii~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~n-a~~a~GH 169 (393) |.+... .=+=+| .+. |.. .|. +.-..-+.++|.. ..+|.-+| +-.+..+.-..-+.- ...+.+| T Consensus 1 ms~~n~-~t~~~~---------~~~~~~~-al~le~f~geV~taf~~-~s~~~~~~~~rti~~gkS~q~~~iG~~~~~~~ 68 (364) T protein:vir:10 1 MSNPNV-LTQPAV---------SASGEVD-SLLIEKFNNRVHEQYLK-GENLLQWFDVQEVVGTNSVSNKYIGETELQVL 68 (364) T ss_pred CCCccc-cccccc---------ccccchh-hhhhhhhhhhHHHHHHH-HHhhcCcceeeeecccceEEeeeeeeeEEeee Confidence 332211 111112 111 111 334 4566778889984 77777777 544443322222222 3577899 Q ss_pred cccchhh-----hh--hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHh-cceeeccCCCc Q lcl|Aclame:pro 170 KDGQTKT-----EQ--AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVD-LALVEGDGTNG 241 (393) Q Consensus 170 k~ga~Kk-----~q--~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~-rAvv~gDG~~~ 241 (393) +.|+.=- .. .++...+-..+..||..-..-.|...+.++|+.---|.+.+..+.+|-|.+- .|..+-++.+. T Consensus 69 ~~G~~ld~~~~~~~k~~itID~ll~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~ 148 (364) T protein:vir:10 69 SPGKSPDASPTEFDKNRLVVDTTVIARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRK 148 (364) T ss_pred ccCcccCCCCcccCcEEEEecceeeechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccc Confidence 9887211 11 2333333345777887754433443344667777778888877777756443 33333344322 Q ss_pred cccchhhhhhhhhhhhhhhhhccCCCCCHHHHH-------hhhceeeecCCCceEEEecchhhhHhhhhhccccccceee Q lcl|Aclame:pro 242 FKSIDKEADVKKIKKITTKAKSAGKTPFADAIE-------EAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRI 314 (393) Q Consensus 242 t~~~~~e~D~~~ik~it~~at~~~~t~~~dal~-------Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l 314 (393) +..-.-.|. . |+...+..+....++.|. +.||-.....+.|++++...-.-+|+.- +.=-|... T Consensus 149 ~~~~~~~g~--~---i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~----~~lvn~d~ 219 (364) T protein:vir:10 149 NPRVAGHGF--S---IHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDA----DRIVDKSY 219 (364) T ss_pred CCcccCCcc--e---eeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcC----Cccccccc Confidence 221111122 1 111223333345555554 4467777777889999977777777662 11111111 Q ss_pred e--cC----CCeEEEeeecccchhhcccchhceeeee------ccccee-cccceeeecceeEe---ecCceEEEeeeec Q lcl|Aclame:pro 315 K--ND----DTEIASEVGVDEIIVYTGSKAVKPTVLV------DQKYHI-DMQDLTKVDAFEWK---TNSNMILVETLTS 378 (393) Q Consensus 315 ~--~~----~~~~~~~v~~~~~~~~tG~k~~~ptv~v------D~k~~~-~~~~~~~~~s~~~~---~ns~~i~~~~~~~ 378 (393) - .+ +|.+..-.|+. ++ -++.+ |.+.- +...|- |..| +.+.|... +.+.++...--.- T Consensus 220 ~~~~~~~~~~G~v~~v~Gv~--Vv--~Sn~l-P~~~~~~~~t~~~t~h~ls~~~--~g~~y~v~~d~~~~~~~~f~~~Al 292 (364) T protein:vir:10 220 TIAASDNTVDGFVLKSWNTP--IV--PSNRF-PKLSDNTEGTGNTKHHKLSNAG--NGNRYDVTAGQTSAQAVLFTQDAL 292 (364) T ss_pred cccCCCccccceeEEEeceE--EE--ecccc-cccccccccccccccccccccc--CCcccccccccceeEEEEEecceE Confidence 0 11 12222222221 11 22222 43211 111111 1000 00111100 0011111000000 Q ss_pred ccccc------------------------------cCcceeeeeC Q lcl|Aclame:pro 379 GHVET------------------------------YNAGAVITVS 393 (393) Q Consensus 379 g~~~~------------------------------~n~~~~~~v~ 393 (393) |.+++ |-++.+++-+ T Consensus 293 ~tv~~~~~t~e~~~~~~~~~~~ida~~a~G~g~lRPeaa~~i~~~ 337 (364) T protein:vir:10 293 LVGRTISITGDIFYEKKEKTWYIDTFLAEGAIPDRWEAVAVVTAA 337 (364) T ss_pred EEEEEecceeeeeeccceeeeeeeeehcccCcccCccceEEEEec Confidence 11110 1111111111 No 119 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=32.15 E-value=1.4 Score=19.71 Aligned_cols=92 Identities=9% Similarity=0.073 Sum_probs=21.6 Q ss_pred CCcchhhHHHHHHHHHHHhhHHHHhhhhhhhhhhHHhh----hhHHHHHHHHHHHHHH----HHHHHHHHHHhhhhhhcc Q lcl|Aclame:pro 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIED----LPKVQELEKTLSENSI----EIIKIENELNAQEEKPKG 72 (393) Q Consensus 1 ~~k~d~~ekq~eLa~lK~~~~~~~s~i~~~~v~~a~~~----~skieelektis~l~a----Ei~k~enel~~~~Ek~K~ 72 (393) -.++.-.+.|...++...+...+..+........+..+ -.+++.++...+..++ ...+.+...... +-++. T Consensus 612 ~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~~~aq~~~~-~~qq~ 690 (711) T protein:vir:10 612 LSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIEDMAQGGDV-VYQQV 690 (711) T ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHH Confidence 22333344444444444433333333222222211111 1222222221111111 111111111100 11111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHH Q lcl|Aclame:pro 73 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIK 103 (393) Q Consensus 73 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k 103 (393) ..++ +..+|. +...|+.-+=+ T Consensus 691 ~~~l----~~~qae------lq~~q~~~~q~ 711 (711) T protein:vir:10 691 RELV----AQALAE------ITASQANVTEQ 711 (711) T ss_pred HHHH----HHHHHH------HHHHHHHhhcC Confidence 1111 111111 12223221111 No 120 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=32.10 E-value=1.4 Score=19.70 Aligned_cols=267 Identities=12% Similarity=0.088 Sum_probs=102.9 Q ss_pred HHHccCCh--hHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecccceeEEEeecc-ccccce Q lcl|Aclame:pro 93 LKKNSGKS--EIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFDS-SNEAQV 168 (393) Q Consensus 93 l~~nqg~k--e~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~n-a~~a~G 168 (393) |+..++.. .-+..|- |.. .|...+.=+.--.-|..+|.. ..+|.-+| +.++..+.-+.-+.- ...+.+ T Consensus 1 ma~~~~~~~~~t~~g~~------~~~-~d~~al~ie~~~geV~~~f~~-~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~ 72 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKG------MSA-GDKLALFLKVFGGEVLTAFTR-TSVTMNKHLVRSIQSGKSAQFPVLGRTKAAY 72 (347) T ss_pred CCccccccccccccccC------Ccc-cchHHHHHHHHhHHHHHHHHH-HHhhhhhhhheeccccceEEeeeccceeEee Confidence 22223222 1233332 222 232222225566677888885 57777677 555544433322222 456677 Q ss_pred ecccchh--hh-----hhhhhhhhh--ccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHH-HHhccee---- Q lcl|Aclame:pro 169 HKDGQTK--TE-----QAATLTIDT--LEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK-IVDLALV---- 234 (393) Q Consensus 169 Hk~ga~K--k~-----q~~~le~~t--i~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~R-av~rAvv---- 234 (393) |+.|+.= +. ...++.+++ .....||..-+. .---++.+.|+.---|.|.+-.+.+|-+ +...|-. T Consensus 73 ~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~-q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~ 151 (347) T protein:vir:94 73 LQPGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDA-MNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTAN 151 (347) T ss_pred eecCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 7777632 22 222233332 344455544322 0001344445555556666665544422 1111110 Q ss_pred --eccCCCccccchhhhhhhhhhhhhhhhhc-cCCCCCHHHHH---hhhceeeecCCCceEEEecchhhhHhhhhhcccc Q lcl|Aclame:pro 235 --EGDGTNGFKSIDKEADVKKIKKITTKAKS-AGKTPFADAIE---EAVDFVRPTAGRRYLIVKTEDRKALLDELRQATA 308 (393) Q Consensus 235 --~gDG~~~t~~~~~e~D~~~ik~it~~at~-~~~t~~~dal~---Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~ 308 (393) ..+|.+........+. .+.+++. .......++++ +.||-+.+...+|++|+.....-+|+.-++-.+. T Consensus 152 ~~~~~g~~~~~~v~i~~~------~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~ 225 (347) T protein:vir:94 152 NENIAGLGKAHVLEVGDQ------ATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAA 225 (347) T ss_pred ccccccCCcceeEeeecc------ccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccc Confidence 0111111001111111 0111100 00011123343 5567777777789999998888888875555555 Q ss_pred ccceeeecCCCeEEEeeecccchhhcccchhc----eeeeecccceecccceeeec-----------ceeEeecCceEEE Q lcl|Aclame:pro 309 NANVRIKNDDTEIASEVGVDEIIVYTGSKAVK----PTVLVDQKYHIDMQDLTKVD-----------AFEWKTNSNMILV 373 (393) Q Consensus 309 ~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~----ptv~vD~k~~~~~~~~~~~~-----------s~~~~~ns~~i~~ 373 (393) +.+.-.-...|.+..-. |++.++ |+-. + -.+....|+..-+ ++-+.+.. T Consensus 226 ~~~~~~~~~~G~V~~v~---------G~~V~~Sn~~p~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~----- 289 (347) T protein:vir:94 226 NYQALIDPSTGSIRNVM---------GFEVIEVPHLTAGG-A-GDNRAEEGVAPTNQKHAFPDTASGDTRVALDN----- 289 (347) T ss_pred ccccccccccceeEEee---------ceEEEEcCcccccc-C-cccccccccccccccccccccccccccccccc----- Confidence 54443333334443322 333332 2111 0 1111112211111 11111111 Q ss_pred eeeecccccccCccee-------eeeC Q lcl|Aclame:pro 374 ETLTSGHVETYNAGAV-------ITVS 393 (393) Q Consensus 374 ~~~~~g~~~~~n~~~~-------~~v~ 393 (393) +.|-+-.+++-.. +++. T Consensus 290 ---~~~l~~~~~A~~tv~~~~~~~e~~ 313 (347) T protein:vir:94 290 ---VVGLFNHRSAVGTVKLKDMALERA 313 (347) T ss_pred ---eEEEEechhhhhhhhhcccceeee Confidence 0111112221110 0001 No 121 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=27.20 E-value=1.9 Score=19.10 Aligned_cols=262 Identities=14% Similarity=0.131 Sum_probs=101.9 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeEE----Eeeccc-----c Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVS----RSFDSS-----N 164 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fhV~n~~~~a~~----i~l~na-----~ 164 (393) |- .. -|+|++.--+|. +...|=+-|.+++.+|..+-|.++...... -.+... + T Consensus 1 mp-al----------tLaea~k~~~d~-------l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~ 62 (310) T protein:vir:97 1 MA-SV----------TLAESAKLAQDE-------LVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVG 62 (310) T ss_pred Cc-cc----------chHHHhhcCcch-------HHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCccccccc Confidence 11 00 022222211222 333333344556777776555555443222 111111 1 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhc-CchhHHHHHHHHHHHHHHHHHHHhcceeeccCCC-cc Q lcl|Aclame:pro 165 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQ-MSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN-GF 242 (393) Q Consensus 165 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~-g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~-~t 242 (393) ...+|.- . .....+|+.++-.-..+..--..-....++. +.+-+.+.+-++-...++- +-.+..+++||+.+ .| T Consensus 63 ~~~~~~g-~--~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~-~~~e~~lINGD~a~n~F 138 (310) T protein:vir:97 63 TTFSGAG-A--GKAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAG-RKYQDQLINGNGAGNEF 138 (310) T ss_pred ccccCCC-c--cccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHH-HHHHHHhhccccCCCcc Confidence 2223321 1 1222334333222222222222212234553 3232223333333334444 46778899998742 22 Q ss_pred ccchhhhhhhhhhhhhhhhhccCCCCCHHHHHhhhceeeec-CCCceEEEecchhhhHhhhhhccccccceee-ecCCCe Q lcl|Aclame:pro 243 KSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKTEDRKALLDELRQATANANVRI-KNDDTE 320 (393) Q Consensus 243 ~~~~~e~D~~~ik~it~~at~~~~t~~~dal~Eald~a~~~-a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l-~~~~~~ 320 (393) .--..--|.. +.|++-+ -++.++.|+|-|-||.+.-+ +.-.+++.|+.-|.+|+--.|+++...---. .+..|. T Consensus 139 ~GL~~~~~~~--q~i~~~~--~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~ 214 (310) T protein:vir:97 139 AGLIQLCASG--QKATTGA--TGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGA 214 (310) T ss_pred cchhhcCCcc--ceeecCC--CCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCC Confidence 1111111111 2222222 24667789999999988544 3446778888888888888887764332111 112222 Q ss_pred EEEeeecccchhhcccchhceeeeec---c---------ccee-------c--ccceee--------ecceeEe--ecCc Q lcl|Aclame:pro 321 IASEVGVDEIIVYTGSKAVKPTVLVD---Q---------KYHI-------D--MQDLTK--------VDAFEWK--TNSN 369 (393) Q Consensus 321 ~~~~v~~~~~~~~tG~k~~~ptv~vD---~---------k~~~-------~--~~~~~~--------~~s~~~~--~ns~ 369 (393) -+- +. ++ +|.+..| . .+.| + -+|+.- ++-+.|- .+.. T Consensus 215 ~v~-------~~--~G---iPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~ 282 (310) T protein:vir:97 215 EVP-------AY--SG---TPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSD 282 (310) T ss_pred EEe-------ee--CC---eEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCc Confidence 110 00 22 2333322 0 1111 0 122211 1111111 1112 Q ss_pred eE--EEeeeecccccccCcceeee-eC Q lcl|Aclame:pro 370 MI--LVETLTSGHVETYNAGAVIT-VS 393 (393) Q Consensus 370 ~i--~~~~~~~g~~~~~n~~~~~~-v~ 393 (393) .+ +|+.|.+=-|..+-+.++++ |. T Consensus 283 v~~~~V~~Y~~~av~~~~A~a~L~~V~ 309 (310) T protein:vir:97 283 EHIWRVKWYCGLALFSEKGLACADGIT 309 (310) T ss_pred ceeEEEEEeeeEEEecccceeeecccc Confidence 11 33444443333333333332 22 No 122 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=24.28 E-value=2.2 Score=18.71 Aligned_cols=266 Identities=13% Similarity=0.097 Sum_probs=104.1 Q ss_pred HHHccCChhHHHHHHHHHHhCccchhhhHhhcc-hhHHHHHHHHHHhhCcccccee-eecccceeEEEeec-ccccccee Q lcl|Aclame:pro 93 LKKNSGKSEIKNAWNAKLAENGVTITDTTFQLP-RKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFD-SSNEAQVH 169 (393) Q Consensus 93 l~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP-~~ii~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~-na~~a~GH 169 (393) |.-.+++.-.+.+|- |. ..|+ .|+ +---.-|.++|.. ..+|..+| +-.+..+.-+.-+. ....+.+| T Consensus 1 m~~~~~~~~t~~~~~------~~-~~~~--~l~le~~~geV~~af~~-~s~~~~~~~~r~i~~G~s~~~~~iG~~~~~~~ 70 (334) T protein:vir:80 1 MTYPAANTHTRPGWG------GA-NSDV--SLHIEEHLGLVDASFMY-SSKFASWMNVRSLRGTNQLRVDRVGASTIAGR 70 (334) T ss_pred CCCCcCCCccccccc------cc-cchh--eehhhhhhhHHHHHHHH-hhhhhccceeeeccccceEEEeeecceeeeee Confidence 322333334555554 11 1111 355 4455667888885 68888788 55554432222222 24578899 Q ss_pred cccchhhhhhhhhhhh-------hccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHH-HHhcceee------ Q lcl|Aclame:pro 170 KDGQTKTEQAATLTID-------TLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK-IVDLALVE------ 235 (393) Q Consensus 170 k~ga~Kk~q~~~le~~-------ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~R-av~rAvv~------ 235 (393) +.|+.=..|.+.-... -.....||..-.. .---++.++|+.---|.+.+..+.++-| ++..|.-. T Consensus 71 ~~g~~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~-q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~ 149 (334) T protein:vir:80 71 KAGEELVVQKNVSDKLNLTVDTVLYARHFFDKFDEW-TSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLK 149 (334) T ss_pred cCCCCCCCCCcccCceEEEEeeeeehhhhHhhHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 9998444444333333 3444555555333 0011244444444455555555543323 22222211 Q ss_pred ---ccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHH-------HHHhhhceeeec---CCCceEEEecchhhhHhhh Q lcl|Aclame:pro 236 ---GDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFAD-------AIEEAVDFVRPT---AGRRYLIVKTEDRKALLDE 302 (393) Q Consensus 236 ---gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~d-------al~Eald~a~~~---a~~~~l~i~~~d~~a~~~~ 302 (393) ++|..+ ..+ .+++....-.++| +..+.||..... .++|+++|...-.-+|+.- T Consensus 150 ~~~~~G~~~--~~~------------~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~ 215 (334) T protein:vir:80 150 PAFHDGILL--PST------------ISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEH 215 (334) T ss_pred ccccCCcce--eec------------ccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcc Confidence 133322 111 1111111112232 233445555444 6899999988877777762 Q ss_pred hhccccccceeeec-CC-CeEEEeeecccchhhcccchhc----eeeeecccceecccc-------ee------------ Q lcl|Aclame:pro 303 LRQATANANVRIKN-DD-TEIASEVGVDEIIVYTGSKAVK----PTVLVDQKYHIDMQD-------LT------------ 357 (393) Q Consensus 303 ~~~~~~~a~~~l~~-~~-~~~~~~v~~~~~~~~tG~k~~~----ptv~vD~k~~~~~~~-------~~------------ 357 (393) =| =.|+..-+ ++ ..|+ -+.+--.-|+..|+ |+..+ +...+ ++. .+ T Consensus 216 ~r----~~n~d~~~s~~~~~~~----~g~i~~v~G~~V~~Sn~~P~~~~-t~~~~-g~~~~~~agd~t~~~~~~~~~~Al 285 (334) T protein:vir:80 216 DR----LMNVEFGAKEGGNSFV----GGRIAMLNGVRVVETPRFPQSAI-TANAL-GADFNVTDAEVRRKMITFIPSMAL 285 (334) T ss_pred cc----cccceecccccccccc----ceeEEEEeceEEEeecCCCCccc-ccccc-ccccccccccccceEEEEEeCceE Confidence 11 11111100 00 0111 01111111433333 43321 10000 000 00 Q ss_pred -eecceeE------eecCceEEEeee-eccc-ccccCcceeeeeC Q lcl|Aclame:pro 358 -KVDAFEW------KTNSNMILVETL-TSGH-VETYNAGAVITVS 393 (393) Q Consensus 358 -~~~s~~~------~~ns~~i~~~~~-~~g~-~~~~n~~~~~~v~ 393 (393) ++-...+ .-..++=.+.++ .-|| +-.|.+..++++. T Consensus 286 ~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G~g~lRPeaa~vv~~~ 330 (334) T protein:vir:80 286 ISAQVHPVSAQFWEEKKDFGHYLDTFQSYNIGQRRPDAVAVHDIT 330 (334) T ss_pred EEEEEeecceeeeechhhHHHHHHHHHHcCCceeccceEEEEEEe Confidence 0000000 000000011111 1122 2234555566666 No 123 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=22.83 E-value=2.4 Score=18.51 Aligned_cols=270 Identities=10% Similarity=0.010 Sum_probs=108.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecccceeEEEeecc-ccccceecc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFDS-SNEAQVHKD 171 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~n-a~~a~GHk~ 171 (393) |.+-.. .=+-+|. | ++ |...+.=+.-..-+.++|. +..+|.-+| +-.+..+--..-+.. ...+++|+. T Consensus 1 Ms~~n~-~t~~~~~------~-sg-~~~al~Le~f~GeV~taF~-~~si~~~~~~vRti~~gkS~qf~~~G~s~~~~~~p 70 (401) T protein:vir:70 1 MSTPNN-LTNVAVS------A-SG-EVDSLLIEKFNGKVNEQYL-KGENIMSYFDVQTVTGTNTVSNKYLGETELQVLAP 70 (401) T ss_pred CCCCcc-ccccccc------c-cc-chhHhHHhHhcchHHHHHH-HHhhhcccceeeeecccceEEEEEeeeeEeeeecC Confidence 333221 1111221 1 11 2222333455667788888 467776666 544444322222222 457889999 Q ss_pred cch------hh-hhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHH-hcceeeccCCCccc Q lcl|Aclame:pro 172 GQT------KT-EQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIV-DLALVEGDGTNGFK 243 (393) Q Consensus 172 ga~------Kk-~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav-~rAvv~gDG~~~t~ 243 (393) |+. +. +..++.-++-+....||.+...-.|..-+.++|+.-.-|.+.+..+.+|-|.+ ..|+.+.++-+. T Consensus 71 G~~ld~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~-- 148 (401) T protein:vir:70 71 GQSPAATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRT-- 148 (401) T ss_pred CCCcCCCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc-- Confidence 982 11 22355556667888888875553443324567888888888887777665644 455544332211 Q ss_pred cchhhhhhhhhhhhhhhhhccCCCCCHHHHHhh-------hceeeecCCCceEEEecchhhhHhhhhhccccccceeee- Q lcl|Aclame:pro 244 SIDKEADVKKIKKITTKAKSAGKTPFADAIEEA-------VDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIK- 315 (393) Q Consensus 244 ~~~~e~D~~~ik~it~~at~~~~t~~~dal~Ea-------ld~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~~a~~~l~- 315 (393) .+.-.+.-. .|+..+....-..++++|.-+ ||--....+ +++++..-+.-.++.+ ...--|.... T Consensus 149 ~p~~~~~G~---~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~-r~vvl~pp~~Ys~Ll~---~d~L~nrd~~~ 221 (401) T protein:vir:70 149 NPRVKGHGF---SINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDIS-DVAILMPWRYFNVLRD---ADRIVDKTYTI 221 (401) T ss_pred CCCcCCCce---EEeccccccccccCHHHHHHHHHHHHHHHHhcCCCcc-ceEEEcCHHHHHHHHh---cCcccchhhcc Confidence 111111111 122222222222344544322 333333345 4666644444333321 1111111111 Q ss_pred cCCCeEEEeeecccchhhcccchhc----eeeeecccceecccceeeecceeEeecCceEEEeeeecccccccCcceeee Q lcl|Aclame:pro 316 NDDTEIASEVGVDEIIVYTGSKAVK----PTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVIT 391 (393) Q Consensus 316 ~~~~~~~~~v~~~~~~~~tG~k~~~----ptv~vD~k~~~~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~ 391 (393) +.+|.++- |.+-..-|...|+ |+.......|.. ++...+-.++. .-=.+-+.|-+-.+++-.+.+ T Consensus 222 s~~g~~~~----G~v~~vaGv~Vv~SnnlP~~a~~it~~~l-----s~a~~G~~y~~--~~d~s~~~~v~f~~~Av~tvk 290 (401) T protein:vir:70 222 SQSGATIQ----GFTLSSYNCPVIPSNRFPKYSQGQTHHLL-----SNEDNGYRYDP--LPAMNGAIAVLFTADALLVGR 290 (401) T ss_pred ccCCcccc----ceEEEEeceEEEeeccccccccccccccc-----cccCCCccCCC--CccccceeEEEEehhheEEEE Confidence 00111110 0000011444333 332211111111 00011111110 000111222233444444443 Q ss_pred eC Q lcl|Aclame:pro 392 VS 393 (393) Q Consensus 392 v~ 393 (393) +- T Consensus 291 ~~ 292 (401) T protein:vir:70 291 SI 292 (401) T ss_pred ee Confidence 32 No 124 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=22.01 E-value=2.5 Score=18.39 Aligned_cols=247 Identities=15% Similarity=0.075 Sum_probs=102.6 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ceeEEEeec-cc-cccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--ALLVSRSFD-SS-NEAQ 167 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~-~--~~a~~i~l~-na-~~a~ 167 (393) |.|.-++ ..|+ +.|+-.-..+...+.+. -+|..+- +++. + ++--+--|. +. ..++ T Consensus 1 ma~~~T~----------------~~d~--iiPev~~~~v~~~~~~~-l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~ 61 (274) T protein:vir:97 1 MPQGLTK----------------TSDQ--IIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ 61 (274) T ss_pred CCcccee----------------hhhe--echHHHHHHHHHhhhhh-hhhcccceecccccCCCCCEEEEeeecCCCccc Confidence 3333221 0111 33332222222222211 1122111 1111 0 011011110 11 1222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 168 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 246 (393) Q Consensus 168 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 246 (393) -...|.+-+-+.++....++.....++-.+.-|...-+.+ ++.+..+++.++.++- |-++..+... .+.+- T Consensus 62 ~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~--~dp~~~~~~~~a~a~a-~~vd~~~~~~l~~a~~----- 133 (274) T protein:vir:97 62 VVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHA-NKVDNDVLEALMGAKL----- 133 (274) T ss_pred cccCCCcccccccccceeEEEeeeecceecccHHHHHhcc--chHHHHHHHHHHHHHH-HHHHHHHHHHHhccCc----- Confidence 3334444455555555555555444443444565555544 7778999999988887 4455443322 11100 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhh----hhccccccceeeecCCCeE Q lcl|Aclame:pro 247 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDE----LRQATANANVRIKNDDTEI 321 (393) Q Consensus 247 ~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~----~~~~~~~a~~~l~~~~~~~ 321 (393) +. .+.+...+.+..|+ -|-......++|++|..+...|+.. .-++|...+..+. +|.+ T Consensus 134 -----------~~----~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~--~G~i 196 (274) T protein:vir:97 134 -----------TV----NADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIV--KGAF 196 (274) T ss_pred -----------cc----cccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCccccccee--cccc Confidence 00 01122345666555 2222344678999999999888752 1122222211111 1222 Q ss_pred EEeeecccchhhcccchhc-eeeeeccccee--------cccceeeecceeEeecCceEEEeeeecccccccCcceeeee Q lcl|Aclame:pro 322 ASEVGVDEIIVYTGSKAVK-PTVLVDQKYHI--------DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITV 392 (393) Q Consensus 322 ~~~v~~~~~~~~tG~k~~~-ptv~vD~k~~~--------~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v 392 (393) ..-. |...++ +.+=+++.|-+ -.+++.+-..|....-+..|-.+.+-.-.+--+.+.++++. T Consensus 197 g~~~---------G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~ 267 (274) T protein:vir:97 197 GEAL---------GAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITK 267 (274) T ss_pred ceec---------CeeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEec Confidence 1111 332222 11101221111 22334444444455555566666555556666666666666 Q ss_pred C Q lcl|Aclame:pro 393 S 393 (393) Q Consensus 393 ~ 393 (393) . T Consensus 268 ~ 268 (274) T protein:vir:97 268 G 268 (274) T ss_pred C Confidence 6 No 125 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=22.01 E-value=2.5 Score=18.39 Aligned_cols=247 Identities=15% Similarity=0.075 Sum_probs=102.6 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ceeEEEeec-cc-cccc Q lcl|Aclame:pro 94 KKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--ALLVSRSFD-SS-NEAQ 167 (393) Q Consensus 94 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~-~--~~a~~i~l~-na-~~a~ 167 (393) |.|.-++ ..|+ +.|+-.-..+...+.+. -+|..+- +++. + ++--+--|. +. ..++ T Consensus 1 ma~~~T~----------------~~d~--iiPev~~~~v~~~~~~~-l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~ 61 (274) T protein:vir:94 1 MPQGLTK----------------TSDQ--IIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ 61 (274) T ss_pred CCcccee----------------hhhe--echHHHHHHHHHhhhhh-hhhcccceecccccCCCCCEEEEeeecCCCccc Confidence 3333221 0111 33332222222222211 1122111 1111 0 011011110 11 1222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 168 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 246 (393) Q Consensus 168 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 246 (393) -...|.+-+-+.++....++.....++-.+.-|...-+.+ ++.+..+++.++.++- |-++..+... .+.+- T Consensus 62 ~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~--~dp~~~~~~~~a~a~a-~~vd~~~~~~l~~a~~----- 133 (274) T protein:vir:94 62 VVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHA-NKVDNDVLEALMGAKL----- 133 (274) T ss_pred cccCCCcccccccccceeEEEeeeecceecccHHHHHhcc--chHHHHHHHHHHHHHH-HHHHHHHHHHHhccCc----- Confidence 3334444455555555555555444443444565555544 7778999999988887 4455443322 11100 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHhhh-ceeeecCCCceEEEecchhhhHhhh----hhccccccceeeecCCCeE Q lcl|Aclame:pro 247 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKTEDRKALLDE----LRQATANANVRIKNDDTEI 321 (393) Q Consensus 247 ~e~D~~~ik~it~~at~~~~t~~~dal~Eal-d~a~~~a~~~~l~i~~~d~~a~~~~----~~~~~~~a~~~l~~~~~~~ 321 (393) +. .+.+...+.+..|+ -|-......++|++|..+...|+.. .-++|...+..+. +|.+ T Consensus 134 -----------~~----~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~--~G~i 196 (274) T protein:vir:94 134 -----------TV----NADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIV--KGAF 196 (274) T ss_pred -----------cc----cccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCccccccee--cccc Confidence 00 01122345666555 2222344678999999999888752 1122222211111 1222 Q ss_pred EEeeecccchhhcccchhc-eeeeeccccee--------cccceeeecceeEeecCceEEEeeeecccccccCcceeeee Q lcl|Aclame:pro 322 ASEVGVDEIIVYTGSKAVK-PTVLVDQKYHI--------DMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITV 392 (393) Q Consensus 322 ~~~v~~~~~~~~tG~k~~~-ptv~vD~k~~~--------~~~~~~~~~s~~~~~ns~~i~~~~~~~g~~~~~n~~~~~~v 392 (393) ..-. |...++ +.+=+++.|-+ -.+++.+-..|....-+..|-.+.+-.-.+--+.+.++++. T Consensus 197 g~~~---------G~~Vi~s~~~p~~t~~l~~~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~ 267 (274) T protein:vir:94 197 GEAL---------GAIIVRTNKLEAGTAILAKKGAVKLILKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITK 267 (274) T ss_pred ceec---------CeeEEEcCCCCcceEEEEeCcceEeeecCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEec Confidence 1111 332222 11101221111 22334444444455555566666555556666666666666 Q ss_pred C Q lcl|Aclame:pro 393 S 393 (393) Q Consensus 393 ~ 393 (393) . T Consensus 268 ~ 268 (274) T protein:vir:94 268 G 268 (274) T ss_pred C Confidence 6 No 126 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=21.49 E-value=2.6 Score=18.31 Aligned_cols=275 Identities=16% Similarity=0.146 Sum_probs=103.5 Q ss_pred HHHccCCh-hHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecccceeEEEeecc-cccccee Q lcl|Aclame:pro 93 LKKNSGKS-EIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFDS-SNEAQVH 169 (393) Q Consensus 93 l~~nqg~k-e~k~AW~a~L~ekgV~~qd~~eiLP~~ii~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~n-a~~a~GH 169 (393) |++..+++ --+..|- |. ..|.-.+.=+--+.-+..+|.. ..+|..+| +.++..+..+.-+.- ...+.+| T Consensus 1 m~~~~~~~~~t~~g~~------~~-~~d~~al~ik~f~~eV~~~f~~-~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~~~ 72 (347) T protein:vir:94 1 MANVPGQKIGTDQGKG------KS-SSDALALFLKVFAGEVLTAFTR-RSVTADKHIVRTIQNGKSAQFPVMGRTSGVYL 72 (347) T ss_pred CCCCCccccccccccC------Cc-cccHHHHHHHHHhHHHHHHHHH-HHhhhcccccccccccceEEEecccceeeeee Confidence 32222321 1122221 11 1121121114455667778884 67888777 655554443333332 4578888 Q ss_pred cccchh-------hhhhhhhhh--hhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHH--hcce----- Q lcl|Aclame:pro 170 KDGQTK-------TEQAATLTI--DTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIV--DLAL----- 233 (393) Q Consensus 170 k~ga~K-------k~q~~~le~--~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav--~rAv----- 233 (393) +.|+.= +....+|.+ .-.....||..-+. .-.-++.++|..---|.+.+-.+.+|-+.+ -.+- T Consensus 73 t~G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~-q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~ 151 (347) T protein:vir:94 73 APGERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDA-MNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASN 151 (347) T ss_pred cCCCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 887732 122222332 22335555554333 111134444555556666666665552211 0111 Q ss_pred --eeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHH---HhhhceeeecCCCceEEEecchhhhHhhhhhcccc Q lcl|Aclame:pro 234 --VEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAI---EEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATA 308 (393) Q Consensus 234 --v~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~t~~~dal---~Eald~a~~~a~~~~l~i~~~d~~a~~~~~~~~~~ 308 (393) +.|+|.+........+|.... ........++| .+.||-+.+....||++|...-..+|+...+-.+. T Consensus 152 ~~~~g~~~~s~~~~~~~~~~~~~--------~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~ 223 (347) T protein:vir:94 152 ENIAGLGTASVLEVGKKADLDTP--------AKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAA 223 (347) T ss_pred cccCCCcccceeeccccccccch--------hhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhh Confidence 111111110000001110000 00000112333 45677777777789999988877777764332222 Q ss_pred ccceeeecCCCeEEEeeecccchhhcccchhc----ee-----eeeccccee------------------ccccee---- Q lcl|Aclame:pro 309 NANVRIKNDDTEIASEVGVDEIIVYTGSKAVK----PT-----VLVDQKYHI------------------DMQDLT---- 357 (393) Q Consensus 309 ~a~~~l~~~~~~~~~~v~~~~~~~~tG~k~~~----pt-----v~vD~k~~~------------------~~~~~~---- 357 (393) +...-.-..+|.+..-. |++.++ |+ ...+..+.+ +.+.-+ T Consensus 224 ~~~~~~~~~~G~Vg~i~---------G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~ 294 (347) T protein:vir:94 224 NYAALIDPETGNIRNVM---------GFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFS 294 (347) T ss_pred hccccccccccceEEEe---------ceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEe Confidence 11111111223332222 222222 11 011111111 110000 Q ss_pred -------------eecceeEeecCceEEEeeeeccc-ccccCcceeeeeC Q lcl|Aclame:pro 358 -------------KVDAFEWKTNSNMILVETLTSGH-VETYNAGAVITVS 393 (393) Q Consensus 358 -------------~~~s~~~~~ns~~i~~~~~~~g~-~~~~n~~~~~~v~ 393 (393) +...|.-+-......+--++-|| +-.|.++.+++.+ T Consensus 295 h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~ 344 (347) T protein:vir:94 295 HRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFS 344 (347) T ss_pred ehhhhhhhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEec Confidence 00000000001111111122232 3345555556555 Done!