Query lcl|Aclame:protein:vir:93858|NCBI_annot:putative structural protein|genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Match_columns 400 No_of_seqs 8 out of 12 Neff 2.6 Searched_HMMs 1612 Date Mon Dec 2 14:59:51 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_37 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_37_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:93966 Length: 400 100.0 3E-191 2E-194 1065.1 24.1 400 1-400 1-400 (400) 2 protein:vir:1663 Length: 393 # 100.0 9E-188 6E-191 1045.9 22.4 393 8-400 1-393 (393) 3 protein:vir:93858 Length: 400 100.0 1E-170 6E-174 952.6 27.4 400 1-400 1-400 (400) 4 protein:vir:861 Length: 318 # 100.0 3E-151 2E-154 845.5 17.6 318 83-400 1-318 (318) 5 protein:vir:94870 Length: 318 100.0 2E-135 1E-138 759.5 17.8 318 83-400 1-318 (318) 6 protein:vir:97397 Length: 517 100.0 9.7E-33 6E-36 196.1 20.1 361 1-400 128-512 (517) 7 protein:vir:4074 Length: 480 # 99.8 2.6E-20 1.6E-23 128.0 16.3 347 1-400 102-475 (480) 8 protein:vir:9704 Length: 394 # 99.7 4.4E-18 2.8E-21 115.7 22.8 352 8-400 1-388 (394) 9 protein:vir:4953 Length: 397 # 99.7 3.9E-18 2.4E-21 116.0 17.8 341 8-400 1-383 (397) 10 protein:vir:98339 Length: 415 99.7 3.9E-17 2.4E-20 110.6 22.5 367 8-400 1-402 (415) 11 protein:vir:81100 Length: 415 99.7 3.9E-17 2.4E-20 110.6 22.5 367 8-400 1-402 (415) 12 protein:vir:79987 Length: 415 99.7 3.9E-17 2.4E-20 110.6 22.5 367 8-400 1-402 (415) 13 protein:vir:4997 Length: 397 # 99.7 1.5E-17 9E-21 112.9 19.2 340 8-400 1-383 (397) 14 protein:vir:4830 Length: 397 # 99.6 3.3E-17 2E-20 110.9 19.8 337 8-400 1-383 (397) 15 protein:vir:7409 Length: 408 # 99.6 2.2E-17 1.4E-20 111.9 18.7 354 1-400 1-391 (408) 16 protein:vir:4600 Length: 415 # 99.6 1.3E-16 7.9E-20 107.7 21.9 363 8-400 1-402 (415) 17 protein:vir:4700 Length: 415 # 99.6 1.3E-16 7.9E-20 107.7 21.9 363 8-400 1-402 (415) 18 protein:vir:962 Length: 397 # 99.6 3.3E-16 2E-19 105.5 23.7 351 3-400 1-395 (397) 19 protein:vir:100172 Length: 394 99.6 3.8E-17 2.3E-20 110.6 18.5 346 8-400 1-382 (394) 20 protein:vir:102119 Length: 404 99.6 8.1E-17 5E-20 108.8 20.2 359 8-400 1-398 (404) 21 protein:vir:191 Length: 385 # 99.6 5.9E-17 3.7E-20 109.6 19.5 361 8-400 1-382 (385) 22 protein:vir:1886 Length: 385 # 99.6 5.9E-17 3.7E-20 109.6 19.5 361 8-400 1-382 (385) 23 protein:vir:9410 Length: 415 # 99.6 3E-16 1.8E-19 105.7 23.2 367 8-400 1-402 (415) 24 protein:vir:10364 Length: 390 99.6 1.4E-16 8.5E-20 107.6 20.7 366 3-400 1-390 (390) 25 protein:vir:4339 Length: 395 # 99.6 1.1E-16 6.7E-20 108.1 20.1 364 8-400 1-393 (395) 26 protein:vir:3991 Length: 404 # 99.6 9.4E-17 5.8E-20 108.4 18.9 350 1-400 1-391 (404) 27 protein:vir:3870 Length: 400 # 99.6 1.4E-15 8.6E-19 102.1 23.7 353 1-400 1-397 (400) 28 protein:vir:1084 Length: 437 # 99.6 1.2E-15 7.7E-19 102.3 23.0 360 1-400 1-425 (437) 29 protein:vir:4511 Length: 409 # 99.6 2.2E-15 1.4E-18 100.9 24.0 360 8-400 1-404 (409) 30 protein:vir:81070 Length: 390 99.6 1.5E-16 9.2E-20 107.4 17.5 364 8-400 1-390 (390) 31 protein:vir:95376 Length: 425 99.6 1.2E-15 7.7E-19 102.3 22.1 378 1-400 1-419 (425) 32 protein:vir:102873 Length: 392 99.6 5.8E-16 3.6E-19 104.1 20.2 340 8-400 1-382 (392) 33 protein:vir:107593 Length: 392 99.6 5.8E-16 3.6E-19 104.1 20.2 340 8-400 1-382 (392) 34 protein:vir:102082 Length: 392 99.6 5.8E-16 3.6E-19 104.1 20.2 340 8-400 1-382 (392) 35 protein:vir:105004 Length: 392 99.6 5.8E-16 3.6E-19 104.1 20.2 340 8-400 1-382 (392) 36 protein:vir:1268 Length: 397 # 99.6 2.4E-15 1.5E-18 100.7 23.4 346 1-400 3-395 (397) 37 protein:vir:100884 Length: 389 99.6 1.7E-16 1.1E-19 107.0 17.1 347 13-400 1-380 (389) 38 protein:vir:1025 Length: 408 # 99.6 3.3E-16 2.1E-19 105.4 18.5 353 1-400 1-391 (408) 39 protein:vir:97053 Length: 390 99.6 2.6E-16 1.6E-19 106.0 17.9 364 3-400 1-390 (390) 40 protein:vir:3845 Length: 395 # 99.6 6E-16 3.7E-19 104.0 19.5 340 8-400 1-381 (395) 41 protein:vir:100135 Length: 418 99.5 2.2E-15 1.3E-18 101.0 20.1 367 1-400 1-413 (418) 42 protein:vir:81160 Length: 371 99.5 1.4E-15 8.8E-19 102.0 18.5 330 8-400 1-369 (371) 43 protein:vir:8102 Length: 543 # 99.5 1.9E-14 1.2E-17 95.8 21.7 365 1-400 120-540 (543) 44 protein:vir:485 Length: 407 # 99.4 8.6E-14 5.3E-17 92.2 22.3 366 3-400 1-398 (407) 45 protein:vir:4456 Length: 401 # 99.4 8.7E-14 5.4E-17 92.2 21.7 372 1-400 1-399 (401) 46 protein:vir:1433 Length: 435 # 99.4 5.2E-14 3.2E-17 93.4 19.2 365 8-400 1-429 (435) 47 protein:vir:100247 Length: 425 99.4 9.7E-14 6E-17 91.9 20.4 363 1-400 12-422 (425) 48 protein:vir:105038 Length: 428 99.4 7.5E-14 4.7E-17 92.5 19.6 367 8-400 1-424 (428) 49 protein:vir:1328 Length: 392 # 99.4 1E-13 6.2E-17 91.9 19.9 362 8-400 1-389 (392) 50 protein:vir:6212 Length: 434 # 99.4 3.2E-13 2E-16 89.1 22.6 359 8-400 1-425 (434) 51 protein:vir:4092 Length: 390 # 99.4 6.5E-14 4E-17 92.9 18.8 343 8-400 1-366 (390) 52 protein:vir:104256 Length: 458 99.4 2.2E-13 1.3E-16 90.0 21.5 375 1-400 1-454 (458) 53 protein:vir:1383 Length: 421 # 99.4 1.7E-13 1.1E-16 90.5 20.0 341 8-400 1-390 (421) 54 protein:vir:80376 Length: 435 99.4 2.2E-13 1.4E-16 90.0 19.1 360 8-400 1-429 (435) 55 protein:vir:101607 Length: 379 99.3 3.7E-12 2.3E-15 83.3 20.4 346 8-400 1-375 (379) 56 protein:vir:93881 Length: 387 99.2 1.1E-11 6.9E-15 80.6 22.4 350 8-400 1-377 (387) 57 protein:vir:80128 Length: 466 99.2 1.7E-11 1E-14 79.7 23.4 377 1-400 1-446 (466) 58 protein:vir:81227 Length: 413 99.2 4.9E-12 3E-15 82.6 19.7 359 8-400 1-408 (413) 59 protein:vir:6242 Length: 390 # 99.2 7.9E-12 4.9E-15 81.5 19.6 352 8-400 1-387 (390) 60 protein:vir:94673 Length: 419 99.2 6.6E-12 4.1E-15 81.9 19.0 363 8-400 1-415 (419) 61 protein:vir:96978 Length: 387 99.2 4.4E-11 2.7E-14 77.4 21.6 364 8-400 1-379 (387) 62 protein:vir:94424 Length: 387 99.2 4.4E-11 2.7E-14 77.4 21.6 364 8-400 1-379 (387) 63 protein:vir:2685 Length: 387 # 99.2 4.4E-11 2.7E-14 77.4 21.6 364 8-400 1-379 (387) 64 protein:vir:101650 Length: 497 99.2 2.1E-11 1.3E-14 79.1 19.8 366 8-400 1-489 (497) 65 protein:vir:7855 Length: 497 # 99.2 2.1E-11 1.3E-14 79.1 19.8 366 8-400 1-489 (497) 66 protein:vir:9361 Length: 402 # 99.1 6.2E-11 3.8E-14 76.6 20.8 363 1-400 1-392 (402) 67 protein:vir:8420 Length: 477 # 99.0 2.8E-10 1.8E-13 72.9 20.4 365 8-400 1-467 (477) 68 protein:vir:103955 Length: 324 99.0 3.9E-12 2.4E-15 83.1 9.7 281 73-400 1-311 (324) 69 protein:vir:98635 Length: 377 99.0 6.5E-11 4.1E-14 76.4 16.1 337 1-400 1-375 (377) 70 protein:vir:99749 Length: 324 99.0 6.2E-12 3.9E-15 82.0 9.5 275 100-400 1-311 (324) 71 protein:vir:97148 Length: 324 99.0 7.6E-12 4.7E-15 81.5 9.8 277 100-400 1-313 (324) 72 protein:vir:9309 Length: 324 # 98.9 9.7E-12 6E-15 81.0 9.8 276 100-400 1-313 (324) 73 protein:vir:78640 Length: 352 98.9 3.2E-10 2E-13 72.7 16.5 312 17-400 1-342 (352) 74 protein:vir:9643 Length: 377 # 98.9 9.3E-10 5.8E-13 70.1 18.8 339 1-400 1-375 (377) 75 protein:vir:78830 Length: 324 98.9 2.9E-11 1.8E-14 78.4 10.1 277 100-400 1-313 (324) 76 protein:vir:96392 Length: 324 98.9 2.9E-11 1.8E-14 78.4 10.1 277 100-400 1-313 (324) 77 protein:vir:94142 Length: 304 98.8 1.6E-11 9.7E-15 79.8 7.7 269 101-400 1-303 (304) 78 protein:vir:105905 Length: 304 98.8 1.6E-11 9.7E-15 79.8 7.7 269 101-400 1-303 (304) 79 protein:vir:41 Length: 299 # N 98.8 5.8E-11 3.6E-14 76.7 9.1 260 101-400 1-296 (299) 80 protein:vir:4226 Length: 326 # 98.8 4.9E-11 3.1E-14 77.1 7.9 281 100-400 1-321 (326) 81 protein:vir:96223 Length: 324 98.7 1.4E-10 8.4E-14 74.7 9.5 274 100-400 1-311 (324) 82 protein:vir:7771 Length: 330 # 98.7 7.3E-11 4.5E-14 76.2 6.4 273 101-400 1-321 (330) 83 protein:vir:2430 Length: 318 # 98.7 2.8E-10 1.7E-13 73.0 9.2 275 101-400 1-311 (318) 84 protein:vir:95963 Length: 395 98.6 2E-08 1.3E-11 62.8 18.3 347 1-400 1-374 (395) 85 protein:vir:101291 Length: 381 98.6 2.7E-08 1.6E-11 62.1 18.1 331 27-400 1-366 (381) 86 protein:vir:9509 Length: 381 # 98.6 2.7E-08 1.6E-11 62.1 18.1 331 27-400 1-366 (381) 87 protein:vir:95763 Length: 297 98.6 3.3E-10 2.1E-13 72.5 7.6 264 101-400 1-292 (297) 88 protein:vir:78523 Length: 338 98.5 9.3E-10 5.8E-13 70.1 8.2 276 111-400 1-331 (338) 89 protein:vir:78223 Length: 333 98.5 1.9E-09 1.2E-12 68.3 8.9 278 111-400 1-330 (333) 90 protein:vir:100632 Length: 381 98.4 1.3E-07 8.1E-11 58.3 18.4 333 27-400 1-366 (381) 91 protein:vir:4856 Length: 293 # 98.4 2.3E-09 1.4E-12 67.9 8.8 249 116-400 1-279 (293) 92 protein:vir:9759 Length: 303 # 98.4 2.2E-09 1.4E-12 68.0 8.5 263 120-400 1-299 (303) 93 protein:vir:2504 Length: 305 # 98.4 2.7E-09 1.7E-12 67.5 8.9 260 119-400 1-294 (305) 94 protein:vir:93616 Length: 645 98.4 1.1E-07 6.9E-11 58.7 16.5 372 1-400 167-635 (645) 95 protein:vir:78350 Length: 383 98.3 4.5E-07 2.8E-10 55.4 19.1 339 8-400 1-373 (383) 96 protein:vir:104085 Length: 320 98.3 8.1E-09 5E-12 64.9 9.3 278 101-400 1-315 (320) 97 protein:vir:2344 Length: 397 # 98.3 1E-08 6.4E-12 64.4 9.1 268 104-400 1-304 (397) 98 protein:vir:99920 Length: 311 98.2 4.7E-09 2.9E-12 66.3 6.3 264 117-400 1-308 (311) 99 protein:vir:94771 Length: 298 98.2 2.3E-08 1.4E-11 62.4 8.8 260 123-400 1-295 (298) 100 protein:vir:9574 Length: 300 # 98.2 2.6E-08 1.6E-11 62.1 8.9 265 117-400 1-296 (300) 101 protein:vir:1638 Length: 298 # 98.2 3.5E-08 2.2E-11 61.5 9.2 263 117-400 1-295 (298) 102 protein:vir:8187 Length: 311 # 98.1 2.4E-08 1.5E-11 62.3 8.0 266 114-400 1-308 (311) 103 protein:vir:5739 Length: 366 # 97.6 1.3E-06 8.2E-10 52.8 9.8 313 31-400 1-362 (366) 104 protein:vir:80684 Length: 315 97.1 7.9E-06 4.9E-09 48.6 8.5 261 117-400 1-304 (315) 105 protein:vir:3158 Length: 321 # 96.9 8.4E-05 5.2E-08 42.9 12.3 281 104-400 1-310 (321) 106 protein:vir:96762 Length: 632 96.3 0.00073 4.5E-07 37.8 15.4 360 1-400 202-623 (632) 107 protein:vir:4197 Length: 314 # 95.2 0.00073 4.5E-07 37.8 9.7 276 106-400 1-309 (314) 108 protein:vir:3033 Length: 272 # 92.3 0.0041 2.5E-06 33.7 8.5 241 101-400 1-265 (272) 109 protein:vir:9820 Length: 272 # 92.3 0.0041 2.5E-06 33.7 8.5 241 101-400 1-265 (272) 110 protein:vir:94933 Length: 330 91.5 0.016 9.7E-06 30.5 11.2 276 101-400 1-319 (330) 111 protein:vir:4159 Length: 315 # 90.6 0.0063 3.9E-06 32.6 7.7 275 99-399 1-315 (315) 112 protein:vir:93742 Length: 274 87.2 0.022 1.4E-05 29.6 8.2 241 101-400 1-266 (274) 113 protein:vir:3613 Length: 272 # 76.1 0.14 8.7E-05 25.3 11.0 245 101-400 1-268 (272) 114 protein:vir:96123 Length: 274 74.7 0.16 9.6E-05 25.0 8.5 241 101-400 1-266 (274) 115 protein:vir:80213 Length: 334 61.8 0.16 0.0001 24.9 5.4 245 100-400 1-289 (334) 116 protein:vir:97433 Length: 274 61.4 0.35 0.00022 23.1 8.2 241 101-400 1-266 (274) 117 protein:vir:94494 Length: 274 61.4 0.35 0.00022 23.1 8.2 241 101-400 1-266 (274) 118 protein:vir:97255 Length: 310 49.8 0.63 0.00039 21.7 10.7 265 101-400 1-300 (310) 119 protein:vir:80180 Length: 381 44.9 0.68 0.00042 21.5 5.9 262 100-400 1-284 (381) 120 protein:vir:6324 Length: 335 # 41.0 0.95 0.00059 20.7 10.3 257 101-400 1-285 (335) 121 protein:vir:96262 Length: 274 32.6 1.4 0.00088 19.8 8.6 241 101-400 1-266 (274) 122 protein:vir:95898 Length: 274 32.6 1.4 0.00088 19.8 8.6 241 101-400 1-266 (274) 123 protein:vir:105334 Length: 276 30.5 1.6 0.00097 19.5 11.9 240 101-400 1-266 (276) 124 protein:vir:94711 Length: 347 29.3 1.4 0.00085 19.8 4.9 280 100-400 1-344 (347) 125 protein:vir:94576 Length: 347 28.4 1.8 0.0011 19.2 7.3 280 100-400 1-326 (347) 126 protein:vir:1239 Length: 274 # 27.6 1.8 0.0011 19.1 8.3 240 101-400 1-266 (274) 127 protein:vir:96833 Length: 275 24.3 2.2 0.0014 18.7 9.5 242 117-400 1-267 (275) 128 protein:vir:80930 Length: 278 22.0 2.5 0.0016 18.4 8.0 249 101-400 1-273 (278) 129 protein:vir:99675 Length: 324 21.9 2.5 0.0016 18.4 6.6 234 152-400 1-292 (324) No 1 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=100.00 E-value=2.9e-191 Score=1065.15 Aligned_cols=400 Identities=99% Similarity=1.286 Sum_probs=399.6 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcch Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k 80 (400) ||||+||||||+++|||++|+++||+++++|||++||+++++++++||++|||+|+|+++.||+++|||||+++|+|||| T Consensus 1 mriS~~~~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LNa~~E~~KGK 80 (400) T protein:vir:93 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) T ss_pred CcccccccccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccc Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~ 160 (400) ++|++||+|++|+.+||++||+|+|+++.++||+++|+|+||+++|++++||+|++++|++||+||||||++|||+|+|+ T Consensus 81 ~kMt~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~ 160 (400) T protein:vir:93 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) T ss_pred HHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcce Q lcl|Aclame:pro 161 LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLAL 240 (400) Q Consensus 161 ~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAv 240 (400) |+++++||++++|+||++|++|++|.++|.++||+|++||++|+|||++++++|+|++||||+|+||+|+||+|||+||+ T Consensus 161 ~~V~~s~~s~~~Aq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~Al 240 (400) T protein:vir:93 161 LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLAL 240 (400) T ss_pred hhHHhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHHHHhhhhhccccccce Q lcl|Aclame:pro 241 VEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVR 320 (400) Q Consensus 241 v~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~ 320 (400) |+|||+|+++++++++|+++|++||++++++|+|||+|+++||+||+||++||||||++.+||+||+||||||++||.+| T Consensus 241 V~GDG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatanahvr 320 (400) T protein:vir:93 241 VEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANAHVR 320 (400) T ss_pred heecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhccccceE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 321 IKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 321 lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) |+|+|.+|+++|||+++++|||+|+|+|||+||||||||||||+++|+|+|++||||||||||||||||||||||||||| T Consensus 321 iknddaeiasevgvdeiivytgskalkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 321 IKNDDAEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eecchhhhhhhcCcceeeeeeccccccceeeeccccccchhhhhhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=100.00 E-value=9.4e-188 Score=1045.93 Aligned_cols=393 Identities=100% Similarity=1.290 Sum_probs=392.5 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHHHHHH Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 87 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~emtEfL 87 (400) |+|||++|||++||+|||+++++||||+||+++++++++||++|||+|+|+++.||+++|||||+++|+||||++|++|| T Consensus 1 mnkpdliekqnrlaelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~en~LN~~eE~~KGK~kMt~~i 80 (393) T protein:vir:16 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 80 (393) T ss_pred CCCcchhhhhhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhcchhhHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEee Q lcl|Aclame:pro 88 ESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 167 (400) Q Consensus 88 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l 167 (400) +|++|+.+||++||+|+|+++.++||+++|+|+||+++|++++||+|++++|++||+||||||++|||+|+|+|+++++| T Consensus 81 esq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~V~~s~ 160 (393) T protein:vir:16 81 ESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF 160 (393) T ss_pred hhHHHHHHHHHHHhccCCchhhhhhhhhhHhhcCcceeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhhHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCC Q lcl|Aclame:pro 168 DSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 247 (400) Q Consensus 168 ~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~ 247 (400) |++++|+||++|++|++|.++|.++||+|+|||++|+|||++++++|+|++||||+|+||+|+||+|||+||+|+|||++ T Consensus 161 ~s~~eAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~GDG~N 240 (393) T protein:vir:16 161 DSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 240 (393) T ss_pred hhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhheecCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHHHHhhhhhccccccceeecCCcc Q lcl|Aclame:pro 248 GFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTE 327 (400) Q Consensus 248 ~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~ 327 (400) +++++++++|+++|++||++++++|+|||+|+++||+||+||++||||||++.+||+||+||||||++|||+||+|+|++ T Consensus 241 ~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatananvriknddte 320 (393) T protein:vir:16 241 GFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIKNDDTE 320 (393) T ss_pred CccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhccCceeeeccchh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 328 IASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 328 ~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) |+++|||+++++|||+|+|+|||+||||||||||||+++|+|+|++||||||||||||||||||||||||||| T Consensus 321 iasevgvdeiivytgskalkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 321 IASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred hhhhcCcceeeeeeccccccceeeeccccccchhhhhhhhhheeccCCceEEEeecccCcceeeccceeEeeC Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=100.00 E-value=1e-170 Score=952.56 Aligned_cols=400 Identities=100% Similarity=1.291 Sum_probs=389.9 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcch Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k 80 (400) ||||+|||||||++|||++|+++||+++++|||++||+++++++++||++|||+|+|+|++||+|+|||||+++|+|||| T Consensus 1 ~~~s~~~~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln~~~E~~Kgk 80 (400) T protein:vir:93 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) T ss_pred CcccccccccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhhhhhhhcccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccc Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~ 160 (400) |+|||||+|++|+++||++||+|||++|+++||+|+|+|+||++||+|+|||+|||+|||+||++++++|++|||+|+|+ T Consensus 81 ~~mtefLkT~~A~~~fa~~l~~nsg~sd~knaW~A~l~E~gvt~td~n~iLP~~il~aIq~al~~~~~~~~f~~v~n~p~ 160 (400) T protein:vir:93 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) T ss_pred hhHHHhhhhHHHHHHHHHHHHhhcCCcchhhhhhhhhhhcccccCCchhhcchHHHHHHHHhhhccCCcccceeeecCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcce Q lcl|Aclame:pro 161 LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLAL 240 (400) Q Consensus 161 ~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAv 240 (400) +++..+++++++|||||+|++|++|+++|++|+|+|+|||+||+|+|++++++|+||+||+|||+|||||||||||+||+ T Consensus 161 l~V~~~~dt~~qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~EL~q~vI~k~Ve~Ai 240 (400) T protein:vir:93 161 LLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLAL 240 (400) T ss_pred eeeecchhhhcccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHHHHHHHHHHHHhhhhe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHHHHhhhhhccccccce Q lcl|Aclame:pro 241 VEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVR 320 (400) Q Consensus 241 v~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~ 320 (400) ++|||+|++...++++|++.|++++..+...++|++.|.+++++||++|+++|+++|+|++|++|++++||+|..+||++ T Consensus 241 i~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A~L~~lk~a~~~a~f~ 320 (400) T protein:vir:93 241 VEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVR 320 (400) T ss_pred eecccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHHHHHHhcCCcceeeee Confidence 99999999999999999999999966655555555555555558999999999999999999999999999999999999 Q ss_pred eecCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 321 IKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 321 lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +.+.|..|++.+||+++++|||+++++|+|+|||+||||||||+++++|+|+|||||||||||++||+++||+|++++|| T Consensus 321 ~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~VDek~~i~~~~~~t~~sf~~~tNs~~ilvetlv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 321 IKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred eccccchhhhhcccceeeeeccCCCCCceeeeehhhhccccCceeccceeeeeccceEEeeeeeccceecccceeeEeeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 No 4 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=100.00 E-value=3.4e-151 Score=845.48 Aligned_cols=318 Identities=98% Similarity=1.283 Sum_probs=317.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee Q lcl|Aclame:pro 83 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 162 (400) Q Consensus 83 mtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a 162 (400) |++||+|++|+.+||++||+|+|++|.++||.++|+|+||+++|++++||+|++++|++||+||||||++|||+|+|+|+ T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~~~~~ 80 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 80 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhhhhhcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccchhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceee Q lcl|Aclame:pro 163 VSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE 242 (400) Q Consensus 163 ~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~ 242 (400) ++++||+.-+||||++|++|++|.++|.++||+|++||++|+|||++++++|+|++||||+|+||+|+||+|||+||+|+ T Consensus 81 V~~s~~s~AeAq~HkdGqTK~eqa~~~~~~Tl~~~~VY~~~S~Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk~Vd~AlV~ 160 (318) T protein:vir:86 81 VSRSFDSSAEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE 160 (318) T ss_pred hhhhhhhhhhhhhhccCCccccceeeeeeechhHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhhee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHHHHhhhhhccccccceee Q lcl|Aclame:pro 243 GDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIK 322 (400) Q Consensus 243 gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk 322 (400) |||.|+++++++++|+++|++||+++++++.|||+.+++|++||+||++||||||++++||+||+||||||++||.+||| T Consensus 161 GDG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdfvrptagrrylivkaedrkalldelrqatanahvrik 240 (318) T protein:vir:86 161 GDGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANAHVRIK 240 (318) T ss_pred ecCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhhhccCCCceEEEEeecchHHHHHHHHhhcccceeEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 323 NDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 323 ~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) |+|++|+++|||+++++|||+|+|+|||+||||||||||||+++|+|+|++|||||||||||||||||||+|+||||| T Consensus 241 nddteiasevgvdeiivytgskalkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:86 241 NDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred ccchhhhhhcCcceeeeeeccccccceeeeccceecchhhhhhhhcceeccCCceEEEeecccCcceeecCceeEEeC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=100.00 E-value=1.7e-135 Score=759.50 Aligned_cols=318 Identities=99% Similarity=1.292 Sum_probs=317.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee Q lcl|Aclame:pro 83 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 162 (400) Q Consensus 83 mtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a 162 (400) |++|+++++|+.+|+++|..|+|++++++||-++|+++||+++|++++||++++++|++|+.++++||.+|||+|.+++. T Consensus 1 mtnfiesqnavteffdvlkknsgkseiknawnaklaengvtitdttfqlprklvesintallntnpvfkvfhvtnvgall 80 (318) T protein:vir:94 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL 80 (318) T ss_pred CccchhhhhhHHHHHHHHhcccChhhhhhhhhhhhhhCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhhhhee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceee Q lcl|Aclame:pro 163 VSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE 242 (400) Q Consensus 163 ~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~ 242 (400) +++++++.|++|.|++|++|++|..+|.++++.|.||||+|+|++++++|+|+|++|||+++.||.|+++++||+.|+|. T Consensus 81 vsrsfdssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnkivdlalve 160 (318) T protein:vir:94 81 VSRSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE 160 (318) T ss_pred eeccccccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhhhheeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHHHHhhhhhccccccceee Q lcl|Aclame:pro 243 GDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIK 322 (400) Q Consensus 243 gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk 322 (400) |||+++++++++++|+++||+||++++++++|||+|+++||+||+||++||||||++.+||+||+||||||++|||+||| T Consensus 161 gdgtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqatananvrik 240 (318) T protein:vir:94 161 GDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATANANVRIK 240 (318) T ss_pred cCCcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhcccceEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 323 NDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 323 ~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) |+|++|+++|||+++++|||+|+.+|||+||||||||||||+++|+|+|++||||||||||||||||||||||||||| T Consensus 241 nddteiasevgvdeiivytgskavkptvlvdqkyhidmqdltkvdafewktnsnmilvetltsghvetynagavitvs 318 (318) T protein:vir:94 241 NDDTEIASEVGVDEIIVYTGSKAVKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 318 (318) T ss_pred ccchhhhhhcCcceeEEeeccccccceeEeccceecchhhhhhhhceeeccCCceEEEEecccCcceeecCceeEEeC Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=9.7e-33 Score=196.09 Aligned_cols=361 Identities=20% Similarity=0.251 Sum_probs=258.0 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHH-HHHHHHHhhH----- Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEII-KIENELNAQE----- 74 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~-K~enEl~~~k----- 74 (400) -+|..++ +.++.++...++..+.++...+.. ++..+...+++++++++.+.+.+.. +..++++... T Consensus 128 a~I~~vk------e~~~~e~~~~~~~~a~~ee~~e~~--~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~ 199 (517) T protein:vir:97 128 AVVTYFR------EEKKKEENKMTFDQNLMQELLDAK--KLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKI 199 (517) T ss_pred hhhhhhh------hhhhhhhhhhhhhhhhhhhhhhhh--hhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhh Confidence 2222221 112233334444433333333322 2333334556666665555544432 2333333222 Q ss_pred ---hhhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccc Q lcl|Aclame:pro 75 ---EKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFK 151 (400) Q Consensus 75 ---Ek~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~ 151 (400) .+....+...+|++++.+...++.. ....+....|...+.+.++.+ ...|..++..|.+.+....++.. T Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~----~~~p~~~~~~i~~~~~~~~~i~~ 271 (517) T protein:vir:97 200 LGVEALKVTPEATEFLKTREAEVAYMSA----SLTKDPKAAWTAELKERGISG----MPAPAGILKRIQDAVNDEGSLLP 271 (517) T ss_pred cccccccccchhhHHHHHHHHHHHHHHh----cccccccceeeeecccccccc----cccchHHHHHHHHhhhhhcccee Confidence 2333444566888888877766433 233455678888888887744 24688899999999998888988 Q ss_pred ceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHH-HHHHHhhcC-chhHHHHHHHHHHH Q lcl|Aclame:pro 152 VFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSL-AERVKRLQM-SYSELYNLIVAELT 228 (400) Q Consensus 152 ~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~L-ad~~k~l~g-~ygalvnyvm~ELa 228 (400) .+.+++++.. .+.+++ ...+++|..|+.|.+++++|...++.|.++|.+.++ .+++.+.-- ...+|.+|++++|+ T Consensus 272 ~~~~~~i~~~--~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~ 349 (517) T protein:vir:97 272 FIRHENLPTL--VVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLP 349 (517) T ss_pred eeeeccccce--eeecccccceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHH Confidence 7778877654 555544 468899999999999999999999999999999887 333333211 13468999999999 Q ss_pred HHHHHHHHhcceeeccCC--CccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHHH Q lcl|Aclame:pro 229 QAIVNKIVDLALVEGDGT--NGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKAL 306 (400) Q Consensus 229 q~fI~Rav~rAvv~gDG~--~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a~ 306 (400) .+|- +..++++++|||+ +.+-+.+..++.+ +..+..|...+++...|.-+-+.+....+++++.|+++| T Consensus 350 ~~l~-~~ee~a~l~GdGtg~~~~gi~~~a~~~~--------~~~~~~~~~~~d~i~~l~~a~~~a~~a~~vmn~~t~~~I 420 (517) T protein:vir:97 350 DMVI-MAVNRAIIMGGVTGVSETQIYPVVGDAW--------ATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAAI 420 (517) T ss_pred HHHH-HHHHHHHhcccCCCcccccccccccccc--------cccccccchHHHHHHHHHHHhhhccCCEEEECHHHHHHH Confidence 9999 7999999999886 4556666655544 233444566667777776666666778899999999999 Q ss_pred HhhhhhccccccceeecCCcceeehhhccccceec--chhhhchhhhhcccceechhhceeccceee--------eecCc Q lcl|Aclame:pro 307 LDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVDQKYHIDMQDLTKVDAFEW--------KTNSN 376 (400) Q Consensus 307 ~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd~k~~~~~~~~~~~~~~~~--------~~~~~ 376 (400) +- |||.+|+|.++.++..-..+| |..-+.|.+.+|++..++++||+.+|.+++ .+|+- T Consensus 421 ~k------------lKD~~G~Yl~~~~~~~~~~~~l~G~~~~~~~~~~~~~~~~~~~~y~i~~~~g~~~~~~fd~~~n~~ 488 (517) T protein:vir:97 421 RF------------LKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNK 488 (517) T ss_pred HH------------hhcCCCCeeccCcCCcccccccCCccccccccccCceeEeeccccEEEeecceeeeeeeecccCce Confidence 98 999999999999998888888 777788999999999999999998887664 56899 Q ss_pred eEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 377 MILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 377 ~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ++++|+.++|+|...++.+..++. T Consensus 489 ~f~~~~~~~g~i~~~~r~a~~~~~ 512 (517) T protein:vir:97 489 EYLFEMPISGSLEYKGTTAYGTYT 512 (517) T ss_pred eEeeeeeeccccccccceEEEEEc Confidence 999999999999999999998888 No 7 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=99.75 E-value=2.6e-20 Score=127.97 Aligned_cols=347 Identities=17% Similarity=0.190 Sum_probs=170.7 Q ss_pred Cccc--ccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhh----hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhH Q lcl|Aclame:pro 1 MRIS--KRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKN----AIEDLPKVQELEKTLSENSIEIIKIENELNAQE 74 (400) Q Consensus 1 ~~~s--~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~----~~~~~skieElektis~l~aEi~K~enEl~~~k 74 (400) +-+| -...++...+ ..+|+.......+...-+.++ +...+.+..+++..+.+++.+++..+....+.. T Consensus 102 ~EvS~v~~pa~~~a~v------~~vks~~~~~e~~~~~~e~~e~~~e~~e~~~~~~el~akl~el~k~~ee~k~~~~~~~ 175 (480) T protein:vir:40 102 TEVSLTPLPSNKGAKV------TKVREENKGEQEQMGANETQEIMKQAIEAGVKVRELEAKVEELNKEREELKKEREASI 175 (480) T ss_pred EEeEEeecccchhhhh------hhhhhhhhhhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhc Confidence 1111 1111222221 122221111111100000000 111122233333333333333332221111111 Q ss_pred hhhcchhHHH--HHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccch-hhhHhhcchhHHHHHHHHHHhhCcccc Q lcl|Aclame:pro 75 EKPKGKDKMT--NFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFK 151 (400) Q Consensus 75 Ek~K~k~emt--EfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~-qd~~eiLP~~iI~AIe~A~ed~d~vl~ 151 (400) .. ..++.. ..+++ ...|.+-+.+ ......+.+...++ .+-.-.+|+.+...+--......+.+. T Consensus 176 ~~--~~~~~~~~~e~r~---~~~~~~~~~e--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (480) T protein:vir:40 176 PS--EKPEDAERKFMRE---LGSKMAEMPE--------QGFLREFANGADLNVVNSLGSITSKYARKSGIYDGAMKARFQ 242 (480) T ss_pred cc--cchhhhhhHHHHH---HHHHhccchh--------hhhhhhhhhhccccccccccccccchhhheeechhhhhhhhh Confidence 11 011110 11111 1112111110 01111111111100 000012232211111000111111111 Q ss_pred ceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 152 VFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQA 230 (400) Q Consensus 152 ~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~ 230 (400) ...+... +..+ ..-+.+|..+++++.+. +..+.+.|..+|+++.+++.-..+-....+|.+|+.+||+.+ T Consensus 243 ~~~~~~~-------g~~~~~~~~e~~~~~~~~~~~~--~~~~~~~~~~v~~l~~~~k~t~~lLDDa~~l~~~i~~~l~~~ 313 (480) T protein:vir:40 243 GLTLAED-------GVDDTFISGTFKAGTDKNKSQT--ATKRSLRPQMAEAYLQMDKATVRGVNDSGALSEYVMSEMVNR 313 (480) T ss_pred cceeeec-------cccceeeeeeeecccccccccc--cccchhhHHHHHHHHHhHHHHHHHhhhhHHHHHHHHHHHHHH Confidence 1111111 1111 23567888888877664 567888999999999996655443332337999999999999 Q ss_pred HHHHHHhcceeec--cCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccccc--ce-EEEEecchhHH Q lcl|Aclame:pro 231 IVNKIVDLALVEG--DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAG--RR-YLIVKAEDRKA 305 (400) Q Consensus 231 fI~Rav~rAvv~g--DG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~--~~-~l~~~~~d~~a 305 (400) |- ++.+.|+++| ||.+.+..++..++.+ +.+.+++++...|-.+-+.+. .. .+|+|+.++++ T Consensus 314 ~~-~~ee~a~l~G~g~g~~~~~g~~~~~~~~------------~~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~ 380 (480) T protein:vir:40 314 VI-QKVEYNMILGSVDGSNGFYGLKTATDGW------------TKQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAE 380 (480) T ss_pred HH-HHHHHHhhccCCCCccccccceeecccc------------cccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHH Confidence 99 7999999999 7777777887766533 122334555555533334332 33 58899999999 Q ss_pred HHhhhhhccccccceeecCCcceeehhhccccceec--chhh---------hchhhhhcccc-eechhhceeccceeeee Q lcl|Aclame:pro 306 LLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKA---------LKPTVLVDQKY-HIDMQDLTKVDAFEWKT 373 (400) Q Consensus 306 ~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~---------l~~t~~vd~k~-~~~~~~~~~~~~~~~~~ 373 (400) |+- |||.||+|.+.-++..-...| |.-. -.|++..+.+| .+--.+....+.|.+.+ T Consensus 381 I~k------------lKD~~G~Yi~q~~~~~~~~~~llG~pvv~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 448 (480) T protein:vir:40 381 LRK------------AKGTDGHSRFNELATKEQIAQSFGAVNLETRVWMPKDEVAVYNHDEYVLIGDLNVENYNDFDLRY 448 (480) T ss_pred HHH------------hhcCCCCeeccCcccccCcceecccceeeeeccccCCcceeeeCCccEEEEecccceeccccccc Confidence 998 999999999865555444333 4321 23456666665 33223455677899999 Q ss_pred cCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 374 NSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 374 ~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ++.++|.|+..+|+|..+++++.+..- T Consensus 449 ~~~~~~~e~~v~g~~~~~~~~~~~~~~ 475 (480) T protein:vir:40 449 NVEQWLSETLVGGSIRGKNRSAYLKKK 475 (480) T ss_pred chhhhhhhhhhceeeEccccEEEEEec Confidence 999999999999999999887776654 No 8 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=99.71 E-value=4.4e-18 Score=115.71 Aligned_cols=352 Identities=12% Similarity=0.083 Sum_probs=194.1 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhh----hhHHHHHHHHHHHHHHHHHHHHHHHHhhHh-hhcch-- Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIED----LPKVQELEKTLSENSIEIIKIENELNAQEE-KPKGK-- 80 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~----~skieElektis~l~aEi~K~enEl~~~kE-k~K~k-- 80 (400) |-+..++|..+.++++++.+...+..+...-.++..++ .++++.+++.+.+++.+++..+...+...+ ....+ T Consensus 1 M~~~~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~ 80 (394) T protein:vir:97 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 88889999999999999988877766544322221111 234444444444444444333332222111 11111 Q ss_pred --------hHHHHHHHHHHHHHHHHHHHHHccCChh----HHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCc Q lcl|Aclame:pro 81 --------DKMTNFIESQNAVTEFFDVLKKNSGKSE----IKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNP 148 (400) Q Consensus 81 --------~emtEfLkTkqA~~dya~ll~~nqg~ke----~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~ 148 (400) ..+.+|++.+.....-.. ......+ ............|++..+.-..+|..+...|-+.+.++.+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~ 157 (394) T protein:vir:97 81 TQEEKTYRESVNDFIRSKGKIVNDSL---RFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVD 157 (394) T ss_pred chhhHHHHHHHHHHHHHHHHHhhhhh---hhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhh Confidence 122334443332221111 1111111 1112222233336665555557999999999999999888 Q ss_pred cccceeeecccceeeEEee-cc-ccccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHH Q lcl|Aclame:pro 149 VFKVFHVTNVGALLVSRSF-DS-ANEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVA 225 (400) Q Consensus 149 vl~~fhV~n~~~~a~~i~l-~n-a~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ 225 (400) +.+..++.+.+.....+.. +. ...+..+.-|.++++ ...+|...++.|..++.+..+.+-+.+-.+ -++.+|+++ T Consensus 158 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~--~~~~~~i~~ 235 (394) T protein:vir:97 158 LKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDAD--VDLVGIVSE 235 (394) T ss_pred hhhhceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhh--HHHHHHHHH Confidence 8887777666654433332 22 234444666677765 568999999999999988777443333222 368999999 Q ss_pred HHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHH Q lcl|Aclame:pro 226 ELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKA 305 (400) Q Consensus 226 ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a 305 (400) +|++.+. ++.+.+++.|+|.... +.+.+.|++..++....+.+....+++|+.+..+ T Consensus 236 ~la~~~~-~~~~~~i~~g~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~ 292 (394) T protein:vir:97 236 SISQIKV-NTTNDAIAKVLKSFTT----------------------KTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQT 292 (394) T ss_pred HHHHHHH-HHHHHHHhhccccccc----------------------cccccHHHHHHHHHhhhhhhhCCEEEEcHHHHHH Confidence 9999999 7999999998876321 1234556777777544444555678999999999 Q ss_pred HHhhhhhccccccceeecCCcceeehhhccccceec--chhhh-chhhhhcccc--eechhh-ceeccc----eee---- Q lcl|Aclame:pro 306 LLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAL-KPTVLVDQKY--HIDMQD-LTKVDA----FEW---- 371 (400) Q Consensus 306 ~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l-~~t~~vd~k~--~~~~~~-~~~~~~----~~~---- 371 (400) |+. |+|.+|.+.+..++.+-.-.| |..-. .|...++.+. --|++. |+-.+. ..| T Consensus 293 l~~------------lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~ 360 (394) T protein:vir:97 293 LDT------------LKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNE 360 (394) T ss_pred HHH------------hhccCCCeeeecCcCCCCCceeccceeEEecccccCCccEEEeeccccEEEEEecceEEEEeccc Confidence 887 889999988754433221111 43221 2233333332 234433 221111 111 Q ss_pred eecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 372 KTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 372 ~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .+. ..+.++.--.|-|---.+-..++++ T Consensus 361 ~~~-~~~~~~~r~d~~v~~~~a~~~~~~~ 388 (394) T protein:vir:97 361 IYG-QYLQAVLRFGVSKVDDKAGYYVTFT 388 (394) T ss_pred ccc-eeEEEEEEEccEEecccceEEEEec Confidence 111 1223333334444444444445555 No 9 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=99.67 E-value=3.9e-18 Score=116.03 Aligned_cols=341 Identities=9% Similarity=0.047 Sum_probs=192.6 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHHHH-- Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTN-- 85 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~emtE-- 85 (400) |+ .++|-.+.++++++.+..++.+++....+ ...+..++++++..++++..+++.....++...+.......... T Consensus 1 Mk--~~~el~~~~~~~~~~~~~l~~~~~~~~~~-~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MK--TSNELHDLWVAQGDKVENLNEKLNVAMLD-DSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred Cc--hHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 66 34455566666666776666666654322 22233456666677776666665433333321110000000000 Q ss_pred HH--HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC---------ccchhhhHhhcchhHHHHHHHHHHhhCcccccee Q lcl|Aclame:pro 86 FI--ESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN---------GVTITDTTFQLPRKLVESINTALLNTNPVFKVFH 154 (400) Q Consensus 86 fL--kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek---------gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh 154 (400) .+ ...+.. .+..++|...|... +.+..+--.++|..+...|-+.+.++.++++... T Consensus 78 ~~~~~~~~~~-------------~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~ 144 (397) T protein:vir:49 78 PLTKSEEEVK-------------AGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVN 144 (397) T ss_pred ccccchhHHH-------------HHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhc Confidence 00 000001 12223333333221 1222222336899999999999999999888767 Q ss_pred eeccccee--eEEee-ccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 155 VTNVGALL--VSRSF-DSA-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 229 (400) Q Consensus 155 V~n~~~~a--~~i~l-~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 229 (400) +.+.+... ..+.. .+. ..+..+--|+++.+ ...+|...++.|..+|....+.+.+.+ .+.-++.+|++++|+. T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~ 222 (397) T protein:vir:49 145 VENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLA--DSAENILAWLSGWIAK 222 (397) T ss_pred eeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHh--hhHHHHHHHHHHHHHH Confidence 76665432 22222 222 23455666777776 468999999999999988777544433 2224679999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhc-eecccccceEEEEecchhHHHHh Q lcl|Aclame:pro 230 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKAEDRKALLD 308 (400) Q Consensus 230 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald-~~~~~~~~~~l~~~~~d~~a~~~ 308 (400) .+. |.++.+++.|||+... .+.+...|++..++. +...-.....+++++.++.+|+. T Consensus 223 ~~~-~~~d~ai~~G~g~~~~---------------------~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~ 280 (397) T protein:vir:49 223 KVV-VTRNKAILEAIAALPT---------------------KPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKK 280 (397) T ss_pred HHH-HHHHHHHHhhcccccc---------------------ccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH Confidence 998 7999999999998432 122344566666652 22222334578899999998887 Q ss_pred hhhhccccccceeecCCcceeehhhccccceec--ch------hhhchhhhhccccee--chhhceecc-cee------- Q lcl|Aclame:pro 309 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GS------KALKPTVLVDQKYHI--DMQDLTKVD-AFE------- 370 (400) Q Consensus 309 ~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~------k~l~~t~~vd~k~~~--~~~~~~~~~-~~~------- 370 (400) |+|.+|++.+..++..-+-+| |. ....|+...+....+ |++.+.... ..+ T Consensus 281 ------------lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~ 348 (397) T protein:vir:49 281 ------------VKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTN 348 (397) T ss_pred ------------hhcCCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEec Confidence 899999998865543322111 43 233455444443222 666533221 111 Q ss_pred ---eeecCc--eEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 371 ---WKTNSN--MILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 371 ---~~~~~~--~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +.|..| .+.++.--.|-+--.++-.+++++ T Consensus 349 ~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 349 IGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFK 383 (397) T ss_pred cccchhhcCceeEEEEeeeCcEEecccceEEEEee Confidence 123333 355666666666666666666666 No 10 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=99.67 E-value=3.9e-17 Score=110.56 Aligned_cols=367 Identities=13% Similarity=0.082 Sum_probs=199.8 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh--------hhcc Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KPKG 79 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE--------k~K~ 79 (400) ||+ ++|-.++|+++++.....++.+...-.++ +..+.+++++.+++++++|.+.++.++.+++ .+.. T Consensus 1 mk~--~~el~~~l~el~~~~~~~~~e~~~~l~~~---~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:98 1 MKT--KEELQSEISDIKRQIDLKVKYATRALNND---ELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHhchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 554 34445566666666655555444332222 2345666777777777777655555443331 1111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC------ccchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 80 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN------GVTITDTTFQLPRKLVESINTALLNTNPVFKVF 153 (400) Q Consensus 80 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek------gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~f 153 (400) ........+...........+...+......++|...+... ++.+.+-..++|..+...|-+.++++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:98 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhhe Confidence 11111111122222222233344444556677777766554 222223333799999999999999999998877 Q ss_pred eeecccceee--EEeeccc-cccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 154 HVTNVGALLV--SRSFDSA-NEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 229 (400) Q Consensus 154 hV~n~~~~a~--~i~l~na-~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 229 (400) .|.+.+.... .+.-.+. ..+..+.-|.+.++. ..+|...++.|..++.+..+.+-+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:98 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred eeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 7766654432 2322332 233344556666654 46899999999999988777444333212 3579999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecc-cccceEEEEecchhHHHHh Q lcl|Aclame:pro 230 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRRYLIVKAEDRKALLD 308 (400) Q Consensus 230 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~-~~~~~~l~~~~~d~~a~~~ 308 (400) .+. ++++.+++.|+|...... ...+... ...++..+.+...|++..++.-..+ -.....+++|+.++.+|+. T Consensus 234 ~~~-~~~~~~il~g~g~g~~~~-~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:98 234 TIA-ATRNKAIIDVITKGSTGS-TSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCcccc-ccccccc-----cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998733111 1111111 1123344556677888888844333 2334468999999999887 Q ss_pred hhhhccccccceeecCCcceeehhhccccceec--chhhhc-hhhhhcc-----cceechhhceec-cceeee------- Q lcl|Aclame:pro 309 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-PTVLVDQ-----KYHIDMQDLTKV-DAFEWK------- 372 (400) Q Consensus 309 ~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-~t~~vd~-----k~~~~~~~~~~~-~~~~~~------- 372 (400) ++|.+|++.+.-++.+-+.+| |..-.+ |...+.. -+--|++++... +.-++. T Consensus 307 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~ 374 (415) T protein:vir:98 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) T ss_pred ------------hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc Confidence 899999988755443332222 332111 1111100 011244442211 111111 Q ss_pred ecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 373 TNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 373 ~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ..+--+.++..-.|.|--.++-.+++++ T Consensus 375 ~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:98 375 HFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred cCceEEEEEEEeccEEeccccEEEEEEe Confidence 0111133444445666555555566665 No 11 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=99.67 E-value=3.9e-17 Score=110.56 Aligned_cols=367 Identities=13% Similarity=0.082 Sum_probs=199.8 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh--------hhcc Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KPKG 79 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE--------k~K~ 79 (400) ||+ ++|-.++|+++++.....++.+...-.++ +..+.+++++.+++++++|.+.++.++.+++ .+.. T Consensus 1 mk~--~~el~~~l~el~~~~~~~~~e~~~~l~~~---~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:81 1 MKT--KEELQSEISDIKRQIDLKVKYATRALNND---ELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHhchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 554 34445566666666655555444332222 2345666777777777777655555443331 1111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC------ccchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 80 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN------GVTITDTTFQLPRKLVESINTALLNTNPVFKVF 153 (400) Q Consensus 80 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek------gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~f 153 (400) ........+...........+...+......++|...+... ++.+.+-..++|..+...|-+.++++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:81 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhhe Confidence 11111111122222222233344444556677777766554 222223333799999999999999999998877 Q ss_pred eeecccceee--EEeeccc-cccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 154 HVTNVGALLV--SRSFDSA-NEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 229 (400) Q Consensus 154 hV~n~~~~a~--~i~l~na-~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 229 (400) .|.+.+.... .+.-.+. ..+..+.-|.+.++. ..+|...++.|..++.+..+.+-+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:81 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred eeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 7766654432 2322332 233344556666654 46899999999999988777444333212 3579999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecc-cccceEEEEecchhHHHHh Q lcl|Aclame:pro 230 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRRYLIVKAEDRKALLD 308 (400) Q Consensus 230 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~-~~~~~~l~~~~~d~~a~~~ 308 (400) .+. ++++.+++.|+|...... ...+... ...++..+.+...|++..++.-..+ -.....+++|+.++.+|+. T Consensus 234 ~~~-~~~~~~il~g~g~g~~~~-~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:81 234 TIA-ATRNKAIIDVITKGSTGS-TSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCcccc-ccccccc-----cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998733111 1111111 1123344556677888888844333 2334468999999999887 Q ss_pred hhhhccccccceeecCCcceeehhhccccceec--chhhhc-hhhhhcc-----cceechhhceec-cceeee------- Q lcl|Aclame:pro 309 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-PTVLVDQ-----KYHIDMQDLTKV-DAFEWK------- 372 (400) Q Consensus 309 ~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-~t~~vd~-----k~~~~~~~~~~~-~~~~~~------- 372 (400) ++|.+|++.+.-++.+-+.+| |..-.+ |...+.. -+--|++++... +.-++. T Consensus 307 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~ 374 (415) T protein:vir:81 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) T ss_pred ------------hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc Confidence 899999988755443332222 332111 1111100 011244442211 111111 Q ss_pred ecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 373 TNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 373 ~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ..+--+.++..-.|.|--.++-.+++++ T Consensus 375 ~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:81 375 HFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred cCceEEEEEEEeccEEeccccEEEEEEe Confidence 0111133444445666555555566665 No 12 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=99.67 E-value=3.9e-17 Score=110.56 Aligned_cols=367 Identities=13% Similarity=0.082 Sum_probs=199.8 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh--------hhcc Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KPKG 79 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE--------k~K~ 79 (400) ||+ ++|-.++|+++++.....++.+...-.++ +..+.+++++.+++++++|.+.++.++.+++ .+.. T Consensus 1 mk~--~~el~~~l~el~~~~~~~~~e~~~~l~~~---~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:79 1 MKT--KEELQSEISDIKRQIDLKVKYATRALNND---ELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHhchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 554 34445566666666655555444332222 2345666777777777777655555443331 1111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC------ccchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 80 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN------GVTITDTTFQLPRKLVESINTALLNTNPVFKVF 153 (400) Q Consensus 80 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek------gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~f 153 (400) ........+...........+...+......++|...+... ++.+.+-..++|..+...|-+.++++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:79 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhhe Confidence 11111111122222222233344444556677777766554 222223333799999999999999999998877 Q ss_pred eeecccceee--EEeeccc-cccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 154 HVTNVGALLV--SRSFDSA-NEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 229 (400) Q Consensus 154 hV~n~~~~a~--~i~l~na-~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 229 (400) .|.+.+.... .+.-.+. ..+..+.-|.+.++. ..+|...++.|..++.+..+.+-+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:79 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred eeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 7766654432 2322332 233344556666654 46899999999999988777444333212 3579999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecc-cccceEEEEecchhHHHHh Q lcl|Aclame:pro 230 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRRYLIVKAEDRKALLD 308 (400) Q Consensus 230 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~-~~~~~~l~~~~~d~~a~~~ 308 (400) .+. ++++.+++.|+|...... ...+... ...++..+.+...|++..++.-..+ -.....+++|+.++.+|+. T Consensus 234 ~~~-~~~~~~il~g~g~g~~~~-~~~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:79 234 TIA-ATRNKAIIDVITKGSTGS-TSSGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCcccc-ccccccc-----cccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998733111 1111111 1123344556677888888844333 2334468999999999887 Q ss_pred hhhhccccccceeecCCcceeehhhccccceec--chhhhc-hhhhhcc-----cceechhhceec-cceeee------- Q lcl|Aclame:pro 309 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-PTVLVDQ-----KYHIDMQDLTKV-DAFEWK------- 372 (400) Q Consensus 309 ~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-~t~~vd~-----k~~~~~~~~~~~-~~~~~~------- 372 (400) ++|.+|++.+.-++.+-+.+| |..-.+ |...+.. -+--|++++... +.-++. T Consensus 307 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~ 374 (415) T protein:vir:79 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) T ss_pred ------------hhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc Confidence 899999988755443332222 332111 1111100 011244442211 111111 Q ss_pred ecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 373 TNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 373 ~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ..+--+.++..-.|.|--.++-.+++++ T Consensus 375 ~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:79 375 HFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred cCceEEEEEEEeccEEeccccEEEEEEe Confidence 0111133444445666555555566665 No 13 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=99.66 E-value=1.5e-17 Score=112.90 Aligned_cols=340 Identities=11% Similarity=0.066 Sum_probs=185.3 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHHH--H Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMT--N 85 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~emt--E 85 (400) |++ +++-.++++++++.++++...++....++... ..+++.++..++.+.++++....++....+.+....... . T Consensus 1 Mk~--~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MKT--SNELHDLWIAQGDKVENLNEKLNVAMLDDSVS-AEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHHHhcchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 653 33444555666666666665555443222221 245666666666666666554444333332111100000 0 Q ss_pred H--HHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC---------ccchhhhHhhcchhHHHHHHHHHHhhCcccccee Q lcl|Aclame:pro 86 F--IESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN---------GVTITDTTFQLPRKLVESINTALLNTNPVFKVFH 154 (400) Q Consensus 86 f--LkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek---------gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh 154 (400) . -..++.. .+..++|...|... ..++.+--..+|..+...|-+.++++.++++... T Consensus 78 ~~~~~~~~~~-------------~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~ 144 (397) T protein:vir:49 78 PLTKNEEEVK-------------ANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVN 144 (397) T ss_pred cccchhhHHH-------------HHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcc Confidence 0 0000000 12222333322211 1112222236899999999999999898877666 Q ss_pred eecccceee--EEeec-c--ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHH Q lcl|Aclame:pro 155 VTNVGALLV--SRSFD-S--ANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELT 228 (400) Q Consensus 155 V~n~~~~a~--~i~l~-n--a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELa 228 (400) |.+.+...- .+... + ....|++- |.++++. ..+|...++.|..++.+..+.+.+.+- +.-++.+|++++|+ T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~a~~v~E-~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l~ 221 (397) T protein:vir:49 145 VENVTTLTGSRVYEKWADITGLAKLDDE-GGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLAD--SAENILAWLSGWIA 221 (397) T ss_pred eeeccCCcceEEEEeeccCCcceeeecc-ccccccccccceeeeEeeeeeeEeehhhHHHHHhh--hhHHHHHHHHHHHH Confidence 666654432 22222 2 23345544 4444444 458999999999999888885443332 22467999999999 Q ss_pred HHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHH Q lcl|Aclame:pro 229 QAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALL 307 (400) Q Consensus 229 q~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~ 307 (400) ..+- |+++.+++.|||+... .+...+.|++..++.-+.|- .....+++++.++.+|+ T Consensus 222 ~~~~-~~~d~ail~G~g~~~~---------------------~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~ 279 (397) T protein:vir:49 222 KKVV-VTRNKAILEAIGTLPN---------------------KPTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALK 279 (397) T ss_pred HHHH-HHHHHHHHhccccccc---------------------cccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHH Confidence 9998 7999999999998432 22334556676665222221 22347889999999988 Q ss_pred hhhhhccccccceeecCCcceeehhhccccceec--chhhh------chhhhhccc--ceechhhce-eccce------- Q lcl|Aclame:pro 308 DELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAL------KPTVLVDQK--YHIDMQDLT-KVDAF------- 369 (400) Q Consensus 308 ~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l------~~t~~vd~k--~~~~~~~~~-~~~~~------- 369 (400) . |+|.+|++.+..++..-+-.| |..-. .|++..+.. +-.|++.+. -.+.. T Consensus 280 ~------------lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 347 (397) T protein:vir:49 280 K------------VKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLST 347 (397) T ss_pred H------------hhccCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEe Confidence 8 899999988754443322111 44221 233322221 222555422 22211 Q ss_pred ---eeeecCc--eEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 370 ---EWKTNSN--MILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 370 ---~~~~~~~--~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ++.|..| .+.++.--.|-+-..++-.++++. T Consensus 348 ~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 348 NIGGGAFETDTTKVRVIDRFDVVSTDTEAFVPASFK 383 (397) T ss_pred ccccchhhcCeeeEEEEEeeccEEecccceEEEEec Confidence 1123344 456677777777666666677766 No 14 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=99.64 E-value=3.3e-17 Score=110.94 Aligned_cols=337 Identities=9% Similarity=0.041 Sum_probs=183.8 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh--------hhcc Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KPKG 79 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE--------k~K~ 79 (400) |+. +++-.+++.++++.+..+..+++.....+.... .+++.++..+..++.+++..+........ ..+. T Consensus 1 Mk~--~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~-ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:48 1 MKT--SNELHDLWVAQGDKVENLNEKLNVAMLDDSVTA-EELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKK 77 (397) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhccc Confidence 543 344456666777777777777766654443222 45666666666666665432222221111 0000 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC---------ccchhhhHhhcchhHHHHHHHHHHhhCccc Q lcl|Aclame:pro 80 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN---------GVTITDTTFQLPRKLVESINTALLNTNPVF 150 (400) Q Consensus 80 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek---------gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl 150 (400) ..+..+.-.+. +..++|.+.|... ..++++--.++|..+...|-+.++++.+++ T Consensus 78 ~~~~~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~ 140 (397) T protein:vir:48 78 PLTKSEEEVKA-----------------GFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQ 140 (397) T ss_pred cccchhhHHHH-----------------HHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHH Confidence 00000111111 1222222222221 122223333789999999999999999998 Q ss_pred cceeeecccceee--EEee-ccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHH Q lcl|Aclame:pro 151 KVFHVTNVGALLV--SRSF-DSA-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVA 225 (400) Q Consensus 151 ~~fhV~n~~~~a~--~i~l-~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ 225 (400) +...+.+.+.... .+.. .+. ..+..+.-|..+.+ ...+|...++.|..++....+.+.+.+- +.-++.+|+++ T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d--s~~~l~~~v~~ 218 (397) T protein:vir:48 141 EYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLAD--SAENILAWLSG 218 (397) T ss_pred hhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhh--chHHHHHHHHH Confidence 8766655554322 2222 222 23344444555544 4579999999999999988885544332 22468999999 Q ss_pred HHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhH Q lcl|Aclame:pro 226 ELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRK 304 (400) Q Consensus 226 ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~ 304 (400) +|+..+. +.++.+++.|||+... .+.....|++...+ +.-..-.....+++|+.++. T Consensus 219 ~l~~~~~-~~~d~~il~G~g~~~~---------------------~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~ 276 (397) T protein:vir:48 219 WIAKKVV-VTRNKAILEAIATLPT---------------------KPTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFT 276 (397) T ss_pred HHHHHHH-HHHHHHHhhccccccc---------------------ccccccHHHHHHHHHHhhhhhcCCCEEEECHHHHH Confidence 9999998 7999999999998542 12233445555444 11111122347889999999 Q ss_pred HHHhhhhhccccccceeecCCcceeehhhccccceec--ch------hhhchhhhhccccee--chhhceeccce-e--- Q lcl|Aclame:pro 305 ALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GS------KALKPTVLVDQKYHI--DMQDLTKVDAF-E--- 370 (400) Q Consensus 305 a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~------k~l~~t~~vd~k~~~--~~~~~~~~~~~-~--- 370 (400) +|+- |+|.+|.+.+..++..-+-.| |. ....|+...+....+ |++.+...+.+ + T Consensus 277 ~L~~------------lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 344 (397) T protein:vir:48 277 ALKK------------VKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSL 344 (397) T ss_pred HHHH------------hhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEE Confidence 8887 899999998865544432222 43 223344444443333 66654332211 1 Q ss_pred ------e-eecCceE--EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 371 ------W-KTNSNMI--LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 371 ------~-~~~~~~i--lve~~~~~~~~~~~~~~~~~~~ 400 (400) | .|..|++ .++.--.|.+---++-.++++. T Consensus 345 ~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:48 345 LSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFK 383 (397) T ss_pred EEeccchhhhhcCceeEEEEeeeccEEecccceEEEEec Confidence 1 1334433 3444445554333333344443 No 15 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=99.64 E-value=2.2e-17 Score=111.88 Aligned_cols=354 Identities=12% Similarity=0.079 Sum_probs=191.1 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcc- Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKG- 79 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~- 79 (400) |-+ +-+++|-.++++++.+.+..++.++... .++.......++|+.+.++++..+++..++.+....+.+.. T Consensus 1 m~~------~m~i~el~~~~~~~~~~~~~~~~e~~~~-~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (408) T protein:vir:74 1 MGV------KLTVNQLNEAWIASGDKVTDFNDQINMA-LNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVN 73 (408) T ss_pred CCh------hhhHHHHHHHHHHHHHHHHHHHHHHHHH-HhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 322 1256666677777777777777776653 22233333456666666666666665544444332211000 Q ss_pred ---hhHHHHHHHHHHHHHHHHHHHHHc-cCChhHHHHHHHHHHhC-ccchhhhHhhcchhHHHHHHHHHHhhCcccccee Q lcl|Aclame:pro 80 ---KDKMTNFIESQNAVTEFFDVLKKN-SGKSEIKNAWSAKLAEN-GVTITDTTFQLPRKLVESINTALLNTNPVFKVFH 154 (400) Q Consensus 80 ---k~emtEfLkTkqA~~dya~ll~~n-qg~ke~k~AW~a~L~ek-gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh 154 (400) ......-....+....|.+-+... .+...... ......- ..+..+--.++|..+...|-+.++++.++++..+ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~ 151 (408) T protein:vir:74 74 MREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFLN--TVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVR 151 (408) T ss_pred ccccccccccchhhhhHHHHHHHHHHHHhcchhhhh--hhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcc Confidence 000001111222222222211111 11111100 0111110 1112222336899999999999999999988777 Q ss_pred eecccceeeEEee--cc--cc-ccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHH Q lcl|Aclame:pro 155 VTNVGALLVSRSF--DS--AN-EAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELT 228 (400) Q Consensus 155 V~n~~~~a~~i~l--~n--a~-~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELa 228 (400) +.+.+.....+.. .+ .. ..|+. -|+++.+ ...+|...++.|..++....+.+.+.+ .+.-++.+|++++|+ T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~~~~v~-E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~ 228 (408) T protein:vir:74 152 VESVSTSSGSRVYEKWTDVTPLKAMDE-EDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLK--DTAENILAWLSSWIA 228 (408) T ss_pred eeeccCCcceEEEEeecCCcccccccc-cccccccccccceeeEEeeeeeEEeeehhHHHHHh--hchHHHHHHHHHHHH Confidence 7666554433222 22 12 22333 2344454 568999999999999988777444333 222468999999999 Q ss_pred HHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhc-eeccc-ccceEEEEecchhHHH Q lcl|Aclame:pro 229 QAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPT-AGRRYLIVKAEDRKAL 306 (400) Q Consensus 229 q~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald-~~~~~-~~~~~l~~~~~d~~a~ 306 (400) .++. +.++.++++|||+... .+.+.+.+++..++. ...|. .+..++++|+.++.+| T Consensus 229 ~~~~-~~~d~~il~G~G~~~~---------------------~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l 286 (408) T protein:vir:74 229 KKVV-VTRNQAIIAAMGTVPK---------------------KPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKL 286 (408) T ss_pred HHHH-HHHHHHHhhccccccc---------------------ccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHH Confidence 9998 7999999999998431 233445667776663 22222 1234688899999999 Q ss_pred HhhhhhccccccceeecCCcceeehhhccccceec--chh------hhchhhhhccccee--chhhceecc-cee----- Q lcl|Aclame:pro 307 LDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSK------ALKPTVLVDQKYHI--DMQDLTKVD-AFE----- 370 (400) Q Consensus 307 ~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k------~l~~t~~vd~k~~~--~~~~~~~~~-~~~----- 370 (400) +- ++|.+|.+.+..++.+-+-+| |.. ...|++..+....+ |++.+.... ..+ T Consensus 287 ~~------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~ 354 (408) T protein:vir:74 287 AL------------VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLP 354 (408) T ss_pred HH------------hhcCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEE Confidence 87 889999988765443322222 543 23455443333222 666533221 111 Q ss_pred -------eeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 371 -------WKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 371 -------~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) |..++-.+.++.--.|-+--.++-.++++. T Consensus 355 ~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 391 (408) T protein:vir:74 355 TNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFT 391 (408) T ss_pred eccccchhhcceeeEEEEEeeCcEEecccceEEEEee Confidence 222333355555555555555555556554 No 16 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=99.63 E-value=1.3e-16 Score=107.72 Aligned_cols=363 Identities=13% Similarity=0.094 Sum_probs=190.8 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcc-------- Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKG-------- 79 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~-------- 79 (400) ||+. +|-..+|.++++.+...+..+...-.+.. +.+.+.+++.++++.++|.+.+..++..+++... T Consensus 1 mk~~--~em~~~l~el~~~~~~~~~e~~~~~~~~~---~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:46 1 MKTK--EELQSEISDIKRQIDLKVKYATRALNNDE---LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSV 75 (415) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHhchhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 5543 33344566666655555554443322221 3345556666666666665544444433321111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC-----cc-chhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 80 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN-----GV-TITDTTFQLPRKLVESINTALLNTNPVFKVF 153 (400) Q Consensus 80 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek-----gV-~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~f 153 (400) ...-................+..........++|...+... +. .+.+--.++|..+...|-+.+.++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:46 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhc Confidence 10110111111111112222222233344555555544332 22 2223333799999999999999999998876 Q ss_pred eeecccceee--EEeeccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 154 HVTNVGALLV--SRSFDSA-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 229 (400) Q Consensus 154 hV~n~~~~a~--~i~l~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 229 (400) .+.+.+.... .+...+. ..+..+.-|.++.+ ...+|...++.|..++....+.+.+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:46 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred ceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 6666554432 2222332 23334455666655 457999999999999888777444433222 4679999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHh Q lcl|Aclame:pro 230 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLD 308 (400) Q Consensus 230 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~ 308 (400) ++- |+++.++..|||+..... .... . ......+..+.+...+++..++ .+..+-.+...+++|+.++.+|+. T Consensus 234 ~i~-~~~d~~il~g~g~g~~~~--~~~~-~---~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:46 234 TIA-ATRNKAIIDVITKGSTGS--TSSG-F---EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCCccc--cccc-c---ccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998743211 1111 0 1112233445556677888777 344455556689999999998876 Q ss_pred hhhhccccccceeecCCcceeehhhccccceec--chhhhchhhhhccc----------ceechhhceeccc-e----ee Q lcl|Aclame:pro 309 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVDQK----------YHIDMQDLTKVDA-F----EW 371 (400) Q Consensus 309 ~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd~k----------~~~~~~~~~~~~~-~----~~ 371 (400) |+|.+|++.+.-.+.+-+..| |... ++.|.- +--|++.+..... - .| T Consensus 307 ------------lkd~~G~~i~~~~~~~~~~~~l~G~pV----~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~ 370 (415) T protein:vir:46 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKI----EILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASW 370 (415) T ss_pred ------------hhccCCCeeeccCcCCCCCccccceee----EEeccccccCCCccEEEEEehhccEEEEeecceEEEe Confidence 888899988743332221111 3321 111110 1123333221110 0 00 Q ss_pred ---eecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 372 ---KTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 372 ---~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ..++-.+.++.--.|.|-..++-.+++++ T Consensus 371 ~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:46 371 TDYMHFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred eccccCceEEEEEEEeccEEeccccEEEEEee Confidence 11122233444445555555555556655 No 17 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=99.63 E-value=1.3e-16 Score=107.72 Aligned_cols=363 Identities=13% Similarity=0.094 Sum_probs=190.8 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcc-------- Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKG-------- 79 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~-------- 79 (400) ||+. +|-..+|.++++.+...+..+...-.+.. +.+.+.+++.++++.++|.+.+..++..+++... T Consensus 1 mk~~--~em~~~l~el~~~~~~~~~e~~~~~~~~~---~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:47 1 MKTK--EELQSEISDIKRQIDLKVKYATRALNNDE---LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSV 75 (415) T ss_pred CchH--HHHHHHHHHHHHHHHHHHHHHHHHhchhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 5543 33344566666655555554443322221 3345556666666666665544444433321111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC-----cc-chhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 80 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN-----GV-TITDTTFQLPRKLVESINTALLNTNPVFKVF 153 (400) Q Consensus 80 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek-----gV-~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~f 153 (400) ...-................+..........++|...+... +. .+.+--.++|..+...|-+.+.++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:47 76 EVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred ccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhc Confidence 10110111111111112222222233344555555544332 22 2223333799999999999999999998876 Q ss_pred eeecccceee--EEeeccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 154 HVTNVGALLV--SRSFDSA-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 229 (400) Q Consensus 154 hV~n~~~~a~--~i~l~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 229 (400) .+.+.+.... .+...+. ..+..+.-|.++.+ ...+|...++.|..++....+.+.+.+-.. -++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~ 233 (415) T protein:vir:47 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK--VNVLQELKLWMAR 233 (415) T ss_pred ceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhch--HHHHHHHHHHHHH Confidence 6666554432 2222332 23334455666655 457999999999999888777444433222 4679999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHh Q lcl|Aclame:pro 230 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLD 308 (400) Q Consensus 230 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~ 308 (400) ++- |+++.++..|||+..... .... . ......+..+.+...+++..++ .+..+-.+...+++|+.++.+|+. T Consensus 234 ~i~-~~~d~~il~g~g~g~~~~--~~~~-~---~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~ 306 (415) T protein:vir:47 234 TIA-ATRNKAIIDVITKGSTGS--TSSG-F---EKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCCccc--cccc-c---ccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 799999999998743211 1111 0 1112233445556677888777 344455556689999999998876 Q ss_pred hhhhccccccceeecCCcceeehhhccccceec--chhhhchhhhhccc----------ceechhhceeccc-e----ee Q lcl|Aclame:pro 309 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVDQK----------YHIDMQDLTKVDA-F----EW 371 (400) Q Consensus 309 ~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd~k----------~~~~~~~~~~~~~-~----~~ 371 (400) |+|.+|++.+.-.+.+-+..| |... ++.|.- +--|++.+..... - .| T Consensus 307 ------------lkd~~G~~i~~~~~~~~~~~~l~G~pV----~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~ 370 (415) T protein:vir:47 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKI----EILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASW 370 (415) T ss_pred ------------hhccCCCeeeccCcCCCCCccccceee----EEeccccccCCCccEEEEEehhccEEEEeecceEEEe Confidence 888899988743332221111 3321 111110 1123333221110 0 00 Q ss_pred ---eecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 372 ---KTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 372 ---~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ..++-.+.++.--.|.|-..++-.+++++ T Consensus 371 ~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:47 371 TDYMHFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred eccccCceEEEEEEEeccEEeccccEEEEEee Confidence 11122233444445555555555556655 No 18 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=99.63 E-value=3.3e-16 Score=105.48 Aligned_cols=351 Identities=11% Similarity=0.084 Sum_probs=170.3 Q ss_pred ccccccchh-hHHHHHHHHHHHHHHHHHhhhhhhcchhh-------h-hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 3 ISKRNMNKP-DLIEKQNRLAELKENNVSLKSQISGFEVK-------N-AIEDLPKVQELEKTLSENSIEIIKIENELNAQ 73 (400) Q Consensus 3 ~s~~~~~k~-~~eekq~~lA~lKe~~~~~Ks~i~~~~~~-------~-~~~~~skieElektis~l~aEi~K~enEl~~~ 73 (400) +++++++-. .+.+..+++.+|++....+..+...+... . ..+...+++++++.+.+++.+|.+.+.+...+ T Consensus 1 m~~k~~~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l 80 (397) T protein:vir:96 1 MALKQLILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDL 80 (397) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 233222100 11122222333333333333333332210 0 11112345566666666666665444333332 Q ss_pred Hhhhc---chhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC------ccchhhhHhhcchhHHHHHHHHHH Q lcl|Aclame:pro 74 EEKPK---GKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN------GVTITDTTFQLPRKLVESINTALL 144 (400) Q Consensus 74 kEk~K---~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek------gV~~qd~~eiLP~~iI~AIe~A~e 144 (400) .+..+ ........-..+.+...+. .......+.+.+....+... +....+-...+|..+...|.+ +. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~ 156 (397) T protein:vir:96 81 EDELAKAADPTDQKPKDGEKRKMKKFK---VTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PK 156 (397) T ss_pred HHHHHhhhhhhhhhhHHHHHHHHHHHh---hhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hh Confidence 21110 0000001111111111111 11111112233333333222 333334444788888888877 45 Q ss_pred hhCccccceeeecccceeeEEee---ccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHH Q lcl|Aclame:pro 145 NTNPVFKVFHVTNVGALLVSRSF---DSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYN 221 (400) Q Consensus 145 d~d~vl~~fhV~n~~~~a~~i~l---~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvn 221 (400) ++..+++...+.+.+.....+.. .+...+|..-.+........+|...++.|..+|....+...+. +.+.-++.+ T Consensus 157 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell--~ds~~~l~~ 234 (397) T protein:vir:96 157 DIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMI--DDASYDVTG 234 (397) T ss_pred hhhhHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHH--hhhHHHHHH Confidence 56777776666555443332222 2233445433333333467899999999999998877744332 233346899 Q ss_pred HHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecc Q lcl|Aclame:pro 222 LIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAE 301 (400) Q Consensus 222 yvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~ 301 (400) |+.++|+..+. ++.+.++..|+|... ..++...|++..++....+.+...++++|+. T Consensus 235 ~i~~~l~~~~~-~~~~~~i~~g~g~~~----------------------~~~~~~~d~~~~~~~~~~~~~~~a~~v~n~~ 291 (397) T protein:vir:96 235 LIADEIQDQSL-NTKNADIAAVLKTAT----------------------AKSVVGVDGLKDLINKEIKKVYDVKLFISAS 291 (397) T ss_pred HHHHHHHHHHH-HHHHHHHhhcccccc----------------------cccccchHHHHHHHHHhhhhhcCcEEEEcHH Confidence 99999999999 799999999998632 1224456777777765566666778999999 Q ss_pred hhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhhhc-----hhhhhccccee--chhhceeccceeee Q lcl|Aclame:pro 302 DRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-----PTVLVDQKYHI--DMQDLTKVDAFEWK 372 (400) Q Consensus 302 d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-----~t~~vd~k~~~--~~~~~~~~~~~~~~ 372 (400) ++.+|+. |+|.+|+|.+.-++..-+.+| |....+ |....+....+ |++.+..... T Consensus 292 ~~~~l~~------------lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~---- 355 (397) T protein:vir:96 292 MYSELDK------------LKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFD---- 355 (397) T ss_pred HHHHHHH------------hhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehhcceEeEe---- Confidence 9999987 899999998744443322222 332221 11111111122 5554322211 Q ss_pred ecCceEEEEecc--------------cccceeeccceeEeeC Q lcl|Aclame:pro 373 TNSNMILVETLT--------------SGHVETYNAGAVITVS 400 (400) Q Consensus 373 ~~~~~ilve~~~--------------~~~~~~~~~~~~~~~~ 400 (400) .+.+-|.+.+ .|-|-.=++..+++++ T Consensus 356 --~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~ 395 (397) T protein:vir:96 356 --RKQVSVSWVDNNIYGQLLAGIIRYDVKATDKKAGFYVTFT 395 (397) T ss_pred --ecceEEEEecccccceeEEEEEEEccEEecccceEEEEee Confidence 1122222222 2222222233334444 No 19 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=99.63 E-value=3.8e-17 Score=110.63 Aligned_cols=346 Identities=13% Similarity=0.087 Sum_probs=177.4 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHHHHHH Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 87 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~emtEfL 87 (400) |++ -++.+.++++.+.+++.+++... ++......++++++..++.+..+++..+.++..+++..+......... T Consensus 1 M~~-----l~~l~~~~~~~~~e~~~~~~~~~-~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~ 74 (394) T protein:vir:10 1 MDK-----LQTLFNEVSAKCADLNAQLNAKL-QDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPV 74 (394) T ss_pred ChH-----HHHHHHHHHHHHHHHHHHHHHHH-hhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhh Confidence 433 12222233333333344333221 111122344555556666666666555444443333222211111111 Q ss_pred HHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC---------ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecc Q lcl|Aclame:pro 88 ESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN---------GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 158 (400) Q Consensus 88 kTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek---------gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~ 158 (400) ..++.... + .........+++|.+-|... +....|-...+|..+...|-+.+.++.++++...+.+. T Consensus 75 ~~~~~~~~--~--~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~ 150 (394) T protein:vir:10 75 DNAQPNGT--D--LKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPV 150 (394) T ss_pred hhhccccc--c--hhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeec Confidence 11100000 0 00000012223333333322 23334444579999999999999999998887666666 Q ss_pred cceeeEEee-c-c-ccccceecccchhhh-hhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 159 GALLVSRSF-D-S-ANEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVN 233 (400) Q Consensus 159 ~~~a~~i~l-~-n-a~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~ 233 (400) +.....+.. + . ...+|.. -|.++.+ +...|...++.|..+|.+..+. +++.+. .-++.+|++++|++.+. T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~-E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~la~~~~- 225 (394) T protein:vir:10 151 TTPKGTYPILKRATDRFSSVA-ELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADS---AVDLTSLVGQSINEKSV- 225 (394) T ss_pred cCCceEEEEEecCCCcccccc-ccccccccccccceeEEeeeeeeEeeehhHHHHHhhh---hHHHHHHHHHHHHHHHH- Confidence 554433322 2 2 2334433 3445554 6789999999999998887773 333332 24679999999999999 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHHHHhhhhhc Q lcl|Aclame:pro 234 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQA 313 (400) Q Consensus 234 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~ 313 (400) +..+.++..|+|+-.+ ...+.....|++...+....+.+...++|+|+.++.+|+. T Consensus 226 ~~~~~~il~g~g~~~~-------------------~~~~~~~~~d~l~~~~~~~~~~~~~a~~vmn~~~~~~l~~----- 281 (394) T protein:vir:10 226 NTYNAMIAPVLQSFTA-------------------KATTTDTLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDT----- 281 (394) T ss_pred HHHHHHHhhccccccc-------------------ccccccccHHHHHHHHHhhhhhhccCEEEecHHHHHHHHH----- Confidence 7999999999886321 1122344567787777655556667789999999999887 Q ss_pred cccccceeecCCcceeehhhccccceecchh--hhchhhhhccc-----------ceechhhceecc---------ceee Q lcl|Aclame:pro 314 TANANVRIKNDDTEIASEVGVDEIIVYTGSK--ALKPTVLVDQK-----------YHIDMQDLTKVD---------AFEW 371 (400) Q Consensus 314 ~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k--~l~~t~~vd~k-----------~~~~~~~~~~~~---------~~~~ 371 (400) |+|.+|.+.+..++...+-..+.. .=+|.+++|.. +--|++++.... +..- T Consensus 282 -------lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~ 354 (394) T protein:vir:10 282 -------LKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSK 354 (394) T ss_pred -------hhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEeccc Confidence 899999998766554432211101 11233333321 111334321111 1111 Q ss_pred eecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 372 KTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 372 ~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .|..+ |.+..--.|-|---++-.+++++ T Consensus 355 ~~~~~-~~~~~r~d~~~~~~~ai~~~~~~ 382 (394) T protein:vir:10 355 IYGRY-LGAAFRFGVKQADSNAGYFVTNT 382 (394) T ss_pred cccee-EEEEEEeccEEeccccEEEEEee Confidence 12222 22222333433333444455555 No 20 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=99.63 E-value=8.1e-17 Score=108.82 Aligned_cols=359 Identities=16% Similarity=0.129 Sum_probs=189.4 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHH------HHHHHHhhHhhhcchh Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIK------IENELNAQEEKPKGKD 81 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K------~enEl~~~kEk~K~k~ 81 (400) |+| .+.+-+++++.+++++..+..+.+.- . .+++.+++.++.|+++|+. .++.+.....+..... T Consensus 1 M~k-~l~el~~~~~~~~~e~~~~~~~~~~~-~-------ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 71 (404) T protein:vir:10 1 MSK-ELRELLNQLDSKNKELNSLLNKDGVT-A-------EELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTG 71 (404) T ss_pred CcH-HHHHHHHHHHHHHHHHHHHHhhcCCC-H-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 887 46777777777777665553332211 1 1233334444444444431 1122211111111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCChh---HHHHHHHHHHhCcc-chhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 82 KMTNFIESQNAVTEFFDVLKKNSGKSE---IKNAWSAKLAENGV-TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 157 (400) Q Consensus 82 emtEfLkTkqA~~dya~ll~~nqg~ke---~k~AW~a~L~ekgV-~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n 157 (400) .-........+ +.+-..++..+.. ......+....-.. ++.+--.++|..+...|-+.+.++.++++.+.+.+ T Consensus 72 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~l~~~~~ 148 (404) T protein:vir:10 72 KEENVIYNGAL---FVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYNMVDYEP 148 (404) T ss_pred cchhhHHHHHH---HHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhhhhceee Confidence 00001111111 1111111111100 00011111111111 11233336899999999999999999988777776 Q ss_pred cccee--eEEeecc-ccccceecccchhhhh--hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 158 VGALL--VSRSFDS-ANEAQVHKDGQTKTEQ--AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIV 232 (400) Q Consensus 158 ~~~~a--~~i~l~n-a~~a~GHk~ga~Kk~q--~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI 232 (400) .+... ..+...+ ...+..+.-|.++..+ +.+|...++.|..++...++.+-+. +.+.-++.+|++++|+.++- T Consensus 149 ~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell--~ds~~~l~~~i~~~la~~~~ 226 (404) T protein:vir:10 149 VFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLL--KFADKSLEDWIINWFVDKVR 226 (404) T ss_pred ccCCccceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHH--hhcHHHHHHHHHHHHHHHHH Confidence 65432 2222222 2344445666665554 5789999999998888877744332 22224789999999999999 Q ss_pred HHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccc--eEEEEecchhHHHHhhh Q lcl|Aclame:pro 233 NKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGR--RYLIVKAEDRKALLDEL 310 (400) Q Consensus 233 ~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~--~~l~~~~~d~~a~~~~~ 310 (400) +.++.+++.|||++. ....+.....+ .+...+.....+++..++...-+.+.+ .++++|+.++.+|+- T Consensus 227 -~~~~~~il~G~g~~~--~~~gi~~~~~~-----~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~-- 296 (404) T protein:vir:10 227 -ITRNAEILYGAGGDE--HATGIMTANKF-----KKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDS-- 296 (404) T ss_pred -HHHHHHHhhcCCCCC--cccceeecccc-----ceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHH-- Confidence 799999999999743 11111111111 123344556778888887654444432 368899999999987 Q ss_pred hhccccccceeecCCcceeehhhccccceec--chhhh-chhhhhc----c--cceechhhceecc---c---------- Q lcl|Aclame:pro 311 RQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAL-KPTVLVD----Q--KYHIDMQDLTKVD---A---------- 368 (400) Q Consensus 311 ~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l-~~t~~vd----~--k~~~~~~~~~~~~---~---------- 368 (400) ++|.+|++.+.-++.+-..+| |..-. .|+...+ . -+--|++++..+. . T Consensus 297 ----------lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~ 366 (404) T protein:vir:10 297 ----------LEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGA 366 (404) T ss_pred ----------hhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEecccc Confidence 889999988765544443333 54321 1111110 0 1223455422211 1 Q ss_pred eeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 369 FEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 369 ~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ..|..++-.|.++.--.|-|--.++-+++++. T Consensus 367 ~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 398 (404) T protein:vir:10 367 GAFETNTTKARIIMRIDGNVKDSEALLIAEIP 398 (404) T ss_pred chhhcCceEEEEEEeeccEEecccceEEEEee Confidence 11233334466777777777777777777777 No 21 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=99.63 E-value=5.9e-17 Score=109.56 Aligned_cols=361 Identities=16% Similarity=0.081 Sum_probs=193.1 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHH--HH Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKM--TN 85 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~em--tE 85 (400) |+| +.+-.++++++.+++.++....++ +.++. ...+++++..+.++.+++++.+..+.....+....... .. T Consensus 1 M~~--l~el~~~~~~~~~e~~~l~~~~~~-e~~~~---~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (385) T protein:vir:19 1 MSE--LALIQKAIEESQQKMTQLFDAQKA-EIEST---GQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEK 74 (385) T ss_pred ChH--HHHHHHHHHHHHHHHHHHHHHHHH-HHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchh Confidence 554 444444455555555444322211 11111 13445566666666666655444444333221111111 01 Q ss_pred HHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEE Q lcl|Aclame:pro 86 FIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 165 (400) Q Consensus 86 fLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i 165 (400) -...+.+..++.+.+....+........ ..+ .....+....+|..+...|-..+.++.++++.+.+.+.+.....+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 150 (385) T protein:vir:19 75 KSFSERAAEELIKSWDGKQGTFGAKTFN-KSL---GSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY 150 (385) T ss_pred hhhHHHHHHHHHHHHHHhhccchhhHHH-hhh---ccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE Confidence 1122333444444443333221111111 011 111111122578778888889999999999876665554443333 Q ss_pred eecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec Q lcl|Aclame:pro 166 SFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 243 (400) Q Consensus 166 ~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g 243 (400) -..+ ...++.+.-|.++.+...+|...++.|..++....+.+-+.+ .+ ..+.+|++++|+.++. ++++.++..| T Consensus 151 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~--d~-~~l~~~i~~~la~a~~-~~~d~~~l~G 226 (385) T protein:vir:19 151 VREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMD--DA-PMLQSYINNRLMYGLA-LKEEGQLLNG 226 (385) T ss_pred EEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHh--hH-HHHHHHHHHHHHHHHH-HHHHHHHHhc Confidence 3222 345555667889999999999999999999988777554433 22 4699999999999988 7999999999 Q ss_pred cCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceec-ccccceEEEEecchhHHHHhhhhhccccccceee Q lcl|Aclame:pro 244 DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVR-PTAGRRYLIVKAEDRKALLDELRQATANANVRIK 322 (400) Q Consensus 244 DG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~-~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk 322 (400) ||++.. ..-+ ....-.++.+...++....|.|..++.... +-.....+++|+.++.+|+. ++ T Consensus 227 ~g~~~~--~~Gi---~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~------------lk 289 (385) T protein:vir:19 227 DGTGDN--LEGL---NKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIAL------------LK 289 (385) T ss_pred cCCCCc--cccc---ccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH------------hh Confidence 998531 0000 000000111122234456788888774443 34445689999999999987 88 Q ss_pred cCCcceeehhhccccc-eecchhhhchhhhhccc--ceechhh-ceeccc------------eeeeecCceEEEEecccc Q lcl|Aclame:pro 323 NDDTEIASEVGVDEII-VYTGSKALKPTVLVDQK--YHIDMQD-LTKVDA------------FEWKTNSNMILVETLTSG 386 (400) Q Consensus 323 ~~d~~~~~~v~v~~~~-~~tg~k~l~~t~~vd~k--~~~~~~~-~~~~~~------------~~~~~~~~~ilve~~~~~ 386 (400) |.+|.+.++...+... ..-|- ..+.+..++.. +-.|++. |+-.+. -.|..++-.|.++.--.| T Consensus 290 d~~G~~l~~~~~~~~~~~l~G~-pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~ 368 (385) T protein:vir:19 290 DNEGRYIFGGPQAFTSNIMWGL-PVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLAL 368 (385) T ss_pred cCCCceeccCcccCCCceecce-eeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeecc Confidence 9999988764221110 01142 22333333332 1223332 221111 113334445566666777 Q ss_pred cceeeccceeEeeC Q lcl|Aclame:pro 387 HVETYNAGAVITVS 400 (400) Q Consensus 387 ~~~~~~~~~~~~~~ 400 (400) +|-.-++-++++++ T Consensus 369 ~v~~~~a~~~~~~~ 382 (385) T protein:vir:19 369 AHYRPTAIIKGTFS 382 (385) T ss_pred EEecccceEEEEec Confidence 77666666666666 No 22 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=99.63 E-value=5.9e-17 Score=109.56 Aligned_cols=361 Identities=16% Similarity=0.081 Sum_probs=193.1 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHH--HH Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKM--TN 85 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~em--tE 85 (400) |+| +.+-.++++++.+++.++....++ +.++. ...+++++..+.++.+++++.+..+.....+....... .. T Consensus 1 M~~--l~el~~~~~~~~~e~~~l~~~~~~-e~~~~---~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (385) T protein:vir:18 1 MSE--LALIQKAIEESQQKMTQLFDAQKA-EIEST---GQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEK 74 (385) T ss_pred ChH--HHHHHHHHHHHHHHHHHHHHHHHH-HHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchh Confidence 554 444444455555555444322211 11111 13445566666666666655444444333221111111 01 Q ss_pred HHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEE Q lcl|Aclame:pro 86 FIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 165 (400) Q Consensus 86 fLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i 165 (400) -...+.+..++.+.+....+........ ..+ .....+....+|..+...|-..+.++.++++.+.+.+.+.....+ T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 150 (385) T protein:vir:18 75 KSFSERAAEELIKSWDGKQGTFGAKTFN-KSL---GSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLLAQGRTSSNALEY 150 (385) T ss_pred hhhHHHHHHHHHHHHHHhhccchhhHHH-hhh---ccccccCCceecchhhhHHHHHhhhccchhhhcceecccCcceEE Confidence 1122333444444443333221111111 011 111111122578778888889999999999876665554443333 Q ss_pred eecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec Q lcl|Aclame:pro 166 SFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 243 (400) Q Consensus 166 ~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g 243 (400) -..+ ...++.+.-|.++.+...+|...++.|..++....+.+-+.+ .+ ..+.+|++++|+.++. ++++.++..| T Consensus 151 ~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~--d~-~~l~~~i~~~la~a~~-~~~d~~~l~G 226 (385) T protein:vir:18 151 VREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMD--DA-PMLQSYINNRLMYGLA-LKEEGQLLNG 226 (385) T ss_pred EEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHh--hH-HHHHHHHHHHHHHHHH-HHHHHHHHhc Confidence 3222 345555667889999999999999999999988777554433 22 4699999999999988 7999999999 Q ss_pred cCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceec-ccccceEEEEecchhHHHHhhhhhccccccceee Q lcl|Aclame:pro 244 DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVR-PTAGRRYLIVKAEDRKALLDELRQATANANVRIK 322 (400) Q Consensus 244 DG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~-~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk 322 (400) ||++.. ..-+ ....-.++.+...++....|.|..++.... +-.....+++|+.++.+|+. ++ T Consensus 227 ~g~~~~--~~Gi---~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~------------lk 289 (385) T protein:vir:18 227 DGTGDN--LEGL---NKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIAL------------LK 289 (385) T ss_pred cCCCCc--cccc---ccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH------------hh Confidence 998531 0000 000000111122234456788888774443 34445689999999999987 88 Q ss_pred cCCcceeehhhccccc-eecchhhhchhhhhccc--ceechhh-ceeccc------------eeeeecCceEEEEecccc Q lcl|Aclame:pro 323 NDDTEIASEVGVDEII-VYTGSKALKPTVLVDQK--YHIDMQD-LTKVDA------------FEWKTNSNMILVETLTSG 386 (400) Q Consensus 323 ~~d~~~~~~v~v~~~~-~~tg~k~l~~t~~vd~k--~~~~~~~-~~~~~~------------~~~~~~~~~ilve~~~~~ 386 (400) |.+|.+.++...+... ..-|- ..+.+..++.. +-.|++. |+-.+. -.|..++-.|.++.--.| T Consensus 290 d~~G~~l~~~~~~~~~~~l~G~-pV~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~ 368 (385) T protein:vir:18 290 DNEGRYIFGGPQAFTSNIMWGL-PVVPTKAQAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLAL 368 (385) T ss_pred cCCCceeccCcccCCCceecce-eeEEcCcCCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeecc Confidence 9999988764221110 01142 22333333332 1223332 221111 113334445566666777 Q ss_pred cceeeccceeEeeC Q lcl|Aclame:pro 387 HVETYNAGAVITVS 400 (400) Q Consensus 387 ~~~~~~~~~~~~~~ 400 (400) +|-.-++-++++++ T Consensus 369 ~v~~~~a~~~~~~~ 382 (385) T protein:vir:18 369 AHYRPTAIIKGTFS 382 (385) T ss_pred EEecccceEEEEec Confidence 77666666666666 No 23 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=99.63 E-value=3e-16 Score=105.72 Aligned_cols=367 Identities=14% Similarity=0.100 Sum_probs=190.5 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh-------hhcch Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-------KPKGK 80 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE-------k~K~k 80 (400) ||. +++-.++|+++++.+..++..+...-.++. ..+++.++..+.+++++|+..+.++...++ .++.. T Consensus 1 mk~--~~el~~~l~el~~~~~~~~~~~~~~~~~~~---~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 75 (415) T protein:vir:94 1 MKT--KEELQSEISDIKRQIDLKVKYATRALNNDE---LEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSV 75 (415) T ss_pred CCh--HHHHHHHHHHHHHHHHHHHHHHHHHhchhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 543 333444555555555555544443222222 234556666666777777655444443332 11111 Q ss_pred hHHHHHHHHHH-HHHHHHHHHHHccCChhHHHHHHHHHHhC-----c-cchhhhHhhcchhHHHHHHHHHHhhCccccce Q lcl|Aclame:pro 81 DKMTNFIESQN-AVTEFFDVLKKNSGKSEIKNAWSAKLAEN-----G-VTITDTTFQLPRKLVESINTALLNTNPVFKVF 153 (400) Q Consensus 81 ~emtEfLkTkq-A~~dya~ll~~nqg~ke~k~AW~a~L~ek-----g-V~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~f 153 (400) ....+-..... ....+..-+..........++|...+... + ..+.+--..+|..+...|.+.++++.++++.. T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~ 155 (415) T protein:vir:94 76 EVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYV 155 (415) T ss_pred cccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhc Confidence 11111111111 11111112222233344556665544332 2 22223334799999999999999999998866 Q ss_pred eeecccceeeE--Eeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 154 HVTNVGALLVS--RSFDS-ANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQ 229 (400) Q Consensus 154 hV~n~~~~a~~--i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq 229 (400) .+.+.+..... +...+ ...+..+.-|.++++. ...|...++.|..++....+.+-+.+ .+.-++.+|++++|+. T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~--ds~~~~~~~i~~~l~~ 233 (415) T protein:vir:94 156 TVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIE--DAKVNVLQELKLWMAR 233 (415) T ss_pred ceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHh--hchHHHHHHHHHHHHH Confidence 66665544322 22233 2344455566677654 46899999999999998777443332 2224679999999999 Q ss_pred HHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhc-eecccccceEEEEecchhHHHHh Q lcl|Aclame:pro 230 AIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKAEDRKALLD 308 (400) Q Consensus 230 ~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald-~~~~~~~~~~l~~~~~d~~a~~~ 308 (400) .+. ++++.+++.|+|+....... .+... ...+...+.+...|++..++. +..+-.....+++|+.+..+|+. T Consensus 234 ~~~-~~~~~~il~g~g~g~~~~~~-~~~~~-----~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~ 306 (415) T protein:vir:94 234 TIA-ATRNKAIIDVITKGSTGSTS-SGFEK-----EGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred HHH-HHHHHHHhhccccCcccccc-ccccc-----cccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHH Confidence 998 79999999998873321111 11111 112233444566788888883 33443445678999999999887 Q ss_pred hhhhccccccceeecCCcceeehhhccccceec--chhhhc-hhhhhcc-----cceechhhc-eeccce----ee---e Q lcl|Aclame:pro 309 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-PTVLVDQ-----KYHIDMQDL-TKVDAF----EW---K 372 (400) Q Consensus 309 ~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-~t~~vd~-----k~~~~~~~~-~~~~~~----~~---~ 372 (400) ++|.+|++.+...+.+-+..+ |..-.+ |...... -+--|++++ +-.+.- .| . T Consensus 307 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~ 374 (415) T protein:vir:94 307 ------------MKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYM 374 (415) T ss_pred ------------hhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccc Confidence 888899887744433221111 322110 1000000 011234432 111100 01 1 Q ss_pred ecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 373 TNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 373 ~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .++--+.++..-.|.+--.++-.+++++ T Consensus 375 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 402 (415) T protein:vir:94 375 HFGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred cCceEEEEEEEeccEEeccccEEEEEEe Confidence 1111233444445555555555555555 No 24 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=99.62 E-value=1.4e-16 Score=107.56 Aligned_cols=366 Identities=11% Similarity=0.048 Sum_probs=200.4 Q ss_pred ccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhc---- Q lcl|Aclame:pro 3 ISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPK---- 78 (400) Q Consensus 3 ~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K---- 78 (400) +|++ +++-+++++++++++.....+...-.. ..-++..++++++.++++++++|++.+..++....... T Consensus 1 m~e~------~~~l~~~~~~~~~~~~~~~e~~~~~~~-~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~ 73 (390) T protein:vir:10 1 MTDI------TSKLEATLANVTDSLRAFGERAVRDGE-LNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDV 73 (390) T ss_pred ChHH------HHHHHHHHHHHHHHHHHHHHHHHhhcc-cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 2221 122233344444444433333222111 11223467888888999999988876665554442111 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchh-hhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 79 GKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTIT-DTTFQLPRKLVESINTALLNTNPVFKVFHVTN 157 (400) Q Consensus 79 ~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~q-d~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n 157 (400) ....+.+......+...|+.......+.. ...+.+........++ +-...+|..++..|-+.++++.++++.+.+.+ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~ 151 (390) T protein:vir:10 74 QHVSVGDLFVASEQFQASAGRWNDRSARA--TMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVRDLIGSGR 151 (390) T ss_pred cccchhhhhhhhHHHHHHHHhhhhhhhhh--hhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhhhhcceee Confidence 11123344445555555554444333321 1122222222211111 12226788888888888898899998666665 Q ss_pred ccceeeEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 158 VGALLVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI 235 (400) Q Consensus 158 ~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Ra 235 (400) .+.....+-..+ ...+.-+.-|+++++...+|...++.|..++....+.+.+.+ .+ .++.+|++++|+..+- +. T Consensus 152 ~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~--d~-~~l~~~i~~~l~~~~~-~~ 227 (390) T protein:vir:10 152 TDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILS--DA-PQLASYMNNRLIRGLK-VK 227 (390) T ss_pred ccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHH--hH-HHHHHHHHHHHHHHHH-HH Confidence 554433333322 234444566889999999999999999988877777444432 33 3799999999999999 79 Q ss_pred HhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhcc Q lcl|Aclame:pro 236 VDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQAT 314 (400) Q Consensus 236 v~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~ 314 (400) ++++++.|||++.. ..-+..+.. .+..++..+++...|.+..++.-..|. .....+++++.++.+|+. T Consensus 228 ~~~~il~G~G~~~~-p~Gi~~~~~----~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~------ 296 (390) T protein:vir:10 228 EDAEILRGTGANDG-LLGLIPQAT----TYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIEL------ 296 (390) T ss_pred HHHHHhhcCCCCcc-ccccccccc----cccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH------ Confidence 99999999998531 111111110 111122334445567787777444443 333468899999999887 Q ss_pred ccccceeecCCcceeehhhccccceec--chhhhchhhhhc--ccceechhh-ceecc-----------ceeeeecCceE Q lcl|Aclame:pro 315 ANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVD--QKYHIDMQD-LTKVD-----------AFEWKTNSNMI 378 (400) Q Consensus 315 ~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd--~k~~~~~~~-~~~~~-----------~~~~~~~~~~i 378 (400) ++|.+|.+.+.-+++... +| |.. .+.+..+. .-+-.|++. |.-.+ .-.|..+.-.+ T Consensus 297 ------lkd~~g~~l~~~~~~~~~-~~l~G~p-v~~~~~~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~ 368 (390) T protein:vir:10 297 ------AKDANNQYLIGNARGTLT-PTLWGLP-VVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTV 368 (390) T ss_pred ------hhcCCCceeecCCcCcCC-ceeccee-eEEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEE Confidence 889999987764443221 01 221 11111111 112233432 21111 11233333345 Q ss_pred EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 379 LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 379 lve~~~~~~~~~~~~~~~~~~~ 400 (400) .++.--.|.|-..+|-++++++ T Consensus 369 r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 369 LAEERLALVVYRPEALISGSFA 390 (390) T ss_pred EEEEeeccEEeccccEEEEEeC Confidence 5666666766666666677777 No 25 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=99.62 E-value=1.1e-16 Score=108.11 Aligned_cols=364 Identities=15% Similarity=0.074 Sum_probs=182.0 Q ss_pred cchhhHHHHHHHH-HHHHHHHHHhhhhhhcchhhhhh------hhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcch Q lcl|Aclame:pro 8 MNKPDLIEKQNRL-AELKENNVSLKSQISGFEVKNAI------EDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) Q Consensus 8 ~~k~~~eekq~~l-A~lKe~~~~~Ks~i~~~~~~~~~------~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k 80 (400) |. ++++++++| +++++...+.++.++..+.+... +...++++++..+.++++++...+....... +.... T Consensus 1 m~--~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 77 (395) T protein:vir:43 1 MS--DFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANE-KRDGG 77 (395) T ss_pred Ch--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-ccccc Confidence 32 334333333 22333333333333333221111 1123334444444444444433322222211 11111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHh--hcchhHHHHHHHHHHhhCccccceeeecc Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTF--QLPRKLVESINTALLNTNPVFKVFHVTNV 158 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~e--iLP~~iI~AIe~A~ed~d~vl~~fhV~n~ 158 (400) ..............+|.+.+. ...+..+...+..+...+++.+. ++|..+...|-+.++++.++++.+.+.+. T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~ 152 (395) T protein:vir:43 78 EEAPKTAGQMVAESLKEQGVT-----SSLRGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTT 152 (395) T ss_pred cchhhhHHHHHHHHHHHHHHH-----HHhhhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceec Confidence 122111112122222211111 12233333334444444444333 68888888899999999999886666555 Q ss_pred cceeeEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 159 GALLVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIV 236 (400) Q Consensus 159 ~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav 236 (400) +.....+-..+ ...+..+.-|+++.+...+|...++.|..++....+.+.+.+ .+ +++.+|++++|+.++. +.+ T Consensus 153 ~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~--d~-~~l~~~v~~~la~a~~-~~~ 228 (395) T protein:vir:43 153 ESNSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILD--DA-SALQSYIDARARYGLM-LVE 228 (395) T ss_pred CCCceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHH--hH-HHHHHHHHHHHHHHHH-HHH Confidence 54433333332 234444566889999999999999999999988777555433 22 4689999999999998 799 Q ss_pred hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccc Q lcl|Aclame:pro 237 DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATA 315 (400) Q Consensus 237 ~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~ 315 (400) +.++++|||+... ..-+...... ... ..++..+.....|.+..++.-..|. ....++++|+.+..+|+. T Consensus 229 d~~~l~G~g~~~~-~~Gi~~~~~~-~~~-~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------- 298 (395) T protein:vir:43 229 ECQLLYGNGTGAN-LHGIIPQAQA-YAP-PSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIEL------- 298 (395) T ss_pred HHHHHhccCCCCc-cccccccccc-ccc-ccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHH------- Confidence 9999999998431 0011111110 000 0011122223466777776333333 334579999999998876 Q ss_pred cccceeecCCcceeehhhcccc--ceecchhhhchhhhhccc--ceechhhcee-ccc------------eeeeecCceE Q lcl|Aclame:pro 316 NANVRIKNDDTEIASEVGVDEI--IVYTGSKALKPTVLVDQK--YHIDMQDLTK-VDA------------FEWKTNSNMI 378 (400) Q Consensus 316 ~an~~lk~~d~~~~~~v~v~~~--~~~tg~k~l~~t~~vd~k--~~~~~~~~~~-~~~------------~~~~~~~~~i 378 (400) ++|.+|.+.++-..+.. ++ -|. ..+.+..++.. +..|++.+.. .+. ..|..++-.+ T Consensus 299 -----lkd~~G~~i~~~~~~~~~~~l-~G~-pVv~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~ 371 (395) T protein:vir:43 299 -----NKDAENRYIIGSPQNGTTPTL-WRL-PVVETQAITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTI 371 (395) T ss_pred -----hhccCCceeccccccCCCcee-cce-eeEEcCCCCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEE Confidence 78888888875422221 11 142 23333333332 2235444221 110 0122222234 Q ss_pred EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 379 LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 379 lve~~~~~~~~~~~~~~~~~~~ 400 (400) .++.--.|.+---++-++++|+ T Consensus 372 r~~~r~d~~v~~~~a~~~~~~t 393 (395) T protein:vir:43 372 RAEERLAFAVYRPEAFVTGSLT 393 (395) T ss_pred EEEEeeccEEecccceEEEEec Confidence 4455555555433444444444 No 26 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=99.61 E-value=9.4e-17 Score=108.44 Aligned_cols=350 Identities=12% Similarity=0.085 Sum_probs=190.8 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh----- Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE----- 75 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE----- 75 (400) |-+ +| +++|-.++++++++++..+..+++..-.+. ..+..+++++.+.+.++..++++.++.++...+ T Consensus 1 ~~~---~m---~l~el~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (404) T protein:vir:39 1 MGV---KL---TVNQLNEAWIASGDKVTDFNDQINMALNDD-NFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVN 73 (404) T ss_pred CCh---HH---HHHHHHHHHHHHHHHHHHHHHHHHHHhccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 543 33 455556666666666666666655432221 111124445555555555555444333332221 Q ss_pred ------hhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcc Q lcl|Aclame:pro 76 ------KPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPV 149 (400) Q Consensus 76 ------k~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~v 149 (400) .......+...-...++...| +.. + .......+.+-... .++.+--.++|..+...|-+.++++.++ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~--~-~~~~~~~e~~a~~~-~t~~~gg~~iP~~~~~~ii~~~~~~~~l 146 (404) T protein:vir:39 74 MREEEKGPLNKSEYELKDKFVKEFVNM---VRN--P-MAFLNTVSSKTETS-GSDSAAGLTIPQDIRTMINTLVRQYDSL 146 (404) T ss_pred cccccccccccchhhhHHHHHHHHHHH---Hhc--c-hhhhhhhhhhhhhc-ccccCCceeccHHHHHHHHHHHHhhhhH Confidence 001111122222222222222 211 1 11222222221111 1222222368999999999999999999 Q ss_pred ccceeeeccccee--eEEeeccc--cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHH Q lcl|Aclame:pro 150 FKVFHVTNVGALL--VSRSFDSA--NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIV 224 (400) Q Consensus 150 l~~fhV~n~~~~a--~~i~l~na--~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm 224 (400) ++...+.+.+... ..+...+. ..+.-+--|.++.+ ...+|...++.|..++....+.+.+.+- +.-++.+|++ T Consensus 147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~ 224 (404) T protein:vir:39 147 QQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKD--TAENILAWLS 224 (404) T ss_pred HhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhh--chHHHHHHHH Confidence 8877766655432 22222222 23333555667776 5689999999999999887775544432 2256899999 Q ss_pred HHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhce-eccc-ccceEEEEecch Q lcl|Aclame:pro 225 AELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDF-VRPT-AGRRYLIVKAED 302 (400) Q Consensus 225 ~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~-~~~~-~~~~~l~~~~~d 302 (400) ++|+.++. +.++.+++.|||+.. ..+.+.+.+++..++.- ..|- .+..++++|+.+ T Consensus 225 ~~l~~~~~-~~~d~~il~g~g~~~---------------------~~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~ 282 (404) T protein:vir:39 225 SWIAKKVV-VTRNQAIIAAMGTVP---------------------KKPTIAKFDDVITMINTSVDPAIIATSSLLTNQSG 282 (404) T ss_pred HHHHHHHH-HHHHHHHHhcccccc---------------------cccccccHHHHHHHHHHhhhhhhccCCEEEEcHHH Confidence 99999999 799999999999842 22334445666666532 2221 123478999999 Q ss_pred hHHHHhhhhhccccccceeecCCcceeehhhccccceec--chh------hhchhhhhcc--cceechhhceeccc---- Q lcl|Aclame:pro 303 RKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSK------ALKPTVLVDQ--KYHIDMQDLTKVDA---- 368 (400) Q Consensus 303 ~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k------~l~~t~~vd~--k~~~~~~~~~~~~~---- 368 (400) +.+|+- ++|.+|++.+..++..-+..| |.. ...|+...+. -+-.|++++..... T Consensus 283 ~~~L~~------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~ 350 (404) T protein:vir:39 283 LNKLAL------------VKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENM 350 (404) T ss_pred HHHHHH------------hhccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecce Confidence 999987 788899888755543322222 432 2234333222 22335554332211 Q ss_pred -e--------eeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 369 -F--------EWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 369 -~--------~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) . .|..++-.+.++..-.|.+--.++-.++++. T Consensus 351 ~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 391 (404) T protein:vir:39 351 SLLPTNIGAGAFETDTTKIRVIDRFDVKTTDSEALVAGSFT 391 (404) T ss_pred EEEEeccchhhhhhceeeEEEEeeeccEEecccceEEEEee Confidence 1 1233344456777777777777777777665 No 27 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=99.59 E-value=1.4e-15 Score=102.05 Aligned_cols=353 Identities=12% Similarity=0.093 Sum_probs=181.6 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhh-------hh-hhhhHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKN-------AI-EDLPKVQELEKTLSENSIEIIKIENELNA 72 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~-------~~-~~~skieElektis~l~aEi~K~enEl~~ 72 (400) |-|- +-+++-+..+.++++.++.++.++.....+. .. ....++++++..++++...+...+..... T Consensus 1 ~~l~------e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~ 74 (400) T protein:vir:38 1 MTLD------EKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKG 74 (400) T ss_pred CChH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3222 2233333445556666665555554432111 11 11345555655665555554432222221 Q ss_pred hH----------hhhcchhHHHHHHHHHHHHHHHHHHHHHccCC-hhHHHHHHH--HHHhCccchhhhHhhcchhHHHHH Q lcl|Aclame:pro 73 QE----------EKPKGKDKMTNFIESQNAVTEFFDVLKKNSGK-SEIKNAWSA--KLAENGVTITDTTFQLPRKLVESI 139 (400) Q Consensus 73 ~k----------Ek~K~k~emtEfLkTkqA~~dya~ll~~nqg~-ke~k~AW~a--~L~ekgV~~qd~~eiLP~~iI~AI 139 (400) .. +....+..+..+.+....-.+-........+. ........+ .....|+...+-..++|..+...| T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~i 154 (400) T protein:vir:38 75 NEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTP 154 (400) T ss_pred HhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHH Confidence 11 11122223333444333332222222111111 011111111 122335555554457999999999 Q ss_pred HHHHHhhCccccceeeecccceeeEEee-c-cccccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCch Q lcl|Aclame:pro 140 NTALLNTNPVFKVFHVTNVGALLVSRSF-D-SANEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSY 216 (400) Q Consensus 140 e~A~ed~d~vl~~fhV~n~~~~a~~i~l-~-na~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~y 216 (400) ...++++..+++.+.+.+.+.....+-. . ....+..+.-|.++++ +...|...++.|..++.+..+.+.+.+-.+ T Consensus 155 i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~-- 232 (400) T protein:vir:38 155 QRELQTVVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSA-- 232 (400) T ss_pred HHHHHhhhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhH-- Confidence 9999999999887776655544332222 2 2333334445555554 678999999999999998888554443222 Q ss_pred hHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEE Q lcl|Aclame:pro 217 SELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYL 296 (400) Q Consensus 217 galvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l 296 (400) -++.+|++++|+..+. ++.+.++..|+|..... +....|++..++...-+.+...++ T Consensus 233 ~~~~~~i~~~l~~~~~-~~~~~~i~~~~~~~~~~----------------------~~~~~~~~~~~~~~~~~~~~~a~~ 289 (400) T protein:vir:38 233 IDLVGLIAQNGQQIKV-NTTNGAVATLLKGFTAK----------------------TISSVDDLKHINNVDLDPAYSRVI 289 (400) T ss_pred HHHHHHHHHHHHHHHH-HHHHHhhhhcccccccc----------------------ccccHHHHHHHHHhhhhhhhCcEE Confidence 4679999999999999 69999999998863221 223345555555544455556789 Q ss_pred EEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhhhchhhhhcccc----------eechhhce Q lcl|Aclame:pro 297 IVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVDQKY----------HIDMQDLT 364 (400) Q Consensus 297 ~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd~k~----------~~~~~~~~ 364 (400) ++|+.++.+|+. ++|.+|.|.+..++..-.-+| |. |.+++|... --|++.+. T Consensus 290 v~~~~~~~~l~~------------lkd~~G~~i~~~~~~~~~~~~l~G~----pv~~~~~~~~~~~g~~~~~~gd~s~~~ 353 (400) T protein:vir:38 290 IASQSFYNFLDT------------VKDGNGRYLLQDSILTPSGKSVLGM----PIAVVSDDTLGAAGEAHAFLGDIKRAI 353 (400) T ss_pred EEcHHHHHHHHH------------hhccCCCeeeecCcCCCCccccccc----eeEEecccccCCCCceEEEEEeccccE Confidence 999999999887 899999998854443322222 43 333333211 11444322 Q ss_pred eccce-e----e---eecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 365 KVDAF-E----W---KTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 365 ~~~~~-~----~---~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ....+ + | .+...-+.+..--.|-|-.-++-..|+++ T Consensus 354 ~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~ 397 (400) T protein:vir:38 354 LFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAGYFLTYT 397 (400) T ss_pred EEEeecceEEEEecccccceeEEEEEEeccEEecccceEEEEee Confidence 11110 1 1 00111122222223333333344444444 No 28 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=99.59 E-value=1.2e-15 Score=102.31 Aligned_cols=360 Identities=12% Similarity=0.041 Sum_probs=166.2 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhH------ Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQE------ 74 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~k------ 74 (400) |+|.+. |..+.+++++|+...+++.++..+...-. +...+...+++++.+.++++..++.+......... T Consensus 1 Mki~el---k~el~~~~~el~~~~~elr~~~~~~~~~~-~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~ 76 (437) T protein:vir:10 1 MKIEKL---KKDLATKTAELNTKKAEIRSFTESEDKTI-DEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDD 76 (437) T ss_pred CCHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555443 23344444444444444433333322211 11122233445555555555544432111111000 Q ss_pred ------------------hhhcchhHHHHHHH-HHHHHHHHHHHHHH----------ccCChhHHHHHHHHHHhC----- Q lcl|Aclame:pro 75 ------------------EKPKGKDKMTNFIE-SQNAVTEFFDVLKK----------NSGKSEIKNAWSAKLAEN----- 120 (400) Q Consensus 75 ------------------Ek~K~k~emtEfLk-TkqA~~dya~ll~~----------nqg~ke~k~AW~a~L~ek----- 120 (400) +..+...+..+-+. .+++..+....... ......-.+++...+... T Consensus 77 ~~~~~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 156 (437) T protein:vir:10 77 SDLVAPELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDV 156 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhh Confidence 00000000000000 00001110000000 000011112233333221 Q ss_pred -ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEee--ccc-cccceecccchhhhhhhhhhhhhccH Q lcl|Aclame:pro 121 -GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF--DSA-NEAQVHKDGQTKTEQAATLTIDTLEP 196 (400) Q Consensus 121 -gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l--~na-~~a~GHk~ga~Kk~q~~~le~~ti~p 196 (400) .....+.-.++|..+...|... ..+..+.....+.+.+.....+.. .+. ..+|+.-.+..++....+|...++.| T Consensus 157 ~~~~~~~~g~lvp~~~~~~i~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~ 235 (437) T protein:vir:10 157 TGIALKDGKVIIPETILTPEKEV-HQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDL 235 (437) T ss_pred hhcccccccccchHHHHHHHHHh-hhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeeh Confidence 1222333337888877777764 446667665666665555443333 222 34444444444445667899999999 Q ss_pred HHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCC Q lcl|Aclame:pro 197 VMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPF 276 (400) Q Consensus 197 ~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~ 276 (400) ..+|....+..-+.+ .+.-++.+|++++|+.++. ++.+.++++|||+..+ ....+.. T Consensus 236 ~k~~~~~~is~ell~--ds~~~~~~~i~~~l~~~~~-~~~~~~i~~g~g~~~~--------------------~~~~~~~ 292 (437) T protein:vir:10 236 KTYTGGYVFSQELIS--DSSYDWQAELQSRLIELRD-NTDDSLIITALTDGIK--------------------KTTSTYL 292 (437) T ss_pred hheeeehhhhHHHHh--hhHHHHHHHHHHHHHHHHH-HHHHHHHhhhhccccc--------------------ccccccc Confidence 999988777444433 2224689999999999999 7999999999987321 1122444 Q ss_pred HHHHHHhhceecccc--cceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhh------h Q lcl|Aclame:pro 277 ADAIEEAVDFVRPTA--GRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKA------L 346 (400) Q Consensus 277 ~dal~Eald~~~~~~--~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~------l 346 (400) .+++..++...-+.+ +..++++|+.++.+|+. |+|.||.|.+.-++.+-..+| |..- . T Consensus 293 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~------------lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~ 360 (437) T protein:vir:10 293 LGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDM------------ATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKL 360 (437) T ss_pred hhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHH------------hhccCCCeeeccCccCCCCcccccceeEEecccc Confidence 556666654332222 33478999999999887 899999998855554433222 5322 2 Q ss_pred chhhhhccccee--chhhceec-cceeee--------ecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 347 KPTVLVDQKYHI--DMQDLTKV-DAFEWK--------TNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 347 ~~t~~vd~k~~~--~~~~~~~~-~~~~~~--------~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .|++..+....+ |++.+... +.-++. +....+.|..--.|-|-.=+|.+.||.. T Consensus 361 ~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~ 425 (437) T protein:vir:10 361 FPSASAGDVNIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGK 425 (437) T ss_pred cCCcCCCceEEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEee Confidence 244433333334 66653322 211111 1111122221111111111111222211 No 29 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=99.58 E-value=2.2e-15 Score=100.90 Aligned_cols=360 Identities=12% Similarity=0.110 Sum_probs=183.6 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhc--chhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH-------HHHhhHhh-- Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISG--FEVKNAIEDLPKVQELEKTLSENSIEIIKIEN-------ELNAQEEK-- 76 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~--~~~~~~~~~~skieElektis~l~aEi~K~en-------El~~~kEk-- 76 (400) |+...|.++.++| .+.+..+..+++. ++. ++..+++++++.++.++.+|...+. ......+. T Consensus 1 M~l~eL~e~r~~l---~~e~~~l~~k~~~~~~t~----e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~ 73 (409) T protein:vir:45 1 MKLHELKQKRNTI---ATDMRALNEKIGDNAWTE----EQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQR 73 (409) T ss_pred CCHHHHHHHHHHH---HHHHHHHHHHhhcCCCCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhc Confidence 6655555554443 3334333333322 211 1234556666666666666543221 11111111 Q ss_pred hc--chhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee Q lcl|Aclame:pro 77 PK--GKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH 154 (400) Q Consensus 77 ~K--~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh 154 (400) ++ ......+--+.+++...|.+- +......+.+++..+.-........+--.++|..+...|-+.+.++.++++..+ T Consensus 74 ~~~~~~~~~~~~~~~~~a~~~~l~~-~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~ 152 (409) T protein:vir:45 74 QNLDPENNSQQDEKRAQVFDKWMRH-GASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIASVAQ 152 (409) T ss_pred ccCCCCCcchhhHHHHHHHHHHHHh-hhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhhhhhhhce Confidence 11 011111111222333333111 122233455666554322222222222237999999999999999999988777 Q ss_pred eecccceeeEEee--cc-ccccceecccchhhhhhhhhhhhhccHHHHHH-HHHHH-HHHHhhcCchhHHHHHHHHHHHH Q lcl|Aclame:pro 155 VTNVGALLVSRSF--DS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYK-LQSLA-ERVKRLQMSYSELYNLIVAELTQ 229 (400) Q Consensus 155 V~n~~~~a~~i~l--~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYk-kq~La-d~~k~l~g~ygalvnyvm~ELaq 229 (400) |.+...+..+.-+ .. ...+.-+--|.++.++...|...++.|..+|- .-.+. +++.+ +.-++.+|++++|+. T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~d---s~~~l~~~i~~~la~ 229 (409) T protein:vir:45 153 ILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQD---SAIDMEAYLARRIAE 229 (409) T ss_pred eeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhc---cHHHHHHHHHHHHHH Confidence 6665443322222 22 24455566778889999999999999876653 23342 33333 224789999999999 Q ss_pred HHHHHHHhcceeeccCCCccccch-hhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecc---cccceEEEEecchhHH Q lcl|Aclame:pro 230 AIVNKIVDLALVEGDGTNGFKSID-KEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP---TAGRRYLIVKAEDRKA 305 (400) Q Consensus 230 ~fI~Rav~rAvv~gDG~~~t~~~~-~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~---~~~~~~l~~~~~d~~a 305 (400) ++. ++++.++++|||+.....+- +-..+.. ...+..+++...|++...+.-..| ..+.-++++++.+..+ T Consensus 230 a~~-~~~~~a~l~G~G~~~~~~p~Gil~~~~~-----~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~ 303 (409) T protein:vir:45 230 RIG-RGEARYLIQGTGAGTPKQPKGLAASVTG-----TTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKL 303 (409) T ss_pred HHH-HHHHHHhhccCCCCCccccceeeecccc-----ccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHH Confidence 999 79999999999985321111 1111110 112333445566777776643333 3344456779999888 Q ss_pred HHhhhhhccccccceeecCCcceeehhhccccceec--chhhhchhhhhcc--------cc--eechhhceecccee--- Q lcl|Aclame:pro 306 LLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVDQ--------KY--HIDMQDLTKVDAFE--- 370 (400) Q Consensus 306 ~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd~--------k~--~~~~~~~~~~~~~~--- 370 (400) |+. |+|.+|++.+.-++.+-.-.| |. |-++.|. +. .-|+++|.-.+..+ T Consensus 304 l~~------------lkd~~G~~i~~~~~~~~~~~~l~G~----PV~~~~~~p~~~~~~~~i~~Gd~~~~~i~~~~~~~~ 367 (409) T protein:vir:45 304 ISE------------MEDGQGRPLWLPDIVGVAPASVLNV----PYVIDQEIDDIGAGKKFMFCGDFDRFIIRRVRYMIL 367 (409) T ss_pred HHH------------hhcCCCceeeccCcCCCCCceecce----eeEEecCcCCccCCccEEEEeehhhhheeeccceEE Confidence 877 889999987643332221111 42 2222221 11 12565554332211 Q ss_pred -----eeecCce--EEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 371 -----WKTNSNM--ILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 371 -----~~~~~~~--ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .-|..|+ |.++..-.|.+---+|-.++++- T Consensus 368 ~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k 404 (409) T protein:vir:45 368 KRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGK 404 (409) T ss_pred EEeecccccCCcEEEEEEEEeccEeechhheEEEEec Confidence 1233344 33444444554444444455553 No 30 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=99.58 E-value=1.5e-16 Score=107.35 Aligned_cols=364 Identities=12% Similarity=0.071 Sum_probs=197.9 Q ss_pred cchhhHHHH-HHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh-----hhcchh Q lcl|Aclame:pro 8 MNKPDLIEK-QNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-----KPKGKD 81 (400) Q Consensus 8 ~~k~~~eek-q~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE-----k~K~k~ 81 (400) |. .+.++ ++++.++++++..+-.+...-. +.-.++..++++++..+.+++++|+..+..+..... ....+. T Consensus 1 m~--~l~~~l~~~~~~~~~~~~~~~e~~~~~~-~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 77 (390) T protein:vir:81 1 MT--DITSKLEATLANVTDSLRAFGERAVRDG-ELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS 77 (390) T ss_pred Ch--HHHHHHHHHHHHHHHHHHHHHHHHHhhc-CcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 22 12222 2234444443333222111110 011222467888888888888888765544443321 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCC--hhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccc Q lcl|Aclame:pro 82 KMTNFIESQNAVTEFFDVLKKNSGK--SEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 159 (400) Q Consensus 82 emtEfLkTkqA~~dya~ll~~nqg~--ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~ 159 (400) +.+......+...|+.......+. -+++..+..... ....+-..++|..++..|-..++++.++++...+...+ T Consensus 78 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~ 153 (390) T protein:vir:81 78 -VGDMFVASEQFQASAGRWNDRSARATMNIKAALNTAST---DAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTD 153 (390) T ss_pred -chhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhcc---ccccCCcceechhhhHHHHHHHhhhhhhhhhcceeecc Confidence 122233334444555444443332 233333322111 00111112688888888888899888888765655444 Q ss_pred ceeeEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 160 ALLVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVD 237 (400) Q Consensus 160 ~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~ 237 (400) .....+-..+ ...+..+--|.++.+...+|...++.|..++....+.+-+-+ .+ .++.+|++++|+.++- |+++ T Consensus 154 ~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~--d~-~~~~~~i~~~l~~~~~-~~~d 229 (390) T protein:vir:81 154 SALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILS--DA-PQLASYMNNRLIRGLK-VKED 229 (390) T ss_pred CCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHH--hH-HHHHHHHHHHHHHHHH-HHHH Confidence 4433333332 345555677889999999999999999998888777444333 22 4689999999999999 7999 Q ss_pred cceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc-cceEEEEecchhHHHHhhhhhcccc Q lcl|Aclame:pro 238 LALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKAEDRKALLDELRQATAN 316 (400) Q Consensus 238 rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~-~~~~l~~~~~d~~a~~~~~~~~~~~ 316 (400) +++++|||++.. ..-+.... -....+...+.+...|.+..++.-+.|.. .-..+++|+.++.+|+. T Consensus 230 ~a~l~G~g~~~~-~~Gi~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~-------- 296 (390) T protein:vir:81 230 AEILRGTGANDG-LLGLIPQA----TTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIEL-------- 296 (390) T ss_pred HHHHhcCCCCCc-ccceeecc----cccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH-------- Confidence 999999998531 00011110 01112233444566788888774444433 23468999999999987 Q ss_pred ccceeecCCcceeehhhccccc-eecchhhhchhhhhccc--ceechhh-ceecc-----------ceeeeecCceEEEE Q lcl|Aclame:pro 317 ANVRIKNDDTEIASEVGVDEII-VYTGSKALKPTVLVDQK--YHIDMQD-LTKVD-----------AFEWKTNSNMILVE 381 (400) Q Consensus 317 an~~lk~~d~~~~~~v~v~~~~-~~tg~k~l~~t~~vd~k--~~~~~~~-~~~~~-----------~~~~~~~~~~ilve 381 (400) ++|.+|.+.+.-..+... ..-|-. ++.+..++.. +-.|++. |+-.+ .-.|..++=.+.++ T Consensus 297 ----lkd~~G~~l~~~~~~~~~~~l~G~p-v~~~~~~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~ 371 (390) T protein:vir:81 297 ----AKDANNQYLIGNARGTLTPTLWGLP-VVATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAE 371 (390) T ss_pred ----hhcCCCceeecCcccccCceeccee-eEEcCCCCCCcEEEEehhceEEEEEecceEEEEecccchhhcCcEEEEEE Confidence 888888887653322211 001332 1112222211 2334443 22111 11223333345677 Q ss_pred ecccccceeeccceeEeeC Q lcl|Aclame:pro 382 TLTSGHVETYNAGAVITVS 400 (400) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~ 400 (400) .--.|.|-.-+|-+++|++ T Consensus 372 ~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 372 ERLALVVYRPEALISGSFA 390 (390) T ss_pred EeeccEEecccceEEEEeC Confidence 7777877777777788888 No 31 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=99.58 E-value=1.2e-15 Score=102.29 Aligned_cols=378 Identities=15% Similarity=0.175 Sum_probs=181.1 Q ss_pred Cccccccc------chhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhh----hhhhhH----HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MRISKRNM------NKPDLIEKQNRLAELKENNVSLKSQISGFEVKNA----IEDLPK----VQELEKTLSENSIEIIKI 66 (400) Q Consensus 1 ~~~s~~~~------~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~----~~~~sk----ieElektis~l~aEi~K~ 66 (400) |-|....+ .++.+.|.++++.++++...+++.+++....+.+ ..++.+ ..+++..+++++.++... T Consensus 1 ~~~~~~~~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~ 80 (425) T protein:vir:95 1 MALRQLMLTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQL 80 (425) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 65555422 2333444455555555555555555443221111 111222 222333333333333334 Q ss_pred HHHHHhhHhhhcchhHHHHHHHHHH------HHHHHHHHHHHccC--ChhHHHHHHHHHHhCccchhhhHhhcchhHHHH Q lcl|Aclame:pro 67 ENELNAQEEKPKGKDKMTNFIESQN------AVTEFFDVLKKNSG--KSEIKNAWSAKLAENGVTITDTTFQLPRKLVES 138 (400) Q Consensus 67 enEl~~~kEk~K~k~emtEfLkTkq------A~~dya~ll~~nqg--~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~A 138 (400) +++++.+... +.+.+..++....+ ....++..+ ..+. ...--..+...+.+.+ .+.+...++|.-+... T Consensus 81 ~~~l~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~-~~~~gg~~vP~~~~~~ 157 (425) T protein:vir:95 81 EDELEQINSK-QPSNQSRQKMQGSKGDVVEMNRLQVREML-KTGEYYKRSEVVEFYEKFRNLR-AVAGGELTIPEVVVNR 157 (425) T ss_pred HHHHHHhhhh-ccchhhhhhhhhhhhhHHHHHHHHHHHHH-hhhhhhhhhHHHHHHHHHHhhc-ccccCceeccHHHHHH Confidence 4444433221 11222222222111 111122222 1111 1111222333333221 1233444789999999 Q ss_pred HHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchhhhhhh-hhhhhhccHHHHHHHHHHHHHHHhhcCch Q lcl|Aclame:pro 139 INTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAA-TLTIDTLEPVMVYKLQSLAERVKRLQMSY 216 (400) Q Consensus 139 Ie~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~-~le~~ti~p~~VYkkq~Lad~~k~l~g~y 216 (400) |-+.+.++.++++...+.+.... ..+-... ...+.-+.-|.+..++.. +|...++.|..++.+..+.+.+- +.+. T Consensus 158 Ii~~l~~~~~i~~~~~~~~~~g~-~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell--~ds~ 234 (425) T protein:vir:95 158 IMDIMGDYTTLYPLVDKIRVKGT-TRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLL--QDSI 234 (425) T ss_pred HHHHHHhhhhHHHhhceeecCce-eEEEEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHHH--hccH Confidence 99999999999997776665433 2332222 345555667777777765 79999999998888777744433 3333 Q ss_pred hHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceec---ccccc Q lcl|Aclame:pro 217 SELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVR---PTAGR 293 (400) Q Consensus 217 galvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~---~~~~~ 293 (400) .++.+||.++|+.++. ++++.+++.|||+......-+..++.... .++..+.+...+.+...+.-+. .-.+. T Consensus 235 ~~l~~~i~~~l~~~i~-~~~d~~il~G~G~~~~~p~Gil~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (425) T protein:vir:95 235 INLDDYVTKKIARAIA-KALDLAIVKGTGAANKQPLGIIPSLPPEN----QVTVEADNNLLKNLVKQIGLIDTGDDSVGE 309 (425) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHhhccCCCCccccceeeccccccc----ccccccccchHHHHHHHHHhhhhhccccCc Confidence 5789999999999999 79999999999974211111111111111 1223345556667766653222 22345 Q ss_pred eEEEEecchhHHHHhhhhhccccccceeecCCcceeeh--hhccccceecchhhhchhhhhcccce--echhhceeccce Q lcl|Aclame:pro 294 RYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASE--VGVDEIIVYTGSKALKPTVLVDQKYH--IDMQDLTKVDAF 369 (400) Q Consensus 294 ~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~--v~v~~~~~~tg~k~l~~t~~vd~k~~--~~~~~~~~~~~~ 369 (400) .+.++++.|....+..|+ .++|.+|+|.+. .+-..+++ |.. .+.+..++...- -|++.|.-.+.- T Consensus 310 ~~~v~~~~~~~~~l~~l~--------~~kd~~g~~i~~~~~~~~~~l~--G~p-vv~~~~~~~~~i~~Gd~~~~~~~~~~ 378 (425) T protein:vir:95 310 IVAVMKRSTYYNRLVEFS--------IQVDSNGNVVGKLPNLRTPDLL--GLR-VVFNNFLDDDTVLFGEFEQYTLVERE 378 (425) T ss_pred eEEEEeChHHHHHHHHHH--------hhcCCCCceeeccCCCCCcccc--cee-eEEcCcCCCccEEEEecccEEEEeec Confidence 567888888544332222 278999998753 23222222 532 233333333322 255554333222 Q ss_pred eee--------ecCceE--EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 370 EWK--------TNSNMI--LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 370 ~~~--------~~~~~i--lve~~~~~~~~~~~~~~~~~~~ 400 (400) ++. |..|++ .+...-.|-+-.-+|-.+++|. T Consensus 379 ~~~i~~~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~ 419 (425) T protein:vir:95 379 NITIDSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTIT 419 (425) T ss_pred ceEEEeecccccccCceEEEEEEeeCcEeecccceEEEEec Confidence 111 122221 1222222222222222333333 No 32 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=99.58 E-value=5.8e-16 Score=104.13 Aligned_cols=340 Identities=12% Similarity=0.065 Sum_probs=184.4 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH--HH-----HhhHhhhcch Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIEN--EL-----NAQEEKPKGK 80 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~en--El-----~~~kEk~K~k 80 (400) |+|. |.|.+++++.+++++..+-. . ++..+++.+...+..|+++|+..+. +. ....+..+.. T Consensus 1 M~k~-l~el~~~~~~~~~e~~~~~~------~----~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) T protein:vir:10 1 MSKE-LRELLAKLEGKKEEVRSLMG------E----DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHhh------H----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 7765 67777777777776655421 1 1224556666666677777643221 11 1111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc---hhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVT---ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 157 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~---~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n 157 (400) ....-+.+.+ +++.+...+.+.+.+.........+... +.+--.++|..+..-|-+.+.++.++++...+.+ T Consensus 70 --~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~ 144 (392) T protein:vir:10 70 --VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) T ss_pred --ccchHHHHHH---HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee Confidence 1111122222 3344544444444444444444443332 2233337899998889999998888887667777 Q ss_pred cccee--eEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 158 VGALL--VSRSFDS-ANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 233 (400) Q Consensus 158 ~~~~a--~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 233 (400) .+... ..+...+ ...+..+.-|+++.+. ..+|...++.|..+|....+-+.+.+ .+.-++.+|++++|+..+- T Consensus 145 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~- 221 (392) T protein:vir:10 145 VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ--DSDQNILKYVTKWLGKKSK- 221 (392) T ss_pred ccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHh--hhHHHHHHHHHHHHHHHHH- Confidence 65443 2233222 3344445667777765 57999999999999988888444332 2224689999999999998 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc--cceEEEEecchhHHHHhhhh Q lcl|Aclame:pro 234 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA--GRRYLIVKAEDRKALLDELR 311 (400) Q Consensus 234 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~--~~~~l~~~~~d~~a~~~~~~ 311 (400) +..+.+++.|+|+... +.....|++..++.+.-+.+ +...+++|+.++.+|+- T Consensus 222 ~~~d~~~~~g~g~~~~----------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~--- 276 (392) T protein:vir:10 222 VTRNVLILGVIEKLTK----------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK--- 276 (392) T ss_pred HHHHHHHhhccccccc----------------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH--- Confidence 7999999999997431 11233455555553322222 34568999999999987 Q ss_pred hccccccceeecCCcceeehhhccccceec--chhhhc-------h--hhhhccccee--chhhceeccc-----e---- Q lcl|Aclame:pro 312 QATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-------P--TVLVDQKYHI--DMQDLTKVDA-----F---- 369 (400) Q Consensus 312 ~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-------~--t~~vd~k~~~--~~~~~~~~~~-----~---- 369 (400) |||.+|++.+.-.+..-.-+| |....+ + .+..+....+ |++.+...+- + T Consensus 277 ---------lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 277 ---------LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred ---------hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 889999988755443322222 332111 1 1111111111 3343222111 0 Q ss_pred --eeeecCce--EEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 370 --EWKTNSNM--ILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 370 --~~~~~~~~--ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +..|.+|+ +.++.--.|-|---++-..+++. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 11122333 33444334433333333344443 No 33 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=99.58 E-value=5.8e-16 Score=104.13 Aligned_cols=340 Identities=12% Similarity=0.065 Sum_probs=184.4 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH--HH-----HhhHhhhcch Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIEN--EL-----NAQEEKPKGK 80 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~en--El-----~~~kEk~K~k 80 (400) |+|. |.|.+++++.+++++..+-. . ++..+++.+...+..|+++|+..+. +. ....+..+.. T Consensus 1 M~k~-l~el~~~~~~~~~e~~~~~~------~----~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) T protein:vir:10 1 MSKE-LRELLAKLEGKKEEVRSLMG------E----DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHhh------H----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 7765 67777777777776655421 1 1224556666666677777643221 11 1111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc---hhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVT---ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 157 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~---~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n 157 (400) ....-+.+.+ +++.+...+.+.+.+.........+... +.+--.++|..+..-|-+.+.++.++++...+.+ T Consensus 70 --~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~ 144 (392) T protein:vir:10 70 --VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) T ss_pred --ccchHHHHHH---HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee Confidence 1111122222 3344544444444444444444443332 2233337899998889999998888887667777 Q ss_pred cccee--eEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 158 VGALL--VSRSFDS-ANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 233 (400) Q Consensus 158 ~~~~a--~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 233 (400) .+... ..+...+ ...+..+.-|+++.+. ..+|...++.|..+|....+-+.+.+ .+.-++.+|++++|+..+- T Consensus 145 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~- 221 (392) T protein:vir:10 145 VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ--DSDQNILKYVTKWLGKKSK- 221 (392) T ss_pred ccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHh--hhHHHHHHHHHHHHHHHHH- Confidence 65443 2233222 3344445667777765 57999999999999988888444332 2224689999999999998 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc--cceEEEEecchhHHHHhhhh Q lcl|Aclame:pro 234 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA--GRRYLIVKAEDRKALLDELR 311 (400) Q Consensus 234 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~--~~~~l~~~~~d~~a~~~~~~ 311 (400) +..+.+++.|+|+... +.....|++..++.+.-+.+ +...+++|+.++.+|+- T Consensus 222 ~~~d~~~~~g~g~~~~----------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~--- 276 (392) T protein:vir:10 222 VTRNVLILGVIEKLTK----------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK--- 276 (392) T ss_pred HHHHHHHhhccccccc----------------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH--- Confidence 7999999999997431 11233455555553322222 34568999999999987 Q ss_pred hccccccceeecCCcceeehhhccccceec--chhhhc-------h--hhhhccccee--chhhceeccc-----e---- Q lcl|Aclame:pro 312 QATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-------P--TVLVDQKYHI--DMQDLTKVDA-----F---- 369 (400) Q Consensus 312 ~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-------~--t~~vd~k~~~--~~~~~~~~~~-----~---- 369 (400) |||.+|++.+.-.+..-.-+| |....+ + .+..+....+ |++.+...+- + T Consensus 277 ---------lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 277 ---------LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred ---------hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 889999988755443322222 332111 1 1111111111 3343222111 0 Q ss_pred --eeeecCce--EEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 370 --EWKTNSNM--ILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 370 --~~~~~~~~--ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +..|.+|+ +.++.--.|-|---++-..+++. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 11122333 33444334433333333344443 No 34 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=99.58 E-value=5.8e-16 Score=104.13 Aligned_cols=340 Identities=12% Similarity=0.065 Sum_probs=184.4 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH--HH-----HhhHhhhcch Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIEN--EL-----NAQEEKPKGK 80 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~en--El-----~~~kEk~K~k 80 (400) |+|. |.|.+++++.+++++..+-. . ++..+++.+...+..|+++|+..+. +. ....+..+.. T Consensus 1 M~k~-l~el~~~~~~~~~e~~~~~~------~----~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) T protein:vir:10 1 MSKE-LRELLAKLEGKKEEVRSLMG------E----DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHhh------H----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 7765 67777777777776655421 1 1224556666666677777643221 11 1111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc---hhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVT---ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 157 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~---~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n 157 (400) ....-+.+.+ +++.+...+.+.+.+.........+... +.+--.++|..+..-|-+.+.++.++++...+.+ T Consensus 70 --~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~ 144 (392) T protein:vir:10 70 --VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) T ss_pred --ccchHHHHHH---HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee Confidence 1111122222 3344544444444444444444443332 2233337899998889999998888887667777 Q ss_pred cccee--eEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 158 VGALL--VSRSFDS-ANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 233 (400) Q Consensus 158 ~~~~a--~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 233 (400) .+... ..+...+ ...+..+.-|+++.+. ..+|...++.|..+|....+-+.+.+ .+.-++.+|++++|+..+- T Consensus 145 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~- 221 (392) T protein:vir:10 145 VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ--DSDQNILKYVTKWLGKKSK- 221 (392) T ss_pred ccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHh--hhHHHHHHHHHHHHHHHHH- Confidence 65443 2233222 3344445667777765 57999999999999988888444332 2224689999999999998 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc--cceEEEEecchhHHHHhhhh Q lcl|Aclame:pro 234 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA--GRRYLIVKAEDRKALLDELR 311 (400) Q Consensus 234 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~--~~~~l~~~~~d~~a~~~~~~ 311 (400) +..+.+++.|+|+... +.....|++..++.+.-+.+ +...+++|+.++.+|+- T Consensus 222 ~~~d~~~~~g~g~~~~----------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~--- 276 (392) T protein:vir:10 222 VTRNVLILGVIEKLTK----------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK--- 276 (392) T ss_pred HHHHHHHhhccccccc----------------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH--- Confidence 7999999999997431 11233455555553322222 34568999999999987 Q ss_pred hccccccceeecCCcceeehhhccccceec--chhhhc-------h--hhhhccccee--chhhceeccc-----e---- Q lcl|Aclame:pro 312 QATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-------P--TVLVDQKYHI--DMQDLTKVDA-----F---- 369 (400) Q Consensus 312 ~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-------~--t~~vd~k~~~--~~~~~~~~~~-----~---- 369 (400) |||.+|++.+.-.+..-.-+| |....+ + .+..+....+ |++.+...+- + T Consensus 277 ---------lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 277 ---------LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred ---------hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 889999988755443322222 332111 1 1111111111 3343222111 0 Q ss_pred --eeeecCce--EEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 370 --EWKTNSNM--ILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 370 --~~~~~~~~--ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +..|.+|+ +.++.--.|-|---++-..+++. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 11122333 33444334433333333344443 No 35 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=99.58 E-value=5.8e-16 Score=104.13 Aligned_cols=340 Identities=12% Similarity=0.065 Sum_probs=184.4 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH--HH-----HhhHhhhcch Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIEN--EL-----NAQEEKPKGK 80 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~en--El-----~~~kEk~K~k 80 (400) |+|. |.|.+++++.+++++..+-. . ++..+++.+...+..|+++|+..+. +. ....+..+.. T Consensus 1 M~k~-l~el~~~~~~~~~e~~~~~~------~----~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (392) T protein:vir:10 1 MSKE-LRELLAKLEGKKEEVRSLMG------E----DKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNNGREVETRN 69 (392) T ss_pred CcHH-HHHHHHHHHHHHHHHHHHhh------H----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccC Confidence 7765 67777777777776655421 1 1224556666666677777643221 11 1111111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc---hhhhHhhcchhHHHHHHHHHHhhCccccceeeec Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVT---ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTN 157 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~---~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n 157 (400) ....-+.+.+ +++.+...+.+.+.+.........+... +.+--.++|..+..-|-+.+.++.++++...+.+ T Consensus 70 --~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~~ 144 (392) T protein:vir:10 70 --VDGEMEYRDV---FMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQYVTVEP 144 (392) T ss_pred --ccchHHHHHH---HHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhhceeee Confidence 1111122222 3344544444444444444444443332 2233337899998889999998888887667777 Q ss_pred cccee--eEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 158 VGALL--VSRSFDS-ANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 233 (400) Q Consensus 158 ~~~~a--~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 233 (400) .+... ..+...+ ...+..+.-|+++.+. ..+|...++.|..+|....+-+.+.+ .+.-++.+|++++|+..+- T Consensus 145 ~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~- 221 (392) T protein:vir:10 145 VRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQ--DSDQNILKYVTKWLGKKSK- 221 (392) T ss_pred ccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHh--hhHHHHHHHHHHHHHHHHH- Confidence 65443 2233222 3344445667777765 57999999999999988888444332 2224689999999999998 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc--cceEEEEecchhHHHHhhhh Q lcl|Aclame:pro 234 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA--GRRYLIVKAEDRKALLDELR 311 (400) Q Consensus 234 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~--~~~~l~~~~~d~~a~~~~~~ 311 (400) +..+.+++.|+|+... +.....|++..++.+.-+.+ +...+++|+.++.+|+- T Consensus 222 ~~~d~~~~~g~g~~~~----------------------~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~--- 276 (392) T protein:vir:10 222 VTRNVLILGVIEKLTK----------------------QAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDK--- 276 (392) T ss_pred HHHHHHHhhccccccc----------------------cCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHH--- Confidence 7999999999997431 11233455555553322222 34568999999999987 Q ss_pred hccccccceeecCCcceeehhhccccceec--chhhhc-------h--hhhhccccee--chhhceeccc-----e---- Q lcl|Aclame:pro 312 QATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-------P--TVLVDQKYHI--DMQDLTKVDA-----F---- 369 (400) Q Consensus 312 ~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-------~--t~~vd~k~~~--~~~~~~~~~~-----~---- 369 (400) |||.+|++.+.-.+..-.-+| |....+ + .+..+....+ |++.+...+- + T Consensus 277 ---------lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~ 347 (392) T protein:vir:10 277 ---------LKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTD 347 (392) T ss_pred ---------hhccCCCeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEec Confidence 889999988755443322222 332111 1 1111111111 3343222111 0 Q ss_pred --eeeecCce--EEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 370 --EWKTNSNM--ILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 370 --~~~~~~~~--ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +..|.+|+ +.++.--.|-|---++-..+++. T Consensus 348 ~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 348 VGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred cccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 11122333 33444334433333333344443 No 36 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=99.57 E-value=2.4e-15 Score=100.72 Aligned_cols=346 Identities=10% Similarity=0.014 Sum_probs=180.7 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH-------h- Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN-------A- 72 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~-------~- 72 (400) |++++ . +.+..++++.+++++.++...- +..+++.+...++++..+|+..+.... . T Consensus 3 ~~m~k-----~-l~el~~~~~~~~~~~~~~~~~~----------~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~ 66 (397) T protein:vir:12 3 MQMSK-----K-EIALRQQFTEKKQQADKALQEG----------NTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGG 66 (397) T ss_pred CcHHH-----H-HHHHHHHHHHHHHHHHHHhhhh----------hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 43332 2 4444445555555544332211 112233333444444444432111111 1 Q ss_pred ---hHh---hhcchh--HHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCcc---chhhhHhhcchhHHHHHHH Q lcl|Aclame:pro 73 ---QEE---KPKGKD--KMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGV---TITDTTFQLPRKLVESINT 141 (400) Q Consensus 73 ---~kE---k~K~k~--emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV---~~qd~~eiLP~~iI~AIe~ 141 (400) ..+ ...... .....-.+..-...|++-+... ...+..+.|...+..+.. ..++--.++|..+...|-+ T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~-~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~ 145 (397) T protein:vir:12 67 VNFVPEQERNPEGQRSQGQGNEERQQQYSKAFLKGLRGK-RLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHE 145 (397) T ss_pred hhhhhhhhhhhcccccccchhhHHHHHHHHHHHHHHhcc-CCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHH Confidence 000 000000 0000111111112222223322 223556666665555533 2233334789999999999 Q ss_pred HHHhhCccccceeeecccc--eeeEEeecc-ccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchh Q lcl|Aclame:pro 142 ALLNTNPVFKVFHVTNVGA--LLVSRSFDS-ANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYS 217 (400) Q Consensus 142 A~ed~d~vl~~fhV~n~~~--~a~~i~l~n-a~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~yg 217 (400) .+.++.++++...+.+.+. +...+..++ ...+..|.-|.++++. ..+|...++.|..++....+.+.+.+-.+ - T Consensus 146 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~--~ 223 (397) T protein:vir:12 146 FKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSD--Q 223 (397) T ss_pred hhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhch--H Confidence 9999999988666666553 333444333 3467777888888765 57999999999999888777554443222 4 Q ss_pred HHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhc--eecccccceE Q lcl|Aclame:pro 218 ELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD--FVRPTAGRRY 295 (400) Q Consensus 218 alvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald--~~~~~~~~~~ 295 (400) ++.+|++++|+..+- |+++.+++.|||..-. .+....+++..++. ....-.+... T Consensus 224 ~l~~~i~~~l~~~~~-~~~d~~il~G~g~~~~----------------------~g~~~~~~i~~~~~~~l~~~~~~~a~ 280 (397) T protein:vir:12 224 AIMTYVAKWFAKKSV-VTRNNLILAAIASLKK----------------------VDIDGLDGIKKALNVTLDPMVAPGSI 280 (397) T ss_pred HHHHHHHHHHHHHHH-HHHHHHHHhccccccc----------------------cccccHHHHHHHHhhccchhhhCCCE Confidence 689999999999998 7999999999997421 11223455555552 2222233346 Q ss_pred EEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhhhc-----hhhhhccc--ceechhhceec Q lcl|Aclame:pro 296 LIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK-----PTVLVDQK--YHIDMQDLTKV 366 (400) Q Consensus 296 l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~-----~t~~vd~k--~~~~~~~~~~~ 366 (400) +++++.++.+|+. |+|.+|.+.+..++.+-+-+| |....+ |....+.. +--|++++... T Consensus 281 ~~~n~~~~~~L~~------------lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~ 348 (397) T protein:vir:12 281 VLTNQDGYDWLDT------------LKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVL 348 (397) T ss_pred EEEcHHHHHHHHH------------hhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEE Confidence 8899999998887 899999988755443322222 432211 11111111 12255543321 Q ss_pred c-----ceeee------ecCce--EEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 367 D-----AFEWK------TNSNM--ILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 367 ~-----~~~~~------~~~~~--ilve~~~~~~~~~~~~~~~~~~~ 400 (400) . .+.|. |.+|+ +.++.--.|.+-.-++-.+++++ T Consensus 349 ~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t 395 (397) T protein:vir:12 349 FDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQIT 395 (397) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEe Confidence 1 11111 22343 44444445554434444455666 No 37 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=99.57 E-value=1.7e-16 Score=106.98 Aligned_cols=347 Identities=12% Similarity=0.062 Sum_probs=173.2 Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHHHHHHHHHHH Q lcl|Aclame:pro 13 LIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFIESQNA 92 (400) Q Consensus 13 ~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~emtEfLkTkqA 92 (400) +++-++.+.+++..+++++.+++....+. .....++++++..++++.+++...+.+....+.+..... ........ T Consensus 1 meeL~~~~~~~~~~~~e~~~~l~~~~~~~-~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~---~~~~~~~~ 76 (389) T protein:vir:10 1 MDKLQTLFNDVSAKCADLNAQLNAKLQDE-NASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEP---KTEPKDDG 76 (389) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhH-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh---hccccccc Confidence 33334444455555555555554432221 112244555555555555554333222222211000000 00000000 Q ss_pred HHHHHHHHHHccCChhHHHHHHHHHHhC--------ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeE Q lcl|Aclame:pro 93 VTEFFDVLKKNSGKSEIKNAWSAKLAEN--------GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVS 164 (400) Q Consensus 93 ~~dya~ll~~nqg~ke~k~AW~a~L~ek--------gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~ 164 (400) ...-.. .........+++|.+.|... +.+..|-..++|..+...|-+.+.++.++++..++.+.+..... T Consensus 77 ~~~~~~--~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~ 154 (389) T protein:vir:10 77 SKKGTD--LSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGT 154 (389) T ss_pred cccccc--cchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeE Confidence 000000 00000011122233322211 23334444579999999999999999999887777666544333 Q ss_pred Eee--ccccccceecccchhh-hhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhccee Q lcl|Aclame:pro 165 RSF--DSANEAQVHKDGQTKT-EQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALV 241 (400) Q Consensus 165 i~l--~na~~a~GHk~ga~Kk-~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv 241 (400) +.. .....+..+.-|.++. ....+|...++.|..++....+.+.+.+ .+.-++.+|++++|+..+. +..+.+++ T Consensus 155 ~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~la~~~~-~~~~~~i~ 231 (389) T protein:vir:10 155 YPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIA--DSAVDLTALVGQSIKEKSV-NTYNAMIA 231 (389) T ss_pred EEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHh--hhhHHHHHHHHHHHHHHHH-HHHHHHHh Confidence 322 2233333445555555 4678999999999999888777443332 2223679999999999999 79999999 Q ss_pred eccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHHHHhhhhhcccccccee Q lcl|Aclame:pro 242 EGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRI 321 (400) Q Consensus 242 ~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~l 321 (400) .|+|...+ ...+.....|++...+...-+.+....+++|+.++.+|+- + T Consensus 232 ~g~~~~~~-------------------~~~~~~~~~d~l~~~~~~~~~~~~~a~~~~n~~~~~~L~~------------l 280 (389) T protein:vir:10 232 PVLQSFTA-------------------KKTTTDTLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDT------------L 280 (389) T ss_pred hhhccccc-------------------ccccccccHHHHHHHHHhhhhhhhCcEEEecHHHHHHHHH------------h Confidence 99876321 1122334567777666544444556789999999999997 9 Q ss_pred ecCCcceeehhhccccceecchhhh--chhhhhccc-----------ceechhh---------ceeccceeeeecCceEE Q lcl|Aclame:pro 322 KNDDTEIASEVGVDEIIVYTGSKAL--KPTVLVDQK-----------YHIDMQD---------LTKVDAFEWKTNSNMIL 379 (400) Q Consensus 322 k~~d~~~~~~v~v~~~~~~tg~k~l--~~t~~vd~k-----------~~~~~~~---------~~~~~~~~~~~~~~~il 379 (400) +|.+|+|.+..++.+.+...|...| .|-+++|.. +--|++. ++---+..-.|.+++.. T Consensus 281 kd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~ 360 (389) T protein:vir:10 281 KDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKIYGKYLGA 360 (389) T ss_pred hccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeeccccccceEEE Confidence 9999999886665543321111101 122222221 1113332 11111111112222222 Q ss_pred EEecccccceeeccceeEeeC Q lcl|Aclame:pro 380 VETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 380 ve~~~~~~~~~~~~~~~~~~~ 400 (400) + .--.|-|-.-+|...++++ T Consensus 361 ~-~r~d~~~~~~~a~~~~~~~ 380 (389) T protein:vir:10 361 A-FRFGVQKADSKAGYFVTNT 380 (389) T ss_pred E-EEeccEEecccceEEEEee Confidence 2 1222333223333344444 No 38 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=99.57 E-value=3.3e-16 Score=105.43 Aligned_cols=353 Identities=11% Similarity=0.065 Sum_probs=180.8 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh----- Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE----- 75 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE----- 75 (400) |-++ | +++|-.+++.+++.++..++.++...- ++......++++++..++++.+++++.+.++....+ T Consensus 1 m~~~---m---~l~el~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (408) T protein:vir:10 1 MGVK---L---TVNQLNEAWIASGDKVTDFNDQINMAL-NDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVN 73 (408) T ss_pred CCcc---c---cHHHHHHHHHHHHHHHHHHHHHHHHHh-hcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 4332 2 355555666666666666666554321 111111234444555555444444433333332211 Q ss_pred -hhcchhHHHHHHHHHHHHHHHHHHHHHcc-CChhHHHHHHHHHHhCc-cchhhhHhhcchhHHHHHHHHHHhhCccccc Q lcl|Aclame:pro 76 -KPKGKDKMTNFIESQNAVTEFFDVLKKNS-GKSEIKNAWSAKLAENG-VTITDTTFQLPRKLVESINTALLNTNPVFKV 152 (400) Q Consensus 76 -k~K~k~emtEfLkTkqA~~dya~ll~~nq-g~ke~k~AW~a~L~ekg-V~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~ 152 (400) ++..+.. .-...+.....|.+-++..- +..... +....+... -...|--..+|..+..-|-..++++.++++. T Consensus 74 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~ 149 (408) T protein:vir:10 74 MREEEKGP--LNKSENELKDKFVKDFVNMVRNPMAFM--NTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQY 149 (408) T ss_pred cccccccc--cccchhhhHHHHHHHHHHHhhcchhhh--hhhhhhhhhcccccCCceeccHhHHHHHHHHHHhhchhhhh Confidence 0000000 00111111122211111110 011111 111111111 1112222368999988899999999999887 Q ss_pred eeeecccceee--EEeecccc--ccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHH Q lcl|Aclame:pro 153 FHVTNVGALLV--SRSFDSAN--EAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAEL 227 (400) Q Consensus 153 fhV~n~~~~a~--~i~l~na~--~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~EL 227 (400) ..+.+.+.... .+...+.. .+..+--|+++.+. ..+|...++.+..++....+-+.+.+ .+.-++.+|++++| T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l 227 (408) T protein:vir:10 150 VRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLK--DTAENILAWLSSWI 227 (408) T ss_pred cceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHh--hchHHHHHHHHHHH Confidence 66666543322 22223222 23233345566664 46899999999988888777554433 23357899999999 Q ss_pred HHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhce-ecccc-cceEEEEecchhHH Q lcl|Aclame:pro 228 TQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDF-VRPTA-GRRYLIVKAEDRKA 305 (400) Q Consensus 228 aq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~-~~~~~-~~~~l~~~~~d~~a 305 (400) +..+. +..+.+++.|||+.. ..+.....|++..++.. ..|.- +...+++++.++.+ T Consensus 228 ~~~~~-~~~~~~il~g~g~~~---------------------~~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~ 285 (408) T protein:vir:10 228 AKKVV-VTRNQAIIEVMKAAP---------------------KKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNK 285 (408) T ss_pred HHHHH-HHHHHHHhhcccccc---------------------cccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHH Confidence 99999 799999999999732 12223345667666622 22211 22368899999999 Q ss_pred HHhhhhhccccccceeecCCcceeehhhccccceec--chhhh------chhhhhccc--ceechhhceec-ccee---- Q lcl|Aclame:pro 306 LLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAL------KPTVLVDQK--YHIDMQDLTKV-DAFE---- 370 (400) Q Consensus 306 ~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l------~~t~~vd~k--~~~~~~~~~~~-~~~~---- 370 (400) |+. ++|.+|.+.+..++.+-..+| |.... .|++-.+.. +.-|++.+... +..+ T Consensus 286 l~~------------lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~ 353 (408) T protein:vir:10 286 LAL------------VKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLL 353 (408) T ss_pred HHH------------hhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEE Confidence 888 899999998765544332222 44332 344333332 22366653322 2111 Q ss_pred --------eeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 371 --------WKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 371 --------~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) |..++-.+.++.--.|-|--.++-.++++. T Consensus 354 ~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~ 391 (408) T protein:vir:10 354 PTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFS 391 (408) T ss_pred EcccccchhhcCceEEEEEEeeccEEeccccEEEEEee Confidence 222333444555555555444444455544 No 39 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=99.57 E-value=2.6e-16 Score=106.04 Aligned_cols=364 Identities=12% Similarity=0.056 Sum_probs=195.6 Q ss_pred ccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhh-----h Q lcl|Aclame:pro 3 ISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEK-----P 77 (400) Q Consensus 3 ~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk-----~ 77 (400) +|+. ++ +=++++.++.+++..+-.+...-.. ...+...++++++.++.+++++|++.+.+++....+ . T Consensus 1 m~~~--~~----~l~~~~~~~~~~~~~~~e~~~~~~~-~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~ 73 (390) T protein:vir:97 1 MTDI--TA----KLEATLANVTDSLKAFGERAVRDGE-LNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDV 73 (390) T ss_pred ChHH--HH----HHHHHHHHHHHHHHHHHHHHHhhcC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 2211 11 1112233333333222222111110 112234678888899999999988777666644321 1 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccch--hhhHhhcchhHHHHHHHHHHhhCccccceee Q lcl|Aclame:pro 78 KGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTI--TDTTFQLPRKLVESINTALLNTNPVFKVFHV 155 (400) Q Consensus 78 K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~--qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV 155 (400) ..+. ..+..........|........+. ....+.+.+. .+.++ .+--.++|..++..|-..++++.++++.+.+ T Consensus 74 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~-~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~ 149 (390) T protein:vir:97 74 QHVS-VGDMFVASEQFQASTGRWNDRSAR--ATMNIKAALN-TASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGS 149 (390) T ss_pred cccc-chhhhhhhHHHHHHHHHhhhhhhh--hhhHHHHHHH-hhhcccccccccccchhhhHHHHHHHhhhhhhHhhcce Confidence 1111 123333444445554444333332 1112222222 12211 2222378988889999999999998886665 Q ss_pred ecccceeeEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 156 TNVGALLVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVN 233 (400) Q Consensus 156 ~n~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~ 233 (400) ...+.....+-..+ ...+..+.-|+++.+...+|...++.|..++.+..+.+-+- +.+ .++.+|++++|+.++- T Consensus 150 ~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell--~ds-~~l~~~i~~~la~a~~- 225 (390) T protein:vir:97 150 GRTDSALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQIL--SDA-PQLASYMNNRLIRGLK- 225 (390) T ss_pred eeccCCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHH--HhH-HHHHHHHHHHHHHHHH- Confidence 55544433333332 23444456688999999999999999999888877744332 223 4689999999999999 Q ss_pred HHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc-cceEEEEecchhHHHHhhhhh Q lcl|Aclame:pro 234 KIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKAEDRKALLDELRQ 312 (400) Q Consensus 234 Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~-~~~~l~~~~~d~~a~~~~~~~ 312 (400) ++++++++.|||++.. ..-+.... -.++.++..+++...|.+..++.-+.+.. ....+++|+.++.+|+. T Consensus 226 ~~~d~a~l~G~g~~~~-p~Gi~~~~----~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~---- 296 (390) T protein:vir:97 226 VKEDAEILRGTGANDG-LLGLIPQA----TTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIEL---- 296 (390) T ss_pred HHHHHHHhhcCCCCcc-ccceeecc----ccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH---- Confidence 7999999999998541 00111111 11122233344556677877774444432 33468899999999887 Q ss_pred ccccccceeecCCcceeehhhcccc--ceecchhhhchhhhhcc--cceechhh-ceecc-----------ceeeeecCc Q lcl|Aclame:pro 313 ATANANVRIKNDDTEIASEVGVDEI--IVYTGSKALKPTVLVDQ--KYHIDMQD-LTKVD-----------AFEWKTNSN 376 (400) Q Consensus 313 ~~~~an~~lk~~d~~~~~~v~v~~~--~~~tg~k~l~~t~~vd~--k~~~~~~~-~~~~~-----------~~~~~~~~~ 376 (400) ++|.+|.+.++-+++.. ++. |-.. +.+..++. -+..|++. |.-.+ .-.|..++- T Consensus 297 --------lkd~~G~~l~~~~~~~~~~~l~-G~pV-~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~ 366 (390) T protein:vir:97 297 --------AKDANNQYLIGNARGTLTPTLW-GLPV-VATQAMAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMV 366 (390) T ss_pred --------hhcCCCceeecCccCCCCceec-ceee-EEcCCCCCCcEEEEeccceEEEEEecceEEEEeecccccccCcE Confidence 78888888765433221 110 3211 11111111 12234432 22111 112233333 Q ss_pred eEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 377 MILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 377 ~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .+.++.--.|.|-.-+|-++++++ T Consensus 367 ~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 367 TVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred EEEEEEeeccEEeccccEEEEEeC Confidence 456666666766666666667777 No 40 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=99.57 E-value=6e-16 Score=104.05 Aligned_cols=340 Identities=12% Similarity=0.127 Sum_probs=171.4 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhh----hhhhhhHHHHHHHHHHHHHHHHHHHHHHHHh----hHhhhcc Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKN----AIEDLPKVQELEKTLSENSIEIIKIENELNA----QEEKPKG 79 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~----~~~~~skieElektis~l~aEi~K~enEl~~----~kEk~K~ 79 (400) |+...|.+++.++ .+.+.++..++.....+. ..+...+++.+++.+..+++.....+..... ....+.. T Consensus 1 M~~~eL~~~~~~~---~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (395) T protein:vir:38 1 MNINQLKDAFDMA---GQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVN 77 (395) T ss_pred CCHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 7766665555444 334444444444433221 1111223333333333333332211110000 0000000 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC---ccch-hhhHhhcchhHHHHHHHHHHhhCccccceee Q lcl|Aclame:pro 80 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN---GVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHV 155 (400) Q Consensus 80 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek---gV~~-qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV 155 (400) ...+ .. +.... ..+++.+.|...+... +.++ .+--.++|..+..-|-+.+.++.++++..++ T Consensus 78 ~~~~-~~---~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~ 143 (395) T protein:vir:38 78 KKPL-PV---KDGKP----------DAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANV 143 (395) T ss_pred cccc-ch---hhhhH----------HHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcce Confidence 0000 00 00000 0123344444444332 2222 2222379999999999999999988886665 Q ss_pred ecccce--eeEE-ee-ccccccceecccchhhhh-hhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 156 TNVGAL--LVSR-SF-DSANEAQVHKDGQTKTEQ-AATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQA 230 (400) Q Consensus 156 ~n~~~~--a~~i-~l-~na~~a~GHk~ga~Kk~q-~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~ 230 (400) .+.+.. ...+ .. .....+.-+.-|+++.+. ..+|...++.|..++.+..+.+.+.+-.+ -++.+|++++|+++ T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~la~~ 221 (395) T protein:vir:38 144 ENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTV--DNIIQWLVNWAAKK 221 (395) T ss_pred eeccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhH--HHHHHHHHHHHHHH Confidence 444322 2222 12 223334445566676654 57999999999988888777544443222 46899999999999 Q ss_pred HHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc--cceEEEEecchhHHHHh Q lcl|Aclame:pro 231 IVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA--GRRYLIVKAEDRKALLD 308 (400) Q Consensus 231 fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~--~~~~l~~~~~d~~a~~~ 308 (400) +. |..+.+++.|+|+... .+...+.|++..++...-+.+ +..++++|+.++.+|+. T Consensus 222 ~~-~~~~~~il~g~g~~~~---------------------~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~ 279 (395) T protein:vir:38 222 DV-VTRNAKILEVMGKAPK---------------------KPTISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSK 279 (395) T ss_pred HH-HHHHHHHhhccccccc---------------------ccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH Confidence 99 7999999999997431 222344566666664322222 33468899999999887 Q ss_pred hhhhccccccceeecCCcceeehhhccccceec--chhhhchh-----hhhccc--ceechhh-ceeccc---------- Q lcl|Aclame:pro 309 ELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPT-----VLVDQK--YHIDMQD-LTKVDA---------- 368 (400) Q Consensus 309 ~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t-----~~vd~k--~~~~~~~-~~~~~~---------- 368 (400) ++|.+|.+.+...+.+-.-+| |....+-. ...+.+ +--|++. |.-.+. T Consensus 280 ------------lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~ 347 (395) T protein:vir:38 280 ------------VKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNV 347 (395) T ss_pred ------------hhccCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEecc Confidence 888899887654433322111 43222111 001111 2224443 221110 Q ss_pred --eeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 369 --FEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 369 --~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ..|..++-.+.++.--.|-+---++-+++++. T Consensus 348 ~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 381 (395) T protein:vir:38 348 GAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFK 381 (395) T ss_pred ccchhhcCceEEEEEEeeccEEecccceEEEEee Confidence 11333333445555555555445555555555 No 41 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=99.54 E-value=2.2e-15 Score=100.98 Aligned_cols=367 Identities=11% Similarity=0.065 Sum_probs=176.7 Q ss_pred Cc-------ccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhh--hh--------hhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MR-------ISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNA--IE--------DLPKVQELEKTLSENSIEI 63 (400) Q Consensus 1 ~~-------~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~--~~--------~~skieElektis~l~aEi 63 (400) || +++....+..+ .+++.++++.++++..++.+...+.. .+ ...+++++++.+.++...+ T Consensus 1 ~~~~~~~~~~~~~~~~~~el---~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~ 77 (418) T protein:vir:10 1 MSHMNEPRQFGRKSGGDSHP---EQVLETVTKELKRIGDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARL 77 (418) T ss_pred CCCchhHHHHHHHhccHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHH Confidence 32 33333333333 33445555555555555555432211 11 1123333333333333333 Q ss_pred HHHHHHHHhhH-----hhhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC----ccchhhhHhhcchh Q lcl|Aclame:pro 64 IKIENELNAQE-----EKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN----GVTITDTTFQLPRK 134 (400) Q Consensus 64 ~K~enEl~~~k-----Ek~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek----gV~~qd~~eiLP~~ 134 (400) ...+....... +.++...+. ..+.... +-++..-......+.+...+.+. +....+...++|.. T Consensus 78 ~~~e~~~~~~~~~~~~~~~~~~~~~---~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~ 150 (418) T protein:vir:10 78 LEAEQKLARGGGSAELETPKTLGQL---VTESEEM----KGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVAD 150 (418) T ss_pred HHHHHHHhhcccccccchhhhhhHH---hhhHHHH----HHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchh Confidence 22222111111 011111111 1111111 11111111111122222222221 22223333489999 Q ss_pred HHHHHHHHHHhhCccccceeeecccceeeEEeeccc--cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 135 LVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA--NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRL 212 (400) Q Consensus 135 iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na--~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l 212 (400) +...|-..+.++..+++.+.+.+.+.....+-..+. ..+.-+--|+++.+...+|...++.|..++....+.+-+.+ T Consensus 151 ~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~- 229 (418) T protein:vir:10 151 RQAGIIAPPQRKMTIRDLLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILD- 229 (418) T ss_pred HHHHHHHHHhhhhhHHhhcceeeccCCceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHHHHH- Confidence 988888999998888886555555443333333222 23333456778899999999999999998887777443333 Q ss_pred cCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccccc Q lcl|Aclame:pro 213 QMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAG 292 (400) Q Consensus 213 ~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~ 292 (400) ++ +++.+|++++|+..+. ++++.++++|||++.. |. | +..-.-..+.+...+.....+++..++.-+.+... T Consensus 230 ds--~~l~~~i~~~l~~a~~-~~~d~a~l~G~g~~~~---p~-G-i~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 301 (418) T protein:vir:10 230 DA--PALQSYIDGRARYGLQ-LTEEGQILKGDGTGAN---IL-G-ILPQASAFMPSITLANATPIDKIRLALLQAVLAEF 301 (418) T ss_pred hH--HHHHHHHHHHHHHHHH-HHHHHHHhccCCCCcc---cc-c-cccccccccccccccccccHHHHHHHHHhhccccC Confidence 22 4799999999999999 7999999999998531 10 1 00000111122233334556888888744444443 Q ss_pred c-eEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhhhchhhhhccc--ceechhh-ceec Q lcl|Aclame:pro 293 R-RYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVDQK--YHIDMQD-LTKV 366 (400) Q Consensus 293 ~-~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd~k--~~~~~~~-~~~~ 366 (400) . ..+++++.+..+|+. ++|.+|.+.++-.++.+ -.| |. .++.+..+... +..|++. |.-. T Consensus 302 ~~~~~v~n~~~~~~L~~------------lkd~~G~~i~~~~~~~~-~~~l~G~-pV~~~~~~p~~~~~~gd~s~~~~~~ 367 (418) T protein:vir:10 302 PATGIVLNPIDWASIEL------------TKDSQGRYIVGNPVNGT-TPRLWNL-PVVETQAMTANEFLVGAFSMAAQIF 367 (418) T ss_pred CCCEEEEcHHHHHHHHH------------hhcCCCceeccccccCC-Cceecce-eeEEcCCCCCCcEEEeeccceEEEE Confidence 3 368889999988877 88999999886322111 111 42 22333333222 2224443 2211 Q ss_pred cce------------eeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 367 DAF------------EWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 367 ~~~------------~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +.- .|..+.-.+.++..-.|.+---.+-+.+++. T Consensus 368 ~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~ 413 (418) T protein:vir:10 368 DRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVYRPESFVTGALV 413 (418) T ss_pred EecceEEEEecccchhhhcCceEEEEEEeeccEEecccceEEEEec Confidence 100 0112222223333334433222222223333 No 42 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=99.53 E-value=1.4e-15 Score=101.98 Aligned_cols=330 Identities=14% Similarity=0.136 Sum_probs=181.5 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHHHHHH Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 87 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~emtEfL 87 (400) |.| .+++..++++.+++++.++-. + ++..++++++..+..++.+|...+.......+..+......+.. T Consensus 1 M~k-~l~~l~e~~~~~~~e~~~~~~-------~---~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 69 (371) T protein:vir:81 1 MPK-ELRELLEQINNKKEEARKLLA-------E---NKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPTV 69 (371) T ss_pred CcH-HHHHHHHHHHHHHHHHHHHhh-------H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccch Confidence 887 588888888888877665432 1 12234566666666666666544333332222111110000000 Q ss_pred H-HHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccce--eeE Q lcl|Aclame:pro 88 E-SQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGAL--LVS 164 (400) Q Consensus 88 k-TkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~--a~~ 164 (400) . ..+....|++-+.. .+.+++. .|. ..+--.++|..+..-|-..+.++.++++.+.+.+.+.. -.. T Consensus 70 ~~~~~~~~~~~~~l~~-----~~~~a~~-----~~t-~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~ 138 (371) T protein:vir:81 70 QVKENEVEAFVNHIRT-----RFRNAMS-----EGS-NQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRV 138 (371) T ss_pred hhHHHHHHHHHHHHHH-----HHHHhhc-----cCC-CccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEE Confidence 0 00111122211211 1112211 111 11222368999999999999999999887666665543 233 Q ss_pred Eeeccc-cccceecccchhhh-hhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceee Q lcl|Aclame:pro 165 RSFDSA-NEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE 242 (400) Q Consensus 165 i~l~na-~~a~GHk~ga~Kk~-q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~ 242 (400) +..... ..+..+.-|+++.+ ...+|...++.|..++....+.+.+.+ .+.-++.+|++++|+.++. |+.+.+++. T Consensus 139 ~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~a~~-~~~~~~i~~ 215 (371) T protein:vir:81 139 FKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLN--DSTEAIVNTLVRWIGDESR-VTRNGLIIN 215 (371) T ss_pred EEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHh--hhhHHHHHHHHHHHHHHHH-HHHHHHHHh Confidence 333333 45556677788776 468999999999998888777444433 2224789999999999999 799999999 Q ss_pred ccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecc--cccceEEEEecchhHHHHhhhhhccccccce Q lcl|Aclame:pro 243 GDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP--TAGRRYLIVKAEDRKALLDELRQATANANVR 320 (400) Q Consensus 243 gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~--~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~ 320 (400) |||+..... ..+.+++...+.+.-+ ......+++|+.++.+|+. T Consensus 216 g~g~~~~~~----------------------~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~------------ 261 (371) T protein:vir:81 216 VLNTKAKTA----------------------IADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDT------------ 261 (371) T ss_pred hcccccccc----------------------cccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHH------------ Confidence 999853211 2233444444422211 1233478999999999887 Q ss_pred eecCCcceeehhhccccceec--chhhhchhhhhcc---------------c--ceechhhceecc-ce----------- Q lcl|Aclame:pro 321 IKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVDQ---------------K--YHIDMQDLTKVD-AF----------- 369 (400) Q Consensus 321 lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd~---------------k--~~~~~~~~~~~~-~~----------- 369 (400) ++|.+|++.+..++..-.-.| |. |-++.|. . +--|++.+.... .. T Consensus 262 lkd~~g~~l~~~~~~~~~~~~l~G~----pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~ 337 (371) T protein:vir:81 262 LKDQNGQYLLQPSISSPTGRQLLGL----PVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAM 337 (371) T ss_pred hhccCCCeeeecccCCCCCceecce----eEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEecccc Confidence 888899888754432211111 32 2111111 1 111334322211 11 Q ss_pred -eeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 370 -EWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 370 -~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .|..++-.+.++.--.|-+-.-++-.++++. T Consensus 338 ~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~ 369 (371) T protein:vir:81 338 DAFETDATLWRAIERMDVKMRDDEAFVFGEVQ 369 (371) T ss_pred chhhcCceEEEEEEeeccEEecccceEEEEEe Confidence 1333344455555556666555666666666 No 43 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=99.49 E-value=1.9e-14 Score=95.84 Aligned_cols=365 Identities=10% Similarity=0.039 Sum_probs=162.7 Q ss_pred Ccccccc-------------cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhh---hhhhhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MRISKRN-------------MNKPDLIEKQNRLAELKENNVSLKSQISGFEVK---NAIEDLPKVQELEKTLSENSIEII 64 (400) Q Consensus 1 ~~~s~~~-------------~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~---~~~~~~skieElektis~l~aEi~ 64 (400) +|+.... .....+++...+.+ ..+.++++..+..+.+ ...+....+++++..+.++...+. T Consensus 120 ~r~e~~a~~~~~~~~~~~~~~~~~~l~e~~~~~~---~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~ 196 (543) T protein:vir:81 120 MRVEAGSSQGGRGDYDRDAILEPDSIEDCRFRDP---WNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAAT 196 (543) T ss_pred hhhhhhhHHHhhHHHHHhhhccCccHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1111000 00001111111111 1111111111111111 111223445555555555555444 Q ss_pred HHHHHHHhhHhhhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHH---HhCccchhhhHhhcchhHHH-HHH Q lcl|Aclame:pro 65 KIENELNAQEEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKL---AENGVTITDTTFQLPRKLVE-SIN 140 (400) Q Consensus 65 K~enEl~~~kEk~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L---~ekgV~~qd~~eiLP~~iI~-AIe 140 (400) +.+..+.....+........+-...+.+...| +...... .....-...+ ...+++.++.-.++|..+.. -|. T Consensus 197 ~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~---~~~~~~~-~l~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~ 272 (543) T protein:vir:81 197 KIIERFDDEDSTLARQCLATSSPAYLRAWSKM---ARNPHAA-ILTEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVII 272 (543) T ss_pred HHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHH---HHhhHHH-HhhhhhhhhhhhhhhcccccccCcccCchhhhhHHHH Confidence 33322222111100000000111111122111 2211111 1111111112 22345444444478877664 367 Q ss_pred HHHHhhCccccceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHH Q lcl|Aclame:pro 141 TALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSEL 219 (400) Q Consensus 141 ~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygal 219 (400) .++.++.++.....|.... +-..+-..+ ...+.-+.-|.+++....+|...++.|..++.+..+.+-+. +.++ ++ T Consensus 273 ~~~~~~~~l~~~~~~~~~~-g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell--~d~~-~~ 348 (543) T protein:vir:81 273 TSNGSLNDIRRFARQVVAT-GDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEAL--QDEA-NV 348 (543) T ss_pred HHHhhhchhhhhcccccCC-cceEEEEecCCcceeecccCccccccccccceeeeeeeeeEeeehhhHHHH--hccH-HH Confidence 7888767666554444332 222222222 23333345677888999999999999999998877744333 3334 79 Q ss_pred HHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEE Q lcl|Aclame:pro 220 YNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIV 298 (400) Q Consensus 220 vnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~ 298 (400) .+||++.|+.++. +.++.++++|||++. ...-+...... ......+..+.+...+++...+.-..|. .....+++ T Consensus 349 ~~~i~~~l~~~~~-~~~d~ail~G~Gt~~-~p~Gi~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~ 424 (543) T protein:vir:81 349 TETVALLFAEGKD-ELEAVTLTTGTGQGN-QPTGIVTALAG--TAAEIAPVTAETFALADVYAVYEQLAARHRRQGAWLA 424 (543) T ss_pred HHHHHHHHHHHHH-HHHHHHHhccCCCCc-ccccchhhccc--ccccccccccccccHHHHHHHHHhhhccccCCcEEEE Confidence 9999999999998 799999999999852 11111110000 0011112223344455555555322222 33456899 Q ss_pred ecchhHHHHhhhhhccccccceeecCCcceeehhhc---cccceecchhhhchhhhhcc------------c---ceech Q lcl|Aclame:pro 299 KAEDRKALLDELRQATANANVRIKNDDTEIASEVGV---DEIIVYTGSKALKPTVLVDQ------------K---YHIDM 360 (400) Q Consensus 299 ~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v---~~~~~~tg~k~l~~t~~vd~------------k---~~~~~ 360 (400) ++.++.+|+. ++|.+|.+.+.-.. +.++. |. |-++.|. + +.-|+ T Consensus 425 n~~~~~~l~~------------lkd~~G~~l~~~~~~g~~~~l~--G~----pv~~~~~~~~~~~~~~~~~~~~i~~gd~ 486 (543) T protein:vir:81 425 NNLIYNKIRQ------------FDTQGGAGLWTTIGNGEPSQLL--GR----PVGEAEAMDANWNTSASADNFVLLYGNF 486 (543) T ss_pred cHHHHHHHHH------------hhcCCCceeccCcCCCCCcccc--ce----eeEEeccccccccccccCCcceEEEeec Confidence 9999999887 88999988875321 11221 42 2222221 1 12255 Q ss_pred hhceeccceee--------------eecCceEEEEecccccceeecccee--EeeC Q lcl|Aclame:pro 361 QDLTKVDAFEW--------------KTNSNMILVETLTSGHVETYNAGAV--ITVS 400 (400) Q Consensus 361 ~~~~~~~~~~~--------------~~~~~~ilve~~~~~~~~~~~~~~~--~~~~ 400 (400) ++|.-.+..++ ..++-.|.++.--.|-| +|..|+ ++++ T Consensus 487 ~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v--~~~~A~~~l~~~ 540 (543) T protein:vir:81 487 QNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADV--VNPNAFRLLNVE 540 (543) T ss_pred cceeEEeecccEEEEeccccccchhhcCceEEEEEEeeccEe--ecccceEEEEec Confidence 55443322221 11222344444444433 333333 3333 No 44 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=99.45 E-value=8.6e-14 Score=92.21 Aligned_cols=366 Identities=17% Similarity=0.197 Sum_probs=166.9 Q ss_pred ccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhH---hhhcc Q lcl|Aclame:pro 3 ISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQE---EKPKG 79 (400) Q Consensus 3 ~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~k---Ek~K~ 79 (400) |++.+=.+.-+++.+..+..+|++..+. +..++.+... -..+++.++..+.++.......+.++.... ...+. T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~~~---~~~~e~~~~~-l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 76 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKNDKR---IDAIEQEKGK-LAGEVETLNGKLAELENLKSDLEAELAEVKRPAGGTQN 76 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 5555544555555555555555544332 2222111100 012233333333333222222221111111 01111 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccc Q lcl|Aclame:pro 80 KDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 159 (400) Q Consensus 80 k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~ 159 (400) . ...+ .++|...| +.... ..++...+...|..- . ..|--.++|..+..-|-..++++.++++..++.... T Consensus 77 ~-~~~e---~~~a~~~~---l~~g~-~~~~~~~e~~a~~~~-t-~~~gG~~iP~~~~~~I~~~~~~~~~l~~~~~~~~~~ 146 (407) T protein:vir:48 77 K-VASE---HKEAFIGF---MRKGR-EDGLRELERKALQVG-N-DEDGGYAIPEELDRTILTLLKDEVVMRQEATVITLG 146 (407) T ss_pred c-hhhH---HHHHHHHH---Hhccc-hhhhhHHHHHhhhcc-c-CCCCcccccHhHHHHHHHHHHhhhhhhhhceeeecC Confidence 1 1112 23344444 22211 124443343333221 1 112223689999999999999999998866654444 Q ss_pred ceeeEEeecc--ccccceecccchhhhhh-hhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 ALLVSRSFDS--ANEAQVHKDGQTKTEQA-ATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIV 236 (400) Q Consensus 160 ~~a~~i~l~n--a~~a~GHk~ga~Kk~q~-~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav 236 (400) .....+-... ..-+| ..-|..+.++. -+|...++.+..++....+.+.+.+ .+..++.+|++++|+.++- +.+ T Consensus 147 ~~~~~~~~~~~~~~a~~-v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~-~~~ 222 (407) T protein:vir:48 147 GSDYKKLVNLGGTTSGW-VGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLD--DAFFNVEDWINSELALEFA-EQE 222 (407) T ss_pred CCceEEEEecCCcceee-ecccccccccccccceeEEeeeeeeEeehhhHHHHHh--cchHHHHHHHHHHHHHHHH-HHH Confidence 4433333322 23333 23344555554 4788888888777666555333322 2335679999999999998 799 Q ss_pred hcceeeccCCCccc---cchhhh--h-hhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhh Q lcl|Aclame:pro 237 DLALVEGDGTNGFK---SIDKEA--D-VKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDE 309 (400) Q Consensus 237 ~rAvv~gDG~~~t~---~~~~e~--D-~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~ 309 (400) +.++++|||++... ..+... + ...+..+....+..+.+-..|+|...+.-..|. .+...+++++.++..|+- T Consensus 223 ~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~- 301 (407) T protein:vir:48 223 EIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFAIRL- 301 (407) T ss_pred HhhhhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHHHHH- Confidence 99999999984210 000000 0 000111111223334445567776555322222 123367899999888776 Q ss_pred hhhccccccceeecCCcceeehhhccccceec--chhhh----chhhhhcccce--echh-hceeccceeee------ec Q lcl|Aclame:pro 310 LRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAL----KPTVLVDQKYH--IDMQ-DLTKVDAFEWK------TN 374 (400) Q Consensus 310 ~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l----~~t~~vd~k~~--~~~~-~~~~~~~~~~~------~~ 374 (400) ++|.+|++.+--++..-..+| |.... .|.+.-+.+.- -|++ .|.-.+..++. +. T Consensus 302 -----------lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~ 370 (407) T protein:vir:48 302 -----------LKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTN 370 (407) T ss_pred -----------hhccCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeecccc Confidence 889999987632222211111 43211 11111111111 2554 34433333322 12 Q ss_pred Cce--EEEEecccccceeecccee--EeeC Q lcl|Aclame:pro 375 SNM--ILVETLTSGHVETYNAGAV--ITVS 400 (400) Q Consensus 375 ~~~--ilve~~~~~~~~~~~~~~~--~~~~ 400 (400) .|+ |.++.--.|-| .+..|+ +++. T Consensus 371 ~~~~~~~~~~r~d~~v--~~~~a~~~l~~~ 398 (407) T protein:vir:48 371 KPFVGFYTTKRTGGML--VDSQAIKLMKIG 398 (407) T ss_pred CCcEEEEEEEEeccEE--ecccceEEEEee Confidence 222 22222222222 233333 2222 No 45 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=99.44 E-value=8.7e-14 Score=92.17 Aligned_cols=372 Identities=17% Similarity=0.206 Sum_probs=171.1 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcch Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k 80 (400) |-++-.+. ++-+.|.+....++|+...+. ++.++-+... ...+++.++..+.+++..+...+.......+...+. T Consensus 1 m~~~lk~l-~~~~~el~~~~~~~k~~~~~~---~~~~e~~~~~-l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (401) T protein:vir:44 1 MAVDIKDV-EQVAQELQQKFDDFKAKNDKR---VEAIEQEKGK-LAGQVETLNGKLSELENLKSDLEKELLELKRPARGA 75 (401) T ss_pred CCccHHHH-HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 43331110 111222222222233332222 2222211111 123445555555555444443322222222111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccc Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~ 160 (400) ..-.+ -..+++...| +. +.-..++...+...|....- .+--..+|..+..-|-+.+.++.++++..++.+... T Consensus 76 ~~~~~-~e~~~a~~~~---lr-~~~~~~~~~~e~~a~~~~~~--~~GG~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~ 148 (401) T protein:vir:44 76 QNKVA-AEHKDAFVGF---LR-KGREDGLRDLERKALQVGTD--EDGGYAVPEELDRSILSLLKDEVVMRQEATVITVGG 148 (401) T ss_pred ccchh-HHHHHHHHHH---Hh-hhhhhhhHHHHHHHhhcCCC--CCCceeccHhHHHHHHHHHHhhhhhhhhceeeecCC Confidence 00000 1123444444 22 11112343333333332211 111226899888888899999898888767655544 Q ss_pred eeeEEeecc--ccccceecccchhhhhh-hhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 LLVSRSFDS--ANEAQVHKDGQTKTEQA-ATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVD 237 (400) Q Consensus 161 ~a~~i~l~n--a~~a~GHk~ga~Kk~q~-~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~ 237 (400) ....+-... ...+|. --|.++.+.. .+|...++.+..++....+.+-+ ++.+.-++.+|++++|+.++- +.++ T Consensus 149 ~~~~~~~~~~~~~a~wv-~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el--l~ds~~~l~~~i~~~la~ai~-~~~~ 224 (401) T protein:vir:44 149 SDYKKLVNLGGTASGWV-GETDTRSQTATSRLGLIEPFMGEIYGNPQATQKM--LDDAFFNVEAWINSELATEFA-EQEE 224 (401) T ss_pred CceEEEEecCCccceee-ccccccCccccccceeeeeehhheeeehhhhHHH--HhcchHHHHHHHHHHHHHHHH-HHHH Confidence 333333222 233342 2234444433 47888888888777776663332 223335789999999999998 7999 Q ss_pred cceeeccCCCcccc-----c-hhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhh Q lcl|Aclame:pro 238 LALVEGDGTNGFKS-----I-DKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDEL 310 (400) Q Consensus 238 rAvv~gDG~~~t~~-----~-~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~ 310 (400) .++++|||++...- . ........+..+....+..+.+...|++...+.-..|. .+..++++++.++.+|+. T Consensus 225 ~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~-- 302 (401) T protein:vir:44 225 IAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRL-- 302 (401) T ss_pred hhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHH-- Confidence 99999999853110 0 00011111122222223334445577887766433332 233468899999988887 Q ss_pred hhccccccceeecCCcceeehhhccccceec--chhhhc----hhhhhccc--ceechhh-ceeccceeee------ecC Q lcl|Aclame:pro 311 RQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK----PTVLVDQK--YHIDMQD-LTKVDAFEWK------TNS 375 (400) Q Consensus 311 ~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~----~t~~vd~k--~~~~~~~-~~~~~~~~~~------~~~ 375 (400) ++|.+|++.+--++.+-..+| |..-.+ |.+.-+.+ +-.|++. |.-.+..+++ +.+ T Consensus 303 ----------lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~ 372 (401) T protein:vir:44 303 ----------LKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNK 372 (401) T ss_pred ----------hhccCCceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeeccccC Confidence 888888887632222211111 433211 11100011 1135542 3333322222 223 Q ss_pred ceE--EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 376 NMI--LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 376 ~~i--lve~~~~~~~~~~~~~~~~~~~ 400 (400) |++ .+...-.|-|-.-+|..++++. T Consensus 373 ~~v~~~a~~r~d~~~~~~~a~~~l~~~ 399 (401) T protein:vir:44 373 PFVGFYTTKRTGGMLVDSQAIKLLKIA 399 (401) T ss_pred CcEEEEEEEEeccEEecccceEEEEee Confidence 332 2333333444444455555555 No 46 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.42 E-value=5.2e-14 Score=93.42 Aligned_cols=365 Identities=12% Similarity=0.100 Sum_probs=172.2 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH--hhHh--h------- Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN--AQEE--K------- 76 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~--~~kE--k------- 76 (400) |+...|.++.+++.+--+.+.+.+..-..++.+ +..+++++++.+.+|+.+|+..+.... +..+ . T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~e----e~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~ 76 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVE----QQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAV 76 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhh Confidence 888888887776655444444333333334332 246788999999999988865332111 1100 0 Q ss_pred -------hcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHH-------HhCccch-hhh--HhhcchhHHHHH Q lcl|Aclame:pro 77 -------PKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKL-------AENGVTI-TDT--TFQLPRKLVESI 139 (400) Q Consensus 77 -------~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L-------~ekgV~~-qd~--~eiLP~~iI~AI 139 (400) ...+....++-. .+...|.+-+....| +.+.++.... ..+.+.+ ++. -.++|..+...| T Consensus 77 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~i 152 (435) T protein:vir:14 77 AAPAAAPVHAQPKALEVKG--AKMARMVRALAAARG--DAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEV 152 (435) T ss_pred hhccccccccccchhhhhH--HHHHHHHHHHHhhcc--hhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHH Confidence 001101111100 111223333333333 2222322222 2222211 111 126898877777 Q ss_pred HHHHHhhCccccc-eeeecccceeeEEeec-c-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCch Q lcl|Aclame:pro 140 NTALLNTNPVFKV-FHVTNVGALLVSRSFD-S-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSY 216 (400) Q Consensus 140 e~A~ed~d~vl~~-fhV~n~~~~a~~i~l~-n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~y 216 (400) -+.+.++..++.. .++.....+...+.-. . ...+| .--|..+++...+|...++.|..++....+.+.+-+-.+.. T Consensus 153 i~~l~~~~~i~~~~~~~~~~~~~~~~~p~~~~~~~a~~-v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~ 231 (435) T protein:vir:14 153 IELLRPKSVVRKLGARTLPLSNGNITIPRLKGGAIVGY-IGADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVN 231 (435) T ss_pred HHHHhhhchhhhhcceeeecCCCceEEEEEeCCcceee-eccCccccccccceeEEEeeeEEEEEeehhhHHHHHhhccC Confidence 7777776766553 2232222222232221 2 23333 34566788888999999999988888866744433222222 Q ss_pred hHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHh---hceecccccc Q lcl|Aclame:pro 217 SELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEA---VDFVRPTAGR 293 (400) Q Consensus 217 galvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Ea---ld~~~~~~~~ 293 (400) ..+-+|++++|+..+- |.++.+++.|||+... +.-+-.......+++.+...+.+...+++... +.-+.+.-.. T Consensus 232 ~~l~~~i~~~l~~ai~-~~~d~a~l~G~G~~~~--p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~ 308 (435) T protein:vir:14 232 PNVDQIVVGDLTAAIG-AREDKAFIRDDGTANT--PKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQ 308 (435) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHhhccCCCCcc--ccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccccC Confidence 3578999999999999 7999999999998420 00010000011111111111111111223222 2222222234 Q ss_pred eEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceecchhhhc----hhhhh---c--ccceechhhce Q lcl|Aclame:pro 294 RYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALK----PTVLV---D--QKYHIDMQDLT 364 (400) Q Consensus 294 ~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~----~t~~v---d--~k~~~~~~~~~ 364 (400) ..+++++.++.+|+. ++|.+|.+.++-.-+.++. |....+ |+.+- + .=+--|++.|. T Consensus 309 ~~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~g~l~--G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~ 374 (435) T protein:vir:14 309 PGWIMAPRTFRFLEG------------LRDGNGNKVYPELANGMLK--GYPVGKTTQVPINLGETGKESEIYFTDFGDVF 374 (435) T ss_pred CEEEEcHHHHHHHHH------------hhccCCceeccCCCCCeee--cceeEeeccccccccCCCccceEEEeecccEE Confidence 467889999999887 8899999988643333322 321111 11100 0 01223444433 Q ss_pred eccc---------------------eeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 365 KVDA---------------------FEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 365 ~~~~---------------------~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) -.+. ..|..++-.|.++.--.+ -..+..|+..++ T Consensus 375 i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~--~~~~~~a~~~l~ 429 (435) T protein:vir:14 375 IGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDF--GPRHVESIAVLA 429 (435) T ss_pred EEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCc--eeecccceEEEe Confidence 1110 002222222233322222 223344444444 No 47 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=99.41 E-value=9.7e-14 Score=91.93 Aligned_cols=363 Identities=14% Similarity=0.170 Sum_probs=174.9 Q ss_pred Cccc--ccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHH------HHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MRIS--KRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQ------ELEKTLSENSIEIIKIENELNA 72 (400) Q Consensus 1 ~~~s--~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skie------Elektis~l~aEi~K~enEl~~ 72 (400) -+|+ ..-++|.-++.+.+..+++++.+.+++..++.+..+.+.+ +..++ |....++++.++++..+.++.. T Consensus 12 ~~~~~~~~~~~~~l~e~ra~~~~e~~~l~~~~~~~~~~~k~~~~~~-~~~~~~~~~~~e~~~~~~~~~~ei~~~~~~~~~ 90 (425) T protein:vir:10 12 AALTGPVGAVPRGIISVRAEGPTEVKALIENLQKAFHDFKAEHTKQ-LDAVKAGLPTSDALAKVDKVSADLEALQAAVDE 90 (425) T ss_pred HHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHH Confidence 1222 2224455555555555677777777666666554322110 01110 1111122333333322222211 Q ss_pred hHh---hhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHH--------hCccchhhhHhhcchhHHHHHHH Q lcl|Aclame:pro 73 QEE---KPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLA--------ENGVTITDTTFQLPRKLVESINT 141 (400) Q Consensus 73 ~kE---k~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~--------ekgV~~qd~~eiLP~~iI~AIe~ 141 (400) ... +.+..-.-.+.+++ .+.+++|...|. ..|. ..+--.++|..+..-|-. T Consensus 91 ~~~~~~~~~~~~~~~~~~~~-----------------~~~~~af~~~l~~~e~~~al~~~t-~~~gG~lvP~~~~~~ii~ 152 (425) T protein:vir:10 91 ANIKIAAAQMGANGVKPLRD-----------------PEYTEAFKAHVKRGDVQAALNKGE-DSEGGYLTPIEWDRTITN 152 (425) T ss_pred HHHHHHhhhccccccccccc-----------------HHHHHHHHHHhhhhhhHHHhhcCc-CCCCceeccHhHHHHHHH Confidence 110 00000000011111 133333433322 1222 122223789999888999 Q ss_pred HHHhhCccccceeeecccceeeEEeecc--ccccceecccchhhhhh-hhhhhhhccHHHHHHHHHHHHHHHhhcCchhH Q lcl|Aclame:pro 142 ALLNTNPVFKVFHVTNVGALLVSRSFDS--ANEAQVHKDGQTKTEQA-ATLTIDTLEPVMVYKLQSLAERVKRLQMSYSE 218 (400) Q Consensus 142 A~ed~d~vl~~fhV~n~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~-~~le~~ti~p~~VYkkq~Lad~~k~l~g~yga 218 (400) .++++.++++...+.+.+.....+-..+ .+..|. .-|.++.+.. .+|...++.|..++-+..+.+.+.+ .+.-+ T Consensus 153 ~~~~~s~l~~l~~~~~~~~~~~~~~~~~~~~~a~wv-~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~--ds~~~ 229 (425) T protein:vir:10 153 KLVLISPMRQLCRVQPVSKAGFSKLFNMGGTTSGWV-GEASQRPQTNAATFQPLSFASGEIYANPAATQQILD--DAEID 229 (425) T ss_pred HHHhhhhhhhhceeeeccCCceEEEEEcCCcceeee-ccccccccccccccceeeeeheeeEeehHhHHHHHh--cchhH Confidence 9999889888767666555544443333 233443 2334555554 4799999999888777666443333 22246 Q ss_pred HHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhh--------hhhhhhhhhccCCCCCHHHHHHhh-ceecc Q lcl|Aclame:pro 219 LYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKK--------IKKITTKAKSAGKTPFADAIEEAV-DFVRP 289 (400) Q Consensus 219 lvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~--------ik~it~~at~~~~T~~~dal~Eal-d~~~~ 289 (400) +.+||+++|+.++- +.++.++++|||++... -+..++.. ...+....+..+.+...|+|...+ +.... T Consensus 230 l~~~i~~~la~ai~-~~~d~~~l~G~G~~~p~--Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~ 306 (425) T protein:vir:10 230 LESWLATEVQTEFA-KQEGKAFLAGDGTNKPN--GLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSA 306 (425) T ss_pred HHHHHHHHHHHHHH-HHHHhhhhcccCCCCcc--eeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhh Confidence 79999999999999 79999999999974210 01111000 000000112223445566776554 22222 Q ss_pred cccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhhhc----hhhhhccc--ceechh Q lcl|Aclame:pro 290 TAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK----PTVLVDQK--YHIDMQ 361 (400) Q Consensus 290 ~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~----~t~~vd~k--~~~~~~ 361 (400) -.+.-++++++.++.+|+- ++|.+|++.+.-++..-.-+| |..-.+ |.+.-+.+ +-.|++ T Consensus 307 ~~~~a~~vmn~~~~~~L~~------------lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~ 374 (425) T protein:vir:10 307 FTGNARFAMNRNTQRQVRK------------LKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQ 374 (425) T ss_pred hccCCEEEEchHHHHHHHH------------hhcCCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehh Confidence 2234478999999988887 889999987633322211111 421111 22221122 123555 Q ss_pred h-ceeccceeee------ecCceE--EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 362 D-LTKVDAFEWK------TNSNMI--LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 362 ~-~~~~~~~~~~------~~~~~i--lve~~~~~~~~~~~~~~~~~~~ 400 (400) . |...+.-++. ++.|++ .....-.|.|---+|-.++++. T Consensus 375 ~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~ 422 (425) T protein:vir:10 375 QTYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVA 422 (425) T ss_pred ccEEEEEecceEEEecccccCCcEEEEEEEEeccEeecccceEEEEee Confidence 4 3322222211 233443 3333444555444455555555 No 48 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=99.41 E-value=7.5e-14 Score=92.54 Aligned_cols=367 Identities=13% Similarity=0.116 Sum_probs=166.1 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhh---hcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh--h-----h Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQI---SGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--K-----P 77 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i---~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE--k-----~ 77 (400) |+| +.+-.++.+++.+.+..+.+.. ..++. ++..+++++++.+++|+.+|+..+.+-+..++ + . T Consensus 1 M~k--l~~L~e~r~~l~~~~~~l~~~~~e~~~lt~----ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~ 74 (428) T protein:vir:10 1 MPQ--IEELRRQRAGINEQIQALATIEATNGTLTA----EQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQ 74 (428) T ss_pred Cch--HHHHHHHHHHHHHHHHHHHHHHhccCCCCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchh Confidence 776 2222223333333333222211 11221 23467788999999999998753332221111 0 0 Q ss_pred --cchhHHHHHHHHHH-HHHHHHHHHHHccCC-hhHHHHHHHHH----HhCccchhhh--HhhcchhHHHHHHHHHHhhC Q lcl|Aclame:pro 78 --KGKDKMTNFIESQN-AVTEFFDVLKKNSGK-SEIKNAWSAKL----AENGVTITDT--TFQLPRKLVESINTALLNTN 147 (400) Q Consensus 78 --K~k~emtEfLkTkq-A~~dya~ll~~nqg~-ke~k~AW~a~L----~ekgV~~qd~--~eiLP~~iI~AIe~A~ed~d 147 (400) +..+...+....+. ....++.-+....+. +.....+.... ....+.+... -.++|..+..-|=.-++++. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~ 154 (428) T protein:vir:10 75 HGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRT 154 (428) T ss_pred hccccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhc Confidence 00000001111111 111122222222221 12222221111 1111111111 12578777666656667666 Q ss_pred ccccc-eeeecccceeeEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHH Q lcl|Aclame:pro 148 PVFKV-FHVTNVGALLVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIV 224 (400) Q Consensus 148 ~vl~~-fhV~n~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm 224 (400) .+++. .++-.-..+-..+.-.+ ....| ..-|.++++...+|...++.|..++....+.+.+-+ .+.-++.+|++ T Consensus 155 ~l~~~~~~~~~~~~g~~~~p~~~~~~~a~~-v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~--ds~~~l~~~i~ 231 (428) T protein:vir:10 155 IVRKLGARSIPLPNGNMSLPRLAGGATASY-TGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIG--RAGFNVEQLVL 231 (428) T ss_pred hhhhhcceeeecCCcceEEEEEeCCcceee-eccCccccccccceeeEEeeeEEEEEeehhhHHHHh--hhhHHHHHHHH Confidence 66553 22211111212222111 23333 345788899999999999999888887777544432 22246799999 Q ss_pred HHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCC-CCHHHHHHhh-cee---cccccceEEEEe Q lcl|Aclame:pro 225 AELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKT-PFADAIEEAV-DFV---RPTAGRRYLIVK 299 (400) Q Consensus 225 ~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T-~~~dal~Eal-d~~---~~~~~~~~l~~~ 299 (400) ++|+.++- +.++.+++.|||++. ..--+.........+++++...+.+ .+.+....++ .+. .+-.+....+++ T Consensus 232 ~~l~~ai~-~~~d~~~l~G~G~~~-~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n 309 (428) T protein:vir:10 232 QDILTAIS-VREDKAFMRDDGTGD-TPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMS 309 (428) T ss_pred HHHHHHHH-HHHHHHHhccCCCCc-cccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEc Confidence 99999999 799999999999742 1111111211112222222222222 2333444444 121 222223457889 Q ss_pred cchhHHHHhhhhhccccccceeecCCcceeehhhccccceecchhhhchhhhhcc----------cceechhhceeccce Q lcl|Aclame:pro 300 AEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQ----------KYHIDMQDLTKVDAF 369 (400) Q Consensus 300 ~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~----------k~~~~~~~~~~~~~~ 369 (400) +.++.+|+- ++|.+|.+.++-.-+.++. |..-.+ +..++. =+-.|++.|.-.+.. T Consensus 310 ~~~~~~L~~------------lkd~~G~~i~~~~~~g~l~--G~pv~~-~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~ 374 (428) T protein:vir:10 310 NRTYMKLFG------------LRDGNGNKVYPEMAQGMLK--GYPIQR-TSAIPANLGEGGKESEIYFADFNDVVIGEDG 374 (428) T ss_pred HHHHHHHHH------------hhccCCceeccCCCCCeee--ceeeEE-eccccccccCCCccceEEEEecceEEEEEec Confidence 999988877 8899999887532222211 333211 111111 022233333311100 Q ss_pred ee-------------------eecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 370 EW-------------------KTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 370 ~~-------------------~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +. -|..||+.+=....--+-..+..|+..++ T Consensus 375 ~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t 424 (428) T protein:vir:10 375 NMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGT 424 (428) T ss_pred ceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEe Confidence 00 12223332222222223345566666666 No 49 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=99.40 E-value=1e-13 Score=91.85 Aligned_cols=362 Identities=14% Similarity=0.093 Sum_probs=177.7 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhH------hhhcchh Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQE------EKPKGKD 81 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~k------Ek~K~k~ 81 (400) |++..+.+..++.+.+...+..+...+++-.. .-++..+.++++..+.+++.+|++.+....... ..++... T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~~~~--~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~ 78 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAGKEM--TAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSG 78 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcc Confidence 77777777666666666666555554433221 112344566677777777776653222111111 1111100 Q ss_pred HHHHHHHHHHHHHHHHHHHHHccCC-hhHHHHHHHHHHhCccchhhhHhhcchhHHHH-HHHHHHhhCccccce-eeecc Q lcl|Aclame:pro 82 KMTNFIESQNAVTEFFDVLKKNSGK-SEIKNAWSAKLAENGVTITDTTFQLPRKLVES-INTALLNTNPVFKVF-HVTNV 158 (400) Q Consensus 82 emtEfLkTkqA~~dya~ll~~nqg~-ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~A-Ie~A~ed~d~vl~~f-hV~n~ 158 (400) .- ..+.+..++...+. .|. .+.+. -.-....++.+......++|+.+... |...+. ...++..+ ++... T Consensus 79 ~~----~~~~~~~~~~~~~r--~g~~~~~~~-~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~-~~~~l~~~~~~~~~ 150 (392) T protein:vir:13 79 SG----AQRSADHDDDAVLR--AGNLGEARS-FEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVE-RSAIMRGGASTFTT 150 (392) T ss_pred cc----hhhhhhHHHHHHHh--ccchhhhHH-HHhhhhhhcccccCCCccccccchHHHHHHHHh-hhhhhhhcceeeec Confidence 00 00111111211111 111 11111 11122223222222233678776555 444555 45555433 33222 Q ss_pred ---cceeeEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 159 ---GALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI 235 (400) Q Consensus 159 ---~~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Ra 235 (400) ..+-..+.-.....+| ..-|.++.+...+|...++.|..++-...+.+-.-+ .+.-++.+||.++|+..+- +. T Consensus 151 ~~~~~~~~~~~~~~~~a~~-v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~i~-~~ 226 (392) T protein:vir:13 151 SDANPMDFTVITGRATAGI-VGETAEIPESYPATTQRSMGGFKYGFASVVSYEFAT--DQVLDLVGFLVSDAGPAIG-DA 226 (392) T ss_pred CCCceeEEEEEcCCcceee-ecccccccccccceeeEEeeeeeEEeeehhHHHHHh--cchHHHHHHHHHHHHHHHH-HH Confidence 1222222222234455 467778889999999999999887777555333222 2223679999999999998 79 Q ss_pred HhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhcc Q lcl|Aclame:pro 236 VDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQAT 314 (400) Q Consensus 236 v~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~ 314 (400) .+.++++|||++.. .-+......... ...+..+.+...|+|...+.-..|. .+.-.+++++.++.+|+- T Consensus 227 ~d~~~l~G~Gt~~p--~Gil~~~~~~~~--~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~------ 296 (392) T protein:vir:13 227 MGRHFLTGTGTGQP--RGILTDATGANA--AFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRK------ 296 (392) T ss_pred HHHHHhcccCCccc--cccccccccccc--cccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHH------ Confidence 99999999998521 111111110000 0111223445567776655222222 233467889999998887 Q ss_pred ccccceeecCCcceeehhhccccceec--chhhhchhhhhccc--ceechhhceeccceee--------eecCceE--EE Q lcl|Aclame:pro 315 ANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVDQK--YHIDMQDLTKVDAFEW--------KTNSNMI--LV 380 (400) Q Consensus 315 ~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd~k--~~~~~~~~~~~~~~~~--------~~~~~~i--lv 380 (400) |+|.+|.+.+.-++..-+-+| |.-. +.+..+..+ +--|++.|+-.+..++ .|..|++ .+ T Consensus 297 ------lkd~~G~~l~~~~~~~g~~~~l~G~Pv-~~~~~~~~~~i~~Gdf~~~~i~~~~~~~i~~~~~~~~~~~~~~~r~ 369 (392) T protein:vir:13 297 ------LKDANGQYLWQSALTVGAPDTFNGKVV-ETDDGMPADKVLFADLSKYRVRFAGSLRVDRSVDAKFSTDQIVYRF 369 (392) T ss_pred ------hhccCCceeecCCcCCCCCceecceee-EEcCCCCCCcEEEeeccceeEEeecceEEEeeccccccCCcEEEEE Confidence 889999987644333322222 4221 111122111 1235555443332222 2333333 34 Q ss_pred EecccccceeeccceeEeeC Q lcl|Aclame:pro 381 ETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 381 e~~~~~~~~~~~~~~~~~~~ 400 (400) +..-.|.+---+|-.+++++ T Consensus 370 ~~r~d~~~~~~~A~~~~~~~ 389 (392) T protein:vir:13 370 LQRADGLLVDARGAKVLTVT 389 (392) T ss_pred EEEeccEEecccceEEEEee Confidence 44455555555666666666 No 50 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=99.40 E-value=3.2e-13 Score=89.11 Aligned_cols=359 Identities=15% Similarity=0.177 Sum_probs=174.0 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcch--hhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh--------hh Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFE--VKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE--------KP 77 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~--~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE--------k~ 77 (400) |+-..+.++. .++.++..+.+++++..-+ .+...+-.++++++.+.++.+..+|.+.+.......+ .+ T Consensus 1 M~l~el~~~~--~~~~~~~~a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~ 78 (434) T protein:vir:62 1 MNLKEILNAS--LTRTKSRLAELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEK 78 (434) T ss_pred CCHHHHHHHH--HHHHHHHHHHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhh Confidence 6543333322 2234445555554444333 2222222467777777777777776443222111110 00 Q ss_pred cchhHHHHH-----HHHHHHHHHHHHHHH---HccC-----ChhHHHHHHHHHHhC---------ccchhhhHhhcchhH Q lcl|Aclame:pro 78 KGKDKMTNF-----IESQNAVTEFFDVLK---KNSG-----KSEIKNAWSAKLAEN---------GVTITDTTFQLPRKL 135 (400) Q Consensus 78 K~k~emtEf-----LkTkqA~~dya~ll~---~nqg-----~ke~k~AW~a~L~ek---------gV~~qd~~eiLP~~i 135 (400) ...+...+. ..+.+.-..+.+.+. ...+ ..+.+++|...|... ++.+.|--.++|..+ T Consensus 79 ~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~lvP~~~ 158 (434) T protein:vir:62 79 KEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSVTIPDFL 158 (434) T ss_pred hcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccceecchhh Confidence 000000000 000011111112121 1111 136677776655421 222222223689888 Q ss_pred HHHHHHHHHhhCccccceeeecccce-eeEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 136 VESINTALLNTNPVFKVFHVTNVGAL-LVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRL 212 (400) Q Consensus 136 I~AIe~A~ed~d~vl~~fhV~n~~~~-a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l 212 (400) ..-|-..++++.++.+..++.+.... ..-+.... +...++-..|..++....+|...++.+..++.+..+-+.+.+- T Consensus 159 ~~~Ii~~l~~~~~i~~~~~~~~~~~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~d 238 (434) T protein:vir:62 159 SKEIITYAQEENFLRRLGTGVKTKENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLAR 238 (434) T ss_pred HHHHHHhhhhhhhhhhhcceeccCCceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhc Confidence 88888999998888775554333221 11111111 2222222335677778889999999998888876663333221 Q ss_pred cCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccc Q lcl|Aclame:pro 213 QMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTA 291 (400) Q Consensus 213 ~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~ 291 (400) +.-++.+|+.++|+..+. ++.+.++++|||++... ... ..-..++ ...+.+...|+|.... +....-. T Consensus 239 --s~~~l~~~i~~~la~~~~-~~~d~~~l~G~G~~~~~--~g~---~~~~~~~---~~~~~~~~~d~l~~l~~~l~~~~~ 307 (434) T protein:vir:62 239 --TGLPIEQIVMDELKKAYV-RKETQYMVNGDEANNIN--DGA---LAKKAVE---FKTDEKNLYDALVKMKNTPVKEVR 307 (434) T ss_pred --chHHHHHHHHHHHHHHHH-HHHHHHHhccCCCCccc--cce---eeccccc---ccccccchhhHHHHHHhhcchhhh Confidence 223579999999999999 79999999999985411 011 0001111 1223344567776555 3322222 Q ss_pred cceEEEEecchhHHHHhhhhhccccccceeecCCcceeehh------hccccceecchhhhchhhhhc------cc--ce Q lcl|Aclame:pro 292 GRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEV------GVDEIIVYTGSKALKPTVLVD------QK--YH 357 (400) Q Consensus 292 ~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v------~v~~~~~~tg~k~l~~t~~vd------~k--~~ 357 (400) +....++++.++.+|+- ++|.+|+|.+.- |...++. |..-.+ +..++ .. +. T Consensus 308 ~~a~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~~~~g~~~tl~--G~pV~~-~~~~~~~~~~~~~~i~~ 372 (434) T protein:vir:62 308 KKARWVLNTAALTKIET------------MKTDDGFPLLRPFNQAEGGIGYTLL--GFPVEE-EDAIDIPDSPDTPVFYF 372 (434) T ss_pred cCCEEEEcHHHHHHHHH------------hhccCCCEeeccCCCccCCCCceec--ceeeEE-ecCccCccCCCceEEEE Confidence 34467899999999887 899999998632 2222222 432211 11110 00 11 Q ss_pred echhhceeccceeeeecCceEEEEeccc---------ccceeeccceeEe-------eC Q lcl|Aclame:pro 358 IDMQDLTKVDAFEWKTNSNMILVETLTS---------GHVETYNAGAVIT-------VS 400 (400) Q Consensus 358 ~~~~~~~~~~~~~~~~~~~~ilve~~~~---------~~~~~~~~~~~~~-------~~ 400 (400) -|++.|.-.+- +|++.++.++. ..++.+..|-+|. ++ T Consensus 373 Gdfs~~~i~~~------~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~ 425 (434) T protein:vir:62 373 GDFSKFYIQDV------IGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYK 425 (434) T ss_pred eeccceEEEEe------eceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEE Confidence 25555543322 23333333221 2223333444332 11 No 51 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=99.40 E-value=6.5e-14 Score=92.89 Aligned_cols=343 Identities=13% Similarity=0.071 Sum_probs=163.0 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHHHHHH Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 87 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~emtEfL 87 (400) |+ ++.++.++..++++++... + +...+.++....+.++ ....+++.....+ ... T Consensus 1 ik--~L~e~~~e~~e~~~~~~~~------~------~~~~~~~e~~~~~~~~---~~~~~~~~~~~~~---------~~~ 54 (390) T protein:vir:40 1 MN--NLDKKDSETLNISTAFLNA------I------KEGATEAEQVTAFTNM---AEQIQNNIIAQAR---------KEV 54 (390) T ss_pred Cc--hHHHHHHHHHHHHHHHHHH------H------hhhhhHHHHHHHHHHH---HHHHHHHHHHHHH---------HHH Confidence 11 2233333333222221111 1 0000111111111111 1111111111000 000 Q ss_pred HHHHHHHHHHHHHH--HccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEE Q lcl|Aclame:pro 88 ESQNAVTEFFDVLK--KNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSR 165 (400) Q Consensus 88 kTkqA~~dya~ll~--~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i 165 (400) + ....+.+-... ......+.++.|.+.+...+. +|-..++|..+...|-+.++++.++++.+++.+.......+ T Consensus 55 ~--~~~~~~~~~~~~~~~~l~~~~r~~~~~~~~~~~~--~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i 130 (390) T protein:vir:40 55 N--REMNDNNVLASRGANALTSDESKYYNEVIAGNGF--AGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWI 130 (390) T ss_pred H--HHHHHHHHHHhcCchhccHHHHHHHHHHHhccCc--ccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEE Confidence 0 01111111111 111234778888777766554 55566899999999999999999999987877776665544 Q ss_pred eecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec Q lcl|Aclame:pro 166 SFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 243 (400) Q Consensus 166 ~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g 243 (400) --.+ ....|+.-.+..++....+|...++.|..+|.+..+.+.+-+- +.-++.+|++++|+.++- +.++.+++.| T Consensus 131 ~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~d--s~~~l~~~i~~~la~~i~-~~~~~a~l~G 207 (390) T protein:vir:40 131 ISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDL--GPSWLDQYVRTILGEAMA-LGLEAGIVNG 207 (390) T ss_pred EEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhc--chHHHHHHHHHHHHHHHH-HHHHhhhhcc Confidence 4433 3466776656667777889999999999888887774333332 224579999999999999 6999999999 Q ss_pred cCCCccccchhhhhh---hhhhhhhhhhhccCCCC---CHHHHHHhh-ceecccccceEEEEecchhHHHHhhhhhcccc Q lcl|Aclame:pro 244 DGTNGFKSIDKEADV---KKIKKITTKAKSAGKTP---FADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDELRQATAN 316 (400) Q Consensus 244 DG~~~t~~~~~e~D~---~~ik~it~~at~~~~T~---~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~ 316 (400) ||++.. .-+..+. ..-.....++..++.-. ..+.+.-++ +-..+-.+..++++++.+....+..+| T Consensus 208 ~G~~~P--~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~~~~----- 280 (390) T protein:vir:40 208 SGKDQP--IGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIYAAT----- 280 (390) T ss_pred cCCCcc--ceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHHHHh----- Confidence 997421 0011110 00000011111111110 011222232 223334566778999888655555443 Q ss_pred ccceeecCCcceeehhhccccceecchhhhchhhhhcccc--eechhhceeccceee--------eecCceE--EEEecc Q lcl|Aclame:pro 317 ANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKY--HIDMQDLTKVDAFEW--------KTNSNMI--LVETLT 384 (400) Q Consensus 317 an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~--~~~~~~~~~~~~~~~--------~~~~~~i--lve~~~ 384 (400) .++|.+|++..+..+-...| +++..++... .-|++.|.-.+.-++ .|..|++ .+..-- T Consensus 281 ---~~~d~~G~~v~~~~~~g~pv-------v~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~ 350 (390) T protein:vir:40 281 ---SYMTPQGVWVTGILPVPLEI-------VQSVAVPVGKAVAGRAKDYFMGIGSEQVIRTSTEYRLLDDETLYYAKQYA 350 (390) T ss_pred ---hccCCCCccccccCCCceeE-------EEcCCCCCCcEEEEeeceEEEEeecceEEEecchhhhhcCcEEEEEEEEe Confidence 38899999987654333333 2222222221 224444433221111 1111211 111111 Q ss_pred cccceeeccceeEeeC Q lcl|Aclame:pro 385 SGHVETYNAGAVITVS 400 (400) Q Consensus 385 ~~~~~~~~~~~~~~~~ 400 (400) .|.|---+|=.++.++ T Consensus 351 dg~v~~~~A~~~l~~~ 366 (390) T protein:vir:40 351 NGRPKDNSSFLVFDIT 366 (390) T ss_pred CCEEecccceEEEEee Confidence 2222211222222222 No 52 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=99.40 E-value=2.2e-13 Score=90.01 Aligned_cols=375 Identities=13% Similarity=0.118 Sum_probs=149.3 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchh--hhhhhhhhHHHHH-HHHHHHHHHHHHHHHHHHHhhH--- Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEV--KNAIEDLPKVQEL-EKTLSENSIEIIKIENELNAQE--- 74 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~--~~~~~~~skieEl-ektis~l~aEi~K~enEl~~~k--- 74 (400) |-|-..+.+. |....++|+-++.+...+...+.-.. +...+.+.+.+++ .+.+.++..+++..+.|..++. T Consensus 1 ~~~~~~~~~~---e~~~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~ 77 (458) T protein:vir:10 1 MTIDINKLKE---ELGLGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKS 77 (458) T ss_pred Cccchhhhhh---hhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3322211111 10011222222211111100000000 0000001111111 1112222222211111111110 Q ss_pred -----------hhh-cc----hhHHHHHHHHHHHHH-HHHHHHHHccC------ChhHHHHHHHHHHhC----------- Q lcl|Aclame:pro 75 -----------EKP-KG----KDKMTNFIESQNAVT-EFFDVLKKNSG------KSEIKNAWSAKLAEN----------- 120 (400) Q Consensus 75 -----------Ek~-K~----k~emtEfLkTkqA~~-dya~ll~~nqg------~ke~k~AW~a~L~ek----------- 120 (400) |+. +. ..+..+.+.+..... .+........+ ..-..++....+..+ T Consensus 78 ~~~~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~ 157 (458) T protein:vir:10 78 KKSNELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRH 157 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhh Confidence 000 00 000000000000000 00000000000 000011111122211 Q ss_pred ------ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc--ccccceecccchh------hhhh Q lcl|Aclame:pro 121 ------GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS--ANEAQVHKDGQTK------TEQA 186 (400) Q Consensus 121 ------gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n--a~~a~GHk~ga~K------k~q~ 186 (400) +..+.+....+|..+..-|-+.+.++.++++.+.+...+.....+...+ ....|+. -|..+ +..+ T Consensus 158 ~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~v~-e~~~~~~~~~~~~~~ 236 (458) T protein:vir:10 158 LKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEPDAGKATWVA-ASTYGTDTTTGEEVK 236 (458) T ss_pred hhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEecCCcceeecc-ccccccccccccccc Confidence 1223344457888888888888888888887666666555544433333 2333322 22222 3446 Q ss_pred hhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccc---cchhhhhhhhhhh Q lcl|Aclame:pro 187 ATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFK---SIDKEADVKKIKK 263 (400) Q Consensus 187 ~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~---~~~~e~D~~~ik~ 263 (400) .+|...++.|..++....+.+.+ ++.+.-++.+|+.++|+.++- ++++.++++|||++... ..+..+... - T Consensus 237 ~~~~~i~~~~~k~~~~v~is~el--l~ds~~~~~~~i~~~l~~~i~-~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~---~ 310 (458) T protein:vir:10 237 GALKEIHFSTYKLAAKSFITDET--EEDAIFSLLPLLRKRLIEAHA-VSIEEAFMTGDGSGKPKGLLTLASEDSAK---V 310 (458) T ss_pred ccceeeEeeeeeEEeeehhhHHH--HhcchHHHHHHHHHHHHHHHH-HHHHHHhhcCCCCCccceeeecccccccc---e Confidence 67888899988777766663332 233324689999999999999 79999999999984211 111111111 1 Q ss_pred hhhhhhccCCCCCHHHHHHhhceecccc-cceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhcccccee-- Q lcl|Aclame:pro 264 ITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVY-- 340 (400) Q Consensus 264 it~~at~~~~T~~~dal~Eald~~~~~~-~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~-- 340 (400) ++..+...+.+-..|+|..++.-..+.. +.-.+++|+.++.+|+. ++|.+|++.....+...... T Consensus 311 ~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~------------lkd~~G~~i~~~~~~~~~~~~~ 378 (458) T protein:vir:10 311 VTEAKADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDL------------LEDEEWQDVAQVGNDSVKLQGQ 378 (458) T ss_pred eecccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHh------------hcccCCceeeccccccccccCc Confidence 1111112222334677777663222222 33468999999998877 88999988765433322110 Q ss_pred --c--chhhhchhhhhcc-----c-ceechhh-ceeccceeee------ecCce--EEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 341 --T--GSKALKPTVLVDQ-----K-YHIDMQD-LTKVDAFEWK------TNSNM--ILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 341 --t--g~k~l~~t~~vd~-----k-~~~~~~~-~~~~~~~~~~------~~~~~--ilve~~~~~~~~~~~~~~~~~~~ 400 (400) | |-.. +.+..++. . +-.|+.+ |.-.+..++. +..|+ ++.|.--. .-.|...+++..+ T Consensus 379 ~~~l~G~pv-~~~~~~p~~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~--~~v~~~~a~v~~~ 454 (458) T protein:vir:10 379 VGRIYGLPV-VVSEYFPAKANSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVN--LQRYFANGVVSGT 454 (458) T ss_pred Cceecceee-EEccccccccCCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEec--ceEecccceEEEe Confidence 0 2211 11111111 0 1233322 3333222222 11222 22222211 2234444544443 No 53 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=99.38 E-value=1.7e-13 Score=90.54 Aligned_cols=341 Identities=10% Similarity=0.085 Sum_probs=171.2 Q ss_pred cchhh-HHHHHHHHHHHHHHHHHhhhhhhcchhhhhh----hhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhh----- Q lcl|Aclame:pro 8 MNKPD-LIEKQNRLAELKENNVSLKSQISGFEVKNAI----EDLPKVQELEKTLSENSIEIIKIENELNAQEEKP----- 77 (400) Q Consensus 8 ~~k~~-~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~----~~~skieElektis~l~aEi~K~enEl~~~kEk~----- 77 (400) |+-.. +.+=+++++++++.+......+.+...+... ....++++++..++.+..+++..+..+......+ T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 65333 4444466777777776666655554322211 1123455555555555444443333322211100 Q ss_pred --cchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceee Q lcl|Aclame:pro 78 --KGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHV 155 (400) Q Consensus 78 --K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV 155 (400) ...... .-.+.......|++-+.......+. .+++.+.+--.++|..+...|...++++.++++.+.+ T Consensus 81 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~---------ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~ 150 (421) T protein:vir:13 81 RVIINGDS-KEEKRSLQLSAMSKTIRGIQLSEEE---------RDIMSSTNNGAVIPQEFVNEFEKLKEGYPSLKEHCHV 150 (421) T ss_pred ccccccch-hHHHHHHHHHHHHHhhhccchhHHH---------hhccccCCcceecchhhHHHHHHHHHhhhhhhhhcee Confidence 000000 0011111111222222211111111 1245555555579999999999999999999887777 Q ss_pred ecccceeeEEee--cc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 156 TNVGALLVSRSF--DS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIV 232 (400) Q Consensus 156 ~n~~~~a~~i~l--~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI 232 (400) .+.+.....+.- .. ......+--|..+.....+|...++.|..++.+..+.+-+- +.+.-++.+|++++|+.++. T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell--~ds~~~l~~~i~~~la~~~~ 228 (421) T protein:vir:13 151 IPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLL--EDSEINFLEFVNEEFAEFAV 228 (421) T ss_pred eeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHH--hhhHHHHHHHHHHHHHHHHH Confidence 666655443322 22 23344466677888889999999999998888877744332 22224689999999999988 Q ss_pred HHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc-cceEEEEecchhHHHHhhhh Q lcl|Aclame:pro 233 NKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKAEDRKALLDELR 311 (400) Q Consensus 233 ~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~-~~~~l~~~~~d~~a~~~~~~ 311 (400) +.++.+++.. ++-+.+ .+.+.+.|++..++.-..+.. ....+|+++.++.+|+- T Consensus 229 -~~~~~~i~~~-----------------~~g~~~----~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~--- 283 (421) T protein:vir:13 229 -NTENAEIVKQ-----------------AKAVLA----EETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDG--- 283 (421) T ss_pred -HHhhhhHhhh-----------------hhhccc----cccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHH--- Confidence 5777665421 111211 122345677777774444332 33578899999999887 Q ss_pred hccccccceeecCCcceeehhhc---cccceecchhhhchhhhhccc----------ceechhh---------ceeccce Q lcl|Aclame:pro 312 QATANANVRIKNDDTEIASEVGV---DEIIVYTGSKALKPTVLVDQK----------YHIDMQD---------LTKVDAF 369 (400) Q Consensus 312 ~~~~~an~~lk~~d~~~~~~v~v---~~~~~~tg~k~l~~t~~vd~k----------~~~~~~~---------~~~~~~~ 369 (400) ++|.+|.+.+.-.. +.++. |.. .+++|.- +--|++. ++---+. T Consensus 284 ---------lkd~~G~~i~~~~~~~~~~tl~--G~p----V~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~ 348 (421) T protein:vir:13 284 ---------LMDKQGRPLLKELSDGGDLVFK--GRP----VIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSK 348 (421) T ss_pred ---------hhcCCCceeecCcCCCCCceec--cee----eEEeccccccCCCceEEEEEeccccEEEEEecceEEEeec Confidence 88999998875311 11111 332 1211110 1122222 2111111 Q ss_pred eeeecCceEE--EEecccccc---------eeeccceeEeeC Q lcl|Aclame:pro 370 EWKTNSNMIL--VETLTSGHV---------ETYNAGAVITVS 400 (400) Q Consensus 370 ~~~~~~~~il--ve~~~~~~~---------~~~~~~~~~~~~ 400 (400) .-.|..|++. ++.--.|-+ ..+..|+.++.+ T Consensus 349 ~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~ 390 (421) T protein:vir:13 349 EAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQ 390 (421) T ss_pred ccccccCeeEEEEEeeecceeecchhhheeeecccceeeccc Confidence 1123344433 222222211 112223333332 No 54 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.36 E-value=2.2e-13 Score=89.95 Aligned_cols=360 Identities=13% Similarity=0.109 Sum_probs=166.0 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhH--hh------hcc Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQE--EK------PKG 79 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~k--Ek------~K~ 79 (400) |+-..|.++.+++.+--+.+.+.+..-..++.+ +.+++++++..+++++.+|...+....... ++ +.. T Consensus 1 M~l~eL~~~r~~~~~~~~~l~~~~~e~~~l~~e----e~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~ 76 (435) T protein:vir:80 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVE----QQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAV 76 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhh Confidence 777777766655544322233222221223222 246778888889999888865432111111 10 000 Q ss_pred hhHHHH-------HHHHHH-HHHHHHHHHHHccCChhHHHHHHHHHH-------hCccch---hhhHhhcchhHHHHHHH Q lcl|Aclame:pro 80 KDKMTN-------FIESQN-AVTEFFDVLKKNSGKSEIKNAWSAKLA-------ENGVTI---TDTTFQLPRKLVESINT 141 (400) Q Consensus 80 k~emtE-------fLkTkq-A~~dya~ll~~nqg~ke~k~AW~a~L~-------ekgV~~---qd~~eiLP~~iI~AIe~ 141 (400) ...... -...+. +...|++-+...++ +...+....+. .+.+.+ .+--.++|..+..-|-+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~ 154 (435) T protein:vir:80 77 TASAAAPVYAQPKAPEVKGAKMARMVRALAAARG--DAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIE 154 (435) T ss_pred ccccccccccccchhhhhHHHHHHHHHHHHhccc--hhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHH Confidence 000000 000111 11122222222222 22222221111 111111 11112678887777777 Q ss_pred HHHhhCccccceeeeccc--ceeeEEee-cc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchh Q lcl|Aclame:pro 142 ALLNTNPVFKVFHVTNVG--ALLVSRSF-DS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYS 217 (400) Q Consensus 142 A~ed~d~vl~~fhV~n~~--~~a~~i~l-~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~yg 217 (400) .++.+.++++ +.+...| .+...+.- .. ....|. --|..+++...+|...++.|..++....+.+.+-+-.+.-. T Consensus 155 ~l~~~~~i~~-~~~~~v~~~~~~~~~p~~~~~~~a~~v-~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~ 232 (435) T protein:vir:80 155 LLRPKSVVRK-LGARTLPLSNGNITIPRLKGGAIVGYI-GADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNP 232 (435) T ss_pred HHhhhchhhh-ccceeeecCCCceEEEEEeCCcceeee-ccCccccccccceeeEEEeeEEEEEeehhhHHHHHhhcccH Confidence 7776666654 3222222 22223321 22 233332 34667888899999999999988887777444432222213 Q ss_pred HHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCC--HHHHHHhh-ceec--cccc Q lcl|Aclame:pro 218 ELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPF--ADAIEEAV-DFVR--PTAG 292 (400) Q Consensus 218 alvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~--~dal~Eal-d~~~--~~~~ 292 (400) ++.+|++++|+.++- ++.+.+++.|||++.. .--+..+.. +...+ ....+.|.. ..++..++ .+.. +-.. T Consensus 233 ~l~~~i~~~l~~a~~-~~~d~a~l~G~G~~~~-p~Gi~~~~~-~~~~~--~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~ 307 (435) T protein:vir:80 233 NVDQIVVGDLTAAIG-AREDKAFIRDDGTANT-PKGLRFWAL-PGNVI--TASDGSTLQKIETDLGKAILALENADANLT 307 (435) T ss_pred HHHHHHHHHHHHHHH-HHHHHHhhccCCCCCc-ccceeeccc-cccee--ecccccchhhHHHHHHHHHHHhhccccccc Confidence 578999999999999 7999999999997421 001111100 00111 111111111 11333333 2222 2223 Q ss_pred ceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceecchhhhchhhhhcc-------------cceec Q lcl|Aclame:pro 293 RRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQ-------------KYHID 359 (400) Q Consensus 293 ~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~-------------k~~~~ 359 (400) ...+++++.++.+|+- ++|.+|.+.++..-+.+.. |. |.++.|. -+--| T Consensus 308 ~~~~vmn~~~~~~L~~------------lkd~~G~~l~~~~~~~~l~--G~----pv~~~~~~p~~~~~~~~~~~i~~gd 369 (435) T protein:vir:80 308 QPGWIMAPRTFRFLEG------------LRDGNGNKVYPELANGMLK--GY----PVGKTTQVPINLGEAGKESEIYFTD 369 (435) T ss_pred cCEEEEcHHHHHHHHh------------hhccCCceeccCCCCCeEe--ee----eeEEeccccccccCCCCcceEEEEE Confidence 3467899999988877 8899999988744333322 32 2221111 01123 Q ss_pred hhhceeccc---------------------eeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 360 MQDLTKVDA---------------------FEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 360 ~~~~~~~~~---------------------~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) ++.|+-.+. .-|..|+-.|.++.--.+.| .++.|+..+. T Consensus 370 ~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~--~~~~a~~~l~ 429 (435) T protein:vir:80 370 FGDVFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGP--RHVESIAVLS 429 (435) T ss_pred cccEEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEe--ecccceEEEe Confidence 333321100 01333333333333322222 2344444444 No 55 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=99.26 E-value=3.7e-12 Score=83.28 Aligned_cols=346 Identities=15% Similarity=0.112 Sum_probs=178.6 Q ss_pred cchhhHHHHHHHH-HHHHHHHHHhhhh----hhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhH Q lcl|Aclame:pro 8 MNKPDLIEKQNRL-AELKENNVSLKSQ----ISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDK 82 (400) Q Consensus 8 ~~k~~~eekq~~l-A~lKe~~~~~Ks~----i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~e 82 (400) |...+++++++++ +.+++.......+ .+....+-.-.+..+.+++++.+.++++.+++.+.... ...++... T Consensus 1 m~~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~---~~~~~~~~ 77 (379) T protein:vir:10 1 MEALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLK---EKAKSEDK 77 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hccccccc Confidence 7777777775555 4455444333222 22221111111123455666666666666655443322 22233333 Q ss_pred HHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhC--ccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccc Q lcl|Aclame:pro 83 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAEN--GVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) Q Consensus 83 mtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ek--gV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~ 160 (400) ..++.+........ .. .++..=.....++ +..+++.....|..+..-|-..++++.++.+..++..... T Consensus 78 ~~~~~~~~~~~~~~---~~------~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~ 148 (379) T protein:vir:10 78 SDSLVKSITENFND---IK------EVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISG 148 (379) T ss_pred chhHHHHHHHHHHh---HH------HHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHhhhhHHhhceeeeccC Confidence 33333332221111 00 1111101112223 2444455556787777777777777788877666544444 Q ss_pred eeeEEeecc-cc-ccc-eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 161 LLVSRSFDS-AN-EAQ-VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVD 237 (400) Q Consensus 161 ~a~~i~l~n-a~-~a~-GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~ 237 (400) ..+.+--.+ .. .++ ..--|.++.....+|...++.|..++....+.+.+- +.+ ..+.+|+..+|+..+- +..+ T Consensus 149 ~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell--~D~-~~l~~~i~~~la~~~~-~~~~ 224 (379) T protein:vir:10 149 GTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMA--NNL-PFLTSFIPNALRRDYA-KAEN 224 (379) T ss_pred CceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHH--hhH-HHHHHHHHHHHHHHHH-HHHH Confidence 433333222 21 121 123466888888999999999998888866744442 222 4599999999999998 7999 Q ss_pred cceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhhhhhcccc Q lcl|Aclame:pro 238 LALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDELRQATAN 316 (400) Q Consensus 238 rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~ 316 (400) .+++.|+|...+... ...++....|.+..++ ++..+-.....+++++.+..+|+- T Consensus 225 ~~~~~g~~~~~~~~~----------------~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~-------- 280 (379) T protein:vir:10 225 AAFNAVLAANATAST----------------EIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILV-------- 280 (379) T ss_pred HHHhccccccccccc----------------ccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH-------- Confidence 999999987643221 1122333456677665 232232233458899999999887 Q ss_pred ccceeecCCcceeehhhccccce--ec--chhhhchhhhhccc--ceechhhceeccce------------eeeecCceE Q lcl|Aclame:pro 317 ANVRIKNDDTEIASEVGVDEIIV--YT--GSKALKPTVLVDQK--YHIDMQDLTKVDAF------------EWKTNSNMI 378 (400) Q Consensus 317 an~~lk~~d~~~~~~v~v~~~~~--~t--g~k~l~~t~~vd~k--~~~~~~~~~~~~~~------------~~~~~~~~i 378 (400) +||.+|++....++.-..- .| | ...+.+..++.. +-.|++.+...... .|..|+-.+ T Consensus 281 ----lkd~~G~~l~~~~~~~~~~~~~~l~G-~pvv~s~~~~ag~~~~gdf~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~ 355 (379) T protein:vir:10 281 ----TQKSVGAGYGLPGVVTQDNGVLRING-IPLFRATWLAANKYYVGDWTRVTKVTTEGLSLEFSEVEGTNFVKNNITA 355 (379) T ss_pred ----hhccCCceeccCCccCCCCCcceecc-eeeEecCCCCCCceEEeecccEEEEEEeceEEEEeecccccccCCcEEE Confidence 8999999876444321100 01 2 122333333332 22355554332211 133333334 Q ss_pred EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 379 LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 379 lve~~~~~~~~~~~~~~~~~~~ 400 (400) .+|.--.+.|- +..|++.++ T Consensus 356 r~~~R~~~~v~--~p~a~v~~~ 375 (379) T protein:vir:10 356 RIEAQVALAVE--QPAALIFGD 375 (379) T ss_pred EEEEEeccEEe--cCccEEEEE Confidence 44544444443 444444443 No 56 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=99.24 E-value=1.1e-11 Score=80.65 Aligned_cols=350 Identities=13% Similarity=0.075 Sum_probs=159.0 Q ss_pred cchh-----hHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhh-cchh Q lcl|Aclame:pro 8 MNKP-----DLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKP-KGKD 81 (400) Q Consensus 8 ~~k~-----~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~-K~k~ 81 (400) ||+- .+.+..+++..+++++.++....++-. +...+...+++.++..++.+..+++..+.+..+..+.. +... T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~-ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDM-EDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQ 79 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCC Confidence 6652 233333444444444444444333321 11111223444555555555555444333332222111 0000 Q ss_pred HHHHHHHHHHHHHHHHHHH-HHccCChhHHHHHHHHHHhCccch-hhhHhhcchhHHHHHHHHHHhhCccccceeeeccc Q lcl|Aclame:pro 82 KMTNFIESQNAVTEFFDVL-KKNSGKSEIKNAWSAKLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVG 159 (400) Q Consensus 82 emtEfLkTkqA~~dya~ll-~~nqg~ke~k~AW~a~L~ekgV~~-qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~ 159 (400) .+.+=-+..++-..|.+-. ...++. +....+......-+..+ .+--.++|..+...|-..+.+++++.+...|.+.+ T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~-~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~~~~v~~~~ 158 (387) T protein:vir:93 80 SLNDHEKMVKAKAEFYRHAILPNEFE-KPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIK 158 (387) T ss_pred CcchhhHHHHHHHHHHHHHhhhhhhh-hhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhhheeeeecC Confidence 0000001111111221111 111111 11122222222222110 11122789999888999999999988877777666 Q ss_pred ceeeEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHH-h Q lcl|Aclame:pro 160 ALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIV-D 237 (400) Q Consensus 160 ~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav-~ 237 (400) .......--+...+..+.-|++.+++..+|...++.+..++.+..+. +++ +.+..++.+|++++|+..|. +.. . T Consensus 159 ~~~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell---~Ds~~~l~~~i~~~la~~~~-~~e~~ 234 (387) T protein:vir:93 159 GLEIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVI---HGSDVDLVNWVENALQSGLA-AKERK 234 (387) T ss_pred CceEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhhHHHH---hhhHHHHHHHHHHHHHHHHH-HHHHH Confidence 54332211122334446678888889999999999988877776553 333 33334689999999999998 454 4 Q ss_pred cceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhcccc Q lcl|Aclame:pro 238 LALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATAN 316 (400) Q Consensus 238 rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~~ 316 (400) .++..|+|++...- .... .+.+.++.+...|+|..++.-..|. ...-.+++++.+..+++.- T Consensus 235 ~~~~~g~g~g~p~g--~l~~--------~~~~~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~------- 297 (387) T protein:vir:93 235 DALAVSPKSGLDHM--SFYN--------GSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISV------- 297 (387) T ss_pred hHhhcCCCccccce--eeec--------cccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHH------- Confidence 46677787743111 0000 1122334455678888877433332 2333578888887777663 Q ss_pred ccceeecCCcceeehhhccccceecchhhhchhhhhcccce---echhhceeccceeeeecCceEEEEecc---cccce- Q lcl|Aclame:pro 317 ANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYH---IDMQDLTKVDAFEWKTNSNMILVETLT---SGHVE- 389 (400) Q Consensus 317 an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~---~~~~~~~~~~~~~~~~~~~~ilve~~~---~~~~~- 389 (400) ++|++|.+.. |.+.+++ |. |-++.|.... =|++.|.. --.+ ++++..+ .|.+- T Consensus 298 ----~~d~~~~~~~--~~~~~ll--G~----PV~~~~~~~~~~~GDf~~~~~-------~~~~-~~~~~~~~~~~~~~~~ 357 (387) T protein:vir:93 298 ----LSNGTTNFFD--TPAEKVF--GK----PVVFTDAAVKPIVGDFNYFGI-------NYDG-TTYDTDKDVKKGEYLF 357 (387) T ss_pred ----HhcCCCcccc--cCCcccc--cc----ceEEecCCCceeeeehhhhhe-------ehhh-heeeecccccCCceeE Confidence 3445555432 3333333 32 3233332111 12222211 1111 1111111 12111 Q ss_pred ---------eeccceeEeeC Q lcl|Aclame:pro 390 ---------TYNAGAVITVS 400 (400) Q Consensus 390 ---------~~~~~~~~~~~ 400 (400) .++..|+..+. T Consensus 358 ~~~~r~d~~v~~~eA~~~l~ 377 (387) T protein:vir:93 358 VLTAWYDQQRTLDSAFRIAK 377 (387) T ss_pred EEEeeeCceeechhheEEEE Confidence 12222222221 No 57 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=99.24 E-value=1.7e-11 Score=79.70 Aligned_cols=377 Identities=13% Similarity=0.136 Sum_probs=161.1 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcch-------hhhhh----hh----hhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFE-------VKNAI----ED----LPKVQELEKTLSENSIEIIK 65 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~-------~~~~~----~~----~skieElektis~l~aEi~K 65 (400) |-|---++. ..++++..+|.+|.+....++++.+... .+.+. ++ -..+.+++..+.++..||.. T Consensus 1 ~~~~~~~l~-~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~ 79 (466) T protein:vir:80 1 MALRQLMLA-KKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKE 79 (466) T ss_pred CchHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433322222 2345555556666555555544443221 00000 01 12233444555555566555 Q ss_pred HHHHHHhhH---hhhcchhHHHHH-----HHHH-HHHHHHHHHHHHccC-----ChhHHHHHHHHHHh--CccchhhhHh Q lcl|Aclame:pro 66 IENELNAQE---EKPKGKDKMTNF-----IESQ-NAVTEFFDVLKKNSG-----KSEIKNAWSAKLAE--NGVTITDTTF 129 (400) Q Consensus 66 ~enEl~~~k---Ek~K~k~emtEf-----LkTk-qA~~dya~ll~~nqg-----~ke~k~AW~a~L~e--kgV~~qd~~e 129 (400) .+++++.+. +....++...+. ++.. .....+.+-+...+. .++.+..|.+.-.. .+....+... T Consensus 80 le~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~ 159 (466) T protein:vir:80 80 LENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAEL 159 (466) T ss_pred HHHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccc Confidence 454444322 222222222111 1111 111111111111111 12223333221111 1111122223 Q ss_pred hcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHH Q lcl|Aclame:pro 130 QLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAE 207 (400) Q Consensus 130 iLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad 207 (400) ++|.-++..|.+.+.+++++++...|.+..... .+...+ ....|+. -|.++++...+|...++.+..++-+..+.+ T Consensus 160 ~vP~~~~~~i~~~l~~~~~l~~~~~v~~~~g~~-~~~~~~~~~~a~wv~-E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ 237 (466) T protein:vir:80 160 TIPDVMLELLRDNMHRYSKLISKVRLRPLKGTA-RQNIAGAIPEGVWTE-AVANLNELSLSFSQIEVDGYKVGGFIPIPN 237 (466) T ss_pred cccHHHHHHHHHhhhhhhhhhhheeeeecCcee-EeeeecCCcceeecc-cccccccccccccceeecceeeeeehhhhH Confidence 799999999999999999999966666655432 222233 2345554 556677778889999999888888777744 Q ss_pred HHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhh-------------hhhhhhh------hh Q lcl|Aclame:pro 208 RVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVK-------------KIKKITT------KA 268 (400) Q Consensus 208 ~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~-------------~ik~it~------~a 268 (400) .+-+ .+.-++..|++.+|+..|. ++.+.++++|||+.... -+..+.- .+..++. .. T Consensus 238 ell~--ds~~~l~~~i~~~la~~~~-~~~~~ail~G~G~~~P~--Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 312 (466) T protein:vir:80 238 STLE--DSDLNLADEILDAIGQAIG-FALDKAILYGTGTKMPV--GIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDP 312 (466) T ss_pred HHHh--cchHHHHHHHHHHHHHHHH-HHHhhheeeccCCCCcc--eeeecccccccccccccccccccccchhhhhhhhh Confidence 4433 2335689999999999999 79999999999984210 0100000 0000000 00 Q ss_pred hccCCCCCHHHHHHhhceeccc--ccceEEEEecchhHHHHhhhhhccccccceee---cCCcceeehhhccccceecch Q lcl|Aclame:pro 269 KSAGKTPFADAIEEAVDFVRPT--AGRRYLIVKAEDRKALLDELRQATANANVRIK---NDDTEIASEVGVDEIIVYTGS 343 (400) Q Consensus 269 t~~~~T~~~dal~Eald~~~~~--~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk---~~d~~~~~~v~v~~~~~~tg~ 343 (400) .......+...+.-++...+.. -+..+.+.+......|+. ++ +.+|.++...+-...++ |. T Consensus 313 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~------------~~~~~~~~g~~~~~~~~~~~i~--G~ 378 (466) T protein:vir:80 313 TGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMS------------KAITFNSAGALVASLNNTMPIV--GG 378 (466) T ss_pred hccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhc------------ccccccCCccccccCCCccccc--cc Confidence 0000011111111122222222 222333444444433333 22 55666655443222121 32 Q ss_pred hhhchhhhhcc--cceechhhceecc--------ceeeeecCce--EEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 344 KALKPTVLVDQ--KYHIDMQDLTKVD--------AFEWKTNSNM--ILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 344 k~l~~t~~vd~--k~~~~~~~~~~~~--------~~~~~~~~~~--ilve~~~~~~~~~~~~~~~~~~~ 400 (400) . .+++..+.. -+.-+.++|+-++ +.+-.|..|+ +.+..--.|-+--.+|=.+++++ T Consensus 379 p-vv~s~~~~~~~~~~g~~~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~ 446 (466) T protein:vir:80 379 D-IVILDFIPDNDIIGGYGSLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIA 446 (466) T ss_pred c-eeecCccCccceeeeccccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEec Confidence 1 111111111 0111122222111 1112222232 33333334444322222333344 No 58 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=99.23 E-value=4.9e-12 Score=82.59 Aligned_cols=359 Identities=14% Similarity=0.054 Sum_probs=162.4 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHH---HHHHHHhhHh-h-----hc Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIK---IENELNAQEE-K-----PK 78 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K---~enEl~~~kE-k-----~K 78 (400) |=|+......++. +..+++.++.++.+... +..++++...+..++..+.. ..+.....++ + .. T Consensus 1 ~~ke~~~~~~~~~---~~~~~e~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 (413) T protein:vir:81 1 MVKEAGDAPTNAQ---VAEIAEVKSMVEQFKAD-----EDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGE 72 (413) T ss_pred ChhhHHHHHHHHH---HHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhh Confidence 4444433222211 11222222222222111 01111111111111111111 0000000000 0 00 Q ss_pred chhHH--------HHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccc Q lcl|Aclame:pro 79 GKDKM--------TNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVF 150 (400) Q Consensus 79 ~k~em--------tEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl 150 (400) ....+ .+..+... ..............-+. ++..... ..+-...+....+|..+...|-..++++.++. T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~ 149 (413) T protein:vir:81 73 GYKSIGEFFAKRAGDQIKQQA-GGAQLNYSVGEYVAPRV-KAASDPA-STATLTDEFQGGYGTTWNRNIIYRRREKLVVA 149 (413) T ss_pred hhhhhhhhhhhhhhhHHHHHH-HHHHhhhhhhhhhhhHH-Hhhhhhh-hhcccccccccccchhhHHHHHHHHhhhhhHH Confidence 00000 00000000 00000000000001011 1222111 12333456666889888888888899888888 Q ss_pred cceeeecccceeeEEeecc-----ccccceecccchhhhhhh-hhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHH Q lcl|Aclame:pro 151 KVFHVTNVGALLVSRSFDS-----ANEAQVHKDGQTKTEQAA-TLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIV 224 (400) Q Consensus 151 ~~fhV~n~~~~a~~i~l~n-----a~~a~GHk~ga~Kk~q~~-~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm 224 (400) +.+.+-+.+.....+.-.. ...+.-+.-|.++.+... .|...++.|..++.+..+.+-+.+ .+ +.+.+|++ T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~--ds-~~l~~~i~ 226 (413) T protein:vir:81 150 DLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIE--DY-DFLVSYIN 226 (413) T ss_pred hhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHH--HH-HHHHHHHH Confidence 7767666655443332221 123344556777777764 799999999888777666333322 22 35899999 Q ss_pred HHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceec-ccccc-eEEEEecch Q lcl|Aclame:pro 225 AELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVR-PTAGR-RYLIVKAED 302 (400) Q Consensus 225 ~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~-~~~~~-~~l~~~~~d 302 (400) ++|+.++. ++++.++++|||+... ..-+.....+. + .+..+.....+.+..++.-+. +.+.+ ..+++|+.+ T Consensus 227 ~~la~~~~-~~~d~~~l~G~G~~~~--~~Gi~~~~~~~---~-~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~ 299 (413) T protein:vir:81 227 ARLLEELA-IEEERQLLLGDGTGNN--LTGLLKRDGIQ---T-LAVSNKDELADSIYKAMTNISLATPFQADALVINPLD 299 (413) T ss_pred HHHHHHHH-HHHHHHHhccCCCCCc--ccccccccccc---c-ccccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHH Confidence 99999998 7999999999998531 11111111111 1 112223344566666663322 22222 248999999 Q ss_pred hHHHHhhhhhccccccceeecCCcceeehhhcccccee-------c--chhhhchhhhhccc--ceechhh-ceeccc-- Q lcl|Aclame:pro 303 RKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVY-------T--GSKALKPTVLVDQK--YHIDMQD-LTKVDA-- 368 (400) Q Consensus 303 ~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~-------t--g~k~l~~t~~vd~k--~~~~~~~-~~~~~~-- 368 (400) +.+|+. |||.+|.+.+.-.+....-. | |- ..+.+..++.. +..|++. |.-.+. T Consensus 300 ~~~l~~------------lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~-pv~~s~~~~~~~~~~gd~~~~~~~~~~~~ 366 (413) T protein:vir:81 300 YQELRL------------AKDANGQYYGGGVFQGQYGSGGIMLDPAPWGL-RTVQSQVVPVGKPVVGAFRSAASVLRKGG 366 (413) T ss_pred HHHHHH------------hhccCCceeccccccccccccccccCceecce-eeEEcCCCCcccEEEEecccEEEEEEecc Confidence 999988 99999998875443322110 0 22 11222222222 2235543 222111 Q ss_pred --ee--------eeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 369 --FE--------WKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 369 --~~--------~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .. |..++=.+.++.--.|.+---++-+++++. T Consensus 367 ~~v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 367 VRIDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVA 408 (413) T ss_pred eEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEec Confidence 11 112222344444455555444555555555 No 59 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=99.20 E-value=7.9e-12 Score=81.46 Aligned_cols=352 Identities=13% Similarity=0.067 Sum_probs=173.6 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh------------ Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE------------ 75 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE------------ 75 (400) |.+..+.+.+++.+++.+++..+........ .--++..++++++..+++++.+|.....+..+..+ T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~~~~--lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~ 78 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFAGKE--MTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSG 78 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhhccc--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc Confidence 7777777777777766666666655443321 11123466777777777777777543222222211 Q ss_pred ---hhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHH-HHHHHHHhhCcccc Q lcl|Aclame:pro 76 ---KPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVE-SINTALLNTNPVFK 151 (400) Q Consensus 76 ---k~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~-AIe~A~ed~d~vl~ 151 (400) ..........|+++-... +.+.. .-...+.+.+.......+|.++.. -|.+.++. .++|. T Consensus 79 ~~~~~~~~~~~~~~~r~~~~~--------------~~r~~-~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~-~~~l~ 142 (390) T protein:vir:62 79 SGAQRSADVDDDATLRAGNLG--------------EARSF-EFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVER-SAIMR 142 (390) T ss_pred ccchhhcchHHHHHHhhhhhh--------------hhHHH-HhhhhhhcccccCCCccccccchHHHHHHHHhh-hhhhh Confidence 000011111222211111 11100 111111222222222356766543 35666665 44443 Q ss_pred ce-eeecccce-eeEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHH Q lcl|Aclame:pro 152 VF-HVTNVGAL-LVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAEL 227 (400) Q Consensus 152 ~f-hV~n~~~~-a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~EL 227 (400) .+ +|...... .+.+...+ ..-+| ..-|++.+++..+|...++.|..++-+..+.+-+.+- +.-++..||+++| T Consensus 143 ~~~~~~~~~~~~~~~~p~~~~~~~a~w-v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--s~~~l~~~i~~~l 219 (390) T protein:vir:62 143 GGATTFTTSDANPLDFTVITGRSSASI-VGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATD--QVLDLVGFLVSDA 219 (390) T ss_pred hcceeeecCCCceeEEEEEcCCcceee-ecccccccccccceeeeEeeeeeEEeehHHHHHHHhh--hhHHHHHHHHHHH Confidence 22 44333221 12232222 23333 4456788889999999999998888776663333221 2236789999999 Q ss_pred HHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHH Q lcl|Aclame:pro 228 TQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKAL 306 (400) Q Consensus 228 aq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~ 306 (400) +.++- +.++.++++|||.|.- +....-. ...+.+. ..+.+-+.|+|...+.=..|. .+.-..++|+.....| T Consensus 220 ~~~i~-~~~d~~~l~G~G~p~G----i~~~~~~-~~~~~~~-~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L 292 (390) T protein:vir:62 220 GPAIG-DAMGRHFITGTGQPRG----ILTDASP-ATATFLA-TDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQM 292 (390) T ss_pred HHHHH-HHHHhhhhccCCcccc----ccccccc-cccceec-ccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHH Confidence 99998 7999999999997621 1111100 0001111 111122345554433111222 1233678899988887 Q ss_pred HhhhhhccccccceeecCCcceeehhhccccceec--chhhhchhhhhc--ccceechhhceeccc--------eeeeec Q lcl|Aclame:pro 307 LDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALKPTVLVD--QKYHIDMQDLTKVDA--------FEWKTN 374 (400) Q Consensus 307 ~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~~t~~vd--~k~~~~~~~~~~~~~--------~~~~~~ 374 (400) +. |+|++|+|.+.-++..-.-+| |.-.. -+..++ .-+--|++.|.-.+. .+..|. T Consensus 293 ~~------------lkd~~g~~l~~~~~~~g~~~~l~G~Pv~-~~~~~p~~~i~~gd~s~~~i~~~~~~~v~~~~~~~~~ 359 (390) T protein:vir:62 293 RK------------LKDANGQYLWQSGLTVGAPSLFNGKVVE-TDDGMPADKILFADLSKYRVRFAGSLRVDRSVDAKFS 359 (390) T ss_pred HH------------hhccCCCeeecCCcCCCccceecccceE-EecCCCCccEEEeeccceeEEeecceEEEeecccccc Confidence 76 889999987643332221111 43211 111111 112245665544332 233345 Q ss_pred CceEE--EEecccccceeeccceeEeeC Q lcl|Aclame:pro 375 SNMIL--VETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 375 ~~~il--ve~~~~~~~~~~~~~~~~~~~ 400 (400) .|++. ++.--.|-|-..+|-.+++|. T Consensus 360 ~~~~~~~~~~r~d~~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 360 TDQIVYRFLQRADGLLVDARGAKVLTVT 387 (390) T ss_pred CCcEEEEEEEEeCcEeechhheEEEEee Confidence 55553 444445555555555566666 No 60 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=99.20 E-value=6.6e-12 Score=81.87 Aligned_cols=363 Identities=15% Similarity=0.110 Sum_probs=154.9 Q ss_pred cchh-hHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhh-hhHHHHHHHHHHHHHHHHHHHHHHHHh-------hHh--- Q lcl|Aclame:pro 8 MNKP-DLIEKQNRLAELKENNVSLKSQISGFEVKNAIED-LPKVQELEKTLSENSIEIIKIENELNA-------QEE--- 75 (400) Q Consensus 8 ~~k~-~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~-~skieElektis~l~aEi~K~enEl~~-------~kE--- 75 (400) |++. .++|.+++|.+..+.+...+.+.... ..+ ....++++..++++..++...+..... ... T Consensus 1 m~~~~~lee~~a~l~~~~~~~~~~~~~~~~~-----~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (419) T protein:vir:94 1 MPPTPTLEEQRAALLARLDDTSLTTEQVQEI-----VAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTP 75 (419) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 5554 33444444433333332222222111 111 011233333333333333221111111 100 Q ss_pred --hhcchhHHHHHHHHHHHHHHHHHHHHH--ccC--ChhHHHHHHHHHHhC----ccchhhhHhhcchhHHHHHHHHHHh Q lcl|Aclame:pro 76 --KPKGKDKMTNFIESQNAVTEFFDVLKK--NSG--KSEIKNAWSAKLAEN----GVTITDTTFQLPRKLVESINTALLN 145 (400) Q Consensus 76 --k~K~k~emtEfLkTkqA~~dya~ll~~--nqg--~ke~k~AW~a~L~ek----gV~~qd~~eiLP~~iI~AIe~A~ed 145 (400) ....+... +.... .+.++..+. ..+ ..+++......+... |.......-+.|.-+-.-|....+. T Consensus 76 ~~~~~~~~~~-~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) T protein:vir:94 76 AEAGTFRSLA-QRFAD----SDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) T ss_pred cccccccchh-hhhhh----HHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhh Confidence 00001111 11111 111111111 111 112333322222222 1111222124454444444545444 Q ss_pred hCccccceeeecccceeeEEe--------ecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCch Q lcl|Aclame:pro 146 TNPVFKVFHVTNVGALLVSRS--------FDS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSY 216 (400) Q Consensus 146 ~d~vl~~fhV~n~~~~a~~i~--------l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~y 216 (400) +..+.+.+++.......+.+- ... .+.+.-+--|+++++.+.+|...++.|..++.+..+.+.+-+ ++ T Consensus 151 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~-- 227 (419) T protein:vir:94 151 PLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQAAD-DN-- 227 (419) T ss_pred hhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHHHH-hH-- Confidence 333333334333222211111 111 134445567889999999999999999999888777443332 22 Q ss_pred hHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhh--hhccCCCCCHHHHHHhhceecccc-cc Q lcl|Aclame:pro 217 SELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTK--AKSAGKTPFADAIEEAVDFVRPTA-GR 293 (400) Q Consensus 217 galvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~--at~~~~T~~~dal~Eald~~~~~~-~~ 293 (400) +.+.+|++.+|+..+. ++++.++++|||+....-.-.... +..+.+. ....+.+...+.|..++.-..+.. .- T Consensus 228 ~~l~~~i~~~la~a~~-~~~d~aii~G~G~~~p~Gi~~~~~---~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~ 303 (419) T protein:vir:94 228 SQLMGYIQGRLTYGLR-FLRDRQLLNGNGSTEMQGILTTPG---IGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPP 303 (419) T ss_pred HHHHHHHHHHHHHHHH-HHHHHHHHhccCcccccceecccc---cccccccccccccccchhHHHHHHHHHhhhhccCCC Confidence 4689999999999999 799999999999853211111111 1112221 112334445788888884444433 33 Q ss_pred eEEEEecchhHHHHhhhhhccccccceeecCCcc-eeehhhccccceec--chhhhchhhhhccc--ceechhh-ceecc Q lcl|Aclame:pro 294 RYLIVKAEDRKALLDELRQATANANVRIKNDDTE-IASEVGVDEIIVYT--GSKALKPTVLVDQK--YHIDMQD-LTKVD 367 (400) Q Consensus 294 ~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~-~~~~v~v~~~~~~t--g~k~l~~t~~vd~k--~~~~~~~-~~~~~ 367 (400) ..+++++.++..|+. +++.+|. +.....+.+-..+| |. .++.+.-++.. +-.|++. |+-.+ T Consensus 304 ~~~v~n~~~~~~l~~------------~k~~~~~~~~~~~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~gd~~~~~~~~~ 370 (419) T protein:vir:94 304 DGVVVHPQDWESIEL------------DQAPGSGVFRVIANVQGEATPRIWGL-NVVSTVAIAQGTALVGGFRQGATLWS 370 (419) T ss_pred CEEEEcHHHHHHHHH------------HhhcCCCceeecCCcccCCCccccce-eeEEcCCCCCccEEEeeccceEEEEE Confidence 478999999999887 5554333 22111111111111 32 11112111111 2233332 22111 Q ss_pred ce------------eeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 368 AF------------EWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 368 ~~------------~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .- -|..++-.+.++..-.|-|---++=++++++ T Consensus 371 ~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~ 415 (419) T protein:vir:94 371 RQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) T ss_pred ecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEec Confidence 10 1222333345555555555333343444444 No 61 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=99.15 E-value=4.4e-11 Score=77.37 Aligned_cols=364 Identities=13% Similarity=0.080 Sum_probs=154.2 Q ss_pred cch-----hhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh-hhcchh Q lcl|Aclame:pro 8 MNK-----PDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-KPKGKD 81 (400) Q Consensus 8 ~~k-----~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE-k~K~k~ 81 (400) ||+ ..+.+..+++.++++++.++....+.. .+...+...+++.++..+.++..+++..+.++..... ..+... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~-~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNID-MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 79 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 654 223333333444444443333332221 1111111233444444444444444433333332221 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-ccCChhHHHHHHHHHHh--CccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecc Q lcl|Aclame:pro 82 KMTNFIESQNAVTEFFDVLKK-NSGKSEIKNAWSAKLAE--NGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 158 (400) Q Consensus 82 emtEfLkTkqA~~dya~ll~~-nqg~ke~k~AW~a~L~e--kgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~ 158 (400) ...+=-+...+-.+|.+-.+. ....+ ....+...... .|. ..+--.++|..+..-|-..+++++++.....|.+. T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 157 (387) T protein:vir:96 80 SLSDNEKMVKAKAEFYRHAILPNEFEK-PSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHH-HHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeec Confidence 000001111222222221111 11110 01111111111 111 11112378999888888999999998876666666 Q ss_pred cceeeEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 159 GALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIV- 236 (400) Q Consensus 159 ~~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav- 236 (400) +.......-.+...+..+.-|.+.++...+|...++.+..++.+..+. ++..+. ..++.+||+++|+..+. ++. T Consensus 158 ~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds---~~~l~~~i~~~la~~~~-~~e~ 233 (387) T protein:vir:96 158 KGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS---DVDLVNWVENALQSGLA-AKER 233 (387) T ss_pred CCceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhh---HHHHHHHHHHHHHHHHH-HHHH Confidence 544322211122335556788899999999999999998887776663 333333 34679999999999998 453 Q ss_pred hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccc Q lcl|Aclame:pro 237 DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATA 315 (400) Q Consensus 237 ~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~ 315 (400) +.++..|+|+....- +.. +...+.++++...|+|..++.-..|. ...-..++++.+..+++..+.. T Consensus 234 ~~~~~~g~g~g~~~g------~~~----~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~--- 300 (387) T protein:vir:96 234 KDALAVSPKSGLEHM------SFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSN--- 300 (387) T ss_pred HhHhhcCCCccccce------eee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhc--- Confidence 456667777632100 000 01122334455678888887323222 2334678888888887764322 Q ss_pred cccceeecCCcceeehhhccccceecchhhhchhhhhccc-ceechhhceeccceeeeecCceEEEEecccccceeeccc Q lcl|Aclame:pro 316 NANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQK-YHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAG 394 (400) Q Consensus 316 ~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k-~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~~~ 394 (400) .|-.+-.+.+.- .+|--.+.| ..+.|-++-|-+ |++..++....-...-.+..-.+.+..--.|-| ++.- T Consensus 301 -~~~~~~~~~~~~----llG~PV~~~--~~~~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v--~~~~ 371 (387) T protein:vir:96 301 -GTTNFFDTPAEK----VFGKPVVFT--DAAVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQR--TLDS 371 (387) T ss_pred -CCCcccccCCcc----ccccceEEe--cCCCceeeechhhhhhhhhhhhheecccccCCceEEEEEEEeCcEe--echh Confidence 122233222211 011111111 112222222222 112221111100011111111122222222222 1222 Q ss_pred eeE--eeC Q lcl|Aclame:pro 395 AVI--TVS 400 (400) Q Consensus 395 ~~~--~~~ 400 (400) |+. .+. T Consensus 372 A~~~l~~k 379 (387) T protein:vir:96 372 AFRIAKAK 379 (387) T ss_pred heEEEEee Confidence 221 111 No 62 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=99.15 E-value=4.4e-11 Score=77.37 Aligned_cols=364 Identities=13% Similarity=0.080 Sum_probs=154.2 Q ss_pred cch-----hhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh-hhcchh Q lcl|Aclame:pro 8 MNK-----PDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-KPKGKD 81 (400) Q Consensus 8 ~~k-----~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE-k~K~k~ 81 (400) ||+ ..+.+..+++.++++++.++....+.. .+...+...+++.++..+.++..+++..+.++..... ..+... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~-~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNID-MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 79 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 654 223333333444444443333332221 1111111233444444444444444433333332221 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-ccCChhHHHHHHHHHHh--CccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecc Q lcl|Aclame:pro 82 KMTNFIESQNAVTEFFDVLKK-NSGKSEIKNAWSAKLAE--NGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 158 (400) Q Consensus 82 emtEfLkTkqA~~dya~ll~~-nqg~ke~k~AW~a~L~e--kgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~ 158 (400) ...+=-+...+-.+|.+-.+. ....+ ....+...... .|. ..+--.++|..+..-|-..+++++++.....|.+. T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 157 (387) T protein:vir:94 80 SLSDNEKMVKAKAEFYRHAILPNEFEK-PSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHH-HHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeec Confidence 000001111222222221111 11110 01111111111 111 11112378999888888999999998876666666 Q ss_pred cceeeEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 159 GALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIV- 236 (400) Q Consensus 159 ~~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav- 236 (400) +.......-.+...+..+.-|.+.++...+|...++.+..++.+..+. ++..+. ..++.+||+++|+..+. ++. T Consensus 158 ~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds---~~~l~~~i~~~la~~~~-~~e~ 233 (387) T protein:vir:94 158 KGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS---DVDLVNWVENALQSGLA-AKER 233 (387) T ss_pred CCceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhh---HHHHHHHHHHHHHHHHH-HHHH Confidence 544322211122335556788899999999999999998887776663 333333 34679999999999998 453 Q ss_pred hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccc Q lcl|Aclame:pro 237 DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATA 315 (400) Q Consensus 237 ~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~ 315 (400) +.++..|+|+....- +.. +...+.++++...|+|..++.-..|. ...-..++++.+..+++..+.. T Consensus 234 ~~~~~~g~g~g~~~g------~~~----~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~--- 300 (387) T protein:vir:94 234 KDALAVSPKSGLEHM------SFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSN--- 300 (387) T ss_pred HhHhhcCCCccccce------eee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhc--- Confidence 456667777632100 000 01122334455678888887323222 2334678888888887764322 Q ss_pred cccceeecCCcceeehhhccccceecchhhhchhhhhccc-ceechhhceeccceeeeecCceEEEEecccccceeeccc Q lcl|Aclame:pro 316 NANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQK-YHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAG 394 (400) Q Consensus 316 ~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k-~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~~~ 394 (400) .|-.+-.+.+.- .+|--.+.| ..+.|-++-|-+ |++..++....-...-.+..-.+.+..--.|-| ++.- T Consensus 301 -~~~~~~~~~~~~----llG~PV~~~--~~~~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v--~~~~ 371 (387) T protein:vir:94 301 -GTTNFFDTPAEK----VFGKPVVFT--DAAVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQR--TLDS 371 (387) T ss_pred -CCCcccccCCcc----ccccceEEe--cCCCceeeechhhhhhhhhhhhheecccccCCceEEEEEEEeCcEe--echh Confidence 122233222211 011111111 112222222222 112221111100011111111122222222222 1222 Q ss_pred eeE--eeC Q lcl|Aclame:pro 395 AVI--TVS 400 (400) Q Consensus 395 ~~~--~~~ 400 (400) |+. .+. T Consensus 372 A~~~l~~k 379 (387) T protein:vir:94 372 AFRIAKAK 379 (387) T ss_pred heEEEEee Confidence 221 111 No 63 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=99.15 E-value=4.4e-11 Score=77.37 Aligned_cols=364 Identities=13% Similarity=0.080 Sum_probs=154.2 Q ss_pred cch-----hhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh-hhcchh Q lcl|Aclame:pro 8 MNK-----PDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-KPKGKD 81 (400) Q Consensus 8 ~~k-----~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE-k~K~k~ 81 (400) ||+ ..+.+..+++.++++++.++....+.. .+...+...+++.++..+.++..+++..+.++..... ..+... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~-~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~ 79 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNID-MEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQ 79 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCC Confidence 654 223333333444444443333332221 1111111233444444444444444433333332221 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHH-ccCChhHHHHHHHHHHh--CccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecc Q lcl|Aclame:pro 82 KMTNFIESQNAVTEFFDVLKK-NSGKSEIKNAWSAKLAE--NGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 158 (400) Q Consensus 82 emtEfLkTkqA~~dya~ll~~-nqg~ke~k~AW~a~L~e--kgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~ 158 (400) ...+=-+...+-.+|.+-.+. ....+ ....+...... .|. ..+--.++|..+..-|-..+++++++.....|.+. T Consensus 80 ~~~~~~~~~~~~~~~~r~~~~~~~~~~-~~~~~~~~~~a~~~~~-~~~gG~lIP~~~~~~Ii~~~~~~~~l~~~~~~~~~ 157 (387) T protein:vir:26 80 SLSDNEKMVKAKAEFYRHAILPNEFEK-PSMEAQRLLHALPTGN-DSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNI 157 (387) T ss_pred CCchhHHHHHHHHHHHHHHHhhhhHHH-HHHHHHHHHhhhccCC-CCCCceeechhHHHHHHHHHHhhchhhhhceeeec Confidence 000001111222222221111 11110 01111111111 111 11112378999888888999999998876666666 Q ss_pred cceeeEEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHH- Q lcl|Aclame:pro 159 GALLVSRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIV- 236 (400) Q Consensus 159 ~~~a~~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav- 236 (400) +.......-.+...+..+.-|.+.++...+|...++.+..++.+..+. ++..+. ..++.+||+++|+..+. ++. T Consensus 158 ~~~~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds---~~~l~~~i~~~la~~~~-~~e~ 233 (387) T protein:vir:26 158 KGLEIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGS---DVDLVNWVENALQSGLA-AKER 233 (387) T ss_pred CCceeeeeeccCCccccccccccccccccccceeeechheeeeechhhHHHHhhh---HHHHHHHHHHHHHHHHH-HHHH Confidence 544322211122335556788899999999999999998887776663 333333 34679999999999998 453 Q ss_pred hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccc Q lcl|Aclame:pro 237 DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATA 315 (400) Q Consensus 237 ~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~ 315 (400) +.++..|+|+....- +.. +...+.++++...|+|..++.-..|. ...-..++++.+..+++..+.. T Consensus 234 ~~~~~~g~g~g~~~g------~~~----~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~--- 300 (387) T protein:vir:26 234 KDALAVSPKSGLEHM------SFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSN--- 300 (387) T ss_pred HhHhhcCCCccccce------eee----ccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhc--- Confidence 456667777632100 000 01122334455678888887323222 2334678888888887764322 Q ss_pred cccceeecCCcceeehhhccccceecchhhhchhhhhccc-ceechhhceeccceeeeecCceEEEEecccccceeeccc Q lcl|Aclame:pro 316 NANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQK-YHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAG 394 (400) Q Consensus 316 ~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k-~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~~~ 394 (400) .|-.+-.+.+.- .+|--.+.| ..+.|-++-|-+ |++..++....-...-.+..-.+.+..--.|-| ++.- T Consensus 301 -~~~~~~~~~~~~----llG~PV~~~--~~~~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v--~~~~ 371 (387) T protein:vir:26 301 -GTTNFFDTPAEK----VFGKPVVFT--DAAVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYDQQR--TLDS 371 (387) T ss_pred -CCCcccccCCcc----ccccceEEe--cCCCceeeechhhhhhhhhhhhheecccccCCceEEEEEEEeCcEe--echh Confidence 122233222211 011111111 112222222222 112221111100011111111122222222222 1222 Q ss_pred eeE--eeC Q lcl|Aclame:pro 395 AVI--TVS 400 (400) Q Consensus 395 ~~~--~~~ 400 (400) |+. .+. T Consensus 372 A~~~l~~k 379 (387) T protein:vir:26 372 AFRIAKAK 379 (387) T ss_pred heEEEEee Confidence 221 111 No 64 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=99.15 E-value=2.1e-11 Score=79.09 Aligned_cols=366 Identities=15% Similarity=0.151 Sum_probs=158.9 Q ss_pred cch-hhHHHHHHHHH---------------HHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 8 MNK-PDLIEKQNRLA---------------ELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 71 (400) Q Consensus 8 ~~k-~~~eekq~~lA---------------~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~ 71 (400) |++ ++++.+..+++ ++++.+.+...++..++.+ .+...+.+++.+.+.++++++...++++. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 78 (497) T protein:vir:10 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAE--VEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433 33344433332 2333333333333333211 11112223333444444444433333222 Q ss_pred hhH--------hhhcchhHHHHHHHHHHHHHHHHHHH---HHccCC-------hhHHHHHHHHHH------hCcc-chhh Q lcl|Aclame:pro 72 AQE--------EKPKGKDKMTNFIESQNAVTEFFDVL---KKNSGK-------SEIKNAWSAKLA------ENGV-TITD 126 (400) Q Consensus 72 ~~k--------Ek~K~k~emtEfLkTkqA~~dya~ll---~~nqg~-------ke~k~AW~a~L~------ekgV-~~qd 126 (400) ..+ +..+....+.+-++........-... ..+... .+...+...-+. .... +..+ T Consensus 79 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) T protein:vir:10 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) T ss_pred HHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcc Confidence 211 01111111222222111111110000 000000 112222111111 1111 1111 Q ss_pred hHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeeccc---cccceecccchhhhhhhhhhhhhccHHHHHHHH Q lcl|Aclame:pro 127 TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA---NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQ 203 (400) Q Consensus 127 ~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na---~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq 203 (400) --..+|..+..-|-+.++++..|.+.+.+.......+.+-.++. ..+|. --|.++.+...+|...++.|..|+-+. T Consensus 159 gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv-~E~~~~~~s~~~f~~i~~~~~k~a~~~ 237 (497) T protein:vir:10 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAV-AEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) T ss_pred cccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceee-ccCcccccccccceeeEeeeeeeEeec Confidence 12268887777777777878888776555444444444444332 23443 356678888899999999999888887 Q ss_pred HHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhh-----------------hh---- Q lcl|Aclame:pro 204 SLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKK-----------------IK---- 262 (400) Q Consensus 204 ~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~-----------------ik---- 262 (400) .+.+.+-+ + ++ .+.+||.++|+..+- +.++.+++.|||+.....+--...... +. T Consensus 238 ~iS~ell~-d-~~-~l~~~i~~~l~~~i~-~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) T protein:vir:10 238 TITDEGLR-D-AP-ELFNFVQGRLLEGIQ-RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) T ss_pred HhHHHHHH-h-HH-HHHHHHHHHHHHHHH-HHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcc Confidence 77443333 2 23 489999999999999 799999999999743111100000000 00 Q ss_pred -------------hhhhhh--------------hccCCCCCHHHHHHhhceeccccc--ceEEEEecchhHHHHhhhhhc Q lcl|Aclame:pro 263 -------------KITTKA--------------KSAGKTPFADAIEEAVDFVRPTAG--RRYLIVKAEDRKALLDELRQA 313 (400) Q Consensus 263 -------------~it~~a--------------t~~~~T~~~dal~Eald~~~~~~~--~~~l~~~~~d~~a~~~~~~~~ 313 (400) .+...+ .........+.+.-++.-+..... ...+++|+.|..+|+- T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~----- 388 (497) T protein:vir:10 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL----- 388 (497) T ss_pred cccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHH----- Confidence 000000 000000111112222211111111 1268899999999988 Q ss_pred cccccceeecCCcceeehh----------hccccceecchhhhchhhhh--------c-----------ccceechhhce Q lcl|Aclame:pro 314 TANANVRIKNDDTEIASEV----------GVDEIIVYTGSKALKPTVLV--------D-----------QKYHIDMQDLT 364 (400) Q Consensus 314 ~~~an~~lk~~d~~~~~~v----------~v~~~~~~tg~k~l~~t~~v--------d-----------~k~~~~~~~~~ 364 (400) |||.+|.|.+.= +-+.+.+ |- ..+-|..+ | +...|+++.+ T Consensus 389 -------lkd~~G~~i~~~~~~~~~~~~~~~~~~l~--G~-pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~- 457 (497) T protein:vir:10 389 -------TKDANGQYMGGNFFGNAYGNPVNGGKNIW--GV-PVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNS- 457 (497) T ss_pred -------hhcCCCceeccCcccccccccccCCceee--ce-eeEecCCCCCCceEEeecccceEEEEEecccEEEeecc- Confidence 999999987621 1111111 20 00000001 1 1111222211 Q ss_pred eccceeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 365 KVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 365 ~~~~~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +...|..|+=.|.+|.--.|-| .+..|++.++ T Consensus 458 --~~~~f~~n~v~~r~~~r~~~~v--~~p~A~~~l~ 489 (497) T protein:vir:10 458 --NGTDFVDGKVTVRAEERLGLLV--YRPSAFQLIQ 489 (497) T ss_pred --cchhhhcCcEEEEEEEeeccee--eccccEEEEE Confidence 1122444444455555444433 4556665555 No 65 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=99.15 E-value=2.1e-11 Score=79.09 Aligned_cols=366 Identities=15% Similarity=0.151 Sum_probs=158.9 Q ss_pred cch-hhHHHHHHHHH---------------HHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 8 MNK-PDLIEKQNRLA---------------ELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELN 71 (400) Q Consensus 8 ~~k-~~~eekq~~lA---------------~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~ 71 (400) |++ ++++.+..+++ ++++.+.+...++..++.+ .+...+.+++.+.+.++++++...++++. T Consensus 1 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~--~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 78 (497) T protein:vir:78 1 MPSTAQLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAE--VEAHERAQEMLKSLGGADAAKDGLDNDIP 78 (497) T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 433 33344433332 2333333333333333211 11112223333444444444433333222 Q ss_pred hhH--------hhhcchhHHHHHHHHHHHHHHHHHHH---HHccCC-------hhHHHHHHHHHH------hCcc-chhh Q lcl|Aclame:pro 72 AQE--------EKPKGKDKMTNFIESQNAVTEFFDVL---KKNSGK-------SEIKNAWSAKLA------ENGV-TITD 126 (400) Q Consensus 72 ~~k--------Ek~K~k~emtEfLkTkqA~~dya~ll---~~nqg~-------ke~k~AW~a~L~------ekgV-~~qd 126 (400) ..+ +..+....+.+-++........-... ..+... .+...+...-+. .... +..+ T Consensus 79 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (497) T protein:vir:78 79 EVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGT 158 (497) T ss_pred HHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcc Confidence 211 01111111222222111111110000 000000 112222111111 1111 1111 Q ss_pred hHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeeccc---cccceecccchhhhhhhhhhhhhccHHHHHHHH Q lcl|Aclame:pro 127 TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA---NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQ 203 (400) Q Consensus 127 ~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na---~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq 203 (400) --..+|..+..-|-+.++++..|.+.+.+.......+.+-.++. ..+|. --|.++.+...+|...++.|..|+-+. T Consensus 159 gg~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv-~E~~~~~~s~~~f~~i~~~~~k~a~~~ 237 (497) T protein:vir:78 159 FAPGILPTFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAV-AEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) T ss_pred cccccchhhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceee-ccCcccccccccceeeEeeeeeeEeec Confidence 12268887777777777878888776555444444444444332 23443 356678888899999999999888887 Q ss_pred HHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhh-----------------hh---- Q lcl|Aclame:pro 204 SLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKK-----------------IK---- 262 (400) Q Consensus 204 ~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~-----------------ik---- 262 (400) .+.+.+-+ + ++ .+.+||.++|+..+- +.++.+++.|||+.....+--...... +. T Consensus 238 ~iS~ell~-d-~~-~l~~~i~~~l~~~i~-~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) T protein:vir:78 238 TITDEGLR-D-AP-ELFNFVQGRLLEGIQ-RKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) T ss_pred HhHHHHHH-h-HH-HHHHHHHHHHHHHHH-HHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcc Confidence 77443333 2 23 489999999999999 799999999999743111100000000 00 Q ss_pred -------------hhhhhh--------------hccCCCCCHHHHHHhhceeccccc--ceEEEEecchhHHHHhhhhhc Q lcl|Aclame:pro 263 -------------KITTKA--------------KSAGKTPFADAIEEAVDFVRPTAG--RRYLIVKAEDRKALLDELRQA 313 (400) Q Consensus 263 -------------~it~~a--------------t~~~~T~~~dal~Eald~~~~~~~--~~~l~~~~~d~~a~~~~~~~~ 313 (400) .+...+ .........+.+.-++.-+..... ...+++|+.|..+|+- T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~----- 388 (497) T protein:vir:78 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRL----- 388 (497) T ss_pred cccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHH----- Confidence 000000 000000111112222211111111 1268899999999988 Q ss_pred cccccceeecCCcceeehh----------hccccceecchhhhchhhhh--------c-----------ccceechhhce Q lcl|Aclame:pro 314 TANANVRIKNDDTEIASEV----------GVDEIIVYTGSKALKPTVLV--------D-----------QKYHIDMQDLT 364 (400) Q Consensus 314 ~~~an~~lk~~d~~~~~~v----------~v~~~~~~tg~k~l~~t~~v--------d-----------~k~~~~~~~~~ 364 (400) |||.+|.|.+.= +-+.+.+ |- ..+-|..+ | +...|+++.+ T Consensus 389 -------lkd~~G~~i~~~~~~~~~~~~~~~~~~l~--G~-pV~~t~~~~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~- 457 (497) T protein:vir:78 389 -------TKDANGQYMGGNFFGNAYGNPVNGGKNIW--GV-PVVTTPLIPLGTILVGHFAPSVIQTARREGVTMQMTNS- 457 (497) T ss_pred -------hhcCCCceeccCcccccccccccCCceee--ce-eeEecCCCCCCceEEeecccceEEEEEecccEEEeecc- Confidence 999999987621 1111111 20 00000001 1 1111222211 Q ss_pred eccceeeeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 365 KVDAFEWKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 365 ~~~~~~~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +...|..|+=.|.+|.--.|-| .+..|++.++ T Consensus 458 --~~~~f~~n~v~~r~~~r~~~~v--~~p~A~~~l~ 489 (497) T protein:vir:78 458 --NGTDFVDGKVTVRAEERLGLLV--YRPSAFQLIQ 489 (497) T ss_pred --cchhhhcCcEEEEEEEeeccee--eccccEEEEE Confidence 1122444444455555444433 4556665555 No 66 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=99.12 E-value=6.2e-11 Score=76.56 Aligned_cols=363 Identities=15% Similarity=0.106 Sum_probs=161.8 Q ss_pred Cc--------ccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcch------hhhhhhhhhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MR--------ISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFE------VKNAIEDLPKVQELEKTLSENSIEIIKI 66 (400) Q Consensus 1 ~~--------~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~------~~~~~~~~skieElektis~l~aEi~K~ 66 (400) || |+--.|++ +.|.+++|.++.+.+..++..+.... .+...+-..+++.++..+.++..+++.. T Consensus 1 ~~~~~~~~~~~~g~~mk~--l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~ 78 (402) T protein:vir:93 1 MRNFKNDNELLGGNEMPT--LYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDI 78 (402) T ss_pred CcchhhhhhcCCCCCChH--HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 54 21122322 34555555555555544444432211 1111111233444555555555555444 Q ss_pred HHHHHhhHh---hhcchhHHHHHHHHHHHHHHHHHHHH-HccCChhHHHHHHHHHHhCcc-chhhhHhhcchhHHHHHHH Q lcl|Aclame:pro 67 ENELNAQEE---KPKGKDKMTNFIESQNAVTEFFDVLK-KNSGKSEIKNAWSAKLAENGV-TITDTTFQLPRKLVESINT 141 (400) Q Consensus 67 enEl~~~kE---k~K~k~emtEfLkTkqA~~dya~ll~-~nqg~ke~k~AW~a~L~ekgV-~~qd~~eiLP~~iI~AIe~ 141 (400) +.++..... .+...+.+. -+..++-.+|++-.+ ..... .....+......... +..+--.++|..+...|-. T Consensus 79 e~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~r~~~~~~~~~-~~~~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~ 155 (402) T protein:vir:93 79 EEKEKAKVKDKGEAYQSLSDN--EKMVKAKAEFYRHAILPNEFE-KPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVS 155 (402) T ss_pred HHHHHhhhhhccccCCCCchh--HHHHHHHHHHHHHHHhhhhHH-HHHHhHHHHHhhhccCCCcCCccccchhHHHHHHH Confidence 433332221 111111111 111122222322111 11111 111111111111111 1112233789998888889 Q ss_pred HHHhhCccccceeeecccceeeEEeec-cccccceecccchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHH Q lcl|Aclame:pro 142 ALLNTNPVFKVFHVTNVGALLVSRSFD-SANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSEL 219 (400) Q Consensus 142 A~ed~d~vl~~fhV~n~~~~a~~i~l~-na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygal 219 (400) .+.+++++....+|.+.+...... +. +...+..+.-|++.++...+|...++.+..+|.+..+. ++..+ +..++ T Consensus 156 ~~~~~~~l~~~~~v~~~~~~~~p~-~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~D---s~~~l 231 (402) T protein:vir:93 156 EPFAKNQLREKARLTNIKGLEIPR-VSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHG---SDVDL 231 (402) T ss_pred hHHhhhhhhhhceeeecCCceeee-eeccCCccccccccccccccccccceeeecceeeeeechhhHHHHhh---hHHHH Confidence 999999998877777766543321 12 22234456778888999999999999998888775552 33333 33467 Q ss_pred HHHHHHHHHHHHHHHHH-hcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecc-cccceEEE Q lcl|Aclame:pro 220 YNLIVAELTQAIVNKIV-DLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRRYLI 297 (400) Q Consensus 220 vnyvm~ELaq~fI~Rav-~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~-~~~~~~l~ 297 (400) .+||+++|+..|. ++. +.++..|+|+....- +.. +...+.++++...|+|..++--..| -...-..+ T Consensus 232 ~~~i~~~la~~~~-~~e~~~~~~~g~g~g~p~g------~~~----~~~~~~~~~~~~~d~l~~~~~~l~~~y~~na~~i 300 (402) T protein:vir:93 232 VNWVENALQSGLA-AKERKDALAVSPKSGLEHM------SFY----NGSVKEVEGADMYDAIINALADLHEDYRDNATIY 300 (402) T ss_pred HHHHHHHHHHHHH-HHHHHhHhhcCCCccccce------eee----ccccccccccchHHHHHHHHhccChhhhcCCEEE Confidence 9999999999998 454 456666777642110 010 0112223345556888887732322 22334678 Q ss_pred EecchhHHHHhhhhhccccccceeecCCcceeehhhccccceecch-----hhhchhhhhcccc-eechhhceeccceee Q lcl|Aclame:pro 298 VKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGS-----KALKPTVLVDQKY-HIDMQDLTKVDAFEW 371 (400) Q Consensus 298 ~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~-----k~l~~t~~vd~k~-~~~~~~~~~~~~~~~ 371 (400) +++.+..+++.-+ +|++|.+.. |...+++ |. ..+.|-++-|=++ .+..++....-.+.- T Consensus 301 mn~~t~~~~~~~~-----------~d~~~~~~~--~~~~~ll--G~PV~~t~~~~~i~~GDf~~~~~~~~~~~~~~~~~~ 365 (402) T protein:vir:93 301 MRYADYVKIISVL-----------SNGTTNFFD--TPAEKVF--GKPVVFTDAAVKPIVGDFNYFGINYDGTTYDTDKDV 365 (402) T ss_pred EechHHHHHHHHH-----------hcCCCcccc--cCCcccc--ccceEEecCCCceeeechhhhhhhhhhhhhhhhhcc Confidence 8888888877733 222332221 2222222 21 1122222223221 111111111000111 Q ss_pred eecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 372 KTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 372 ~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .+.+-.+++..--.|-| .|..|+..+. T Consensus 366 ~~~~~~~~~~~r~Dg~v--~~~~A~~~l~ 392 (402) T protein:vir:93 366 KKGEYLFVLTAWYDQQR--TLDSAFRIAK 392 (402) T ss_pred cCCceEEEEEEEeCcEE--echhheEEEE Confidence 11111122222222222 2333332221 No 67 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=99.01 E-value=2.8e-10 Score=72.93 Aligned_cols=365 Identities=15% Similarity=0.130 Sum_probs=149.5 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhh--hhhh---hhHHHHHHHHHHHHHHHHHHHH------HHHHhhH-- Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKN--AIED---LPKVQELEKTLSENSIEIIKIE------NELNAQE-- 74 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~--~~~~---~skieElektis~l~aEi~K~e------nEl~~~k-- 74 (400) |.| .++|--..+.+|++....+..+++++-.+. +..+ -.+.++.+..+.++.++|.+.+ .+++... T Consensus 1 ~~k-~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~ 79 (477) T protein:vir:84 1 MEK-HLEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIER 79 (477) T ss_pred Cch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 333 234444445566666666666665543211 1100 1122223333333333332111 1111000 Q ss_pred -------hhh-------------cchhHHHHHHHHH----------HHHHHHHHHHHHccCChhHHHHHHHHHHhCccch Q lcl|Aclame:pro 75 -------EKP-------------KGKDKMTNFIESQ----------NAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTI 124 (400) Q Consensus 75 -------Ek~-------------K~k~emtEfLkTk----------qA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~ 124 (400) +++ -......+|++.. .+.....+.+....+..+...........+...+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (477) T protein:vir:84 80 SGKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDR 159 (477) T ss_pred hhcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccc Confidence 000 0000111222111 1111111111111111122211111111222222 Q ss_pred hhhH--hhcchh-HHHHHHHHHHhhCccccceeeeccccee--eEEee-cc-ccccceecccc-----hhhhhhhhhhhh Q lcl|Aclame:pro 125 TDTT--FQLPRK-LVESINTALLNTNPVFKVFHVTNVGALL--VSRSF-DS-ANEAQVHKDGQ-----TKTEQAATLTID 192 (400) Q Consensus 125 qd~~--eiLP~~-iI~AIe~A~ed~d~vl~~fhV~n~~~~a--~~i~l-~n-a~~a~GHk~ga-----~Kk~q~~~le~~ 192 (400) .+.. .++|.. +..-|-+.++++..+.+.+.+-..+... +.+-. ++ ...++.+--|+ .|.+...+|... T Consensus 160 ~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i 239 (477) T protein:vir:84 160 NGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFV 239 (477) T ss_pred cCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeE Confidence 2221 134443 3455666777666666644544333332 22221 22 34555555554 456677889999 Q ss_pred hccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccC Q lcl|Aclame:pro 193 TLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAG 272 (400) Q Consensus 193 ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~ 272 (400) ++.|..++-+..+-+.+-+-.+ -++.+|+.++|+.++- +.++.+++.|||+... .--+... ..+..++. +..+ T Consensus 240 ~~~~~k~~~~~~iS~ell~ds~--~~l~~~i~~~l~~~~~-~~~d~~~l~G~Gt~~~-p~Gi~~~-~~~~~~~~--~~~~ 312 (477) T protein:vir:84 240 QANVKTIAGQQGIAIQLLDQAA--VSVDEFVFRDLAADYA-NKLNVQVISGTGSNNQ-VVGVRAT-AGITQVTA--TSAG 312 (477) T ss_pred EEeeeeEEeeeHHHHHHHhccc--hhHHHHHHHHHHHHHH-HHHHHHHhccCCCCCc-cceeeec-cccccccc--cccc Confidence 9999888777666333222222 3578999999999999 7999999999997431 0000000 00111111 1111 Q ss_pred CC-C----CHHHHHHhhceecccc--cceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhh---ccccceec- Q lcl|Aclame:pro 273 KT-P----FADAIEEAVDFVRPTA--GRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVG---VDEIIVYT- 341 (400) Q Consensus 273 ~T-~----~~dal~Eald~~~~~~--~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~---v~~~~~~t- 341 (400) .| . +.+.+..++.-+.+.. .-..+++|+.+.++|+. |+|.+|.+.+..+ ++...+.+ T Consensus 313 ~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~------------lkd~~G~~l~~~~~~~~~~~~~~~~ 380 (477) T protein:vir:84 313 SALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHA------------IFAGDDRPLIVPSGPGFNNLGVLTE 380 (477) T ss_pred cchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHH------------hhccCCCeeeecCcccccccccccc Confidence 11 1 1223333333222222 22468889999999888 8999998876322 11111111 Q ss_pred -------chhhhchhhhhcc-------------cceechhhceeccceeeeecCceEEEEeccc-----ccc-------- Q lcl|Aclame:pro 342 -------GSKALKPTVLVDQ-------------KYHIDMQDLTKVDAFEWKTNSNMILVETLTS-----GHV-------- 388 (400) Q Consensus 342 -------g~k~l~~t~~vd~-------------k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~-----~~~-------- 388 (400) |.=.=+|.+..+. -+--|+++|+- ..+|+. ++.++. +++ T Consensus 381 ~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i-------~~~~~~-~~~~~~~~~~~~~~~~~v~~~~ 452 (477) T protein:vir:84 381 VASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLAL-------FESSVR-MRALQETRAENLSVLLQVYGYL 452 (477) T ss_pred cccccccchhcccceEecCcccccccccCCcceEEEEEeceEEE-------Eeecee-EEeccccccccceeeeeehhhh Confidence 0000112221111 01122222211 122221 222221 111 Q ss_pred ---eeeccceeEeeC Q lcl|Aclame:pro 389 ---ETYNAGAVITVS 400 (400) Q Consensus 389 ---~~~~~~~~~~~~ 400 (400) -.....+++.++ T Consensus 453 ~~~~~r~~~afv~~t 467 (477) T protein:vir:84 453 AFTAARFPQSVVEIG 467 (477) T ss_pred hhhhhccccceEEee Confidence 001234444444 No 68 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=99.00 E-value=3.9e-12 Score=83.14 Aligned_cols=281 Identities=14% Similarity=0.116 Sum_probs=149.7 Q ss_pred hHhhhcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccc-hhhhHhhcchhHHHHHHHHHHhhCcccc Q lcl|Aclame:pro 73 QEEKPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVT-ITDTTFQLPRKLVESINTALLNTNPVFK 151 (400) Q Consensus 73 ~kEk~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~-~qd~~eiLP~~iI~AIe~A~ed~d~vl~ 151 (400) .+..+|.+-+...|.....+.. - +..-.+. .++..-.+|..+..-|-..+.++.++++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~----------~-----------~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~ 59 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQ----------V-----------FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ 59 (324) T ss_pred CCCchHHHHHHHHHHHHhhccc----------e-----------ecccceeccCCCcceechhHHHHHHHHHHhhchhhh Confidence 1111111111111111110000 0 0001111 1122226999999988888998999998 Q ss_pred ceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHH Q lcl|Aclame:pro 152 VFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQA 230 (400) Q Consensus 152 ~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~ 230 (400) .+++.+.+.....+--.+ ...+.-+.-|.++.+...+|...++.|..++...++.+-+.+ .+.-++.+|+.++|+++ T Consensus 60 ~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~a 137 (324) T protein:vir:10 60 LGKYEPMEGTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEA 137 (324) T ss_pred hcceeeccCCceEEEEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHH Confidence 888777665544443332 344555667888999999999999999988888777443332 23357899999999999 Q ss_pred HHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhh Q lcl|Aclame:pro 231 IVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDE 309 (400) Q Consensus 231 fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~ 309 (400) +- |+++.+++.|||.+..-. ... .. +....+..+.+...+++..++.-..+. .....+++++.++..|+. T Consensus 138 i~-~~~d~a~l~G~g~~~~~~--~i~--~~---~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~- 208 (324) T protein:vir:10 138 FY-KKFDEAGILNQGNNPFGK--SIA--QS---IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK- 208 (324) T ss_pred HH-HHHHHHhhhcCCCCccCc--ccc--cc---ccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH- Confidence 99 799999999999863111 011 11 111122334566678888887444443 344478999999998876 Q ss_pred hhhccccccceeecCCcceeehhhccccceecchhhh-chhhhhcccc--eechhhceecccee--e------------- Q lcl|Aclame:pro 310 LRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKAL-KPTVLVDQKY--HIDMQDLTKVDAFE--W------------- 371 (400) Q Consensus 310 ~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l-~~t~~vd~k~--~~~~~~~~~~~~~~--~------------- 371 (400) ++|.+|.+.+.-+.+.+.. |-... .|+.-++... -.|++.+.-....+ + T Consensus 209 -----------l~d~~g~~~~~~~~~~~l~--G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~ 275 (324) T protein:vir:10 209 -----------IVDPETKERIYDRNSDTLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKN 275 (324) T ss_pred -----------hhccCCceeecCCCCcccc--ceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeeccccccccc Confidence 7788887776554444332 32110 0111111111 11333332111111 0 Q ss_pred -------eecCceEEE--EecccccceeeccceeEeeC Q lcl|Aclame:pro 372 -------KTNSNMILV--ETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 372 -------~~~~~~ilv--e~~~~~~~~~~~~~~~~~~~ 400 (400) -|.+|++.+ +.--.+.| .+..|+..+. T Consensus 276 ~~~~~~~~~~~~~~~~r~~~r~d~~v--~~~~A~~~l~ 311 (324) T protein:vir:10 276 EDGTPVNLFEQDMVALRATMHVALHI--ADDKAFAKLV 311 (324) T ss_pred ccccchhhhhcCcEEEEEEEEEccEE--ecccceEEEE Confidence 123344333 22222222 2333333333 No 69 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=98.99 E-value=6.5e-11 Score=76.42 Aligned_cols=337 Identities=16% Similarity=0.190 Sum_probs=153.6 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHH----HHHHHHHHHHHHHHHhhHhh Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTL----SENSIEIIKIENELNAQEEK 76 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElekti----s~l~aEi~K~enEl~~~kEk 76 (400) |-|+..+ +.++++.+.++...++.- . ..++.++.. ..++.++.+ .. T Consensus 1 M~i~~k~------------~~~~~~~~~~l~~~~~~~---~------~~ee~~~~~~~~~~~~~~~~~~-------~~-- 50 (377) T protein:vir:98 1 MAINLKE------------LPKYREAVAELSAKISAG---A------TSEEQEKLFEAAFTTMGDEILA-------KN-- 50 (377) T ss_pred CCCcHHH------------HHHHHHHHHHHHHHHHhh---h------hhHHHHHHHHHHHHhHHHHHHH-------HH-- Confidence 5553322 222222222222211110 0 001111111 111222211 00 Q ss_pred hcchhHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeee Q lcl|Aclame:pro 77 PKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVT 156 (400) Q Consensus 77 ~K~k~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~ 156 (400) ..++.+.+..+ ......+.+-++.|.+-+...+. .|....+|..++..|-..+.+++++++...|. T Consensus 51 ---~~e~~~~~~~~---------~~~~~lt~ee~~~~~~~~~~~~~--~~gg~~vP~~~~~~I~~~l~~~s~i~~~~~v~ 116 (377) T protein:vir:98 51 ---EEEMERMFDLR---------DKNRELTAEEIKFFNDIDKNVGG--KDKFKLLPEETMVQVFDDLVAEHPLLKVINFK 116 (377) T ss_pred ---HHHHHHHHHhc---------cCCcccCHHHHHHHHHHHhccCC--CCCccccCHHHHHHHHHHHHHhhhhhhheeeE Confidence 01111111000 01122234556666665544433 44455899999999999999999999988777 Q ss_pred cccceeeEEeecc--ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 157 NVGALLVSRSFDS--ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK 234 (400) Q Consensus 157 n~~~~a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~R 234 (400) +..... .+-..+ ....||--.+..++.....|...++.+..+|-+-.+..-+ |+-++-++-.|+.++|+..|- + T Consensus 117 ~~~~~~-~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~el--L~ds~~~ie~~i~~~la~~~a-~ 192 (377) T protein:vir:98 117 NTSLRL-KALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA--LKFGPKWIKQFITEQLKEAIA-V 192 (377) T ss_pred ecCcce-EEEEecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHh--hhccHhHHHHHHHHHHHHHHH-H Confidence 776442 333322 3445655445555566778999999998888775553322 444456789999999999999 7 Q ss_pred HHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCC-CHHHHHHhhceecccccceEEEEecchhHHHHhhhhhc Q lcl|Aclame:pro 235 IVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTP-FADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQA 313 (400) Q Consensus 235 av~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~-~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~ 313 (400) +.+.|+++|||+.. ..-+..++.....-.+++...+..+ +.+++. .+.|+-|. ..++...+-+... T Consensus 193 ~~~~a~i~G~G~~q--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~~~~~~----------~~~~~a~~~m~~~ 259 (377) T protein:vir:98 193 ALELAIVKGDGLLQ--PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIA-DLSDLTPD----------NAPKKLVPVMKHL 259 (377) T ss_pred HHhhceEeccCCCc--ceeeeecccccccccccccccccccchhhhHh-hhhhhchh----------HHHHHHHHHHHHH Confidence 99999999999742 1111121110000011111111111 122322 22333332 2233333434444 Q ss_pred cccccceeecCCcceeehhhccc------------------cceecchhh-hchhhhhccc--ceechhhceeccceeee Q lcl|Aclame:pro 314 TANANVRIKNDDTEIASEVGVDE------------------IIVYTGSKA-LKPTVLVDQK--YHIDMQDLTKVDAFEWK 372 (400) Q Consensus 314 ~~~an~~lk~~d~~~~~~v~v~~------------------~~~~tg~k~-l~~t~~vd~k--~~~~~~~~~~~~~~~~~ 372 (400) +.++=.++||.+|++...+..+. +++ |..- ++.+..+... ..-|.+.|.-++..++. T Consensus 260 t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~l--g~p~~vv~s~~~p~~~i~fgdf~~Y~i~~r~~~~ 337 (377) T protein:vir:98 260 SVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVL--PHGITILESLAVETGKAIAFVANRYDAFMATAST 337 (377) T ss_pred HHHHHhhhhccCCceEEEecccchhhccccccccCCCCcccccc--CCCceEEecCCCCcccEEEEEecceeEEeecceE Confidence 44555568888888876322110 111 1100 1111112111 22344555544443333 Q ss_pred ecC--------ceEEEEec--ccccceeeccceeEeeC Q lcl|Aclame:pro 373 TNS--------NMILVETL--TSGHVETYNAGAVITVS 400 (400) Q Consensus 373 ~~~--------~~ilve~~--~~~~~~~~~~~~~~~~~ 400 (400) +.. +.+++=.. -.|.+-.=+|=.|++++ T Consensus 338 i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~ 375 (377) T protein:vir:98 338 IEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) T ss_pred EEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEe Confidence 221 22222111 11222112222355555 No 70 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.96 E-value=6.2e-12 Score=82.03 Aligned_cols=275 Identities=13% Similarity=0.095 Sum_probs=150.8 Q ss_pred HHHccCChhHHHHHHHHHHhC------ccc-hhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-cc Q lcl|Aclame:pro 100 LKKNSGKSEIKNAWSAKLAEN------GVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-AN 171 (400) Q Consensus 100 l~~nqg~ke~k~AW~a~L~ek------gV~-~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~ 171 (400) |++.+-.+.--+.|...+... .+. .++..-.+|..+..-|-..++++.++++.+++...+.....+--.+ .. T Consensus 1 ~~k~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:99 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCCchHhhHHHHHHHHHhhhhhhccccceeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 111111111111122222111 111 1122226899998888888898999988777776665544443333 33 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcccc Q lcl|Aclame:pro 172 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKS 251 (400) Q Consensus 172 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~ 251 (400) .+.-..-|.++.+...+|...++.|..++...++.+-+.+ .+..++.+|+.++|++++. |+++.+++.|||.+..- T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~ai~-~~~d~~~l~G~g~~~~~- 156 (324) T protein:vir:99 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFG- 156 (324) T ss_pred ceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHhhhcCCCCccC- Confidence 4555567888999999999999999988888777443322 2235789999999999999 79999999999985311 Q ss_pred chhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccccccceeecCCcceee Q lcl|Aclame:pro 252 IDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIAS 330 (400) Q Consensus 252 ~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~ 330 (400) .+-...+ ....+..+.+...+++..++.-..+. .....+++++.++..|+. ++|.+|++.+ T Consensus 157 ---~~~~~~~---~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~------------l~d~~g~~~~ 218 (324) T protein:vir:99 157 ---KSIAQSI---EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKERI 218 (324) T ss_pred ---ccccccc---cccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhcCCCceee Confidence 1111111 11223344566778888887444443 234478999999998876 7888888777 Q ss_pred hhhccccceecchhhh-chhhhhcccce--echhhceeccceee----------------------eecCceE--EEEec Q lcl|Aclame:pro 331 EVGVDEIIVYTGSKAL-KPTVLVDQKYH--IDMQDLTKVDAFEW----------------------KTNSNMI--LVETL 383 (400) Q Consensus 331 ~v~v~~~~~~tg~k~l-~~t~~vd~k~~--~~~~~~~~~~~~~~----------------------~~~~~~i--lve~~ 383 (400) .-+.+.+.. |-... .|+.-++.... .|++.+.-....+. -|.+|++ .++.- T Consensus 219 ~~~~~~~l~--G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r 296 (324) T protein:vir:99 219 YDRNSDTLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred cCCCCcccc--ceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 655544332 32110 01111111111 13333221111110 0233333 33333 Q ss_pred ccccceeeccceeEeeC Q lcl|Aclame:pro 384 TSGHVETYNAGAVITVS 400 (400) Q Consensus 384 ~~~~~~~~~~~~~~~~~ 400 (400) -.+.|. |..|+..+. T Consensus 297 ~d~~v~--~~~a~~~lt 311 (324) T protein:vir:99 297 VALHIA--DDKAFAKLV 311 (324) T ss_pred EccEEe--cccceEEEE Confidence 333332 333333222 No 71 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.96 E-value=7.6e-12 Score=81.54 Aligned_cols=277 Identities=13% Similarity=0.087 Sum_probs=153.9 Q ss_pred HHHccCChhHHHHHHHHHH------hCccc-hhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-cc Q lcl|Aclame:pro 100 LKKNSGKSEIKNAWSAKLA------ENGVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-AN 171 (400) Q Consensus 100 l~~nqg~ke~k~AW~a~L~------ekgV~-~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~ 171 (400) |..++-.+.-.+.|...+. ..++. .++.-..+|..+..-|-..++++.++++.+++.+.+.....+--.+ .. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ip~~~~~~ 80 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhhhccccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceEEEEEecCc Confidence 3333332211122221111 11221 1222337899998888888898899888777777665544443333 33 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcccc Q lcl|Aclame:pro 172 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKS 251 (400) Q Consensus 172 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~ 251 (400) .+.-+.-|.++.+...+|...++.|..++....+.+-+.+ .+.-++.+|+.++|++++- |.++.+++.|||.+.. T Consensus 81 ~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~--ds~~~l~~~i~~~l~~aia-~~~d~a~l~G~g~~~~-- 155 (324) T protein:vir:97 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPF-- 155 (324) T ss_pred ceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHhhccCCCCcc-- Confidence 4545567788999999999999999988888777433222 2235779999999999999 7999999999997531 Q ss_pred chhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhc-eecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceee Q lcl|Aclame:pro 252 IDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIAS 330 (400) Q Consensus 252 ~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald-~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~ 330 (400) ....... + .......+.+...+++..+.. +........++++++.++..|+. ++|.+|++.+ T Consensus 156 ~~gi~~~--~---~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~------------lkd~~g~~~~ 218 (324) T protein:vir:97 156 GKSIAQS--I---EKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKERI 218 (324) T ss_pred Ccccccc--c---cccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhcCCCceee Confidence 1111110 1 111123344556778887763 32223344578999999998877 8888888877 Q ss_pred hhhccccceecchhhhc-hhhhhcc--cceechhhceecccee------------------------eeecCceEEEEec Q lcl|Aclame:pro 331 EVGVDEIIVYTGSKALK-PTVLVDQ--KYHIDMQDLTKVDAFE------------------------WKTNSNMILVETL 383 (400) Q Consensus 331 ~v~v~~~~~~tg~k~l~-~t~~vd~--k~~~~~~~~~~~~~~~------------------------~~~~~~~ilve~~ 383 (400) .-+.+.+.. |..... |.+-++. -+-.|++.+.-.+..+ |..++-.+.++.- T Consensus 219 ~~~~~~tl~--G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r 296 (324) T protein:vir:97 219 YDRNSDTLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred cCCCCcccc--ceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 655555443 432110 1111111 1122333332111111 2223333334444 Q ss_pred ccccceeeccceeEeeC Q lcl|Aclame:pro 384 TSGHVETYNAGAVITVS 400 (400) Q Consensus 384 ~~~~~~~~~~~~~~~~~ 400 (400) -.+.+-.-+|=++|+.- T Consensus 297 ~d~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 297 VALHIADDKAFAKLVPA 313 (324) T ss_pred eccEEecccceEEEEec Confidence 44444433333333332 No 72 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.94 E-value=9.7e-12 Score=80.96 Aligned_cols=276 Identities=13% Similarity=0.118 Sum_probs=149.5 Q ss_pred HHHccCCh-hHHHHHHHHHHhC------ccchhh-hHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-c Q lcl|Aclame:pro 100 LKKNSGKS-EIKNAWSAKLAEN------GVTITD-TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-A 170 (400) Q Consensus 100 l~~nqg~k-e~k~AW~a~L~ek------gV~~qd-~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a 170 (400) |++.+-.+ +++ .|...+.+. .+...+ -.-.+|..+..-|-+.+.++.++++...+.+.+.....+--.+ . T Consensus 1 ~~~~~~~~~~~~-~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~ 79 (324) T protein:vir:93 1 MEQTQKLKLNLQ-HFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADK 79 (324) T ss_pred CchhHHHHHHHH-HHHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecC Confidence 22222222 222 222222211 111112 1226899999988888898899988777666655444443332 3 Q ss_pred cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccc Q lcl|Aclame:pro 171 NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFK 250 (400) Q Consensus 171 ~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~ 250 (400) ..+.-..-|.++.+...+|...++.|..++....+.+-+.+ .+.-++.+|++++|++++- |+++.+++.|||.+..- T Consensus 80 ~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~aia-~~~d~a~l~G~g~~~~~ 156 (324) T protein:vir:93 80 PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFG 156 (324) T ss_pred cceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHHhcCCCCCCcC Confidence 34444567788889999999999999998888777443333 2235779999999999999 79999999999975321 Q ss_pred cchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecc-cccceEEEEecchhHHHHhhhhhccccccceeecCCccee Q lcl|Aclame:pro 251 SIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIA 329 (400) Q Consensus 251 ~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~-~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~ 329 (400) . .... ....+.+....+...+++..++.-..+ -.....+++++.++.+|+. ++|.+|.+. T Consensus 157 ~--~~~~-----~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~------------l~d~~G~~~ 217 (324) T protein:vir:93 157 K--SIAQ-----SIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKER 217 (324) T ss_pred c--cccc-----cccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhCCCCCee Confidence 1 1111 111122333456677888887743333 2344578999999999887 788888877 Q ss_pred ehhhccccceecchhhhc-hhhhhccc--ceechhhceeccce------------------------eeeecCceEEEEe Q lcl|Aclame:pro 330 SEVGVDEIIVYTGSKALK-PTVLVDQK--YHIDMQDLTKVDAF------------------------EWKTNSNMILVET 382 (400) Q Consensus 330 ~~v~v~~~~~~tg~k~l~-~t~~vd~k--~~~~~~~~~~~~~~------------------------~~~~~~~~ilve~ 382 (400) +.-+.+.+.. |-.-.+ +....+.. +..|.+.+.-.... -|..++--+.++. T Consensus 218 ~~~~~~~~l~--G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~ 295 (324) T protein:vir:93 218 IYDRNSDSLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) T ss_pred ecCCCCCccc--ceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEE Confidence 6544333322 211000 00000000 11122221100000 0233333333444 Q ss_pred cccccceeeccceeEeeC Q lcl|Aclame:pro 383 LTSGHVETYNAGAVITVS 400 (400) Q Consensus 383 ~~~~~~~~~~~~~~~~~~ 400 (400) --.+.|---+|=++|+-- T Consensus 296 r~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 296 HVALHIADDKAFAKLVPA 313 (324) T ss_pred EeccEEecccceEEEecc Confidence 334443333333333322 No 73 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=98.89 E-value=3.2e-10 Score=72.67 Aligned_cols=312 Identities=15% Similarity=0.121 Sum_probs=139.0 Q ss_pred HHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh-----------hhcchhHHHH Q lcl|Aclame:pro 17 QNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-----------KPKGKDKMTN 85 (400) Q Consensus 17 q~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE-----------k~K~k~emtE 85 (400) ++++.+++++ ++++++.++.+..++++.+.+..+..+ ..+..-...+ T Consensus 1 ~eei~~l~~~----------------------~~~l~~~~~~l~~~~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~ 58 (352) T protein:vir:78 1 MEDIKQLETE----------------------KAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAE 58 (352) T ss_pred ChhHHHHHHH----------------------HHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccchhhhHHHHHHH Confidence 2333333332 233333333333333332222221111 0000001122 Q ss_pred HHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeE- Q lcl|Aclame:pro 86 FIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVS- 164 (400) Q Consensus 86 fLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~- 164 (400) |++.......+...+.. .+.-...|.+ | ...+--.++|..+..-|-..+.++.++.+...|.+....... T Consensus 59 ~~r~~~~~~~~~~~~~~-------~~~~~~al~~-~-~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~~~p~ 129 (352) T protein:vir:78 59 FYRHAILPNEFEKPSME-------AQRLLHALPT-G-NDSGGDKLLPKTLSKEIVSEPFAKNQLREKARLTNIKGLEIPR 129 (352) T ss_pred HHHHHhhhhHHHHHHhh-------HHHHHHHhcc-C-CCCCCceeccHhHHHHHHHHHHhhcchhhheeeEecCCceEEE Confidence 33222222211111100 0001111111 1 122333479999888888889999999887777766544322 Q ss_pred EeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHH-Hhcceeec Q lcl|Aclame:pro 165 RSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI-VDLALVEG 243 (400) Q Consensus 165 i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Ra-v~rAvv~g 243 (400) +...+....|. .-|+.+++...+|...++.|..++-...+.+.+ |+.+..++.+|++++|++.+. ++ ...++..| T Consensus 130 ~~~~~~~a~~v-~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~el--l~Ds~~~l~~~i~~~la~~~~-~~e~~~~~~~g 205 (352) T protein:vir:78 130 VSYTLDDDDFI-TDVETAKELKLKGDTVKFTTNKFKVFAAISDTV--IHGSDVDLVNWVENALQSGLA-AKERKDALAVS 205 (352) T ss_pred EecCCCccccc-ccccccccccccceeeeecceeEEeechhhHHH--HhhhhHHHHHHHHHHHHHHHH-HHHHHhhhhcC Confidence 22233344454 346677888899999999987666655552222 233335789999999999998 45 34466777 Q ss_pred cCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccccccceee Q lcl|Aclame:pro 244 DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATANANVRIK 322 (400) Q Consensus 244 DG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk 322 (400) ||++... ..... ..+ +.++++...|+|..++.-..|. ...-..++++.+..+++. ++ T Consensus 206 ~g~~~~~--g~l~~-~~~-------~~~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~------------~~ 263 (352) T protein:vir:78 206 PKSGLEH--MSFYN-GSV-------KEVEGANMYDAIINALADLHEDYRDNATIYMRYADYVKIIS------------VL 263 (352) T ss_pred CCCcccc--cceec-ccc-------ccccccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHH------------HH Confidence 7764311 11111 111 1233444578888777433332 234557788888888776 32 Q ss_pred cCCcceeehhhccccceecchhhhchhhhhcccce---echhhceeccceeeeecCceEEEEecc---cccce----e-- Q lcl|Aclame:pro 323 NDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYH---IDMQDLTKVDAFEWKTNSNMILVETLT---SGHVE----T-- 390 (400) Q Consensus 323 ~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~---~~~~~~~~~~~~~~~~~~~~ilve~~~---~~~~~----~-- 390 (400) +++|.+.. .|...+++ |- |-+..|.... =|++.|.. --++ ++++.++ .|.+- . T Consensus 264 ~~~~~~~~-~~~~~~ll--G~----PV~~~~~~~~~~~Gdf~~~~~-------~~~~-~~~~~~~~~~~g~~~f~~~~r~ 328 (352) T protein:vir:78 264 SNGTTNFF-DTPAEKVF--GK----PVVFTDAAVKPIVGDFNYFGI-------NYDG-TTYDTDKDVKKGEYLFVLTAWY 328 (352) T ss_pred hccCCccc-ccCCcccc--cc----ceEEecCCCceeEeehhhhhh-------hhhh-heeeeeccccCCeeEEEEEeee Confidence 32332222 12222222 31 2222221111 12222211 1111 1222221 12111 1 Q ss_pred ----eccceeEeeC Q lcl|Aclame:pro 391 ----YNAGAVITVS 400 (400) Q Consensus 391 ----~~~~~~~~~~ 400 (400) ++..|+..+. T Consensus 329 Dg~~~~~eA~~~l~ 342 (352) T protein:vir:78 329 DQQRTLDSAFRIAK 342 (352) T ss_pred CceeechhheEEEE Confidence 2333333332 No 74 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.88 E-value=9.3e-10 Score=70.10 Aligned_cols=339 Identities=18% Similarity=0.194 Sum_probs=157.8 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcch Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k 80 (400) |.|+.-++ +++.++ .+++.+++... .++.++ .+.++.....++.++.++ .++ T Consensus 1 M~i~~~~~--~~~~e~---~~~l~~~~~~~---------~~~e~~---~~~~~~~~~~~~~~~~~~-------~~~---- 52 (377) T protein:vir:96 1 MAINLKEL--PKYREA---VAELSAKISAG---------ATPEEQ---EKLFEAAFTTMGDEILAK-------NEE---- 52 (377) T ss_pred CCccHHHH--HHHHHH---HHHHHHHHhhc---------ccHHHH---HHHHHHHHHHHHHHHHHH-------HHH---- Confidence 76644331 223222 22222222211 001111 111122223333333221 110 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccc Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~ 160 (400) ++.+++-.+ ......+.+-++...+-+...+. .+--.++|..++..|-+.+.+++++++...|.+.+. T Consensus 53 -e~~~~~~~~---------~~~~~lt~ee~~~~~~~~~~~~~--~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~ 120 (377) T protein:vir:96 53 -EMERMFDLR---------DKNRELTAEEIKFFNDIDKNVGG--KDKFKLLPEETMVQVFDDLVAEHPLLKVINFKNTSL 120 (377) T ss_pred -HHHHHHHhc---------cCCcccCHHHHHHHHHHHhcCCC--CCCceecCHHHHHHHHHHHHhhhhhhhhceeEecCC Confidence 111111000 00111122333444443332222 333448999999999999999999999888877764 Q ss_pred eeeEEee-cc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 161 LLVSRSF-DS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDL 238 (400) Q Consensus 161 ~a~~i~l-~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~r 238 (400) .. .+-. .+ ....||--.+.-+.....+|...++.+..+|.+..+..-+ |+.++-++..|+.++|+..|- ++.+. T Consensus 121 ~~-~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~l--l~ds~~~le~~i~~~l~~~~~-~~~~~ 196 (377) T protein:vir:96 121 RL-KALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA--LKFGPKWLKQFITEQLKEAIA-VALEL 196 (377) T ss_pred ce-EEEEecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHH--hhcchhhHHHHHHHHHHHHHH-HHHhh Confidence 42 2322 22 3455665444556666789999999998877775553333 444557789999999999999 79999 Q ss_pred ceeeccCCCccccchhhhhhhhhhhhhhhh----------hccCCC--CCHHHHH-------Hhh---ceecccc--cce Q lcl|Aclame:pro 239 ALVEGDGTNGFKSIDKEADVKKIKKITTKA----------KSAGKT--PFADAIE-------EAV---DFVRPTA--GRR 294 (400) Q Consensus 239 Avv~gDG~~~t~~~~~e~D~~~ik~it~~a----------t~~~~T--~~~dal~-------Eal---d~~~~~~--~~~ 294 (400) |+++|||... ..-+..+......-.+.+ +..++. .+.+.+. .++ +..+|.. +.- T Consensus 197 a~i~G~G~~~--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a 274 (377) T protein:vir:96 197 AIVKGNGLLQ--PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQV 274 (377) T ss_pred ceEeccCCCc--ceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCce Confidence 9999999742 111111110000000000 000110 1222222 222 1223332 233 Q ss_pred EEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeeec Q lcl|Aclame:pro 295 YLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTN 374 (400) Q Consensus 295 ~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~ 374 (400) +.++++.+...++ .+....+.+|.++.-.|.+-..+- +.+ +|. +.-.--|.+.|+-++..++.+. T Consensus 275 ~~~mn~~t~~~~~---------~~~~~~~~~G~~~~~l~~p~~v~~--s~~-~p~---~~i~fgdf~~Y~i~~r~~~~i~ 339 (377) T protein:vir:96 275 KLLLNPEDRWTLE---------AKFTSRNQFGEYVTVLPHGITILE--SLA-VET---GKAIAFVANRYDAFMATASTIE 339 (377) T ss_pred EEEEchhhHHhcc---------ccccccCCCCCceeccCCCceEEe--cCC-CCc---ccEEEEEcCcEEEEEecccEEE Confidence 5667777655442 334466678888876665533331 222 232 1123345555555544333321 Q ss_pred --------CceEEE--EecccccceeeccceeEeeC Q lcl|Aclame:pro 375 --------SNMILV--ETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 375 --------~~~ilv--e~~~~~~~~~~~~~~~~~~~ 400 (400) .+++++ =.--.|.+-.=++=.|++++ T Consensus 340 ~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~ 375 (377) T protein:vir:96 340 EYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) T ss_pred eehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEe Confidence 222211 11112222222333355566 No 75 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.87 E-value=2.9e-11 Score=78.36 Aligned_cols=277 Identities=13% Similarity=0.093 Sum_probs=148.7 Q ss_pred HHHccCChhHHHHHHHHHHh------Cccc-hhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-cc Q lcl|Aclame:pro 100 LKKNSGKSEIKNAWSAKLAE------NGVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-AN 171 (400) Q Consensus 100 l~~nqg~ke~k~AW~a~L~e------kgV~-~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~ 171 (400) |+..+-.+.-.+.|..++.. .++. ..+...++|..+...|-+.++++.++++-+.+.+.+.....+--.+ .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 22222222112222222211 1221 2223337999998888888898898888776666655444433332 23 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcccc Q lcl|Aclame:pro 172 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKS 251 (400) Q Consensus 172 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~ 251 (400) .+.-+--|.++.+...+|...++.|..++....+.+-+-+ .+.-++.+|+.++|++++. |+++.+++.|+|.+.... T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~--ds~~~l~~~i~~~la~ai~-~~~d~a~l~G~g~~~~~~ 157 (324) T protein:vir:78 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFGK 157 (324) T ss_pred ceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHHhccCCCCCcCc Confidence 4444566888999999999999999888877777433222 2235789999999999999 899999999999754211 Q ss_pred chhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhc-eecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceee Q lcl|Aclame:pro 252 IDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIAS 330 (400) Q Consensus 252 ~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald-~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~ 330 (400) .... . .....+..+.+...+.+..++. +...-.....+++++.++.+|+. ++|.+|.+.. T Consensus 158 --gi~~--~---~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~------------l~d~~G~~~~ 218 (324) T protein:vir:78 158 --SIAQ--S---IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKERI 218 (324) T ss_pred --cccc--c---ccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhccCCCeee Confidence 1111 1 1112223345666777887773 32323344579999999999887 7778887765 Q ss_pred hhhccccceecchhhhc-hhhhhccc--ceechhhceeccc------------------------eeeeecCceEEEEec Q lcl|Aclame:pro 331 EVGVDEIIVYTGSKALK-PTVLVDQK--YHIDMQDLTKVDA------------------------FEWKTNSNMILVETL 383 (400) Q Consensus 331 ~v~v~~~~~~tg~k~l~-~t~~vd~k--~~~~~~~~~~~~~------------------------~~~~~~~~~ilve~~ 383 (400) .-+.+.+.. |..... |.+.++.. +-.|++.+.-... .-|..++-.+.++.- T Consensus 219 ~~~~~~~l~--G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r 296 (324) T protein:vir:78 219 YDRNSDSLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred cCCCCCccc--ceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 544433322 321100 11111110 1112222111000 013333333344444 Q ss_pred ccccceeeccceeEeeC Q lcl|Aclame:pro 384 TSGHVETYNAGAVITVS 400 (400) Q Consensus 384 ~~~~~~~~~~~~~~~~~ 400 (400) -.+.|-.-+|=++++-- T Consensus 297 ~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 297 VALHIADDKAFAKLVPA 313 (324) T ss_pred EccEEecccceEEEecc Confidence 44444332322233221 No 76 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.87 E-value=2.9e-11 Score=78.36 Aligned_cols=277 Identities=13% Similarity=0.093 Sum_probs=148.7 Q ss_pred HHHccCChhHHHHHHHHHHh------Cccc-hhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-cc Q lcl|Aclame:pro 100 LKKNSGKSEIKNAWSAKLAE------NGVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-AN 171 (400) Q Consensus 100 l~~nqg~ke~k~AW~a~L~e------kgV~-~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~ 171 (400) |+..+-.+.-.+.|..++.. .++. ..+...++|..+...|-+.++++.++++-+.+.+.+.....+--.+ .. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~ 80 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADKP 80 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCc Confidence 22222222112222222211 1221 2223337999998888888898898888776666655444433332 23 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcccc Q lcl|Aclame:pro 172 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKS 251 (400) Q Consensus 172 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~ 251 (400) .+.-+--|.++.+...+|...++.|..++....+.+-+-+ .+.-++.+|+.++|++++. |+++.+++.|+|.+.... T Consensus 81 ~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~--ds~~~l~~~i~~~la~ai~-~~~d~a~l~G~g~~~~~~ 157 (324) T protein:vir:96 81 GAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFGK 157 (324) T ss_pred ceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHHhccCCCCCcCc Confidence 4444566888999999999999999888877777433222 2235789999999999999 899999999999754211 Q ss_pred chhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhc-eecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceee Q lcl|Aclame:pro 252 IDKEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIAS 330 (400) Q Consensus 252 ~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald-~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~ 330 (400) .... . .....+..+.+...+.+..++. +...-.....+++++.++.+|+. ++|.+|.+.. T Consensus 158 --gi~~--~---~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~------------l~d~~G~~~~ 218 (324) T protein:vir:96 158 --SIAQ--S---IEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKERI 218 (324) T ss_pred --cccc--c---ccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhccCCCeee Confidence 1111 1 1112223345666777887773 32323344579999999999887 7778887765 Q ss_pred hhhccccceecchhhhc-hhhhhccc--ceechhhceeccc------------------------eeeeecCceEEEEec Q lcl|Aclame:pro 331 EVGVDEIIVYTGSKALK-PTVLVDQK--YHIDMQDLTKVDA------------------------FEWKTNSNMILVETL 383 (400) Q Consensus 331 ~v~v~~~~~~tg~k~l~-~t~~vd~k--~~~~~~~~~~~~~------------------------~~~~~~~~~ilve~~ 383 (400) .-+.+.+.. |..... |.+.++.. +-.|++.+.-... .-|..++-.+.++.- T Consensus 219 ~~~~~~~l~--G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r 296 (324) T protein:vir:96 219 YDRNSDSLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred cCCCCCccc--ceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEE Confidence 544433322 321100 11111110 1112222111000 013333333344444 Q ss_pred ccccceeeccceeEeeC Q lcl|Aclame:pro 384 TSGHVETYNAGAVITVS 400 (400) Q Consensus 384 ~~~~~~~~~~~~~~~~~ 400 (400) -.+.|-.-+|=++++-- T Consensus 297 ~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 297 VALHIADDKAFAKLVPA 313 (324) T ss_pred EccEEecccceEEEecc Confidence 44444332322233221 No 77 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.84 E-value=1.6e-11 Score=79.83 Aligned_cols=269 Identities=9% Similarity=0.031 Sum_probs=148.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhh-HhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDT-TFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKD 178 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~-~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ 178 (400) |..+.- ....+..++. -..+|+.+...|-..+.++.++++..++.+.+.....+-..+ ...+.-+.- T Consensus 1 ma~~~~-----------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 69 (304) T protein:vir:94 1 MATPTY-----------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSE 69 (304) T ss_pred Cccccc-----------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeec Confidence 222221 1122322222 226999999999999999999999888777766554543333 345555677 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhh Q lcl|Aclame:pro 179 GQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 258 (400) Q Consensus 179 ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~ 258 (400) |+++.++..+|...++.|..++....+.+-+.. .+.-++.+|++++|+.++- |+++.+++.|||.+.....-..+-+ T Consensus 70 ~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~ia-~~~d~~~l~G~g~~~~~~~~~~~~~ 146 (304) T protein:vir:94 70 TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK--WTAKDFFNEVKPLIAEAFY-KAFDQAVIFGTKSPYNTSTSGKPLV 146 (304) T ss_pred CcccccccceeeEEEEEEEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHhhheeccCCCccccccccccc Confidence 888999999999999999888888777443322 2335789999999999999 7999999999998542221111211 Q ss_pred hhhhhhhhhhhccCCCCCHHHHHHhhceecc-cccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhcccc Q lcl|Aclame:pro 259 KKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEI 337 (400) Q Consensus 259 ~~ik~it~~at~~~~T~~~dal~Eald~~~~-~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~ 337 (400) .......++ ..+.+..-++|..++.-..+ -.....+++|+.++.+|+. ++|.+|.+.+.-+.+. T Consensus 147 ~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~------------lkd~~G~~l~~~~~~~- 211 (304) T protein:vir:94 147 EGAEEKGNV--VTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN------------ALDANDRPLFDANGNE- 211 (304) T ss_pred ccccccccc--cccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH------------hhccCCcEeecCCCcc- Confidence 111111111 12334456777777643333 2344568899999998886 6677777665333221 Q ss_pred ceecchhhh-chhhhhcc----cceechhhceecccee--------------------------eeecCceEEEEecccc Q lcl|Aclame:pro 338 IVYTGSKAL-KPTVLVDQ----KYHIDMQDLTKVDAFE--------------------------WKTNSNMILVETLTSG 386 (400) Q Consensus 338 ~~~tg~k~l-~~t~~vd~----k~~~~~~~~~~~~~~~--------------------------~~~~~~~ilve~~~~~ 386 (400) .. |-... .+.+..|- -+-.|.+.+.-....+ |+.++-.+.+|.--.+ T Consensus 212 l~--G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~ 289 (304) T protein:vir:94 212 IM--GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAY 289 (304) T ss_pred cc--ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEecc Confidence 11 21110 00000010 0111333222111100 2333333334444444 Q ss_pred cceeeccceeEeeC Q lcl|Aclame:pro 387 HVETYNAGAVITVS 400 (400) Q Consensus 387 ~~~~~~~~~~~~~~ 400 (400) -|..-+|-++++.. T Consensus 290 ~v~~~~a~~~l~~a 303 (304) T protein:vir:94 290 MNVKPEAFATLKPT 303 (304) T ss_pred EeecccceEEEEec Confidence 44433333344444 No 78 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.84 E-value=1.6e-11 Score=79.83 Aligned_cols=269 Identities=9% Similarity=0.031 Sum_probs=148.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhh-HhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDT-TFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKD 178 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~-~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ 178 (400) |..+.- ....+..++. -..+|+.+...|-..+.++.++++..++.+.+.....+-..+ ...+.-+.- T Consensus 1 ma~~~~-----------~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~E 69 (304) T protein:vir:10 1 MATPTY-----------TPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLAKGVGAYWVSE 69 (304) T ss_pred Cccccc-----------ccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEeec Confidence 222221 1122322222 226999999999999999999999888777766554543333 345555677 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhh Q lcl|Aclame:pro 179 GQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 258 (400) Q Consensus 179 ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~ 258 (400) |+++.++..+|...++.|..++....+.+-+.. .+.-++.+|++++|+.++- |+++.+++.|||.+.....-..+-+ T Consensus 70 ~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~l~~~ia-~~~d~~~l~G~g~~~~~~~~~~~~~ 146 (304) T protein:vir:10 70 TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLK--WTAKDFFNEVKPLIAEAFY-KAFDQAVIFGTKSPYNTSTSGKPLV 146 (304) T ss_pred CcccccccceeeEEEEEEEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHhhheeccCCCccccccccccc Confidence 888999999999999999888888777443322 2335789999999999999 7999999999998542221111211 Q ss_pred hhhhhhhhhhhccCCCCCHHHHHHhhceecc-cccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhcccc Q lcl|Aclame:pro 259 KKIKKITTKAKSAGKTPFADAIEEAVDFVRP-TAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEI 337 (400) Q Consensus 259 ~~ik~it~~at~~~~T~~~dal~Eald~~~~-~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~ 337 (400) .......++ ..+.+..-++|..++.-..+ -.....+++|+.++.+|+. ++|.+|.+.+.-+.+. T Consensus 147 ~~~~~~~~~--~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~------------lkd~~G~~l~~~~~~~- 211 (304) T protein:vir:10 147 EGAEEKGNV--VTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN------------ALDANDRPLFDANGNE- 211 (304) T ss_pred ccccccccc--cccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH------------hhccCCcEeecCCCcc- Confidence 111111111 12334456777777643333 2344568899999998886 6677777665333221 Q ss_pred ceecchhhh-chhhhhcc----cceechhhceecccee--------------------------eeecCceEEEEecccc Q lcl|Aclame:pro 338 IVYTGSKAL-KPTVLVDQ----KYHIDMQDLTKVDAFE--------------------------WKTNSNMILVETLTSG 386 (400) Q Consensus 338 ~~~tg~k~l-~~t~~vd~----k~~~~~~~~~~~~~~~--------------------------~~~~~~~ilve~~~~~ 386 (400) .. |-... .+.+..|- -+-.|.+.+.-....+ |+.++-.+.+|.--.+ T Consensus 212 l~--G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~ 289 (304) T protein:vir:10 212 IM--GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAY 289 (304) T ss_pred cc--ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEecc Confidence 11 21110 00000010 0111333222111100 2333333334444444 Q ss_pred cceeeccceeEeeC Q lcl|Aclame:pro 387 HVETYNAGAVITVS 400 (400) Q Consensus 387 ~~~~~~~~~~~~~~ 400 (400) -|..-+|-++++.. T Consensus 290 ~v~~~~a~~~l~~a 303 (304) T protein:vir:10 290 MNVKPEAFATLKPT 303 (304) T ss_pred EeecccceEEEEec Confidence 44433333344444 No 79 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.79 E-value=5.8e-11 Score=76.72 Aligned_cols=260 Identities=13% Similarity=0.029 Sum_probs=150.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeeccccccceecccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSANEAQVHKDGQ 180 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na~~a~GHk~ga 180 (400) |..+.- .+-+..+---.+|..+...|-+.+.++.++++..++.+.+.....+-..+...+.=+.-|. T Consensus 1 ~g~~a~-------------~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~ 67 (299) T protein:vir:41 1 MGFNPD-------------TTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMSGVGAFWVDEAE 67 (299) T ss_pred CCcCCC-------------cccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEcCCceeeeecCc Confidence 111100 0011111112689998888888899888888877777766654444333434444567788 Q ss_pred hhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhh Q lcl|Aclame:pro 181 TKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKK 260 (400) Q Consensus 181 ~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ 260 (400) ++.++..+|...++.|..++....+.+-+.+ .++-++.+|+++.|+.++- |+++.+++.|||.+.... .... T Consensus 68 ~~~~~~~~f~~v~l~~~k~~~~~~is~ell~--ds~~~~~~~i~~~l~~a~~-~~~d~a~l~G~g~~~~~g--il~~--- 139 (299) T protein:vir:41 68 RIQTSKPTFTKAKMRSKKMGVIIPTTKENLN--YSVTNFFSLMQAEIVEAFY-KKFDQAVFTGVESPYNWN--ILKS--- 139 (299) T ss_pred cccccccceeEEEEeeEEEEEeehhhHHHHh--cCHHHHHHHHHHHHHHHHH-HHHHHHHhhcccCccccc--cccc--- Confidence 9999999999999999999888777444333 3345789999999999999 799999999999864321 1111 Q ss_pred hhhhhhhhhccCCCCCHHHHHHhhc-eecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhcccc-- Q lcl|Aclame:pro 261 IKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEI-- 337 (400) Q Consensus 261 ik~it~~at~~~~T~~~dal~Eald-~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~-- 337 (400) ..... .+..+.+...+++..++. +...-.....+++++.++.+|+. ++|.+|++.+.-.+... T Consensus 140 ~~~~~--~~~~~~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~~~~~ 205 (299) T protein:vir:41 140 ATDAS--NLVEETANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRS------------TKDGNGMPIFNTATSNGVD 205 (299) T ss_pred ccccc--eeeccccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHH------------hhccCCceeecCCcCCCCc Confidence 11111 122334456788888884 33333334569999999999887 78888887663322111 Q ss_pred ceecchhhhchhhhhcc---------cceechhhceeccceeee----------------------ecCceEE--EEecc Q lcl|Aclame:pro 338 IVYTGSKALKPTVLVDQ---------KYHIDMQDLTKVDAFEWK----------------------TNSNMIL--VETLT 384 (400) Q Consensus 338 ~~~tg~k~l~~t~~vd~---------k~~~~~~~~~~~~~~~~~----------------------~~~~~il--ve~~~ 384 (400) ++. | .|-++.|. -+-.|++.+.-....++. |..|++. ++.-- T Consensus 206 ~l~-G----~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~ 280 (299) T protein:vir:41 206 DVL-G----LPIAYTPKYTFGDKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEV 280 (299) T ss_pred eec-c----eeeEEecccCCCCCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEe Confidence 110 3 23222221 122334433221111110 2333433 33333 Q ss_pred cccceeeccceeEeeC Q lcl|Aclame:pro 385 SGHVETYNAGAVITVS 400 (400) Q Consensus 385 ~~~~~~~~~~~~~~~~ 400 (400) .+-+..-+|=++++.. T Consensus 281 d~~v~~~~A~~~l~~~ 296 (299) T protein:vir:41 281 GFMVVKDEAFSAVQPK 296 (299) T ss_pred ccEEecccceEEEEec Confidence 4444334444444444 No 80 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.76 E-value=4.9e-11 Score=77.08 Aligned_cols=281 Identities=10% Similarity=0.008 Sum_probs=143.8 Q ss_pred HHHccC-ChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEee-ccccccceec Q lcl|Aclame:pro 100 LKKNSG-KSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-DSANEAQVHK 177 (400) Q Consensus 100 l~~nqg-~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l-~na~~a~GHk 177 (400) |.-|-. ..++-.+- ..+...+.+.+--..+|..+..-|=+.++++.++++..++...+.....+-. .+...+.-+. T Consensus 1 ~~~~~~r~~~~~~~~--e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~ 78 (326) T protein:vir:42 1 MAVNPDRTTPFLGVN--DPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTGDVSASWIG 78 (326) T ss_pred CCCCccchhhhcCcc--hhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeCCcceEEec Confidence 111111 11221110 0011112112212269999999888889988988886666665554444433 2333444457 Q ss_pred ccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhh Q lcl|Aclame:pro 178 DGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEAD 257 (400) Q Consensus 178 ~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D 257 (400) -|.++.+...+|...++.|..++....+.+.+ ++.+.-++.+|++++|++++- |..+.++++|||.+...... .. T Consensus 79 Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~el--l~~s~~~~~~~i~~~l~~a~~-~~~d~a~l~G~gs~~p~gi~--~~ 153 (326) T protein:vir:42 79 EGDMKPITKGNMTSQTIAPHKIATIFVASAET--VRANPANYLGTMRTKVATAFA-MAFDNAAINGTDSPFPTFLA--QT 153 (326) T ss_pred CCccccccccceeEEEEeeEEEEEeehhhHHH--HhcCHHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCcccccc--cc Confidence 78899999999999999999888887774433 233346789999999999999 79999999999986532211 11 Q ss_pred hhhhhhhhhhhhccCC--CCCHHHHHHhh-ceecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhc Q lcl|Aclame:pro 258 VKKIKKITTKAKSAGK--TPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGV 334 (400) Q Consensus 258 ~~~ik~it~~at~~~~--T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v 334 (400) .....-....++.... ++....+...+ ..+..-.....+++|+.++.+|+. ++|.+|.+.+.-++ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~ 221 (326) T protein:vir:42 154 TKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNG------------AKDKSGRPLFIEST 221 (326) T ss_pred ccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHH------------hhccCCceeecccc Confidence 1111111111222221 22222233333 333333344568899999999887 78888887765433 Q ss_pred ccccee-------cchhhhchhhhhc-cccee---chhhceeccceeee----------------------ecCceE--E Q lcl|Aclame:pro 335 DEIIVY-------TGSKALKPTVLVD-QKYHI---DMQDLTKVDAFEWK----------------------TNSNMI--L 379 (400) Q Consensus 335 ~~~~~~-------tg~k~l~~t~~vd-~k~~~---~~~~~~~~~~~~~~----------------------~~~~~i--l 379 (400) .+.... -|....+ +..++ .+..+ |++.+.-.+..++. |..|++ . T Consensus 222 ~~~~~~~~~~~~l~G~pv~~-~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r 300 (326) T protein:vir:42 222 YTEENSPFRLGRIVARPTIL-SDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVR 300 (326) T ss_pred ccCccccccCceeeeeeEEE-cCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEE Confidence 221110 0221111 11111 12211 33333211111110 222332 3 Q ss_pred EEecccccceeeccceeEeeC Q lcl|Aclame:pro 380 VETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 380 ve~~~~~~~~~~~~~~~~~~~ 400 (400) ++..-.+.|..-.|=+.|+.- T Consensus 301 ~~~~~d~~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 301 VEAEYAFHCNDKDAFVKLTNV 321 (326) T ss_pred EEEEeccEEecccceEEEeec Confidence 344344444333333333322 No 81 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.73 E-value=1.4e-10 Score=74.68 Aligned_cols=274 Identities=13% Similarity=0.124 Sum_probs=143.3 Q ss_pred HHHccCCh-hHHHHHHHHHHh------Cccc-hhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeec-cc Q lcl|Aclame:pro 100 LKKNSGKS-EIKNAWSAKLAE------NGVT-ITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFD-SA 170 (400) Q Consensus 100 l~~nqg~k-e~k~AW~a~L~e------kgV~-~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~-na 170 (400) |++++-.+ +.++ |...+.. -++. .++..-++|..+..-|-..+.++.++++.+.+...+.....+--. .. T Consensus 1 ~~~~~~~~~~~~~-f~~~~~~~~~~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~ 79 (324) T protein:vir:96 1 MEQTQKLKLNLQH-FASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFWADK 79 (324) T ss_pred CCcchhhhHHHHH-HHHhhhhhhhcccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecC Confidence 11111111 1111 1111111 1111 122223689988888888888888888866665555443333222 22 Q ss_pred cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccc Q lcl|Aclame:pro 171 NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFK 250 (400) Q Consensus 171 ~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~ 250 (400) ..+.-.--|.++.+...+|...++.|..++...++.+-+.+ .+..++.+|+.++|++++- |+++.+++.|||.+..- T Consensus 80 ~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~--ds~~~l~~~i~~~l~~aia-~~~d~~~l~G~g~~~~~ 156 (324) T protein:vir:96 80 PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLN--YTYSQFFEEMKPMIAEAFY-KKFDEAGILNQGNNPFG 156 (324) T ss_pred cceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHhhhcCCCCCcC Confidence 33444566788899999999999999988888777443322 3335789999999999999 79999999999975321 Q ss_pred cchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccccccceeecCCccee Q lcl|Aclame:pro 251 SIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIA 329 (400) Q Consensus 251 ~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~ 329 (400) . ... ..+ ....+..+++...++|..++.-..+. .....+++++.++.+|+. ++|.+|.+. T Consensus 157 ~--~~~--~~~---~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~------------lkd~~G~~~ 217 (324) T protein:vir:96 157 K--SIA--QSI---KKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK------------IVDPETKER 217 (324) T ss_pred c--ccc--ccc---cccceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH------------hhCCCCCee Confidence 1 111 111 11122334456678888877433332 333468999999999887 788888876 Q ss_pred ehhhccccceecchhhhc-hhhhhcccc--eechhhceecccee--e--------------------eecCceE--EEEe Q lcl|Aclame:pro 330 SEVGVDEIIVYTGSKALK-PTVLVDQKY--HIDMQDLTKVDAFE--W--------------------KTNSNMI--LVET 382 (400) Q Consensus 330 ~~v~v~~~~~~tg~k~l~-~t~~vd~k~--~~~~~~~~~~~~~~--~--------------------~~~~~~i--lve~ 382 (400) +.-+.+.+.. |....+ |....+... -.|++.+.-....+ . -|.+|++ .++. T Consensus 218 ~~~~~~~~l~--G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~ 295 (324) T protein:vir:96 218 IYDRNSDSLD--GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATM 295 (324) T ss_pred ecCCCCCccc--ceeeEeecCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEE Confidence 6533333222 322110 111111100 11222211100000 0 0222332 2333 Q ss_pred cccccceeeccceeEeeC Q lcl|Aclame:pro 383 LTSGHVETYNAGAVITVS 400 (400) Q Consensus 383 ~~~~~~~~~~~~~~~~~~ 400 (400) --.+.| .+..|+..+. T Consensus 296 r~d~~v--~~~~a~~~l~ 311 (324) T protein:vir:96 296 HVALHI--ADDKAFAKLV 311 (324) T ss_pred EeccEE--ecccceEEEe Confidence 222222 2233332222 No 82 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.68 E-value=7.3e-11 Score=76.15 Aligned_cols=273 Identities=15% Similarity=0.098 Sum_probs=141.0 Q ss_pred HHccCChhHHHHHHHHHHhCccch-hhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeec-cccccceecc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFD-SANEAQVHKD 178 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~-qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~-na~~a~GHk~ 178 (400) |..+. ...-.+.. .+-...+|..+...|-+.++++..+++...+...+.....+--. ....+.-..- T Consensus 1 m~~~~-----------~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E 69 (330) T protein:vir:77 1 MAGST-----------VPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTGAVSASWTGE 69 (330) T ss_pred Ccccc-----------cchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceeEecC Confidence 11111 11111222 22223688777777777788878877755544443332232222 2233444566 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhh Q lcl|Aclame:pro 179 GQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 258 (400) Q Consensus 179 ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~ 258 (400) |.++++...+|...++.|..++.+..+.+-+.+ .+.-++.+|++++|+.++- +.++.++++|||.+.. ..-...+. T Consensus 70 g~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~--ds~~~~~~~i~~~l~~ai~-~~~~~~~l~G~g~~~~-~~g~~~~~ 145 (330) T protein:vir:77 70 AERKPITKGSFGKQELEPVKITTIFAESAEVVR--LNPLNYLNTMRTKIAEAIA-LKFDAAAIHGIDKPSA-FKGYLAET 145 (330) T ss_pred CCccccccceeeEEEEeEEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCCc-cccccccc Confidence 788999999999999999988888777433222 2335789999999999999 7999999999997431 11111111 Q ss_pred hhhhhhhh---hhhccCCCCCHHHHHHhhc-eecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhc Q lcl|Aclame:pro 259 KKIKKITT---KAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGV 334 (400) Q Consensus 259 ~~ik~it~---~at~~~~T~~~dal~Eald-~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v 334 (400) ........ .......+...+++..++. ..+.......+++|+.++.+|+. +||.+|.+.+.-++ T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------------lkd~~G~~l~~~~~ 213 (330) T protein:vir:77 146 TKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNT------------AVDGNGRPLFVEST 213 (330) T ss_pred cccceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH------------HhccCCceeecCcc Confidence 11111110 0111112223466666663 33333444578999999999988 88889888764332 Q ss_pred cccceec--chhhh-chhhhhcc-----------cceechhhceecccee----------------------------ee Q lcl|Aclame:pro 335 DEIIVYT--GSKAL-KPTVLVDQ-----------KYHIDMQDLTKVDAFE----------------------------WK 372 (400) Q Consensus 335 ~~~~~~t--g~k~l-~~t~~vd~-----------k~~~~~~~~~~~~~~~----------------------------~~ 372 (400) ....... |..-| +|-++.|. =+..|++.|.-.+..+ |. T Consensus 214 ~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~ 293 (330) T protein:vir:77 214 YTEQVGAIREGRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQ 293 (330) T ss_pred ccccccccCCceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhh Confidence 2211110 00000 23222222 1223444443222211 22 Q ss_pred ecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 373 TNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 373 ~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .++-.+.++.--.+-|-.-+|=++|+.- T Consensus 294 ~~~~~~r~~~r~d~~v~~~~a~~~i~~~ 321 (330) T protein:vir:77 294 HNMVAVRCEAEFAFMVNDKDAFVKLTDQ 321 (330) T ss_pred cCcEEEEEEEEeccEEecccceEEEEec Confidence 2233333444334433322222233222 No 83 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.66 E-value=2.8e-10 Score=72.99 Aligned_cols=275 Identities=14% Similarity=0.069 Sum_probs=152.0 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceeccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDG 179 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~g 179 (400) +.... .-.+|...+...+- ++.-..+|..+..-|-+.+.++.++++-..+...+.....+-..+ ...+.-+.-| T Consensus 1 ~~~~~---~~~~e~~~~~~~~~--~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~~~~a~~v~Eg 75 (318) T protein:vir:24 1 MAAGT---AFAVDHAQIAQTGD--TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVGDVSAQWIGEG 75 (318) T ss_pred CCCCC---CCCHHHHHhhcccC--cccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCcceEEecCC Confidence 22221 12346666655543 444447999988888888888888888666666555554554433 2345555668 Q ss_pred chhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhh Q lcl|Aclame:pro 180 QTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVK 259 (400) Q Consensus 180 a~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~ 259 (400) .+++++..+|+..++.|..++....+.+.+-+ .+..++.+|++++|+..+- +.++.++++|||.+.....- ... T Consensus 76 ~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~--ds~~~~~~~i~~~l~~~~~-~~~d~a~l~G~g~~~~~~~~--~~~- 149 (318) T protein:vir:24 76 DMKPITKGNMTSQTIAPHKIATIFVASAETVR--ANPANYLGTMRTKVATAFA-MAFDGAAMHGTDSPFPTYIG--QTT- 149 (318) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHhh--cChHHHHHHHHHHHHHHHH-HHHHHhhhcccCCCCCcccc--ccc- Confidence 88999999999999999988887777443322 2224689999999999999 79999999999986432221 111 Q ss_pred hhhhhhhhhhccCCCCCHHHHHHhhceecccc-cceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccc Q lcl|Aclame:pro 260 KIKKITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEII 338 (400) Q Consensus 260 ~ik~it~~at~~~~T~~~dal~Eald~~~~~~-~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~ 338 (400) ..++......+.+...+.+..++.-..+.. ....+++++.++.+|+. ++|.+|.+.+.=.+..-. T Consensus 150 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~~~~ 215 (318) T protein:vir:24 150 --KAISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNG------------AKDQNGRPLFIESTYGEA 215 (318) T ss_pred --ccccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH------------hhccCCceeecCccccCc Confidence 112222222233333344444443222222 23468999999998887 888888876432221111 Q ss_pred eec--chhhh-chhhh---hcc-cc---eechhhceecccee------------------------eeecCceEEEEecc Q lcl|Aclame:pro 339 VYT--GSKAL-KPTVL---VDQ-KY---HIDMQDLTKVDAFE------------------------WKTNSNMILVETLT 384 (400) Q Consensus 339 ~~t--g~k~l-~~t~~---vd~-k~---~~~~~~~~~~~~~~------------------------~~~~~~~ilve~~~ 384 (400) ..+ |..-+ .|.++ ++. +. -.|.+.+.-....+ |..++-.|.++.-- T Consensus 216 ~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~ 295 (318) T protein:vir:24 216 ASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEY 295 (318) T ss_pred cccccCceEEEEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEE Confidence 110 11110 11111 111 11 11333332111100 33333334455555 Q ss_pred cccceeeccceeEeeC Q lcl|Aclame:pro 385 SGHVETYNAGAVITVS 400 (400) Q Consensus 385 ~~~~~~~~~~~~~~~~ 400 (400) .+.|.--.+=++|+.. T Consensus 296 d~~v~~~~a~~~i~~~ 311 (318) T protein:vir:24 296 AFHCNDAEAFVALTNV 311 (318) T ss_pred ccEEecccceEEEEee Confidence 5555444443344432 No 84 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.62 E-value=2e-08 Score=62.77 Aligned_cols=347 Identities=13% Similarity=0.131 Sum_probs=146.3 Q ss_pred CcccccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcch Q lcl|Aclame:pro 1 MRISKRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGK 80 (400) Q Consensus 1 ~~~s~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k 80 (400) |-+ .-+..+++..++|...+++..++.. ...++ ....+...+...++++..... T Consensus 1 mt~---------~~~~~e~~~~~~e~~~~~~~~~~~~---------~~~e~---~~~~~~~~~~~~~~~~~~~~~----- 54 (395) T protein:vir:95 1 MAD---------MKQNNVKLKNYHEHKKQFANLVQNG---------ASDEE---QSKAFGAMFDALSNDLQEEIT----- 54 (395) T ss_pred Chh---------HHHHHHHHHHHHHHHHHHHHHHhhh---------hhHHH---HHHHHHHHHHHHHHHHHHHHH----- Confidence 211 1122222222222222222111111 00011 111111111112222111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccc Q lcl|Aclame:pro 81 DKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGA 160 (400) Q Consensus 81 ~emtEfLkTkqA~~dya~ll~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~ 160 (400) .++...... +..... ...+....+.++.+.+ +...+. .+.-.++|..+...|-..+.++.++++..+|.+.+. T Consensus 55 ~e~~~~~~~--~~~~~~--r~~~~l~~ee~~~~~~-~~~~t~--~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~ 127 (395) T protein:vir:95 55 AEINNRVVD--NGILAK--RSQDPLTSEERKFFND-INYDVG--YTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGI 127 (395) T ss_pred HHHHHHHHH--HHHHhh--cCccccchHHHHHHHH-HhhccC--CCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCC Confidence 111111111 001000 0111123455555544 333222 233347999999999999999999999888877765 Q ss_pred eeeEEee-ccc-cccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 161 LLVSRSF-DSA-NEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDL 238 (400) Q Consensus 161 ~a~~i~l-~na-~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~r 238 (400) .. .+.. ... ...||--.+..++....+|...++.+..+|.+......+ |+-+.-++-+|+.++|++.|- ++++. T Consensus 128 ~~-~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~el--l~ds~~~ie~~i~~~la~~ia-~~~~~ 203 (395) T protein:vir:95 128 KT-RVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDL--STFGPAWIERFVRTQIQEAIS-VALES 203 (395) T ss_pred ce-EEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHH--HhcchhHHHHHHHHHHHHHHH-HHHhh Confidence 43 3333 333 455544445556667889999999988877765553333 233335678999999999999 79999 Q ss_pred ceeeccCCCccccchhhhhhhhhhhhhhhhhccCCCCC-------HHHHHHhh---cee-----cccccceEEEEecchh Q lcl|Aclame:pro 239 ALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPF-------ADAIEEAV---DFV-----RPTAGRRYLIVKAEDR 303 (400) Q Consensus 239 Avv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~-------~dal~Eal---d~~-----~~~~~~~~l~~~~~d~ 303 (400) |+++|||+...+-.-+..+......-.+... .+.+.. .+.+...+ .+- ..-.+..+.++++.+. T Consensus 204 a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~-~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~ 282 (395) T protein:vir:95 204 AIINGGGAAKTQPVGLMKDVNTNSGAVTDKA-SSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDS 282 (395) T ss_pred heeeccCCCCcCceeeeeccccccccccccc-ccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhh Confidence 9999999853211111111110000000000 011111 11222211 111 0111233556666665 Q ss_pred HHHHhhhhhccccccceeecCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeee--------cC Q lcl|Aclame:pro 304 KALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKT--------NS 375 (400) Q Consensus 304 ~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~--------~~ 375 (400) ..+. .+--..+.+|+++...|.+-.++. +.+| |.. .-.--|++.|+-.+..++.. -. T Consensus 283 ~~~~---------g~~~~~~~~G~~~~~lg~g~~v~~--~~~~-p~~---~i~fgdfs~y~i~~r~~~~i~~~~~~~~~~ 347 (395) T protein:vir:95 283 WDVQ---------ARYTYLTANGGFVTVLPYNVTIIT--SEFV-PEG---KLVAFVTDRYNAVRGGGLTVKKFDQTLALE 347 (395) T ss_pred hhcC---------CcceeccCCCcceeccCCcceEEE--cCCC-CCC---cEEEEecccEEEEEecceEEEeccchhhhC Confidence 4332 222234456777665554422221 1122 210 01223455554443333222 11 Q ss_pred ceE--EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 376 NMI--LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 376 ~~i--lve~~~~~~~~~~~~~~~~~~~ 400 (400) +++ .+-.--.|-+---+|=.|.+|+ T Consensus 348 d~~~f~~~~r~dg~~~~~~A~~~l~i~ 374 (395) T protein:vir:95 348 DAVLFTAKTFAYGQPDDNKASAVYDLK 374 (395) T ss_pred CcEEEEEEEEECCEEeccccEEEEEee Confidence 111 1111112222222333344444 No 85 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=98.58 E-value=2.7e-08 Score=62.12 Aligned_cols=331 Identities=11% Similarity=0.089 Sum_probs=151.0 Q ss_pred HHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHH--HHHHHHHHHH-HHhhHhhh--cchhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 27 NVSLKSQISGFEVKNAIEDLPKVQELEKTLSEN--SIEIIKIENE-LNAQEEKP--KGKDKMTNFIESQNAVTEFFDVLK 101 (400) Q Consensus 27 ~~~~Ks~i~~~~~~~~~~~~skieElektis~l--~aEi~K~enE-l~~~kEk~--K~k~emtEfLkTkqA~~dya~ll~ 101 (400) |.- ..+++..+ +-.|+...+++. +.+..+..++ ++++.+.. +.+.++.++ ..+.+ . T Consensus 1 m~i--------k~~~~~~~--~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~-------~~~~~--~ 61 (381) T protein:vir:10 1 MTI--------NLSETFAN--AKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERV-------SSLPK--S 61 (381) T ss_pred Cch--------hhHHHHHH--HHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHH-------HHhcc--C Confidence 110 00000000 111111111100 0000111111 11111111 111122111 11100 1 Q ss_pred HccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEee-ccc-cccceeccc Q lcl|Aclame:pro 102 KNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-DSA-NEAQVHKDG 179 (400) Q Consensus 102 ~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l-~na-~~a~GHk~g 179 (400) .+....+.++.+.+ +.. |. +.|--.++|..+...|-+.+.+++++++..+|.+.+... .+-. .+. .-.||--.+ T Consensus 62 ~~~lt~~e~~~~~~-~~~-~~-~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~-~i~~~~~~~~a~w~~e~~ 137 (381) T protein:vir:10 62 AQSLSANQRSFFMD-INK-NV-NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRL-KFLKSETSGVAVWGKIYG 137 (381) T ss_pred cccccHHHHHHHHH-Hhc-cc-CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcce-EEEEecCCcceeeecccc Confidence 11123344554433 222 21 234445899999999999999999999988888877543 3322 222 344554444 Q ss_pred chhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhh Q lcl|Aclame:pro 180 QTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVK 259 (400) Q Consensus 180 a~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~ 259 (400) ..+.....+|...++.+..+|..-.+..-+ |+.++-++-+|+..+|++.|- ++.+.|++.|||+.-. .-+..++. T Consensus 138 ~~~~~~~~~f~~i~l~~~kl~~~~~is~el--L~Ds~~~ie~~i~~~la~~~a-~~~~~a~i~G~G~~qP--~Gil~~~~ 212 (381) T protein:vir:10 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDL--NDFGPAWIERFVRVQIEEAFA-VALETAFLKGTGKDQP--IGLNRQVQ 212 (381) T ss_pred cccccccccceeeeecceeEEeechhhHHH--hhcCHHHHHHHHHHHHHHHHH-HHhhheeEeccCCCCc--eeeeeccC Confidence 445555778999999988888776664333 444556889999999999999 7999999999998321 11111110 Q ss_pred hhhhh----------hhhhhccCCCCCHHH---HHHhhcee-----cccccceEEEEecchhHHHHhhhhhcccccccee Q lcl|Aclame:pro 260 KIKKI----------TTKAKSAGKTPFADA---IEEAVDFV-----RPTAGRRYLIVKAEDRKALLDELRQATANANVRI 321 (400) Q Consensus 260 ~ik~i----------t~~at~~~~T~~~da---l~Eald~~-----~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~l 321 (400) ..... ..+.+........+. +..++++. ++-.+.-+.++++.+...|+.-. -. T Consensus 213 ~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~---------~~ 283 (381) T protein:vir:10 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---------TH 283 (381) T ss_pred cccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccc---------cc Confidence 00000 000111111111222 33344332 11223345678888877765310 13 Q ss_pred ecCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeee--------cCceEEEEecccccceeec- Q lcl|Aclame:pro 322 KNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKT--------NSNMILVETLTSGHVETYN- 392 (400) Q Consensus 322 k~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~--------~~~~ilve~~~~~~~~~~~- 392 (400) .+.+|.|+...+.+-.++. +..| |. +.=..-|++.|+-.+..++.+ -.+++.+=.....-.-.++ T Consensus 284 ~~~~G~~v~~l~~g~~vv~--s~~~-p~---~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~ 357 (381) T protein:vir:10 284 LNANGVYVTALPFNLNVIE--STVQ-EA---GKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDN 357 (381) T ss_pred CCCCCceeecCCCCceEEe--cCCC-Cc---CcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecC Confidence 4568888876665544432 2222 21 111233455555444433322 2222222222111112232 Q ss_pred -cceeEeeC Q lcl|Aclame:pro 393 -AGAVITVS 400 (400) Q Consensus 393 -~~~~~~~~ 400 (400) |=.|++++ T Consensus 358 ~A~~v~~l~ 366 (381) T protein:vir:10 358 KVAAVWKLD 366 (381) T ss_pred ceEEEEEEE Confidence 22344444 No 86 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=98.58 E-value=2.7e-08 Score=62.12 Aligned_cols=331 Identities=11% Similarity=0.089 Sum_probs=151.0 Q ss_pred HHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHH--HHHHHHHHHH-HHhhHhhh--cchhHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 27 NVSLKSQISGFEVKNAIEDLPKVQELEKTLSEN--SIEIIKIENE-LNAQEEKP--KGKDKMTNFIESQNAVTEFFDVLK 101 (400) Q Consensus 27 ~~~~Ks~i~~~~~~~~~~~~skieElektis~l--~aEi~K~enE-l~~~kEk~--K~k~emtEfLkTkqA~~dya~ll~ 101 (400) |.- ..+++..+ +-.|+...+++. +.+..+..++ ++++.+.. +.+.++.++ ..+.+ . T Consensus 1 m~i--------k~~~~~~~--~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~-------~~~~~--~ 61 (381) T protein:vir:95 1 MTI--------NLSETFAN--AKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQAKAEAERV-------SSLPK--S 61 (381) T ss_pred Cch--------hhHHHHHH--HHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHH-------HHhcc--C Confidence 110 00000000 111111111100 0000111111 11111111 111122111 11100 1 Q ss_pred HccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEee-ccc-cccceeccc Q lcl|Aclame:pro 102 KNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF-DSA-NEAQVHKDG 179 (400) Q Consensus 102 ~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l-~na-~~a~GHk~g 179 (400) .+....+.++.+.+ +.. |. +.|--.++|..+...|-+.+.+++++++..+|.+.+... .+-. .+. .-.||--.+ T Consensus 62 ~~~lt~~e~~~~~~-~~~-~~-~~~gg~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~~~~-~i~~~~~~~~a~w~~e~~ 137 (381) T protein:vir:95 62 AQSLSANQRSFFMD-INK-NV-NYKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRL-KFLKSETSGVAVWGKIYG 137 (381) T ss_pred cccccHHHHHHHHH-Hhc-cc-CCCCceecCHHHHHHHHHHHHhhccceeheeeEecCcce-EEEEecCCcceeeecccc Confidence 11123344554433 222 21 234445899999999999999999999988888877543 3322 222 344554444 Q ss_pred chhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhh Q lcl|Aclame:pro 180 QTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVK 259 (400) Q Consensus 180 a~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~ 259 (400) ..+.....+|...++.+..+|..-.+..-+ |+.++-++-+|+..+|++.|- ++.+.|++.|||+.-. .-+..++. T Consensus 138 ~~~~~~~~~f~~i~l~~~kl~~~~~is~el--L~Ds~~~ie~~i~~~la~~~a-~~~~~a~i~G~G~~qP--~Gil~~~~ 212 (381) T protein:vir:95 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDL--NDFGPAWIERFVRVQIEEAFA-VALETAFLKGTGKDQP--IGLNRQVQ 212 (381) T ss_pred cccccccccceeeeecceeEEeechhhHHH--hhcCHHHHHHHHHHHHHHHHH-HHhhheeEeccCCCCc--eeeeeccC Confidence 445555778999999988888776664333 444556889999999999999 7999999999998321 11111110 Q ss_pred hhhhh----------hhhhhccCCCCCHHH---HHHhhcee-----cccccceEEEEecchhHHHHhhhhhcccccccee Q lcl|Aclame:pro 260 KIKKI----------TTKAKSAGKTPFADA---IEEAVDFV-----RPTAGRRYLIVKAEDRKALLDELRQATANANVRI 321 (400) Q Consensus 260 ~ik~i----------t~~at~~~~T~~~da---l~Eald~~-----~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~l 321 (400) ..... ..+.+........+. +..++++. ++-.+.-+.++++.+...|+.-. -. T Consensus 213 ~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~---------~~ 283 (381) T protein:vir:95 213 KGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQY---------TH 283 (381) T ss_pred cccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcccc---------cc Confidence 00000 000111111111222 33344332 11223345678888877765310 13 Q ss_pred ecCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeee--------cCceEEEEecccccceeec- Q lcl|Aclame:pro 322 KNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKT--------NSNMILVETLTSGHVETYN- 392 (400) Q Consensus 322 k~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~--------~~~~ilve~~~~~~~~~~~- 392 (400) .+.+|.|+...+.+-.++. +..| |. +.=..-|++.|+-.+..++.+ -.+++.+=.....-.-.++ T Consensus 284 ~~~~G~~v~~l~~g~~vv~--s~~~-p~---~~iifgDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~ 357 (381) T protein:vir:95 284 LNANGVYVTALPFNLNVIE--STVQ-EA---GKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDN 357 (381) T ss_pred CCCCCceeecCCCCceEEe--cCCC-Cc---CcEEEEecccEEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecC Confidence 4568888876665544432 2222 21 111233455555444433322 2222222222111112232 Q ss_pred -cceeEeeC Q lcl|Aclame:pro 393 -AGAVITVS 400 (400) Q Consensus 393 -~~~~~~~~ 400 (400) |=.|++++ T Consensus 358 ~A~~v~~l~ 366 (381) T protein:vir:95 358 KVAAVWKLD 366 (381) T ss_pred ceEEEEEEE Confidence 22344444 No 87 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.58 E-value=3.3e-10 Score=72.54 Aligned_cols=264 Identities=12% Similarity=0.097 Sum_probs=144.5 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEee--ccccccceecc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSF--DSANEAQVHKD 178 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l--~na~~a~GHk~ 178 (400) |..++-+.... ..+++-..++|+.+...|-+.+.++.++++...+.+.+...-..-+ .....+.-+.- T Consensus 1 m~~~~~~~~~~----------~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E 70 (297) T protein:vir:95 1 MTVQTFNPENV----------LVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGISAYWVNE 70 (297) T ss_pred CCccccccccc----------cccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCceeEEeec Confidence 44443321111 1112222368999999988889888999997777665543322333 22345666778 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhh Q lcl|Aclame:pro 179 GQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADV 258 (400) Q Consensus 179 ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~ 258 (400) |+++++...+|...++.|..++....+.+-. ++.+.-++.+|++++|+.++- ++++.+++.|||.+........ T Consensus 71 g~~~~~~~~~f~~v~l~~~k~~~~~~is~el--l~ds~~~l~~~i~~~la~ai~-~~~d~a~l~G~g~~~~~gi~~~--- 144 (297) T protein:vir:95 71 TEKIKTDKPEVVPVTLKAHKLGIILVTSREA--LNYTWKKFFEDMKPQIVEAFY-KKIDEAGLLGHDTPFANSVAKA--- 144 (297) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHH--HhcCHHHHHHHHHHHHHHHHH-HHHHHHHhcccCCccccccccc--- Confidence 8899999999999999998888877763322 222235679999999999999 7999999999998654322111 Q ss_pred hhhhhhhhhhhccCCCCCHHHHHHhhceecccc-cceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhcccc Q lcl|Aclame:pro 259 KKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEI 337 (400) Q Consensus 259 ~~ik~it~~at~~~~T~~~dal~Eald~~~~~~-~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~ 337 (400) .+...+..+.+...+++..++.-..+.. ....+++++.++.+|+. |+|.+|.+.+.-. +.. T Consensus 145 -----~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~------------l~d~~G~~i~~~~-~~~ 206 (297) T protein:vir:95 145 -----AKDANKVIGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALRE------------ARDGNKVSIYDKA-ANT 206 (297) T ss_pred -----ccccceecccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH------------hhccCCceeecCC-CCc Confidence 1111222344566788888773333322 33468889999888876 6666776654211 111 Q ss_pred ceecchhhh-chhhhhcccc--eechhhceeccceeee----------------------ecCceEEEEecccccceeec Q lcl|Aclame:pro 338 IVYTGSKAL-KPTVLVDQKY--HIDMQDLTKVDAFEWK----------------------TNSNMILVETLTSGHVETYN 392 (400) Q Consensus 338 ~~~tg~k~l-~~t~~vd~k~--~~~~~~~~~~~~~~~~----------------------~~~~~ilve~~~~~~~~~~~ 392 (400) .. |.-.. .|+..++... ..|++.+.-....+.. |.+|++-+-....--....| T Consensus 207 l~--G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~ 284 (297) T protein:vir:95 207 ID--GITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITK 284 (297) T ss_pred cc--ceeeEeecCCCCCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeec Confidence 11 22110 0111122211 1344433221111100 23343333222211222233 Q ss_pred cceeEeeC Q lcl|Aclame:pro 393 AGAVITVS 400 (400) Q Consensus 393 ~~~~~~~~ 400 (400) ..++..+. T Consensus 285 ~~a~~~l~ 292 (297) T protein:vir:95 285 TDAFAKLT 292 (297) T ss_pred ccceEEEe Confidence 33333332 No 88 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.51 E-value=9.3e-10 Score=70.09 Aligned_cols=276 Identities=17% Similarity=0.135 Sum_probs=144.4 Q ss_pred HHHHHHHHhC--ccchhh-----hHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeeccc--c-------ccc Q lcl|Aclame:pro 111 NAWSAKLAEN--GVTITD-----TTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSA--N-------EAQ 174 (400) Q Consensus 111 ~AW~a~L~ek--gV~~qd-----~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na--~-------~a~ 174 (400) -||+..|... |...+. .-.++|+.+..-|-+.+++...+++.+++.+.+.....+--.+. . .+. T Consensus 1 ~~~~~e~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~~ 80 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (338) T ss_pred CcchHHhhhhhcccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeecccccc Confidence 5667666655 322221 12279999999888888988999887777766654443332221 1 111 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcc-ccch Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGF-KSID 253 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t-~~~~ 253 (400) -..-|++++....+|...++.|..++...++.+.+.+- +.-++.+|++++|++.+- |..+.+++.|||...- .... T Consensus 81 ~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~d--s~~~~~~~i~~~la~a~~-~~~d~~~l~G~g~~~~~~~~g 157 (338) T protein:vir:78 81 EQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARM--NPSGLYTKLQADLAYAIG-RGIDLAVFHGKSPLTGSALQG 157 (338) T ss_pred cccccccccccccceeEEEEEEEEEEEeehhhHHHHhc--CHHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCccccccc Confidence 12346788889999999999998888887775543332 335689999999999999 7999999999996321 1111 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc--cceEEEEecchhHHHHhhhhhccccccceeecCCcceeeh Q lcl|Aclame:pro 254 KEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA--GRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASE 331 (400) Q Consensus 254 ~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~--~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~ 331 (400) ...+.....-.+......+.....|++..++.-..... ....+++++.++.+|+. +| .++|.+|++.++ T Consensus 158 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~-~~--------~l~d~~g~~l~~ 228 (338) T protein:vir:78 158 IDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLR-SQ--------AYRDANGNVDPT 228 (338) T ss_pred cccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHH-Hh--------hhccCCCceeec Confidence 11221111100111111222344567777764433222 33468888888777643 11 278888888764 Q ss_pred hhccccceec--chhhhc----hhh---hhcccc---eechhhceeccceeee----------------------ecCce Q lcl|Aclame:pro 332 VGVDEIIVYT--GSKALK----PTV---LVDQKY---HIDMQDLTKVDAFEWK----------------------TNSNM 377 (400) Q Consensus 332 v~v~~~~~~t--g~k~l~----~t~---~vd~k~---~~~~~~~~~~~~~~~~----------------------~~~~~ 377 (400) -......-+| |-...+ |.. ..+.+. --|++.|.-.+..++. |..|+ T Consensus 229 ~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 308 (338) T protein:vir:78 229 RINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQ 308 (338) T ss_pred ccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccchhhhhcCc Confidence 3322111111 433221 211 111222 1244333322211111 12233 Q ss_pred EEE--EecccccceeeccceeEeeC Q lcl|Aclame:pro 378 ILV--ETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 378 ilv--e~~~~~~~~~~~~~~~~~~~ 400 (400) |.+ |.--.+- -.+..|++.+. T Consensus 309 ~~~r~~~r~d~~--v~~~~a~~~l~ 331 (338) T protein:vir:78 309 IAILIEVTFGWL--LGDKQAFVKFV 331 (338) T ss_pred EEEEEEEEeccE--eecccceEEEe Confidence 322 2111111 12222222222 No 89 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.46 E-value=1.9e-09 Score=68.34 Aligned_cols=278 Identities=18% Similarity=0.162 Sum_probs=151.6 Q ss_pred HHHHHHHHhC--ccchhhh-----HhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc--ccccc---e--- Q lcl|Aclame:pro 111 NAWSAKLAEN--GVTITDT-----TFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS--ANEAQ---V--- 175 (400) Q Consensus 111 ~AW~a~L~ek--gV~~qd~-----~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n--a~~a~---G--- 175 (400) =||+.+|... |+..+.. ...+|..+...|-+.+.++.++++...+...+.....+-..+ ..-.| | T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~eg~~~ 80 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGVGTSN 80 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEeecCcccc Confidence 5666666543 4322211 116899999999999999888888766665554433433322 12222 1 Q ss_pred -ecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchh Q lcl|Aclame:pro 176 -HKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDK 254 (400) Q Consensus 176 -Hk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~ 254 (400) ...+..++.+..+|...++.|..++...++.+-+. +.+.-++..|+.++|++.+- |.++-++++|||........- T Consensus 81 ~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell--~~s~~~~~~~i~~~la~ai~-~~~d~~~l~G~g~~~~~~~~g 157 (333) T protein:vir:78 81 EQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFA--RMNPSGLYTKLQGDLAYAIG-RGIDLAVFHGKSPLTGSALQG 157 (333) T ss_pred cccccccccccccceeEEEEeeEEEEEeehhhHHHH--hcCHHHHHHHHHHHHHHHHH-HHHHHHHhcccCCCCCccccc Confidence 23356788889999999999877777666644332 33335789999999999999 799999999999743221111 Q ss_pred hhhhhhhhhhh-hhhhccCCCCCHHHHHHhhceeccccc--ceEEEEecchhHHHHhhhhhccccccceeecCCcceeeh Q lcl|Aclame:pro 255 EADVKKIKKIT-TKAKSAGKTPFADAIEEAVDFVRPTAG--RRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASE 331 (400) Q Consensus 255 e~D~~~ik~it-~~at~~~~T~~~dal~Eald~~~~~~~--~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~ 331 (400) ......+...+ ......+++.+.+++..++.-...... ...+++|+.+...|+. + ..++|.+|.+.++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~-~--------~~~~d~~G~~i~~ 228 (333) T protein:vir:78 158 IDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLR-A--------QAYRDANGNVDPS 228 (333) T ss_pred ccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHH-H--------hhhcCCCCceeec Confidence 11000000000 111223345567788887754433322 2368888888777654 1 1367788887765 Q ss_pred hhccccceec--chhhhc----hhhh---hccc------------------ceechhhceeccc------eeeeecCceE Q lcl|Aclame:pro 332 VGVDEIIVYT--GSKALK----PTVL---VDQK------------------YHIDMQDLTKVDA------FEWKTNSNMI 378 (400) Q Consensus 332 v~v~~~~~~t--g~k~l~----~t~~---vd~k------------------~~~~~~~~~~~~~------~~~~~~~~~i 378 (400) -.+-...-+| |....+ |... .+.+ ..|+++++.+... ..|..++=.+ T Consensus 229 ~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~ 308 (333) T protein:vir:78 229 RINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAI 308 (333) T ss_pred CccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEEEeccccccccccceeehhhcCcEEE Confidence 3322111111 433322 2110 1111 1222222221111 1233333344 Q ss_pred EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 379 LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 379 lve~~~~~~~~~~~~~~~~~~~ 400 (400) .++.--.+.|..-+|=++|+.. T Consensus 309 r~~~r~d~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 309 LIEVTFGWLLGDKQAFVKFVDD 330 (333) T ss_pred EEEEEEccEEecccceEEEecc Confidence 5566666666555555555544 No 90 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=98.44 E-value=1.3e-07 Score=58.32 Aligned_cols=333 Identities=12% Similarity=0.097 Sum_probs=146.9 Q ss_pred HHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHH-HHHH-HHHH-HHHHhhHhhhcchhHHHHHHHHHHHHHHHHHHHHHc Q lcl|Aclame:pro 27 NVSLKSQISGFEVKNAIEDLPKVQELEKTLSEN-SIEI-IKIE-NELNAQEEKPKGKDKMTNFIESQNAVTEFFDVLKKN 103 (400) Q Consensus 27 ~~~~Ks~i~~~~~~~~~~~~skieElektis~l-~aEi-~K~e-nEl~~~kEk~K~k~emtEfLkTkqA~~dya~ll~~n 103 (400) |.-+.. ++.++ +-+++.+.+.+. ..+. .+.. ...++..+..+.+ .-.|+. ++ ....+ -.+ T Consensus 1 m~~kl~--------~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~e~~---~~-~~~~~--~~~ 63 (381) T protein:vir:10 1 MTINLS--------ETFAN--AKNEFINAVNNGEPQERQNELYGDMINQLFEETKLQ-AKAEAE---RV-SSLPK--SAQ 63 (381) T ss_pred CchhHH--------HHHHH--HHHHHHHHHHhhhHHHHHHHHHHHHHHhhhhhHHHH-HHHHHH---HH-HHhcc--ccc Confidence 111100 11111 111111111110 0000 0111 1111111111000 011110 01 00000 111 Q ss_pred cCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchh Q lcl|Aclame:pro 104 SGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTK 182 (400) Q Consensus 104 qg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~K 182 (400) ....+-++.+.+ +...+- .+-..++|..+...|-+.+.+++++++...|.+.+...-+.-... ....||.-.+..+ T Consensus 64 ~l~~~e~~~~~~-~~~~t~--~~Gg~lvP~~~~~~I~~~l~~~spir~~a~v~~~~~~~~i~~~~~~~~a~W~~e~~~~~ 140 (381) T protein:vir:10 64 TLSANQRNFFMD-INKSVG--YKEEKLLPEETIDRIFEDLTTNHPLLADLGIKNAGLRLKFLKSETSGVAVWGKIYGEIK 140 (381) T ss_pred ccCHHHHHHHHH-HhhcCC--CCCceecCHHHHHHHHHHHHhhcceeeeeeeEecCcceEEEeecCCcceEEeecccccc Confidence 112344444433 322211 122347999999999999999999999888877765432222222 3456777666666 Q ss_pred hhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhh Q lcl|Aclame:pro 183 TEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIK 262 (400) Q Consensus 183 k~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik 262 (400) .....+|...++.+..+|..-.+..-+ |+-++-++-+|+..+|+..|- ++.+.|++.|||+.-. .-+..++-... T Consensus 141 ~~~~~~f~~i~l~~~kl~a~i~is~el--L~Ds~~~le~~i~~~la~~~a-~~~~~afi~GdG~~qP--~Gil~~~~~~~ 215 (381) T protein:vir:10 141 GQLDAAFSEETAIQNKLTAFVVLPKDL--NDFGPAWIERFVRVQIEEAFA-VALETAFLKGTGKDQP--IGLNRQVQKGV 215 (381) T ss_pred cccCccceeEeecceeEEeeccccHHH--HhccHHHHHHHHHHHHHHHHH-HHhhceeEecccCCCc--eeeeecCCccc Confidence 666789999999988888776663333 344446788999999999999 7999999999998421 11111100000 Q ss_pred hhhhhh-----hccCCCCC------HH---HHHHhhcee-----cccccceEEEEecchhHHHHhhhhhccccccceeec Q lcl|Aclame:pro 263 KITTKA-----KSAGKTPF------AD---AIEEAVDFV-----RPTAGRRYLIVKAEDRKALLDELRQATANANVRIKN 323 (400) Q Consensus 263 ~it~~a-----t~~~~T~~------~d---al~Eald~~-----~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~ 323 (400) ..+.+ ++.+..++ .+ ++...++.. ++-.+..++++++.+...++- ..-..+ T Consensus 216 -~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~---------~~~~~~ 285 (381) T protein:vir:10 216 -SVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA---------QYTHLN 285 (381) T ss_pred -cccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhcc---------ccccCC Confidence 00000 00000011 11 112222111 112233466788888777653 111457 Q ss_pred CCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeee--------cCceEEEEec--ccccceeecc Q lcl|Aclame:pro 324 DDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKT--------NSNMILVETL--TSGHVETYNA 393 (400) Q Consensus 324 ~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~--------~~~~ilve~~--~~~~~~~~~~ 393 (400) .+|+|+...+.+-.++.+ .. +|. +.=.--|.+.|+-.+..++.+ -.+++.+=.. -.|.+-.=+| T Consensus 286 ~~G~~v~~lp~g~~vv~~--~~-~p~---~~i~fGDfs~Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A 359 (381) T protein:vir:10 286 ANGVYVTALPFNLNVIES--TV-QEA---GKVLTYVKGLYDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKV 359 (381) T ss_pred CCCceeecCCCCceeEEc--CC-CCc---CcEEEEEcccEEEEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCc Confidence 789888655544333321 22 231 111223444444444333322 2222222111 1122211122 Q ss_pred ceeEeeC Q lcl|Aclame:pro 394 GAVITVS 400 (400) Q Consensus 394 ~~~~~~~ 400 (400) =.|++++ T Consensus 360 ~~v~~l~ 366 (381) T protein:vir:10 360 AAVWKLD 366 (381) T ss_pred EEEEEEe Confidence 2354444 No 91 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.44 E-value=2.3e-09 Score=67.94 Aligned_cols=249 Identities=14% Similarity=0.091 Sum_probs=139.8 Q ss_pred HHHhCccch-hhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee--eEEeeccc--cccceecccchhhhh-hhhh Q lcl|Aclame:pro 116 KLAENGVTI-TDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL--VSRSFDSA--NEAQVHKDGQTKTEQ-AATL 189 (400) Q Consensus 116 ~L~ekgV~~-qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a--~~i~l~na--~~a~GHk~ga~Kk~q-~~~l 189 (400) -|......+ .+--..+|..+...|-+.++++.++.+...+.+.+... ..+...+. ..+.-.--|.++++. ..+| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 333322211 12112689999999999999988887766655544332 23332221 122223335566654 5789 Q ss_pred hhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 190 TIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAK 269 (400) Q Consensus 190 e~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at 269 (400) ...++.|..++....+-+.+.+ .+.-++.+|++++|+..+- |..+.+++.|+|...+. T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~--ds~~~l~~~i~~~la~~~~-~~~~~~i~~g~~~~~~~------------------- 138 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLA--DSAENILAWLSGWIAKKVV-VTRNKAILGVVDKLPTK------------------- 138 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHh--hhhHHHHHHHHHHHHHHHH-HHHHhHHhhcccccccc------------------- Confidence 9999999999888777433332 2224689999999999998 79999999999875532 Q ss_pred ccCCCCCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhhh Q lcl|Aclame:pro 270 SAGKTPFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAL 346 (400) Q Consensus 270 ~~~~T~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l 346 (400) +++...|+|..++.=..|. .....+++|+.++..|+. ++|.+|++.+.-++.+-.-.| |..-. T Consensus 139 --~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~------------lkd~~g~~l~~~~~~~~~~~~l~G~Pv~ 204 (293) T protein:vir:48 139 --PTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKK------------VKNALGDYLMERDVKSPTGYSIAGFAVK 204 (293) T ss_pred --ccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH------------hhccCCceEeecCcCCCCCceecceeeE Confidence 1233456666655222222 223468889999988887 889999988765544432222 43221 Q ss_pred ------chhhhhccc--ceechhhcee-ccce----------eeeecCce--EEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 347 ------KPTVLVDQK--YHIDMQDLTK-VDAF----------EWKTNSNM--ILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 347 ------~~t~~vd~k--~~~~~~~~~~-~~~~----------~~~~~~~~--ilve~~~~~~~~~~~~~~~~~~~ 400 (400) .|.+..+.. +--|++++.. .+.. +..|..|+ +.++.--.|-+---++-.+++++ T Consensus 205 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~ 279 (293) T protein:vir:48 205 EISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFK 279 (293) T ss_pred EecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEee Confidence 122221111 1124443221 1111 11233333 34444444444444444455555 No 92 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.43 E-value=2.2e-09 Score=68.01 Aligned_cols=263 Identities=14% Similarity=0.061 Sum_probs=135.9 Q ss_pred CccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhccHHH Q lcl|Aclame:pro 120 NGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLEPVM 198 (400) Q Consensus 120 kgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~ 198 (400) -|+. ++--.++|..+...|-..++++..+++...+...+.....+...+ ...+.=+.-|.+++..+.+|...++.|.. T Consensus 1 m~t~-t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~v~l~~~k 79 (303) T protein:vir:97 1 MGTE-TSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTLDSDIDVVAENGKKTHGGLSLEPVTIVPIK 79 (303) T ss_pred Cccc-CCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEecCcceEEeecCccccccccceeeEEeeeEE Confidence 2211 111227898888888888888788887766666665545553333 23344455678999999999999999988 Q ss_pred HHHHHHHHH-HHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCc-cccchhhhhhhhhhhhhhhhhcc-CCCC Q lcl|Aclame:pro 199 VYKLQSLAE-RVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNG-FKSIDKEADVKKIKKITTKAKSA-GKTP 275 (400) Q Consensus 199 VYkkq~Lad-~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~-t~~~~~e~D~~~ik~it~~at~~-~~T~ 275 (400) ++...++-+ ++.....+.-.+.+|+.++|+.++- |.++.++.+|+|... +...+ .+... ....++.+... ++.. T Consensus 80 l~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~-~~ld~a~l~G~~~~~g~~~~~-~~~~~-~~~~~~~~~~~~~~~~ 156 (303) T protein:vir:97 80 VEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLA-RGIDLMAMHGINPRTKKASDV-IGTNH-FDSKVTQVVKFTESED 156 (303) T ss_pred EEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHH-HHHHhhhhcccccCCcccccc-ccccc-cccccccccccccccc Confidence 888877733 3333344456789999999999999 799999999964311 11111 11100 00111111111 2233 Q ss_pred CHHHHHHhhceecc-cccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehh--hccc---cceecchhhhc-- Q lcl|Aclame:pro 276 FADAIEEAVDFVRP-TAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEV--GVDE---IIVYTGSKALK-- 347 (400) Q Consensus 276 ~~dal~Eald~~~~-~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v--~v~~---~~~~tg~k~l~-- 347 (400) ..+++..++.-..+ -.....+++|+.++.+|+. ++|.+|.+..-- ..+. .+. |....+ T Consensus 157 ~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~------------lkd~~g~~~~~~~~~~~~~~~~l~--G~Pv~~s~ 222 (303) T protein:vir:97 157 ADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAK------------VTNGEMGPKMYPELAWGANPDSIN--GLKSSVNT 222 (303) T ss_pred hHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHH------------hhccCCCeEEecCccCCCCCceec--ceeeEEec Confidence 34666666643222 2233458999999999987 777777655421 1111 111 422211 Q ss_pred --hh-----------hhhcc--cceechh--------hceecccee-eeecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 348 --PT-----------VLVDQ--KYHIDMQ--------DLTKVDAFE-WKTNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 348 --~t-----------~~vd~--k~~~~~~--------~~~~~~~~~-~~~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) |. ++.|= .|.+.+. ++..-|..+ =-|.+||+.+-....--.--.++.|++.+- T Consensus 223 ~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~ 299 (303) T protein:vir:97 223 TVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVT 299 (303) T ss_pred ccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEee Confidence 10 11111 1111111 111000000 003344444422211112223344444333 No 93 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.43 E-value=2.7e-09 Score=67.53 Aligned_cols=260 Identities=13% Similarity=0.069 Sum_probs=131.5 Q ss_pred hCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccc-----hhhhhhhhhhhh Q lcl|Aclame:pro 119 ENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQ-----TKTEQAATLTID 192 (400) Q Consensus 119 ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga-----~Kk~q~~~le~~ 192 (400) =+....++--.++|..+..-|-+.++++.++++...+.+.+.....+-..+ ...+.-+.-|+ ++.....+|... T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~~f~~i 80 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLATLPEADWVGESATDPKGVKPTSKVTWANR 80 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeCCcceEEeecccccccccccccccceeeE Confidence 122222444447999999999999998898888777666655444443322 22233233333 466678889999 Q ss_pred hccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccc-cchhhhhhhhhhhhhhhhhcc Q lcl|Aclame:pro 193 TLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFK-SIDKEADVKKIKKITTKAKSA 271 (400) Q Consensus 193 ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~-~~~~e~D~~~ik~it~~at~~ 271 (400) ++.|..++....+.+.+.+ .+.-++.+|+.++|+..+- |..+.++++|||.+... .............-. +.. T Consensus 81 ~~~~~k~~~~~~is~ell~--ds~~~~~~~i~~~l~~~~a-~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~---~~~ 154 (305) T protein:vir:25 81 TLVAEEIAVIIPVHENVID--DATVAVLTEVAELGGQAIG-KKLDQAVIFGTDKPASWVSPALIPAAVTAGQAV---EVV 154 (305) T ss_pred EeeeEEEEEeehhhHHHHh--cchHHHHHHHHHHHHHHHH-HHHhhhheeccCCCCCccccccccccccccccc---ccc Confidence 9999888877666443333 3335789999999999999 79999999999975421 222222211111111 111 Q ss_pred CCCCC----HHHHHHhhcee-cccccceEEEEecchhHHHHhhhhhccccccceeecCCcceeeh--------hhccccc Q lcl|Aclame:pro 272 GKTPF----ADAIEEAVDFV-RPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASE--------VGVDEII 338 (400) Q Consensus 272 ~~T~~----~dal~Eald~~-~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~--------v~v~~~~ 338 (400) ..+.. .+++..++... .......-+++|+.++.+|+- ++|.+|.+.+. |-+++.. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~------------lkd~~G~~i~~~~~l~G~Pv~~~~~~ 222 (305) T protein:vir:25 155 GGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN------------IRDANGNPVFRDDSFAGFRTFFNRNG 222 (305) T ss_pred ccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHH------------hhccCCceeecCCcccccceEEcCcc Confidence 11222 22333333222 222222347889999888876 78888887653 2222211 Q ss_pred eecchhhhchhhhhc---------ccceechhhcee--ccceee-eecCceEE--EEecccccceeeccceeEeeC Q lcl|Aclame:pro 339 VYTGSKALKPTVLVD---------QKYHIDMQDLTK--VDAFEW-KTNSNMIL--VETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 339 ~~tg~k~l~~t~~vd---------~k~~~~~~~~~~--~~~~~~-~~~~~~il--ve~~~~~~~~~~~~~~~~~~~ 400 (400) -..+.+. +-.+.| +-..|++++..+ .+.... -|.+||+. ++.-..++ ..|..+++.+. T Consensus 223 ~~~~~~~--~~~~gd~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~--v~~p~a~v~~~ 294 (305) T protein:vir:25 223 AWDADAA--IEVIADSSRVKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYV--LGVSATAQGAN 294 (305) T ss_pred CCCCCcc--EEEEEecceEEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecce--eeCcccEEEEc Confidence 0000000 000111 111112221110 000000 13333333 23222222 34444444443 No 94 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=98.37 E-value=1.1e-07 Score=58.72 Aligned_cols=372 Identities=13% Similarity=0.136 Sum_probs=152.3 Q ss_pred Cccc-------------ccccchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhh-------hhhhhHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MRIS-------------KRNMNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNA-------IEDLPKVQELEKTLSENS 60 (400) Q Consensus 1 ~~~s-------------~~~~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~-------~~~~skieElektis~l~ 60 (400) +... ....+......-.+.+..+++....+..++.++..+.+ -.+..++++++..+..++ T Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~ 246 (645) T protein:vir:93 167 KPVVKIASSAGAAAQSTTVFHKEKTIMNIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVD 246 (645) T ss_pred cchhhhhhhhcchhhccccccccccccchhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHH Confidence 0000 00000000000111233333333333333333321111 122366777777777777 Q ss_pred HHHHHHHH-HHHhhH------hhhcch---------hHHHHHHHHHHHHHHHHHHHHHccCCh----hHHH-H------- Q lcl|Aclame:pro 61 IEIIKIEN-ELNAQE------EKPKGK---------DKMTNFIESQNAVTEFFDVLKKNSGKS----EIKN-A------- 112 (400) Q Consensus 61 aEi~K~en-El~~~k------Ek~K~k---------~emtEfLkTkqA~~dya~ll~~nqg~k----e~k~-A------- 112 (400) .+|...+. +..... +...+. +...+=.........|++-|....|.- ++.+ . T Consensus 247 ~~i~r~e~~e~~~a~~a~pv~~~~~~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~ 326 (645) T protein:vir:93 247 AHLKRLRELEAGKAATAQPVKQAGNGNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRL 326 (645) T ss_pred HHHHHHHHHHHHHHhcccccccccccccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhh Confidence 66643222 111000 000000 000000001111122333333333320 1111 1 Q ss_pred --HHHHHHhCccchhhhH----hhcchhHHHHHHHHHHhhCcccccee------eecccceeeEEeecc--ccccceecc Q lcl|Aclame:pro 113 --WSAKLAENGVTITDTT----FQLPRKLVESINTALLNTNPVFKVFH------VTNVGALLVSRSFDS--ANEAQVHKD 178 (400) Q Consensus 113 --W~a~L~ekgV~~qd~~----eiLP~~iI~AIe~A~ed~d~vl~~fh------V~n~~~~a~~i~l~n--a~~a~GHk~ 178 (400) +..+-...|.++ +.. .+.|..+..-|=+.+.+. .++..+- ....| +-+.+--++ ....|. -- T Consensus 327 ~~~~~~a~~~~~~~-~~~~~Gg~~vp~~~~~~ii~~l~~~-svv~~l~~~~~~~~~~~~-~~~~ip~~t~~~~a~wv-~E 402 (645) T protein:vir:93 327 HHVLKSAVGAGTTT-DPQWAGSLSEYQEYAQDFIDYLRPQ-TIIGRFGQGGIPALRQVP-FNIRVHAQVSGGAAGWV-GE 402 (645) T ss_pred hhhhhhhhhccccc-cccccCCccCchhhHHHHHHhhhhh-hhHHhhcccccccccccc-CceeeeeeecCcceEEe-cc Confidence 111111233321 211 145544333333344432 2222211 01111 111222222 344554 35 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhh Q lcl|Aclame:pro 179 GQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEAD 257 (400) Q Consensus 179 ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D 257 (400) |.++..+..+|...++.|..++-...+- +++.+.. -++.+|+.++|+.++- +.++.|++.|||...+-..|. + T Consensus 403 g~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds~---~~~~~~i~~~l~~aia-~~~d~a~l~g~g~~~~~~~p~-g- 476 (645) T protein:vir:93 403 GKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFSS---PAADALVRNALAEAVV-ARLDTDFVDPKKAAVADVSPA-S- 476 (645) T ss_pred CccccccccceeEEEEeeEEEEEeehhHHHHHhhch---HHHHHHHHHHHHHHHH-HHHHHHhhcCCCcccCCcccc-c- Confidence 7788999999999999998888776662 3334332 3568999999999999 799999999998754333332 1 Q ss_pred hhhhhhhhhhhhccCCCCCHHHHHHhh-ce--ecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceeeh-hh Q lcl|Aclame:pro 258 VKKIKKITTKAKSAGKTPFADAIEEAV-DF--VRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASE-VG 333 (400) Q Consensus 258 ~~~ik~it~~at~~~~T~~~dal~Eal-d~--~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~-v~ 333 (400) +. -..+++..+++. ..++...+ .+ +.......+.++++.+..+|+. ++|.+|.+.+| ++ T Consensus 477 i~----~~~~~~~~~~~~-~~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~------------lkd~~G~~~~~~~~ 539 (645) T protein:vir:93 477 IT----HDVKGTASSGNP-DADAEAAFGQFVAANLQPTGAVWLMSSTNALALSM------------RKNALGQKEYPDMT 539 (645) T ss_pred ee----ccccccccccch-HHHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHh------------ccccCCceeecCCC Confidence 00 011122222232 23333333 12 1111122467889999988877 88889988764 22 Q ss_pred ccccceecchhhh----chhh--hhc---------ccceechhhceec-------------cceee--eecCceEEEEec Q lcl|Aclame:pro 334 VDEIIVYTGSKAL----KPTV--LVD---------QKYHIDMQDLTKV-------------DAFEW--KTNSNMILVETL 383 (400) Q Consensus 334 v~~~~~~tg~k~l----~~t~--~vd---------~k~~~~~~~~~~~-------------~~~~~--~~~~~~ilve~~ 383 (400) ...-++. |-... +|.. +.| .-..|++++..+. +.... -|.+||+.+=.. T Consensus 540 ~~~~tL~-G~PV~~s~~vp~~~~~gd~s~~~ig~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~ 618 (645) T protein:vir:93 540 LLGGSFQ-GLPVIVSQYVGDQLVLVNAPDIYLADDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAE 618 (645) T ss_pred CCCceee-ceeeEEeccCCcceeEeccccEEEEEecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEE Confidence 1111110 21111 1221 111 1112222211111 00000 034555554443 Q ss_pred ccccceeeccceeEeeC Q lcl|Aclame:pro 384 TSGHVETYNAGAVITVS 400 (400) Q Consensus 384 ~~~~~~~~~~~~~~~~~ 400 (400) ..-..--++..|+..+. T Consensus 619 ~r~d~~~~~p~a~~~lt 635 (645) T protein:vir:93 619 RWINWRRRRTAAVAVIT 635 (645) T ss_pred EEEcceeeCccceEEEe Confidence 33333345566655554 No 95 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=98.34 E-value=4.5e-07 Score=55.37 Aligned_cols=339 Identities=13% Similarity=0.135 Sum_probs=144.8 Q ss_pred cchhhHHHHHHHHHHHHHHHHHhhhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhcchhHHHHHH Q lcl|Aclame:pro 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEKPKGKDKMTNFI 87 (400) Q Consensus 8 ~~k~~~eekq~~lA~lKe~~~~~Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kEk~K~k~emtEfL 87 (400) |... +.+. ++.++|+..++...++.-+. +.++....+++ +..++.++.+ +. +.++.+ T Consensus 1 M~~k-l~~~---~~~~~e~~~~l~~~~~~~~~--~~~~~~~~~~~---~~~~~~~~~~---------~~---~~~~~~-- 57 (383) T protein:vir:78 1 MTIK-LKNN---LANYEEKRTAFVNAVKNEDT--QEIQNKAYVEM---VDAMAADIME---------QA---KKEARQ-- 57 (383) T ss_pred Cchh-HHHH---HHHHHHHHHHHHHHHhccCh--HHHHHHHHHHH---HHHHHHHHHH---------HH---HHHHHH-- Confidence 4322 2222 23333333333222221100 00110111111 1111111110 00 001111 Q ss_pred HHHHHHHHHHHHHHHccCC----hhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceee Q lcl|Aclame:pro 88 ESQNAVTEFFDVLKKNSGK----SEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLV 163 (400) Q Consensus 88 kTkqA~~dya~ll~~nqg~----ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~ 163 (400) ...+.. ....|- .+-++.+. .+...+ +.|--.++|..+...|-..+.+++++++...|.+..... T Consensus 58 -~~~~~~------~~~~g~~~lt~~e~~~~~-~~~~~~--~~~gg~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~~~~- 126 (383) T protein:vir:78 58 -EADAYI------SASRTDKNITNEEIKFFN-DINKEV--GYKEETLLPQTVVDEIFEDLTTEHPFLASIGMRTTGLRT- 126 (383) T ss_pred -HHHHHH------HhcCChhhhhHHHHHHHH-HHhccC--CCCCccccCHHHHHHHHHHHHhhccceeeeeeEecCCce- Confidence 111111 111111 12222222 122222 234445899999999999999999999977777766543 Q ss_pred EEee-cc-ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhccee Q lcl|Aclame:pro 164 SRSF-DS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALV 241 (400) Q Consensus 164 ~i~l-~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv 241 (400) .+-. .+ ....||.-.+.-++....+|...++.+..+|.+-.+..-+ |+-+.-++-+|+.++|+..|- ++.+.|++ T Consensus 127 ~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~el--l~Ds~~~ie~~i~~~l~~~~a-~~~~~a~i 203 (383) T protein:vir:78 127 KFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDL--EKFGPAWVKRFVVTQIEEAFA-VALESAYI 203 (383) T ss_pred EEEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHH--hhccHHHHHHHHHHHHHHHHH-HHHhhheE Confidence 3333 22 3566877666666667789999999988777775553333 333345788999999999999 79999999 Q ss_pred eccCCCccccchhhhhhhhhhhhhh----hhhccCCCCCHH--HHHHhh-------ceec----ccccce-EEEEecchh Q lcl|Aclame:pro 242 EGDGTNGFKSIDKEADVKKIKKITT----KAKSAGKTPFAD--AIEEAV-------DFVR----PTAGRR-YLIVKAEDR 303 (400) Q Consensus 242 ~gDG~~~t~~~~~e~D~~~ik~it~----~at~~~~T~~~d--al~Eal-------d~~~----~~~~~~-~l~~~~~d~ 303 (400) .|||+.- ..-+..++-.....+. ..+..+.+.+.+ .+...| +|.. ..+-++ ..++++.|. T Consensus 204 ~G~G~~q--P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~ 281 (383) T protein:vir:78 204 VGDGNDK--PIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDA 281 (383) T ss_pred eccCCCC--ceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcch Confidence 9999742 1111111100000000 001111111111 111111 1111 011111 234444332 Q ss_pred HHHHhhhhhccccccceeecCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeee--------cC Q lcl|Aclame:pro 304 KALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKT--------NS 375 (400) Q Consensus 304 ~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~--------~~ 375 (400) -.+.- -.-..+.+|.++...|.+-.++- +..| |.. .=.--|.+.|.-.+..++.+ -. T Consensus 282 ~~~~~---------~~~~~~~~G~~~t~l~~~~~iv~--s~~~-p~~---~iifgdfs~Y~i~~r~~~~i~~~~~~~f~~ 346 (383) T protein:vir:78 282 WDVKK---------QYTSLNANGVYVTALPFNLNIIE--SLFV-PEK---KAISYVAERYDALIGGPLDIGTYDQTLAIE 346 (383) T ss_pred hhhcc---------chhccCCCCceeeecCCCceEEe--cCCC-Ccc---cEEEeeccceEEEecccceEEecchhhhhc Confidence 11110 01134567887765554433321 1111 211 01122444444443333222 22 Q ss_pred ceEEEEecccccceeeccce--eEeeC Q lcl|Aclame:pro 376 NMILVETLTSGHVETYNAGA--VITVS 400 (400) Q Consensus 376 ~~ilve~~~~~~~~~~~~~~--~~~~~ 400 (400) +++.+=...-.--..+|..| |++++ T Consensus 347 d~~~f~~~~r~dG~~~~~~A~~vl~~~ 373 (383) T protein:vir:78 347 DLNLYAAKQFAYGKAKDDKAAAVWTLN 373 (383) T ss_pred CceEEEEEEEEcCEEecCCeEEEEEEE Confidence 22222222222223344444 54555 No 96 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.33 E-value=8.1e-09 Score=64.95 Aligned_cols=278 Identities=11% Similarity=0.015 Sum_probs=140.1 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceeccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDG 179 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~g 179 (400) |..+.+-+.. ...+...+- ++--..+|..+..-|-+.+.++..+++...+.+.+.....+-..+ ...+.-+.-| T Consensus 1 ~~~~~~~~~~---~~~~~~t~~--~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~E~ 75 (320) T protein:vir:10 1 MAAGTAFQVD---HAQIAQTGD--TMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQWIGEG 75 (320) T ss_pred CCCCccCCHH---HHHhhcccc--ccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEeCCcceEEecCC Confidence 3333221222 222333321 222225898888888888888888888666666554444443333 3344445678 Q ss_pred chhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhh Q lcl|Aclame:pro 180 QTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVK 259 (400) Q Consensus 180 a~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~ 259 (400) +++.+...+|...++.|..++....+-+-+- +.+.-++.+|+.++|++++- |.++.+++.|||.+............ T Consensus 76 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell--~ds~~~l~~~i~~~l~~a~a-~~~d~a~l~G~g~~~~~~~~~~~~~~ 152 (320) T protein:vir:10 76 DMKPITKGNMTSQNIAPHKIATIFVASAETV--RANPANYLGTMRTKVATAFA-MAFDSAALNGTDSPFPTYLAQTTKSV 152 (320) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHH--hcChHHHHHHHHHHHHHHHH-HHHHHHhhcccCCCCCcccccccccc Confidence 8899999999999999988877766633322 22335789999999999999 79999999999975422222111111 Q ss_pred hhhhhhhhhhccCCCCCHH-HHHHhhc-eecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhcccc Q lcl|Aclame:pro 260 KIKKITTKAKSAGKTPFAD-AIEEAVD-FVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEI 337 (400) Q Consensus 260 ~ik~it~~at~~~~T~~~d-al~Eald-~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~ 337 (400) .. .++...........+ .+..++. ....-....++++|+.++.+|+. ++|.+|.+.+.-.+... T Consensus 153 ~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~------------lkd~~G~~l~~~~~~~~ 218 (320) T protein:vir:10 153 SL--ADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNG------------AKDKNGRPLFIESTYTD 218 (320) T ss_pred cc--eecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHH------------hhccCCceeeccccccC Confidence 00 001111111111222 2333331 11122334578999999999987 78888887664322111 Q ss_pred --ceecchhh----hchhhhhcc-cce---echhhceeccceee----------------------eecCceE--EEEec Q lcl|Aclame:pro 338 --IVYTGSKA----LKPTVLVDQ-KYH---IDMQDLTKVDAFEW----------------------KTNSNMI--LVETL 383 (400) Q Consensus 338 --~~~tg~k~----l~~t~~vd~-k~~---~~~~~~~~~~~~~~----------------------~~~~~~i--lve~~ 383 (400) .-..|-.- .+++..++. ++. .|++.+.-....++ -|.+|++ .++.- T Consensus 219 ~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~ 298 (320) T protein:vir:10 219 ENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAE 298 (320) T ss_pred ccccccCceeeeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEe Confidence 00001111 112222221 111 23333321111110 0223333 33333 Q ss_pred ccccceeeccceeEeeC Q lcl|Aclame:pro 384 TSGHVETYNAGAVITVS 400 (400) Q Consensus 384 ~~~~~~~~~~~~~~~~~ 400 (400) -.+.|.--.|=++|+.- T Consensus 299 ~d~~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 299 YAFHNNDKDAFVKLTNV 315 (320) T ss_pred eccEEecccceEEEEec Confidence 44444333333333321 No 97 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.29 E-value=1e-08 Score=64.37 Aligned_cols=268 Identities=15% Similarity=0.076 Sum_probs=139.3 Q ss_pred cCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchh Q lcl|Aclame:pro 104 SGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTK 182 (400) Q Consensus 104 qg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~K 182 (400) -|. ++........+- ++....||..+...|=..++++.++++.+.+.+.+.....+-..+ ...+.-+--|..+ T Consensus 1 ~g~----~~e~~~~~~~~t--~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~wv~Eg~~~ 74 (397) T protein:vir:23 1 MGF----SADHSQIAQTKD--TMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTGDVSAQWIGEGDMK 74 (397) T ss_pred CCc----CHHHHHHhhccC--CCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcCCcceEEecCCccc Confidence 222 111222233322 222335777766666666677788888777666665544443333 3344445678889 Q ss_pred hhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhh Q lcl|Aclame:pro 183 TEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIK 262 (400) Q Consensus 183 k~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik 262 (400) ++....|...++.|..++....+.+-+- +.+.-++.+|++++|+..+- |.++.+++.|||.+.. .....+ T Consensus 75 ~~s~~~f~~v~l~~~k~~~~v~iS~ell--~ds~~~l~~~i~~~l~~aia-~~~d~a~l~G~gt~~~--~~~~~~----- 144 (397) T protein:vir:23 75 PITKGNMTKRDVHPAKIATIFVASAETV--RANPANYLGTMRTKVATAIA-MAFDNAALHGTNAPSA--FQGYLD----- 144 (397) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHH--hcchHHHHHHHHHHHHHHHH-HHHHHHHhhcccCCcc--cccccc----- Confidence 9999999999999988777766633322 23335789999999999999 7999999999998641 111111 Q ss_pred hhhhhhhccCCCCCHHHHHHhhceeccccc-ceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec Q lcl|Aclame:pro 263 KITTKAKSAGKTPFADAIEEAVDFVRPTAG-RRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT 341 (400) Q Consensus 263 ~it~~at~~~~T~~~dal~Eald~~~~~~~-~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t 341 (400) .+........+...+.+..++.-..+... ...+++++.++.+|+- ++|.+|++.+.-......... T Consensus 145 -~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~------------lkd~~G~~i~~~~~~~~~~~~ 211 (397) T protein:vir:23 145 -QSNKTQSISPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNG------------SVDANGRPLFVESTYESLTTP 211 (397) T ss_pred -cccceeeecccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHH------------hhccCCceeeccccccccccc Confidence 11112223334455556665532222222 2468899999988887 888889887754443322211 Q ss_pred ch-hhh--chhhhhcc----cc---eechhhceecccee------------------------eeecCceEEEEeccccc Q lcl|Aclame:pro 342 GS-KAL--KPTVLVDQ----KY---HIDMQDLTKVDAFE------------------------WKTNSNMILVETLTSGH 387 (400) Q Consensus 342 g~-k~l--~~t~~vd~----k~---~~~~~~~~~~~~~~------------------------~~~~~~~ilve~~~~~~ 387 (400) +. ..+ .|-++.|. +. --|++.+.-....+ |..++-.+.++.--.+- T Consensus 212 ~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~ 291 (397) T protein:vir:23 212 FREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLL 291 (397) T ss_pred ccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccc Confidence 10 011 12221111 11 11333222111111 12222222233322222 Q ss_pred ceeeccceeEeeC Q lcl|Aclame:pro 388 VETYNAGAVITVS 400 (400) Q Consensus 388 ~~~~~~~~~~~~~ 400 (400) |-.-++-..+... T Consensus 292 v~~~~a~~~~~~~ 304 (397) T protein:vir:23 292 INDVNAFVKLTFD 304 (397) T ss_pred eecccceEEEeec Confidence 2222221222211 No 98 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.25 E-value=4.7e-09 Score=66.25 Aligned_cols=264 Identities=14% Similarity=-0.001 Sum_probs=132.4 Q ss_pred HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhcc Q lcl|Aclame:pro 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLE 195 (400) Q Consensus 117 L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~ 195 (400) ++ . .+++.-..+|+.+..-|-..++++..+.+..++...+.....+-..+ ...+.-.--|.++.+...+|...++. T Consensus 1 Ma--t-~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v~l~ 77 (311) T protein:vir:99 1 MA--T-FGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNGRPKAEFVGEGQQKSSTTGEFDFVTST 77 (311) T ss_pred Cc--e-ecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeCCceeEEeecCcccccccceeeEEEEe Confidence 11 1 11222236898888888888888888877666665555444443332 22333345577888899999999999 Q ss_pred HHHHHHHHHH-HHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhh--hccC Q lcl|Aclame:pro 196 PVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKA--KSAG 272 (400) Q Consensus 196 p~~VYkkq~L-ad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~a--t~~~ 272 (400) |..++..-.+ .+++...+-+.-++.+|+.++|++.+- +..+.++..|||...-....-..- .+...+... +..+ T Consensus 78 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~-~~~d~~~l~G~g~~~g~~~~g~~~--~~~~~~~~~~~~~~~ 154 (311) T protein:vir:99 78 PKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALA-RALDLGLYHRINPLTGTVIPGWSN--YLGAASKRVELTADT 154 (311) T ss_pred eEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHH-HHHHHHhhcccCcccCcccccccc--ccccccceeeccccc Confidence 9877777666 333333333345689999999999999 799999999988532111111100 000011111 1111 Q ss_pred CCCCHHHHHHhhce---ecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccce--ecchhhhc Q lcl|Aclame:pro 273 KTPFADAIEEAVDF---VRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIV--YTGSKALK 347 (400) Q Consensus 273 ~T~~~dal~Eald~---~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~--~tg~k~l~ 347 (400) .+...+.+.-++.- +.-......+++|+.++.+|+- ++|.+|.+.+.-....... .-|....+ T Consensus 155 ~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~------------lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~ 222 (311) T protein:vir:99 155 IANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLST------------ARYTDGRKKFPELGLGIGVSSFEGIDASV 222 (311) T ss_pred cchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHh------------hhccCCCeeecCcccCCCCceecceeeEe Confidence 12222233333322 1112222348999999998877 7888888775321111000 01322221 Q ss_pred hh--------------hhhccc---ceechhhceec----------------cceeeeecCceE--EEEecccccceeec Q lcl|Aclame:pro 348 PT--------------VLVDQK---YHIDMQDLTKV----------------DAFEWKTNSNMI--LVETLTSGHVETYN 392 (400) Q Consensus 348 ~t--------------~~vd~k---~~~~~~~~~~~----------------~~~~~~~~~~~i--lve~~~~~~~~~~~ 392 (400) +. +....+ +--|.+.++.. +....-|..||| .+|.--.+.|- + T Consensus 223 s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~--~ 300 (311) T protein:vir:99 223 SDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVF--T 300 (311) T ss_pred ecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceec--C Confidence 11 000000 00122221111 111111333444 44555555552 2 Q ss_pred cceeEeeC Q lcl|Aclame:pro 393 AGAVITVS 400 (400) Q Consensus 393 ~~~~~~~~ 400 (400) .++|.... T Consensus 301 ~~~v~~~~ 308 (311) T protein:vir:99 301 DRFVVIEN 308 (311) T ss_pred hhHeeeec Confidence 33332222 No 99 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.18 E-value=2.3e-08 Score=62.42 Aligned_cols=260 Identities=15% Similarity=0.059 Sum_probs=134.9 Q ss_pred chhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhccHHHHHH Q lcl|Aclame:pro 123 TITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYK 201 (400) Q Consensus 123 ~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYk 201 (400) -.++.-.++|..+...|-..++++..+++..++...+.....+--.+ .-.+.-+--|+++.+...+|...++.|..++. T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 80 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMVPIKVEY 80 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCcceEEeeCCccccccccceeEEEEeeeEEEE Confidence 22333446898888888788888888888777666655444443322 23344466788999999999999999988877 Q ss_pred HHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec----cCCCccccchhhhhhhhhhhhhhhhhccCCCCC Q lcl|Aclame:pro 202 LQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG----DGTNGFKSIDKEADVKKIKKITTKAKSAGKTPF 276 (400) Q Consensus 202 kq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g----DG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~ 276 (400) .-.+- +++.........+..|+.++|+..+- |.++.++..| +|.+..... .-+..-........+ ...+.. T Consensus 81 ~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~-~~~d~~~l~G~~~~~g~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~ 156 (298) T protein:vir:94 81 GARISDEFMYASDEEKINILQAFNDGFAKKVA-RGIDLMAFHGVNPRLGTASAVIG-TNHFDSKVTQKVEAP--RGIADP 156 (298) T ss_pred eeehhHHHhccCCccHHHHHHHHHHHHHHHHH-HHHHHHhhcccccCCCccccccc-ccccccccccccccc--cccccH Confidence 76663 33333334455788999999999998 8999999998 333221111 011111011011111 111222 Q ss_pred HHHHHHhhceecc-cccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhhh----chh Q lcl|Aclame:pro 277 ADAIEEAVDFVRP-TAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAL----KPT 349 (400) Q Consensus 277 ~dal~Eald~~~~-~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l----~~t 349 (400) .+++..++.-..+ -.....+++|+.++.+|+. ++|.+|.+.+.=+.-.-..+| |.... +|. T Consensus 157 ~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------------lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~ 224 (298) T protein:vir:94 157 NGAIENAVELLTGVDADVTGIAINPSFRSALAK------------QKDLQGNALFPELKWGATPDTINGLPVDVNKTVSD 224 (298) T ss_pred HHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHH------------hhccCCCeeecCcccCCCCceecceeeEEeccccc Confidence 3456666643333 2233469999999999987 788888877532111111111 32111 010 Q ss_pred hhhccc---ceechhhceeccc---eee--------------eecCceEEE--EecccccceeeccceeEeeC Q lcl|Aclame:pro 350 VLVDQK---YHIDMQDLTKVDA---FEW--------------KTNSNMILV--ETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 350 ~~vd~k---~~~~~~~~~~~~~---~~~--------------~~~~~~ilv--e~~~~~~~~~~~~~~~~~~~ 400 (400) ..-..+ +.-|++.+...+- ..+ -|.+|+|.+ |.--.+.+. ++.|++.+. T Consensus 225 ~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~--~~~a~~~l~ 295 (298) T protein:vir:94 225 MSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGIL--DATKFARVT 295 (298) T ss_pred ccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEee--cccceEEEE Confidence 000000 1112222111110 000 123344333 333333333 333444343 No 100 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.17 E-value=2.6e-08 Score=62.15 Aligned_cols=265 Identities=13% Similarity=0.052 Sum_probs=135.0 Q ss_pred HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhcc Q lcl|Aclame:pro 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLE 195 (400) Q Consensus 117 L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~ 195 (400) +++.- ++.--++|..+..-|-..+.++..+.....+...+.....+--.+ .-.+.=.--|.++++...+|...++. T Consensus 1 ma~~t---~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~ 77 (300) T protein:vir:95 1 MSEAQ---LSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDFDSDIDIVAENGKKTHGGVSLDPVTIV 77 (300) T ss_pred Ccccc---cCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEeeCCcccccccccceeeEee Confidence 22111 111115788777777777777777765455554443333322222 22232334578899999999999999 Q ss_pred HHHHHHHHHH-HHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCC Q lcl|Aclame:pro 196 PVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKT 274 (400) Q Consensus 196 p~~VYkkq~L-ad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T 274 (400) |..++....+ .++.......+-++.+|+.++|++++- |..+.++..|+|..--......+....- ...+.....+.+ T Consensus 78 ~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia-~~~d~~~l~G~~~~~g~~~~~~~~~~~~-~~~~~~~~~~~~ 155 (300) T protein:vir:95 78 PLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLA-RGLDIMSIHGINPRTKQASTIIGDNCFD-KKVTQTVPFKDT 155 (300) T ss_pred eEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHH-HHHHHhhhhcccCCCCCCcccccccccc-cccceeeccccc Confidence 9888777666 334444456677899999999999999 7999999999532111111111211100 111111122234 Q ss_pred CCHHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhc----cccceecchhhh--- Q lcl|Aclame:pro 275 PFADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGV----DEIIVYTGSKAL--- 346 (400) Q Consensus 275 ~~~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v----~~~~~~tg~k~l--- 346 (400) ...+++..++.-.... ..-..+++|+.++.+|+. ++|.+|.+.++-.. ..+.. |.... T Consensus 156 ~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~------------lkd~~G~~i~~~~~~~~~~~~l~--G~Pv~~s~ 221 (300) T protein:vir:95 156 NPDESMEDAVGMIDGSERDITGAILDPIFTTALSK------------MKNAEGGKLYPELAWGGVPDAIN--GLAVDKNR 221 (300) T ss_pred chHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHH------------hhccCCCeeccCccccCCCceec--ceeeEEec Confidence 4456676666322221 222358999999999987 88999988763211 11111 32211 Q ss_pred -chhhhhccc---ceechhhceeccce---eee--------------ecCceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 347 -KPTVLVDQK---YHIDMQDLTKVDAF---EWK--------------TNSNMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 347 -~~t~~vd~k---~~~~~~~~~~~~~~---~~~--------------~~~~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) +|+..-+.+ +.-|++.+...+-+ +++ |.+||+.+=....--....+..|++.+. T Consensus 222 ~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~ 296 (300) T protein:vir:95 222 TVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIV 296 (300) T ss_pred CCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEe Confidence 111100001 11123221111110 000 3344443333222223334455555554 No 101 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.15 E-value=3.5e-08 Score=61.47 Aligned_cols=263 Identities=14% Similarity=0.031 Sum_probs=132.1 Q ss_pred HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhcc Q lcl|Aclame:pro 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLE 195 (400) Q Consensus 117 L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~ 195 (400) ++..|= .++|..+..-|-..++++..+.+...+...+.....+-..+ .-.+.-+--|+++++...+|...++. T Consensus 1 ma~~gG------~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~~~~a~~v~E~~~~~~~~~~f~~v~l~ 74 (298) T protein:vir:16 1 MVLNKG------TLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTMV 74 (298) T ss_pred CcccCc------ceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEecCcceEEecCCccccccccceeEEEEe Confidence 333321 24787777777777777777777666555554444443333 33455556677999999999999999 Q ss_pred HHHHHHHHHH-HHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCCC Q lcl|Aclame:pro 196 PVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKT 274 (400) Q Consensus 196 p~~VYkkq~L-ad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T 274 (400) |..++....+ .+++.....++-++.+|+.++|+..+- |.++.++..|+|...-...+..+...-....+........+ T Consensus 75 ~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~-~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~ 153 (298) T protein:vir:16 75 PIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVA-RGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGI 153 (298) T ss_pred eeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHH-HHHHHHhhccccCCCCccccccccccccccccccccccccc Confidence 9888887777 344444555566889999999999998 89999999994322111222222111011111111111112 Q ss_pred CC-HHHHHHhhceeccc-ccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhhh---- Q lcl|Aclame:pro 275 PF-ADAIEEAVDFVRPT-AGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKAL---- 346 (400) Q Consensus 275 ~~-~dal~Eald~~~~~-~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l---- 346 (400) .. .+++..++.-..+. ..-..+++|+.++.+|+. ++|.+|++.++=+...-...| |-... T Consensus 154 ~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------------lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~ 221 (298) T protein:vir:16 154 ADPNGAIENAVELLTGVDADVTGIAINPSFRSALAK------------QKDLQDNALFPELKWGATPDTINGLPVDVNKT 221 (298) T ss_pred ccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHH------------hhccCCCeeecCcccCCCCceecceeeEEecc Confidence 21 23455554322221 112248889999999987 788888887642211111111 32111 Q ss_pred chhhhhccc---ceechhhceecccee---e--------------eecCceEE--EEecccccceeeccceeEeeC Q lcl|Aclame:pro 347 KPTVLVDQK---YHIDMQDLTKVDAFE---W--------------KTNSNMIL--VETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 347 ~~t~~vd~k---~~~~~~~~~~~~~~~---~--------------~~~~~~il--ve~~~~~~~~~~~~~~~~~~~ 400 (400) +|+..-..+ +.-|++.+...+-++ + -|.+|||- +|.--.+- ..++.|++.+. T Consensus 222 v~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~--v~~~~a~~~l~ 295 (298) T protein:vir:16 222 VSDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWG--ILDATKFARVT 295 (298) T ss_pred cccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccE--eecccceEEEe Confidence 111000001 111222211111000 0 02223322 22222221 22333333333 No 102 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.13 E-value=2.4e-08 Score=62.34 Aligned_cols=266 Identities=15% Similarity=0.033 Sum_probs=134.0 Q ss_pred HHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhh Q lcl|Aclame:pro 114 SAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTID 192 (400) Q Consensus 114 ~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ 192 (400) -+-+..-| .++|..+...|=+.++++.++.+...+.+.+...+.+--.+ ...+.=+.-|+++.+...+|... T Consensus 1 mat~~~gg-------~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v 73 (311) T protein:vir:81 1 MVALATGT-------FQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) T ss_pred CceecCCc-------eEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeCCceeEEeecCcccccccceeeEE Confidence 11111112 26898888888888888788887666766665555554333 33343356788999999999999 Q ss_pred hccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCcc-ccchhhhhhhhhhhhhhhhhc Q lcl|Aclame:pro 193 TLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGF-KSIDKEADVKKIKKITTKAKS 270 (400) Q Consensus 193 ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t-~~~~~e~D~~~ik~it~~at~ 270 (400) ++.|..++.+.++. +++.......-.+.+|+.++|+..+- |..+.++..|+|.... ........+.......+.+ . T Consensus 74 ~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~-~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~-~ 151 (311) T protein:vir:81 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALG-RALDLIGIHGINPLTGAALSGSPAKILDTTNIVELT-T 151 (311) T ss_pred EEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHH-HHHHHhhhccccCCCCcccccccccccccceeeeec-c Confidence 99998887776663 33433334445689999999999987 7999999999652111 1111112221111111111 1 Q ss_pred cCCCCCHHHHHHhhceecccc-cceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceec--chhhhc Q lcl|Aclame:pro 271 AGKTPFADAIEEAVDFVRPTA-GRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYT--GSKALK 347 (400) Q Consensus 271 ~~~T~~~dal~Eald~~~~~~-~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~t--g~k~l~ 347 (400) .+.+..-..+..+++-...-. .-..+++|+.++.+|+. |+|.+|++.+.-....-...| |....+ T Consensus 152 ~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~------------lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~ 219 (311) T protein:vir:81 152 GTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLAT------------QRDSQGRKLYPELGFGTDVASFAGLNAAV 219 (311) T ss_pred cccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHh------------hhccCCCeeecCccccCCCceecceeEEe Confidence 111111122333333222211 22348999999999987 889999887532110000000 221111 Q ss_pred hhhhh---------------cc-c---ceechhhceeccceee---------------eecCceEEE--Eecccccceee Q lcl|Aclame:pro 348 PTVLV---------------DQ-K---YHIDMQDLTKVDAFEW---------------KTNSNMILV--ETLTSGHVETY 391 (400) Q Consensus 348 ~t~~v---------------d~-k---~~~~~~~~~~~~~~~~---------------~~~~~~ilv--e~~~~~~~~~~ 391 (400) ..-+- +. + +.-|++.|.-....++ -|.+|+|.+ +.--.+.|--- T Consensus 220 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~ 299 (311) T protein:vir:81 220 SDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMST 299 (311) T ss_pred cccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecc Confidence 11000 00 0 2234443332111111 033344443 22223333222 Q ss_pred ccceeEeeC Q lcl|Aclame:pro 392 NAGAVITVS 400 (400) Q Consensus 392 ~~~~~~~~~ 400 (400) +|=++++-- T Consensus 300 ~a~~~l~~a 308 (311) T protein:vir:81 300 DAFAVVRDA 308 (311) T ss_pred cceEEEEee Confidence 222222222 No 103 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=97.63 E-value=1.3e-06 Score=52.80 Aligned_cols=313 Identities=14% Similarity=0.129 Sum_probs=136.0 Q ss_pred hhhhhcchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhHh-hhcchhHHHHHHHHHHHHHHHHHHHHHccCC--h Q lcl|Aclame:pro 31 KSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE-KPKGKDKMTNFIESQNAVTEFFDVLKKNSGK--S 107 (400) Q Consensus 31 Ks~i~~~~~~~~~~~~skieElektis~l~aEi~K~enEl~~~kE-k~K~k~emtEfLkTkqA~~dya~ll~~nqg~--k 107 (400) -.-.+++.++..... +.-+. ..| ++..+..+..|.+. ++...|. . T Consensus 1 ~a~~~a~~~~~~~~~--------------~~~~~--------~~~~~~~kg~~~~~~~~a----------~a~~~g~~~~ 48 (366) T protein:vir:57 1 MAAAVAVPVKAHSVA--------------PGIII--------KEELQQYKGAGMTRMVMS----------IAAGKGNLAD 48 (366) T ss_pred Ccccccccccccccc--------------ccccc--------ccccccccchhHHHHHHH----------HHhcccchhH Confidence 111112212111100 00000 001 11122223333321 2222222 0 Q ss_pred hHH---HHHHHHHHhCccchh--hhHhhcchhHHHHHHHHHHhhCccccceeeecc--cceeeEEeecc--ccccceecc Q lcl|Aclame:pro 108 EIK---NAWSAKLAENGVTIT--DTTFQLPRKLVESINTALLNTNPVFKVFHVTNV--GALLVSRSFDS--ANEAQVHKD 178 (400) Q Consensus 108 e~k---~AW~a~L~ekgV~~q--d~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~--~~~a~~i~l~n--a~~a~GHk~ 178 (400) ..+ ..+-+.-..+.+.++ +--.++|..+..-|-+.++++.. +..+....+ ..+.+.+--.+ ...+| ..- T Consensus 49 a~~~a~~~~~~~~~~~a~~~~~~~Gg~lvP~~~~~~ii~~l~~~s~-l~~lg~~~v~~~~g~~~~p~~t~~~~a~w-v~E 126 (366) T protein:vir:57 49 AAKFAATELGDTGLSMAISTAAGSGGALIPQNMQNEVIELLRDRTV-VRILGARSIPLPNGNLSMPRLSGGATAGY-VGE 126 (366) T ss_pred HHHHHHHhhcchhhhhhccccccCCccccchhHHHHHHHHHhhhcc-hhhhceeeeecCCCceEEEEEeCCcceee-ecc Confidence 111 111111111222211 11125798887777777886544 444432222 22323332222 23344 356 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHHHH-HHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhh Q lcl|Aclame:pro 179 GQTKTEQAATLTIDTLEPVMVYKLQSLAER-VKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEAD 257 (400) Q Consensus 179 ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~-~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D 257 (400) |+++.+...+|...++.|..++....+.+. +.+.. -++.+|+.++|+..+- |..+.+++.|||+.. +..-+... T Consensus 127 ~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~---~~~~~~i~~~l~~a~~-~~~d~a~l~G~G~~~-~p~Gi~~~ 201 (366) T protein:vir:57 127 GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAG---FNVEQLLLGDILSAIA-TREDKAFLRDDGTGD-TPKGMKAV 201 (366) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHHHhhhh---HHHHHHHHHHHHHHHH-HHHHHHhhccCCCCc-cccceeec Confidence 778888999999999999888877777333 33332 3578999999999999 799999999999732 10001111 Q ss_pred hhhhhhh--hhhhhccCCCCCHHHHHHhhc--ee--cccccceEEEEecchhHHHHhhhhhccccccceeecCCcceeeh Q lcl|Aclame:pro 258 VKKIKKI--TTKAKSAGKTPFADAIEEAVD--FV--RPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASE 331 (400) Q Consensus 258 ~~~ik~i--t~~at~~~~T~~~dal~Eald--~~--~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~ 331 (400) .. +... +.+++.. .....+++...++ +. .+-.+....++++.++.+|+. ++|.+|.+.++ T Consensus 202 ~~-~~~~~~~~~~t~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~------------lkd~~G~~l~~ 267 (366) T protein:vir:57 202 AT-AANRLVAWTGTAI-NLTTIDEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFG------------LRDGNGNKVYP 267 (366) T ss_pred cc-cccceeecccccc-chhhHHHHHHHHHHhhhccccccccCEEEecHHHHHHHHh------------hhccCCceecc Confidence 00 0000 1111111 1112334444442 22 222344567899999988887 88888888875 Q ss_pred hhccccceecchhhh----chhhhh---cc--cceechhhceecccee---------------------eeecCceEEEE Q lcl|Aclame:pro 332 VGVDEIIVYTGSKAL----KPTVLV---DQ--KYHIDMQDLTKVDAFE---------------------WKTNSNMILVE 381 (400) Q Consensus 332 v~v~~~~~~tg~k~l----~~t~~v---d~--k~~~~~~~~~~~~~~~---------------------~~~~~~~ilve 381 (400) -.-+.+.. |.... +|..+- |+ =+-.|+++|.-.+..+ |..|+--|.+| T Consensus 268 ~~~~g~l~--G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~ 345 (366) T protein:vir:57 268 EMSQGILK--GYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVV 345 (366) T ss_pred CCCCCeec--ceeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEEeeccccccccccchhhhhcCceeEEee Confidence 22222211 32211 111110 00 1112333332111111 22222222233 Q ss_pred ecccccceeeccceeEeeC Q lcl|Aclame:pro 382 TLTSGHVETYNAGAVITVS 400 (400) Q Consensus 382 ~~~~~~~~~~~~~~~~~~~ 400 (400) .--.+.| .+..++..++ T Consensus 346 ~~~d~~v--~~~~a~~~lt 362 (366) T protein:vir:57 346 TEHDIGF--RHPEGLVLGT 362 (366) T ss_pred eeeCcEe--eccccEEEEe Confidence 2222222 3444554444 No 104 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.10 E-value=7.9e-06 Score=48.56 Aligned_cols=261 Identities=15% Similarity=0.092 Sum_probs=126.9 Q ss_pred HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeecc-ccccceecccchhhhhhhhhhhhhcc Q lcl|Aclame:pro 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDS-ANEAQVHKDGQTKTEQAATLTIDTLE 195 (400) Q Consensus 117 L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~n-a~~a~GHk~ga~Kk~q~~~le~~ti~ 195 (400) +++. . +.+--..+|.-+..-|=..++++.++.+..++-..+.....+--.+ ...+.=+--|.+++.+..+|...++. T Consensus 1 Ma~~-~-~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~ 78 (315) T protein:vir:80 1 MADD-F-LSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSGVPRAKIVGEGEVKPSASVDVSAFTAQ 78 (315) T ss_pred CCCC-c-CCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeCCcceEEeeCCccccccccceeeeEee Confidence 2111 0 1111226888877777777777777776555554444433333333 23444455677889999999999999 Q ss_pred HHHHHHHHHH-HHHHHhhcCc-hhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCC Q lcl|Aclame:pro 196 PVMVYKLQSL-AERVKRLQMS-YSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGK 273 (400) Q Consensus 196 p~~VYkkq~L-ad~~k~l~g~-ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~ 273 (400) |..++....+ .++..+.... .+.|-+|+.++|+.++- |+++.|+.+|+|...-....-.... +.++++.... T Consensus 79 ~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~-~~~d~a~~~G~~~~~~~~~~~~~~~-----~~~~~~~~~~ 152 (315) T protein:vir:80 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIG-RAVDLIAFHGIDPATGKAASAVHTS-----LNKTKNIVDA 152 (315) T ss_pred eeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHH-HHHhhheeeccCCCCCccccccccc-----cccccceeec Confidence 9877776666 3344443332 23356889999999988 7999999999764221111111221 1111111111 Q ss_pred CCC-HHHHHHhh---ceecccccceEEEEecchhHHHHhhhhhccccccceeecCCc-----ceeehh---hccccceec Q lcl|Aclame:pro 274 TPF-ADAIEEAV---DFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDT-----EIASEV---GVDEIIVYT 341 (400) Q Consensus 274 T~~-~dal~Eal---d~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~-----~~~~~v---~v~~~~~~t 341 (400) +.. .+++..++ .-+.... ....++|+..+.+|+- |++.+| .+..+- |....+. T Consensus 153 ~~~~~~d~~~~~~~~~~~~~~~-~~~~imn~~~~~~L~~------------l~~~~g~~~~g~~~~~~~~~g~~~tl~-- 217 (315) T protein:vir:80 153 TDSATADLVKAVGLIAGAGLQV-PNGVALDPAFSFALST------------EVYPKGSPLAGQPMYPAAGFAGLDNWR-- 217 (315) T ss_pred cccchHHHHHHHHHHhhccCcc-ceEEEEcHHHHHHHHH------------HhhccCCcccccccccccccCCCceec-- Confidence 111 12333333 2222222 2347889999888875 333222 222110 0000111 Q ss_pred chhhh----ch------------hhhhccc-c--------eechhhceeccce---eeeecCceEEEEecccccceeecc Q lcl|Aclame:pro 342 GSKAL----KP------------TVLVDQK-Y--------HIDMQDLTKVDAF---EWKTNSNMILVETLTSGHVETYNA 393 (400) Q Consensus 342 g~k~l----~~------------t~~vd~k-~--------~~~~~~~~~~~~~---~~~~~~~~ilve~~~~~~~~~~~~ 393 (400) |-... +| -++-|-+ + .+++.++.+.+.. -|+.++-.+.++.--.|.|..-++ T Consensus 218 G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a 297 (315) T protein:vir:80 218 GLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDS 297 (315) T ss_pred ceeeEecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccc Confidence 21111 01 1111211 1 1222222111111 155555555666666666655444 Q ss_pred ceeEeeC Q lcl|Aclame:pro 394 GAVITVS 400 (400) Q Consensus 394 ~~~~~~~ 400 (400) =++++.. T Consensus 298 ~~~l~~~ 304 (315) T protein:vir:80 298 FAVVKEK 304 (315) T ss_pred eEEEeec Confidence 4444433 No 105 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=96.86 E-value=8.4e-05 Score=42.93 Aligned_cols=281 Identities=16% Similarity=0.087 Sum_probs=128.0 Q ss_pred cCChhHHHHHHHHHHhCccchhhhH--hhcchhHHHHHHHHHHhhCccccceeeecccceeeEEeeccccccceec---c Q lcl|Aclame:pro 104 SGKSEIKNAWSAKLAENGVTITDTT--FQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVSRSFDSANEAQVHK---D 178 (400) Q Consensus 104 qg~ke~k~AW~a~L~ekgV~~qd~~--eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~i~l~na~~a~GHk---~ 178 (400) =+.+.|.+-=.+.++.+++...|+. +.+|.++-..|-.++.+..++|+.+.|.....+.-.+..-+.....+.. . T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~~~~~~e~ 80 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERHRRPQDEG 80 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCccccccccc Confidence 1123444433333444455445544 3789888777777777788999988866655443333221111111111 1 Q ss_pred cchhhhhhhhhhhhhccHHHHHHHHHHH-HHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhh Q lcl|Aclame:pro 179 GQTKTEQAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEAD 257 (400) Q Consensus 179 ga~Kk~q~~~le~~ti~p~~VYkkq~La-d~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D 257 (400) +........+|..+.+....++-.-.+. +...+ +....++-+|++..+++.|= +.++++.++|||.....-. ..-+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d-~a~~~d~e~~i~~~ia~~~a-~~~~~~~~nGd~~~~~~~~-~~n~ 157 (321) T protein:vir:31 81 EWNENESDVSTGTIDISTEKATVAWDLPREVVQE-NPEGEALADRILNLMTDAWS-ADVEDLAANGDEDAEDSFE-NQND 157 (321) T ss_pred ccccccccceeeeeeeeeEEEEeehhccHHHHHh-hhcchhHHHHHHHHHHHHHH-HHHHhheeeccccCCCccc-ccch Confidence 1122233445666655554443332221 22222 21124689999999999998 7999999999997332100 1111 Q ss_pred hh--hhhhhhhhhhccCCCCCHHHHHHhhceeccc---ccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehh Q lcl|Aclame:pro 258 VK--KIKKITTKAKSAGKTPFADAIEEAVDFVRPT---AGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEV 332 (400) Q Consensus 258 ~~--~ik~it~~at~~~~T~~~dal~Eald~~~~~---~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v 332 (400) =| .++.-..+.+..+.+...|.+...+.-..|. .++-+.++++..+.++++.| ++.+|.+..+. T Consensus 158 G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l-----------~~~~~~~~~~~ 226 (321) T protein:vir:31 158 GFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTL-----------TDRDTPLGDNV 226 (321) T ss_pred hhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHH-----------hcCCCccccch Confidence 12 1111111111122333445555444332222 23346778888777887744 44445443333 Q ss_pred hcc--ccceecchhhhchhhhhccc--ceechhhce----eccceeeeecC-----ceEEEEeccc----ccceeeccce Q lcl|Aclame:pro 333 GVD--EIIVYTGSKALKPTVLVDQK--YHIDMQDLT----KVDAFEWKTNS-----NMILVETLTS----GHVETYNAGA 395 (400) Q Consensus 333 ~v~--~~~~~tg~k~l~~t~~vd~k--~~~~~~~~~----~~~~~~~~~~~-----~~ilve~~~~----~~~~~~~~~~ 395 (400) ..+ ..++ -|..... +..+..+ .--|++.|. ....+....+. ..+=++...+ .-||-|.+.+ T Consensus 227 l~~~~~~tl-~G~pvv~-~~~mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a 304 (321) T protein:vir:31 227 IMGEADVNP-FSFPIIG-SGLWPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVV 304 (321) T ss_pred hhccccccc-cceeEEE-cCCCCCCcEEEeccccEEEEEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEE Confidence 221 1122 1332221 1111111 111222221 11111111111 1122232222 2457787877 Q ss_pred eEee-C Q lcl|Aclame:pro 396 VITV-S 400 (400) Q Consensus 396 ~~~~-~ 400 (400) .++= - T Consensus 305 ~~~~i~ 310 (321) T protein:vir:31 305 LAEGLG 310 (321) T ss_pred EEecCC Confidence 7762 1 No 106 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=96.34 E-value=0.00073 Score=37.79 Aligned_cols=360 Identities=15% Similarity=0.111 Sum_probs=122.3 Q ss_pred Ccccccccchh-----------hHHHHHHHHHHHHHHHHHhhhhhhcch-hhhhhhhhhHHHHHHHHHH-HHHHHHHHHH Q lcl|Aclame:pro 1 MRISKRNMNKP-----------DLIEKQNRLAELKENNVSLKSQISGFE-VKNAIEDLPKVQELEKTLS-ENSIEIIKIE 67 (400) Q Consensus 1 ~~~s~~~~~k~-----------~~eekq~~lA~lKe~~~~~Ks~i~~~~-~~~~~~~~skieElektis-~l~aEi~K~e 67 (400) .|.++...+-+ ...+.+++.+.+-+ +..+.+.-+ ..+++..--.+++.+..+- .++.+-.+.. T Consensus 202 ~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~----l~~~~~~~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~ 277 (632) T protein:vir:96 202 TRGAETGAKNPAPAASGANENDILSRERTRISEITA----IGQQFSQRSLAQEAIQKGHTVDQFRALVLERMNPGQPGNF 277 (632) T ss_pred ccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHH----HHHHhhhhhhHHHHHhccccHHHHHHHHHHHHhhhhhhhh Confidence 11111000000 00011111111110 111111000 0001111111122221111 1110000000 Q ss_pred H-----HHH---hhHh---hhc-----chhHHHHHHHHH------HH--HHHHHHHHHHccCChhHHHHH---HHHHHhC Q lcl|Aclame:pro 68 N-----ELN---AQEE---KPK-----GKDKMTNFIESQ------NA--VTEFFDVLKKNSGKSEIKNAW---SAKLAEN 120 (400) Q Consensus 68 n-----El~---~~kE---k~K-----~k~emtEfLkTk------qA--~~dya~ll~~nqg~ke~k~AW---~a~L~ek 120 (400) . .+. .+.. ... ....|...++.. .+ ...++.-++...|.. .+.| .+.|..+ T Consensus 278 ~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~--arg~~~~~~~l~~r 355 (632) T protein:vir:96 278 EKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKE--ARGFYMPHEVLVQR 355 (632) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhh--hhhhhhhHHHHHHh Confidence 0 000 0000 000 000111111100 00 001111122222221 0111 1122222 Q ss_pred -ccchhhhH--hhcchhHH-HHHHHHHHhhCccccceeeecccce--eeEEeecc--ccccceecccchhhhhhhhhhhh Q lcl|Aclame:pro 121 -GVTITDTT--FQLPRKLV-ESINTALLNTNPVFKVFHVTNVGAL--LVSRSFDS--ANEAQVHKDGQTKTEQAATLTID 192 (400) Q Consensus 121 -gV~~qd~~--eiLP~~iI-~AIe~A~ed~d~vl~~fhV~n~~~~--a~~i~l~n--a~~a~GHk~ga~Kk~q~~~le~~ 192 (400) ...+++.+ .++|..++ .-|-+.+++ ..++..+.+...|.. .+.+--.+ ...+| ..-|.+++....+|... T Consensus 356 a~~~~t~~~gg~lvp~~~~~~~iie~lr~-~s~i~~l~~~~~~~~~g~~~ip~~~~~~~a~w-v~E~~~~~~s~~~f~~i 433 (632) T protein:vir:96 356 QLEKKTAGKGGELVATELLSEEFIDILRN-KAIIGQMGARMLPGLVGDVDIPKKTSGANFYW-IGEDEDVQDSDFDFTTL 433 (632) T ss_pred hhhcccccccccccccccchHHHHHHHhh-cchhhhhcceEeecCCcceEEEEEeCCceeEe-ecCCccccccccceeeE Confidence 12222221 13443332 122333343 445554543333322 22222222 23333 24566788899999999 Q ss_pred hccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccC Q lcl|Aclame:pro 193 TLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAG 272 (400) Q Consensus 193 ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~ 272 (400) ++.|..++....+...+-+-.+ -++.+||.++|+.++- ++++.+++.|||+... +.-+--...+..++. .+ T Consensus 434 ~l~~~k~~~~v~iS~ell~ds~--~~~~~~i~~~l~~a~~-~~~d~a~l~G~G~~~~--p~Gi~~~~~~~~~~~----~~ 504 (632) T protein:vir:96 434 SFSPKTIAGAVPVTRKLRKQSS--IHVENLIREDLIEGIG-VALDLAMLTGTGLAND--PVGLLNMTGVPALTY----PA 504 (632) T ss_pred EeeeeEEEEehhhHHHHHhccc--hHHHHHHHHHHHHHHH-HHHHHHhhcccCCCCc--cceeeecccccceec----cc Confidence 9999888887777444332222 3679999999999999 7999999999997431 100100001111111 11 Q ss_pred CCCCHHHH---HHhhceecccccceEEEEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceecchhhhchh Q lcl|Aclame:pro 273 KTPFADAI---EEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPT 349 (400) Q Consensus 273 ~T~~~dal---~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t 349 (400) .+...+.+ ..++.-+....+.-..++++..+.++.- .+++|.+|.+...- +... |.-.. .+ T Consensus 505 ~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~----------~~l~d~~G~~i~~~---~~l~--G~pv~-~s 568 (632) T protein:vir:96 505 GGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKK----------AQVFDNTGERIWQN---NEVN--GYRAE-AS 568 (632) T ss_pred ccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHH----------HhccCCCCceeecC---Ceec--ccceE-ec Confidence 22222333 3333222222223345666655555443 23677888876531 1111 21111 11 Q ss_pred hhhcc--cceechhhceeccceeeeecCceEEEEecc-----ccc----ceeeccceeEeeC Q lcl|Aclame:pro 350 VLVDQ--KYHIDMQDLTKVDAFEWKTNSNMILVETLT-----SGH----VETYNAGAVITVS 400 (400) Q Consensus 350 ~~vd~--k~~~~~~~~~~~~~~~~~~~~~~ilve~~~-----~~~----~~~~~~~~~~~~~ 400 (400) ..+.. -...|.+.|.- ...+-+-|+... +|. +..++.++|.-.. T Consensus 569 ~~ip~~~~~~gd~s~~~i-------~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~ 623 (632) T protein:vir:96 569 NQIPADTWIFGDWSQIVI-------AMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKE 623 (632) T ss_pred cccccCcEEEeecceEEE-------EEecceEEEEccccccccCceEEEEEeecCceeechh Confidence 11111 11222222210 011112222211 121 1233333333222 No 107 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=95.17 E-value=0.00073 Score=37.78 Aligned_cols=276 Identities=12% Similarity=0.033 Sum_probs=118.8 Q ss_pred ChhHHHHHHHHHHhCccchhhhHh--hcchhHHHHHHHHHHhhCccccceeeec-ccceeeEE---eecc--ccccceec Q lcl|Aclame:pro 106 KSEIKNAWSAKLAENGVTITDTTF--QLPRKLVESINTALLNTNPVFKVFHVTN-VGALLVSR---SFDS--ANEAQVHK 177 (400) Q Consensus 106 ~ke~k~AW~a~L~ekgV~~qd~~e--iLP~~iI~AIe~A~ed~d~vl~~fhV~n-~~~~a~~i---~l~n--a~~a~GHk 177 (400) -.++++.-+ . .++++..|.-- +.|... ..+-+.+.++.++++..+|.+ .......+ +... ......-. T Consensus 1 ~~~~~~~~~-~--~k~it~~d~~gG~L~P~~~-~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~ 76 (314) T protein:vir:41 1 MDFLNKPFQ-I--TPKIDVPDLGKGILAVQRF-GEFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSG 76 (314) T ss_pred CchhhhHHH-h--hcccccccCCCceeChHHH-HHHHHHHHhccchhhheeeecccCccceeecccccCccccccccccc Confidence 112222222 1 22333333322 456543 445566777899999777543 22221111 1111 11111111 Q ss_pred ccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccc-hhhh Q lcl|Aclame:pro 178 DGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSI-DKEA 256 (400) Q Consensus 178 ~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~-~~e~ 256 (400) .+.+-+.+..+|-...|....+.-.-++.+-.-+-+....++=+|+++.+++.|= +..+.+.++|||...+..+ ...- T Consensus 77 ~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g-~~~~~~~~nGdg~~~s~~~~~~~p 155 (314) T protein:vir:41 77 TKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVT-YDLECFFLHADSSLTTGRELYRIN 155 (314) T ss_pred CCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHH-HHHHHHhhccccCCcCcccchhcc Confidence 2223455666777777766444433333222222222113678999999999988 7999999999996432211 0111 Q ss_pred hhhhhhh--hhhhhhccCCCCCHHHHHHhh-ceecc----cccceEEEEecchhHHHHhhhhhccccccceeecCCccee Q lcl|Aclame:pro 257 DVKKIKK--ITTKAKSAGKTPFADAIEEAV-DFVRP----TAGRRYLIVKAEDRKALLDELRQATANANVRIKNDDTEIA 329 (400) Q Consensus 257 D~~~ik~--it~~at~~~~T~~~dal~Eal-d~~~~----~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~ 329 (400) |=| +++ ..++..+.....+.+++.-++ .=..| -+++-..+++++.+.++|..+. ++|... T Consensus 156 ~G~-l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~------------~~~~~l 222 (314) T protein:vir:41 156 DGW-MKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLL------------VRETGL 222 (314) T ss_pred hhh-hhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHh------------ccCCcc Confidence 112 111 111111222233445544333 22222 2345578889999999988443 222111 Q ss_pred ---ehhhccccceec----chhhhchhhhhcccc--eechhhceeccceeeee-------cCceEEEEec-ccccceeec Q lcl|Aclame:pro 330 ---SEVGVDEIIVYT----GSKALKPTVLVDQKY--HIDMQDLTKVDAFEWKT-------NSNMILVETL-TSGHVETYN 392 (400) Q Consensus 330 ---~~v~v~~~~~~t----g~k~l~~t~~vd~k~--~~~~~~~~~~~~~~~~~-------~~~~ilve~~-~~~~~~~~~ 392 (400) +-.+-...++.- +...| |....+.+. --|.+.|+-+..+.++. +.....+-+. ...+++-.+ T Consensus 223 ~~~~~~~~~~~~l~G~PV~~~~~~-~~~~~~~~~i~fgd~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~ 301 (314) T protein:vir:41 223 GDSALIGATGLQYDGIPIQYVPAL-DALGDDKARALLTVPTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDEN 301 (314) T ss_pred cchhhhCCCCceecceeeEecccc-cccCCCCceEEEechhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcC Confidence 111111111100 11111 111111111 01222332222222221 1222333333 233566666 Q ss_pred cceeEeeC Q lcl|Aclame:pro 393 AGAVITVS 400 (400) Q Consensus 393 ~~~~~~~~ 400 (400) +.++.++- T Consensus 302 aa~~~~~~ 309 (314) T protein:vir:41 302 AAVAAVID 309 (314) T ss_pred cEEEEEee Confidence 76666665 No 108 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=92.32 E-value=0.0041 Score=33.68 Aligned_cols=241 Identities=13% Similarity=0.114 Sum_probs=107.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ce-eeEEee-ccccccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--AL-LVSRSF-DSANEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~-~--~~-a~~i~l-~na~~a~ 174 (400) |.+..+ +.-.-+.|+-+-..|...+.+. .+|..+- +.+. + ++ .+.+-. ..-..+. T Consensus 1 MA~~~T------------------~~~~~~iPev~s~~v~~~~~~~-~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~ 61 (272) T protein:vir:30 1 MAVGTT------------------KMAQMLDPEVLADMIDAEVGKA-IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAE 61 (272) T ss_pred CCCccc------------------cchheechHHHHHHHHHHHHHH-hhhhccccccccccCCCCCEEEEEEecCCCCcc Confidence 332111 0000133333222232223221 1121111 1110 0 01 011111 1111222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchh Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDK 254 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~ 254 (400) --.-|.+.+.+..++....+.+..+++.-.+-|......+ +++++++.+.++.++- |.++.++...- T Consensus 62 ~v~eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~--~d~~~~~~~~~~~~~a-~~~d~~i~~~~---------- 128 (272) T protein:vir:30 62 DVAEGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGY--GDPVGQAAKQIVEAID-HKVDADVLDAL---------- 128 (272) T ss_pred cccCCCcccccccccceEEEEeeeeeeeeeecHHHHhhcc--ccHHHHHHHHHHHHHH-HHHHHHHHHHh---------- Confidence 2334567788888999999999887776666666665555 5689999999999987 67776654321 Q ss_pred hhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhh-hh---hccccccceeecCCccee Q lcl|Aclame:pro 255 EADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDE-LR---QATANANVRIKNDDTEIA 329 (400) Q Consensus 255 e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~-~~---~~~~~an~~lk~~d~~~~ 329 (400) +......+.+.+.|++..++ -|.......+++|+|+.+...|+.- +- .++...+-.+. +|.+. T Consensus 129 ----------~~a~~~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~--~g~ig 196 (272) T protein:vir:30 129 ----------SKSTQTVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVV--SGVYG 196 (272) T ss_pred ----------cccccccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccc--cccch Confidence 11112223344567777776 3444556678999999999888641 11 11111111111 11111 Q ss_pred ehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEeccccc-------------ceeecccee Q lcl|Aclame:pro 330 SEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGH-------------VETYNAGAV 396 (400) Q Consensus 330 ~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~-------------~~~~~~~~~ 396 (400) .-- |+. .+-|..++..+.+ +...+++++-. ++-+-||+-.+.- +-.+|..++ T Consensus 197 ~i~---------G~~-Vi~s~~~p~~t~~----~~~~~a~~~~~-~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~v 261 (272) T protein:vir:30 197 EVL---------GVQ-IVRSRKCPKGTAY----MVRKGALRIML-KRNTMVETDRDITKAINQIVANKHYGVYLYKAEKA 261 (272) T ss_pred hhc---------Cee-EEEcCCCCcceEE----EEcCCeEEEEe-cCCceeeeccccccceeEEEEEEEEEEEEEcCCce Confidence 111 331 2222222222211 23344555432 3334455432110 112344334 Q ss_pred EeeC Q lcl|Aclame:pro 397 ITVS 400 (400) Q Consensus 397 ~~~~ 400 (400) +.+. T Consensus 262 v~~t 265 (272) T protein:vir:30 262 VKIT 265 (272) T ss_pred EEEE Confidence 3333 No 109 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=92.32 E-value=0.0041 Score=33.68 Aligned_cols=241 Identities=13% Similarity=0.114 Sum_probs=107.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ce-eeEEee-ccccccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--AL-LVSRSF-DSANEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~-~--~~-a~~i~l-~na~~a~ 174 (400) |.+..+ +.-.-+.|+-+-..|...+.+. .+|..+- +.+. + ++ .+.+-. ..-..+. T Consensus 1 MA~~~T------------------~~~~~~iPev~s~~v~~~~~~~-~~~~~~~~~~~~~~g~~G~tv~iP~~~~~~~a~ 61 (272) T protein:vir:98 1 MAVGTT------------------KMAQMLDPEVLADMIDAEVGKA-IRFAPLAEVDTTLEGQPGTTLTVPKWDYIGDAE 61 (272) T ss_pred CCCccc------------------cchheechHHHHHHHHHHHHHH-hhhhccccccccccCCCCCEEEEEEecCCCCcc Confidence 332111 0000133333222232223221 1121111 1110 0 01 011111 1111222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCCccccchh Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTNGFKSIDK 254 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~ 254 (400) --.-|.+.+.+..++....+.+..+++.-.+-|......+ +++++++.+.++.++- |.++.++...- T Consensus 62 ~v~eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~--~d~~~~~~~~~~~~~a-~~~d~~i~~~~---------- 128 (272) T protein:vir:98 62 DVAEGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGY--GDPVGQAAKQIVEAID-HKVDADVLDAL---------- 128 (272) T ss_pred cccCCCcccccccccceEEEEeeeeeeeeeecHHHHhhcc--ccHHHHHHHHHHHHHH-HHHHHHHHHHh---------- Confidence 2334567788888999999999887776666666665555 5689999999999987 67776654321 Q ss_pred hhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhh-hh---hccccccceeecCCccee Q lcl|Aclame:pro 255 EADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDE-LR---QATANANVRIKNDDTEIA 329 (400) Q Consensus 255 e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~-~~---~~~~~an~~lk~~d~~~~ 329 (400) +......+.+.+.|++..++ -|.......+++|+|+.+...|+.- +- .++...+-.+. +|.+. T Consensus 129 ----------~~a~~~~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~--~g~ig 196 (272) T protein:vir:98 129 ----------SKSTQTVEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGANRVV--SGVYG 196 (272) T ss_pred ----------cccccccccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccccccccccccccccc--cccch Confidence 11112223344567777776 3444556678999999999888641 11 11111111111 11111 Q ss_pred ehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEeccccc-------------ceeecccee Q lcl|Aclame:pro 330 SEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGH-------------VETYNAGAV 396 (400) Q Consensus 330 ~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~-------------~~~~~~~~~ 396 (400) .-- |+. .+-|..++..+.+ +...+++++-. ++-+-||+-.+.- +-.+|..++ T Consensus 197 ~i~---------G~~-Vi~s~~~p~~t~~----~~~~~a~~~~~-~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~v 261 (272) T protein:vir:98 197 EVL---------GVQ-IVRSRKCPKGTAY----MVRKGALRIML-KRNTMVETDRDITKAINQIVANKHYGVYLYKAEKA 261 (272) T ss_pred hhc---------Cee-EEEcCCCCcceEE----EEcCCeEEEEe-cCCceeeeccccccceeEEEEEEEEEEEEEcCCce Confidence 111 331 2222222222211 23344555432 3334455432110 112344334 Q ss_pred EeeC Q lcl|Aclame:pro 397 ITVS 400 (400) Q Consensus 397 ~~~~ 400 (400) +.+. T Consensus 262 v~~t 265 (272) T protein:vir:98 262 VKIT 265 (272) T ss_pred EEEE Confidence 3333 No 110 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=91.47 E-value=0.016 Score=30.47 Aligned_cols=276 Identities=18% Similarity=0.215 Sum_probs=121.8 Q ss_pred HHccCChhHHHHHHHH-------------HHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceee----ecccceee Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAK-------------LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHV----TNVGALLV 163 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~-------------L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV----~n~~~~a~ 163 (400) |-.--+-+..-.|+-. |++++.-.+|. +++.||+.+ .+++.+|..+-| .+..+|-. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~alTLaea~~l~~d~---~~~~VIE~l----~~~s~iL~~lpf~~ve~~~~~~~r 73 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMPTVTLAESAKLSQDH---LVSGLIETI----VEVNPLYEMMPFTEIEGNALAYNR 73 (330) T ss_pred CceecCCccccceeehhccccccchhhhhhhHHhhcCchh---hHHHHHHhh----hccchHHhhcccccccCCcceeee Confidence 1111122333334322 34443322333 455555554 444555544333 33334432 Q ss_pred EEeeccccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec Q lcl|Aclame:pro 164 SRSFDSANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG 243 (400) Q Consensus 164 ~i~l~na~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g 243 (400) .-.+..+ +|-.-.+.-......+|+..+.+...+--.-..-....+|.|.+-+...+-++....++= +..+..+++| T Consensus 74 ~~~lp~a--~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~-~~~e~~linG 150 (330) T protein:vir:94 74 ENVLGDV--QFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIG-RQYQASMITG 150 (330) T ss_pred eecCCcc--eeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH-HHHHHHhhcc Confidence 2222222 222222221222233454445443333333333344555666553333333333333443 3677899999 Q ss_pred cCCC-cc-ccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceeccccc-ceEEEEecchhHHHHhhhhhccccccce Q lcl|Aclame:pro 244 DGTN-GF-KSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAG-RRYLIVKAEDRKALLDELRQATANANVR 320 (400) Q Consensus 244 DG~~-~t-~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~-~~~l~~~~~d~~a~~~~~~~~~~~an~~ 320 (400) |+.+ .| -........ +-|+ +-+-++.++.|+|-|.|+.+..+.| ..+|+.++-.+.+|+--.|+++...-.. T Consensus 151 Ds~~~~F~GL~~~~~~~---q~i~--tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~ 225 (330) T protein:vir:94 151 DGTGNSFQGMMGLVAAS---QTIS--AGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGE 225 (330) T ss_pred CCCCccccchhhcCCcc---cEEe--cCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCC Confidence 8662 11 111111111 1111 2234567888999999999987765 5788888888888888888776544221 Q ss_pred -eecCCcceeehhhccccceecchhhhchhhhhcc------------cceechhh-ceeccceeee-ecCceEEEEec-- Q lcl|Aclame:pro 321 -IKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQ------------KYHIDMQD-LTKVDAFEWK-TNSNMILVETL-- 383 (400) Q Consensus 321 -lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~------------k~~~~~~~-~~~~~~~~~~-~~~~~ilve~~-- 383 (400) -.|..|..+-.. -| -.+.|.+-++. =|++.+-+ ..+.|--|-. .....|-|+.+ T Consensus 226 ~~~~~~G~~v~~~--------~G-vPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~ 296 (330) T protein:vir:94 226 VMTLPSGRQIPTY--------RG-VPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGA 296 (330) T ss_pred cccccCCCEEeee--------CC-eEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCC Confidence 233445443211 01 11112221111 02222210 0000000000 01234555553 Q ss_pred ------ccccceeeccceeEeeC Q lcl|Aclame:pro 384 ------TSGHVETYNAGAVITVS 400 (400) Q Consensus 384 ------~~~~~~~~~~~~~~~~~ 400 (400) .+++|+.|...||..-. T Consensus 297 ~~~k~v~~~~v~~y~~~av~~~~ 319 (330) T protein:vir:94 297 KENADETITRVKMYCGFANFSQL 319 (330) T ss_pred ccccceeeEEEEEeeeeEEechh Confidence 46889999665554322 No 111 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=90.62 E-value=0.0063 Score=32.65 Aligned_cols=275 Identities=14% Similarity=0.051 Sum_probs=119.4 Q ss_pred HH----HHccCChhHHHHHHHHHHhCccchhhhHh-hcchhHHHHHHHHHHhhCccccceeeec---ccceee-EEeecc Q lcl|Aclame:pro 99 VL----KKNSGKSEIKNAWSAKLAENGVTITDTTF-QLPRKLVESINTALLNTNPVFKVFHVTN---VGALLV-SRSFDS 169 (400) Q Consensus 99 ll----~~nqg~ke~k~AW~a~L~ekgV~~qd~~e-iLP~~iI~AIe~A~ed~d~vl~~fhV~n---~~~~a~-~i~l~n 169 (400) +| +.++.- ...++.-++ .|..- .|+......+-+.+.++.++++..+|-+ ....-+ .++... T Consensus 1 ~~~~~~~~~~~~-------~~~~k~~t~--~d~~Gg~l~P~~~~~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~ 71 (315) T protein:vir:41 1 MLTIEDIRGGKP-------FEIVPKIDV--PDLGRGVLSVDRFGEFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVL 71 (315) T ss_pred CcccchhhcCCh-------hhhhhhcCC--cCCCCceechHHHHHHHHHHHhhhhhhhhceeeeccccccccccccccCc Confidence 22 111111 112222233 33322 4555556667777777889998777532 111110 111111 Q ss_pred --ccccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeeccCCC Q lcl|Aclame:pro 170 --ANEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN 247 (400) Q Consensus 170 --a~~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~ 247 (400) ..++..-..+.+.+++..+|....+.+..++-+-.+.+-.-+-+...-++-+|++.++++.|= |..+.+.++|||.. T Consensus 72 ~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a-~~~~~~~~nGdg~s 150 (315) T protein:vir:41 72 DVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGIS-YVLEKYYLHGDTSS 150 (315) T ss_pred ccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHH-HHHHHHhhccCCcC Confidence 112222223345566778888888887776655333222222121113678999999999998 79999999999973 Q ss_pred ccccchhhhhhhhhhhhhh--hhhcc--CCCCCHHHHHHhhceecc-----cccceEEEEecchhHHHHhhhhhcccccc Q lcl|Aclame:pro 248 GFKSIDKEADVKKIKKITT--KAKSA--GKTPFADAIEEAVDFVRP-----TAGRRYLIVKAEDRKALLDELRQATANAN 318 (400) Q Consensus 248 ~t~~~~~e~D~~~ik~it~--~at~~--~~T~~~dal~Eald~~~~-----~~~~~~l~~~~~d~~a~~~~~~~~~~~an 318 (400) ... ....-|=| ++..+. .++.. ..+.+..+.+-+|-++-| -+.+-..++++..+.++|. T Consensus 151 ~~p-~~~~~~G~-l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rk---------- 218 (315) T protein:vir:41 151 SDP-LLRMSDGW-LKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRD---------- 218 (315) T ss_pred cCc-cccccccc-eecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHH---------- Confidence 110 00000101 011111 11111 111222222222212222 2334578999999999988 Q ss_pred ceeecCCcceeehhh--ccc-cceecchhhh-c---hhhhhccccee---chhhceeccceeeee------cCceE-EEE Q lcl|Aclame:pro 319 VRIKNDDTEIASEVG--VDE-IIVYTGSKAL-K---PTVLVDQKYHI---DMQDLTKVDAFEWKT------NSNMI-LVE 381 (400) Q Consensus 319 ~~lk~~d~~~~~~v~--v~~-~~~~tg~k~l-~---~t~~vd~k~~~---~~~~~~~~~~~~~~~------~~~~i-lve 381 (400) +++.+|.+.-.-+ .++ .++ -|---- + |... +.+..| |++.|+-...+++.. +++.+ .+- T Consensus 219 --lk~~~g~~lw~~~~~~g~~~tl-~G~PV~~~~~m~~~~-~~~~~ilf~d~~nl~~~~~~~i~i~~~~~a~~~~~~~~~ 294 (315) T protein:vir:41 219 --ALKGRETGLGDQALTGANSILY-DGRPVQYVPALEALN-DGKSRALFVVPTQLVYGFWRNIKVVPDYDAEMRLTKYVA 294 (315) T ss_pred --HhccCCCccccchhhcCCCcee-cccceEecccccccC-CCCccEEEecccceEEEeccccEEEeeecCCCCceEEEE Confidence 7888877653111 111 111 121100 0 0000 111111 222232222222221 11111 111 Q ss_pred ecc-cccc--eeeccceeEee Q lcl|Aclame:pro 382 TLT-SGHV--ETYNAGAVITV 399 (400) Q Consensus 382 ~~~-~~~~--~~~~~~~~~~~ 399 (400) ++. .|++ +-+-+-++++| T Consensus 295 ~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 295 SLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred EEEeceeEEeccceeEeeeeC Confidence 222 2322 33444555666 No 112 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=87.23 E-value=0.022 Score=29.63 Aligned_cols=241 Identities=14% Similarity=0.114 Sum_probs=104.3 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ce-eeEEeeccc-cccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--AL-LVSRSFDSA-NEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~-~--~~-a~~i~l~na-~~a~ 174 (400) |.|.-++ --+-+.|+-+-..+...+++. -+|.++= +.+. + ++ .+.+-.=+. ..++ T Consensus 1 ma~~~T~------------------~~~~iiPev~~~~v~~~~~~~-~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~ 61 (274) T protein:vir:93 1 MPQGITK------------------TSNQIIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ 61 (274) T ss_pred CCcccee------------------hhheechHHHHHHHHHHHHhh-hhhcccccccccccCCCCCEEEEEeeccCCCcc Confidence 4443321 000022322222222222211 1111111 1110 0 01 011100011 1222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 253 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 253 (400) -...|.+-+.+.++....++.....++-..+-|....+.+ ++++..+++.++.++- |-++..+... .|.+- T Consensus 62 ~~~eg~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~--~d~~~~~~~~~~~~~a-~~~d~~~~~~~~~a~~----- 133 (274) T protein:vir:93 62 VVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHA-NKVDNDVLEALMGAKL----- 133 (274) T ss_pred cccCCCcccccccccceeEEEeeeecccccccHHHHHhhc--cchHHHHHHHHHHHHH-HHHHHHHHHHHhcccc----- Confidence 3344555566667777777777666655556666655554 6779999999998888 5666555432 11100 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhh----hhhccccccceeecCCcce Q lcl|Aclame:pro 254 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDE----LRQATANANVRIKNDDTEI 328 (400) Q Consensus 254 ~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~----~~~~~~~an~~lk~~d~~~ 328 (400) ++ .+.+...+++.+|+ -|-......++|+||+.+...|+.. .-.++...+-.+. +|.+ T Consensus 134 -----------~~----~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~--~G~i 196 (274) T protein:vir:93 134 -----------TV----NADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIV--KGAF 196 (274) T ss_pred -----------cc----cccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhccccccccccccee--eccc Confidence 00 11122355565555 2223345678999999999998742 1112222111111 1222 Q ss_pred eehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccc-------------cceeeccce Q lcl|Aclame:pro 329 ASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSG-------------HVETYNAGA 395 (400) Q Consensus 329 ~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~-------------~~~~~~~~~ 395 (400) ..-- |+. |++|.+-...-.=+...+++++- ++.-+.||+.... -+-.+|..+ T Consensus 197 g~~~---------G~~-----Vi~s~~~p~~t~~l~~~gai~~~-~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:93 197 GEAL---------GAI-----IVRTNKLEAGTAILAKKGAVKLI-LKRDFFLEVARDASTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred ceec---------Cee-----EEEcCCCCcceEEEEeCCeEEEE-ecCCcccccccchhhcccEEEEEEEEEEEEEcCCc Confidence 2222 321 11222111111224556777764 3444566765432 123345555 Q ss_pred eEeeC Q lcl|Aclame:pro 396 VITVS 400 (400) Q Consensus 396 ~~~~~ 400 (400) ++.+. T Consensus 262 ~v~~t 266 (274) T protein:vir:93 262 AVKIT 266 (274) T ss_pred eEEEe Confidence 55544 No 113 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=76.06 E-value=0.14 Score=25.27 Aligned_cols=245 Identities=13% Similarity=0.106 Sum_probs=108.3 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eeccc---ceeeEEeec-cc-cccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVG---ALLVSRSFD-SA-NEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~~---~~a~~i~l~-na-~~a~ 174 (400) |.|.-+ +--+-|.|+=.=..|...++. ..+|.++= +++.- ++--+--|. +. ..++ T Consensus 1 ma~~~T------------------~~~d~iiPev~~~~v~~~~~~-~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~ 61 (272) T protein:vir:36 1 MSKQKT------------------TLADLVNPEVLAPIVSYELNK-ALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAA 61 (272) T ss_pred CCCcce------------------ehhhhhchHHHHHHHHHHHHh-hhhhccccccccccccCCCCEEEEeeeccCcccc Confidence 332222 111114454332233333332 23333322 11110 011111111 11 2334 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 253 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 253 (400) -+..|.+-+.+.++....++.....++-.+.-|....+.+ ++.+..++++++.++- |-++..+... .|. T Consensus 62 ~~~eg~~i~~~~lt~~~~~~~i~~~~k~~~vtD~~~~~~~--~d~~~~~~~~~a~~~a-~~~d~~i~~~l~~~------- 131 (272) T protein:vir:36 62 DVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGY--GDPIGESNKQLGLSLA-NKVDDDLLSAAKTT------- 131 (272) T ss_pred ccCCCCccChhhcCCcceeEeeehhhccccccHHHHhhcc--chHHHHHHHHHHHHHH-HHHHHHHHHHhccc------- Confidence 4666777777888877777777765554555565555544 7889999999998876 5666544322 221 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHHhhc-eecccccceEEEEecchhHHHHhhhhhccccccceeecC--Ccceee Q lcl|Aclame:pro 254 KEADVKKIKKITTKAKSAGKTPFADAIEEAVD-FVRPTAGRRYLIVKAEDRKALLDELRQATANANVRIKND--DTEIAS 330 (400) Q Consensus 254 ~e~D~~~ik~it~~at~~~~T~~~dal~Eald-~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~lk~~--d~~~~~ 330 (400) ....+..-+.|.+..|++ |-.-....+++|+|+.+...||...+ .....+. ++.+.. T Consensus 132 --------------~~~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~------~~~~~~~~~~~~~~~ 191 (272) T protein:vir:36 132 --------------SQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDAN------AKNIGSEVGANALIN 191 (272) T ss_pred --------------cccccccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhcccc------cccccccccccceee Confidence 112233445566666662 11112235799999999888764211 1111111 111100 Q ss_pred hhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEeccccc-------------ceeeccceeE Q lcl|Aclame:pro 331 EVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGH-------------VETYNAGAVI 397 (400) Q Consensus 331 ~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~-------------~~~~~~~~~~ 397 (400) | .+--+-|.. .+-|.-+.....+...=+...+++++- ++.-+-||+-..-- +-.+|.-+|+ T Consensus 192 --G--~ig~~~G~~-Vv~s~~~p~~~~~~~~~~~~~gA~~~~-~~~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv 265 (272) T protein:vir:36 192 --G--TYADVLGAQ-IVRSKKLAEGSALMFKIVSNSPALKLV-LKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVV 265 (272) T ss_pred --e--ccceecCee-EEEeCCCCCCceeEEEEEecccceeee-ecCCcccccccchhhcCcEEEEEEEEEEEEEcCccEE Confidence 0 000111322 111222222211111112336777763 34445677643211 1234444444 Q ss_pred eeC Q lcl|Aclame:pro 398 TVS 400 (400) Q Consensus 398 ~~~ 400 (400) .++ T Consensus 266 ~~t 268 (272) T protein:vir:36 266 NIT 268 (272) T ss_pred EEe Confidence 444 No 114 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=74.71 E-value=0.16 Score=25.02 Aligned_cols=241 Identities=15% Similarity=0.139 Sum_probs=101.6 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ce-eeEEeeccc-cccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--AL-LVSRSFDSA-NEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~-~--~~-a~~i~l~na-~~a~ 174 (400) |.+..++ -.|+ +.|+=+-..+...++. -.+|+.+- +++. + ++ .+.+-.=+. -.+. T Consensus 1 ma~~~T~----------------~~d~--i~Pev~s~~v~~~~~~-~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~ 61 (274) T protein:vir:96 1 MAQGTTK----------------VSNL--IVPEVLAPMMQAELDK-KLRFAQFADIDSTLVGQPGDTLTFPAFTYSGDAQ 61 (274) T ss_pred CCccccc----------------hhhh--hhhHHHHHHHHHHHHh-hhhhcccccccccccCCCCCEEEEEeeccCCCcc Confidence 3332221 0111 3332222222222221 22223221 1110 0 01 011100011 1222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 253 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 253 (400) -...|.+-+.++++.....+.....|+..++-|....+.+ ++.+..+.+.++.++- |-++..+... +|.+-+ T Consensus 62 ~~~~g~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~--~d~~~~~~~~~~~~~a-~~~d~~i~~~l~~a~~~---- 134 (274) T protein:vir:96 62 VIAEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLSGF--GDPQGEAVRQHGLAIA-NKVDNDVLEALKGATLT---- 134 (274) T ss_pred ccCCCCcCchhhcccceeEEEEEeeeceeeecHHHHHhhc--chHHHHHHHHHHHHHH-HHHHHHHHHHHhcCCCC---- Confidence 2333445555556655555555554555555666655544 7789999999998877 5666554433 222111 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhh----hhhccccccceeecCCcce Q lcl|Aclame:pro 254 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDE----LRQATANANVRIKNDDTEI 328 (400) Q Consensus 254 ~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~----~~~~~~~an~~lk~~d~~~ 328 (400) -.+.+...|.+.+|+ -|-......++||+|+.+...|+.. .-+++...|-.+. +|.+ T Consensus 135 ----------------~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~--~g~i 196 (274) T protein:vir:96 135 ----------------VEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIV--KGAF 196 (274) T ss_pred ----------------cCcccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhccccccccccccccccee--eccc Confidence 011122356666665 2223334679999999998888652 1122222222121 2222 Q ss_pred eehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEeccccc-------------ceeeccce Q lcl|Aclame:pro 329 ASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGH-------------VETYNAGA 395 (400) Q Consensus 329 ~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~-------------~~~~~~~~ 395 (400) ...- |+. |++|.+-..+-.=+...+++++-. +.-+.||+....- +-.+|... T Consensus 197 g~~~---------G~~-----Vi~s~~~p~~t~~l~~~gA~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~ 261 (274) T protein:vir:96 197 GEAL---------GAV-----IVRSNKLNKGEALLAKKGAVKLIT-KRDFFLEKDRDASRKSTALYSDKHYVAYLYDESK 261 (274) T ss_pred ceec---------Cee-----EEEcCCCCcceEEEEeCcceeeee-cCCcccccccchhhcccEEEEeeEEEEEEEcCcc Confidence 2222 221 111211111111244567777643 3334566543211 22334444 Q ss_pred eEeeC Q lcl|Aclame:pro 396 VITVS 400 (400) Q Consensus 396 ~~~~~ 400 (400) ++.+. T Consensus 262 vv~~t 266 (274) T protein:vir:96 262 VVKIT 266 (274) T ss_pred EEEEE Confidence 44333 No 115 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=61.76 E-value=0.16 Score=24.90 Aligned_cols=245 Identities=13% Similarity=0.057 Sum_probs=100.6 Q ss_pred HHHccCChhHHHHHHHHHHhCccchhhhHhhcc-hhHHHHHHHHHHhhCcccccee-eecccceeeEEee-cccccccee Q lcl|Aclame:pro 100 LKKNSGKSEIKNAWSAKLAENGVTITDTTFQLP-RKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSF-DSANEAQVH 176 (400) Q Consensus 100 l~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP-~~iI~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l-~na~~a~GH 176 (400) |.-.+++.-.+.+|.. . ..|+ .|+ +---.-|.++|.. ..+|..+| +-.+..+.-+.-+ =....+.+| T Consensus 1 m~~~~~~~~t~~~~~~------~-~~~~--~l~le~~~geV~~af~~-~s~~~~~~~~r~i~~G~s~~~~~iG~~~~~~~ 70 (334) T protein:vir:80 1 MTYPAANTHTRPGWGG------A-NSDV--SLHIEEHLGLVDASFMY-SSKFASWMNVRSLRGTNQLRVDRVGASTIAGR 70 (334) T ss_pred CCCCcCCCcccccccc------c-cchh--eehhhhhhhHHHHHHHH-hhhhhccceeeeccccceEEEeeecceeeeee Confidence 3333344455666651 1 1111 355 4455667888885 68888788 5555433222222 224578899 Q ss_pred cccchhhhhhhhhhhh-------hccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHH-HHhcceee------ Q lcl|Aclame:pro 177 KDGQTKTEQAATLTID-------TLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK-IVDLALVE------ 242 (400) Q Consensus 177 k~ga~Kk~q~~~le~~-------ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~R-av~rAvv~------ 242 (400) +.|+.=..|.+.-... -.....||..-.. .---++.++|+.---|.+.+-.+.++-| ++..|.-. T Consensus 71 ~~g~~l~~~~~~~~~~~l~ID~~l~~~~~VddiD~~-q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~ 149 (334) T protein:vir:80 71 KAGEELVVQKNVSDKLNLTVDTVLYARHFFDKFDEW-TSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLK 149 (334) T ss_pred cCCCCCCCCCcccCceEEEEeeeeehhhhHhhHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 9998444443333333 3444555554333 0011244444444455555555543323 22222211 Q ss_pred ---ccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHH-------HHHHhhceeccc---ccceEEEEecchhHHHHhh Q lcl|Aclame:pro 243 ---GDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFAD-------AIEEAVDFVRPT---AGRRYLIVKAEDRKALLDE 309 (400) Q Consensus 243 ---gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~d-------al~Eald~~~~~---~~~~~l~~~~~d~~a~~~~ 309 (400) +||..+ ..+. +++....-.++| +..+.|+...+. ..+||++|.+.-.-+|+. T Consensus 150 ~~~~~G~~~--~~~~------------~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~- 214 (334) T protein:vir:80 150 PAFHDGILL--PSTI------------SGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLE- 214 (334) T ss_pred ccccCCcce--eecc------------cccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhc- Confidence 133322 1111 111111112232 234445555444 679999999988888887 Q ss_pred hhhccccccceeecCC-cceeehhhcccccee--cc-----hhhhchhhhhcccceechhhceec--cceee----eecC Q lcl|Aclame:pro 310 LRQATANANVRIKNDD-TEIASEVGVDEIIVY--TG-----SKALKPTVLVDQKYHIDMQDLTKV--DAFEW----KTNS 375 (400) Q Consensus 310 ~~~~~~~an~~lk~~d-~~~~~~v~v~~~~~~--tg-----~k~l~~t~~vd~k~~~~~~~~~~~--~~~~~----~~~~ 375 (400) .. ||.|.| |...+--+..+..+. -| ++.| |+..+- .. |+..= .|.. T Consensus 215 ---~~-----r~~n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~-P~~~~t-----------~~~~g~~~~~~agd~t~ 274 (334) T protein:vir:80 215 ---HD-----RLMNVEFGAKEGGNSFVGGRIAMLNGVRVVETPRF-PQSAIT-----------ANALGADFNVTDAEVRR 274 (334) T ss_pred ---cc-----ccccceeccccccccccceeEEEEeceEEEeecCC-CCcccc-----------ccccccccccccccccc Confidence 22 233321 111110111111111 14 2333 322110 00 00000 0000 Q ss_pred ceEEEEecccccceeeccceeEeeC Q lcl|Aclame:pro 376 NMILVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 376 ~~ilve~~~~~~~~~~~~~~~~~~~ 400 (400) -+.| -|...|+.|+- T Consensus 275 ~~~~----------~~~~~Al~t~~ 289 (334) T protein:vir:80 275 KMIT----------FIPSMALISAQ 289 (334) T ss_pred eEEE----------EEeCceEEEEE Confidence 0111 01222333322 No 116 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=61.40 E-value=0.35 Score=23.06 Aligned_cols=241 Identities=15% Similarity=0.117 Sum_probs=100.6 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ceeeEEeec-cc-cccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--ALLVSRSFD-SA-NEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~-~--~~a~~i~l~-na-~~a~ 174 (400) |.|.-++ ..|+ +.|+-.-..+...+.+. -+|..+- +++. + ++--+--|. +. ..++ T Consensus 1 ma~~~T~----------------~~d~--iiPev~~~~v~~~~~~~-l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~ 61 (274) T protein:vir:97 1 MPQGLTK----------------TSDQ--IIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ 61 (274) T ss_pred CCcccee----------------hhhe--echHHHHHHHHHhhhhh-hhhcccceecccccCCCCCEEEEeeecCCCccc Confidence 3333221 0011 33322222222222211 1111111 1110 0 011011110 11 1222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 253 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 253 (400) -...|.+-+-+.++....++.....++-.+.-|...-+.+ ++.+..+++.++.++- |-++..+... .+.+- T Consensus 62 ~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~--~dp~~~~~~~~a~a~a-~~vd~~~~~~l~~a~~----- 133 (274) T protein:vir:97 62 VVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHA-NKVDNDVLEALMGAKL----- 133 (274) T ss_pred cccCCCcccccccccceeEEEeeeecceecccHHHHHhcc--chHHHHHHHHHHHHHH-HHHHHHHHHHHhccCc----- Confidence 2333444455555555555555444443444555554544 7778999999988887 4555443322 11100 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhh----hhhccccccceeecCCcce Q lcl|Aclame:pro 254 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDE----LRQATANANVRIKNDDTEI 328 (400) Q Consensus 254 ~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~----~~~~~~~an~~lk~~d~~~ 328 (400) +. .+.+...+.+..|+ -|-......++|+||+.+...|+.. .-+++...+-.+. +|.+ T Consensus 134 -----------~~----~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~--~G~i 196 (274) T protein:vir:97 134 -----------TV----NADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIV--KGAF 196 (274) T ss_pred -----------cc----cccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCccccccee--cccc Confidence 00 01122345666555 2223344678999999999988752 1122222221121 2222 Q ss_pred eehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccc-------------cceeeccce Q lcl|Aclame:pro 329 ASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSG-------------HVETYNAGA 395 (400) Q Consensus 329 ~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~-------------~~~~~~~~~ 395 (400) ..-- |+ .|++|.+-..+-.=+...+++++- ++.-+.||+-..- -+-.+|... T Consensus 197 g~~~---------G~-----~Vi~s~~~p~~t~~l~~~gA~~~~-~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:97 197 GEAL---------GA-----IIVRTNKLEAGTAILAKKGAVKLI-LKRDFFLEVARDASTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred ceec---------Ce-----eEEEcCCCCcceEEEEeCcceEee-ecCCceeccccchhhcccEEEEEEEEEEEEEcCCc Confidence 2222 22 112222211222235567777763 4445667764321 123445555 Q ss_pred eEeeC Q lcl|Aclame:pro 396 VITVS 400 (400) Q Consensus 396 ~~~~~ 400 (400) |+.+. T Consensus 262 vv~~t 266 (274) T protein:vir:97 262 AVKIT 266 (274) T ss_pred eEEEe Confidence 55554 No 117 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=61.40 E-value=0.35 Score=23.06 Aligned_cols=241 Identities=15% Similarity=0.117 Sum_probs=100.6 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc-c--ceeeEEeec-cc-cccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV-G--ALLVSRSFD-SA-NEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~-~--~~a~~i~l~-na-~~a~ 174 (400) |.|.-++ ..|+ +.|+-.-..+...+.+. -+|..+- +++. + ++--+--|. +. ..++ T Consensus 1 ma~~~T~----------------~~d~--iiPev~~~~v~~~~~~~-l~~~~~~~~d~~l~g~~G~tv~iP~~~~~g~a~ 61 (274) T protein:vir:94 1 MPQGLTK----------------TSDQ--IIPEVLAPMMQAQLEKK-LRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQ 61 (274) T ss_pred CCcccee----------------hhhe--echHHHHHHHHHhhhhh-hhhcccceecccccCCCCCEEEEeeecCCCccc Confidence 3333221 0011 33322222222222211 1111111 1110 0 011011110 11 1222 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 253 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 253 (400) -...|.+-+-+.++....++.....++-.+.-|...-+.+ ++.+..+++.++.++- |-++..+... .+.+- T Consensus 62 ~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~--~dp~~~~~~~~a~a~a-~~vd~~~~~~l~~a~~----- 133 (274) T protein:vir:94 62 VVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHA-NKVDNDVLEALMGAKL----- 133 (274) T ss_pred cccCCCcccccccccceeEEEeeeecceecccHHHHHhcc--chHHHHHHHHHHHHHH-HHHHHHHHHHHhccCc----- Confidence 2333444455555555555555444443444555554544 7778999999988887 4555443322 11100 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhh----hhhccccccceeecCCcce Q lcl|Aclame:pro 254 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDE----LRQATANANVRIKNDDTEI 328 (400) Q Consensus 254 ~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~----~~~~~~~an~~lk~~d~~~ 328 (400) +. .+.+...+.+..|+ -|-......++|+||+.+...|+.. .-+++...+-.+. +|.+ T Consensus 134 -----------~~----~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~--~G~i 196 (274) T protein:vir:94 134 -----------TV----NADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIV--KGAF 196 (274) T ss_pred -----------cc----cccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhccccCccccccee--cccc Confidence 00 01122345666555 2223344678999999999988752 1122222221121 2222 Q ss_pred eehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccc-------------cceeeccce Q lcl|Aclame:pro 329 ASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSG-------------HVETYNAGA 395 (400) Q Consensus 329 ~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~-------------~~~~~~~~~ 395 (400) ..-- |+ .|++|.+-..+-.=+...+++++- ++.-+.||+-..- -+-.+|... T Consensus 197 g~~~---------G~-----~Vi~s~~~p~~t~~l~~~gA~~~~-~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:94 197 GEAL---------GA-----IIVRTNKLEAGTAILAKKGAVKLI-LKRDFFLEVARDASTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred ceec---------Ce-----eEEEcCCCCcceEEEEeCcceEee-ecCCceeccccchhhcccEEEEEEEEEEEEEcCCc Confidence 2222 22 112222211222235567777763 4445667764321 123445555 Q ss_pred eEeeC Q lcl|Aclame:pro 396 VITVS 400 (400) Q Consensus 396 ~~~~~ 400 (400) |+.+. T Consensus 262 vv~~t 266 (274) T protein:vir:94 262 AVKIT 266 (274) T ss_pred eEEEe Confidence 55554 No 118 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=49.76 E-value=0.63 Score=21.69 Aligned_cols=265 Identities=15% Similarity=0.126 Sum_probs=109.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecccceeeE----Eeeccc-----c Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALLVS----RSFDSA-----N 171 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a~~----i~l~na-----~ 171 (400) |- .- -|+|++.--+|. +..+|=+-|.+++.+|..+-|.++...... -.+..+ + T Consensus 1 mp-al----------tLaea~k~~~d~-------l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~ 62 (310) T protein:vir:97 1 MA-SV----------TLAESAKLAQDE-------LVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVG 62 (310) T ss_pred Cc-cc----------chHHHhhcCcch-------HHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCccccccc Confidence 11 00 022222211232 333333344556777776555554443222 111111 1 Q ss_pred ccceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhc-CchhHHHHHHHHHHHHHHHHHHHhcceeeccCCC-cc Q lcl|Aclame:pro 172 EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQ-MSYSELYNLIVAELTQAIVNKIVDLALVEGDGTN-GF 249 (400) Q Consensus 172 ~a~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~-g~ygalvnyvm~ELaq~fI~Rav~rAvv~gDG~~-~t 249 (400) ...+|.- . .....+|+.++-.-..+..--..-....++. +.+.+.+.+-++-...++- +-.+..+++||+.+ .| T Consensus 63 ~~~~~~g-~--~~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~-~~~e~~lINGD~a~n~F 138 (310) T protein:vir:97 63 TTFSGAG-A--GKAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAG-RKYQDQLINGNGAGNEF 138 (310) T ss_pred ccccCCC-c--cccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHH-HHHHHHhhccccCCCcc Confidence 2223321 1 1222333333222222222222212234553 3232223333333334444 46778899998742 22 Q ss_pred ccchhhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccc-cceEEEEecchhHHHHhhhhhccccccce-eecCCcc Q lcl|Aclame:pro 250 KSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTA-GRRYLIVKAEDRKALLDELRQATANANVR-IKNDDTE 327 (400) Q Consensus 250 ~~~~~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~-~~~~l~~~~~d~~a~~~~~~~~~~~an~~-lk~~d~~ 327 (400) .--..--|.. +.|++-+ -++.++.|+|-|-||++.-+. .-.+|+.|+.-+.+|+--.|+++...-.- -.+..|. T Consensus 139 ~GL~~~~~~~--q~i~~~~--~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~ 214 (310) T protein:vir:97 139 AGLIQLCASG--QKATTGA--TGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGA 214 (310) T ss_pred cchhhcCCcc--ceeecCC--CCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCC Confidence 1111111111 2222222 246677899999999985443 44789999988889998888876543221 2344444 Q ss_pred eeehhhccccceecchhhhchhhhhc------------ccceechhhc-eeccceeee-ecCceEEEEec--------cc Q lcl|Aclame:pro 328 IASEVGVDEIIVYTGSKALKPTVLVD------------QKYHIDMQDL-TKVDAFEWK-TNSNMILVETL--------TS 385 (400) Q Consensus 328 ~~~~v~v~~~~~~tg~k~l~~t~~vd------------~k~~~~~~~~-~~~~~~~~~-~~~~~ilve~~--------~~ 385 (400) .+-.. +--.+.|.+.+. .-|++.+-+- .+.|--|-. ..+.-|-|+.+ .+ T Consensus 215 ~v~~~---------~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~ 285 (310) T protein:vir:97 215 EVPAY---------SGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHI 285 (310) T ss_pred EEeee---------CCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCccee Confidence 43111 111112221111 1122221100 000000100 11223556654 35 Q ss_pred ccceeeccceeEeeC Q lcl|Aclame:pro 386 GHVETYNAGAVITVS 400 (400) Q Consensus 386 ~~~~~~~~~~~~~~~ 400 (400) +.|+.|..-||..-. T Consensus 286 ~~V~~Y~~~av~~~~ 300 (310) T protein:vir:97 286 WRVKWYCGLALFSEK 300 (310) T ss_pred EEEEEeeeEEEeccc Confidence 667788544433221 No 119 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=44.90 E-value=0.68 Score=21.50 Aligned_cols=262 Identities=12% Similarity=-0.020 Sum_probs=92.1 Q ss_pred HHHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeecc---cce-eeEEeeccccccce Q lcl|Aclame:pro 100 LKKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV---GAL-LVSRSFDSANEAQV 175 (400) Q Consensus 100 l~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~---~~~-a~~i~l~na~~a~G 175 (400) |+-.||. -.-..+|+..++++-++|+-.-..|...|+. ..+|..+-...+ ..+ .+.|--=..-.+.. T Consensus 1 ~~~~~~~--------~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~-~lv~~~l~~~~~~~~~~GdTV~ip~~g~~~a~d 71 (381) T protein:vir:80 1 MATIQGT--------GGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQ-KFAALEATKKIPFEGKKGDLIHIPNISRAAVYD 71 (381) T ss_pred Cceeccc--------ccccCcccchhhHHhhhhHHHHHHHHHHHHH-hhhhhhccccccceeecCceEEeeccCcceeee Confidence 4444443 1123567888888888898888888888874 555543211111 011 11111011223444 Q ss_pred ecccchhhhhhhhhhh-------hhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceee------ Q lcl|Aclame:pro 176 HKDGQTKTEQAATLTI-------DTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVE------ 242 (400) Q Consensus 176 Hk~ga~Kk~q~~~le~-------~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~------ 242 (400) |+.|..=..+...-.. ....+..|+..-.. ...| ++.+.+++++...+= |.+|.++.. T Consensus 72 ~~~g~~i~~~~~~~~~~~itID~~~~~~~~Idd~D~~-------~~~~-D~~~~~~~~~~~aLA-~~~D~~i~~~~~~~~ 142 (381) T protein:vir:80 72 KQPQTPVNLQARTDSEFTFTVTKYKESSFMIEDIVNT-------QASY-TLRQYYTKEAGYALA-RDMDNFALAHRAVIN 142 (381) T ss_pred ecCCCcccccccCCceEEEEEeeeeecceeechHHHH-------hhcc-ChHHHHHHHHHHHHH-HHHHHHHHHHHhhcc Confidence 5544422222222221 12233333332221 1111 334445555554442 333433321 Q ss_pred cc--CCCccccchhhhhhhhhhhhhhhhhccCCCC-CHHHHHHhhceecccccceEEEEecchhHHHHhhhhhcccc--c Q lcl|Aclame:pro 243 GD--GTNGFKSIDKEADVKKIKKITTKAKSAGKTP-FADAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATAN--A 317 (400) Q Consensus 243 gD--G~~~t~~~~~e~D~~~ik~it~~at~~~~T~-~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~~--a 317 (400) .. +.+++.... +++.......+.... ..|+ ...++.++||=+.....+|||||++....+|+..-+-..+. + T Consensus 143 ~~~~~~~~t~~~~-i~~~~~~~~~t~~~~--~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~ 219 (381) T protein:vir:80 143 AFPSQRIYSYDTT-LGDGTVNAHLTGTPA--PLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQ 219 (381) T ss_pred ccccccccccccc-ccccccccccccchh--hHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhcc Confidence 11 111111111 111111110111000 0011 13455667765554445689999999999888643221111 0 Q ss_pred cceeecCCcceeehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccccceeeccceeE Q lcl|Aclame:pro 318 NVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGHVETYNAGAVI 397 (400) Q Consensus 318 n~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~~~~~~ 397 (400) ..-| .+|.|..--|+ .|+. ++.| |..-. -.|++..- .-..-. ...--+...| -++|+++++- T Consensus 220 ~~~l--~~G~Ig~i~G~---~Vv~-Sn~l-p~~~~-t~~~~~ag--------ap~~~~-~~~~~~~~~g-~~s~~a~av~ 281 (381) T protein:vir:80 220 VKPV--TSGVVGTILGM---EVIV-TTQI-GINSL-TGYVNGQG--------APTQPT-PGVLGSPYLP-DQAGTANVVN 281 (381) T ss_pred chhh--hceeeeEEcce---EEEe-eccc-ccccc-cceeeecc--------cccccc-cccccccccc-ccccceeeee Confidence 0001 12333221111 1111 1111 11000 01111000 000000 0000000011 1233333333 Q ss_pred eeC Q lcl|Aclame:pro 398 TVS 400 (400) Q Consensus 398 ~~~ 400 (400) ++. T Consensus 282 ~~k 284 (381) T protein:vir:80 282 TGS 284 (381) T ss_pred eee Confidence 333 No 120 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=40.99 E-value=0.95 Score=20.71 Aligned_cols=257 Identities=12% Similarity=0.040 Sum_probs=112.9 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecccceeeEEeec-cccccceecc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFD-SANEAQVHKD 178 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~-na~~a~GHk~ 178 (400) |.+-.. -.+-.| |-+.-|. .+.=+.--.-|.+||.. ..+|..+| +-.+..+.-..-+- ....+.+|+. T Consensus 1 ms~~~~-~tr~~~-------~~s~~d~-al~le~f~geV~~af~~-~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~p 70 (335) T protein:vir:63 1 MSFLND-LTRPNY-------AGKNADV-DIHLEEHLGIVDKHFAY-TSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRA 70 (335) T ss_pred CCCccc-chhhhc-------ccccchh-heehhhhhhhHHHHHHh-hhhhccccceeeeccceeEEEeeeeeeeeecccC Confidence 444422 334445 2222233 22225566778889985 88888788 55554443222222 2468899999 Q ss_pred cchhh-------hhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHH-HHhcceeeccCCCccc Q lcl|Aclame:pro 179 GQTKT-------EQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK-IVDLALVEGDGTNGFK 250 (400) Q Consensus 179 ga~Kk-------~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~R-av~rAvv~gDG~~~t~ 250 (400) |+.=. +..++...+-+.+..||.+-..--| -++.++|+.---|...+..+.++-| ++..|-.....+.... T Consensus 71 G~~l~~~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~-yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~ 149 (335) T protein:vir:63 71 GEELERSRVVNDKWNLTVDTLLYLRHQFDHQDEWTQS-FDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDA 149 (335) T ss_pred CcCcCCCCccccceEEEecceeechhhhhhHHHHhcC-chhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCC Confidence 98322 2244444555677778776444111 1234445555556666555554434 3333322221110000 Q ss_pred cchhhhhhhhhhhhhhhhhccCCCCCHHHHH-------Hhhceecc---cccceEEEEecchhHHHHhhhhhccccccce Q lcl|Aclame:pro 251 SIDKEADVKKIKKITTKAKSAGKTPFADAIE-------EAVDFVRP---TAGRRYLIVKAEDRKALLDELRQATANANVR 320 (400) Q Consensus 251 ~~~~e~D~~~ik~it~~at~~~~T~~~dal~-------Eald~~~~---~~~~~~l~~~~~d~~a~~~~~~~~~~~an~~ 320 (400) .-+-.+. .++ .+..+.+.++|+|- ++||-..+ ..++|+.+|.+.-.-+|+. +.+-.|.. T Consensus 150 ~~~G~~~-----~~~--~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~----~~~l~n~~ 218 (335) T protein:vir:63 150 FSPGVLE-----KLD--LTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLE----HDKLMNVE 218 (335) T ss_pred cCCCcce-----eee--eccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhc----cccccccc Confidence 0011111 111 11122233555543 55554443 2678999999888877776 44445554 Q ss_pred eec-------CCcceeehhhccccceecchhhhchhhhhcccceech-hhceeccceeeeecCceEEEEecccccceeec Q lcl|Aclame:pro 321 IKN-------DDTEIASEVGVDEIIVYTGSKALKPTVLVDQKYHIDM-QDLTKVDAFEWKTNSNMILVETLTSGHVETYN 392 (400) Q Consensus 321 lk~-------~d~~~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~-~~~~~~~~~~~~~~~~~ilve~~~~~~~~~~~ 392 (400) .-+ .+|.+..--|+. ++ .++.| |+....- | .+ +.+ ++.++ .+..-+.++ +. T Consensus 219 ~~~s~~~~~~~~g~v~~v~Gv~--V~--~sn~l-P~~~~t~--~-~lg~a~--n~~~~-d~~~~~~~~----------~~ 277 (335) T protein:vir:63 219 YQATGATNDYVKSRVAILNGVK--VL--ETPRF-ATKAIAA--H-PLGRHF--NVSAE-ESERQIALF----------LP 277 (335) T ss_pred cccccccccccCceeEEeeceE--EE--eeccC-CCCCccc--c-cccccC--Ccccc-ccceeEEEE----------Ee Confidence 321 123444444443 22 45555 5443211 1 01 000 11110 111111111 11 Q ss_pred cceeEeeC Q lcl|Aclame:pro 393 AGAVITVS 400 (400) Q Consensus 393 ~~~~~~~~ 400 (400) ..|++|+- T Consensus 278 ~~Al~t~~ 285 (335) T protein:vir:63 278 SKTLITAQ 285 (335) T ss_pred cceEEEEE Confidence 11222221 No 121 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=32.56 E-value=1.4 Score=19.75 Aligned_cols=241 Identities=14% Similarity=0.107 Sum_probs=95.9 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc---cceeeEEeec-cc-cccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV---GALLVSRSFD-SA-NEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~---~~~a~~i~l~-na-~~a~ 174 (400) |.+.-++ ..|+ +.|+=.-..+...+.. .-+|+.+- +.+. .++--+--|. +. -.+. T Consensus 1 m~~~~T~----------------l~d~--i~Pev~~~~v~~~~~~-~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~ 61 (274) T protein:vir:96 1 MAQGMTK----------------LTNQ--IVPEVLAPMMQAELEK-KLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAK 61 (274) T ss_pred CCcceee----------------hhhe--echHHHHHHHHHHHHh-hhhccccceecccccCCCCCEEEeeeecCCCccc Confidence 2222210 0011 3343222222222221 22222221 1111 0000000000 00 0112 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 253 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 253 (400) -...|.+-+.+.++....++.-...|+-.+.-|...-+. +++.+..+++.++.++- |-++..+... .+...+. T Consensus 62 ~~~~g~~i~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~--~~d~~~~~~~~~~~~~a-~~vd~~i~~~l~~a~~~~--- 135 (274) T protein:vir:96 62 VVAEGEKIPTDILETKKREAKIRKIAKGTSISDEALLSG--YGDPQGEQVRQHGLAHA-NKVDDDVLEALKSAKLTV--- 135 (274) T ss_pred cccCCCccchhhcccceeEEEeeeeecceeehHHHHhhc--cchHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccc--- Confidence 222233334444444444444444454444455554444 37788889998888877 4555443322 1111100 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhhh----hhccccccceeecCCcce Q lcl|Aclame:pro 254 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDEL----RQATANANVRIKNDDTEI 328 (400) Q Consensus 254 ~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~~----~~~~~~an~~lk~~d~~~ 328 (400) .+++...+.+..|+ -|-......++||||+.....|+... -.++...+..+. +|.| T Consensus 136 -----------------~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~--~G~i 196 (274) T protein:vir:96 136 -----------------EADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIV--KGAF 196 (274) T ss_pred -----------------cccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhcccccccccccccccee--cccc Confidence 01122344455554 22223346789999999998887521 011111111110 1111 Q ss_pred eehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEeccccc-------------ceeeccce Q lcl|Aclame:pro 329 ASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGH-------------VETYNAGA 395 (400) Q Consensus 329 ~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~-------------~~~~~~~~ 395 (400) -. +-|+. |++|.+....-.=+...+++++- ++..+-||+....- +-.||... T Consensus 197 g~---------~~G~~-----Vi~s~~~~~~t~~l~~~gA~~~~-~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:96 197 GE---------ALGAV-----IVRSNKLEAGTAILAKKGAVKLI-TKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred ce---------ecCeE-----EEEeCCCCCceEEEEeccceeee-ecCCcccccccccccccCEEEEeEEEEEEEEcCCc Confidence 11 11332 22222222222235677888874 34456677754321 22344444 Q ss_pred eEeeC Q lcl|Aclame:pro 396 VITVS 400 (400) Q Consensus 396 ~~~~~ 400 (400) ++.+. T Consensus 262 ~v~~t 266 (274) T protein:vir:96 262 AVKIT 266 (274) T ss_pred EEEEE Confidence 44333 No 122 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=32.56 E-value=1.4 Score=19.75 Aligned_cols=241 Identities=14% Similarity=0.107 Sum_probs=95.9 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecc---cceeeEEeec-cc-cccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNV---GALLVSRSFD-SA-NEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~---~~~a~~i~l~-na-~~a~ 174 (400) |.+.-++ ..|+ +.|+=.-..+...+.. .-+|+.+- +.+. .++--+--|. +. -.+. T Consensus 1 m~~~~T~----------------l~d~--i~Pev~~~~v~~~~~~-~l~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~ 61 (274) T protein:vir:95 1 MAQGMTK----------------LTNQ--IVPEVLAPMMQAELEK-KLRFASFAEIDNTLVGQPGDTLTFPAFIYSGDAK 61 (274) T ss_pred CCcceee----------------hhhe--echHHHHHHHHHHHHh-hhhccccceecccccCCCCCEEEeeeecCCCccc Confidence 2222210 0011 3343222222222221 22222221 1111 0000000000 00 0112 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 253 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 253 (400) -...|.+-+.+.++....++.-...|+-.+.-|...-+. +++.+..+++.++.++- |-++..+... .+...+. T Consensus 62 ~~~~g~~i~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~--~~d~~~~~~~~~~~~~a-~~vd~~i~~~l~~a~~~~--- 135 (274) T protein:vir:95 62 VVAEGEKIPTDILETKKREAKIRKIAKGTSISDEALLSG--YGDPQGEQVRQHGLAHA-NKVDDDVLEALKSAKLTV--- 135 (274) T ss_pred cccCCCccchhhcccceeEEEeeeeecceeehHHHHhhc--cchHHHHHHHHHHHHHH-HHHHHHHHHHHhcccccc--- Confidence 222233334444444444444444454444455554444 37788889998888877 4555443322 1111100 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhhh----hhccccccceeecCCcce Q lcl|Aclame:pro 254 KEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDEL----RQATANANVRIKNDDTEI 328 (400) Q Consensus 254 ~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~~----~~~~~~an~~lk~~d~~~ 328 (400) .+++...+.+..|+ -|-......++||||+.....|+... -.++...+..+. +|.| T Consensus 136 -----------------~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~--~G~i 196 (274) T protein:vir:95 136 -----------------EADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIV--KGAF 196 (274) T ss_pred -----------------cccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhcccccccccccccccee--cccc Confidence 01122344455554 22223346789999999998887521 011111111110 1111 Q ss_pred eehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEeccccc-------------ceeeccce Q lcl|Aclame:pro 329 ASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGH-------------VETYNAGA 395 (400) Q Consensus 329 ~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~-------------~~~~~~~~ 395 (400) -. +-|+. |++|.+....-.=+...+++++- ++..+-||+....- +-.||... T Consensus 197 g~---------~~G~~-----Vi~s~~~~~~t~~l~~~gA~~~~-~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~ 261 (274) T protein:vir:95 197 GE---------ALGAV-----IVRSNKLEAGTAILAKKGAVKLI-TKRDFFLETDRDPSTKTTALYSDKHYVAYLYDESK 261 (274) T ss_pred ce---------ecCeE-----EEEeCCCCCceEEEEeccceeee-ecCCcccccccccccccCEEEEeEEEEEEEEcCCc Confidence 11 11332 22222222222235677888874 34456677754321 22344444 Q ss_pred eEeeC Q lcl|Aclame:pro 396 VITVS 400 (400) Q Consensus 396 ~~~~~ 400 (400) ++.+. T Consensus 262 ~v~~t 266 (274) T protein:vir:95 262 AVKIT 266 (274) T ss_pred EEEEE Confidence 44333 No 123 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=30.52 E-value=1.6 Score=19.51 Aligned_cols=240 Identities=17% Similarity=0.136 Sum_probs=110.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee-----eEEee-ccc-ccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL-----VSRSF-DSA-NEA 173 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a-----~~i~l-~na-~~a 173 (400) |.+.- ++--.-|.|+=+=-.+...+++ --+|+++- ..+..+. -+--| =+. ..+ T Consensus 1 Ma~~~------------------T~l~d~i~Pev~~~~v~~~~~~-~~~~~~~~-~~~~~l~g~~G~ti~iP~~~~igda 60 (276) T protein:vir:10 1 MAQGT------------------TTKSTQIVPEVLAPMMQAELDK-KLRFAQFA-DIDSTLVGQPGDTLTFPAFVYSGDA 60 (276) T ss_pred CCcce------------------eehhhhhchHHHHHHHHHHHHh-hhhhcccc-eecccccCCCCCEEEeeeecCCCcc Confidence 22211 1111114454433333444432 23333332 1111111 00000 011 122 Q ss_pred ceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccc Q lcl|Aclame:pro 174 QVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSI 252 (400) Q Consensus 174 ~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~ 252 (400) .-...|.+-+.+.++....++.....++--..-|....+.+ ++.+..+.+.++.++- |-++..+... +|.+ T Consensus 61 ~~~~eg~~i~~~~lt~~~~~a~i~~~~k~~~~tD~a~~~~~--~dp~~~~~~~~~~~~a-~~~d~~~~~~l~~~~----- 132 (276) T protein:vir:10 61 TVVPEGQKIPVDKIETNRREAKIHKIGKGTDITDEALLSGY--GDPQGEAVRQHGLAIA-NKVDNDVLEALRGTK----- 132 (276) T ss_pred ccccCCCccCccccccceeeEEeehccccccccHHHHHhhc--cchHHHHHHHHHHHHH-HHHHHHHHHHHhccc----- Confidence 22344555556666666666655555555555666665555 7789999999998877 4444332211 1111 Q ss_pred hhhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHh----hhhhccccccceeecCCcc Q lcl|Aclame:pro 253 DKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLD----ELRQATANANVRIKNDDTE 327 (400) Q Consensus 253 ~~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~----~~~~~~~~an~~lk~~d~~ 327 (400) .+-.+++...|.+..|+ -|-.-....++|++|+.....|+. +.-+++...+..+. +|. T Consensus 133 ---------------~~~~~~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~--~G~ 195 (276) T protein:vir:10 133 ---------------LTVSADIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFTRATELGDNIIV--KGA 195 (276) T ss_pred ---------------ccccccccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhcccccccccccccccee--ccc Confidence 11112233456666655 222223456899999999988874 22333333333332 233 Q ss_pred eeehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEecccc-------------cceeeccc Q lcl|Aclame:pro 328 IASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSG-------------HVETYNAG 394 (400) Q Consensus 328 ~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~-------------~~~~~~~~ 394 (400) |....|+ . .+-+.-++..+.+ +...|++++-. +..+.||+-... -+-.||+. T Consensus 196 ig~~~G~---------~-Vi~s~~~p~~t~~----l~~~gAi~~~~-~~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~ 260 (276) T protein:vir:10 196 FGEALGA---------V-IVRSKKLDEGEAI----LAKRGAVKLIT-KRDFFLETDRDPSTKTTALYSDKHYVAYLYDES 260 (276) T ss_pred cceecce---------e-EEEcCCCCcceEE----EEeccceeeee-cCCceeecccchhhcccEEEEeeEEEEEEEcCc Confidence 3333332 1 1112222222222 66778888744 455667875432 12334555 Q ss_pred eeEeeC Q lcl|Aclame:pro 395 AVITVS 400 (400) Q Consensus 395 ~~~~~~ 400 (400) .|+.+. T Consensus 261 ~vv~~t 266 (276) T protein:vir:10 261 KAVKVT 266 (276) T ss_pred ceEEEe Confidence 555554 No 124 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=29.34 E-value=1.4 Score=19.85 Aligned_cols=280 Identities=15% Similarity=0.135 Sum_probs=102.0 Q ss_pred HHHccCCh-hHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecccceeeEEeecc-cccccee Q lcl|Aclame:pro 100 LKKNSGKS-EIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFDS-ANEAQVH 176 (400) Q Consensus 100 l~~nqg~k-e~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~n-a~~a~GH 176 (400) |++..+++ --+..|. |. ..|.-.+.=+--+.-+..+|.. ..+|..+| +.++..+..+.-+.- ...+.+| T Consensus 1 m~~~~~~~~~t~~g~~------~~-~~d~~al~ik~f~~eV~~~f~~-~s~~~~~~~~r~i~~G~sv~i~~iG~~tv~~~ 72 (347) T protein:vir:94 1 MANVPGQKIGTDQGKG------KS-SSDALALFLKVFAGEVLTAFTR-RSVTADKHIVRTIQNGKSAQFPVMGRTSGVYL 72 (347) T ss_pred CCCCCccccccccccC------Cc-cccHHHHHHHHHhHHHHHHHHH-HHhhhcccccccccccceEEEecccceeeeee Confidence 33332321 1222332 11 1121111114455667778884 67887777 555554443333322 4678888 Q ss_pred cccchh-------hhhhhhhh--hhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHH--hcce----- Q lcl|Aclame:pro 177 KDGQTK-------TEQAATLT--IDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIV--DLAL----- 240 (400) Q Consensus 177 k~ga~K-------k~q~~~le--~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav--~rAv----- 240 (400) +.|+.= +....+|. ..-.....||..-+. .-.-++.++|..---|.+.+-.+.+|-+.+ -.+- T Consensus 73 t~G~~l~~~~~~~~~~e~~itID~~~~~~~~VddiD~~-q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~ 151 (347) T protein:vir:94 73 APGERLSDKRKGIKHTEKVITIDGLLTADVMIFDIEDA-MNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASN 151 (347) T ss_pred cCCCCcCCCCCCCCcceEEEEecchhhhhHHhhhHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 887732 12222233 222335555544333 111134444555556666666665552211 0111 Q ss_pred --eeccCCCccccchhhhhhhhhhhhhhhhhccCCCCCHHHH---HHhhceecccccceEEEEecchhHHHHhhhhhccc Q lcl|Aclame:pro 241 --VEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAI---EEAVDFVRPTAGRRYLIVKAEDRKALLDELRQATA 315 (400) Q Consensus 241 --v~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~T~~~dal---~Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~ 315 (400) +.|+|.+........+|..+. ........++| .+.||=+.+....||+||.+.-..+|+...+-... T Consensus 152 ~~~~g~~~~s~~~~~~~~~~~~~--------~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~ 223 (347) T protein:vir:94 152 ENIAGLGTASVLEVGKKADLDTP--------AKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAA 223 (347) T ss_pred cccCCCcccceeeccccccccch--------hhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhh Confidence 111111110000001110000 00000112333 45566666665679999999988888764321111 Q ss_pred cccceeecCCcceeehhhccccceecchhhhch----hhhhcc------------------cceechhhcee-------- Q lcl|Aclame:pro 316 NANVRIKNDDTEIASEVGVDEIIVYTGSKALKP----TVLVDQ------------------KYHIDMQDLTK-------- 365 (400) Q Consensus 316 ~an~~lk~~d~~~~~~v~v~~~~~~tg~k~l~~----t~~vd~------------------k~~~~~~~~~~-------- 365 (400) ....-.--.+|.|..--|+. ||. |+.|-. ....+. +|..|.+.-+. T Consensus 224 ~~~~~~~~~~G~Vg~i~G~~---V~~-Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~ 299 (347) T protein:vir:94 224 NYAALIDPETGNIRNVMGFV---VVE-VPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAV 299 (347) T ss_pred hccccccccccceEEEeceE---EEe-cCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhh Confidence 11111111234443333321 111 122210 011111 12222111110 Q ss_pred ---------ccceeeeecCceEEEEecccccceeec-cceeEeeC Q lcl|Aclame:pro 366 ---------VDAFEWKTNSNMILVETLTSGHVETYN-AGAVITVS 400 (400) Q Consensus 366 ---------~~~~~~~~~~~~ilve~~~~~~~~~~~-~~~~~~~~ 400 (400) ...+.-+-.++..++=.+.-||--..- +-++|+.+ T Consensus 300 ~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~ 344 (347) T protein:vir:94 300 GTVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFS 344 (347) T ss_pred hhhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEec Confidence 000000111111111112222211111 11223333 No 125 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=28.36 E-value=1.8 Score=19.24 Aligned_cols=280 Identities=12% Similarity=0.105 Sum_probs=105.4 Q ss_pred HHHccCCh--hHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecccceeeEEeecc-ccccce Q lcl|Aclame:pro 100 LKKNSGKS--EIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVGALLVSRSFDS-ANEAQV 175 (400) Q Consensus 100 l~~nqg~k--e~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~~~~a~~i~l~n-a~~a~G 175 (400) |+..++.. .-+..|. |..+ |...+.=+.--.-|..+|.. ..+|.-+| +.++..+.-+.-+.- ...+.+ T Consensus 1 ma~~~~~~~~~t~~g~~------~~~~-d~~al~ie~~~geV~~~f~~-~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~ 72 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKG------MSAG-DKLALFLKVFGGEVLTAFTR-TSVTMNKHLVRSIQSGKSAQFPVLGRTKAAY 72 (347) T ss_pred CCccccccccccccccC------Cccc-chHHHHHHHHhHHHHHHHHH-HHhhhhhhhheeccccceEEeeeccceeEee Confidence 22233322 2344443 2222 32222225566677888885 57777677 555544432222222 456677 Q ss_pred ecccchh--h-----hhhhhhhhhh--ccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHH-HHhccee---- Q lcl|Aclame:pro 176 HKDGQTK--T-----EQAATLTIDT--LEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNK-IVDLALV---- 241 (400) Q Consensus 176 Hk~ga~K--k-----~q~~~le~~t--i~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~R-av~rAvv---- 241 (400) |+.|+.= + ....++.+++ .....||..-+. .---++.+.|+.---|.|.+-.+.+|-+ +...|-. T Consensus 73 ~~~G~~l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~-q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~ 151 (347) T protein:vir:94 73 LQPGENLDDKRKDMKHTEKTINIDGLLTADVLIYDIEDA-MNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTAN 151 (347) T ss_pred eecCcCCCCCcCCccccceEEEEcchhhhhhhhhhHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc Confidence 7777632 2 2222233332 344455544322 0001344445555556666665544422 1111110 Q ss_pred --eccCCCccccchhhhhhhhhhhhhhhhhcc-CCCCCHHHHH---HhhceecccccceEEEEecchhHHHHhhhhhccc Q lcl|Aclame:pro 242 --EGDGTNGFKSIDKEADVKKIKKITTKAKSA-GKTPFADAIE---EAVDFVRPTAGRRYLIVKAEDRKALLDELRQATA 315 (400) Q Consensus 242 --~gDG~~~t~~~~~e~D~~~ik~it~~at~~-~~T~~~dal~---Eald~~~~~~~~~~l~~~~~d~~a~~~~~~~~~~ 315 (400) ..+|.+........+. .+.+++.. ......|+++ +.||=..+....||+||.+...-+|+.-++-.+. T Consensus 152 ~~~~~g~~~~~~v~i~~~------~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~ 225 (347) T protein:vir:94 152 NENIAGLGKAHVLEVGDQ------ATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAA 225 (347) T ss_pred ccccccCCcceeEeeecc------ccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccc Confidence 0111111001111111 01111000 0011123343 5555555555679999999999999876555555 Q ss_pred cccceeecCCcceeehhhcc----c-cceec-chhhhchh-----------hhhcccceechhhceeccceeeeecCceE Q lcl|Aclame:pro 316 NANVRIKNDDTEIASEVGVD----E-IIVYT-GSKALKPT-----------VLVDQKYHIDMQDLTKVDAFEWKTNSNMI 378 (400) Q Consensus 316 ~an~~lk~~d~~~~~~v~v~----~-~~~~t-g~k~l~~t-----------~~vd~k~~~~~~~~~~~~~~~~~~~~~~i 378 (400) +.|.-.-..+|.|..--|+. + ..+-. |...+-+. .-...+|.+|+++-+.+ -|..... T Consensus 226 ~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l-----~~~~~A~ 300 (347) T protein:vir:94 226 NYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGL-----FNHRSAV 300 (347) T ss_pred ccccccccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEE-----Eechhhh Confidence 55554333445555533321 1 11100 11111111 01123444443332200 0000000 Q ss_pred -EEEeccccccee-ecc---ceeEeeC Q lcl|Aclame:pro 379 -LVETLTSGHVET-YNA---GAVITVS 400 (400) Q Consensus 379 -lve~~~~~~~~~-~~~---~~~~~~~ 400 (400) .||+.. =.+|+ |.. |-+|.-. T Consensus 301 ~tv~~~~-~~~e~~~~~~~~~~~i~~~ 326 (347) T protein:vir:94 301 GTVKLKD-MALERARRANFQADQIIAK 326 (347) T ss_pred hhhhhcc-cceeeeechhhhhhhhhhh Confidence 011000 00000 000 0000000 No 126 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=27.56 E-value=1.8 Score=19.14 Aligned_cols=240 Identities=14% Similarity=0.102 Sum_probs=98.7 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCccccceeeeccccee-----eEEeec-cc-ccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNVGALL-----VSRSFD-SA-NEA 173 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fhV~n~~~~a-----~~i~l~-na-~~a 173 (400) |.|.-+ +-.+=+.|+=.-..+...+.+ .-+|+.+= ..+..++ -+--|. +. -.+ T Consensus 1 ma~~~T------------------~l~d~iiPev~~~~v~~~~~~-~l~~~~~~-~~d~~l~g~~G~tv~iP~~~~ig~a 60 (274) T protein:vir:12 1 MAQGLT------------------KTSNQIIPEVLAPMMQAQLEK-KLRFASFA-EVDSTLQGQPGDTLTFPAFVYSGDA 60 (274) T ss_pred CCccee------------------ehhhhhchHHHHHHHHHHHHh-hhhhcccc-eecccccCCCCCEEEEeeecCCCcc Confidence 222221 111114454322222222222 22222221 1111111 000000 00 011 Q ss_pred ceecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccc Q lcl|Aclame:pro 174 QVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSI 252 (400) Q Consensus 174 ~GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~ 252 (400) .-...|.+-+.+.++....++.-...|+-...-|....+.+ ++.+..+++.++.++- |-++..+... .+.+. T Consensus 61 ~~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~--~d~~~~~~~q~~~~~a-~~vd~~~l~~~~~a~~---- 133 (274) T protein:vir:12 61 QVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGY--GDPQGEQVRQHGLAHA-NKVDNDVLEALMGAKL---- 133 (274) T ss_pred ccccCCCccchhhcccceeeEEeeeecceeeecHHHHHhcc--cchHHHHHHHHHHHHH-HHHHHHHHHHHhcccc---- Confidence 11222333344444444444444444444444555544444 7778888888888877 4555443322 11100 Q ss_pred hhhhhhhhhhhhhhhhhccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhhh----hhccccccceeecCCcc Q lcl|Aclame:pro 253 DKEADVKKIKKITTKAKSAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDEL----RQATANANVRIKNDDTE 327 (400) Q Consensus 253 ~~e~D~~~ik~it~~at~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~~----~~~~~~an~~lk~~d~~ 327 (400) +...+....|.+..|+ -|-......++|+||+.+...|+... -.++.-.+-.+ .+|. T Consensus 134 ----------------~~~~~a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~--~~G~ 195 (274) T protein:vir:12 134 ----------------TVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFTRATELGDDII--VKGA 195 (274) T ss_pred ----------------cccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhccccccccccce--eccc Confidence 0011122345555555 23333446789999999988887521 11111111001 0122 Q ss_pred eeehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEeccccc-------------ceeeccc Q lcl|Aclame:pro 328 IASEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGH-------------VETYNAG 394 (400) Q Consensus 328 ~~~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~-------------~~~~~~~ 394 (400) |... -|+. |++|.+....-.=+...|++++-. +.-+-||+....- +-.||.. T Consensus 196 ig~~---------~G~~-----Vi~s~~~p~~t~~l~~~gA~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~ 260 (274) T protein:vir:12 196 FGEA---------LGAI-----IVRSNKLEAGTAILAKKGAVKLIL-KRDFFLEVARDASTKTTALYSDKHYVAYLYDES 260 (274) T ss_pred ceee---------cCee-----EEEeCCCCcceEEEEeccceeeee-cCCceeccccchhhcccEEEeeeEEEEEEEcCC Confidence 2111 1332 222222222222366788888754 4446678754321 2334555 Q ss_pred eeEeeC Q lcl|Aclame:pro 395 AVITVS 400 (400) Q Consensus 395 ~~~~~~ 400 (400) .|+.+. T Consensus 261 ~vv~~t 266 (274) T protein:vir:12 261 KAVKIT 266 (274) T ss_pred ceEEEE Confidence 555554 No 127 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=24.31 E-value=2.2 Score=18.71 Aligned_cols=242 Identities=16% Similarity=0.112 Sum_probs=105.8 Q ss_pred HHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eecccc---ee-eEEeeccc-cccceecccchhhhhhhhhh Q lcl|Aclame:pro 117 LAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVGA---LL-VSRSFDSA-NEAQVHKDGQTKTEQAATLT 190 (400) Q Consensus 117 L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~~~---~a-~~i~l~na-~~a~GHk~ga~Kk~q~~~le 190 (400) .+..+. ++--+-|.|+=.=..+...+.. .-+|+++- +.+.-. +- +.+-.=+. -.+.-...|.+=+.+.++.. T Consensus 1 ~~~~~~-T~l~d~i~PEv~~~~v~~~~~~-~~~~~~~~~~~~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~~lt~~ 78 (275) T protein:vir:96 1 MALENM-TKLANMVNPEVLAPMMQAELDK-KLKFAQFADIDNTLVGQPGNTITFPAFVYSGDAKVVPEGEEIPIDLIETK 78 (275) T ss_pred CCCccc-chhhhhhchHHHHHHHHHHHHH-hhhhcccceecccccCCCCCEEEeeeeccCCccccccCCCCcchhhcccc Confidence 222221 1222224454433333444432 33334332 222100 10 11100010 11222233334444555555 Q ss_pred hhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccchhhhhhhhhhhhhhhhh Q lcl|Aclame:pro 191 IDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSIDKEADVKKIKKITTKAK 269 (400) Q Consensus 191 ~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~~e~D~~~ik~it~~at 269 (400) ..++.....|+..+.-|....+.+ ++.+..+.+.++.++- |-++-.+... .+.+. + T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~--~d~~~~~~~~~a~~~a-~~~d~~ll~~l~~a~~--------------------~ 135 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGY--GDPKGEAVRQHGLAIA-NKVDNDVLEALQGATL--------------------K 135 (275) T ss_pred eeeEEeehhcccccccHHHHHhhc--cchHHHHHHHHHHHHH-HHHHHHHHHHHhcccc--------------------c Confidence 555555555555555666655554 6778888888888776 4444333211 11100 0 Q ss_pred ccCCCCCHHHHHHhh-ceecccccceEEEEecchhHHHHhh----hhhccccccceeecCCcceeehhhccccceecchh Q lcl|Aclame:pro 270 SAGKTPFADAIEEAV-DFVRPTAGRRYLIVKAEDRKALLDE----LRQATANANVRIKNDDTEIASEVGVDEIIVYTGSK 344 (400) Q Consensus 270 ~~~~T~~~dal~Eal-d~~~~~~~~~~l~~~~~d~~a~~~~----~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k 344 (400) ..++....|.+..|+ -|-......++|+||+++...||.. .-+++...+-.+. +|.|-..- |+. T Consensus 136 ~~~~~~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~--~G~ig~~~---------G~~ 204 (275) T protein:vir:96 136 VEADITKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIV--KGAFGEAL---------GAI 204 (275) T ss_pred ccccccCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhccccccccccccccccee--ccccceec---------Cee Confidence 011223456666666 3333445678999999999888652 1222222221111 22222222 332 Q ss_pred hhchhhhhcccceechhhceeccceeeeecCceEEEEecccc-------------cceeeccceeEeeC Q lcl|Aclame:pro 345 ALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSG-------------HVETYNAGAVITVS 400 (400) Q Consensus 345 ~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~-------------~~~~~~~~~~~~~~ 400 (400) -+ -+.-++..+ .=+...+++++-. +..+.||+.... -+-.||...|+.+. T Consensus 205 Vi-~s~~~p~~t----~~i~~~gA~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t 267 (275) T protein:vir:96 205 IV-RSNKIKEGE----AILAKRGAVKLIT-KRDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKIT 267 (275) T ss_pred EE-EeCCCCcce----EEEEeccceeeee-cCCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEE Confidence 11 111122222 1245677888744 444667775431 12334555555554 No 128 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=22.04 E-value=2.5 Score=18.39 Aligned_cols=249 Identities=13% Similarity=0.019 Sum_probs=94.2 Q ss_pred HHccCChhHHHHHHHHHHhCccchhhhHhhcchhHHHHHHHHHHhhCcccccee-eeccc---ceeeEEeec-cc-cccc Q lcl|Aclame:pro 101 KKNSGKSEIKNAWSAKLAENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFH-VTNVG---ALLVSRSFD-SA-NEAQ 174 (400) Q Consensus 101 ~~nqg~ke~k~AW~a~L~ekgV~~qd~~eiLP~~iI~AIe~A~ed~d~vl~~fh-V~n~~---~~a~~i~l~-na-~~a~ 174 (400) |.+..+ +--+-+.|+-+-..+...++. ..+|.++- +.+.- ++--+.-|. +. ..++ T Consensus 1 Ma~~~T------------------~~~~~iiPev~s~~v~~~~~~-~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~ 61 (278) T protein:vir:80 1 MADLTT------------------KLANLIDPEVMGPMISAKLPK-AIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQ 61 (278) T ss_pred CCCcce------------------ehhheecHHHHHHHHHHHHHH-hhhhcccceecccccCCCCCEEEEeeeccCCcce Confidence 333221 111113343333333333332 22222221 11100 000000000 00 0011 Q ss_pred eecccchhhhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHHHHHHHHHHHHHHHHHhcceeec-cCCCccccch Q lcl|Aclame:pro 175 VHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKIVDLALVEG-DGTNGFKSID 253 (400) Q Consensus 175 GHk~ga~Kk~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvnyvm~ELaq~fI~Rav~rAvv~g-DG~~~t~~~~ 253 (400) -=..|.+=+.+.++.....+.....++..+.-|+...+.+ ++++..++++++.++- |-++..+... .|.+-...-. T Consensus 62 ~~~~g~~i~~~~lt~~~~~~~i~~~~~a~~v~D~~~~~~~--~d~~~~~~~~~a~~~a-~~~d~~l~~~l~~a~~~~~~~ 138 (278) T protein:vir:80 62 DVAEGAAIDYSALETESVKHGIKKAGKGVKLTDESVLSGY--GDPVEEAQKQIRMAIA-SKVDNDILEEALTTTLEVKGA 138 (278) T ss_pred eecCCCcCcccccccceeeEeeehhhccccccHHHHhhcc--ccHHHHHHHHHHHHHH-HHHHHHHHHHHhccccccccc Confidence 1111222233344444444444443433444565555554 7789999999988887 5666544433 2211100000 Q ss_pred hhhhhhhhhhhhhhhhccCCCCCHHHHHHhhceecccccceEEEEecchhHHHHh----hhhhccccccceeecCCccee Q lcl|Aclame:pro 254 KEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKALLD----ELRQATANANVRIKNDDTEIA 329 (400) Q Consensus 254 ~e~D~~~ik~it~~at~~~~T~~~dal~Eald~~~~~~~~~~l~~~~~d~~a~~~----~~~~~~~~an~~lk~~d~~~~ 329 (400) ...|- +.. ..-.|.|+ +.+|+.+.... .++|++|+.....|+. +..+++...|-.+. +|.+- T Consensus 139 ~t~~~---------~~~-~~~~~~da-~~~l~~~~~~~-~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~--~G~ig 204 (278) T protein:vir:80 139 INIGL---------IDK-IENTFTDA-PDAIEDESITT-TGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLV--KGAFG 204 (278) T ss_pred cccch---------hhh-HHHHHHHH-HHhhcccCCCc-ccEEEECHHHHHHHHhhhhhhcccccccccccee--eccce Confidence 00000 000 00123333 33565554432 4579999999888864 22223332222221 12222 Q ss_pred ehhhccccceecchhhhchhhhhcccceechhhceeccceeeeecCceEEEEeccccc-------------ceeecccee Q lcl|Aclame:pro 330 SEVGVDEIIVYTGSKALKPTVLVDQKYHIDMQDLTKVDAFEWKTNSNMILVETLTSGH-------------VETYNAGAV 396 (400) Q Consensus 330 ~~v~v~~~~~~tg~k~l~~t~~vd~k~~~~~~~~~~~~~~~~~~~~~~ilve~~~~~~-------------~~~~~~~~~ 396 (400) .-- |+.-++- .-++. .-.-+...|++++-..+ -+-||+....- +-.+|..++ T Consensus 205 ~~~---------G~~Vi~s-~~~p~----~t~~l~~~gAi~~~~~~-~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~ 269 (278) T protein:vir:80 205 ELL---------GWEIVRT-KKLAD----GNALAVKAGALKTFLKR-NLLAESGRDMDHKLTKFNADQHYAVALVDETKA 269 (278) T ss_pred eec---------ceeEEEc-CCCCc----ceEEEEeccceeeeecC-CcccccccchhhccceeeeeeEEEEEEEcCcce Confidence 222 3221111 11111 11224456677754333 35567653221 123445555 Q ss_pred EeeC Q lcl|Aclame:pro 397 ITVS 400 (400) Q Consensus 397 ~~~~ 400 (400) +.++ T Consensus 270 v~it 273 (278) T protein:vir:80 270 VKVV 273 (278) T ss_pred EEEe Confidence 4444 No 129 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=21.87 E-value=2.5 Score=18.37 Aligned_cols=234 Identities=17% Similarity=0.164 Sum_probs=79.8 Q ss_pred cee-eecccceeeEEeeccccccceecccchh---------hhhhhhhhhhhccHHHHHHHHHHHHHHHhhcCchhHHHH Q lcl|Aclame:pro 152 VFH-VTNVGALLVSRSFDSANEAQVHKDGQTK---------TEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYN 221 (400) Q Consensus 152 ~fh-V~n~~~~a~~i~l~na~~a~GHk~ga~K---------k~q~~~le~~ti~p~~VYkkq~Lad~~k~l~g~ygalvn 221 (400) -.| +.+-..+ .+--=......+|+.|+.= .+..++.-..-.....||..-+.--| -++.+.|+.--- T Consensus 1 ~vr~i~~g~s~--~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~~qa~-~Dlr~e~s~~~G 77 (324) T protein:vir:99 1 MTRTITSGKSA--QFPVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIEDAMNH-YDVRSEYSTQMG 77 (324) T ss_pred CeeeeecCceE--EEeeeeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHHHhcC-ccchhHHHHHHH Confidence 112 3322222 1111113466778877732 12223333444445555555333101 124444555555 Q ss_pred HHHHHHHHHHHHHHHhcceeeccCCCccccchhhhhhhhhhhhhhhhhccCC--CCCHHHHH---HhhceecccccceEE Q lcl|Aclame:pro 222 LIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGK--TPFADAIE---EAVDFVRPTAGRRYL 296 (400) Q Consensus 222 yvm~ELaq~fI~Rav~rAvv~gDG~~~t~~~~~e~D~~~ik~it~~at~~~~--T~~~dal~---Eald~~~~~~~~~~l 296 (400) |.|.+-.+.+|-+ .++.+.....+.........+...+-.++..+...+. ....|+|+ +.||-..+....||+ T Consensus 78 ~aLA~~~Dq~i~~--~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~ 155 (324) T protein:vir:99 78 EALAMAADVANYA--EMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTF 155 (324) T ss_pred HHHHHHHHHHHHH--HHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEE Confidence 6666665555422 1111111111111111111111111111111211111 11234444 566666655567999 Q ss_pred EEecchhHHHHhhhhhccccccceeecCCcceeehhhccccceecchhh-----------------hchhhhhc----cc Q lcl|Aclame:pro 297 IVKAEDRKALLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKA-----------------LKPTVLVD----QK 355 (400) Q Consensus 297 ~~~~~d~~a~~~~~~~~~~~an~~lk~~d~~~~~~v~v~~~~~~tg~k~-----------------l~~t~~vd----~k 355 (400) +|.++-.-+|++..+-...+.+.-----+|.|..--|+ .||. |+. ..+++.-| .| T Consensus 156 vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf---~V~~-Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~k 231 (324) T protein:vir:99 156 YTDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGF---EVVE-TPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGK 231 (324) T ss_pred EeChHHHHHHhhcccccccccccccceecceEEEEece---EEEe-cCCccccccccccccccccccccccccccccccc Confidence 99999998888642211111110000012333222221 1111 011 11112212 34 Q ss_pred ceechhhceeccceeeeecCceE----------------------EEEecccccceeeccceeEeeC Q lcl|Aclame:pro 356 YHIDMQDLTKVDAFEWKTNSNMI----------------------LVETLTSGHVETYNAGAVITVS 400 (400) Q Consensus 356 ~~~~~~~~~~~~~~~~~~~~~~i----------------------lve~~~~~~~~~~~~~~~~~~~ 400 (400) |..|-+..+ +.-|..... ++=.+--||.- .+.-++-.|- T Consensus 232 y~~d~~~~~-----gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~-lRPe~a~~v~ 292 (324) T protein:vir:99 232 MTVGADNVV-----GLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGG-LRPEAVGAII 292 (324) T ss_pred cccccCcee-----EEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcc-cccceEEEEE Confidence 544432221 111111111 11111111111 1111111122 Done!