Query lcl|Aclame:protein:vir:9704|NCBI_annot:hypothetical protein|genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Match_columns 394 No_of_seqs 124 out of 972 Neff 10.7 Searched_HMMs 1612 Date Sat Nov 30 07:59:41 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_38 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_38_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:9704 Length: 394 # 100.0 5.2E-84 3.2E-87 477.2 42.5 394 1-394 1-394 (394) 2 protein:vir:3870 Length: 400 # 100.0 1.2E-74 7.1E-78 425.9 42.1 390 1-391 2-400 (400) 3 protein:vir:4700 Length: 415 # 100.0 3.9E-70 2.4E-73 401.1 40.1 386 3-394 1-408 (415) 4 protein:vir:4600 Length: 415 # 100.0 3.9E-70 2.4E-73 401.1 40.1 386 3-394 1-408 (415) 5 protein:vir:962 Length: 397 # 100.0 2.9E-70 1.8E-73 401.8 38.5 380 1-390 7-397 (397) 6 protein:vir:79987 Length: 415 100.0 1.1E-69 7.1E-73 398.5 40.1 386 3-394 1-408 (415) 7 protein:vir:98339 Length: 415 100.0 1.1E-69 7.1E-73 398.5 40.1 386 3-394 1-408 (415) 8 protein:vir:81100 Length: 415 100.0 1.1E-69 7.1E-73 398.5 40.1 386 3-394 1-408 (415) 9 protein:vir:9410 Length: 415 # 100.0 3.4E-69 2.1E-72 396.0 40.5 386 3-394 1-408 (415) 10 protein:vir:100884 Length: 389 100.0 2.6E-69 1.6E-72 396.6 37.3 366 3-394 1-386 (389) 11 protein:vir:100172 Length: 394 100.0 7.5E-69 4.6E-72 394.1 37.9 369 3-394 1-388 (394) 12 protein:vir:1025 Length: 408 # 100.0 2.6E-66 1.6E-69 380.1 38.4 374 1-394 3-397 (408) 13 protein:vir:4953 Length: 397 # 100.0 2E-66 1.2E-69 380.8 36.9 373 3-394 1-389 (397) 14 protein:vir:1084 Length: 437 # 100.0 6.5E-66 4E-69 378.0 38.4 390 4-394 1-431 (437) 15 protein:vir:1268 Length: 397 # 100.0 3.5E-66 2.2E-69 379.4 37.0 370 1-390 4-397 (397) 16 protein:vir:102873 Length: 392 100.0 5.2E-66 3.2E-69 378.5 36.2 372 2-394 1-391 (392) 17 protein:vir:102082 Length: 392 100.0 5.2E-66 3.2E-69 378.5 36.2 372 2-394 1-391 (392) 18 protein:vir:105004 Length: 392 100.0 5.2E-66 3.2E-69 378.5 36.2 372 2-394 1-391 (392) 19 protein:vir:107593 Length: 392 100.0 5.2E-66 3.2E-69 378.5 36.2 372 2-394 1-391 (392) 20 protein:vir:81160 Length: 371 100.0 7E-66 4.3E-69 377.8 35.8 353 1-390 1-371 (371) 21 protein:vir:485 Length: 407 # 100.0 4.7E-65 2.9E-68 373.2 37.9 370 3-394 1-404 (407) 22 protein:vir:7409 Length: 408 # 100.0 9.7E-65 6E-68 371.5 38.8 378 1-394 2-397 (408) 23 protein:vir:3991 Length: 404 # 100.0 1.2E-64 7.3E-68 371.1 39.1 375 1-394 2-397 (404) 24 protein:vir:4830 Length: 397 # 100.0 1.7E-64 1E-67 370.2 38.4 373 3-394 1-389 (397) 25 protein:vir:4456 Length: 401 # 100.0 2.7E-64 1.7E-67 369.1 38.7 367 1-390 1-401 (401) 26 protein:vir:1383 Length: 421 # 100.0 1.3E-64 8.1E-68 370.8 36.5 375 1-394 2-398 (421) 27 protein:vir:4997 Length: 397 # 100.0 2.4E-64 1.5E-67 369.4 37.7 369 3-394 1-389 (397) 28 protein:vir:3845 Length: 395 # 100.0 8.6E-64 5.4E-67 366.3 37.6 373 1-394 1-387 (395) 29 protein:vir:102119 Length: 404 100.0 6.4E-62 4E-65 356.1 39.0 376 1-394 1-404 (404) 30 protein:vir:6242 Length: 390 # 100.0 4.9E-62 3E-65 356.7 36.5 368 1-391 1-390 (390) 31 protein:vir:100247 Length: 425 100.0 5.6E-62 3.5E-65 356.4 36.7 363 1-391 20-425 (425) 32 protein:vir:4511 Length: 409 # 100.0 1.6E-61 1E-64 353.9 37.3 377 4-393 1-409 (409) 33 protein:vir:1328 Length: 392 # 100.0 1.3E-61 8.1E-65 354.4 36.7 367 1-391 1-392 (392) 34 protein:vir:10364 Length: 390 100.0 1.5E-60 9.5E-64 348.5 36.6 366 2-388 1-390 (390) 35 protein:vir:95376 Length: 425 100.0 8.2E-60 5.1E-63 344.5 37.2 380 1-394 7-425 (425) 36 protein:vir:100135 Length: 418 100.0 1.4E-59 8.8E-63 343.2 37.8 377 1-393 19-418 (418) 37 protein:vir:81070 Length: 390 100.0 1E-59 6.2E-63 344.0 35.9 366 2-388 1-390 (390) 38 protein:vir:191 Length: 385 # 100.0 7.7E-60 4.7E-63 344.7 34.9 362 3-391 1-385 (385) 39 protein:vir:1886 Length: 385 # 100.0 7.7E-60 4.7E-63 344.7 34.9 362 3-391 1-385 (385) 40 protein:vir:97053 Length: 390 100.0 2.4E-59 1.5E-62 341.9 36.9 366 2-388 1-390 (390) 41 protein:vir:2685 Length: 387 # 100.0 8.6E-60 5.3E-63 344.4 34.3 368 3-394 1-385 (387) 42 protein:vir:96978 Length: 387 100.0 8.6E-60 5.3E-63 344.4 34.3 368 3-394 1-385 (387) 43 protein:vir:94424 Length: 387 100.0 8.6E-60 5.3E-63 344.4 34.3 368 3-394 1-385 (387) 44 protein:vir:7855 Length: 497 # 100.0 1.8E-59 1.1E-62 342.6 35.6 390 1-394 6-497 (497) 45 protein:vir:101650 Length: 497 100.0 1.8E-59 1.1E-62 342.6 35.6 390 1-394 6-497 (497) 46 protein:vir:4339 Length: 395 # 100.0 1E-58 6.5E-62 338.5 36.7 368 1-390 3-395 (395) 47 protein:vir:6212 Length: 434 # 100.0 1.9E-58 1.2E-61 337.1 37.1 383 1-394 1-434 (434) 48 protein:vir:81227 Length: 413 100.0 2.7E-58 1.7E-61 336.2 37.9 380 1-393 1-413 (413) 49 protein:vir:9361 Length: 402 # 100.0 7.4E-59 4.6E-62 339.3 33.9 370 1-394 14-400 (402) 50 protein:vir:93881 Length: 387 100.0 1.3E-58 8.3E-62 337.9 35.0 367 3-394 1-385 (387) 51 protein:vir:101607 Length: 379 100.0 4.6E-58 2.8E-61 334.9 37.5 360 1-390 1-379 (379) 52 protein:vir:105038 Length: 428 100.0 8.1E-58 5E-61 333.6 36.6 377 3-390 1-428 (428) 53 protein:vir:1433 Length: 435 # 100.0 2.7E-57 1.7E-60 330.7 35.2 379 4-392 1-435 (435) 54 protein:vir:104256 Length: 458 100.0 4.9E-57 3E-60 329.3 36.0 383 1-390 3-458 (458) 55 protein:vir:80376 Length: 435 100.0 2E-56 1.3E-59 325.9 36.1 379 4-392 1-435 (435) 56 protein:vir:94673 Length: 419 100.0 7.6E-56 4.7E-59 322.8 38.4 383 1-392 2-419 (419) 57 protein:vir:8102 Length: 543 # 100.0 2.8E-55 1.7E-58 319.6 36.8 365 1-391 140-543 (543) 58 protein:vir:8420 Length: 477 # 100.0 5.3E-56 3.3E-59 323.6 32.2 390 2-394 1-475 (477) 59 protein:vir:4856 Length: 293 # 100.0 5E-57 3.1E-60 329.3 25.0 271 124-394 1-285 (293) 60 protein:vir:78640 Length: 352 100.0 4.7E-55 2.9E-58 318.4 30.6 335 3-394 1-350 (352) 61 protein:vir:98635 Length: 377 100.0 1.8E-54 1.1E-57 315.2 28.5 336 1-390 3-377 (377) 62 protein:vir:80128 Length: 466 100.0 2.2E-52 1.3E-55 303.8 35.5 381 1-394 7-451 (466) 63 protein:vir:4092 Length: 390 # 100.0 3E-52 1.8E-55 303.1 32.4 338 3-394 1-374 (390) 64 protein:vir:93616 Length: 645 100.0 2.4E-51 1.5E-54 298.1 33.4 386 1-394 194-643 (645) 65 protein:vir:9643 Length: 377 # 100.0 1.2E-51 7.6E-55 299.7 30.9 328 1-390 3-377 (377) 66 protein:vir:95963 Length: 395 100.0 1E-50 6.3E-54 294.6 26.7 341 3-394 1-380 (395) 67 protein:vir:97148 Length: 324 100.0 2.1E-50 1.3E-53 293.0 25.8 286 82-394 1-319 (324) 68 protein:vir:7771 Length: 330 # 100.0 1.3E-50 7.8E-54 294.2 24.0 271 121-394 1-327 (330) 69 protein:vir:41 Length: 299 # N 100.0 1.4E-50 8.8E-54 293.9 24.0 263 124-391 1-299 (299) 70 protein:vir:78350 Length: 383 100.0 6.9E-50 4.3E-53 290.1 27.5 346 1-394 1-379 (383) 71 protein:vir:9574 Length: 300 # 100.0 2.4E-50 1.5E-53 292.6 24.2 260 129-390 1-300 (300) 72 protein:vir:1638 Length: 298 # 100.0 5.1E-50 3.2E-53 290.8 24.3 256 132-389 1-298 (298) 73 protein:vir:2344 Length: 397 # 100.0 8E-50 4.9E-53 289.8 23.9 273 118-394 1-310 (397) 74 protein:vir:8187 Length: 311 # 100.0 1.4E-49 8.5E-53 288.5 23.9 259 130-391 1-311 (311) 75 protein:vir:99749 Length: 324 100.0 2.9E-49 1.8E-52 286.7 25.4 286 82-394 1-319 (324) 76 protein:vir:4226 Length: 326 # 100.0 1.6E-49 9.6E-53 288.2 23.3 281 109-393 1-326 (326) 77 protein:vir:95763 Length: 297 100.0 1.9E-49 1.2E-52 287.7 23.8 264 120-391 1-297 (297) 78 protein:vir:103955 Length: 324 100.0 4.1E-49 2.5E-52 285.9 25.5 285 82-394 1-319 (324) 79 protein:vir:9759 Length: 303 # 100.0 2.4E-49 1.5E-52 287.2 24.1 259 130-390 1-303 (303) 80 protein:vir:100632 Length: 381 100.0 3.3E-48 2E-51 280.9 30.0 328 1-394 1-372 (381) 81 protein:vir:105905 Length: 304 100.0 2.3E-49 1.4E-52 287.2 23.5 262 120-389 1-304 (304) 82 protein:vir:94142 Length: 304 100.0 2.3E-49 1.4E-52 287.2 23.5 262 120-389 1-304 (304) 83 protein:vir:96392 Length: 324 100.0 7.1E-49 4.4E-52 284.6 26.2 286 82-394 1-322 (324) 84 protein:vir:78830 Length: 324 100.0 7.1E-49 4.4E-52 284.6 26.2 286 82-394 1-322 (324) 85 protein:vir:9309 Length: 324 # 100.0 8.7E-49 5.4E-52 284.1 26.6 286 82-394 1-322 (324) 86 protein:vir:9509 Length: 381 # 100.0 2.7E-48 1.7E-51 281.4 29.2 327 1-394 1-372 (381) 87 protein:vir:101291 Length: 381 100.0 2.7E-48 1.7E-51 281.4 29.2 327 1-394 1-372 (381) 88 protein:vir:94771 Length: 298 100.0 3.8E-49 2.4E-52 286.0 24.1 256 132-389 1-298 (298) 89 protein:vir:80684 Length: 315 100.0 3.1E-49 1.9E-52 286.5 23.5 263 128-394 1-310 (315) 90 protein:vir:2430 Length: 318 # 100.0 5.2E-49 3.2E-52 285.3 23.5 277 115-394 1-317 (318) 91 protein:vir:78223 Length: 333 100.0 9.8E-49 6.1E-52 283.8 24.4 275 115-391 1-333 (333) 92 protein:vir:104085 Length: 320 100.0 8.9E-49 5.5E-52 284.0 23.8 276 115-393 1-320 (320) 93 protein:vir:5739 Length: 366 # 100.0 2.3E-48 1.4E-51 281.8 24.9 319 60-390 1-366 (366) 94 protein:vir:2504 Length: 305 # 100.0 1.6E-48 1E-51 282.6 23.4 258 128-394 1-304 (305) 95 protein:vir:78523 Length: 338 100.0 2.7E-48 1.7E-51 281.4 24.6 279 112-393 1-338 (338) 96 protein:vir:96223 Length: 324 100.0 5.2E-48 3.2E-51 279.8 25.7 284 82-394 1-322 (324) 97 protein:vir:99920 Length: 311 100.0 1E-47 6.2E-51 278.3 22.9 259 129-390 1-311 (311) 98 protein:vir:96762 Length: 632 100.0 6.3E-46 3.9E-49 268.4 31.3 377 1-389 185-632 (632) 99 protein:vir:97397 Length: 517 100.0 2.1E-39 1.3E-42 232.7 30.0 364 1-393 125-517 (517) 100 protein:vir:4159 Length: 315 # 100.0 2.3E-39 1.4E-42 232.4 20.6 275 108-387 1-315 (315) 101 protein:vir:4197 Length: 314 # 100.0 8.7E-38 5.4E-41 223.8 21.6 276 115-393 1-314 (314) 102 protein:vir:4074 Length: 480 # 100.0 1.4E-34 8.8E-38 206.2 24.3 348 1-393 111-480 (480) 103 protein:vir:3158 Length: 321 # 100.0 8.9E-33 5.5E-36 196.3 22.8 278 112-394 1-316 (321) 104 protein:vir:3033 Length: 272 # 99.9 2E-28 1.2E-31 172.4 22.1 261 128-393 1-272 (272) 105 protein:vir:9820 Length: 272 # 99.9 2E-28 1.2E-31 172.4 22.1 261 128-393 1-272 (272) 106 protein:vir:3613 Length: 272 # 99.8 4.3E-21 2.7E-24 132.2 20.2 259 128-390 1-272 (272) 107 protein:vir:93742 Length: 274 99.8 5.2E-20 3.2E-23 126.3 21.1 260 128-394 1-274 (274) 108 protein:vir:96833 Length: 275 99.7 1.4E-18 8.7E-22 118.5 20.6 263 127-394 1-275 (275) 109 protein:vir:105334 Length: 276 99.7 3.3E-18 2.1E-21 116.4 20.8 260 128-394 1-274 (276) 110 protein:vir:96123 Length: 274 99.7 3.4E-18 2.1E-21 116.3 20.2 260 128-394 1-274 (274) 111 protein:vir:95107 Length: 270 99.7 4.5E-18 2.8E-21 115.7 20.3 260 130-394 1-269 (270) 112 protein:vir:97433 Length: 274 99.7 1.1E-17 6.9E-21 113.5 21.2 260 128-394 1-274 (274) 113 protein:vir:94494 Length: 274 99.7 1.1E-17 6.9E-21 113.5 21.2 260 128-394 1-274 (274) 114 protein:vir:80930 Length: 278 99.7 1.2E-17 7.3E-21 113.4 20.7 259 128-391 1-278 (278) 115 protein:vir:93858 Length: 400 99.6 1.1E-15 6.8E-19 102.6 26.1 363 1-388 8-400 (400) 116 protein:vir:1239 Length: 274 # 99.6 1.2E-16 7.3E-20 107.9 20.1 259 128-394 1-274 (274) 117 protein:vir:79928 Length: 393 99.6 2.9E-16 1.8E-19 105.8 19.2 334 24-394 1-386 (393) 118 protein:vir:95898 Length: 274 99.6 5.3E-16 3.3E-19 104.3 20.4 260 128-394 1-274 (274) 119 protein:vir:96262 Length: 274 99.6 5.3E-16 3.3E-19 104.3 20.4 260 128-394 1-274 (274) 120 protein:vir:94933 Length: 330 99.6 3.9E-16 2.4E-19 105.0 18.9 287 94-391 1-330 (330) 121 protein:vir:8324 Length: 410 # 99.5 1.1E-14 6.8E-18 97.1 18.6 360 1-388 11-410 (410) 122 protein:vir:739 Length: 231 # 99.4 1.1E-14 7E-18 97.0 14.7 222 159-390 1-231 (231) 123 protein:vir:99424 Length: 360 99.3 4E-13 2.5E-16 88.5 19.7 282 97-393 1-360 (360) 124 protein:vir:97255 Length: 310 99.3 1.6E-12 1E-15 85.2 20.5 263 128-390 1-310 (310) 125 protein:vir:108211 Length: 318 99.2 2.6E-12 1.6E-15 84.1 15.7 264 125-391 1-318 (318) 126 protein:vir:7990 Length: 273 # 99.2 5.4E-12 3.3E-15 82.4 17.2 251 133-390 1-273 (273) 127 protein:vir:105822 Length: 273 99.1 1.2E-11 7.7E-15 80.4 17.4 251 133-390 1-273 (273) 128 protein:vir:102605 Length: 273 99.1 1.2E-11 7.7E-15 80.4 17.4 251 133-390 1-273 (273) 129 protein:vir:8885 Length: 347 # 99.1 2E-11 1.2E-14 79.3 15.5 272 118-391 1-347 (347) 130 protein:vir:94576 Length: 347 98.8 5.5E-10 3.4E-13 71.3 15.2 271 118-390 1-347 (347) 131 protein:vir:2201 Length: 345 # 98.8 8.7E-10 5.4E-13 70.3 15.4 273 112-390 1-345 (345) 132 protein:vir:10450 Length: 344 98.8 2.4E-10 1.5E-13 73.3 11.9 270 118-390 1-344 (344) 133 protein:vir:3136 Length: 322 # 98.7 7E-10 4.4E-13 70.8 13.5 264 127-394 1-322 (322) 134 protein:vir:94711 Length: 347 98.7 5.7E-10 3.6E-13 71.2 12.6 271 118-391 1-347 (347) 135 protein:vir:80213 Length: 334 98.7 8.4E-10 5.2E-13 70.4 13.4 272 118-392 1-334 (334) 136 protein:vir:100057 Length: 375 98.7 3.5E-09 2.2E-12 67.0 15.8 278 112-394 1-373 (375) 137 protein:vir:103323 Length: 364 98.7 7.2E-09 4.5E-12 65.2 17.3 271 118-394 1-343 (364) 138 protein:vir:78739 Length: 332 98.7 2.3E-09 1.4E-12 68.0 14.5 271 115-388 1-332 (332) 139 protein:vir:94622 Length: 341 98.7 1.7E-09 1.1E-12 68.6 13.6 270 118-392 1-341 (341) 140 protein:vir:5974 Length: 324 # 98.7 6.3E-09 3.9E-12 65.6 16.5 256 130-394 1-294 (324) 141 protein:vir:3364 Length: 347 # 98.6 4.2E-09 2.6E-12 66.5 14.6 272 118-392 1-347 (347) 142 protein:vir:102944 Length: 330 98.6 1.8E-08 1.1E-11 63.0 18.1 258 128-394 1-300 (330) 143 protein:vir:6324 Length: 335 # 98.6 1.1E-08 6.7E-12 64.3 16.5 273 118-394 1-332 (335) 144 protein:vir:1541 Length: 347 # 98.6 9.2E-09 5.7E-12 64.6 15.9 272 118-392 1-347 (347) 145 protein:vir:1583 Length: 351 # 98.6 2.3E-08 1.4E-11 62.4 17.0 255 128-394 1-298 (351) 146 protein:vir:78935 Length: 335 98.6 1.9E-08 1.2E-11 62.9 16.5 273 118-394 1-332 (335) 147 protein:vir:99675 Length: 324 98.5 1.2E-08 7.4E-12 64.0 14.6 230 161-394 1-300 (324) 148 protein:vir:80180 Length: 381 98.4 2.6E-08 1.6E-11 62.1 12.8 272 118-394 1-339 (381) 149 protein:vir:95318 Length: 328 98.4 1.8E-08 1.1E-11 63.0 11.8 212 121-336 1-328 (328) 150 protein:vir:105645 Length: 400 98.2 8.8E-08 5.5E-11 59.3 12.6 271 118-394 1-337 (400) 151 protein:vir:103759 Length: 330 98.1 1.4E-07 8.5E-11 58.2 12.3 213 118-336 1-330 (330) 152 protein:vir:107826 Length: 331 98.0 2.7E-07 1.7E-10 56.6 11.9 214 121-336 1-331 (331) 153 protein:vir:98525 Length: 331 98.0 2.7E-07 1.7E-10 56.6 11.9 214 121-336 1-331 (331) 154 protein:vir:107388 Length: 331 98.0 2.7E-07 1.7E-10 56.6 11.9 214 121-336 1-331 (331) 155 protein:vir:97031 Length: 402 97.9 4.2E-07 2.6E-10 55.5 11.5 271 118-394 1-337 (402) 156 protein:vir:7324 Length: 335 # 97.9 6.2E-07 3.8E-10 54.6 11.9 214 118-338 1-335 (335) 157 protein:vir:102655 Length: 322 97.9 2.4E-06 1.5E-09 51.4 15.1 271 116-391 1-322 (322) 158 protein:vir:103285 Length: 296 97.7 3.3E-06 2E-09 50.7 12.8 261 129-391 1-296 (296) 159 protein:vir:7019 Length: 401 # 97.7 2.6E-06 1.6E-09 51.2 12.2 271 118-394 1-344 (401) 160 protein:vir:8843 Length: 317 # 97.7 6.5E-06 4E-09 49.0 14.2 266 124-392 1-317 (317) 161 protein:vir:106647 Length: 303 97.6 5.5E-06 3.4E-09 49.4 13.1 257 126-394 1-303 (303) 162 protein:vir:9927 Length: 295 # 97.6 2.8E-06 1.7E-09 51.1 11.3 254 127-394 1-292 (295) 163 protein:vir:107687 Length: 319 97.5 1.6E-05 1E-08 46.8 14.8 278 80-388 1-319 (319) 164 protein:vir:95131 Length: 325 97.5 6.1E-05 3.8E-08 43.7 19.0 259 129-394 1-297 (325) 165 protein:vir:94800 Length: 319 97.4 8.1E-05 5E-08 43.0 18.5 280 76-394 1-301 (319) 166 protein:vir:97331 Length: 319 97.4 8.1E-05 5E-08 43.0 18.5 280 76-394 1-301 (319) 167 protein:vir:9875 Length: 296 # 97.3 2.5E-05 1.6E-08 45.8 13.6 260 120-391 1-296 (296) 168 protein:vir:96792 Length: 315 97.3 9.3E-05 5.8E-08 42.7 17.3 256 127-394 1-284 (315) 169 protein:vir:80068 Length: 301 97.3 3.8E-05 2.3E-08 44.8 14.3 254 130-388 1-301 (301) 170 protein:vir:107120 Length: 329 97.2 0.00014 8.4E-08 41.8 17.8 290 69-394 1-309 (329) 171 protein:vir:104342 Length: 314 97.1 6.4E-05 4E-08 43.6 13.2 280 110-391 1-314 (314) 172 protein:vir:93966 Length: 400 97.0 0.00021 1.3E-07 40.7 20.1 354 1-388 8-400 (400) 173 protein:vir:95451 Length: 313 96.8 0.00019 1.2E-07 41.0 13.8 262 129-392 1-313 (313) 174 protein:vir:79642 Length: 329 96.7 0.00016 1E-07 41.3 13.0 286 95-391 1-329 (329) 175 protein:vir:95875 Length: 401 96.7 8.8E-05 5.5E-08 42.8 11.1 270 117-394 1-401 (401) 176 protein:vir:99075 Length: 392 96.6 0.00042 2.6E-07 39.1 14.5 255 133-394 1-311 (392) 177 protein:vir:1663 Length: 393 # 96.5 0.00056 3.5E-07 38.4 18.1 354 1-388 1-393 (393) 178 protein:vir:79548 Length: 652 96.5 0.00057 3.6E-07 38.4 21.5 377 1-387 195-652 (652) 179 protein:vir:1383 Length: 421 # 95.9 0.0012 7.5E-07 36.6 22.1 360 1-394 9-404 (421) 180 protein:vir:80128 Length: 466 95.8 0.0014 8.6E-07 36.2 18.5 367 1-394 1-443 (466) 181 protein:vir:80446 Length: 367 95.5 0.0019 1.2E-06 35.5 18.6 257 126-394 1-339 (367) 182 protein:vir:78387 Length: 349 95.3 0.0023 1.4E-06 35.1 19.5 252 130-394 1-319 (349) 183 protein:vir:1781 Length: 221 # 95.0 0.0025 1.5E-06 34.9 12.2 170 211-394 1-208 (221) 184 protein:vir:98856 Length: 343 94.5 0.0042 2.6E-06 33.6 16.3 274 83-394 1-337 (343) 185 protein:vir:174 Length: 423 # 93.6 0.007 4.3E-06 32.4 16.6 260 133-394 1-320 (423) 186 protein:vir:108303 Length: 418 93.4 0.0075 4.7E-06 32.2 17.2 246 131-394 1-324 (418) 187 protein:vir:270 Length: 341 # 93.4 0.0076 4.7E-06 32.2 12.8 276 80-394 1-336 (341) 188 protein:vir:1153 Length: 338 # 93.4 0.0078 4.8E-06 32.1 16.2 278 83-392 1-338 (338) 189 protein:vir:100331 Length: 342 92.5 0.011 7E-06 31.3 16.3 279 83-394 1-342 (342) 190 protein:vir:79157 Length: 339 92.2 0.012 7.7E-06 31.0 14.9 276 83-391 1-339 (339) 191 protein:vir:2016 Length: 357 # 92.2 0.012 7.7E-06 31.0 15.9 280 83-394 1-349 (357) 192 protein:vir:94989 Length: 349 92.2 0.012 7.7E-06 31.0 20.5 252 130-394 1-319 (349) 193 protein:vir:1829 Length: 355 # 92.1 0.013 7.9E-06 31.0 16.4 279 83-394 1-352 (355) 194 protein:vir:78777 Length: 358 92.0 0.013 8.2E-06 30.9 15.7 278 80-394 1-347 (358) 195 protein:vir:104011 Length: 337 92.0 0.013 8.4E-06 30.8 17.7 278 83-393 1-337 (337) 196 protein:vir:3525 Length: 423 # 91.9 0.014 8.4E-06 30.8 17.4 253 133-394 1-320 (423) 197 protein:vir:105374 Length: 423 91.5 0.016 9.6E-06 30.5 17.3 257 133-394 1-320 (423) 198 protein:vir:79171 Length: 337 91.4 0.016 1E-05 30.4 17.6 278 83-393 1-337 (337) 199 protein:vir:99311 Length: 463 90.2 0.0096 6E-06 31.6 8.3 269 80-394 1-299 (463) 200 protein:vir:95603 Length: 463 90.2 0.0096 6E-06 31.6 8.3 269 80-394 1-299 (463) 201 protein:vir:5694 Length: 357 # 89.2 0.028 1.7E-05 29.1 15.7 280 83-394 1-349 (357) 202 protein:vir:5255 Length: 304 # 89.1 0.016 1E-05 30.4 8.7 251 133-387 1-304 (304) 203 protein:vir:95512 Length: 693 88.5 0.032 2E-05 28.8 24.2 379 1-388 227-693 (693) 204 protein:vir:6061 Length: 357 # 88.1 0.034 2.1E-05 28.6 17.6 280 83-394 1-349 (357) 205 protein:vir:98566 Length: 355 88.0 0.035 2.2E-05 28.5 18.9 279 83-394 1-352 (355) 206 protein:vir:78186 Length: 337 86.4 0.046 2.8E-05 27.9 16.6 278 83-393 1-337 (337) 207 protein:vir:105522 Length: 423 86.0 0.049 3E-05 27.8 17.6 255 132-394 1-320 (423) 208 protein:vir:100172 Length: 394 77.0 0.13 8E-05 25.5 16.3 332 1-375 3-394 (394) 209 protein:vir:80835 Length: 464 70.2 0.11 7E-05 25.8 6.2 281 80-394 1-296 (464) 210 protein:vir:79008 Length: 299 60.9 0.36 0.00023 23.0 20.0 255 133-392 1-299 (299) 211 protein:vir:3870 Length: 400 # 60.6 0.37 0.00023 23.0 21.1 349 1-381 9-400 (400) 212 protein:vir:861 Length: 318 # 60.2 0.38 0.00023 22.9 14.5 282 91-388 1-318 (318) 213 protein:vir:3783 Length: 336 # 56.2 0.46 0.00029 22.4 16.8 274 86-394 1-334 (336) 214 protein:vir:96666 Length: 462 53.1 0.54 0.00033 22.1 11.8 280 80-394 1-354 (462) 215 protein:vir:3746 Length: 336 # 52.2 0.56 0.00035 22.0 16.7 274 86-394 1-334 (336) 216 protein:vir:9704 Length: 394 # 50.2 0.62 0.00038 21.7 19.1 336 1-382 8-394 (394) 217 protein:vir:100884 Length: 389 44.3 0.81 0.0005 21.1 19.5 341 1-393 3-389 (389) 218 protein:vir:101557 Length: 336 40.4 0.97 0.0006 20.7 8.4 284 82-388 1-336 (336) 219 protein:vir:94070 Length: 339 38.5 1.1 0.00066 20.4 9.1 290 79-388 1-339 (339) 220 protein:vir:8846 Length: 705 # 38.2 1.1 0.00067 20.4 8.9 121 1-128 581-705 (705) 221 protein:vir:3643 Length: 336 # 37.1 1.1 0.0007 20.3 7.5 284 82-388 1-336 (336) 222 protein:vir:78558 Length: 336 36.9 1.1 0.00071 20.3 8.3 283 82-388 1-336 (336) 223 protein:vir:2736 Length: 348 # 34.8 1.3 0.00079 20.0 17.9 260 132-391 1-348 (348) 224 protein:vir:108295 Length: 711 34.5 1.3 0.0008 20.0 9.4 98 1-105 598-711 (711) 225 protein:vir:3845 Length: 395 # 30.8 1.5 0.00096 19.5 22.4 346 1-394 5-392 (395) 226 protein:vir:81070 Length: 390 29.5 1.7 0.001 19.4 18.5 349 6-394 1-382 (390) 227 protein:vir:63741 Length: 468 27.9 1.4 0.00084 19.8 4.7 286 73-394 1-331 (468) 228 protein:vir:1084 Length: 437 # 27.3 1.9 0.0011 19.1 23.2 370 1-392 5-437 (437) 229 protein:vir:962 Length: 397 # 26.2 2 0.0012 19.0 22.6 332 1-381 14-397 (397) 230 protein:vir:1886 Length: 385 # 25.6 2 0.0013 18.9 15.5 348 10-394 1-376 (385) 231 protein:vir:191 Length: 385 # 25.6 2 0.0013 18.9 15.5 348 10-394 1-376 (385) 232 protein:vir:80491 Length: 467 25.3 1.8 0.0011 19.2 4.8 284 72-394 1-330 (467) 233 protein:vir:10364 Length: 390 25.1 2.1 0.0013 18.8 19.3 344 6-394 1-382 (390) 234 protein:vir:94870 Length: 318 22.3 2.5 0.0015 18.4 14.0 286 88-388 1-318 (318) 235 protein:vir:8420 Length: 477 # 21.9 2.5 0.0016 18.4 22.1 374 1-388 7-477 (477) 236 protein:vir:102873 Length: 392 21.5 2.6 0.0016 18.3 17.1 334 1-381 4-392 (392) 237 protein:vir:102082 Length: 392 21.5 2.6 0.0016 18.3 17.1 334 1-381 4-392 (392) 238 protein:vir:105004 Length: 392 21.5 2.6 0.0016 18.3 17.1 334 1-381 4-392 (392) 239 protein:vir:107593 Length: 392 21.5 2.6 0.0016 18.3 17.1 334 1-381 4-392 (392) 240 protein:vir:106734 Length: 336 21.4 2.6 0.0016 18.3 7.5 283 82-388 1-336 (336) 241 protein:vir:97053 Length: 390 21.2 2.6 0.0016 18.3 18.8 350 6-394 1-384 (390) 242 protein:vir:102823 Length: 470 21.1 2.7 0.0016 18.3 6.1 255 83-394 1-331 (470) No 1 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=100.00 E-value=5.2e-84 Score=477.18 Aligned_cols=394 Identities=100% Similarity=1.362 Sum_probs=360.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |++++|+||++++++++++++++.++++..+++++.+++++++++++++++++++++++++..+...+............ T Consensus 1 M~~~~l~el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~ 80 (394) T protein:vir:97 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999988888777666665555666 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ..+.......+..+.+.................................+.+..+|++++|+++++.|++.+++.++|++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~ 160 (394) T protein:vir:97 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) T ss_pred chhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhh Confidence 66666666677777777776666666666666666666666666666777888889999999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++++++++++.+|+.+..++.++|++|++..++.++++|+.|++++++++++++||+||++|+.++|++||.+.|+++ T Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~i~~~la~~ 240 (394) T protein:vir:97 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) T ss_pred hceeeeccCcceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHHHHHHHHHHH Confidence 99999999999999999877788888889988887788999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCceeecccccCCCccccccc Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGK 320 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~ 320 (394) ++++++.++++|++++++.+..++++++++++..++++++++|+|||++|..|++|+|++|+|||+|++.++.+++|+|+ T Consensus 241 ~~~~~~~~i~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~ 320 (394) T protein:vir:97 241 KVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGK 320 (394) T ss_pred HHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhhCCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccc Confidence 99999999999999999999999999999999888899999999999999999999999999999999999999999999 Q ss_pred ceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 321 PVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 321 pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ||+++++.+.++++++||||+++|++++|++++++++++.++.+.+|+++|+|++|.+|+||++++++++|+|+ T Consensus 321 pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 321 PVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred eeEEecccccCCccEEEeeccccEEEEEecceEEEEecccccceeEEEEEEEccEEecccceEEEEecccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=100.00 E-value=1.2e-74 Score=425.94 Aligned_cols=390 Identities=45% Similarity=0.752 Sum_probs=316.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDD----LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIG 76 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~----~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~ 76 (394) ||+++|+++++++.++++++.++.+++++++++.+ ..+.+.++++++++.+++++++++++..+...+........ T Consensus 2 ~l~e~i~e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~ 81 (400) T protein:vir:38 2 TLDEKLAAVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGK 81 (400) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 99999999999999999999999999988665443 23456667888888888888888877766554433222211 Q ss_pred cccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHh Q lcl|Aclame:pro 77 GKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMP--INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKT 154 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~ 154 (394) .............+....+..................... ............+.+...|+++||+++.+.|++.+++ T Consensus 82 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~ 160 (400) T protein:vir:38 82 -KPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQT 160 (400) T ss_pred -cccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHh Confidence 1111122222223333332222221111111111111111 1111111222234566778899999999999999999 Q ss_pred hhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHH Q lcl|Aclame:pro 155 VVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVS 234 (394) Q Consensus 155 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~ 234 (394) .++|+++|++++++++++++|+.+..++.++||+|++..++.++++|++|++++++++++++||+||++||.++|++||. T Consensus 161 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~ 240 (400) T protein:vir:38 161 VVDLKPFTNVFQASTQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIA 240 (400) T ss_pred hhhhhhcceeEeccCcceEEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHH Confidence 99999999999999999999999888888999999999998889999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCceeecccccCCCc Q lcl|Aclame:pro 235 ESISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSG 314 (394) Q Consensus 235 ~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 314 (394) +.|+++++.+++.++++|++++++.+..+++++.+++....+++++++|+|||++|.+|++|+|++|+|||+|++.++.+ T Consensus 241 ~~l~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~ 320 (400) T protein:vir:38 241 QNGQQIKVNTTNGAVATLLKGFTAKTISSVDDLKHINNVDLDPAYSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSG 320 (400) T ss_pred HHHHHHHHHHHHHhhhhccccccccccccHHHHHHHHHhhhhhhhCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCc Confidence 99999999999999999999999999999999999999889999999999999999999999999999999999999999 Q ss_pred ccccccceEEecCccc---ccCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 315 KVLLGKPVFVLSDEVL---GANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 315 ~~l~G~pV~~~~~~~~---~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) ++|+|+||+++++++. ++..++||||+++|++++|++++++++++.+|.+.+|+++|+|++|.+|+||++|+++|+| T Consensus 321 ~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 321 KSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred cccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccceEEEEeecCC Confidence 9999999999887654 3456899999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=100.00 E-value=3.9e-70 Score=401.08 Aligned_cols=386 Identities=27% Similarity=0.391 Sum_probs=306.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) ++.++|+++++.++++++.+..++++..+++++.++++.+++++++|++++++++++++..++................. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:47 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 77999999999999999999999999999999999999999999999999999888877666544332222111111111 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETT-PVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) ................. .......+......... .........++..++.++|+++.+.|++.+++.++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~ 154 (415) T protein:vir:47 81 RTYRNQANINDLGISIQ------NTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) T ss_pred hhhHHHHHHHHHHHhhh------hhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhh Confidence 10000000000000000 00000001111111111 112222344566788899999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++.+++|+.+..+. .+.|+.|++..++.+.++|++|++++++++++++||+|+++|+.++|++||.+.|+++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 234 (415) T protein:vir:47 155 VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMART 234 (415) T ss_pred cceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHH Confidence 999999999999999876554 4566677777777778899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccc----------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCce Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTT----------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRY 303 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~----------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~ 303 (394) ++.+++.++++|+|++.+ .+..+++++.+++..+..+++ +++|||||++|..|++|+|++|+| T Consensus 235 i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~ 314 (415) T protein:vir:47 235 IAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred HHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCe Confidence 999999999999887533 233568899999988887766 689999999999999999999999 Q ss_pred eecccccCCCcccccccceEEecCcccc---cCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEeccc Q lcl|Aclame:pro 304 LLQDDITAVSGKVLLGKPVFVLSDEVLG---ANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDK 380 (394) Q Consensus 304 l~~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~ 380 (394) ||+|++.++.+++|+|+||+++++++.+ +..++||||+++|++++|++++++++++.++.+++|+++|+|++|.+|+ T Consensus 315 i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:47 315 LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccc Confidence 9999999999999999999998876643 3468999999989999999999999999999999999999999999999 Q ss_pred ceEEEEecCccCCC Q lcl|Aclame:pro 381 AGYYVTFTPEPLPL 394 (394) Q Consensus 381 af~~l~~~~~~~~~ 394 (394) ||++++++++++|- T Consensus 395 a~~~~~~~~~~~~~ 408 (415) T protein:vir:47 395 SAIVIEYDDSERGE 408 (415) T ss_pred cEEEEEeeccCCCC Confidence 99999999999999 No 4 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=100.00 E-value=3.9e-70 Score=401.08 Aligned_cols=386 Identities=27% Similarity=0.391 Sum_probs=306.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) ++.++|+++++.++++++.+..++++..+++++.++++.+++++++|++++++++++++..++................. T Consensus 1 mk~~~em~~~l~el~~~~~~~~~e~~~~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:46 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchh Confidence 77999999999999999999999999999999999999999999999999999888877666544332222111111111 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETT-PVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) ................. .......+......... .........++..++.++|+++.+.|++.+++.++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~ 154 (415) T protein:vir:46 81 RTYRNQANINDLGISIQ------NTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) T ss_pred hhhHHHHHHHHHHHhhh------hhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhh Confidence 10000000000000000 00000001111111111 112222344566788899999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++.+++|+.+..+. .+.|+.|++..++.+.++|++|++++++++++++||+|+++|+.++|++||.+.|+++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 234 (415) T protein:vir:46 155 VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMART 234 (415) T ss_pred cceeeccCCceeEEEEEecCCcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHH Confidence 999999999999999876554 4566677777777778899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccc----------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCce Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTT----------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRY 303 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~----------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~ 303 (394) ++.+++.++++|+|++.+ .+..+++++.+++..+..+++ +++|||||++|..|++|+|++|+| T Consensus 235 i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~ 314 (415) T protein:vir:46 235 IAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred HHHHHHHHHhhccccCCccccccccccccceeccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCe Confidence 999999999999887533 233568899999988887766 689999999999999999999999 Q ss_pred eecccccCCCcccccccceEEecCcccc---cCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEeccc Q lcl|Aclame:pro 304 LLQDDITAVSGKVLLGKPVFVLSDEVLG---ANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDK 380 (394) Q Consensus 304 l~~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~ 380 (394) ||+|++.++.+++|+|+||+++++++.+ +..++||||+++|++++|++++++++++.++.+++|+++|+|++|.+|+ T Consensus 315 i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:46 315 LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccc Confidence 9999999999999999999998876643 3468999999989999999999999999999999999999999999999 Q ss_pred ceEEEEecCccCCC Q lcl|Aclame:pro 381 AGYYVTFTPEPLPL 394 (394) Q Consensus 381 af~~l~~~~~~~~~ 394 (394) ||++++++++++|- T Consensus 395 a~~~~~~~~~~~~~ 408 (415) T protein:vir:46 395 SAIVIEYDDSERGE 408 (415) T ss_pred cEEEEEeeccCCCC Confidence 99999999999999 No 5 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=100.00 E-value=2.9e-70 Score=401.79 Aligned_cols=380 Identities=34% Similarity=0.546 Sum_probs=292.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc- Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNA---LESDD-LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENI- 75 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~---~~~e~-~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~- 75 (394) ||+++++++++++++|+++.+++.++.+.. +++.. .++..++++++++|+++++.++++++++++.......... T Consensus 7 ~l~~~~~el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l~~~~~~ 86 (397) T protein:vir:96 7 ILNKQIKERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDLEDELAK 86 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 899999999999999888777666555443 22221 2334455666666666666666665554433221111000 Q ss_pred -ccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhhhhhhhcccccCCccccchhHHhHHHHHHH Q lcl|Aclame:pro 76 -GGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINET-TPVEPQKDGIKKENAKPVSSEEILYTPAREVK 153 (394) Q Consensus 76 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~ 153 (394) ........................... .... ...... ........+.+...+++++|+++...|++ +. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~-~~ 156 (397) T protein:vir:96 87 AADPTDQKPKDGEKRKMKKFKVTEEELA------EKRS---AINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLE-PK 156 (397) T ss_pred hhhhhhhhhHHHHHHHHHHHhhhhHHHH------HHHH---HHHHHHHhhhhhhhhcccccccccchhHHHHHHHHH-hh Confidence 000000000011111111000000000 0000 000000 11111223456677889999999999988 46 Q ss_pred hhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHH Q lcl|Aclame:pro 154 TVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIV 233 (394) Q Consensus 154 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i 233 (394) +..+|+.+|++++++++++.+|+.+.++..++||.|++..++.++++|++|+++++++++++++|+++++|+.+++++|| T Consensus 157 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~~~l~~~i 236 (397) T protein:vir:96 157 DIVDLSKYVRSVPVNSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDASYDVTGLI 236 (397) T ss_pred hhhhHHHhhhhccccccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhHHHHHHHH Confidence 77889999999999999999999988888899999999999888999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCceeecccccCCC Q lcl|Aclame:pro 234 SESISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVS 313 (394) Q Consensus 234 ~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~ 313 (394) .++|++.++.+++.++++|+|++++.+..++|++.+++....+++++++|||||++|..|++|+|++|+|||+|+++++. T Consensus 237 ~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~d~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~~~~~~~~~~~ 316 (397) T protein:vir:96 237 ADEIQDQSLNTKNADIAAVLKTATAKSVVGVDGLKDLINKEIKKVYDVKLFISASMYSELDKLKDKNGRYLLQDSITAAS 316 (397) T ss_pred HHHHHHHHHHHHHHHHhhcccccccccccchHHHHHHHHHhhhhhcCcEEEEcHHHHHHHHHhhccCCCeEeccCccCCC Confidence 99999999999999999999999999999999999999988889999999999999999999999999999999999999 Q ss_pred cccccccceEEecCcc----cccCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 314 GKVLLGKPVFVLSDEV----LGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) Q Consensus 314 ~~~l~G~pV~~~~~~~----~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af~~l~~~~ 389 (394) +++|+|+||+++++.. .+..+++||||+++|++++|+++++.++++.+|.+++|+++|+|++|++|+||+++++++ T Consensus 317 ~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~ 396 (397) T protein:vir:96 317 GKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAKAFASFFDRKQVSVSWVDNNIYGQLLAGIIRYDVKATDKKAGFYVTFTI 396 (397) T ss_pred cccccccceEEecccccCCCCCceEEEEeehhcceEeEeecceEEEEecccccceeEEEEEEEccEEecccceEEEEeec Confidence 9999999999876533 334568999999999999999999999999999999999999999999999999999999 Q ss_pred c Q lcl|Aclame:pro 390 E 390 (394) Q Consensus 390 ~ 390 (394) + T Consensus 397 a 397 (397) T protein:vir:96 397 G 397 (397) T ss_pred C Confidence 9 No 6 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=100.00 E-value=1.1e-69 Score=398.54 Aligned_cols=386 Identities=27% Similarity=0.395 Sum_probs=304.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) |+.++||++++.++++++.++.++++.++++++.+++++++.++++++++++++++.++.++.................. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 88999999999999999999999999999999999999999999999999999888777665443322211111111111 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETT-PVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) .................... .............. .........++++|++++|+++.+.|++.+++.++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~ 154 (415) T protein:vir:79 81 RTYRNQANINDLGISIQNTK------VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) T ss_pred hhHHHHHHHHHHhhhhhhhh------hHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhh Confidence 00000000000000000000 00001111111111 112222345566788999999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++++++|+.+.++. .+.|+.|++..++.+.++|+++++++++++++++||+||++|+.++|++||.+.|+++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 234 (415) T protein:vir:79 155 VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMART 234 (415) T ss_pred eeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHH Confidence 999999999999998876554 4556667777776677899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccc----------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCce Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTT----------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRY 303 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~----------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~ 303 (394) ++.+++.++++|+|++.+ .+..++++|.+++..+..+++ +++|+||+++|..|+++||++|+| T Consensus 235 ~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:79 235 IAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred HHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCce Confidence 999999999999877543 234578999999988877765 689999999999999999999999 Q ss_pred eecccccCCCcccccccceEEecCcccc---cCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEeccc Q lcl|Aclame:pro 304 LLQDDITAVSGKVLLGKPVFVLSDEVLG---ANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDK 380 (394) Q Consensus 304 l~~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~ 380 (394) ||+|++.++.+++|+|+||+++++++.+ +.+++||||+++|++++|.+++++++++.++.+++|+++|+|++|++|+ T Consensus 315 l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:79 315 LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccc Confidence 9999999999999999999998876543 4468999999989999999999999999999999999999999999999 Q ss_pred ceEEEEecCccCCC Q lcl|Aclame:pro 381 AGYYVTFTPEPLPL 394 (394) Q Consensus 381 af~~l~~~~~~~~~ 394 (394) ||+++++++++.|- T Consensus 395 a~~~~~~~~~~~~~ 408 (415) T protein:vir:79 395 SAIVIEYDDSERGE 408 (415) T ss_pred cEEEEEEeccCCCC Confidence 99999999999998 No 7 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=100.00 E-value=1.1e-69 Score=398.54 Aligned_cols=386 Identities=27% Similarity=0.395 Sum_probs=304.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) |+.++||++++.++++++.++.++++.++++++.+++++++.++++++++++++++.++.++.................. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 88999999999999999999999999999999999999999999999999999888777665443322211111111111 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETT-PVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) .................... .............. .........++++|++++|+++.+.|++.+++.++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~ 154 (415) T protein:vir:98 81 RTYRNQANINDLGISIQNTK------VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) T ss_pred hhHHHHHHHHHHhhhhhhhh------hHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhh Confidence 00000000000000000000 00001111111111 112222345566788999999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++++++|+.+.++. .+.|+.|++..++.+.++|+++++++++++++++||+||++|+.++|++||.+.|+++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 234 (415) T protein:vir:98 155 VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMART 234 (415) T ss_pred eeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHH Confidence 999999999999998876554 4556667777776677899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccc----------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCce Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTT----------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRY 303 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~----------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~ 303 (394) ++.+++.++++|+|++.+ .+..++++|.+++..+..+++ +++|+||+++|..|+++||++|+| T Consensus 235 ~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:98 235 IAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred HHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCce Confidence 999999999999877543 234578999999988877765 689999999999999999999999 Q ss_pred eecccccCCCcccccccceEEecCcccc---cCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEeccc Q lcl|Aclame:pro 304 LLQDDITAVSGKVLLGKPVFVLSDEVLG---ANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDK 380 (394) Q Consensus 304 l~~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~ 380 (394) ||+|++.++.+++|+|+||+++++++.+ +.+++||||+++|++++|.+++++++++.++.+++|+++|+|++|++|+ T Consensus 315 l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:98 315 LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccc Confidence 9999999999999999999998876543 4468999999989999999999999999999999999999999999999 Q ss_pred ceEEEEecCccCCC Q lcl|Aclame:pro 381 AGYYVTFTPEPLPL 394 (394) Q Consensus 381 af~~l~~~~~~~~~ 394 (394) ||+++++++++.|- T Consensus 395 a~~~~~~~~~~~~~ 408 (415) T protein:vir:98 395 SAIVIEYDDSERGE 408 (415) T ss_pred cEEEEEEeccCCCC Confidence 99999999999998 No 8 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=100.00 E-value=1.1e-69 Score=398.54 Aligned_cols=386 Identities=27% Similarity=0.395 Sum_probs=304.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) |+.++||++++.++++++.++.++++.++++++.+++++++.++++++++++++++.++.++.................. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchh Confidence 88999999999999999999999999999999999999999999999999999888777665443322211111111111 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-hhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETT-PVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) .................... .............. .........++++|++++|+++.+.|++.+++.++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~ 154 (415) T protein:vir:81 81 RTYRNQANINDLGISIQNTK------VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) T ss_pred hhHHHHHHHHHHhhhhhhhh------hHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhh Confidence 00000000000000000000 00001111111111 112222345566788999999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++++++|+.+.++. .+.|+.|++..++.+.++|+++++++++++++++||+||++|+.++|++||.+.|+++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 234 (415) T protein:vir:81 155 VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMART 234 (415) T ss_pred eeeeeccCCceeEEEEeecCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHHHhhchHHHHHHHHHHHHHH Confidence 999999999999998876554 4556667777776677899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccc----------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCce Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTT----------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRY 303 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~----------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~ 303 (394) ++.+++.++++|+|++.+ .+..++++|.+++..+..+++ +++|+||+++|..|+++||++|+| T Consensus 235 ~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:81 235 IAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred HHHHHHHHHhhccccCccccccccccccccccccccccchhHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCce Confidence 999999999999877543 234578999999988877765 689999999999999999999999 Q ss_pred eecccccCCCcccccccceEEecCcccc---cCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEeccc Q lcl|Aclame:pro 304 LLQDDITAVSGKVLLGKPVFVLSDEVLG---ANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDK 380 (394) Q Consensus 304 l~~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~ 380 (394) ||+|++.++.+++|+|+||+++++++.+ +.+++||||+++|++++|.+++++++++.++.+++|+++|+|++|++|+ T Consensus 315 l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:81 315 LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccc Confidence 9999999999999999999998876543 4468999999989999999999999999999999999999999999999 Q ss_pred ceEEEEecCccCCC Q lcl|Aclame:pro 381 AGYYVTFTPEPLPL 394 (394) Q Consensus 381 af~~l~~~~~~~~~ 394 (394) ||+++++++++.|- T Consensus 395 a~~~~~~~~~~~~~ 408 (415) T protein:vir:81 395 SAIVIEYDDSERGE 408 (415) T ss_pred cEEEEEEeccCCCC Confidence 99999999999998 No 9 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=100.00 E-value=3.4e-69 Score=395.98 Aligned_cols=386 Identities=27% Similarity=0.395 Sum_probs=305.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) ++.++||+++++++++++.+..++++.++++++.++++.+.+|++.|+++++++++.++..++................. T Consensus 1 mk~~~el~~~l~el~~~~~~~~~~~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (415) T protein:vir:94 1 MKTKEELQSEISDIKRQIDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEA 80 (415) T ss_pred CChHHHHHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccch Confidence 88999999999999999999999999999999999999999999999999998887776655543322211111111111 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTP-VEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) .................... ....+.......... ........++.+++.++|+++.+.|++.+++.++|+++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~------~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~ 154 (415) T protein:vir:94 81 STYRNQANINDLGISIQNTK------VTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKY 154 (415) T ss_pred hhHHHHHHHHHHHhhhhhhh------hhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhh Confidence 11111111111110000000 000011111111111 12223345566788999999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++.+++|+...++. .+.|+.|++..++.+.++|++|++++++++++++||+|+++|+.++|++||.+.|+++ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~ 234 (415) T protein:vir:94 155 VTVKRVTNGSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAKVNVLQELKLWMART 234 (415) T ss_pred cceeeccCCceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechhhHHHHhhchHHHHHHHHHHHHHH Confidence 999999999999998876554 4556667777776677899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccc----------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCce Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTT----------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRY 303 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~----------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~ 303 (394) ++.+++.++++|+|++.+ .+..+++++.+++..+..+++ +++|+|||++|..|+++||++|+| T Consensus 235 ~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~ 314 (415) T protein:vir:94 235 IAATRNKAIIDVITKGSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNY 314 (415) T ss_pred HHHHHHHHHhhccccCccccccccccccccccccccccchHHHHHHHHhhhhhccCCCEEEEcHHHHHHHHHhhccCCCe Confidence 999999999999887543 234568999999988877765 689999999999999999999999 Q ss_pred eecccccCCCcccccccceEEecCcccc---cCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEeccc Q lcl|Aclame:pro 304 LLQDDITAVSGKVLLGKPVFVLSDEVLG---ANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDK 380 (394) Q Consensus 304 l~~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~ 380 (394) ||.|++.++.+++|+|+||+++++++.+ +.+++||||+++|++++|.+++++++++.++.+++|+++|+|++|.+|+ T Consensus 315 l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~r~~~r~d~~~~~~~ 394 (415) T protein:vir:94 315 LIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred eeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccc Confidence 9999999999999999999998876543 3468999999989999999999999999999999999999999999999 Q ss_pred ceEEEEecCccCCC Q lcl|Aclame:pro 381 AGYYVTFTPEPLPL 394 (394) Q Consensus 381 af~~l~~~~~~~~~ 394 (394) ||++++++++++|- T Consensus 395 a~~~~~~~~~~~~~ 408 (415) T protein:vir:94 395 SAIVIEYDDSERGE 408 (415) T ss_pred cEEEEEEeccCCCC Confidence 99999999999998 No 10 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=100.00 E-value=2.6e-69 Score=396.61 Aligned_cols=366 Identities=41% Similarity=0.676 Sum_probs=279.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhc--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALE--SDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~--~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) +++|+++.++++ ++++++.++++.... ....+++++++++++++.++++.++++++..+................ T Consensus 1 meeL~~~~~~~~---~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~ 77 (389) T protein:vir:10 1 MDKLQTLFNDVS---AKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGS 77 (389) T ss_pred ChHHHHHHHHHH---HHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc Confidence 334444444333 333333333332211 123455667778888888888888877777655433222111111111 Q ss_pred cchh-------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHH Q lcl|Aclame:pro 81 TQEE-------KTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVK 153 (394) Q Consensus 81 ~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~ 153 (394) .... ......+..+++.. ..........+++.|+++||+++...|++.++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~lr~~-----------------------~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~ 134 (389) T protein:vir:10 78 KKGTDLSKKPIDAKKKAINDFIHSH-----------------------GKVIDATSKVTSTEAGVLIPEEIIYDPTAEVN 134 (389) T ss_pred ccccccchhHHHHHHHHHHHHhhcc-----------------------hhhhhhhcccccCCcceeehHHHHHHHHHHHH Confidence 0000 00111111111110 00111123455677889999999999999999 Q ss_pred hhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHH Q lcl|Aclame:pro 154 TVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIV 233 (394) Q Consensus 154 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i 233 (394) +.++|+++|++++++++++++|+.+..++.++||.|++..++.++++|++++++++++++++++|+|+++||.++|++|| T Consensus 135 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i 214 (389) T protein:vir:10 135 SVVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALV 214 (389) T ss_pred hhhhHHhhcceeeccCCeeEEEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHH Confidence 99999999999999999999999998888888999999999888999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccccccc---ccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCceeeccccc Q lcl|Aclame:pro 234 SESISQIKVNTTNDAIAKVLKSFTTK---TVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDIT 310 (394) Q Consensus 234 ~~~l~~~~~~~~~~a~~~g~~~~~~~---~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~ 310 (394) .++|+++++.+++.+|++|++++.+. +..+++++.+++...++++++++|+||+++|..|++|||++|+|||+|++. T Consensus 215 ~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~~~d~l~~~~~~~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~ 294 (389) T protein:vir:10 215 GQSIKEKSVNTYNAMIAPVLQSFTAKKTTTDTLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDTLKDKNGRYLLHDASD 294 (389) T ss_pred HHHHHHHHHHHHHHHHhhhhcccccccccccccHHHHHHHHHhhhhhhhCcEEEecHHHHHHHHHhhccCCCeeeecCcc Confidence 99999999999999999999876554 456789999998888888899999999999999999999999999998764 Q ss_pred C----CCcccccccceEEecCccc----ccCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEecccce Q lcl|Aclame:pro 311 A----VSGKVLLGKPVFVLSDEVL----GANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAG 382 (394) Q Consensus 311 ~----~~~~~l~G~pV~~~~~~~~----~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af 382 (394) + +.+++|||+||+++++... ++.+++||||+++|++++|++++|.++++.+|.+.+|+++|+|++|++|+|| T Consensus 295 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~d~~~~~~~a~ 374 (389) T protein:vir:10 295 SITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSKIYGKYLGAAFRFGVQKADSKAG 374 (389) T ss_pred cccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeeccccccceEEEEEEeccEEecccce Confidence 4 4457999999998876433 2345899999998999999999999999999999999999999999999999 Q ss_pred EEEEecCccCCC Q lcl|Aclame:pro 383 YYVTFTPEPLPL 394 (394) Q Consensus 383 ~~l~~~~~~~~~ 394 (394) ++++++++|++- T Consensus 375 ~~~~~~~~~~~~ 386 (389) T protein:vir:10 375 YFVTNTDVPGSA 386 (389) T ss_pred EEEEeeccCCCC Confidence 999999888555 No 11 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=100.00 E-value=7.5e-69 Score=394.08 Aligned_cols=369 Identities=39% Similarity=0.653 Sum_probs=291.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc--c Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE--V 80 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~--~ 80 (394) |++|++|++++++..+++++..++.... .....++.+++.++++.+..+++.++++++.+++..+........... . T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~-~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~ 79 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQD-ENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQP 79 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhh-hhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhcc Confidence 6677777777666666665555443322 122235667777888888888888887777766554332211111000 0 Q ss_pred ------cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHh Q lcl|Aclame:pro 81 ------TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKT 154 (394) Q Consensus 81 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~ 154 (394) ........+.+..++++. .........+.+++.|++++|++++..|++.+++ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~l~~~----------------------~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~ 137 (394) T protein:vir:10 80 NGTDLKKKPIDAKKKAINDFIHSH----------------------GKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNS 137 (394) T ss_pred cccchhhhHHHHHHHHHHHHHhcc----------------------chhhhhhhcccccccCceeccHHHHHHHHHHHHh Confidence 000001111121111111 1111223345667778899999999999999999 Q ss_pred hhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHH Q lcl|Aclame:pro 155 VVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVS 234 (394) Q Consensus 155 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~ 234 (394) .++|+++|++++++++++.+|+.+..++.+.||.|++..++.++++|++|++++++++++++||+||++||.++|++||. T Consensus 138 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~ 217 (394) T protein:vir:10 138 VVDLSTLVTKTPVTTPKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVG 217 (394) T ss_pred hhhhhhhceeeeccCCceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHH Confidence 99999999999999999999999887888899999999998788999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhccccccccc---cccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCceeecccccC Q lcl|Aclame:pro 235 ESISQIKVNTTNDAIAKVLKSFTTKT---VKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITA 311 (394) Q Consensus 235 ~~l~~~~~~~~~~a~~~g~~~~~~~~---~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~ 311 (394) ++|+++++.+++.++++|.|++++.+ ..++|++.+++...++++++++|||||++|.+|++|+|++|+|||+|++.+ T Consensus 218 ~~la~~~~~~~~~~il~g~g~~~~~~~~~~~~~d~l~~~~~~~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~ 297 (394) T protein:vir:10 218 QSINEKSVNTYNAMIAPVLQSFTAKATTTDTLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDTLKDKNGRYLLHDASDS 297 (394) T ss_pred HHHHHHHHHHHHHHHhhcccccccccccccccHHHHHHHHHhhhhhhccCEEEecHHHHHHHHHhhccCCCeeeeccccc Confidence 99999999999999999999876654 456889999888888889999999999999999999999999999987644 Q ss_pred ----CCcccccccceEEecCccc----ccCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEecccceE Q lcl|Aclame:pro 312 ----VSGKVLLGKPVFVLSDEVL----GANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGY 383 (394) Q Consensus 312 ----~~~~~l~G~pV~~~~~~~~----~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af~ 383 (394) +.+++|+|+||+++++... ++.+++||||+++|+++++++++++++++.+|.+++|+++|+|++|++|+||+ T Consensus 298 ~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~ai~ 377 (394) T protein:vir:10 298 ITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWEDSKIYGRYLGAAFRFGVKQADSNAGY 377 (394) T ss_pred cccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEecccccceeEEEEEEeccEEeccccEE Confidence 4457999999999876433 34468999999999999999999999999999999999999999999999999 Q ss_pred EEEecCccCCC Q lcl|Aclame:pro 384 YVTFTPEPLPL 394 (394) Q Consensus 384 ~l~~~~~~~~~ 394 (394) +++++++++|. T Consensus 378 ~~~~~~~~~~~ 388 (394) T protein:vir:10 378 FVTNTDAASGS 388 (394) T ss_pred EEEeecccCCC Confidence 99999999887 No 12 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=100.00 E-value=2.6e-66 Score=380.14 Aligned_cols=374 Identities=21% Similarity=0.275 Sum_probs=290.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGK 78 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (394) |-| .|+||++++.++.++++++.++++..+++++ .+++++++++++.+.+++++++.+++..+.............. T Consensus 3 ~~m-~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (408) T protein:vir:10 3 VKL-TVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGP 81 (408) T ss_pred ccc-cHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 333 5899999999999999999988877655443 4567788888888888888888777766554333222221111 Q ss_pred ccc---chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-hhhhhcccccCCccccchhHHhHHHHHHHh Q lcl|Aclame:pro 79 EVT---QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPV-EPQKDGIKKENAKPVSSEEILYTPAREVKT 154 (394) Q Consensus 79 ~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~ 154 (394) ... .......+.+..+.+.... ..... ......++.+.|+++||+++++.|++.+++ T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~ 142 (408) T protein:vir:10 82 LNKSENELKDKFVKDFVNMVRNPMA-------------------FMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQ 142 (408) T ss_pred cccchhhhHHHHHHHHHHHhhcchh-------------------hhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHh Confidence 111 1111111122222111100 00011 111234556668899999999999999999 Q ss_pred hhhhhheeeeEeecCCceeEEEEecCC--CcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHH Q lcl|Aclame:pro 155 VVDLKPFTTVYQAKKASGKYPVLQRAT--TKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGI 232 (394) Q Consensus 155 ~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~ 232 (394) .++|+++|+++++++.++++|+....+ ..+.|++|++..++.+.++|++|++++++++++++||+||++|+.++|.+| T Consensus 143 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~ 222 (408) T protein:vir:10 143 YDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAW 222 (408) T ss_pred hchhhhhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHH Confidence 999999999999999888888876543 344566777777776779999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccccc-ccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceeecccc Q lcl|Aclame:pro 233 VSESISQIKVNTTNDAIAKVLKSFTTK-TVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDI 309 (394) Q Consensus 233 i~~~l~~~~~~~~~~a~~~g~~~~~~~-~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~ 309 (394) |.+.|+++++.+++.+|++|+|++++. +..+++++.+++...+.+.+ +++|+||+++|.+|++++|++|+|||++++ T Consensus 223 i~~~l~~~~~~~~~~~il~g~g~~~~~~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~ 302 (408) T protein:vir:10 223 LSSWIAKKVVVTRNQAIIEVMKAAPKKPTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP 302 (408) T ss_pred HHHHHHHHHHHHHHHHHhhcccccccccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceEeccCc Confidence 999999999999999999999998875 67789999998877666654 689999999999999999999999999999 Q ss_pred cCCCcccccccceEEecCcccc-----cCceEEEeccccEEEEeecceEEEEeeccc--c---cceEEEEEEeccEEecc Q lcl|Aclame:pro 310 TAVSGKVLLGKPVFVLSDEVLG-----ANKAFIGDFKRGVLFADRKDLGLRWADNEI--Y---GQYLQAVLRFGVSKVDD 379 (394) Q Consensus 310 ~~~~~~~l~G~pV~~~~~~~~~-----~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~---~~~~r~~~r~d~~v~~~ 379 (394) .++.+++|+|+||+++++...+ ...++||||+++|++++|++++|.++++.+ | ...+|+++|+|++|++| T Consensus 303 ~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~ 382 (408) T protein:vir:10 303 TKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDS 382 (408) T ss_pred CCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEecc Confidence 9999999999999997754433 345899999999999999999999987654 3 45689999999999999 Q ss_pred cceEEEEecCccCCC Q lcl|Aclame:pro 380 KAGYYVTFTPEPLPL 394 (394) Q Consensus 380 ~af~~l~~~~~~~~~ 394 (394) +||+++++++++.+. T Consensus 383 ~a~~~~~~~~~~~~~ 397 (408) T protein:vir:10 383 EALVAGSFSAIADQV 397 (408) T ss_pred ccEEEEEeeccccCC Confidence 999999999976433 No 13 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=100.00 E-value=2e-66 Score=380.82 Aligned_cols=373 Identities=18% Similarity=0.213 Sum_probs=287.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |.+++||++.+++++++++.+.++++....+++ .+++++++++++.++++++.++++++............... ... T Consensus 1 Mk~~~el~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~-~~~ 79 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEK-KPL 79 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccc-ccc Confidence 779999999999999999998888876554443 35677778888888887777776665544333222111111 111 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ....... ...+.+........ . ...........+++.|++++|+++.+.|++.+++.++|++ T Consensus 80 ~~~~~~~---~~~~~~~~~~~l~~----~-----------~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~ 141 (397) T protein:vir:49 80 TKSEEEV---KAGFVKDFKNLVRG----R-----------YQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQE 141 (397) T ss_pred ccchhHH---HHHHHHHHHHHHhc----c-----------hhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHh Confidence 1111110 01111111110000 0 0001111234566778899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecC--CCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRA--TTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESIS 238 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~ 238 (394) +|+++++++.++++++.... .+.+.|++|++..++.++++|+++++++++++++++||+||++|+.++|++||.+.|+ T Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~ 221 (397) T protein:vir:49 142 YVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIA 221 (397) T ss_pred hhceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHHHHH Confidence 99999998887777766543 3446677777777777889999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcccccccc-ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCccc Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKSFTTK-TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKV 316 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~~~~~-~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~ 316 (394) ++++++++.++++|+|++++. +..++|++.+++..+...++ +++|+|||++|..|++|+|++|+|||+|++.++.+++ T Consensus 222 ~~~~~~~d~ai~~G~g~~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~ 301 (397) T protein:vir:49 222 KKVVVTRNKAILEAIAALPTKPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYS 301 (397) T ss_pred HHHHHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCce Confidence 999999999999999987764 46689999998877665544 6899999999999999999999999999999999999 Q ss_pred ccccceEEecCcc-----cccCceEEEeccccEEEEeecceEEEEeecc--cc---cceEEEEEEeccEEecccceEEEE Q lcl|Aclame:pro 317 LLGKPVFVLSDEV-----LGANKAFIGDFKRGVLFADRKDLGLRWADNE--IY---GQYLQAVLRFGVSKVDDKAGYYVT 386 (394) Q Consensus 317 l~G~pV~~~~~~~-----~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~---~~~~r~~~r~d~~v~~~~af~~l~ 386 (394) |+|+||+++++.. .+...++||||+++|++++|++++++++++. .| ...+|+++|+|+++++|+||++++ T Consensus 302 l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~ 381 (397) T protein:vir:49 302 IDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPAS 381 (397) T ss_pred ecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEeccccchhhcCceeEEEEeeeCcEEecccceEEEE Confidence 9999999876533 3455699999999899999999999987754 24 345899999999999999999999 Q ss_pred ecCccCCC Q lcl|Aclame:pro 387 FTPEPLPL 394 (394) Q Consensus 387 ~~~~~~~~ 394 (394) ++++++|- T Consensus 382 ~~~~~~~~ 389 (397) T protein:vir:49 382 FKAIADQK 389 (397) T ss_pred eecccCCC Confidence 99999887 No 14 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=100.00 E-value=6.5e-66 Score=377.98 Aligned_cols=390 Identities=23% Similarity=0.339 Sum_probs=273.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------ Q lcl|Aclame:pro 4 EKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENI------ 75 (394) Q Consensus 4 e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~------ 75 (394) =||+||++++.+++++++++.+++++...+.+ .++.+...++++++.+++++++++++..+........... T Consensus 1 Mki~elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~ 80 (437) T protein:vir:10 1 MKIEKLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDLV 80 (437) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 14778888888888888777777776544332 2233444455555555555555444433322111000000 Q ss_pred ----ccccccchhhhHHH---HHHHH-HHHHHHHHHHH-----------HHH--HHHH-HHHHHHHhhhhhhh-hhhccc Q lcl|Aclame:pro 76 ----GGKEVTQEEKTYRE---SVNDF-IRSKGKIVNDS-----------LRF--EGKD-EVLMPINETTPVEP-QKDGIK 132 (394) Q Consensus 76 ----~~~~~~~~~~~~~~---~~~~~-~~~~~~~~~~~-----------~~~--~~~~-~~~~~~~~~~~~~~-~~~~~~ 132 (394) .............. ..... ........... ... .... .............. .....+ T Consensus 81 ~~e~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~ 160 (437) T protein:vir:10 81 APELEENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIA 160 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcc Confidence 00000000000000 00000 00000000000 000 0000 00001111111111 123356 Q ss_pred ccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhh Q lcl|Aclame:pro 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYR 212 (394) Q Consensus 133 ~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~ 212 (394) ...+++++|+++...|.. +++.++|+.+|++++++++.+.+|+....+..++|+.|++..++.++++|++|++.+++++ T Consensus 161 ~~~~g~lvp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~ 239 (437) T protein:vir:10 161 LKDGKVIIPETILTPEKE-VHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYT 239 (437) T ss_pred cccccccchHHHHHHHHH-hhhhhhhhhcceeEeeccCceeeEEeeccccccccccccccccccccccceeeeeehhhee Confidence 677889999999887655 5778899999999999999999999988888889999999998878899999999999999 Q ss_pred hhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc--cccHHHHHHHHHhhhhhhc--ccEEEEcHH Q lcl|Aclame:pro 213 GAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKT--VKNLDEIKALLNGGFDPAY--NVSLIVSQS 288 (394) Q Consensus 213 ~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~--~~~~~~i~~~~~~~~~~~~--~a~~vm~~~ 288 (394) ++++||+|+++|+.++|.+||.++|+++++.+++.+|++|+|++.+.+ ..+++++.+++...++++| +++|+||++ T Consensus 240 ~~~~is~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ 319 (437) T protein:vir:10 240 GGYVFSQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTDGIKKTTSTYLLGDLKKVLNVTLKPQDSAAASIVMSQS 319 (437) T ss_pred eehhhhHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHhhhhhhhhcCCEEEEcHH Confidence 999999999999999999999999999999999999999998876654 4458889998876666654 689999999 Q ss_pred HHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc-----ccCceEEEeccccEEEEeecceEEEEee-cccc Q lcl|Aclame:pro 289 FYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL-----GANKAFIGDFKRGVLFADRKDLGLRWAD-NEIY 362 (394) Q Consensus 289 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~-----~~~~~~~gd~~~~~~~~~~~~~~i~~~~-~~~~ 362 (394) +|..|++|+|++|+|||+|+++++.+++|+|+||++++++.. ++.+++||||+++|++++|+++++.+++ ...+ T Consensus 320 ~~~~l~~lkd~~g~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~ 399 (437) T protein:vir:10 320 AYNLFDMATDAMGRPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQDTYDIW 399 (437) T ss_pred HHHHHHHhhccCCCeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEEecccccc Confidence 999999999999999999999999999999999999876533 3445899999999999999999999875 4567 Q ss_pred cceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 363 GQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 363 ~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) .+.+++++|+|++|++|+||++|+.+.++.+. T Consensus 400 ~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~ 431 (437) T protein:vir:10 400 YKQLGIFLRQNVVQASKDLIVNLTGKLKAVTV 431 (437) T ss_pred cceeeEEEEEccEEecccceEEEEeecccccc Confidence 78899999999999999999999977655444 No 15 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=100.00 E-value=3.5e-66 Score=379.41 Aligned_cols=370 Identities=26% Similarity=0.352 Sum_probs=278.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------- Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAE------- 73 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~------- 73 (394) |+.++|+||+++++++.++. ++++++++.++++++.++++.++++++.+++..+...+........ T Consensus 4 ~m~k~l~el~~~~~~~~~~~-------~~~~~~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (397) T protein:vir:12 4 QMSKKEIALRQQFTEKKQQA-------DKALQEGNTDEARALLDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERN 76 (397) T ss_pred cHHHHHHHHHHHHHHHHHHH-------HHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh Confidence 56666777777666665554 4445556667777888888888888887766554443322211110 Q ss_pred ccccccccchh----hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh-hhhhcccccCCccccchhHHhHH Q lcl|Aclame:pro 74 NIGGKEVTQEE----KTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVE-PQKDGIKKENAKPVSSEEILYTP 148 (394) Q Consensus 74 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~lvP~~~~~~I 148 (394) ........... ......+..+++.. ......+ ....... ....+.+++.|+++||+++.+.| T Consensus 77 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~----------~~~~~~~---~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~i 143 (397) T protein:vir:12 77 PEGQRSQGQGNEERQQQYSKAFLKGLRGK----------RLTDEER---DLLDSPEFRAMSGINDEDGGILIPEDIGRQI 143 (397) T ss_pred hcccccccchhhHHHHHHHHHHHHHHhcc----------CCcHHHH---HHHhhhhhhhccccccccCcccCchhHHHHH Confidence 00000000000 00111111111110 0000000 0111111 12235566778899999999999 Q ss_pred HHHHHhhhhhhheeeeEeecCCceeEEEEecCC-CcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHH Q lcl|Aclame:pro 149 AREVKTVVDLKPFTTVYQAKKASGKYPVLQRAT-TKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADV 227 (394) Q Consensus 149 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~ 227 (394) ++.+++.++|+++|++++++++++.+++....+ +.+.|++|++..++.+.++|++|+++++++++++++|+|+++|+.+ T Consensus 144 i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds~~ 223 (397) T protein:vir:12 144 HEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDSDQ 223 (397) T ss_pred HHhhhhhhhHHhhcceeeccCCceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhchH Confidence 999999999999999999998888888776544 4456667777777667799999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceee Q lcl|Aclame:pro 228 DLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLL 305 (394) Q Consensus 228 ~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~ 305 (394) +|++||.+.|+++++++++.+|++|+|++.+.+..+++++.++++..+.+++ +++|+|||++|.+|++|+|++|+|+| T Consensus 224 ~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g~~~~~~i~~~~~~~l~~~~~~~a~~~~n~~~~~~L~~lkd~~G~~l~ 303 (397) T protein:vir:12 224 AIMTYVAKWFAKKSVVTRNNLILAAIASLKKVDIDGLDGIKKALNVTLDPMVAPGSIVLTNQDGYDWLDTLKDGTGRYLL 303 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccccccccccccHHHHHHHHhhccchhhhCCCEEEEcHHHHHHHHHhhccCCceee Confidence 9999999999999999999999999999999999999999999886666654 68999999999999999999999999 Q ss_pred cccccCCCcccccccceEEecCcc----cccCceEEEeccccEEEEeecceEEEEeeccc--c---cceEEEEEEeccEE Q lcl|Aclame:pro 306 QDDITAVSGKVLLGKPVFVLSDEV----LGANKAFIGDFKRGVLFADRKDLGLRWADNEI--Y---GQYLQAVLRFGVSK 376 (394) Q Consensus 306 ~~~~~~~~~~~l~G~pV~~~~~~~----~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~---~~~~r~~~r~d~~v 376 (394) +|++.++.+++|+|+||+++++.. .+...++||||+++|++++|++++|+++++.+ | ...+|+++|+|+++ T Consensus 304 ~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~ 383 (397) T protein:vir:12 304 QPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRK 383 (397) T ss_pred cccccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEE Confidence 999999999999999999876532 34556899999998999999999999877643 3 34689999999999 Q ss_pred ecccceEEEEecCc Q lcl|Aclame:pro 377 VDDKAGYYVTFTPE 390 (394) Q Consensus 377 ~~~~af~~l~~~~~ 390 (394) ++|+||++++++.. T Consensus 384 ~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 384 WDEDAVVFGQITVE 397 (397) T ss_pred ecccceEEEEEeeC Confidence 99999999999999 No 16 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=100.00 E-value=5.2e-66 Score=378.51 Aligned_cols=372 Identities=23% Similarity=0.333 Sum_probs=274.9 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|Aclame:pro 2 FEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVT 81 (394) Q Consensus 2 l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (394) ..++|+||++++++++++ ++.++++++.++++++.+|++.|+++++..++..+...+.... .... ..... T Consensus 1 M~k~l~el~~~~~~~~~e-------~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~-~~~~--~~~~~ 70 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEE-------VRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNN-GREV--ETRNV 70 (392) T ss_pred CcHHHHHHHHHHHHHHHH-------HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccc--cccCc Confidence 234456666655555544 4545556667788888889988888887655433322222111 1111 11111 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) ....+....+..+++...... ..................+++.|++++|+++.+.|++.+++.++|+++ T Consensus 71 ~~~~~~~~~~~~~l~~~~~~~-----------~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~ 139 (392) T protein:vir:10 71 DGEMEYRDVFMKALRNKPLNA-----------EEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQY 139 (392) T ss_pred cchHHHHHHHHHHHhcccccH-----------HHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhh Confidence 111222222322222111000 000011111111122334556788899999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++++++++....+. .+.|++|++..++.+.++|++|++++++++++++||+|+++||.++|.+||.+.|+++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 219 (392) T protein:vir:10 140 VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKK 219 (392) T ss_pred ceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHH Confidence 999999988888887766554 4556677777766667899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceeecccccCCCccccc Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLL 318 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~ 318 (394) ++.+++.+|++|+|++++.+..+++++++++...+.+.+ +++|+|||++|..|++|||++|+|||+++++.+.+++|+ T Consensus 220 i~~~~d~~~~~g~g~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tll 299 (392) T protein:vir:10 220 SKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFA 299 (392) T ss_pred HHHHHHHHHhhccccccccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCcccccc Confidence 999999999999999999999999999999876666654 689999999999999999999999999999999999999 Q ss_pred ccceEEec-Cc-------ccccCceEEEeccccEEEEeecceEEEEeecc--cc---cceEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 319 GKPVFVLS-DE-------VLGANKAFIGDFKRGVLFADRKDLGLRWADNE--IY---GQYLQAVLRFGVSKVDDKAGYYV 385 (394) Q Consensus 319 G~pV~~~~-~~-------~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~---~~~~r~~~r~d~~v~~~~af~~l 385 (394) |+|++++. +. ..+...++||||+++|++++|.+++++++++. .| ...+|+++|+|++|.+|+||+++ T Consensus 300 G~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 379 (392) T protein:vir:10 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) T ss_pred CcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEE Confidence 98766543 22 23455689999999999999999999998753 34 34689999999999999999999 Q ss_pred EecCcc---CCC Q lcl|Aclame:pro 386 TFTPEP---LPL 394 (394) Q Consensus 386 ~~~~~~---~~~ 394 (394) ++++++ +|- T Consensus 380 ~~~~~a~~~~~~ 391 (392) T protein:vir:10 380 EIDLSAPVEQPQ 391 (392) T ss_pred EecccccccCCC Confidence 997655 555 No 17 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=100.00 E-value=5.2e-66 Score=378.51 Aligned_cols=372 Identities=23% Similarity=0.333 Sum_probs=274.9 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|Aclame:pro 2 FEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVT 81 (394) Q Consensus 2 l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (394) ..++|+||++++++++++ ++.++++++.++++++.+|++.|+++++..++..+...+.... .... ..... T Consensus 1 M~k~l~el~~~~~~~~~e-------~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~-~~~~--~~~~~ 70 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEE-------VRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNN-GREV--ETRNV 70 (392) T ss_pred CcHHHHHHHHHHHHHHHH-------HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccc--cccCc Confidence 234456666655555544 4545556667788888889988888887655433322222111 1111 11111 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) ....+....+..+++...... ..................+++.|++++|+++.+.|++.+++.++|+++ T Consensus 71 ~~~~~~~~~~~~~l~~~~~~~-----------~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~ 139 (392) T protein:vir:10 71 DGEMEYRDVFMKALRNKPLNA-----------EEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQY 139 (392) T ss_pred cchHHHHHHHHHHHhcccccH-----------HHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhh Confidence 111222222322222111000 000011111111122334556788899999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++++++++....+. .+.|++|++..++.+.++|++|++++++++++++||+|+++||.++|.+||.+.|+++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 219 (392) T protein:vir:10 140 VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKK 219 (392) T ss_pred ceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHH Confidence 999999988888887766554 4556677777766667899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceeecccccCCCccccc Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLL 318 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~ 318 (394) ++.+++.+|++|+|++++.+..+++++++++...+.+.+ +++|+|||++|..|++|||++|+|||+++++.+.+++|+ T Consensus 220 i~~~~d~~~~~g~g~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tll 299 (392) T protein:vir:10 220 SKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFA 299 (392) T ss_pred HHHHHHHHHhhccccccccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCcccccc Confidence 999999999999999999999999999999876666654 689999999999999999999999999999999999999 Q ss_pred ccceEEec-Cc-------ccccCceEEEeccccEEEEeecceEEEEeecc--cc---cceEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 319 GKPVFVLS-DE-------VLGANKAFIGDFKRGVLFADRKDLGLRWADNE--IY---GQYLQAVLRFGVSKVDDKAGYYV 385 (394) Q Consensus 319 G~pV~~~~-~~-------~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~---~~~~r~~~r~d~~v~~~~af~~l 385 (394) |+|++++. +. ..+...++||||+++|++++|.+++++++++. .| ...+|+++|+|++|.+|+||+++ T Consensus 300 G~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 379 (392) T protein:vir:10 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) T ss_pred CcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEE Confidence 98766543 22 23455689999999999999999999998753 34 34689999999999999999999 Q ss_pred EecCcc---CCC Q lcl|Aclame:pro 386 TFTPEP---LPL 394 (394) Q Consensus 386 ~~~~~~---~~~ 394 (394) ++++++ +|- T Consensus 380 ~~~~~a~~~~~~ 391 (392) T protein:vir:10 380 EIDLSAPVEQPQ 391 (392) T ss_pred EecccccccCCC Confidence 997655 555 No 18 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=100.00 E-value=5.2e-66 Score=378.51 Aligned_cols=372 Identities=23% Similarity=0.333 Sum_probs=274.9 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|Aclame:pro 2 FEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVT 81 (394) Q Consensus 2 l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (394) ..++|+||++++++++++ ++.++++++.++++++.+|++.|+++++..++..+...+.... .... ..... T Consensus 1 M~k~l~el~~~~~~~~~e-------~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~-~~~~--~~~~~ 70 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEE-------VRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNN-GREV--ETRNV 70 (392) T ss_pred CcHHHHHHHHHHHHHHHH-------HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccc--cccCc Confidence 234456666655555544 4545556667788888889988888887655433322222111 1111 11111 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) ....+....+..+++...... ..................+++.|++++|+++.+.|++.+++.++|+++ T Consensus 71 ~~~~~~~~~~~~~l~~~~~~~-----------~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~ 139 (392) T protein:vir:10 71 DGEMEYRDVFMKALRNKPLNA-----------EEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQY 139 (392) T ss_pred cchHHHHHHHHHHHhcccccH-----------HHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhh Confidence 111222222322222111000 000011111111122334556788899999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++++++++....+. .+.|++|++..++.+.++|++|++++++++++++||+|+++||.++|.+||.+.|+++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 219 (392) T protein:vir:10 140 VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKK 219 (392) T ss_pred ceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHH Confidence 999999988888887766554 4556677777766667899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceeecccccCCCccccc Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLL 318 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~ 318 (394) ++.+++.+|++|+|++++.+..+++++++++...+.+.+ +++|+|||++|..|++|||++|+|||+++++.+.+++|+ T Consensus 220 i~~~~d~~~~~g~g~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tll 299 (392) T protein:vir:10 220 SKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFA 299 (392) T ss_pred HHHHHHHHHhhccccccccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCcccccc Confidence 999999999999999999999999999999876666654 689999999999999999999999999999999999999 Q ss_pred ccceEEec-Cc-------ccccCceEEEeccccEEEEeecceEEEEeecc--cc---cceEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 319 GKPVFVLS-DE-------VLGANKAFIGDFKRGVLFADRKDLGLRWADNE--IY---GQYLQAVLRFGVSKVDDKAGYYV 385 (394) Q Consensus 319 G~pV~~~~-~~-------~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~---~~~~r~~~r~d~~v~~~~af~~l 385 (394) |+|++++. +. ..+...++||||+++|++++|.+++++++++. .| ...+|+++|+|++|.+|+||+++ T Consensus 300 G~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 379 (392) T protein:vir:10 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) T ss_pred CcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEE Confidence 98766543 22 23455689999999999999999999998753 34 34689999999999999999999 Q ss_pred EecCcc---CCC Q lcl|Aclame:pro 386 TFTPEP---LPL 394 (394) Q Consensus 386 ~~~~~~---~~~ 394 (394) ++++++ +|- T Consensus 380 ~~~~~a~~~~~~ 391 (392) T protein:vir:10 380 EIDLSAPVEQPQ 391 (392) T ss_pred EecccccccCCC Confidence 997655 555 No 19 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=100.00 E-value=5.2e-66 Score=378.51 Aligned_cols=372 Identities=23% Similarity=0.333 Sum_probs=274.9 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc Q lcl|Aclame:pro 2 FEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVT 81 (394) Q Consensus 2 l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (394) ..++|+||++++++++++ ++.++++++.++++++.+|++.|+++++..++..+...+.... .... ..... T Consensus 1 M~k~l~el~~~~~~~~~e-------~~~~~~~~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~-~~~~--~~~~~ 70 (392) T protein:vir:10 1 MSKELRELLAKLEGKKEE-------VRSLMGEDKVAEAEQMMEEVRSLQKKIDLQRSLDEAETEERNN-GREV--ETRNV 70 (392) T ss_pred CcHHHHHHHHHHHHHHHH-------HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc-cccc--cccCc Confidence 234456666655555544 4545556667788888889988888887655433322222111 1111 11111 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) ....+....+..+++...... ..................+++.|++++|+++.+.|++.+++.++|+++ T Consensus 71 ~~~~~~~~~~~~~l~~~~~~~-----------~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~ 139 (392) T protein:vir:10 71 DGEMEYRDVFMKALRNKPLNA-----------EEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELARSFDALEQY 139 (392) T ss_pred cchHHHHHHHHHHHhcccccH-----------HHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHHhhhhhhhh Confidence 111222222322222111000 000011111111122334556788899999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) |++++++++++++++....+. .+.|++|++..++.+.++|++|++++++++++++||+|+++||.++|.+||.+.|+++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ 219 (392) T protein:vir:10 140 VTVEPVRTRSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILKYVTKWLGKK 219 (392) T ss_pred ceeeeccCCceeEEEEeecCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHHHHHHHHHHH Confidence 999999988888887766554 4556677777766667899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceeecccccCCCccccc Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLL 318 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~ 318 (394) ++.+++.+|++|+|++++.+..+++++++++...+.+.+ +++|+|||++|..|++|||++|+|||+++++.+.+++|+ T Consensus 220 i~~~~d~~~~~g~g~~~~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tll 299 (392) T protein:vir:10 220 SKVTRNVLILGVIEKLTKQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFA 299 (392) T ss_pred HHHHHHHHHhhccccccccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCcccccc Confidence 999999999999999999999999999999876666654 689999999999999999999999999999999999999 Q ss_pred ccceEEec-Cc-------ccccCceEEEeccccEEEEeecceEEEEeecc--cc---cceEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 319 GKPVFVLS-DE-------VLGANKAFIGDFKRGVLFADRKDLGLRWADNE--IY---GQYLQAVLRFGVSKVDDKAGYYV 385 (394) Q Consensus 319 G~pV~~~~-~~-------~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~---~~~~r~~~r~d~~v~~~~af~~l 385 (394) |+|++++. +. ..+...++||||+++|++++|.+++++++++. .| ...+|+++|+|++|.+|+||+++ T Consensus 300 G~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l 379 (392) T protein:vir:10 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYG 379 (392) T ss_pred CcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEE Confidence 98766543 22 23455689999999999999999999998753 34 34689999999999999999999 Q ss_pred EecCcc---CCC Q lcl|Aclame:pro 386 TFTPEP---LPL 394 (394) Q Consensus 386 ~~~~~~---~~~ 394 (394) ++++++ +|- T Consensus 380 ~~~~~a~~~~~~ 391 (392) T protein:vir:10 380 EIDLSAPVEQPQ 391 (392) T ss_pred EecccccccCCC Confidence 997655 555 No 20 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=100.00 E-value=7e-66 Score=377.78 Aligned_cols=353 Identities=24% Similarity=0.340 Sum_probs=276.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |-+ .|++|.+++ ..+.++++.+.++++.++++++++|++.++++++.+++..+...+....... .... T Consensus 1 M~k-~l~~l~e~~-------~~~~~e~~~~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~----~~~~ 68 (371) T protein:vir:81 1 MPK-ELRELLEQI-------NNKKEEARKLLAENKIEEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEP----LKPT 68 (371) T ss_pred CcH-HHHHHHHHH-------HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cccc Confidence 553 344444444 4444445555556666778888888888888888776655544332221111 1111 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ..........+..+++... .......++..|++++|+++++.|++.+++.++|++ T Consensus 69 ~~~~~~~~~~~~~~l~~~~-------------------------~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~ 123 (371) T protein:vir:81 69 VQVKENEVEAFVNHIRTRF-------------------------RNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQN 123 (371) T ss_pred hhhHHHHHHHHHHHHHHHH-------------------------HHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhh Confidence 1111222222222222110 001123455668899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~ 239 (394) ++++++++++++.+++....+. .+.|++|++..++.++++|+++++++++++++++||+|+++|+.++|++||.+.|++ T Consensus 124 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ 203 (371) T protein:vir:81 124 LITVEPVTTLSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGD 203 (371) T ss_pred hceeeeccCCceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHH Confidence 9999999988888888776654 455666666677678899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccc Q lcl|Aclame:pro 240 IKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVL 317 (394) Q Consensus 240 ~~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l 317 (394) +++.+++.++++|+|++++.+..+++++..++...+.+.+ +++|+|||++|..|++|+|++|+|||++++.++.+++| T Consensus 204 a~~~~~~~~i~~g~g~~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l 283 (371) T protein:vir:81 204 ESRVTRNGLIINVLNTKAKTAIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQL 283 (371) T ss_pred HHHHHHHHHHHhhcccccccccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCcee Confidence 9999999999999999999999999999998876666654 68999999999999999999999999999999999999 Q ss_pred cccceEEecCccc----------ccCceEEEeccccEEEEeecceEEEEeeccc--c---cceEEEEEEeccEEecccce Q lcl|Aclame:pro 318 LGKPVFVLSDEVL----------GANKAFIGDFKRGVLFADRKDLGLRWADNEI--Y---GQYLQAVLRFGVSKVDDKAG 382 (394) Q Consensus 318 ~G~pV~~~~~~~~----------~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~---~~~~r~~~r~d~~v~~~~af 382 (394) +|+||+++++++. +...++||||+++|++++|.+++|+++++.. | ...+|+++|+|+++.+|+|| T Consensus 284 ~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~ 363 (371) T protein:vir:81 284 LGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAF 363 (371) T ss_pred cceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccce Confidence 9999999887653 4457899999999999999999999987642 3 45689999999999999999 Q ss_pred EEEEecCc Q lcl|Aclame:pro 383 YYVTFTPE 390 (394) Q Consensus 383 ~~l~~~~~ 390 (394) ++++++++ T Consensus 364 ~~~~~~~A 371 (371) T protein:vir:81 364 VFGEVQLA 371 (371) T ss_pred EEEEEecC Confidence 99999999 No 21 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=100.00 E-value=4.7e-65 Score=373.23 Aligned_cols=370 Identities=17% Similarity=0.098 Sum_probs=280.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) +..+++|++.++++.+.+++++++.+..+++. .++..++..+++.++++++++++.....+........... ..... T Consensus 1 l~~~k~l~~~i~e~~~~~~~~k~~~~~~~~~~-e~~~~~l~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~~~--~~~~~ 77 (407) T protein:vir:48 1 MADVKDVEQVAQELQRKFDDFKEKNDKRIDAI-EQEKGKLAGEVETLNGKLAELENLKSDLEAELAEVKRPAG--GTQNK 77 (407) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--ccccc Confidence 56789999999998888888777665544433 2455667777787777777777666655443322111111 11111 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh-hhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEP-QKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) ...+..+++..+++..... .....+. .....+...||++||+++++.|++.++..++|+++ T Consensus 78 ~~~e~~~a~~~~l~~g~~~------------------~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~~~~l~~~ 139 (407) T protein:vir:48 78 VASEHKEAFIGFMRKGRED------------------GLRELERKALQVGNDEDGGYAIPEELDRTILTLLKDEVVMRQE 139 (407) T ss_pred hhhHHHHHHHHHHhccchh------------------hhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHhhhhhhhh Confidence 2222333444443321100 0000111 12234556678999999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIK 241 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~ 241 (394) |+++++.++.+.+|+.. ++..+.|+.|++..++.+.++|+++++++++++++++||+|+++|+.++|++||.++|++++ T Consensus 140 ~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~i 218 (407) T protein:vir:48 140 ATVITLGGSDYKKLVNL-GGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELALEF 218 (407) T ss_pred ceeeecCCCceEEEEec-CCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHHHHH Confidence 99999999888888765 34556677777777766778999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhcccccccccc----------------------------ccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHH Q lcl|Aclame:pro 242 VNTTNDAIAKVLKSFTTKTV----------------------------KNLDEIKALLNGGFDPAY-NVSLIVSQSFYQT 292 (394) Q Consensus 242 ~~~~~~a~~~g~~~~~~~~~----------------------------~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~ 292 (394) +.+++.++++|+|++.|.+. .+++++++++..+...++ +++|+||+++|.. T Consensus 219 ~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~n~~~~~~ 298 (407) T protein:vir:48 219 AEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMMNNSSLFA 298 (407) T ss_pred HHHHHhhhhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEEcHHHHHH Confidence 99999999999988655322 247889998887665555 6899999999999 Q ss_pred HHhhhccCCceeecccccCCCcccccccceEEecCcc---cccCceEEEeccccEEEEeecceEEEEeecc-cccceEEE Q lcl|Aclame:pro 293 LDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEV---LGANKAFIGDFKRGVLFADRKDLGLRWADNE-IYGQYLQA 368 (394) Q Consensus 293 l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-~~~~~~r~ 368 (394) |++|+|++|+|||+|+++.+.+++|+|+||+++++++ .+..+++||||+++|++++|.+++|..+++. .....+|+ T Consensus 299 L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~~~~~~~~~~~~ 378 (407) T protein:vir:48 299 IRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGFYT 378 (407) T ss_pred HHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeeccccCCcEEEEE Confidence 9999999999999999999999999999999987654 3456688999999899999999888765442 23346899 Q ss_pred EEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 369 VLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 369 ~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ++|+|++|++|+||++++++++++-- T Consensus 379 ~~r~d~~v~~~~a~~~l~~~aa~~~~ 404 (407) T protein:vir:48 379 TKRTGGMLVDSQAIKLMKIGAATRQK 404 (407) T ss_pred EEEeccEEecccceEEEEeeccCCCC Confidence 99999999999999999999988544 No 22 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=100.00 E-value=9.7e-65 Score=371.53 Aligned_cols=378 Identities=21% Similarity=0.275 Sum_probs=285.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGK 78 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (394) =.+..|+||++++.++.++++++.++++...++++ .+++++++++++.+.++++.++++++..+............ . T Consensus 2 ~~~m~i~el~~~~~~~~~~~~~~~~e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 80 (408) T protein:vir:74 2 GVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEK-G 80 (408) T ss_pred ChhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-c Confidence 12226789999999999999998888877655443 45667778888888888888777776655433222111111 1 Q ss_pred cccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh-hhcccccCCccccchhHHhHHHHHHHhhhh Q lcl|Aclame:pro 79 EVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQ-KDGIKKENAKPVSSEEILYTPAREVKTVVD 157 (394) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~ 157 (394) ......... ...+.+........ ........+.. ....+...|+++||+++++.|++.+++.++ T Consensus 81 ~~~~~~~~~---~~~~~~~~~~~~~~------------~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~ 145 (408) T protein:vir:74 81 PLNKSENEL---KDKFVKDFVNMVRN------------PMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDS 145 (408) T ss_pred cccchhhhh---HHHHHHHHHHHHhc------------chhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcc Confidence 111111111 11111111000000 00001111111 223456667899999999999999999999 Q ss_pred hhheeeeEeecCCceeEEEEecCC-Ccc-cccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKKASGKYPVLQRAT-TKM-VTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSE 235 (394) Q Consensus 158 l~~~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~ 235 (394) |+++|++++++++++.+++....+ +.. .|++|++..++.++++|++|++++++++++++||+|+++|+.++|++||.+ T Consensus 146 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 225 (408) T protein:vir:74 146 LQQYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSS 225 (408) T ss_pred hhhhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHH Confidence 999999999998888887776543 333 455566666666889999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhcccccccc-ccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceeecccccCC Q lcl|Aclame:pro 236 SISQIKVNTTNDAIAKVLKSFTTK-TVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAV 312 (394) Q Consensus 236 ~l~~~~~~~~~~a~~~g~~~~~~~-~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~ 312 (394) .|+++++.+++.++++|+|++++. +..+++++.+++...+++.+ +++|+|||++|..|++|||++|+|||++++..+ T Consensus 226 ~l~~~~~~~~d~~il~G~G~~~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~ 305 (408) T protein:vir:74 226 WIAKKVVVTRNQAIIAAMGTVPKKPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKP 305 (408) T ss_pred HHHHHHHHHHHHHHhhcccccccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCC Confidence 999999999999999999998775 56789999998876666654 689999999999999999999999999999999 Q ss_pred CcccccccceEEecCcc-----cccCceEEEeccccEEEEeecceEEEEeecc-----cccceEEEEEEeccEEecccce Q lcl|Aclame:pro 313 SGKVLLGKPVFVLSDEV-----LGANKAFIGDFKRGVLFADRKDLGLRWADNE-----IYGQYLQAVLRFGVSKVDDKAG 382 (394) Q Consensus 313 ~~~~l~G~pV~~~~~~~-----~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~-----~~~~~~r~~~r~d~~v~~~~af 382 (394) .+++|+|+||+++++.. .+...++||||+++|++++|+++++.++++. .+...+|+++|+|+++++|+|| T Consensus 306 ~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~ 385 (408) T protein:vir:74 306 NSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEAL 385 (408) T ss_pred CCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeCcEEecccce Confidence 99999999999876533 3445689999999999999999999998753 2345689999999999999999 Q ss_pred EEEEecCccCCC Q lcl|Aclame:pro 383 YYVTFTPEPLPL 394 (394) Q Consensus 383 ~~l~~~~~~~~~ 394 (394) +++++++++++. T Consensus 386 ~~~~~~~~~~~~ 397 (408) T protein:vir:74 386 VAGSFTAIADQV 397 (408) T ss_pred EEEEeecccCCC Confidence 999998887665 No 23 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=100.00 E-value=1.2e-64 Score=371.07 Aligned_cols=375 Identities=20% Similarity=0.262 Sum_probs=293.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGK 78 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (394) =++..|+||++++++++++++++.++++..+.+++ .+++.++.++++.+.++++.++++++..+.............. T Consensus 2 ~~~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (404) T protein:vir:39 2 GVKLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGP 81 (404) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 35557899999999999999999998887665543 4567777888888888888888777766554333222111111 Q ss_pred cc---cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh-hhhcccccCCccccchhHHhHHHHHHHh Q lcl|Aclame:pro 79 EV---TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEP-QKDGIKKENAKPVSSEEILYTPAREVKT 154 (394) Q Consensus 79 ~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~lvP~~~~~~I~~~~~~ 154 (394) .. ........+.+..+++.... .....+. .....+.+.|++++|+++++.|++.+++ T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~-------------------~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~ 142 (404) T protein:vir:39 82 LNKSEYELKDKFVKEFVNMVRNPMA-------------------FLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQ 142 (404) T ss_pred cccchhhhHHHHHHHHHHHHhcchh-------------------hhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHh Confidence 11 11111111222222211100 0000111 1233456677899999999999999999 Q ss_pred hhhhhheeeeEeecCCceeEEEEecCC--CcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHH Q lcl|Aclame:pro 155 VVDLKPFTTVYQAKKASGKYPVLQRAT--TKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGI 232 (394) Q Consensus 155 ~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~ 232 (394) .++|+++|++++++++.+.+|+....+ +.+.|+.|++..++.++++|+++++++++++++++||+|+++|+.++|++| T Consensus 143 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~ 222 (404) T protein:vir:39 143 YDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAW 222 (404) T ss_pred hhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHH Confidence 999999999999998888888776543 345667777787777889999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccccc-ccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceeecccc Q lcl|Aclame:pro 233 VSESISQIKVNTTNDAIAKVLKSFTTK-TVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDI 309 (394) Q Consensus 233 i~~~l~~~~~~~~~~a~~~g~~~~~~~-~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~ 309 (394) |.+.|++.++.+++.++++|+|++++. +..+++++.+++.....+.+ +++|+|||++|..|++|+|++|+|||++++ T Consensus 223 i~~~l~~~~~~~~d~~il~g~g~~~~~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~ 302 (404) T protein:vir:39 223 LSSWIAKKVVVTRNQAIIAAMGTVPKKPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALVKTAEGKYLLEPDP 302 (404) T ss_pred HHHHHHHHHHHHHHHHHHhcccccccccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCceeeccCc Confidence 999999999999999999999998774 46678999999887776655 689999999999999999999999999999 Q ss_pred cCCCcccccccceEEecCccc-----ccCceEEEeccccEEEEeecceEEEEeecc--cc---cceEEEEEEeccEEecc Q lcl|Aclame:pro 310 TAVSGKVLLGKPVFVLSDEVL-----GANKAFIGDFKRGVLFADRKDLGLRWADNE--IY---GQYLQAVLRFGVSKVDD 379 (394) Q Consensus 310 ~~~~~~~l~G~pV~~~~~~~~-----~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~---~~~~r~~~r~d~~v~~~ 379 (394) ..+.+++|+|+||+++++... +...++||||+++|++++|+++++.++++. .| ...+|+++|+|+.+.+| T Consensus 303 ~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~ 382 (404) T protein:vir:39 303 TKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKTTDS 382 (404) T ss_pred CCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhceeeEEEEeeeccEEecc Confidence 999999999999999876443 334689999999999999999999998764 23 45689999999999999 Q ss_pred cceEEEEecCccCCC Q lcl|Aclame:pro 380 KAGYYVTFTPEPLPL 394 (394) Q Consensus 380 ~af~~l~~~~~~~~~ 394 (394) +||+++++++++.|- T Consensus 383 ~a~~~~~~~~~a~~~ 397 (404) T protein:vir:39 383 EALVAGSFTAIADQV 397 (404) T ss_pred cceEEEEeeccccCC Confidence 999999999998777 No 24 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=100.00 E-value=1.7e-64 Score=370.21 Aligned_cols=373 Identities=18% Similarity=0.201 Sum_probs=285.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALES--DDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~--e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |..++||++.++++.++++++.++++....+ ...+++++++++++.+.++++.+++..+..+............ ... T Consensus 1 Mk~~~el~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~ 79 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDKVENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEK-KPL 79 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhcc-ccc Confidence 7899999999999999998888777654332 2346677778888888887777776665544333222111111 101 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) .. ........+.+........ ............+++.+++++|+++++.|++.+++.++|++ T Consensus 80 ~~---~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~ 141 (397) T protein:vir:48 80 TK---SEEEVKAGFVKDFKNLVRG---------------RYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYDSLQE 141 (397) T ss_pred cc---hhhHHHHHHHHHHHHHHhh---------------hhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHHHHHh Confidence 00 1111111111111111100 00011111233455668899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCC--CcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRAT--TKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESIS 238 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~ 238 (394) +|++++++++++++++....+ +.+.|+.|++..++.++++|++|++++++++++++||+|+++|+.++|++||.+.|+ T Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~v~~~l~ 221 (397) T protein:vir:48 142 YVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENILAWLSGWIA 221 (397) T ss_pred hhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHHHHHHHHHH Confidence 999999999888888776543 234555666666665678999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccccccc-cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCccc Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKSFTT-KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKV 316 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~~~~-~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~ 316 (394) ++++.+++.++++|+|++++ .+..+++++.+++..+...++ +++|+|||++|..|++|||++|+|||++++.++.+++ T Consensus 222 ~~~~~~~d~~il~G~g~~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~ 301 (397) T protein:vir:48 222 KKVVVTRNKAILEAIATLPTKPTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYS 301 (397) T ss_pred HHHHHHHHHHHhhcccccccccccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCce Confidence 99999999999999999877 456789999998877665544 6899999999999999999999999999999999999 Q ss_pred ccccceEEecCcc-----cccCceEEEeccccEEEEeecceEEEEeecc--cc---cceEEEEEEeccEEecccceEEEE Q lcl|Aclame:pro 317 LLGKPVFVLSDEV-----LGANKAFIGDFKRGVLFADRKDLGLRWADNE--IY---GQYLQAVLRFGVSKVDDKAGYYVT 386 (394) Q Consensus 317 l~G~pV~~~~~~~-----~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~---~~~~r~~~r~d~~v~~~~af~~l~ 386 (394) |+|+||+++++.. .+..+++||||+++|+++++.+++++.+++. +| ...+|+++|+|+++++|+||++++ T Consensus 302 l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~ 381 (397) T protein:vir:48 302 IDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPAS 381 (397) T ss_pred eccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeeeccEEecccceEEEE Confidence 9999999876533 3556789999999899999999999987754 23 356899999999999999999999 Q ss_pred ecCccCCC Q lcl|Aclame:pro 387 FTPEPLPL 394 (394) Q Consensus 387 ~~~~~~~~ 394 (394) ++++++|- T Consensus 382 ~~~~~~~~ 389 (397) T protein:vir:48 382 FKAIADQK 389 (397) T ss_pred ecccccCC Confidence 99999888 No 25 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=100.00 E-value=2.7e-64 Score=369.12 Aligned_cols=367 Identities=17% Similarity=0.096 Sum_probs=274.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |-++ +++|++.+++|+++.++++++.+...++. .++..++.++++.+++++++++..+...++............ . T Consensus 1 m~~~-lk~l~~~~~el~~~~~~~k~~~~~~~~~~-e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~ 76 (401) T protein:vir:44 1 MAVD-IKDVEQVAQELQQKFDDFKAKNDKRVEAI-EQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGA--Q 76 (401) T ss_pred CCcc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--c Confidence 6665 77888877788777777665554432222 234445667777777777777666665554433221111111 1 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhh-hhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEP-QKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) .....+..+.+..++++...... ...+. ....++.+.|+++||+++.+.|++.++..++|+ T Consensus 77 ~~~~~e~~~a~~~~lr~~~~~~~------------------~~~e~~a~~~~~~~~GG~~iP~~~~~~ii~~~~~~~~l~ 138 (401) T protein:vir:44 77 NKVAAEHKDAFVGFLRKGREDGL------------------RDLERKALQVGTDEDGGYAVPEELDRSILSLLKDEVVMR 138 (401) T ss_pred cchhHHHHHHHHHHHhhhhhhhh------------------HHHHHHHhhcCCCCCCceeccHhHHHHHHHHHHhhhhhh Confidence 11122333445554432211000 00011 122345567789999999999999999999999 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~ 239 (394) ++|+++++.++.+.+|+.. .+..+.|+.|++..+..+.++|++|++++|++++++++|+|+++|+.++|++||.++|++ T Consensus 139 ~~~~~~~~~~~~~~~~~~~-~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ 217 (401) T protein:vir:44 139 QEATVITVGGSDYKKLVNL-GGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELAT 217 (401) T ss_pred hhceeeecCCCceEEEEec-CCccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHH Confidence 9999999998888888765 344556777777777667789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcccccccccc----------------------------ccHHHHHHHHHhhhhhhc-ccEEEEcHHHH Q lcl|Aclame:pro 240 IKVNTTNDAIAKVLKSFTTKTV----------------------------KNLDEIKALLNGGFDPAY-NVSLIVSQSFY 290 (394) Q Consensus 240 ~~~~~~~~a~~~g~~~~~~~~~----------------------------~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~ 290 (394) +++.+++.++++|+|++.|.+. .+++++++++..+...+. +++|+||+++| T Consensus 218 ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~ 297 (401) T protein:vir:44 218 EFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMNNNSL 297 (401) T ss_pred HHHHHHHhhhhccCCCCccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEcHHHH Confidence 9999999999999988655332 247888888887665544 68999999999 Q ss_pred HHHHhhhccCCceeecccccCCCcccccccceEEecCcc---cccCceEEEeccccEEEEeecceEEEEeeccc-ccceE Q lcl|Aclame:pro 291 QTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEV---LGANKAFIGDFKRGVLFADRKDLGLRWADNEI-YGQYL 366 (394) Q Consensus 291 ~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-~~~~~ 366 (394) ..|++|+|++|+|||+|+++.+.+++|+|+||+++++++ .+..+++||||+++|++++|.++++..++... ....+ T Consensus 298 ~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~~~v~~ 377 (401) T protein:vir:44 298 FAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDPYTNKPFVGF 377 (401) T ss_pred HHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEeeeccccCCcEEE Confidence 999999999999999999999999999999999987643 34556889999998999999998887654432 23458 Q ss_pred EEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 367 QAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 367 r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) |+++|+|++|++|+||++|+++++ T Consensus 378 ~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 378 YTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred EEEEEeccEEecccceEEEEeecC Confidence 999999999999999999999999 No 26 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=100.00 E-value=1.3e-64 Score=370.81 Aligned_cols=375 Identities=20% Similarity=0.268 Sum_probs=289.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc- Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE- 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~- 79 (394) .++++|++|+++++++.++.+++.++++...++.+.++++++.+++++++++++.++++++..++..+........... T Consensus 2 n~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (421) T protein:vir:13 2 NLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGGR 81 (421) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 5688999999999999999999999999988888778888899999999999998887777766544332221111111 Q ss_pred --ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhh Q lcl|Aclame:pro 80 --VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVD 157 (394) Q Consensus 80 --~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~ 157 (394) .......... ..+.+..... ............+.+++.|+++||+++++.|++.+++.++ T Consensus 82 ~~~~~~~~~~~~--~~~~~~~~~~----------------~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~~~~~~ 143 (421) T protein:vir:13 82 VIINGDSKEEKR--SLQLSAMSKT----------------IRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLKEGYPS 143 (421) T ss_pred cccccchhHHHH--HHHHHHHHHh----------------hhccchhHHHhhccccCCcceecchhhHHHHHHHHHhhhh Confidence 1111111100 0000000000 0000011112234566778899999999999999999999 Q ss_pred hhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESI 237 (394) Q Consensus 158 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l 237 (394) |+++|++++++++++.+|+........+.|..++...+.++++|++|++++++++++++||+|+++|+.++|++||.++| T Consensus 144 l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l 223 (421) T protein:vir:13 144 LKEHCHVIPVNRNAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFVNEEF 223 (421) T ss_pred hhhhceeeeccCCceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHHHHHH Confidence 99999999999999999998876555444444444445688999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcc-ccccccccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcc Q lcl|Aclame:pro 238 SQIKVNTTNDAIAKVL-KSFTTKTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGK 315 (394) Q Consensus 238 ~~~~~~~~~~a~~~g~-~~~~~~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~ 315 (394) ++++..+++.++++.. |..+..+..++++|.+++..+...++ +++|+||+++|.+|++|+|++|+|||++ +..+.++ T Consensus 224 a~~~~~~~~~~i~~~~~g~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~-~~~~~~~ 302 (421) T protein:vir:13 224 AEFAVNTENAEIVKQAKAVLAEETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDKQGRPLLKE-LSDGGDL 302 (421) T ss_pred HHHHHHHhhhhHhhhhhhccccccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcCCCceeecC-cCCCCCc Confidence 9999999999988754 33345667789999999988776665 6899999999999999999999999975 6777788 Q ss_pred cccccceEEecCcccc---cCceEEEeccccEEEEeecceEEEEeecccccc---eEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 316 VLLGKPVFVLSDEVLG---ANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQ---YLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) Q Consensus 316 ~l~G~pV~~~~~~~~~---~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~---~~r~~~r~d~~v~~~~af~~l~~~~ 389 (394) +|||+||+++++++.+ ...++||||+++|++++|++++|+++++.+|.+ .+|+++|+|+++++|+||+.+.+.+ T Consensus 303 tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~ 382 (421) T protein:vir:13 303 VFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRK 382 (421) T ss_pred eecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeecccccccCeeEEEEEeeecceeecchhhheeeecc Confidence 9999999998876543 456899999999999999999999999988776 5899999999999999876655553 Q ss_pred cc-----------CCC Q lcl|Aclame:pro 390 EP-----------LPL 394 (394) Q Consensus 390 ~~-----------~~~ 394 (394) .. |+- T Consensus 383 ~~a~v~~~~~~~~~~~ 398 (421) T protein:vir:13 383 FGVIVKLQEVLKSSPR 398 (421) T ss_pred cceeeccccccCCCCc Confidence 22 222 No 27 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=100.00 E-value=2.4e-64 Score=369.36 Aligned_cols=369 Identities=18% Similarity=0.207 Sum_probs=283.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESD--DLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e--~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |.+++||+++++++.++++++.+++.....++ ..+++++++++++.+.++++.++++++..+........... .... T Consensus 1 Mk~~~eL~~~~~~~~~~~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ 79 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDKVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEE-KKPL 79 (397) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc-cccc Confidence 77999999999999999988877766543332 34567777888888877777776666554443322111111 1111 Q ss_pred cch----hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhh Q lcl|Aclame:pro 81 TQE----EKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVV 156 (394) Q Consensus 81 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~ 156 (394) ... .....+.+..+++... ..........+.+.|++++|+++...|++.+++.+ T Consensus 80 ~~~~~~~~~~~~~~~~~~l~~~~----------------------~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~ 137 (397) T protein:vir:49 80 TKNEEEVKANFVKDFKNLVRGRY----------------------QNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFD 137 (397) T ss_pred cchhhHHHHHHHHHHHHHhhcch----------------------hhHHHhhhccCCccCcceecHHHHHHHHHHHHhhh Confidence 110 1111111221111100 00111123455667789999999999999999999 Q ss_pred hhhheeeeEeecCCceeEEEEecCC--CcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHH Q lcl|Aclame:pro 157 DLKPFTTVYQAKKASGKYPVLQRAT--TKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVS 234 (394) Q Consensus 157 ~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~ 234 (394) +|+++|++++++++++++++....+ +.+.|++|++..++.+.++|++|++++++++++++||+++++|+.++|++||. T Consensus 138 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~ 217 (397) T protein:vir:49 138 SLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLS 217 (397) T ss_pred hHhhhcceeeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHH Confidence 9999999999998888887765533 34556666676776566899999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHhhcccccccc-ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCC Q lcl|Aclame:pro 235 ESISQIKVNTTNDAIAKVLKSFTTK-TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAV 312 (394) Q Consensus 235 ~~l~~~~~~~~~~a~~~g~~~~~~~-~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~ 312 (394) +.|+++++.+++.+|++|+|++++. +..++|++.+++..+...++ +++|+|||++|..|++|+|++|+|||+|++.++ T Consensus 218 ~~l~~~~~~~~d~ail~G~g~~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g 297 (397) T protein:vir:49 218 GWIAKKVVVTRNKAILEAIGTLPNKPTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSP 297 (397) T ss_pred HHHHHHHHHHHHHHHHhccccccccccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeecccccCC Confidence 9999999999999999999998775 56789999998877665554 689999999999999999999999999999999 Q ss_pred CcccccccceEEecCcc-----cccCceEEEeccccEEEEeecceEEEEeecc--cc---cceEEEEEEeccEEecccce Q lcl|Aclame:pro 313 SGKVLLGKPVFVLSDEV-----LGANKAFIGDFKRGVLFADRKDLGLRWADNE--IY---GQYLQAVLRFGVSKVDDKAG 382 (394) Q Consensus 313 ~~~~l~G~pV~~~~~~~-----~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~---~~~~r~~~r~d~~v~~~~af 382 (394) .+++|+|+||+++++.. .+..+++||||+++|+++++++++|+++++. .| ...+|+++|+|+++++|+|| T Consensus 298 ~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~ 377 (397) T protein:vir:49 298 TGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVVSTDTEAF 377 (397) T ss_pred CCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccEEecccce Confidence 99999999999876433 3456789999999999999999999987753 23 44689999999999999999 Q ss_pred EEEEecCccCCC Q lcl|Aclame:pro 383 YYVTFTPEPLPL 394 (394) Q Consensus 383 ~~l~~~~~~~~~ 394 (394) +++++++++++- T Consensus 378 ~~~~~~~~~~~~ 389 (397) T protein:vir:49 378 VPASFKAIADQK 389 (397) T ss_pred EEEEeccccccc Confidence 999999999755 No 28 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=100.00 E-value=8.6e-64 Score=366.32 Aligned_cols=373 Identities=20% Similarity=0.240 Sum_probs=273.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |- ++||++++.++.++++++.+++++...++..+......++++.++++++++++..+..+................ T Consensus 1 M~---~~eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (395) T protein:vir:38 1 MN---INQLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVN 77 (395) T ss_pred CC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 64 466777777777777777776666544444444445556666666666666555443333221111100000000 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) .... ...... ...+.. .... ..... ........+.+.|++++|+++++.|++.+++.++|++ T Consensus 78 ~~~~-~~~~~~-~~~~~~---~~~~------------~~~~~-~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~ 139 (395) T protein:vir:38 78 KKPL-PVKDGK-PDAQAM---KNQF------------VKDFK-NLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLES 139 (395) T ss_pred cccc-chhhhh-HHHHHH---HHHH------------HHHHH-HHHhhccCccCCCceecchhHhhHHHHHHHhhcchhh Confidence 0000 000000 000000 0000 00000 0111233455668899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCC--CcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRAT--TKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESIS 238 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~ 238 (394) +|++++++++.+.+++....+ ..+.|+.|++..++.+.++|++|++++++++++++||+|+++|+.++|++||.++|+ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la 219 (395) T protein:vir:38 140 LANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAA 219 (395) T ss_pred hcceeeccCCcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHH Confidence 999999988888887765433 334556666676666678999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccccccc-cccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcc Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKSFTT-KTVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGK 315 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~~~~-~~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~ 315 (394) +.++.+++.+|++|+|++.+ .+..+++++.+++...+.+.+ +++|+|||++|..|++|+|++|+|||++++.++.++ T Consensus 220 ~~~~~~~~~~il~g~g~~~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~ 299 (395) T protein:vir:38 220 KKDVVTRNAKILEVMGKAPKKPTISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKY 299 (395) T ss_pred HHHHHHHHHHHhhcccccccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcc Confidence 99999999999999998776 466789999998876666654 689999999999999999999999999999999999 Q ss_pred cccccceEEecCcc----cccCceEEEeccccEEEEeecceEEEEeecc--cc---cceEEEEEEeccEEecccceEEEE Q lcl|Aclame:pro 316 VLLGKPVFVLSDEV----LGANKAFIGDFKRGVLFADRKDLGLRWADNE--IY---GQYLQAVLRFGVSKVDDKAGYYVT 386 (394) Q Consensus 316 ~l~G~pV~~~~~~~----~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~---~~~~r~~~r~d~~v~~~~af~~l~ 386 (394) +|+|+||+++++.. .++..++||||+++|+++++++++|+++++. +| ...+|++.|+|+++.+|+||++++ T Consensus 300 ~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~ 379 (395) T protein:vir:38 300 LIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAAS 379 (395) T ss_pred eeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE Confidence 99999999987643 3455689999999899999999999998753 24 346899999999999999999999 Q ss_pred ecCccCCC Q lcl|Aclame:pro 387 FTPEPLPL 394 (394) Q Consensus 387 ~~~~~~~~ 394 (394) +++++|.- T Consensus 380 ~~~~~~~~ 387 (395) T protein:vir:38 380 FKTVANQA 387 (395) T ss_pred eecccCCC Confidence 99998666 No 29 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=100.00 E-value=6.4e-62 Score=356.08 Aligned_cols=376 Identities=16% Similarity=0.142 Sum_probs=271.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |-+ .|++|+++++++.++++++.++.. ...++++++.++++.|+++++..++..+................. T Consensus 1 M~k-~l~el~~~~~~~~~e~~~~~~~~~-----~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-- 72 (404) T protein:vir:10 1 MSK-ELRELLNQLDSKNKELNSLLNKDG-----VTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGK-- 72 (404) T ss_pred CcH-HHHHHHHHHHHHHHHHHHHHhhcC-----CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc-- Confidence 664 588888888888877766655322 122456677888888888887655444433332222111111111 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ............++............ ...............+.+.|++++|+++.+.|++.+++.++|++ T Consensus 73 ---~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~ 142 (404) T protein:vir:10 73 ---EENVIYNGALFVRAIADNLLKQKNQR-------GLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLKDTTDLYN 142 (404) T ss_pred ---chhhHHHHHHHHHHHHHHHHHHHHhh-------hhcchhhHHhhhccccCCCCceeechhHHHHHHHHHhhhhhHhh Confidence 11111111111111111100000000 00000001111223455677889999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCC-Ccccccccccccccc-cccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRAT-TKMVTVAELEKNPAL-AKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESIS 238 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~e~~~~~~~-~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~ 238 (394) +|+++++++.++.+++....+ ..+.|+.|++..+.. ++++|+++++++++++++++||+|+++|+.++|.+||.+.|+ T Consensus 143 l~~~~~~~~~~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la 222 (404) T protein:vir:10 143 MVDYEPVFTRSGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWIINWFV 222 (404) T ss_pred hhceeeccCCccceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHHHHHH Confidence 999999987777666655444 456666777776654 468999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcccccccc---------------ccccHHHHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCC Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKSFTTK---------------TVKNLDEIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNG 301 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~~~~~---------------~~~~~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G 301 (394) ++++++++.+|+.|+|++.+. +..+++++.+++...+.+.+ +++|+|||++|..|++|||++| T Consensus 223 ~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G 302 (404) T protein:vir:10 223 DKVRITRNAEILYGAGGDEHATGIMTANKFKKITLPKSPALKDFKKCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTG 302 (404) T ss_pred HHHHHHHHHHHhhcCCCCCcccceeeccccceeeccccccHHHHHHHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCC Confidence 999999999999999875432 22357888887765555554 5789999999999999999999 Q ss_pred ceeecccccCCCcccccccceEEecCc----ccccCceEEEeccccEEEEeecceEEEEeeccc--c---cceEEEEEEe Q lcl|Aclame:pro 302 RYLLQDDITAVSGKVLLGKPVFVLSDE----VLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--Y---GQYLQAVLRF 372 (394) Q Consensus 302 ~~l~~~~~~~~~~~~l~G~pV~~~~~~----~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~---~~~~r~~~r~ 372 (394) +|+|.|++.++.+++|+|+||+++++. ..+..+++||||+++|++++|.+++|.++++.+ | ...+|+++|+ T Consensus 303 ~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~ 382 (404) T protein:vir:10 303 RPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRI 382 (404) T ss_pred ceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEee Confidence 999999999999999999999876653 234566899999999999999999999877643 3 3468999999 Q ss_pred ccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 373 GVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 373 d~~v~~~~af~~l~~~~~~~~~ 394 (394) |++|.+|+||++++++++++|- T Consensus 383 d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 383 DGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred ccEEecccceEEEEeecccCCC Confidence 9999999999999999999999 No 30 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=100.00 E-value=4.9e-62 Score=356.71 Aligned_cols=368 Identities=16% Similarity=0.133 Sum_probs=271.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKN-ALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~-~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) |=..+|++|+++++++.++++.+.+++.. .++++..+++++++.+++.+++++++..+..+................. T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~- 79 (390) T protein:vir:62 1 MDATTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGS- 79 (390) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc- Confidence 99999999999999999999888876653 4566777778888888888888887665554443322211111111111 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) ............+.++.... ..+ ..........+..++++++++|+.+...|.+.++..++|+ T Consensus 80 --~~~~~~~~~~~~~~r~~~~~-------~~r--------~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~ 142 (390) T protein:vir:62 80 --GAQRSADVDDDATLRAGNLG-------EAR--------SFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMR 142 (390) T ss_pred --cchhhcchHHHHHHhhhhhh-------hhH--------HHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhh Confidence 00111111111222221100 000 0000111122334445555666665666777888888899 Q ss_pred heeeeEeecCC-ceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESIS 238 (394) Q Consensus 160 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~ 238 (394) .+|++++++++ .+.+|+.++ ...+.|++|++..+ .++++|+++++++|+++++++||+|+++||.++|++||.+.|+ T Consensus 143 ~~~~~~~~~~~~~~~~p~~~~-~~~a~wv~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~ 220 (390) T protein:vir:62 143 GGATTFTTSDANPLDFTVITG-RSSASIVGETAEIP-ESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAG 220 (390) T ss_pred hcceeeecCCCceeEEEEEcC-Ccceeeeccccccc-ccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHH Confidence 99999998765 467887653 34556666666666 5789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcccccc----------------ccccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCC Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKSFT----------------TKTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNG 301 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~~~----------------~~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G 301 (394) ++++.+++.+|++|+|.+. ..+..+++++++++..+...+. +++|+||++++..|++|||++| T Consensus 221 ~~i~~~~d~~~l~G~G~p~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g 300 (390) T protein:vir:62 221 PAIGDAMGRHFITGTGQPRGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKLKDANG 300 (390) T ss_pred HHHHHHHHhhhhccCCccccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHhhccCC Confidence 9999999999999877421 0123457889998877655543 6899999999999999999999 Q ss_pred ceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccc---eEEEEEEeccEEec Q lcl|Aclame:pro 302 RYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQ---YLQAVLRFGVSKVD 378 (394) Q Consensus 302 ~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~---~~r~~~r~d~~v~~ 378 (394) +|||+|++..+.+.+|+|+||++++++ +++.++||||++ |+++++.+++++.+.+.+|.. .+|+++|+|++|++ T Consensus 301 ~~l~~~~~~~g~~~~l~G~Pv~~~~~~--p~~~i~~gd~s~-~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~~ 377 (390) T protein:vir:62 301 QYLWQSGLTVGAPSLFNGKVVETDDGM--PADKILFADLSK-YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLVD 377 (390) T ss_pred CeeecCCcCCCccceecccceEEecCC--CCccEEEeeccc-eeEEeecceEEEeeccccccCCcEEEEEEEEeCcEeec Confidence 999999999999999999999997754 677899999997 678889999999998887755 47999999999999 Q ss_pred ccceEEEEecCcc Q lcl|Aclame:pro 379 DKAGYYVTFTPEP 391 (394) Q Consensus 379 ~~af~~l~~~~~~ 391 (394) |+||++|++++++ T Consensus 378 ~~A~~~l~~~~~a 390 (390) T protein:vir:62 378 ARGAKVLTVTPGA 390 (390) T ss_pred hhheEEEEeecCC Confidence 9999999999999 No 31 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=100.00 E-value=5.6e-62 Score=356.39 Aligned_cols=363 Identities=15% Similarity=0.140 Sum_probs=250.1 Q ss_pred ChHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhchhhHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 1 MFEEKIKEIKATI-ADLNNTIVTKTAQVKNALESDDLEAARS---------IKAEVEQAKANLVEAENDLKLYESSVEVG 70 (394) Q Consensus 1 ~l~e~l~eL~~~~-~el~~~~~~~~~e~~~~~~~e~~~~~~~---------~~~ei~~l~~~i~~l~~~~~~~~~~~~~~ 70 (394) .+...|.|++++. +++++.++++.++.++. .++..+++++ ..++++.++.+++.++..+++........ T Consensus 20 ~~~~~l~e~ra~~~~e~~~l~~~~~~~~~~~-k~~~~~~~~~~~~~~~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~ 98 (425) T protein:vir:10 20 AVPRGIISVRAEGPTEVKALIENLQKAFHDF-KAEHTKQLDAVKAGLPTSDALAKVDKVSADLEALQAAVDEANIKIAAA 98 (425) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 1122222333221 22222222222222211 1111122222 22334444444444444433322221111 Q ss_pred cccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHH Q lcl|Aclame:pro 71 GAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAR 150 (394) Q Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~ 150 (394) .. ...........+..+.+..+++... . .......+++.|++++|++++..|++ T Consensus 99 ~~--~~~~~~~~~~~~~~~af~~~l~~~e--------------~----------~~al~~~t~~~gG~lvP~~~~~~ii~ 152 (425) T protein:vir:10 99 QM--GANGVKPLRDPEYTEAFKAHVKRGD--------------V----------QAALNKGEDSEGGYLTPIEWDRTITN 152 (425) T ss_pred hc--ccccccccccHHHHHHHHHHhhhhh--------------h----------HHHhhcCcCCCCceeccHhHHHHHHH Confidence 10 0011111111122222322222110 0 00112345677889999999999999 Q ss_pred HHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHH Q lcl|Aclame:pro 151 EVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLV 230 (394) Q Consensus 151 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~ 230 (394) .++..++|+++|++++++++.+.+|+... +..+.|++|++..++...++|++++++++++++++++|+|+++|+.++|+ T Consensus 153 ~~~~~s~l~~l~~~~~~~~~~~~~~~~~~-~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~ds~~~l~ 231 (425) T protein:vir:10 153 KLVLISPMRQLCRVQPVSKAGFSKLFNMG-GTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILDDAEIDLE 231 (425) T ss_pred HHHhhhhhhhhceeeeccCCceEEEEEcC-CcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHhcchhHHH Confidence 99999999999999999999999998764 45667777777777656689999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHhhccccccccc----------------------------cccHHHHHHHHHhhhhhhc-cc Q lcl|Aclame:pro 231 GIVSESISQIKVNTTNDAIAKVLKSFTTKT----------------------------VKNLDEIKALLNGGFDPAY-NV 281 (394) Q Consensus 231 ~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~----------------------------~~~~~~i~~~~~~~~~~~~-~a 281 (394) +||.++|+++++.+++.+|++|+|++.|.| ..+++++++++..+...++ ++ T Consensus 232 ~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a 311 (425) T protein:vir:10 232 SWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNA 311 (425) T ss_pred HHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCC Confidence 999999999999999999999988765532 2357888888877655554 68 Q ss_pred EEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcc---cccCceEEEeccccEEEEeecceEEEEee Q lcl|Aclame:pro 282 SLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEV---LGANKAFIGDFKRGVLFADRKDLGLRWAD 358 (394) Q Consensus 282 ~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~ 358 (394) +|+|||++|..|++|+|++|+|||+|++..+.+++|+|+||+++++++ .+..+++||||+++|++++|.++++..+. T Consensus 312 ~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~ 391 (425) T protein:vir:10 312 RFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDP 391 (425) T ss_pred EEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecc Confidence 999999999999999999999999999999999999999999987654 24456899999998999999998776544 Q ss_pred cc-cccceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 359 NE-IYGQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 359 ~~-~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) +. .....+|++.|+|++|++|+||++++++++= T Consensus 392 ~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 392 YTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred cccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 32 2334589999999999999999999999988 No 32 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=100.00 E-value=1.6e-61 Score=353.86 Aligned_cols=377 Identities=15% Similarity=0.140 Sum_probs=278.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---ccccc Q lcl|Aclame:pro 4 EKIKEIKATIADLNNTIVTKTAQVKN-ALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAEN---IGGKE 79 (394) Q Consensus 4 e~l~eL~~~~~el~~~~~~~~~e~~~-~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~---~~~~~ 79 (394) =+|+||+++++++.++++++.+++.+ .+++++.+++++++++++.++++++++++..+............. ..... T Consensus 1 M~l~eL~e~r~~l~~e~~~l~~k~~~~~~t~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (409) T protein:vir:45 1 MKLHELKQKRNTIATDMRALNEKIGDNAWTEEQRTEWNKAKSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPEN 80 (409) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCC Confidence 13789999999999999998887654 467788888999999999999999887766655443332211111 11111 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) ...........+..+++....... ......... .......+...|+++||+++.+.|++.+++.++|+ T Consensus 81 ~~~~~~~~~~a~~~~l~~~~~~~~--------~~e~~~~~~----~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~~l~ 148 (409) T protein:vir:45 81 NSQQDEKRAQVFDKWMRHGASELT--------SEERKALRE----LRAQGVAQDEKGGYTVPETFLAKVVEKMKSYGGIA 148 (409) T ss_pred cchhhHHHHHHHHHHHHhhhhhcc--------HHHHHHHHH----HhhccCccCcCCceeccHhHHHHHHHHHHhhhhhh Confidence 112222233344444433221110 000111111 11122345566789999999999999999999999 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhh-hhhhhhHHHHhccHHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYR-GAIPLSQESIDDADVDLVGIVSESIS 238 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~-~~~~vs~ell~ds~~~l~~~i~~~l~ 238 (394) ++|++++++++....+.........++|.+|++..+.++++|+++++.+++++ ++++||+|+++|+.++|++||.+.|+ T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~~la 228 (409) T protein:vir:45 149 SVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLARRIA 228 (409) T ss_pred hhceeeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHHHHH Confidence 99999999776544443333333444454444444568899999999999985 68999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcccccc---cc---------------ccccHHHHHHHHHhhhhhhc-ccE--EEEcHHHHHHHHhhh Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKSFT---TK---------------TVKNLDEIKALLNGGFDPAY-NVS--LIVSQSFYQTLDTLK 297 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~~~---~~---------------~~~~~~~i~~~~~~~~~~~~-~a~--~vm~~~~~~~l~~lk 297 (394) ++++.+++.+|++|+|++. |. +..+++++.+++..+...++ +++ |+||++++..|++|+ T Consensus 229 ~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~~~l~~lk 308 (409) T protein:vir:45 229 ERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTLKLISEME 308 (409) T ss_pred HHHHHHHHHHhhccCCCCCccccceeeeccccccccccccccchHHHHHHHHhhhhhhccCCeEEEEECHHHHHHHHHhh Confidence 9999999999999988752 21 12357889998887655554 455 478999999999999 Q ss_pred ccCCceeecccccCCCcccccccceEEecCccc---ccCceEEEeccccEEEEeecceEEEEeecccccc---eEEEEEE Q lcl|Aclame:pro 298 DGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL---GANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQ---YLQAVLR 371 (394) Q Consensus 298 d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~---~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~---~~r~~~r 371 (394) |++|+|||++++..+.+.+|+|+||+++++++. +..+++||||++ |++.++.+++++++.+.++.. .+|++.| T Consensus 309 d~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~-~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r 387 (409) T protein:vir:45 309 DGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDR-FIIRRVRYMILKRLVERYAEYDQTGFLAFHR 387 (409) T ss_pred cCCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhh-hheeeccceEEEEeecccccCCcEEEEEEEE Confidence 999999999999999999999999999876643 445688999997 456789999999988877654 5899999 Q ss_pred eccEEecccceEEEEecCccCC Q lcl|Aclame:pro 372 FGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 372 ~d~~v~~~~af~~l~~~~~~~~ 393 (394) +|+++++|+||+++++++++.- T Consensus 388 ~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 388 FDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred eccEeechhheEEEEeccCCCC Confidence 9999999999999999998877 No 33 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=100.00 E-value=1.3e-61 Score=354.37 Aligned_cols=367 Identities=15% Similarity=0.125 Sum_probs=271.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKN-ALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~-~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) |=..+|++|+++++++.++++++.+++.. .++++..+++++++.+++.+++++++..+.++.............. . T Consensus 1 m~~~~l~~l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~---~ 77 (392) T protein:vir:13 1 MDATTLSANFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKATDAVTSLLSGLQG---S 77 (392) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCC---c Confidence 99999999999999999999998887753 4556667778888888888888887654444443332222111111 1 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHH-HHHHhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPA-REVKTVVDL 158 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~-~~~~~~~~l 158 (394) ..............+.++... ...+ .......... .+.++++.++|+++...++ +.+...++| T Consensus 78 ~~~~~~~~~~~~~~~~r~g~~-----------~~~~----~~~~~~~~~~-~t~~~~g~~~~~~~~~~~i~~~~~~~~~l 141 (392) T protein:vir:13 78 GSGAQRSADHDDDAVLRAGNL-----------GEAR----SFEFAPEKRD-GTKAGNPNVLSRTLYGQLIAQAVERSAIM 141 (392) T ss_pred ccchhhhhhHHHHHHHhccch-----------hhhH----HHHhhhhhhc-ccccCCCccccccchHHHHHHHHhhhhhh Confidence 111111111112222221110 0000 0000111122 2333444455555555554 566777788 Q ss_pred hheeeeEeecCC-ceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHH Q lcl|Aclame:pro 159 KPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESI 237 (394) Q Consensus 159 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l 237 (394) +.+++++++.++ .+.+|.... .+.+.|++|++..+ .++++|+++++++++++++++||+|+++|+.++|++||.+.| T Consensus 142 ~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~-~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 219 (392) T protein:vir:13 142 RGGASTFTTSDANPMDFTVITG-RATAGIVGETAEIP-ESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDA 219 (392) T ss_pred hhcceeeecCCCceeEEEEEcC-Ccceeeeccccccc-ccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHHHHH Confidence 999999988655 467776653 45566667766666 578999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhcccccccccc------------------ccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhc Q lcl|Aclame:pro 238 SQIKVNTTNDAIAKVLKSFTTKTV------------------KNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKD 298 (394) Q Consensus 238 ~~~~~~~~~~a~~~g~~~~~~~~~------------------~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd 298 (394) +++++.+++.++++|+|++.|.+. .+++++++++..+...++ +++|+||++++..|++|+| T Consensus 220 ~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd 299 (392) T protein:vir:13 220 GPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKD 299 (392) T ss_pred HHHHHHHHHHHHhcccCCccccccccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhc Confidence 999999999999999987655332 347888888877655554 6899999999999999999 Q ss_pred cCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccc---eEEEEEEeccE Q lcl|Aclame:pro 299 GNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQ---YLQAVLRFGVS 375 (394) Q Consensus 299 ~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~---~~r~~~r~d~~ 375 (394) ++|+|||+|+++.+.+++|+|+||++++++ ++++++||||++ |+++++++++++.+.+.+|.+ .+|++.|+|++ T Consensus 300 ~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~--~~~~i~~Gdf~~-~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~~ 376 (392) T protein:vir:13 300 ANGQYLWQSALTVGAPDTFNGKVVETDDGM--PADKVLFADLSK-YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGL 376 (392) T ss_pred cCCceeecCCcCCCCCceecceeeEEcCCC--CCCcEEEeeccc-eeEEeecceEEEeeccccccCCcEEEEEEEEeccE Confidence 999999999999999999999999997644 678899999997 678899999999998888764 58999999999 Q ss_pred EecccceEEEEecCcc Q lcl|Aclame:pro 376 KVDDKAGYYVTFTPEP 391 (394) Q Consensus 376 v~~~~af~~l~~~~~~ 391 (394) +++|+||+.++++++| T Consensus 377 ~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 377 LVDARGAKVLTVTPAA 392 (392) T ss_pred EecccceEEEEeeccC Confidence 9999999999999999 No 34 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=100.00 E-value=1.5e-60 Score=348.51 Aligned_cols=366 Identities=12% Similarity=0.117 Sum_probs=270.8 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 2 FEEKIKEIKATIADLNNTIVTKTAQVKNA--LESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 2 l~e~l~eL~~~~~el~~~~~~~~~e~~~~--~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) +.+.+++|+++++++.++++++.+++... ++++....+++++++++++++++++++++++..+.............. T Consensus 1 m~e~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~- 79 (390) T protein:vir:10 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVG- 79 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchh- Confidence 67778889999999988888887766542 445566677788888888888888777666655443322221111100 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) ........+..+.... .. ... ................++..++.++|+++...|++.+++.++|+ T Consensus 80 ---~~~~~~~~~~~~~~~~----~~---~~~-----~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~ 144 (390) T protein:vir:10 80 ---DLFVASEQFQASAGRW----ND---RSA-----RATMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPDARLTVR 144 (390) T ss_pred ---hhhhhhHHHHHHHHhh----hh---hhh-----hhhhHHHHHHHhhhcccccccccccchhHHHHHHHHHHhhchhh Confidence 0000111111111000 00 000 00111111222233344455566778888999999999999999 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~ 239 (394) ++|+++++.++.+++|+.+...+.+.|+.|++..+ .++++|++++++++++++++++|+++++|+. ++.+||.++|++ T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~l~~ 222 (390) T protein:vir:10 145 DLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKP-ESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIR 222 (390) T ss_pred hhcceeeccCCceEEEEEecCCcceeeecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHHhHH-HHHHHHHHHHHH Confidence 99999999999999999876666777777777766 5789999999999999999999999999986 799999999999 Q ss_pred HHHHHHHHHHhhcccccc-cc----------------ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCC Q lcl|Aclame:pro 240 IKVNTTNDAIAKVLKSFT-TK----------------TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNG 301 (394) Q Consensus 240 ~~~~~~~~a~~~g~~~~~-~~----------------~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G 301 (394) +++.+++.++++|+|++. |. +...++++.+++..+...++ +++|+|||++|..|++|+|++| T Consensus 223 ~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g 302 (390) T protein:vir:10 223 GLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANN 302 (390) T ss_pred HHHHHHHHHHhhcCCCCccccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC Confidence 999999999999987654 11 11235777777776666655 5789999999999999999999 Q ss_pred ceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeec-ccccc---eEEEEEEeccEEe Q lcl|Aclame:pro 302 RYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN-EIYGQ---YLQAVLRFGVSKV 377 (394) Q Consensus 302 ~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-~~~~~---~~r~~~r~d~~v~ 377 (394) +|||++... +.+++|+|+||++++. .++++++||||+++|.++++++++|+++++ .+|.+ .+|+++|+|++|+ T Consensus 303 ~~l~~~~~~-~~~~~l~G~pv~~~~~--~p~~~~~~gdf~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~ 379 (390) T protein:vir:10 303 QYLIGNARG-TLTPTLWGLPVVATQA--MAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVY 379 (390) T ss_pred ceeecCCcC-cCCceecceeeEEcCC--CCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEe Confidence 999987654 4456999999999764 467789999999989999999999998875 45544 5799999999999 Q ss_pred cccceEEEEec Q lcl|Aclame:pro 378 DDKAGYYVTFT 388 (394) Q Consensus 378 ~~~af~~l~~~ 388 (394) +|+||++++++ T Consensus 380 ~~~a~~~~~~a 390 (390) T protein:vir:10 380 RPEALISGSFA 390 (390) T ss_pred ccccEEEEEeC Confidence 99999999999 No 35 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=100.00 E-value=8.2e-60 Score=344.51 Aligned_cols=380 Identities=14% Similarity=0.098 Sum_probs=255.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHH---HHHHHHHhchhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------Hhh Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTK---TAQVKNALESDD-LEAARSIKAEVEQAKANLVEAENDLKLYESS-------VEV 69 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~---~~e~~~~~~~e~-~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~-------~~~ 69 (394) ||..+++++++++.++.++.+++ .+++++.++..+ .++.+.+..+++.++++.+++.+..+.++.. .+. T Consensus 7 ~~~~el~~~~~~l~el~~~~~el~~~~~el~~~~e~ak~eee~~~l~~ei~~le~e~~~l~~~~~~le~~~~~~~~~l~~ 86 (425) T protein:vir:95 7 MLTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQLEDELEQ 86 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 78888888888777777665444 333433322222 2344455555555555555544444433322 111 Q ss_pred ccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHH Q lcl|Aclame:pro 70 GGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPA 149 (394) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~ 149 (394) .....................+..... ...+. ............ ...........++++++++||+++.+.|+ T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~--~~~~~~~~~~~~--~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii 159 (425) T protein:vir:95 87 INSKQPSNQSRQKMQGSKGDVVEMNRL---QVREM--LKTGEYYKRSEV--VEFYEKFRNLRAVAGGELTIPEVVVNRIM 159 (425) T ss_pred hhhhccchhhhhhhhhhhhhHHHHHHH---HHHHH--HhhhhhhhhhHH--HHHHHHHHhhcccccCceeccHHHHHHHH Confidence 110000000000000000000000000 00000 000000000000 11111112234456688899999999999 Q ss_pred HHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHH Q lcl|Aclame:pro 150 REVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDL 229 (394) Q Consensus 150 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l 229 (394) +.+++.++|+++|++++++ +...+|+.. +.+.+.|+.|++..++...++|++|++++++++++++||+|+++|+.++| T Consensus 160 ~~l~~~~~i~~~~~~~~~~-g~~~ip~~~-~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l 237 (425) T protein:vir:95 160 DIMGDYTTLYPLVDKIRVK-GTTRILVDT-DTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINL 237 (425) T ss_pred HHHHhhhhHHHhhceeecC-ceeEEEEec-CCccccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHH Confidence 9999999999999999985 567888764 45666777777777766668999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccccc--cccc----------------cccHHHHHHHHHhhhhhh---cccEEEEcHH Q lcl|Aclame:pro 230 VGIVSESISQIKVNTTNDAIAKVLKSF--TTKT----------------VKNLDEIKALLNGGFDPA---YNVSLIVSQS 288 (394) Q Consensus 230 ~~~i~~~l~~~~~~~~~~a~~~g~~~~--~~~~----------------~~~~~~i~~~~~~~~~~~---~~a~~vm~~~ 288 (394) ++||.+.|++.++.+++.++++|+|++ .|.| ..+++++.+++......+ .+++|+||+. T Consensus 238 ~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 317 (425) T protein:vir:95 238 DDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRS 317 (425) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccccccccccccchHHHHHHHHHhhhhhccccCceEEEEeCh Confidence 999999999999999999999999865 2222 224677777765544433 3578999999 Q ss_pred HH----HHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccc Q lcl|Aclame:pro 289 FY----QTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQ 364 (394) Q Consensus 289 ~~----~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~ 364 (394) ++ ..|++++|++|+|||++. .+..++|+|+||++++.+ +++.++||||++ |++++|++++|..+++.+|.+ T Consensus 318 ~~~~~l~~l~~~kd~~g~~i~~~~--~~~~~~l~G~pvv~~~~~--~~~~i~~Gd~~~-~~~~~~~~~~i~~~~~~~f~~ 392 (425) T protein:vir:95 318 TYYNRLVEFSIQVDSNGNVVGKLP--NLRTPDLLGLRVVFNNFL--DDDTVLFGEFEQ-YTLVERENITIDSSTHVKFTE 392 (425) T ss_pred HHHHHHHHHHhhcCCCCceeeccC--CCCCccccceeeEEcCcC--CCccEEEEeccc-EEEEeecceEEEeeccccccc Confidence 84 346788999999999753 344569999999987644 677899999997 678889999999999988765 Q ss_pred ---eEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 365 ---YLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 365 ---~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) .+|+++|+|+++++|+||+++++++...+- T Consensus 393 ~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 393 DQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred CceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 489999999999999999999999988777 No 36 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=100.00 E-value=1.4e-59 Score=343.22 Aligned_cols=377 Identities=13% Similarity=0.111 Sum_probs=265.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) =+++++++++++++++.++++++.++........ .+...+.+++++++.+++++++++++++++............... T Consensus 19 el~~~~~e~~~~l~~~~~e~~~~~e~~~~e~~~~-~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 97 (418) T protein:vir:10 19 HPEQVLETVTKELKRIGDEVKSAGEKALAEAKRA-GDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSAELETPK 97 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhh Confidence 2555566666666655555555444333211111 012233455666677777777666666555444322221111100 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) . . ..........+.... . .... .................+.+.+.+++++|++++..|++.+++.++|++ T Consensus 98 ~--~-~~~~~~~~~~~~~~~--~----~~~~-~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~ 167 (418) T protein:vir:10 98 T--L-GQLVTESEEMKGMDG--S----ARKS-VRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRD 167 (418) T ss_pred h--h-hHHhhhHHHHHHHHH--H----Hhhh-hhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHh Confidence 0 0 000000001110000 0 0000 000011111111122344566677889999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++++++++++.+|+....+..+.|+.|++..+ .++++|++|++++++++++++||+++++|+. +|++||.+.|+++ T Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~l~~a 245 (418) T protein:vir:10 168 LLMPGQTSSSSIEYTVETGFTNNAAAVAEGAQKP-TSDLKFNLKNQPVRTIAHLFKASRQILDDAP-ALQSYIDGRARYG 245 (418) T ss_pred hcceeeccCCceeEEEEecCCCceeeeccCcccc-ccccceeeEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHH Confidence 9999999988899999876666677777776665 5789999999999999999999999999985 7999999999999 Q ss_pred HHHHHHHHHhhcccccc-ccc----------------cccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCc Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFT-TKT----------------VKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGR 302 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~-~~~----------------~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~ 302 (394) ++.+++.++++|+|++. |.+ ..+++++.+++..+...++ +++|+|||.+|..|++++|++|+ T Consensus 246 ~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~ 325 (418) T protein:vir:10 246 LQLTEEGQILKGDGTGANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGIVLNPIDWASIELTKDSQGR 325 (418) T ss_pred HHHHHHHHHhccCCCCccccccccccccccccccccccccHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCc Confidence 99999999999988754 211 2346778888777666655 56899999999999999999999 Q ss_pred eeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--cc---ceEEEEEEeccEEe Q lcl|Aclame:pro 303 YLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--YG---QYLQAVLRFGVSKV 377 (394) Q Consensus 303 ~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~~---~~~r~~~r~d~~v~ 377 (394) |||. ++.++.+++|+|+||++++. .+.+.++||||+++|+++++.+++|.++++.. |. ..+|+++|+|++++ T Consensus 326 ~i~~-~~~~~~~~~l~G~pV~~~~~--~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d~~~~ 402 (418) T protein:vir:10 326 YIVG-NPVNGTTPRLWNLPVVETQA--MTANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLALAVY 402 (418) T ss_pred eecc-ccccCCCceecceeeEEcCC--CCCCcEEEeeccceEEEEEecceEEEEecccchhhhcCceEEEEEEeeccEEe Confidence 9994 56777788999999999764 46778999999988899999999999887643 43 45899999999999 Q ss_pred cccceEEEEecCccCC Q lcl|Aclame:pro 378 DDKAGYYVTFTPEPLP 393 (394) Q Consensus 378 ~~~af~~l~~~~~~~~ 393 (394) +|+||+++++++++.= T Consensus 403 ~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 403 RPESFVTGALVEQAGG 418 (418) T ss_pred cccceEEEEeccCCCC Confidence 9999999999998888 No 37 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=100.00 E-value=1e-59 Score=344.03 Aligned_cols=366 Identities=12% Similarity=0.115 Sum_probs=269.4 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH--HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 2 FEEKIKEIKATIADLNNTIVTKTAQVKN--ALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 2 l~e~l~eL~~~~~el~~~~~~~~~e~~~--~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) ..+.+++|+++++++++++++..+++.. .+.++..+.++++.++++++.++++++++++...+............ T Consensus 1 m~~l~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~--- 77 (390) T protein:vir:81 1 MTDITSKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS--- 77 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc--- Confidence 4455566888888888888777665543 24556667777778888888888777766655544332221111110 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) ..+.......+..+...... .. .................+++.++.++|+++...|++.+++.++|+ T Consensus 78 -~~~~~~~~~~~~~~~~~~~~-------~~-----~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~ 144 (390) T protein:vir:81 78 -VGDMFVASEQFQASAGRWND-------RS-----ARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVR 144 (390) T ss_pred -chhhhhhhHHHHHHHHHHhh-------hh-----hhhhhHHHHHHHhhccccccCCcceechhhhHHHHHHHhhhhhhh Confidence 00010111111111100000 00 000111112222233455666777889999999999999999999 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~ 239 (394) ++|++++++++.+.+|..+...+.+.|++|++..+ .++++|+++++++++++++++||+|+++|+. ++++||.+.|++ T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~~~~~i~~~l~~ 222 (390) T protein:vir:81 145 DLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKP-ESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIR 222 (390) T ss_pred hhcceeeccCCceEEEEEecCCcceeeecCCcccc-cccceeeEEEEeeeEEEEeehhhHHHHHhHH-HHHHHHHHHHHH Confidence 99999999999999998876666677777776666 5789999999999999999999999999985 799999999999 Q ss_pred HHHHHHHHHHhhccccccc-c----------------ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCC Q lcl|Aclame:pro 240 IKVNTTNDAIAKVLKSFTT-K----------------TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNG 301 (394) Q Consensus 240 ~~~~~~~~a~~~g~~~~~~-~----------------~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G 301 (394) +++++++.++++|+|++.. . +...++++.+++..+...++ +++|+|||++|..|++|+|++| T Consensus 223 ~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G 302 (390) T protein:vir:81 223 GLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAIELAKDANN 302 (390) T ss_pred HHHHHHHHHHHhcCCCCCcccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC Confidence 9999999999999887542 1 12346778888777666655 5789999999999999999999 Q ss_pred ceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeec-ccccc---eEEEEEEeccEEe Q lcl|Aclame:pro 302 RYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN-EIYGQ---YLQAVLRFGVSKV 377 (394) Q Consensus 302 ~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-~~~~~---~~r~~~r~d~~v~ 377 (394) +|||++. ..+.+++|+|+||+++++ .++++++||||+++|++++|++++|+++++ .+|.+ .+|+++|+|++|. T Consensus 303 ~~l~~~~-~~~~~~~l~G~pv~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~ 379 (390) T protein:vir:81 303 QYLIGNA-RGTLTPTLWGLPVVATQA--MAPGEFLVGAFDLAAQIFDQWDARVEIGYVGEDFQRNMITVLAEERLALVVY 379 (390) T ss_pred ceeecCc-ccccCceecceeeEEcCC--CCCCcEEEEehhceEEEEEecceEEEEecccchhhcCcEEEEEEEeeccEEe Confidence 9999874 455667999999999764 467789999999988999999999999875 44544 5899999999999 Q ss_pred cccceEEEEec Q lcl|Aclame:pro 378 DDKAGYYVTFT 388 (394) Q Consensus 378 ~~~af~~l~~~ 388 (394) +|+||++++++ T Consensus 380 ~~~a~v~~t~a 390 (390) T protein:vir:81 380 RPEALISGSFA 390 (390) T ss_pred cccceEEEEeC Confidence 99999999999 No 38 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=100.00 E-value=7.7e-60 Score=344.68 Aligned_cols=362 Identities=14% Similarity=0.150 Sum_probs=264.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) |++|++|+++++++.++++++.++.+....+ ..++.+++++++..+.++++++++.++..+........... . T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~ 73 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQKAEIES-TGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPG------E 73 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------h Confidence 6689999999999999988877666543221 12344555566666666655555444443332221111100 0 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) .............+... .. ... ............+...++.++|+++...|++.++..++|+++| T Consensus 74 ~~~~~~~~~~~~~~~~~----~~---~~~--------~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~ 138 (385) T protein:vir:19 74 KKSFSERAAEELIKSWD----GK---QGT--------FGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLL 138 (385) T ss_pred hhhhHHHHHHHHHHHHH----Hh---hcc--------chhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhc Confidence 00000011111111000 00 000 0000011122344455567888899999999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKV 242 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~ 242 (394) +++++.++++.+|+.+.....+.|+.|++..+ .++++|+++++++++++++++||+|+++|+. ++++||.+.|+++++ T Consensus 139 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~-~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~ 216 (385) T protein:vir:19 139 AQGRTSSNALEYVREEVFTNNADVVAEKALKP-ESDITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLA 216 (385) T ss_pred ceecccCcceEEEEEecCCcceeeeccCcccc-ccccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHH Confidence 99999988899998876556666666665555 6789999999999999999999999999985 699999999999999 Q ss_pred HHHHHHHhhcccccccc-----------------ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCcee Q lcl|Aclame:pro 243 NTTNDAIAKVLKSFTTK-----------------TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYL 304 (394) Q Consensus 243 ~~~~~a~~~g~~~~~~~-----------------~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l 304 (394) .+++.++++|+|++.+. +...++++.+++..+...++ +++|+|||++|..|++++|++|+|| T Consensus 217 ~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l 296 (385) T protein:vir:19 217 LKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYI 296 (385) T ss_pred HHHHHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCcee Confidence 99999999998876542 12346788888877766655 5799999999999999999999999 Q ss_pred ecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--ccc---eEEEEEEeccEEecc Q lcl|Aclame:pro 305 LQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--YGQ---YLQAVLRFGVSKVDD 379 (394) Q Consensus 305 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~~~---~~r~~~r~d~~v~~~ 379 (394) |.+ +..+++++|+|+||++++. .+++.++||||+++|+++++++++|+++++.. |.+ .+|+++|+|++|.+| T Consensus 297 ~~~-~~~~~~~~l~G~pV~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~ 373 (385) T protein:vir:19 297 FGG-PQAFTSNIMWGLPVVPTKA--QAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRP 373 (385) T ss_pred ccC-cccCCCceecceeeEEcCc--CCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecc Confidence 965 6677888999999998764 46788999999998999999999999887643 444 579999999999999 Q ss_pred cceEEEEecCcc Q lcl|Aclame:pro 380 KAGYYVTFTPEP 391 (394) Q Consensus 380 ~af~~l~~~~~~ 391 (394) +||+++++++++ T Consensus 374 ~a~~~~~~~aa~ 385 (385) T protein:vir:19 374 TAIIKGTFSSGS 385 (385) T ss_pred cceEEEEeccCC Confidence 999999999999 No 39 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=100.00 E-value=7.7e-60 Score=344.68 Aligned_cols=362 Identities=14% Similarity=0.150 Sum_probs=264.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) |++|++|+++++++.++++++.++.+....+ ..++.+++++++..+.++++++++.++..+........... . T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~ 73 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQKAEIES-TGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPG------E 73 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------h Confidence 6689999999999999988877666543221 12344555566666666655555444443332221111100 0 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) .............+... .. ... ............+...++.++|+++...|++.++..++|+++| T Consensus 74 ~~~~~~~~~~~~~~~~~----~~---~~~--------~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~~~~~~l~~~~ 138 (385) T protein:vir:18 74 KKSFSERAAEELIKSWD----GK---QGT--------FGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPGLRRLTIRDLL 138 (385) T ss_pred hhhhHHHHHHHHHHHHH----Hh---hcc--------chhhHHHhhhccccccCCceecchhhhHHHHHhhhccchhhhc Confidence 00000011111111000 00 000 0000011122344455567888899999999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKV 242 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~ 242 (394) +++++.++++.+|+.+.....+.|+.|++..+ .++++|+++++++++++++++||+|+++|+. ++++||.+.|+++++ T Consensus 139 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~-~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-~l~~~i~~~la~a~~ 216 (385) T protein:vir:18 139 AQGRTSSNALEYVREEVFTNNADVVAEKALKP-ESDITFSKQTANVKTIAHWVQASRQVMDDAP-MLQSYINNRLMYGLA 216 (385) T ss_pred ceecccCcceEEEEEecCCcceeeeccCcccc-ccccceeEEEEeeeeEEEeehhhHHHHhhHH-HHHHHHHHHHHHHHH Confidence 99999988899998876556666666665555 6789999999999999999999999999985 699999999999999 Q ss_pred HHHHHHHhhcccccccc-----------------ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCcee Q lcl|Aclame:pro 243 NTTNDAIAKVLKSFTTK-----------------TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYL 304 (394) Q Consensus 243 ~~~~~a~~~g~~~~~~~-----------------~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l 304 (394) .+++.++++|+|++.+. +...++++.+++..+...++ +++|+|||++|..|++++|++|+|| T Consensus 217 ~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~~G~~l 296 (385) T protein:vir:18 217 LKEEGQLLNGDGTGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDNEGRYI 296 (385) T ss_pred HHHHHHHHhccCCCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCCcee Confidence 99999999998876542 12346788888877766655 5799999999999999999999999 Q ss_pred ecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--ccc---eEEEEEEeccEEecc Q lcl|Aclame:pro 305 LQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--YGQ---YLQAVLRFGVSKVDD 379 (394) Q Consensus 305 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~~~---~~r~~~r~d~~v~~~ 379 (394) |.+ +..+++++|+|+||++++. .+++.++||||+++|+++++++++|+++++.. |.+ .+|+++|+|++|.+| T Consensus 297 ~~~-~~~~~~~~l~G~pV~~~~~--~p~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~v~~~ 373 (385) T protein:vir:18 297 FGG-PQAFTSNIMWGLPVVPTKA--QAAGTFTVGGFDMASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALAHYRP 373 (385) T ss_pred ccC-cccCCCceecceeeEEcCc--CCCCcEEEeecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEecc Confidence 965 6677888999999998764 46788999999998999999999999887643 444 579999999999999 Q ss_pred cceEEEEecCcc Q lcl|Aclame:pro 380 KAGYYVTFTPEP 391 (394) Q Consensus 380 ~af~~l~~~~~~ 391 (394) +||+++++++++ T Consensus 374 ~a~~~~~~~aa~ 385 (385) T protein:vir:18 374 TAIIKGTFSSGS 385 (385) T ss_pred cceEEEEeccCC Confidence 999999999999 No 40 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=100.00 E-value=2.4e-59 Score=341.94 Aligned_cols=366 Identities=12% Similarity=0.119 Sum_probs=268.7 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHH--HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 2 FEEKIKEIKATIADLNNTIVTKTAQVKN--ALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 2 l~e~l~eL~~~~~el~~~~~~~~~e~~~--~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) ..+..++|+++++++.++++++.++... .++++..+++++++++++.+++++++++++++............... T Consensus 1 m~~~~~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~--- 77 (390) T protein:vir:97 1 MTDITAKLEATLANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVS--- 77 (390) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc--- Confidence 4555567888888888887777665543 34556666777777777777777777665555444332221111110 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) . .........+..+..... .... ................++..++.++|++++..|++.++..++|+ T Consensus 78 ~-~~~~~~~~~~~~~~~~~~-------~~~~-----~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~ 144 (390) T protein:vir:97 78 V-GDMFVASEQFQASTGRWN-------DRSA-----RATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVR 144 (390) T ss_pred c-hhhhhhhHHHHHHHHHhh-------hhhh-----hhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhH Confidence 0 000000111111110000 0000 00111111222233456677788999999999999999999999 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~ 239 (394) ++|+++++.++.+++|..+..++.+.|+.|++..+ +++++|++++++++++++++++|+|+++|+. ++++||.+.|++ T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~ds~-~l~~~i~~~la~ 222 (390) T protein:vir:97 145 DLIGSGRTDSALIEYVQETGFVNNAAIVAEGALKP-ESSLKFAKKTDTTHVIAHTMKATRQILSDAP-QLASYMNNRLIR 222 (390) T ss_pred hhcceeeccCCceEEEEEecCCcceeeecCCcccc-ccccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHH Confidence 99999999999999999876667777777777766 5789999999999999999999999999985 799999999999 Q ss_pred HHHHHHHHHHhhcccccc-ccc----------------cccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCC Q lcl|Aclame:pro 240 IKVNTTNDAIAKVLKSFT-TKT----------------VKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNG 301 (394) Q Consensus 240 ~~~~~~~~a~~~g~~~~~-~~~----------------~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G 301 (394) +++.+++.++++|+|++. |.+ ...++++.+++..+...++ +++|+|||++|..|++|+|++| T Consensus 223 a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G 302 (390) T protein:vir:97 223 GLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAIELAKDANN 302 (390) T ss_pred HHHHHHHHHHhhcCCCCccccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcCCC Confidence 999999999999987654 211 2235677777766655554 5789999999999999999999 Q ss_pred ceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeec-ccccce---EEEEEEeccEEe Q lcl|Aclame:pro 302 RYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN-EIYGQY---LQAVLRFGVSKV 377 (394) Q Consensus 302 ~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-~~~~~~---~r~~~r~d~~v~ 377 (394) +|||.+. ..+.+++|+|+||++++. .++++++||||+++|++++++++++.++++ .+|.++ +|+++|+|+.|. T Consensus 303 ~~l~~~~-~~~~~~~l~G~pV~~~~~--~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~ 379 (390) T protein:vir:97 303 QYLIGNA-RGTLTPTLWGLPVVATQA--MAPGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVY 379 (390) T ss_pred ceeecCc-cCCCCceecceeeEEcCC--CCCCcEEEEeccceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEe Confidence 9999875 455667999999999764 467889999999888999999999999875 455554 799999999999 Q ss_pred cccceEEEEec Q lcl|Aclame:pro 378 DDKAGYYVTFT 388 (394) Q Consensus 378 ~~~af~~l~~~ 388 (394) +|+||++++++ T Consensus 380 ~~~a~v~~~~a 390 (390) T protein:vir:97 380 RPEALITGSFA 390 (390) T ss_pred ccccEEEEEeC Confidence 99999999999 No 41 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=100.00 E-value=8.6e-60 Score=344.41 Aligned_cols=368 Identities=13% Similarity=0.095 Sum_probs=267.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |.+|.||++++.++.++++++.++++.+..+++ .+++++++++++.++++++.++++++..++............... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 889999999999999999999998877655443 356778888999999999988888877765544332222222222 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ..........+..+.++.............. .........++.+.||++||+++...|++.++++++|++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~----------~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~ 150 (387) T protein:vir:26 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEA----------QRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE 150 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHH----------HHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhh Confidence 2222223333444433322111110000000 001111223455667899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++.++++ ..+|........+.|++|++..+ .++++|+++++.+++++++++||+|||+||.++|++||.+.|+++ T Consensus 151 ~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~ 227 (387) T protein:vir:26 151 KARLTNIKG--LEIPRVSYTLDDDDFITDVETAK-ELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSG 227 (387) T ss_pred hceeeecCC--ceeeeeeccCCcccccccccccc-ccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHH Confidence 999988854 56777665555566666666655 578999999999999999999999999999999999999999999 Q ss_pred HHHHHHH-HHhhcccccccc------------ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeec Q lcl|Aclame:pro 241 KVNTTND-AIAKVLKSFTTK------------TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQ 306 (394) Q Consensus 241 ~~~~~~~-a~~~g~~~~~~~------------~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~ 306 (394) ++++++. .+.+|.|++.+. +...+|++++++..+...++ |++|+||+.++..+..+++..|+|+|. T Consensus 228 ~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~ 307 (387) T protein:vir:26 228 LAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD 307 (387) T ss_pred HHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc Confidence 9999765 455565554332 22348899999887666654 789999999998877666666777774 Q ss_pred ccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccc-cceEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 307 DDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIY-GQYLQAVLRFGVSKVDDKAGYYV 385 (394) Q Consensus 307 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~-~~~~r~~~r~d~~v~~~~af~~l 385 (394) +.+.+|+|+||+++++ ..+++||||+++|.++ .++.++.+++..+ ...+++++|+|++|++|+||+++ T Consensus 308 -----~~~~~llG~PV~~~~~----~~~~~~GDf~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l 376 (387) T protein:vir:26 308 -----TPAEKVFGKPVVFTDA----AVKPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIA 376 (387) T ss_pred -----cCCccccccceEEecC----CCceeeechhhhhhhh--hhhhheecccccCCceEEEEEEEeCcEeechhheEEE Confidence 2456999999999875 3568999999877654 4566666655543 56789999999999999999999 Q ss_pred EecCccCCC Q lcl|Aclame:pro 386 TFTPEPLPL 394 (394) Q Consensus 386 ~~~~~~~~~ 394 (394) +++++..|. T Consensus 377 ~~ka~~~~~ 385 (387) T protein:vir:26 377 KAKENTGPL 385 (387) T ss_pred EeecCCCCC Confidence 999988777 No 42 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=100.00 E-value=8.6e-60 Score=344.41 Aligned_cols=368 Identities=13% Similarity=0.095 Sum_probs=267.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |.+|.||++++.++.++++++.++++.+..+++ .+++++++++++.++++++.++++++..++............... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 889999999999999999999998877655443 356778888999999999988888877765544332222222222 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ..........+..+.++.............. .........++.+.||++||+++...|++.++++++|++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~----------~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~ 150 (387) T protein:vir:96 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEA----------QRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE 150 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHH----------HHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhh Confidence 2222223333444433322111110000000 001111223455667899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++.++++ ..+|........+.|++|++..+ .++++|+++++.+++++++++||+|||+||.++|++||.+.|+++ T Consensus 151 ~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~ 227 (387) T protein:vir:96 151 KARLTNIKG--LEIPRVSYTLDDDDFITDVETAK-ELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSG 227 (387) T ss_pred hceeeecCC--ceeeeeeccCCcccccccccccc-ccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHH Confidence 999988854 56777665555566666666655 578999999999999999999999999999999999999999999 Q ss_pred HHHHHHH-HHhhcccccccc------------ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeec Q lcl|Aclame:pro 241 KVNTTND-AIAKVLKSFTTK------------TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQ 306 (394) Q Consensus 241 ~~~~~~~-a~~~g~~~~~~~------------~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~ 306 (394) ++++++. .+.+|.|++.+. +...+|++++++..+...++ |++|+||+.++..+..+++..|+|+|. T Consensus 228 ~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~ 307 (387) T protein:vir:96 228 LAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD 307 (387) T ss_pred HHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc Confidence 9999765 455565554332 22348899999887666654 789999999998877666666777774 Q ss_pred ccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccc-cceEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 307 DDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIY-GQYLQAVLRFGVSKVDDKAGYYV 385 (394) Q Consensus 307 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~-~~~~r~~~r~d~~v~~~~af~~l 385 (394) +.+.+|+|+||+++++ ..+++||||+++|.++ .++.++.+++..+ ...+++++|+|++|++|+||+++ T Consensus 308 -----~~~~~llG~PV~~~~~----~~~~~~GDf~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l 376 (387) T protein:vir:96 308 -----TPAEKVFGKPVVFTDA----AVKPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIA 376 (387) T ss_pred -----cCCccccccceEEecC----CCceeeechhhhhhhh--hhhhheecccccCCceEEEEEEEeCcEeechhheEEE Confidence 2456999999999875 3568999999877654 4566666655543 56789999999999999999999 Q ss_pred EecCccCCC Q lcl|Aclame:pro 386 TFTPEPLPL 394 (394) Q Consensus 386 ~~~~~~~~~ 394 (394) +++++..|. T Consensus 377 ~~ka~~~~~ 385 (387) T protein:vir:96 377 KAKENTGPL 385 (387) T ss_pred EeecCCCCC Confidence 999988777 No 43 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=100.00 E-value=8.6e-60 Score=344.41 Aligned_cols=368 Identities=13% Similarity=0.095 Sum_probs=267.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |.+|.||++++.++.++++++.++++.+..+++ .+++++++++++.++++++.++++++..++............... T Consensus 1 Mk~l~el~~~~~~~~~~~~~~~~el~e~~~~~~~~~eei~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 889999999999999999999998877655443 356778888999999999988888877765544332222222222 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ..........+..+.++.............. .........++.+.||++||+++...|++.++++++|++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~----------~~~~~a~~~~~~~~gG~lIP~~~~~~Ii~~~~~~~~l~~ 150 (387) T protein:vir:94 81 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEA----------QRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE 150 (387) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHHH----------HHHHhhhccCCCCCCceeechhHHHHHHHHHHhhchhhh Confidence 2222223333444433322111110000000 001111223455667899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++.++++ ..+|........+.|++|++..+ .++++|+++++.+++++++++||+|||+||.++|++||.+.|+++ T Consensus 151 ~~~~~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~-~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~ 227 (387) T protein:vir:94 151 KARLTNIKG--LEIPRVSYTLDDDDFITDVETAK-ELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSG 227 (387) T ss_pred hceeeecCC--ceeeeeeccCCcccccccccccc-ccccccceeeechheeeeechhhHHHHhhhHHHHHHHHHHHHHHH Confidence 999988854 56777665555566666666655 578999999999999999999999999999999999999999999 Q ss_pred HHHHHHH-HHhhcccccccc------------ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeec Q lcl|Aclame:pro 241 KVNTTND-AIAKVLKSFTTK------------TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQ 306 (394) Q Consensus 241 ~~~~~~~-a~~~g~~~~~~~------------~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~ 306 (394) ++++++. .+.+|.|++.+. +...+|++++++..+...++ |++|+||+.++..+..+++..|+|+|. T Consensus 228 ~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~~~~~~~~~ 307 (387) T protein:vir:94 228 LAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD 307 (387) T ss_pred HHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc Confidence 9999765 455565554332 22348899999887666654 789999999998877666666777774 Q ss_pred ccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccc-cceEEEEEEeccEEecccceEEE Q lcl|Aclame:pro 307 DDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIY-GQYLQAVLRFGVSKVDDKAGYYV 385 (394) Q Consensus 307 ~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~-~~~~r~~~r~d~~v~~~~af~~l 385 (394) +.+.+|+|+||+++++ ..+++||||+++|.++ .++.++.+++..+ ...+++++|+|++|++|+||+++ T Consensus 308 -----~~~~~llG~PV~~~~~----~~~~~~GDf~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~~l 376 (387) T protein:vir:94 308 -----TPAEKVFGKPVVFTDA----AVKPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIA 376 (387) T ss_pred -----cCCccccccceEEecC----CCceeeechhhhhhhh--hhhhheecccccCCceEEEEEEEeCcEeechhheEEE Confidence 2456999999999875 3568999999877654 4566666655543 56789999999999999999999 Q ss_pred EecCccCCC Q lcl|Aclame:pro 386 TFTPEPLPL 394 (394) Q Consensus 386 ~~~~~~~~~ 394 (394) +++++..|. T Consensus 377 ~~ka~~~~~ 385 (387) T protein:vir:94 377 KAKENTGPL 385 (387) T ss_pred EeecCCCCC Confidence 999988777 No 44 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=100.00 E-value=1.8e-59 Score=342.65 Aligned_cols=390 Identities=14% Similarity=0.093 Sum_probs=241.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDD------------LEAARSIKAEVEQAKANLVEAENDLKLYESSVE 68 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~------------~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~ 68 (394) =|+.+.+++.+++++++++...+.+|.++.+++.. ..+.++...+++.+.+++++++..+.+.+.... T Consensus 6 ~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~ 85 (497) T protein:vir:78 6 QLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNL 85 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 24444444444444444444444444433322210 011112222333333333333333332221110 Q ss_pred hccccccccccccchhhhHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHH---HHHhhhhhhhhhhcccccCCccccch Q lcl|Aclame:pro 69 VGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDS---LRFEGKDEVLM---PINETTPVEPQKDGIKKENAKPVSSE 142 (394) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~lvP~ 142 (394) ..............+.......+....+......... ........... ..............++++.+++++|+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~ 165 (497) T protein:vir:78 86 KQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILP 165 (497) T ss_pred hhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccch Confidence 0000000000000000000000000000000000000 00000000000 01111111122233455667889999 Q ss_pred hHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHH Q lcl|Aclame:pro 143 EILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESI 222 (394) Q Consensus 143 ~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell 222 (394) ++...|++.+++.++|+++|++++++++++.+|..+...+.+.|++|++..++ ++++|++|++.+|+++++++||+||| T Consensus 166 ~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~-s~~~f~~i~~~~~k~a~~~~iS~ell 244 (497) T protein:vir:78 166 TFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALTITDEGL 244 (497) T ss_pred hhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCccccc-ccccceeeEeeeeeeEeecHhHHHHH Confidence 99999999999999999999999999999999987766667788888877774 78999999999999999999999999 Q ss_pred hccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccH-------------H------------------------ Q lcl|Aclame:pro 223 DDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNL-------------D------------------------ 265 (394) Q Consensus 223 ~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~-------------~------------------------ 265 (394) +|++ +|++||.+.|+++++.+++.+|++|+|++.|.+..+. . T Consensus 245 ~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (497) T protein:vir:78 245 RDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQD 323 (497) T ss_pred HhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhh Confidence 9986 6999999999999999999999999998765433210 0 Q ss_pred ---------------------------------HHHHHHHhhhhhh-c-ccEEEEcHHHHHHHHhhhccCCceeeccccc Q lcl|Aclame:pro 266 ---------------------------------EIKALLNGGFDPA-Y-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDIT 310 (394) Q Consensus 266 ---------------------------------~i~~~~~~~~~~~-~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~ 310 (394) .+..++....... + ..+|+|||.+|..|++|||++|+|||++... T Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~ 403 (497) T protein:vir:78 324 TVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG 403 (497) T ss_pred HHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCccc Confidence 0001111111111 1 2379999999999999999999999987532 Q ss_pred C------CCcccccccceEEecCcccccCceEEEecccc-EEEEeecceEEEEeecc--ccc---ceEEEEEEeccEEec Q lcl|Aclame:pro 311 A------VSGKVLLGKPVFVLSDEVLGANKAFIGDFKRG-VLFADRKDLGLRWADNE--IYG---QYLQAVLRFGVSKVD 378 (394) Q Consensus 311 ~------~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~~i~~~~~~--~~~---~~~r~~~r~d~~v~~ 378 (394) . ..+++|||+||++++++ +.+.++||||+++ |.+++|++++|+++++. +|. ..+|+++|+|+.|++ T Consensus 404 ~~~~~~~~~~~~l~G~pV~~t~~~--~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~ 481 (497) T protein:vir:78 404 NAYGNPVNGGKNIWGVPVVTTPLI--PLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYR 481 (497) T ss_pred ccccccccCCceeeceeeEecCCC--CCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeec Confidence 2 23459999999997755 5677899999985 56889999999998753 354 458999999999999 Q ss_pred ccceEEEEecCccCCC Q lcl|Aclame:pro 379 DKAGYYVTFTPEPLPL 394 (394) Q Consensus 379 ~~af~~l~~~~~~~~~ 394 (394) |+||++|+++++++-- T Consensus 482 p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:78 482 PSAFQLIQLKKGATGS 497 (497) T ss_pred cccEEEEEecCCccCC Confidence 9999999999999777 No 45 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=100.00 E-value=1.8e-59 Score=342.65 Aligned_cols=390 Identities=14% Similarity=0.093 Sum_probs=241.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDD------------LEAARSIKAEVEQAKANLVEAENDLKLYESSVE 68 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~------------~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~ 68 (394) =|+.+.+++.+++++++++...+.+|.++.+++.. ..+.++...+++.+.+++++++..+.+.+.... T Consensus 6 ~l~~~~~~~~~~~~~~~~~~~~~~aE~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~ 85 (497) T protein:vir:10 6 QLEAQGRQLAKSIKDINADETKTAAEKKEALAKIEPDFKAHQAEVEAHERAQEMLKSLGGADAAKDGLDNDIPEVEVRNL 85 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 24444444444444444444444444433322210 011112222333333333333333332221110 Q ss_pred hccccccccccccchhhhHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHH---HHHhhhhhhhhhhcccccCCccccch Q lcl|Aclame:pro 69 VGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDS---LRFEGKDEVLM---PINETTPVEPQKDGIKKENAKPVSSE 142 (394) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~lvP~ 142 (394) ..............+.......+....+......... ........... ..............++++.+++++|+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~ 165 (497) T protein:vir:10 86 KQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTGTFAPGILP 165 (497) T ss_pred hhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCcccccccch Confidence 0000000000000000000000000000000000000 00000000000 01111111122233455667889999 Q ss_pred hHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHH Q lcl|Aclame:pro 143 EILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESI 222 (394) Q Consensus 143 ~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell 222 (394) ++...|++.+++.++|+++|++++++++++.+|..+...+.+.|++|++..++ ++++|++|++.+|+++++++||+||| T Consensus 166 ~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~~~~~~~~~~~~a~wv~E~~~~~~-s~~~f~~i~~~~~k~a~~~~iS~ell 244 (497) T protein:vir:10 166 TFLPGIVEQLFYELSLADLISSRPVTSPNLSYLTESAAHNNAAAVAEAGTYPF-SSEEFARVYEQVGKVANALTITDEGL 244 (497) T ss_pred hhhHHHHHHHHhhhhHHhhccccccCCCceEEEEEcCCCCcceeeccCccccc-ccccceeeEeeeeeeEeecHhHHHHH Confidence 99999999999999999999999999999999987766667788888877774 78999999999999999999999999 Q ss_pred hccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccH-------------H------------------------ Q lcl|Aclame:pro 223 DDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNL-------------D------------------------ 265 (394) Q Consensus 223 ~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~-------------~------------------------ 265 (394) +|++ +|++||.+.|+++++.+++.+|++|+|++.|.+..+. . T Consensus 245 ~d~~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 323 (497) T protein:vir:10 245 RDAP-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQD 323 (497) T ss_pred HhHH-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhh Confidence 9986 6999999999999999999999999998765433210 0 Q ss_pred ---------------------------------HHHHHHHhhhhhh-c-ccEEEEcHHHHHHHHhhhccCCceeeccccc Q lcl|Aclame:pro 266 ---------------------------------EIKALLNGGFDPA-Y-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDIT 310 (394) Q Consensus 266 ---------------------------------~i~~~~~~~~~~~-~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~ 310 (394) .+..++....... + ..+|+|||.+|..|++|||++|+|||++... T Consensus 324 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~ 403 (497) T protein:vir:10 324 TVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFG 403 (497) T ss_pred HHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCccc Confidence 0001111111111 1 2379999999999999999999999987532 Q ss_pred C------CCcccccccceEEecCcccccCceEEEecccc-EEEEeecceEEEEeecc--ccc---ceEEEEEEeccEEec Q lcl|Aclame:pro 311 A------VSGKVLLGKPVFVLSDEVLGANKAFIGDFKRG-VLFADRKDLGLRWADNE--IYG---QYLQAVLRFGVSKVD 378 (394) Q Consensus 311 ~------~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~~i~~~~~~--~~~---~~~r~~~r~d~~v~~ 378 (394) . ..+++|||+||++++++ +.+.++||||+++ |.+++|++++|+++++. +|. ..+|+++|+|+.|++ T Consensus 404 ~~~~~~~~~~~~l~G~pV~~t~~~--~~~~~~~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~ 481 (497) T protein:vir:10 404 NAYGNPVNGGKNIWGVPVVTTPLI--PLGTILVGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYR 481 (497) T ss_pred ccccccccCCceeeceeeEecCCC--CCCceEEeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeec Confidence 2 23459999999997755 5677899999985 56889999999998753 354 458999999999999 Q ss_pred ccceEEEEecCccCCC Q lcl|Aclame:pro 379 DKAGYYVTFTPEPLPL 394 (394) Q Consensus 379 ~~af~~l~~~~~~~~~ 394 (394) |+||++|+++++++-- T Consensus 482 p~A~~~l~~~~~~~~~ 497 (497) T protein:vir:10 482 PSAFQLIQLKKGATGS 497 (497) T ss_pred cccEEEEEecCCccCC Confidence 9999999999999777 No 46 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=100.00 E-value=1e-58 Score=338.45 Aligned_cols=368 Identities=13% Similarity=0.118 Sum_probs=260.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) =++++|+||+++++++.++++++.+++.+...+.+ +..++++++++++.+++++++.++++.+................ T Consensus 3 ~~~k~l~el~~~~~~~~~~~~~~~e~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (395) T protein:vir:43 3 DFEKQIGELNASLKQVGDQIKSQAEQVNTQIANFG-EMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEAP 81 (395) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccchh Confidence 23346777777777766666665555443221111 22334455555666666655555544433322211111111000 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) .............+... ... ...............++..++.++|++++..|++.+++.++|++ T Consensus 82 --~~~~~~~~~~~~~~~~~-------------~~~-~~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~ 145 (395) T protein:vir:43 82 --KTAGQMVAESLKEQGVT-------------SSL-RGSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRD 145 (395) T ss_pred --hhHHHHHHHHHHHHHHH-------------HHh-hhhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHh Confidence 00000000000000000 000 00111111223344566677889999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++++++++.+.+|+.+..+..+.|++|++..+ .++++|+++++++++++++++||+++++|+. ++++||.+.|+++ T Consensus 146 l~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~v~~~la~a 223 (395) T protein:vir:43 146 LVAPGTTESNSVEYVRETGFVNNAAPVSEGTQKP-YSDLTFELENAPVRTIAHLFKASRQILDDAS-ALQSYIDARARYG 223 (395) T ss_pred hccceecCCCceEEEEEecCCCceeeecCCcccc-ccccceeEEEEeeeeEEEeehhhHHHHHhHH-HHHHHHHHHHHHH Confidence 9999999998899999877667777787777766 5789999999999999999999999999976 6999999999999 Q ss_pred HHHHHHHHHhhcccccccc-cc------------------ccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccC Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTK-TV------------------KNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGN 300 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~-~~------------------~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~ 300 (394) ++.+++.++++|+|++.+. +. ..++++.+++..+...++ +++|+|||++|..|++++|++ T Consensus 224 ~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~ 303 (395) T protein:vir:43 224 LMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALIELNKDAE 303 (395) T ss_pred HHHHHHHHHHhccCCCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHHHHhhccC Confidence 9999999999998876552 11 136677777766655554 578999999999999999999 Q ss_pred CceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--cc---ceEEEEEEeccE Q lcl|Aclame:pro 301 GRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--YG---QYLQAVLRFGVS 375 (394) Q Consensus 301 G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~~---~~~r~~~r~d~~ 375 (394) |+|||.+ +.++.+++|+|+||++++. .+.+.++||||+++|++++|.+++|+++++.+ |. ..+|+++|+|++ T Consensus 304 G~~i~~~-~~~~~~~~l~G~pVv~~~~--~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~ 380 (395) T protein:vir:43 304 NRYIIGS-PQNGTTPTLWRLPVVETQA--ITQDEFLTGAFSLGAQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFA 380 (395) T ss_pred Cceeccc-cccCCCceecceeeEEcCC--CCCCcEEEEeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccE Confidence 9999965 6677778999999999764 46778999999998889999999999887643 43 358999999999 Q ss_pred EecccceEEEEecCc Q lcl|Aclame:pro 376 KVDDKAGYYVTFTPE 390 (394) Q Consensus 376 v~~~~af~~l~~~~~ 390 (394) +++|+||++++++++ T Consensus 381 v~~~~a~~~~~~taa 395 (395) T protein:vir:43 381 VYRPEAFVTGSLTAS 395 (395) T ss_pred EecccceEEEEeccC Confidence 999999999999999 No 47 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=100.00 E-value=1.9e-58 Score=337.05 Aligned_cols=383 Identities=14% Similarity=0.131 Sum_probs=242.3 Q ss_pred Ch-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc----- Q lcl|Aclame:pro 1 MF-EEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGA----- 72 (394) Q Consensus 1 ~l-~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~----- 72 (394) |- +|.++++++++++.+ .+++...++++ .++.+++++++++|+++++.+.++++.+++..+.... T Consensus 1 M~l~el~~~~~~~~~~~~-------a~l~~~~~~~~~~~ee~~~~~~e~~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~ 73 (434) T protein:vir:62 1 MNLKEILNASLTRTKSRL-------AELQGKVEKNEVRSEELAAVKAEVEQLTKEIQTISEELAKLEEKEKEEDPAKKKD 73 (434) T ss_pred CCHHHHHHHHHHHHHHHH-------HHHHHHHhccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 53 222233333333322 22222222211 2334445555555555555555554443322111100 Q ss_pred -cccccccccc-hh--------hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----hhhhhhhhhhcccccCCc Q lcl|Aclame:pro 73 -ENIGGKEVTQ-EE--------KTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPIN-----ETTPVEPQKDGIKKENAK 137 (394) Q Consensus 73 -~~~~~~~~~~-~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~ 137 (394) .......... +. .+....+............. ......+.+.... ........+.+.+++.|+ T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~--~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG 151 (434) T protein:vir:62 74 DDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGH--RTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGS 151 (434) T ss_pred chhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccc--cchHHHHHHHHHHHHhccccchhhhhhhcccccccc Confidence 0000000000 00 00000111111110000000 0000001111100 011122233455666788 Q ss_pred cccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCccc-ccccccccccccccccceeeecHhhhhhhhh Q lcl|Aclame:pro 138 PVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMV-TVAELEKNPALAKPDFKDVAWNIDTYRGAIP 216 (394) Q Consensus 138 ~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~ 216 (394) ++||+++++.|++.++.+++|+++|++++++ ++.++|+........+ ++.+++...+.++++|++|++++|+++++++ T Consensus 152 ~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~-~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~ 230 (434) T protein:vir:62 152 VTIPDFLSKEIITYAQEENFLRRLGTGVKTK-ENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALAT 230 (434) T ss_pred eecchhhHHHHHHhhhhhhhhhhhcceeccC-CceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehh Confidence 9999999999999999999999999998875 5678888764433222 2334455555688999999999999999999 Q ss_pred hhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------ccccHHHHHHHHHhhhhhhc-cc Q lcl|Aclame:pro 217 LSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTK--------------TVKNLDEIKALLNGGFDPAY-NV 281 (394) Q Consensus 217 vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~--------------~~~~~~~i~~~~~~~~~~~~-~a 281 (394) ||+||++|+.++|++||.+.|+++++.+++.+|++|+|++.+. +..++|+++++...+...++ ++ T Consensus 231 iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a 310 (434) T protein:vir:62 231 VTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEFKTDEKNLYDALVKMKNTPVKEVRKKA 310 (434) T ss_pred hHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccccccccchhhHHHHHHhhcchhhhcCC Confidence 9999999999999999999999999999999999999876542 12347889998887665554 68 Q ss_pred EEEEcHHHHHHHHhhhccCCceeeccc--ccCCCcccccccceEEecCccccc----CceEEEeccccEEEEeec-ceEE Q lcl|Aclame:pro 282 SLIVSQSFYQTLDTLKDGNGRYLLQDD--ITAVSGKVLLGKPVFVLSDEVLGA----NKAFIGDFKRGVLFADRK-DLGL 354 (394) Q Consensus 282 ~~vm~~~~~~~l~~lkd~~G~~l~~~~--~~~~~~~~l~G~pV~~~~~~~~~~----~~~~~gd~~~~~~~~~~~-~~~i 354 (394) +|+||+.+|..|++|+|++|+|||+|. ..++.+.+|+|+||+++++++.+. ..++||||+++ ++++|. .+++ T Consensus 311 ~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~-~i~~~~g~~~i 389 (434) T protein:vir:62 311 RWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKF-YIQDVIGSLEV 389 (434) T ss_pred EEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccce-EEEEeeceeEE Confidence 999999999999999999999999874 355777899999999987765432 23779999975 577775 5778 Q ss_pred EEeecccccc---eEEEEEEeccEEec-ccceEEEEec-CccCCC Q lcl|Aclame:pro 355 RWADNEIYGQ---YLQAVLRFGVSKVD-DKAGYYVTFT-PEPLPL 394 (394) Q Consensus 355 ~~~~~~~~~~---~~r~~~r~d~~v~~-~~af~~l~~~-~~~~~~ 394 (394) +.+.+.++.+ ++|++.|+|+++++ |.+++++++. .+||-- T Consensus 390 ~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 390 QKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred EeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 8888876653 58999999999887 8887776555 222222 No 48 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=100.00 E-value=2.7e-58 Score=336.21 Aligned_cols=380 Identities=14% Similarity=0.088 Sum_probs=250.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) ||+|..+...+..+...+++++..++.+.. .+..+++.++++.+.+.++++++...................... T Consensus 1 ~~ke~~~~~~~~~~~~~~e~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 75 (413) T protein:vir:81 1 MVKEAGDAPTNAQVAEIAEVKSMVEQFKAD-----EDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYK 75 (413) T ss_pred ChhhHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhh Confidence 999988877766555555554444444332 122222333343333333333322221111111100000000000 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) .. .....+........ ............................+.+...+++++|+++++.|++.+++.++|++ T Consensus 76 ~~-~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~~~~~~~l~~ 150 (413) T protein:vir:81 76 SI-GEFFAKRAGDQIKQ----QAGGAQLNYSVGEYVAPRVKAASDPASTATLTDEFQGGYGTTWNRNIIYRRREKLVVAD 150 (413) T ss_pred hh-hhhhhhhhhhHHHH----HHHHHHhhhhhhhhhhhHHHhhhhhhhhcccccccccccchhhHHHHHHHHhhhhhHHh Confidence 00 00000000000000 00000000000001111111112222344556677889999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCC---CcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRAT---TKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESI 237 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~---~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l 237 (394) +|++++++++.+.+|+.+... ..+.|+.|++..++...+.|+++++++++++++++||+|||+|++ +|.+||.+.| T Consensus 151 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~-~l~~~i~~~l 229 (413) T protein:vir:81 151 LMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDYD-FLVSYINARL 229 (413) T ss_pred hcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHHH-HHHHHHHHHH Confidence 999999999999999876532 345677777777765457899999999999999999999999986 5999999999 Q ss_pred HHHHHHHHHHHHhhcccccccc-cc---------------ccHHHHHHHHHhhhhh-hc-ccEEEEcHHHHHHHHhhhcc Q lcl|Aclame:pro 238 SQIKVNTTNDAIAKVLKSFTTK-TV---------------KNLDEIKALLNGGFDP-AY-NVSLIVSQSFYQTLDTLKDG 299 (394) Q Consensus 238 ~~~~~~~~~~a~~~g~~~~~~~-~~---------------~~~~~i~~~~~~~~~~-~~-~a~~vm~~~~~~~l~~lkd~ 299 (394) +++++.+++.++++|+|++.+. +. ..++++.+++.....+ .+ ..+|+|||++|..|++|||+ T Consensus 230 a~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~ 309 (413) T protein:vir:81 230 LEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDA 309 (413) T ss_pred HHHHHHHHHHHHhccCCCCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhcc Confidence 9999999999999999877652 11 1244444544333222 23 34699999999999999999 Q ss_pred CCceeecccccCC-------CcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecc--ccc---ceEE Q lcl|Aclame:pro 300 NGRYLLQDDITAV-------SGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNE--IYG---QYLQ 367 (394) Q Consensus 300 ~G~~l~~~~~~~~-------~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~~---~~~r 367 (394) +|+|||.+..... .+++|||+||++++++ +++.++||||+++|++++|++++|+++++. +|. ..+| T Consensus 310 ~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~--~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r 387 (413) T protein:vir:81 310 NGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVV--PVGKPVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVR 387 (413) T ss_pred CCceeccccccccccccccccCceecceeeEEcCCC--CcccEEEEecccEEEEEEecceEEEEeccccchhhcCcEEEE Confidence 9999998754332 3458999999997644 578899999999899999999999998764 343 4689 Q ss_pred EEEEeccEEecccceEEEEecCccCC Q lcl|Aclame:pro 368 AVLRFGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 368 ~~~r~d~~v~~~~af~~l~~~~~~~~ 393 (394) +++|+|+.+.+|+||+++++++++|| T Consensus 388 ~~~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 388 AEERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred EEEeeccEEecccceEEEEecCCCCC Confidence 99999999999999999999999999 No 49 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=100.00 E-value=7.4e-59 Score=339.28 Aligned_cols=370 Identities=13% Similarity=0.095 Sum_probs=264.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGK 78 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~ 78 (394) =.|.+|.||++++.+++++++++.++++....+.. .+++++++++++.+.+++++++++++..++............. T Consensus 14 ~~mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~ 93 (402) T protein:vir:93 14 NEMPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAY 93 (402) T ss_pred CCChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccC Confidence 23478999999999999999999888876555432 3567788888999999999888888777665443322222222 Q ss_pred cccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhh Q lcl|Aclame:pro 79 EVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDL 158 (394) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l 158 (394) ............+..+.++........... ............++++.|+++||++++..|++.++.+++| T Consensus 94 ~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~----------~~~~~~~~a~~~~t~~~GG~lIP~~~~~~Ii~~~~~~~~l 163 (402) T protein:vir:93 94 QSLSDNEKMVKAKAEFYRHAILPNEFEKPS----------MEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQL 163 (402) T ss_pred CCCchhHHHHHHHHHHHHHHHhhhhHHHHH----------HhHHHHHhhhccCCCcCCccccchhHHHHHHHhHHhhhhh Confidence 222222223333333333221111100000 0000111112234556678999999999999999999999 Q ss_pred hheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHH Q lcl|Aclame:pro 159 KPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESIS 238 (394) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~ 238 (394) +++|+++++++ ..+|........+.|++|++..+ .++++|++|++++|+++++++||+|||+||.++|++||.+.|+ T Consensus 164 ~~~~~v~~~~~--~~~p~~~~~~~~a~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~~~la 240 (402) T protein:vir:93 164 REKARLTNIKG--LEIPRVSYTLDDDDFITDVETAK-ELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQ 240 (402) T ss_pred hhhceeeecCC--ceeeeeeccCCcccccccccccc-ccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHHHHHH Confidence 99999988854 55777655555566666666655 5789999999999999999999999999999999999999999 Q ss_pred HHHHHHHHH-HHhhcccccccc------------ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCcee Q lcl|Aclame:pro 239 QIKVNTTND-AIAKVLKSFTTK------------TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYL 304 (394) Q Consensus 239 ~~~~~~~~~-a~~~g~~~~~~~------------~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l 304 (394) ++++++++. .+..|.|++.+. +...+|+++++++.+...++ |++|+||+.++..+..+++..|+|+ T Consensus 241 ~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~ 320 (402) T protein:vir:93 241 SGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNF 320 (402) T ss_pred HHHHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcc Confidence 999998765 456666554432 22347889998887665554 7899999999888765555566677 Q ss_pred ecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccc-cceEEEEEEeccEEecccceE Q lcl|Aclame:pro 305 LQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIY-GQYLQAVLRFGVSKVDDKAGY 383 (394) Q Consensus 305 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~-~~~~r~~~r~d~~v~~~~af~ 383 (394) |. +.+.+|+|+||++++++ .+++||||+++|.+++ ++.++.+++..+ ...++++.|||++|++|+||+ T Consensus 321 ~~-----~~~~~llG~PV~~t~~~----~~i~~GDf~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~r~Dg~v~~~~A~~ 389 (402) T protein:vir:93 321 FD-----TPAEKVFGKPVVFTDAA----VKPIVGDFNYFGINYD--GTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFR 389 (402) T ss_pred cc-----cCCccccccceEEecCC----Cceeeechhhhhhhhh--hhhhhhhhcccCCceEEEEEEEeCcEEechhheE Confidence 64 24569999999998753 5689999998876654 455566555443 567899999999999999999 Q ss_pred EEEecCccCCC Q lcl|Aclame:pro 384 YVTFTPEPLPL 394 (394) Q Consensus 384 ~l~~~~~~~~~ 394 (394) .++++++..|. T Consensus 390 ~l~ik~~~~~~ 400 (402) T protein:vir:93 390 IAKAKENTGPL 400 (402) T ss_pred EEEeecCCCCC Confidence 99998887666 No 50 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=100.00 E-value=1.3e-58 Score=337.86 Aligned_cols=367 Identities=13% Similarity=0.109 Sum_probs=260.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |.+|.||++++.+++++++.+.++++....+++ .+++++++++++.|+++++.++++++.++................ T Consensus 1 Mk~l~el~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 80 (387) T protein:vir:93 1 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQS 80 (387) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCC Confidence 889999999999999999999999887665543 356777888899999999888887777654433222211111111 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ..........+..+.++........ ... .............+.+.||++||+++.+.|++.+++.++|++ T Consensus 81 ~~~~~~~~~~~~~~~r~~~~~~~~~------~~~----~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~l~~ 150 (387) T protein:vir:93 81 LNDHEKMVKAKAEFYRHAILPNEFE------KPS----MEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLRE 150 (387) T ss_pred cchhhHHHHHHHHHHHHHhhhhhhh------hhh----hhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhchhhh Confidence 1111122223333333221111000 000 000001111223456677899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|+++++++ ..+|....+...+.|+.|++..+ .++++|++|++++++++++++||+|||+||.+||++||.+.|+++ T Consensus 151 ~~~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~-~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~~~la~~ 227 (387) T protein:vir:93 151 KARLTNIKG--LEIPRVSYTLDDDDFITDVETAK-ELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSG 227 (387) T ss_pred heeeeecCC--ceEEEEeecCCccccccCccccc-ccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHHHHHHHH Confidence 999988864 56776655555566666666655 578999999999999999999999999999999999999999999 Q ss_pred HHHHHHH-HHhhccccccccc------------cccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHH-HhhhccCCceee Q lcl|Aclame:pro 241 KVNTTND-AIAKVLKSFTTKT------------VKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTL-DTLKDGNGRYLL 305 (394) Q Consensus 241 ~~~~~~~-a~~~g~~~~~~~~------------~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l-~~lkd~~G~~l~ 305 (394) ++++++. ++.+|.|++.+.+ ...+|+|+++++.+...++ +++|+||+.++..+ ++++|++|.|++ T Consensus 228 ~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~ 307 (387) T protein:vir:93 228 LAAKERKDALAVSPKSGLDHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFD 307 (387) T ss_pred HHHHHHHhHhhcCCCccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccc Confidence 9999766 4556666554322 2347899999887666654 68999999998665 566676665443 Q ss_pred cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccc-cceEEEEEEeccEEecccceEE Q lcl|Aclame:pro 306 QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIY-GQYLQAVLRFGVSKVDDKAGYY 384 (394) Q Consensus 306 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~-~~~~r~~~r~d~~v~~~~af~~ 384 (394) +.+.+|+|+||++++++ .+++||||+++|.++ .++.++...+... ...++++.|+|++|++|+||+. T Consensus 308 ------~~~~~llG~PV~~~~~~----~~~~~GDf~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~eA~~~ 375 (387) T protein:vir:93 308 ------TPAEKVFGKPVVFTDAA----VKPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRI 375 (387) T ss_pred ------cCCccccccceEEecCC----Cceeeeehhhhheeh--hhheeeecccccCCceeEEEEeeeCceeechhheEE Confidence 23569999999998753 467999999876553 4566665554433 3468889999999999999999 Q ss_pred EEecCccCCC Q lcl|Aclame:pro 385 VTFTPEPLPL 394 (394) Q Consensus 385 l~~~~~~~~~ 394 (394) +++++++.|. T Consensus 376 l~~k~~~~~~ 385 (387) T protein:vir:93 376 AKAKENTGSL 385 (387) T ss_pred EEeecCCCCC Confidence 9998888766 No 51 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=100.00 E-value=4.6e-58 Score=334.94 Aligned_cols=360 Identities=13% Similarity=0.145 Sum_probs=244.6 Q ss_pred ChHHHHHHHHHHHHHHHHHH----HHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTI----VTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIG 76 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~----~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~ 76 (394) |- ++|++++++++.+++ ++..++.+...+..+.+.......+++++++++..++++++..+............ T Consensus 1 m~---~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~ 77 (379) T protein:vir:10 1 ME---ALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDK 77 (379) T ss_pred CC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 43 333444443333333 33333333322222211111223334455555555555554444333222211110 Q ss_pred cccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhh Q lcl|Aclame:pro 77 GKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVV 156 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~ 156 (394) .......+.... ......+.. . .......+...+++.++.++|+.+...|++.+++.+ T Consensus 78 -------~~~~~~~~~~~~-------~~~~~~~~~-------~-~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~ 135 (379) T protein:vir:10 78 -------SDSLVKSITENF-------NDIKEVRNG-------K-SIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQML 135 (379) T ss_pred -------chhHHHHHHHHH-------HhHHHHHhh-------h-hhhhhhhcccccCCCCccccchhhhhHHHHhHHhhh Confidence 000111111000 000000000 0 001112233345555566899999999999999999 Q ss_pred hhhheeeeEeecCCceeEEEEecCCCccccc-ccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHH Q lcl|Aclame:pro 157 DLKPFTTVYQAKKASGKYPVLQRATTKMVTV-AELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSE 235 (394) Q Consensus 157 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~ 235 (394) +|+++|++++++++++.+|+.+..+...+.| .|++..+ .++++|++|++++++++++++||+|||+|++ +|.+||.+ T Consensus 136 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-~l~~~i~~ 213 (379) T protein:vir:10 136 NVSDIVGAVSISGGTYTFVRENGAGEGAIGAQVEGATKG-QKDYDISMIDVNTDFIAGFTRYSKKMANNLP-FLTSFIPN 213 (379) T ss_pred hHHhhceeeeccCCceEEEEeecCCCcccccccCCcccc-ccccceeeeEeeeeeEEeeehhhHHHHhhHH-HHHHHHHH Confidence 9999999999999999999887655444444 5555555 5789999999999999999999999999986 59999999 Q ss_pred HHHHHHHHHHHHHHhhccccccccc------cccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeeccc Q lcl|Aclame:pro 236 SISQIKVNTTNDAIAKVLKSFTTKT------VKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDD 308 (394) Q Consensus 236 ~l~~~~~~~~~~a~~~g~~~~~~~~------~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~ 308 (394) +|++.++.+++.++++|+++.++.+ ..+++++.+++..+..+++ +++|+|||++|..|++|||++|+|+|+|+ T Consensus 214 ~la~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~ 293 (379) T protein:vir:10 214 ALRRDYAKAENAAFNAVLAANATASTEIITNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPG 293 (379) T ss_pred HHHHHHHHHHHHHHhcccccccccccccccCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCC Confidence 9999999999999999988654332 3457788888776666655 57899999999999999999999999987 Q ss_pred cc--CCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecc--cccc---eEEEEEEeccEEecccc Q lcl|Aclame:pro 309 IT--AVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNE--IYGQ---YLQAVLRFGVSKVDDKA 381 (394) Q Consensus 309 ~~--~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~~~---~~r~~~r~d~~v~~~~a 381 (394) +. .+.+.+|||+||++++. .+++.++||||+++++ ..|.++.|+++++. +|.+ .+|+++|+|++|++|+| T Consensus 294 ~~~~~~~~~~l~G~pvv~s~~--~~ag~~~~gdf~~~~~-~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a 370 (379) T protein:vir:10 294 VVTQDNGVLRINGIPLFRATW--LAANKYYVGDWTRVTK-VTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAA 370 (379) T ss_pred ccCCCCCcceecceeeEecCC--CCCCceEEeecccEEE-EEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCcc Confidence 64 45667999999998664 4677899999998654 46889999987654 3544 58999999999999999 Q ss_pred eEEEEecCc Q lcl|Aclame:pro 382 GYYVTFTPE 390 (394) Q Consensus 382 f~~l~~~~~ 390 (394) ||++++++. T Consensus 371 ~v~~~~~~~ 379 (379) T protein:vir:10 371 LIFGDFTAV 379 (379) T ss_pred EEEEEecCC Confidence 999999999 No 52 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=100.00 E-value=8.1e-58 Score=333.58 Aligned_cols=377 Identities=13% Similarity=0.140 Sum_probs=259.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-cccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVK--NALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEV-GGAENIGGKE 79 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~--~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~-~~~~~~~~~~ 79 (394) |.+|++|+++++++.++++++.+... ..+++++.++++++++++++|+++++++++..+........ .......... T Consensus 1 M~kl~~L~e~r~~l~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~ 80 (428) T protein:vir:10 1 MPQIEELRRQRAGINEQIQALATIEATNGTLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVI 80 (428) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccc Confidence 77899999999999999988776432 24677788888999999999999988766443332221111 1111110000 Q ss_pred cc-chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhh Q lcl|Aclame:pro 80 VT-QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDL 158 (394) Q Consensus 80 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l 158 (394) .. .........+....+......... .......................++.|+++||+++.+.|++.+++.++| T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~~~~~l 156 (428) T protein:vir:10 81 VKAEPKQYTGAGMTRMVMSIAAAQGNL----QDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIELLRDRTIV 156 (428) T ss_pred cccccchhhhHHHHHHHHHHHHhhhhH----HHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHHHhhhchh Confidence 00 001111111111111110000000 0000000000011111222234445678899999999999999999999 Q ss_pred hhe-eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHH Q lcl|Aclame:pro 159 KPF-TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESI 237 (394) Q Consensus 159 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l 237 (394) +++ ++++++.++.+.+|+.+. ...+.|++|++..+ .++++|++|++++++++++++||+|+++||.++|++||.+.| T Consensus 157 ~~~~~~~~~~~~g~~~~p~~~~-~~~a~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~~~i~~~l 234 (428) T protein:vir:10 157 RKLGARSIPLPNGNMSLPRLAG-GATASYTGENQDAK-VSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVEQLVLQDI 234 (428) T ss_pred hhhcceeeecCCcceEEEEEeC-CcceeeeccCcccc-ccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHHHHHHHHH Confidence 998 788888888899998764 45566676666666 578999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhccccc-ccccc------------------ccHHHHHHHHHhhh------hh-hcccEEEEcHHHHH Q lcl|Aclame:pro 238 SQIKVNTTNDAIAKVLKSF-TTKTV------------------KNLDEIKALLNGGF------DP-AYNVSLIVSQSFYQ 291 (394) Q Consensus 238 ~~~~~~~~~~a~~~g~~~~-~~~~~------------------~~~~~i~~~~~~~~------~~-~~~a~~vm~~~~~~ 291 (394) +++++.+++.+|++|+|++ .|.|. .+++.+...+..+. .. ..+++|+||+.++. T Consensus 235 ~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~ 314 (428) T protein:vir:10 235 LTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSNMISSGWGMSNRTYM 314 (428) T ss_pred HHHHHHHHHHHHhccCCCCccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccccccCEEEEcHHHHH Confidence 9999999999999999875 23221 12333332222211 11 23689999999999 Q ss_pred HHHhhhccCCceeecccccCCCcccccccceEEecCccc------ccCceEEEeccccEEEEeecceEEEEeeccc---- Q lcl|Aclame:pro 292 TLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL------GANKAFIGDFKRGVLFADRKDLGLRWADNEI---- 361 (394) Q Consensus 292 ~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~------~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~---- 361 (394) .|++|+|++|+|+|.+ . .+++|+|+||++++.++. +..+++||||++ |+++++++++++.+++.. T Consensus 315 ~L~~lkd~~G~~i~~~-~---~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~i~i~~~~~~~~~~~ 389 (428) T protein:vir:10 315 KLFGLRDGNGNKVYPE-M---AQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFND-VVIGEDGNMKVDFSKEASYIDT 389 (428) T ss_pred HHHHhhccCCceeccC-C---CCCeeeceeeEEeccccccccCCCccceEEEEecce-EEEEEecceEEEeecccccccc Confidence 9999999999999964 2 234899999999775432 234689999996 667889999999887632 Q ss_pred -------cc---ceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 362 -------YG---QYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 362 -------~~---~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) |. ..+|+++|+|++|.+|+||+.++-..= T Consensus 390 ~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 390 DGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred cccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 33 358999999999999999999988777 No 53 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=100.00 E-value=2.7e-57 Score=330.70 Aligned_cols=379 Identities=11% Similarity=0.143 Sum_probs=260.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----cc-c Q lcl|Aclame:pro 4 EKIKEIKATIADLNNTIVTKTAQVK--NALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAE----NI-G 76 (394) Q Consensus 4 e~l~eL~~~~~el~~~~~~~~~e~~--~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~----~~-~ 76 (394) =+|+||+++++++.++.+++.+... ..+++++.++++++++++++|+++|+++++..+............ .. . T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~~ 80 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAPA 80 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhcc Confidence 2678888888888888877655432 246777788889999999999998887765433322211100000 00 0 Q ss_pred c---ccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHH Q lcl|Aclame:pro 77 G---KEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVK 153 (394) Q Consensus 77 ~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~ 153 (394) . .............+..+.+............... .................+...|++++|+++...|++.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~ 157 (435) T protein:vir:14 81 AAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKL---AIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLR 157 (435) T ss_pred ccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHH---HHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHh Confidence 0 0000111111112222222211111110000000 000000011111223445566788999999999999999 Q ss_pred hhhhhhhe-eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccH--HHHH Q lcl|Aclame:pro 154 TVVDLKPF-TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDAD--VDLV 230 (394) Q Consensus 154 ~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~--~~l~ 230 (394) ..++++.+ ++++++.++...+|+.+. ...+.|+.|++..+ .++++|++|+++++++++++++|+||++|+. ++|+ T Consensus 158 ~~~~i~~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~~~l~ 235 (435) T protein:vir:14 158 PKSVVRKLGARTLPLSNGNITIPRLKG-GAIVGYIGADTDIP-TTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVD 235 (435) T ss_pred hhchhhhhcceeeecCCCceEEEEEeC-CcceeeeccCcccc-ccccceeEEEeeeEEEEEeehhhHHHHHhhccCHHHH Confidence 99999987 888999888899998764 45566666666665 6889999999999999999999999999985 4699 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccc-cccc---------------ccHH----HHHHHHHhhhhh---hcccEEEEcH Q lcl|Aclame:pro 231 GIVSESISQIKVNTTNDAIAKVLKSFT-TKTV---------------KNLD----EIKALLNGGFDP---AYNVSLIVSQ 287 (394) Q Consensus 231 ~~i~~~l~~~~~~~~~~a~~~g~~~~~-~~~~---------------~~~~----~i~~~~~~~~~~---~~~a~~vm~~ 287 (394) +||.+.|+++++.+++.+|++|+|++. |.+. .+++ ++.+++..+... ..+++|+||| T Consensus 236 ~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~ 315 (435) T protein:vir:14 236 QIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQPGWIMAP 315 (435) T ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcH Confidence 999999999999999999999988752 3222 1222 334444333322 2378999999 Q ss_pred HHHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc------ccCceEEEeccccEEEEeecceEEEEeeccc Q lcl|Aclame:pro 288 SFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL------GANKAFIGDFKRGVLFADRKDLGLRWADNEI 361 (394) Q Consensus 288 ~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~------~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~ 361 (394) ++|..|++++|++|+|+|. ... +++|+|+||++++.++. +.+.++||||++ |++++|+++++.++++.. T Consensus 316 ~~~~~L~~lkd~~G~~l~~-~~~---~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~~~~~~~~~~~ 390 (435) T protein:vir:14 316 RTFRFLEGLRDGNGNKVYP-ELA---NGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGD-VFIGEEETLEIDYSKEAT 390 (435) T ss_pred HHHHHHHHhhccCCceecc-CCC---CCeeecceeEeeccccccccCCCccceEEEeeccc-EEEEEecccEEEEecccc Confidence 9999999999999999994 333 35899999999775432 334689999997 567899999999988643 Q ss_pred -----------c---cceEEEEEEeccEEecccceEEEEecCccC Q lcl|Aclame:pro 362 -----------Y---GQYLQAVLRFGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 362 -----------~---~~~~r~~~r~d~~v~~~~af~~l~~~~~~~ 392 (394) | ...+|+++|+|++|.+|+||+.|+-.+-.+ T Consensus 391 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 391 YKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred ccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 3 346899999999999999999999999999 No 54 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=100.00 E-value=4.9e-57 Score=329.32 Aligned_cols=383 Identities=10% Similarity=0.015 Sum_probs=234.3 Q ss_pred ChHHHHHH----------------------HHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MFEEKIKE----------------------IKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAEN 58 (394) Q Consensus 1 ~l~e~l~e----------------------L~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~ 58 (394) |-++++++ ++..+++.+.+....+.++......+..+..+...++++.+.++..+..+ T Consensus 3 ~~~~~~~~e~~~~e~a~~~~~~~~~~k~~e~~~~~ke~~~~~l~~~~e~~~k~~~E~~~~le~~~ee~k~l~ee~~~~~~ 82 (458) T protein:vir:10 3 IDINKLKEELGLGDLAKSLEGLTAAQKAQEAERMRKEQEEKELARMNDLVSKAVGEDRKRLEEALELVKSLDEKSKKSNE 82 (458) T ss_pred cchhhhhhhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22222222 11111111111000000000000011111122222233333322222211 Q ss_pred HHHHHHHHHhh---------------cccccccc-ccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Q lcl|Aclame:pro 59 DLKLYESSVEV---------------GGAENIGG-KEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETT 122 (394) Q Consensus 59 ~~~~~~~~~~~---------------~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 122 (394) ........... ........ ..................+ . ......... .......... T Consensus 83 ~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~--~-~~~~~~~~~---~~~~~~~~~~ 156 (458) T protein:vir:10 83 LFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEK--L-VLLSYVMEK---GVFETEHGQR 156 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHH--H-HHHHHHHhh---ccchhhhhhh Confidence 11110000000 00000000 0000000000000000000 0 000000000 0000000111 Q ss_pred hhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccc-----c Q lcl|Aclame:pro 123 PVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPAL-----A 197 (394) Q Consensus 123 ~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~-----~ 197 (394) .......+.+...++.++|+.+.+.|++.+++.++|+++|++++++++...+|+.. ..+.+.|+.|++..++. + T Consensus 157 ~~~a~~~~~~~~~g~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~e~~~~~~~~~~~~~ 235 (458) T protein:vir:10 157 HLKAVNQSSSVEVSSESYETIFSQRIIRDLQKELVVGALFEELPMSSKILTMLVEP-DAGKATWVAASTYGTDTTTGEEV 235 (458) T ss_pred hhhhhhhcccCccccceehhhHhHHHHHHHHhhhhHHhhcceeecCCcceEEEEec-CCcceeecccccccccccccccc Confidence 11111223445567889999999999999999999999999999999888888764 34556666766666543 4 Q ss_pred ccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------------- Q lcl|Aclame:pro 198 KPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTV---------------- 261 (394) Q Consensus 198 ~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~---------------- 261 (394) +++|+++++++++++++++||+++++|+.++|.+||.+.|+++++.+++.++++|+|++.|.+. T Consensus 236 ~~~~~~i~~~~~k~~~~v~is~ell~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~ 315 (458) T protein:vir:10 236 KGALKEIHFSTYKLAAKSFITDETEEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAK 315 (458) T ss_pred cccceeeEeeeeeEEeeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeeccc Confidence 6789999999999999999999999999999999999999999999999999999988765332 Q ss_pred ------ccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccc----cCCCcccccccceEEecCccc Q lcl|Aclame:pro 262 ------KNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDI----TAVSGKVLLGKPVFVLSDEVL 330 (394) Q Consensus 262 ------~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~ 330 (394) .+++++++++..+...++ +++|+||+++|..|++|+|++|+|+|.+.. ..+.+++|||+||++++.++. T Consensus 316 ~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~ 395 (458) T protein:vir:10 316 ADGSVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPA 395 (458) T ss_pred ccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEcccccc Confidence 357889998877665554 689999999999999999999999997643 345567999999999876654 Q ss_pred --ccCceEEEeccccEEEEeecceEEEEeeccc-ccceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 331 --GANKAFIGDFKRGVLFADRKDLGLRWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 331 --~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) +++.++||||+.+|+++++.+++|..+++.. ....+|+..|||+.|++|+||+++++++. T Consensus 396 ~~~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 396 KANSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred ccCCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 3467899999988999999999988765532 23468999999999999999999998888 No 55 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=100.00 E-value=2e-56 Score=325.90 Aligned_cols=379 Identities=11% Similarity=0.147 Sum_probs=262.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----ccc Q lcl|Aclame:pro 4 EKIKEIKATIADLNNTIVTKTAQVKN--ALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAE-----NIG 76 (394) Q Consensus 4 e~l~eL~~~~~el~~~~~~~~~e~~~--~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~-----~~~ 76 (394) =+|+||+++++++.++++++.+.... .+++++.++++++++++++|+++++++++..+............ ... T Consensus 1 M~l~eL~~~r~~~~~~~~~l~~~~~e~~~l~~ee~~~~~~l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~ 80 (435) T protein:vir:80 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASA 80 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhcccc Confidence 15788888888888888887654432 46778888999999999999999988765433222211110000 000 Q ss_pred cccc---cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHH Q lcl|Aclame:pro 77 GKEV---TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVK 153 (394) Q Consensus 77 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~ 153 (394) .... ..........+..+.+............... ... ..............+...|++++|+++.+.|++.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~ 157 (435) T protein:vir:80 81 AAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKL-AIE--RGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLR 157 (435) T ss_pred ccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHH-HHh--hhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHh Confidence 0000 0001111111222222211111111100000 000 001111111122345566788999999999999999 Q ss_pred hhhhhhhe-eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccH--HHHH Q lcl|Aclame:pro 154 TVVDLKPF-TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDAD--VDLV 230 (394) Q Consensus 154 ~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~--~~l~ 230 (394) +.++|+++ ++++++.++...+|+.+. ...+.|+.|++..+ .++++|++|++.+++++++++||+|+++|+. ++|+ T Consensus 158 ~~~~i~~~~~~~v~~~~~~~~~p~~~~-~~~a~~v~E~~~~~-~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~l~ 235 (435) T protein:vir:80 158 PKSVVRKLGARTLPLSNGNITIPRLKG-GAIVGYIGADTDIP-TTQQQFDDLKLTAKKMAALVPIANDLIKYAGVNPNVD 235 (435) T ss_pred hhchhhhccceeeecCCCceEEEEEeC-CcceeeeccCcccc-ccccceeeEEEeeEEEEEeehhhHHHHHhhcccHHHH Confidence 99999998 889999999999998764 45566677766666 5889999999999999999999999999985 4799 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccc-cccc---------------ccHH----HHHHHHHhhhhh--h-cccEEEEcH Q lcl|Aclame:pro 231 GIVSESISQIKVNTTNDAIAKVLKSFT-TKTV---------------KNLD----EIKALLNGGFDP--A-YNVSLIVSQ 287 (394) Q Consensus 231 ~~i~~~l~~~~~~~~~~a~~~g~~~~~-~~~~---------------~~~~----~i~~~~~~~~~~--~-~~a~~vm~~ 287 (394) +||.+.|+++++.+++.++++|+|++. |.+. .+.+ ++.+++..+... + .+++|+||+ T Consensus 236 ~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~ 315 (435) T protein:vir:80 236 QIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENADANLTQPGWIMAP 315 (435) T ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecccccchhhHHHHHHHHHHHhhccccccccCEEEEcH Confidence 999999999999999999999988642 3222 1222 344444443322 2 368999999 Q ss_pred HHHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc------ccCceEEEeccccEEEEeecceEEEEeeccc Q lcl|Aclame:pro 288 SFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL------GANKAFIGDFKRGVLFADRKDLGLRWADNEI 361 (394) Q Consensus 288 ~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~------~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~ 361 (394) .++..|++++|++|+|+|. ... +++|+|+||++++.++. +...++||||++ |++++|++++|+++++.. T Consensus 316 ~~~~~L~~lkd~~G~~l~~-~~~---~~~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~-~~i~~~~~~~i~~~~~~~ 390 (435) T protein:vir:80 316 RTFRFLEGLRDGNGNKVYP-ELA---NGMLKGYPVGKTTQVPINLGEAGKESEIYFTDFGD-VFIGEEETLEIDYSKEAT 390 (435) T ss_pred HHHHHHHhhhccCCceecc-CCC---CCeEeeeeeEEeccccccccCCCCcceEEEEEccc-EEEEeecceEEEEecccc Confidence 9999999999999999994 333 35899999999775432 234689999997 568899999999988753 Q ss_pred -----------c---cceEEEEEEeccEEecccceEEEEecCccC Q lcl|Aclame:pro 362 -----------Y---GQYLQAVLRFGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 362 -----------~---~~~~r~~~r~d~~v~~~~af~~l~~~~~~~ 392 (394) | ...+|+++|+|++|.+|+||++|+-.+-++ T Consensus 391 ~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 391 YKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred ccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 3 345899999999999999999999999888 No 56 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=100.00 E-value=7.6e-56 Score=322.77 Aligned_cols=383 Identities=13% Similarity=0.093 Sum_probs=257.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) =.++.|+|+++++++..+..++..++.++. .++.....++++++++++..+++.+++..+..+................ T Consensus 2 ~~~~~lee~~a~l~~~~~~~~~~~~~~~~~-~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (419) T protein:vir:94 2 PPTPTLEEQRAALLARLDDTSLTTEQVQEI-VAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGT 80 (419) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 123344445554444444444444444432 2222233445566666666666665555544433322211111111100 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) .......... ....+...... ........................+.....+..++|+.+...|...++..+.|++ T Consensus 81 ~~~~~~~~~~-~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~ 156 (419) T protein:vir:94 81 FRSLAQRFAD-SDGLREYRARD---KRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPLLVAD 156 (419) T ss_pred ccchhhhhhh-HHHHHHHHHhh---hhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhhhhhh Confidence 0000000000 00001000000 0000000111111111111222233445556678888888888888888899999 Q ss_pred eeeeEeecCCceeEEEEecCC-------CcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRAT-------TKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIV 233 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i 233 (394) +|+++++.++.+.+|..+... +.+.|++|++..+ .++++|+++++++++++++++||+|+++|+. +|++|| T Consensus 157 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-~l~~~i 234 (419) T protein:vir:94 157 LLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKP-QSTLSFDTITTTLKTVAHWLPITRQAADDNS-QLMGYI 234 (419) T ss_pred cceeeeccCCceeeeeeccccccccccCcccceecCCcccc-ccccceeeEEeeeeeEEEeehhhHHHHHhHH-HHHHHH Confidence 999999999888888765422 3345666666665 5889999999999999999999999999975 799999 Q ss_pred HHHHHHHHHHHHHHHHhhccccccccccc---------------------cHHHHHHHHHhhhhhhc-ccEEEEcHHHHH Q lcl|Aclame:pro 234 SESISQIKVNTTNDAIAKVLKSFTTKTVK---------------------NLDEIKALLNGGFDPAY-NVSLIVSQSFYQ 291 (394) Q Consensus 234 ~~~l~~~~~~~~~~a~~~g~~~~~~~~~~---------------------~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~ 291 (394) .+.|+++++.+++.+|++|+|++.|.+.. .++++.+++..+..+++ +++|+|||++|. T Consensus 235 ~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~~~~~~~v~n~~~~~ 314 (419) T protein:vir:94 235 QGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAGFPPDGVVVHPQDWE 314 (419) T ss_pred HHHHHHHHHHHHHHHHHhccCcccccceecccccccccccccccccccchhHHHHHHHHHhhhhccCCCCEEEEcHHHHH Confidence 99999999999999999999987654431 25778888877776665 578999999999 Q ss_pred HHHhhhccCCc-eeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--cc---ce Q lcl|Aclame:pro 292 TLDTLKDGNGR-YLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--YG---QY 365 (394) Q Consensus 292 ~l~~lkd~~G~-~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--~~---~~ 365 (394) .|++++|++|+ |++++++.++.+++|+|+||++++++ ++++++||||+++|+++++++++++++++.. |. .. T Consensus 315 ~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~--~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~ 392 (419) T protein:vir:94 315 SIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAI--AQGTALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLV 392 (419) T ss_pred HHHHHhhcCCCceeecCCcccCCCccccceeeEEcCCC--CCccEEEeeccceEEEEEecceEEEEeccccchhhcCcEE Confidence 99999998555 66888888888999999999997754 6788999999998899999999999987653 44 46 Q ss_pred EEEEEEeccEEecccceEEEEecCccC Q lcl|Aclame:pro 366 LQAVLRFGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 366 ~r~~~r~d~~v~~~~af~~l~~~~~~~ 392 (394) +|+++|+|++|++|+||+++++++++| T Consensus 393 ~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 393 ILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred EEEEEeeccEEeccccEEEEEeccCCC Confidence 899999999999999999999999999 No 57 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=100.00 E-value=2.8e-55 Score=319.65 Aligned_cols=365 Identities=14% Similarity=0.123 Sum_probs=239.2 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHH---HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKN---ALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGG 77 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~---~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~ 77 (394) .-...+.+++.+.....+++++..++++. .+..+..++++++..+++.+++.+..+++.++..+......... T Consensus 140 ~~~~~l~e~~~~~~~~~~e~k~~~e~~~~e~~e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~---- 215 (543) T protein:vir:81 140 LEPDSIEDCRFRDPWNLSEMRTFGRDAEEVKGELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLA---- 215 (543) T ss_pred ccCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh---- Confidence 00011223333332222222222222211 11122223334444444444444433333333222221111100 Q ss_pred ccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHH-HHHHhhh Q lcl|Aclame:pro 78 KEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPA-REVKTVV 156 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~-~~~~~~~ 156 (394) .........+..+.+....... ..............+.+.+.++++||+++...|+ ..++..+ T Consensus 216 ----~~~~~~~~a~~~~~~~~~~~~l------------~~~e~~~~~~~~~~~~t~~~gg~lip~~~~~~ii~~~~~~~~ 279 (543) T protein:vir:81 216 ----TSSPAYLRAWSKMARNPHAAIL------------TEEEKRAINEVRAMGLTKADGGYLVPFQLDPTVIITSNGSLN 279 (543) T ss_pred ----hhhhhhhhHHHHHHHhhHHHHh------------hhhhhhhhhhhhhcccccccCcccCchhhhhHHHHHHHhhhc Confidence 0011111122222211110000 0001111122334556777889999999998876 6678889 Q ss_pred hhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHH Q lcl|Aclame:pro 157 DLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSES 236 (394) Q Consensus 157 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~ 236 (394) +|+.++++.++ ++.+.+|+.. ....+.|++|++..+ .++++|+++++++++++++++||+++++|+ ++|.+||.+. T Consensus 280 ~l~~~~~~~~~-~g~~~~~~~~-~~~~a~~v~Eg~~~~-~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~ 355 (543) T protein:vir:81 280 DIRRFARQVVA-TGDVWHGVSS-AAVQWSWDAEFEEVS-DDSPEFGQPEIPVKKAQGFVPISIEALQDE-ANVTETVALL 355 (543) T ss_pred hhhhhcccccC-CcceEEEEec-CCcceeecccCcccc-ccccccceeeeeeeeeEeeehhhHHHHhcc-HHHHHHHHHH Confidence 99999998766 5667777754 345566666666665 688999999999999999999999999998 5899999999 Q ss_pred HHHHHHHHHHHHHhhccccc-cccc------------------cccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhh Q lcl|Aclame:pro 237 ISQIKVNTTNDAIAKVLKSF-TTKT------------------VKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTL 296 (394) Q Consensus 237 l~~~~~~~~~~a~~~g~~~~-~~~~------------------~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~l 296 (394) |+++++.+++.+|++|+|++ .|.| ..+++++.+++..+...+. +++|+|||++|..|+++ T Consensus 356 l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~l 435 (543) T protein:vir:81 356 FAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQF 435 (543) T ss_pred HHHHHHHHHHHHHhccCCCCcccccchhhcccccccccccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHh Confidence 99999999999999998865 2222 1246788888776655544 58999999999999999 Q ss_pred hccCCceeecccccCCCcccccccceEEecCccc--------ccCceEEEeccccEEEEeecceEEEEeecccc------ Q lcl|Aclame:pro 297 KDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL--------GANKAFIGDFKRGVLFADRKDLGLRWADNEIY------ 362 (394) Q Consensus 297 kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~--------~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~------ 362 (394) +|++|+|||.+ +..+.+++|+|+||+++++++. +..+++||||+. |+++++.+++|.++.+.+. T Consensus 436 kd~~G~~l~~~-~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~-~~i~~~~~~~i~~~~~~~~~~~~~~ 513 (543) T protein:vir:81 436 DTQGGAGLWTT-IGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYGNFQN-YVIADRIGMTVEFIPHLFGTNRRPN 513 (543) T ss_pred hcCCCceeccC-cCCCCCccccceeeEEeccccccccccccCCcceEEEeeccc-eeEEeecccEEEEeccccccchhhc Confidence 99999999976 5566778999999999886543 344589999985 7788999999998765432 Q ss_pred -cceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 363 -GQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 363 -~~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) ...+++++|+|+.|.+|+||+++++++++ T Consensus 514 ~~~~~~~~~r~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 514 GSRGWFAYYRMGADVVNPNAFRLLNVETAS 543 (543) T ss_pred CceEEEEEEeeccEeecccceEEEEecccC Confidence 34689999999999999999999999999 No 58 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=100.00 E-value=5.3e-56 Score=323.63 Aligned_cols=390 Identities=12% Similarity=0.052 Sum_probs=239.6 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHhhc Q lcl|Aclame:pro 2 FEEKIKEIKATIADLNNTIVTKTAQVKNALESDD--------LEAARSIKAEVEQAKANLVEAENDLKLYES---SVEVG 70 (394) Q Consensus 2 l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--------~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~---~~~~~ 70 (394) ++++|+||++++++|+++..++.++++.++++.+ .++..+..+++++++++++.+++.++.+++ ..+.. T Consensus 1 ~~k~~eem~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~ 80 (477) T protein:vir:84 1 MEKHLEELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERS 80 (477) T ss_pred CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 8888888888888888888777777766544322 112223334445555555544433322221 11100 Q ss_pred c-ccc----cccccccchh--hhHHHHHHHHHHHHHHHH---HHH----HHHH---HHHHHHHHHHhhhhh-hhhhhccc Q lcl|Aclame:pro 71 G-AEN----IGGKEVTQEE--KTYRESVNDFIRSKGKIV---NDS----LRFE---GKDEVLMPINETTPV-EPQKDGIK 132 (394) Q Consensus 71 ~-~~~----~~~~~~~~~~--~~~~~~~~~~~~~~~~~~---~~~----~~~~---~~~~~~~~~~~~~~~-~~~~~~~~ 132 (394) . ... .......... .........+.+...... ... .... ............... .......+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (477) T protein:vir:84 81 GKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRN 160 (477) T ss_pred hcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhcccccc Confidence 0 000 0000000000 000000111111100000 000 0000 000000000111111 11222344 Q ss_pred ccCCccccchh-HHhHHHHHHHhhhhhhheeeeEeecCCc--eeEEEEecCCCccccccccccc-----cccccccccee Q lcl|Aclame:pro 133 KENAKPVSSEE-ILYTPAREVKTVVDLKPFTTVYQAKKAS--GKYPVLQRATTKMVTVAELEKN-----PALAKPDFKDV 204 (394) Q Consensus 133 ~~~~~~lvP~~-~~~~I~~~~~~~~~l~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~e~~~~-----~~~~~~~~~~v 204 (394) +..|++++|++ +.+.|++.+++.++|+++++++++++++ +.+|.... +...++|.+++.. .+.++++|+.+ T Consensus 161 ~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~-~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i 239 (477) T protein:vir:84 161 GGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILT-GTSTAIQAADNAALTAPSAHEVDLTDGFV 239 (477) T ss_pred CCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEec-CcceeeeeccCcccccccccccccceeeE Confidence 55567777766 4678999999999999999999887654 56665543 3344555555432 23578899999 Q ss_pred eecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-cccccc---------------c----- Q lcl|Aclame:pro 205 AWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF-TTKTVK---------------N----- 263 (394) Q Consensus 205 ~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~-~~~~~~---------------~----- 263 (394) ++++++++++++||+|||+||.+++++||.++|+++++.+++.+|++|+|++ .|.|+. + T Consensus 240 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~ 319 (477) T protein:vir:84 240 QANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQ 319 (477) T ss_pred EEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHH Confidence 9999999999999999999999999999999999999999999999999864 333321 1 Q ss_pred --HHHHHHHHHhhhhhhc-c-cEEEEcHHHHHHHHhhhccCCceeeccc-------------ccCCCcccccccceEEec Q lcl|Aclame:pro 264 --LDEIKALLNGGFDPAY-N-VSLIVSQSFYQTLDTLKDGNGRYLLQDD-------------ITAVSGKVLLGKPVFVLS 326 (394) Q Consensus 264 --~~~i~~~~~~~~~~~~-~-a~~vm~~~~~~~l~~lkd~~G~~l~~~~-------------~~~~~~~~l~G~pV~~~~ 326 (394) ++++++++......++ + +.|+|||++|..|++|+|++|+|||+|+ +..+.+++|+|+||++++ T Consensus 320 ~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~ 399 (477) T protein:vir:84 320 IIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDP 399 (477) T ss_pred HHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecC Confidence 2233444433333333 2 4799999999999999999999999875 334456799999999977 Q ss_pred Ccccc------cCceEEEeccccEEEEeecceEEEEeeccccc---ceEEEEEEecc-EEecccceEEEEecCccCCC Q lcl|Aclame:pro 327 DEVLG------ANKAFIGDFKRGVLFADRKDLGLRWADNEIYG---QYLQAVLRFGV-SKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 327 ~~~~~------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~---~~~r~~~r~d~-~v~~~~af~~l~~~~~~~~~ 394 (394) .++.+ ...++||||+. ++++. .++.++.+++.+.. ..++++.++++ .+++|+||+.+|++++.+|- T Consensus 400 ~~p~~~~~~~d~~~i~~gd~~~-~~i~~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~~~~~~~ 475 (477) T protein:vir:84 400 TLPTTLGTGTDQDVIHVLRASD-LALFE-SSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGTALTAPT 475 (477) T ss_pred cccccccccCCcceEEEEEece-EEEEe-eceeEEeccccccccceeeeeehhhhhhhhhccccceEEeecccccccc Confidence 65432 23689999986 55554 57778877665543 23566666665 55569999999999999999 No 59 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=100.00 E-value=5e-57 Score=329.27 Aligned_cols=271 Identities=20% Similarity=0.250 Sum_probs=235.4 Q ss_pred hhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecC--CCccccccccccccccccccc Q lcl|Aclame:pro 124 VEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRA--TTKMVTVAELEKNPALAKPDF 201 (394) Q Consensus 124 ~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~e~~~~~~~~~~~~ 201 (394) .-......+++.|++++|+++++.|++.+++.++|+++|+++++++.++++++.... ++.+.|++|+++.++.++++| T Consensus 1 ~l~~~~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 80 (293) T protein:vir:48 1 MLDSKTDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPKL 80 (293) T ss_pred CceeecccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccce Confidence 333344566677889999999999999999999999999999998888887776543 344667777788877778999 Q ss_pred ceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-cccccHHHHHHHHHhhhhhhc- Q lcl|Aclame:pro 202 KDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT-KTVKNLDEIKALLNGGFDPAY- 279 (394) Q Consensus 202 ~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~-~~~~~~~~i~~~~~~~~~~~~- 279 (394) +++++++|++++++++|+|+++|+.++|++||.++|+++++.+++.+|++|+++.++ .+..+++++.+++.++...++ T Consensus 81 ~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~~~~~~d~i~~~~~~l~~~~~~ 160 (293) T protein:vir:48 81 SLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTKPTLTKWDDIIDLEAKVDPAIKQ 160 (293) T ss_pred eEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccccccccCHHHHHHHHHhhhhhhcC Confidence 999999999999999999999999999999999999999999999999999987655 467789999998877665554 Q ss_pred ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc-----ccCceEEEeccccEEEEeecceEE Q lcl|Aclame:pro 280 NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL-----GANKAFIGDFKRGVLFADRKDLGL 354 (394) Q Consensus 280 ~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~-----~~~~~~~gd~~~~~~~~~~~~~~i 354 (394) +++|+||++++..|++|||++|+|||++++.++.+++|+|+||+++++... +...++||||+++|++++|+++++ T Consensus 161 ~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 240 (293) T protein:vir:48 161 TSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSL 240 (293) T ss_pred CCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEEecccccCCccCCceEEEEEeccceEEEEEecceEE Confidence 689999999999999999999999999999999999999999998765443 344689999999999999999999 Q ss_pred EEeecc--cc---cceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 355 RWADNE--IY---GQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 355 ~~~~~~--~~---~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +.+++. .| ...+|+++|+|+++.+|+||++++++++++|- T Consensus 241 ~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~ 285 (293) T protein:vir:48 241 LSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQK 285 (293) T ss_pred EEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCC Confidence 987642 33 45689999999999999999999999999988 No 60 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=100.00 E-value=4.7e-55 Score=318.45 Aligned_cols=335 Identities=14% Similarity=0.091 Sum_probs=223.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) ++.+++|++++++++++.+.+ +++++.++...+.............. T Consensus 1 ~eei~~l~~~~~~l~~~~~~l---------------------------------~~~~d~~e~e~~~~~~~~~~~~~~~~ 47 (352) T protein:vir:78 1 MEDIKQLETEKAGLQQRFNIV---------------------------------ERQVQDIEEKEKAKVKDKGEAYQSLN 47 (352) T ss_pred ChhHHHHHHHHHHHHHHHHHH---------------------------------HHHHHHHHHHHHHHhhhccccccccc Confidence 444444444444444333332 22222211111100000000000011 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ........+..+.++......... ...............+.+.|+++||+++.+.|++.++.+++|+++| T Consensus 48 ~~~~~~~~~~~~~r~~~~~~~~~~----------~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii~~l~~~s~l~~~~ 117 (352) T protein:vir:78 48 DNEKLVKAKAEFYRHAILPNEFEK----------PSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQLREKA 117 (352) T ss_pred hhhhHHHHHHHHHHHHhhhhHHHH----------HHhhHHHHHHHhccCCCCCCceeccHhHHHHHHHHHHhhcchhhhe Confidence 111111223333332211111000 0000001111122345667889999999999999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKV 242 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~ 242 (394) ++.++++ ..+|....+...+.|++|++..+ .++++|++|++++|+++++++||+|||+||.+||++||.+.|+++++ T Consensus 118 ~v~~~~~--~~~p~~~~~~~~a~~v~E~~~~~-~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~la~~~~ 194 (352) T protein:vir:78 118 RLTNIKG--LEIPRVSYTLDDDDFITDVETAK-ELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENALQSGLA 194 (352) T ss_pred eeEecCC--ceEEEEecCCCcccccccccccc-cccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHHHHHHH Confidence 9988754 56777665555666666666665 57899999999999999999999999999999999999999999999 Q ss_pred HHHHH-HHhhcccccccc------------ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeeccc Q lcl|Aclame:pro 243 NTTND-AIAKVLKSFTTK------------TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDD 308 (394) Q Consensus 243 ~~~~~-a~~~g~~~~~~~------------~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~ 308 (394) .+++. .+..|.|++.+. +...+|++++++..+...++ +++|+||+.++..|.+++|.+|+|+|.. T Consensus 195 ~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~- 273 (352) T protein:vir:78 195 AKERKDALAVSPKSGLEHMSFYNGSVKEVEGANMYDAIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDT- 273 (352) T ss_pred HHHHHhhhhcCCCCcccccceeccccccccccchHHHHHHHHhccChhhhcCCEEEEehHHHHHHHHHHhccCCccccc- Confidence 88655 555666554332 22347899999987655554 7899999999999999988899998853 Q ss_pred ccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc-ccceEEEEEEeccEEecccceEEEEe Q lcl|Aclame:pro 309 ITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTF 387 (394) Q Consensus 309 ~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~ 387 (394) .+.+|+|+||+++++ ..+++||||+++|+.. .++.++...+.. ....|++++|+|++|++|+||+.+++ T Consensus 274 ----~~~~llG~PV~~~~~----~~~~~~Gdf~~~~~~~--~~~~~~~~~~~~~g~~~f~~~~r~Dg~~~~~eA~~~l~~ 343 (352) T protein:vir:78 274 ----PAEKVFGKPVVFTDA----AVKPIVGDFNYFGINY--DGTTYDTDKDVKKGEYLFVLTAWYDQQRTLDSAFRIAKA 343 (352) T ss_pred ----CCccccccceEEecC----CCceeEeehhhhhhhh--hhheeeeeccccCCeeEEEEEeeeCceeechhheEEEEe Confidence 356899999999774 3568999999876553 456666655543 34578999999999999999999999 Q ss_pred cCccCCC Q lcl|Aclame:pro 388 TPEPLPL 394 (394) Q Consensus 388 ~~~~~~~ 394 (394) +|+++|+ T Consensus 344 ~a~~~~~ 350 (352) T protein:vir:78 344 KESTGSL 350 (352) T ss_pred ecccCCC Confidence 9999888 No 61 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=100.00 E-value=1.8e-54 Score=315.22 Aligned_cols=336 Identities=11% Similarity=0.027 Sum_probs=244.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |.++++++++++++++.+.++..... +++ .+...+.+..+.+++.... .++.+.... ... ..... T Consensus 3 i~~k~~~~~~~~~~~l~~~~~~~~~~------ee~---~~~~~~~~~~~~~~~~~~~--~~e~~~~~~---~~~-~~~~l 67 (377) T protein:vir:98 3 INLKELPKYREAVAELSAKISAGATS------EEQ---EKLFEAAFTTMGDEILAKN--EEEMERMFD---LRD-KNREL 67 (377) T ss_pred CcHHHHHHHHHHHHHHHHHHHhhhhh------HHH---HHHHHHHHHhHHHHHHHHH--HHHHHHHHH---hcc-CCccc Confidence 88888888888777776655432111 111 1112222333333332211 111111100 000 00000 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) .. ++.+.++ ......+.++++++||+++.+.|++.+.+.++|++ T Consensus 68 t~---ee~~~~~---------------------------------~~~~~~~~~~gg~~vP~~~~~~I~~~l~~~s~i~~ 111 (377) T protein:vir:98 68 TA---EEIKFFN---------------------------------DIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLK 111 (377) T ss_pred CH---HHHHHHH---------------------------------HHHhccCCCCCccccCHHHHHHHHHHHHHhhhhhh Confidence 00 0000000 01123456678899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++.+++ +..++|+.. +.+.+.|+.|+++.+++++++|+++++++|+++++++||++||+||.+||++||.+.|+++ T Consensus 112 ~~~v~~~~-~~~~~~~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~~~la~~ 189 (377) T protein:vir:98 112 VINFKNTS-LRLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFITEQLKEA 189 (377) T ss_pred heeeEecC-cceEEEEec-CCcceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHHHHHHHH Confidence 99999986 456888754 4556667677677776788999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcccccccccccc---------------------HHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhc Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTKTVKN---------------------LDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKD 298 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~~~~~---------------------~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd 298 (394) ++.+++.+|++|+|++.|.|..+ .+.+.++...+...++ +++|+||+.++..+++|+| T Consensus 190 ~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd 269 (377) T protein:vir:98 190 IAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLK 269 (377) T ss_pred HHHHHhhceEeccCCCcceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhc Confidence 99999999999999988766532 1345555555454454 5899999999999999999 Q ss_pred cCCceeecccc--------------cCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccccc- Q lcl|Aclame:pro 299 GNGRYLLQDDI--------------TAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYG- 363 (394) Q Consensus 299 ~~G~~l~~~~~--------------~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~- 363 (394) .+|+|+|..++ .+|.+.+++|+|+.++.+...+++.++||||++ |++++|.+++|+.+++.+|. T Consensus 270 ~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~s~~~p~~~i~fgdf~~-Y~i~~r~~~~i~~~~~~~~~~ 348 (377) T protein:vir:98 270 IAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEEYDQTFAME 348 (377) T ss_pred cCCceEEEecccchhhccccccccCCCCccccccCCCceEEecCCCCcccEEEEEecc-eeEEeecceEEEeechhhhhc Confidence 99999994322 235566899999877767677888999999998 78899999999999988765 Q ss_pred --ceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 364 --QYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 364 --~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) +.+|+++|+||++++|+||++++++-= T Consensus 349 d~~~f~~~~r~dg~~~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 349 DLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred CceEEEEEEEEcCEEeccCcEEEEEEecC Confidence 468999999999999999999999988 No 62 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=100.00 E-value=2.2e-52 Score=303.82 Aligned_cols=381 Identities=13% Similarity=0.118 Sum_probs=239.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHH---Hh----chhhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKN---AL----ESDDL----EAARSIKAEVEQAKANLVEAENDLKLYESSVEV 69 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~---~~----~~e~~----~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~ 69 (394) ||..+++++++++.+|.++.+++..+.+. .+ ++++. ++++++++++.++.+++..++++++.+++..+. T Consensus 7 ~l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~le~el~e 86 (466) T protein:vir:80 7 MLAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKELENELEQ 86 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88888888777777776665554333222 11 11111 233344444445545555555444444433322 Q ss_pred cccccccccccc-chhhhHHHHH---HHHHHHHHHHHHHHHH--HHHHHHHHHHHHhhhhhhhhhhcccccCCccccchh Q lcl|Aclame:pro 70 GGAENIGGKEVT-QEEKTYRESV---NDFIRSKGKIVNDSLR--FEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEE 143 (394) Q Consensus 70 ~~~~~~~~~~~~-~~~~~~~~~~---~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~ 143 (394) ............ .......... ....+..........+ ................ ......+.+++++++|++ T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~g~~~~vP~~ 164 (466) T protein:vir:80 87 LNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRT--LAQQKRAVSGAELTIPDV 164 (466) T ss_pred HHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHH--HhhhhhhhccccccccHH Confidence 111111111000 0000000000 0000000000000000 0001111111111111 112223345566899999 Q ss_pred HHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHh Q lcl|Aclame:pro 144 ILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID 223 (394) Q Consensus 144 ~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ 223 (394) +.+.|++.+++.++|+++|++.++++ ...+++.. ....+.|+.|++..+ .++++|++|++++|+++++++||++||+ T Consensus 165 ~~~~i~~~l~~~~~l~~~~~v~~~~g-~~~~~~~~-~~~~a~wv~E~~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ 241 (466) T protein:vir:80 165 MLELLRDNMHRYSKLISKVRLRPLKG-TARQNIAG-AIPEGVWTEAVANLN-ELSLSFSQIEVDGYKVGGFIPIPNSTLE 241 (466) T ss_pred HHHHHHHhhhhhhhhhhheeeeecCc-eeEeeeec-CCcceeecccccccc-cccccccceeecceeeeeehhhhHHHHh Confidence 99999999999999999999999864 45666543 334445555555555 5789999999999999999999999999 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccH----------------------HHHHHH----------- Q lcl|Aclame:pro 224 DADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNL----------------------DEIKAL----------- 270 (394) Q Consensus 224 ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~----------------------~~i~~~----------- 270 (394) ||.++|++||.++|+++++.+++.+|++|+|++.|.|..+. .++..+ T Consensus 242 ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 321 (466) T protein:vir:80 242 DSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFF 321 (466) T ss_pred cchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHH Confidence 99999999999999999999999999999999877654211 111111 Q ss_pred ------HHhhhhhhc--ccEEEEcHHHHHHHHhhh---ccCCceeecccccCCCcccccccceEEecCcccccCceEEEe Q lcl|Aclame:pro 271 ------LNGGFDPAY--NVSLIVSQSFYQTLDTLK---DGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGD 339 (394) Q Consensus 271 ------~~~~~~~~~--~a~~vm~~~~~~~l~~lk---d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd 339 (394) +......+. ++.|+||+.++..|..++ +++|.|++.+. + ...|+|+||++++++ +.+.+++|| T Consensus 322 ~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~--~--~~~i~G~pvv~s~~~--~~~~~~~g~ 395 (466) T protein:vir:80 322 SELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLN--N--TMPIVGGDIVILDFI--PDNDIIGGY 395 (466) T ss_pred HHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCC--C--cccccccceeecCcc--Cccceeeec Confidence 111111222 246999999999999887 67888877542 2 236999999987754 566789999 Q ss_pred ccccEEEEeecceEEEEeecccccc---eEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 340 FKRGVLFADRKDLGLRWADNEIYGQ---YLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 340 ~~~~~~~~~~~~~~i~~~~~~~~~~---~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) |+. |++++|++++|..+++.+|.+ .+|+++|+||+|++|+||++++++... |- T Consensus 396 ~~~-y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~-~~ 451 (466) T protein:vir:80 396 GSL-YLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANAN-PT 451 (466) T ss_pred ccc-EEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCC-cc Confidence 986 678899999999998887644 589999999999999999999988753 33 No 63 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=100.00 E-value=3e-52 Score=303.09 Aligned_cols=338 Identities=8% Similarity=-0.015 Sum_probs=222.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) ++.|.|+++++.++++ +..+.++..... .+. .+.++++...+.. +..++.+.... T Consensus 1 ik~L~e~~~e~~e~~~---~~~~~~~~~~~~--~e~----~~~~~~~~~~~~~--~~~~~~~~~~~-------------- 55 (390) T protein:vir:40 1 MNNLDKKDSETLNIST---AFLNAIKEGATE--AEQ----VTAFTNMAEQIQN--NIIAQARKEVN-------------- 55 (390) T ss_pred CchHHHHHHHHHHHHH---HHHHHHhhhhhH--HHH----HHHHHHHHHHHHH--HHHHHHHHHHH-------------- Confidence 3444444444444333 322222211000 011 1111111111110 00000000000 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ............... ....+.+... .......+.+++++++|+++++.|++.++..++|+++| T Consensus 56 ---~~~~~~~~~~~~~~~--------~l~~~~r~~~------~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~ 118 (390) T protein:vir:40 56 ---REMNDNNVLASRGAN--------ALTSDESKYY------NEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKI 118 (390) T ss_pred ---HHHHHHHHHHhcCch--------hccHHHHHHH------HHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhc Confidence 000000000000000 0000000000 11123345667889999999999999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKV 242 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~ 242 (394) ++++++++...+|+.. ..+.+.|+.|++..++.++++|+++++++|+++++++||+||++|+.++|++||.+.|+++++ T Consensus 119 ~~~~~~~~~~~i~~~~-~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~la~~i~ 197 (390) T protein:vir:40 119 NFVNTTATTEWIISVG-DVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTILGEAMA 197 (390) T ss_pred eeeecCCceeEEEEEc-CCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHHHHHHHH Confidence 9999999888888764 345556666666667678899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhccccccccccc-------------------cHHHHHHHHHhhhhh--------hcccEEEEcHHHH-H--- Q lcl|Aclame:pro 243 NTTNDAIAKVLKSFTTKTVK-------------------NLDEIKALLNGGFDP--------AYNVSLIVSQSFY-Q--- 291 (394) Q Consensus 243 ~~~~~a~~~g~~~~~~~~~~-------------------~~~~i~~~~~~~~~~--------~~~a~~vm~~~~~-~--- 291 (394) .+++.+|++|+|++.|.|.. ++.++.+++..+... ..+++|+||+.++ . T Consensus 198 ~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~~i~n~~t~~~~l~ 277 (390) T protein:vir:40 198 LGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAILVINPADYWSKIY 277 (390) T ss_pred HHHHhhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCceEEEcchhHHHHHH Confidence 99999999999887765432 233444444443332 2378999999884 3 Q ss_pred HHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccc---eEEE Q lcl|Aclame:pro 292 TLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQ---YLQA 368 (394) Q Consensus 292 ~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~---~~r~ 368 (394) .+++++|++|+|+|.. .++|+||++++. .+++.++||||++ |++++|++++|+++++.+|.+ .+|+ T Consensus 278 ~~~~~~d~~G~~v~~~--------~~~g~pvv~~~~--~p~~~i~~Gd~s~-~~i~~~~~~~v~~~~~~~f~~~~~~~r~ 346 (390) T protein:vir:40 278 AATSYMTPQGVWVTGI--------LPVPLEIVQSVA--VPVGKAVAGRAKD-YFMGIGSEQVIRTSTEYRLLDDETLYYA 346 (390) T ss_pred HHhhccCCCCcccccc--------CCCceeEEEcCC--CCCCcEEEEeece-EEEEeecceEEEecchhhhhcCcEEEEE Confidence 4558999999999743 357999998664 4677899999997 678899999999999887654 5899 Q ss_pred EEEeccEEecccceEEEEecCccC--CC Q lcl|Aclame:pro 369 VLRFGVSKVDDKAGYYVTFTPEPL--PL 394 (394) Q Consensus 369 ~~r~d~~v~~~~af~~l~~~~~~~--~~ 394 (394) ++|+|++|++|+||++++++++.- ++ T Consensus 347 ~~r~dg~v~~~~A~~~l~~~~~~~~~~~ 374 (390) T protein:vir:40 347 KQYANGRPKDNSSFLVFDITGLEGSPAI 374 (390) T ss_pred EEEeCCEEecccceEEEEeeccCCCCCC Confidence 999999999999999999998842 22 No 64 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=100.00 E-value=2.4e-51 Score=298.06 Aligned_cols=386 Identities=12% Similarity=0.036 Sum_probs=246.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHH---HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc-cccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKN---ALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGA-ENIG 76 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~---~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~-~~~~ 76 (394) =..+++++|++++.++.++++++.+++.. .++.++.++++.+.+++++++.+++++++..+........... .... T Consensus 194 ~~~e~i~~l~~~ra~~~~~~~~l~~~a~~~g~~l~aee~~~~d~l~aei~~l~~~i~r~e~~e~~~a~~a~pv~~~~~~~ 273 (645) T protein:vir:93 194 NIGEQIKSFENKRAALAASLEEVMTKAAEEGRTLDVEEEEHYDNTAAEIRQVDAHLKRLRELEAGKAATAQPVKQAGNGN 273 (645) T ss_pred chhhhhhhhhHHHHHHHHHhhhhhhhHhhhccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccc Confidence 12356888888888888887777665543 4666777888888888888888887776432221111110000 0000 Q ss_pred cc-----c--ccchhhhHHHHHHHHHHHHHHH----HHHHHHHHHHHH-HHHHHHhhh-hhhhhhhcccccCCccccchh Q lcl|Aclame:pro 77 GK-----E--VTQEEKTYRESVNDFIRSKGKI----VNDSLRFEGKDE-VLMPINETT-PVEPQKDGIKKENAKPVSSEE 143 (394) Q Consensus 77 ~~-----~--~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~-~~~~~~~~~-~~~~~~~~~~~~~~~~lvP~~ 143 (394) .. . ...........|..+.+..... .......+.... ......... .............|++++|+. T Consensus 274 ~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~~~Gg~~vp~~ 353 (645) T protein:vir:93 274 VAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQWAGSLSEYQE 353 (645) T ss_pred cccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccccccCCccCchh Confidence 00 0 0001111111222222221110 000000000000 000000000 111111122334578899999 Q ss_pred HHhHHHHHHHhhhhhhheeeeEeec----CCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhH Q lcl|Aclame:pro 144 ILYTPAREVKTVVDLKPFTTVYQAK----KASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQ 219 (394) Q Consensus 144 ~~~~I~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ 219 (394) +...|++.+++.++++.++...... .+...+|..+. +..+.|+.|++.. +.++++|+++++++|++++++++|+ T Consensus 354 ~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~-~~~a~wv~Eg~~~-~~s~~~f~~v~l~~~kla~~~~iS~ 431 (645) T protein:vir:93 354 YAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVS-GGAAGWVGEGKTK-PLTKFDFESITFSHAKVSAIAVLTE 431 (645) T ss_pred hHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeec-CcceEEeccCccc-cccccceeEEEEeeEEEEEeehhHH Confidence 9999999999999999886543222 23456666543 3445555555555 5688999999999999999999999 Q ss_pred HHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----cccc-----------ccHHHHHHHHHhhhhhh---ccc Q lcl|Aclame:pro 220 ESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT----TKTV-----------KNLDEIKALLNGGFDPA---YNV 281 (394) Q Consensus 220 ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~----~~~~-----------~~~~~i~~~~~~~~~~~---~~a 281 (394) |||+|+.+++++||++.|+++++.+++.+|++|++++. |.+. ..+.++..++..+.... .++ T Consensus 432 ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~~~~~~~~~~~~d~~~~~~~~~~a~~~~~~a 511 (645) T protein:vir:93 432 ELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDVKGTASSGNPDADAEAAFGQFVAANLQPTGA 511 (645) T ss_pred HHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccccccccccchHHHHHHHHHHHHhcCCCcccc Confidence 99999999999999999999999999999998876542 2211 12345666655554432 358 Q ss_pred EEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc Q lcl|Aclame:pro 282 SLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI 361 (394) Q Consensus 282 ~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~ 361 (394) +|+|||.++..|++|+|++|+|+| |++. ..+++|+|+||++++.++ ..+++|||+. ++++++.++.|..+++.. T Consensus 512 ~~vmn~~~~~~L~~lkd~~G~~~~-~~~~-~~~~tL~G~PV~~s~~vp---~~~~~gd~s~-~~ig~~~~v~i~~s~~a~ 585 (645) T protein:vir:93 512 VWLMSSTNALALSMRKNALGQKEY-PDMT-LLGGSFQGLPVIVSQYVG---DQLVLVNAPD-IYLADDGGVAVDMSREAS 585 (645) T ss_pred EEEEcHHHHHHHHhccccCCceee-cCCC-CCCceeeceeeEEeccCC---cceeEecccc-EEEEEecceEEEeeccee Confidence 999999999999999999999998 4443 344699999999976543 3468899997 556778898888765533 Q ss_pred ----------------------cc---ceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 362 ----------------------YG---QYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 362 ----------------------~~---~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) |+ ..+|+++|+|++++||+||++|+-.+==+.. T Consensus 586 ~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~ 643 (645) T protein:vir:93 586 LEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGVNYGSAS 643 (645) T ss_pred EEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecccCCccc Confidence 32 3479999999999999999999844332333 No 65 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=100.00 E-value=1.2e-51 Score=299.69 Aligned_cols=328 Identities=12% Similarity=0.058 Sum_probs=234.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |+++++++++++++++.++++....+. ++.+.+.+.++++.+++....+ ++.+.... ... ..... T Consensus 3 i~~~~~~~~~e~~~~l~~~~~~~~~~e---------~~~~~~~~~~~~~~~~~~~~~~--~e~~~~~~---~~~-~~~~l 67 (377) T protein:vir:96 3 INLKELPKYREAVAELSAKISAGATPE---------EQEKLFEAAFTTMGDEILAKNE--EEMERMFD---LRD-KNREL 67 (377) T ss_pred ccHHHHHHHHHHHHHHHHHHhhcccHH---------HHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH---hcc-CCccc Confidence 898999998888777776655321111 1112222333333333322110 01111100 000 00001 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ..+ +.+.+ . ......+.++|+++||+++.+.|++.+.+.+||++ T Consensus 68 t~e---e~~~~---------------------------~------~~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~~ 111 (377) T protein:vir:96 68 TAE---EIKFF---------------------------N------DIDKNVGGKDKFKLLPEETMVQVFDDLVAEHPLLK 111 (377) T ss_pred CHH---HHHHH---------------------------H------HHHhcCCCCCCceecCHHHHHHHHHHHHhhhhhhh Confidence 100 00000 0 01123466778999999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++.+++ +...+|+.+ +.+.+.|+.|+++.+++++++|+++++++|+++++++||++||+||.+||++||.+.|+++ T Consensus 112 ~~~v~~~~-~~~~i~~~~-~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~ 189 (377) T protein:vir:96 112 VINFKNTS-LRLKALTAE-TSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEA 189 (377) T ss_pred hceeEecC-CceEEEEec-CCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHHhhcchhhHHHHHHHHHHHH Confidence 99999985 456788653 4455566666667776788999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcccccccccccc--------------------------------HHHHHHHHHhhhhhh---------- Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTKTVKN--------------------------------LDEIKALLNGGFDPA---------- 278 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~~~~~--------------------------------~~~i~~~~~~~~~~~---------- 278 (394) ++.+++.+|++|+|++.|.|..+ .+.+++++..+..++ T Consensus 190 ~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~ 269 (377) T protein:vir:96 190 IAVALELAIVKGNGLLQPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLK 269 (377) T ss_pred HHHHHhhceEeccCCCcceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHHhhcccccccccc Confidence 99999999999999887755432 244455444433322 Q ss_pred --cccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEE Q lcl|Aclame:pro 279 --YNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRW 356 (394) Q Consensus 279 --~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~ 356 (394) .+++|+|||+++..+ .|+|.|++ .+|.+.+++|+|+.++.+...+++.++||||++ |++++|.+++|+. T Consensus 270 ~~~~a~~~mn~~t~~~~------~~~~~~~~--~~G~~~~~l~~p~~v~~s~~~p~~~i~fgdf~~-Y~i~~r~~~~i~~ 340 (377) T protein:vir:96 270 IAGQVKLLLNPEDRWTL------EAKFTSRN--QFGEYVTVLPHGITILESLAVETGKAIAFVANR-YDAFMATASTIEE 340 (377) T ss_pred ccCceEEEEchhhHHhc------cccccccC--CCCCceeccCCCceEEecCCCCcccEEEEEcCc-EEEEEecccEEEe Confidence 256899999997654 57777765 356677899999887777777888899999998 8899999999999 Q ss_pred eeccccc---ceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 357 ADNEIYG---QYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 357 ~~~~~~~---~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) ++|.+|. +.+|+++|+||++++++||++++++-- T Consensus 341 ~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 341 YDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEecC Confidence 9987764 468999999999999999999999988 No 66 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=100.00 E-value=1e-50 Score=294.65 Aligned_cols=341 Identities=11% Similarity=-0.017 Sum_probs=222.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccc Q lcl|Aclame:pro 3 EEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQ 82 (394) Q Consensus 3 ~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (394) |.-..+++++++.+.+...++.+.+++ .... +.+...+++.+++++..+........ T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~----~~~~-----e~~~~~~~~~~~~~~~~~~~~~~~e~-------------- 57 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQN----GASD-----EEQSKAFGAMFDALSNDLQEEITAEI-------------- 57 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhh----hhhH-----HHHHHHHHHHHHHHHHHHHHHHHHHH-------------- Confidence 666666666665555444444332222 2111 11222222222332222211100000 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ............+.. .....+.+... ......+.+.|+++||+++.+.|++.++..++|+++| T Consensus 58 -~~~~~~~~~~~~r~~---------~~l~~ee~~~~-------~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~ 120 (395) T protein:vir:95 58 -NNRVVDNGILAKRSQ---------DPLTSEERKFF-------NDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKI 120 (395) T ss_pred -HHHHHHHHHHhhcCc---------cccchHHHHHH-------HHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhc Confidence 000000000000000 00000000000 1112245667889999999999999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKV 242 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~ 242 (394) +++++++ ...+|+.. ..+.+.|+.++++.+++++++|+++++++|+++++++||+|||+|+.+|+++||.+.|+++++ T Consensus 121 ~v~~~~~-~~~i~~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~~~la~~ia 198 (395) T protein:vir:95 121 NFQNAGI-KTRVIKAD-PAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVRTQIQEAIS 198 (395) T ss_pred eeEecCC-ceEEEEec-CCcceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHHHHHHHHHH Confidence 9999864 55677643 334444444556666678999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhcccccc--ccccc-------------------cHHHHHHHHHhhhhh---------------hcccEEEEc Q lcl|Aclame:pro 243 NTTNDAIAKVLKSFT--TKTVK-------------------NLDEIKALLNGGFDP---------------AYNVSLIVS 286 (394) Q Consensus 243 ~~~~~a~~~g~~~~~--~~~~~-------------------~~~~i~~~~~~~~~~---------------~~~a~~vm~ 286 (394) .+++.+|++|+|++. |.|.. +++++..+...+.+. ..+++|+|| T Consensus 199 ~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn 278 (395) T protein:vir:95 199 VALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVN 278 (395) T ss_pred HHHhhheeeccCCCCcCceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEc Confidence 999999999999863 54432 123332222211110 125689999 Q ss_pred HHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccc-- Q lcl|Aclame:pro 287 QSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQ-- 364 (394) Q Consensus 287 ~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~-- 364 (394) ++++. |.+|+|+|+| .+|.+.+++|+|+.++.+...+++.++||||++ |++++|.+++|+.+++.+|.. T Consensus 279 ~~t~~------~~~g~~~~~~--~~G~~~~~lg~g~~v~~~~~~p~~~i~fgdfs~-y~i~~r~~~~i~~~~~~~~~~d~ 349 (395) T protein:vir:95 279 PRDSW------DVQARYTYLT--ANGGFVTVLPYNVTIITSEFVPEGKLVAFVTDR-YNAVRGGGLTVKKFDQTLALEDA 349 (395) T ss_pred chhhh------hcCCcceecc--CCCcceeccCCcceEEEcCCCCCCcEEEEeccc-EEEEEecceEEEeccchhhhCCc Confidence 99865 5679999987 466777888777644445566778899999998 788999999999999887654 Q ss_pred -eEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 365 -YLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 365 -~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) .+|+++|+||+|++++||++|+++.+-.|. T Consensus 350 ~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~ 380 (395) T protein:vir:95 350 VLFTAKTFAYGQPDDNKASAVYDLKVASAPR 380 (395) T ss_pred EEEEEEEEECCEEeccccEEEEEeeccCCCC Confidence 589999999999999999999998444444 No 67 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=100.00 E-value=2.1e-50 Score=292.98 Aligned_cols=286 Identities=14% Similarity=0.075 Sum_probs=223.1 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) +++.+..+ ...+........... ........+..++.++|+++++.|++.+++.++|+++ T Consensus 1 ~~~~~~~~----------------~~~~~f~~~~~~~~~----~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~ 60 (324) T protein:vir:97 1 MEQTQKLK----------------LNLQHFASNNVKPQV----FNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQL 60 (324) T ss_pred CccchhHH----------------HHHHHHHHhhhhhhh----hccccccccCCCcceechhHHHHHHHHHHhhcchhhh Confidence 11110000 000000000000000 0011123445577899999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIK 241 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~ 241 (394) |++++++++++++|+.+. .+.+.|+.|++..+ .++++|+.+++++++++++++||+|+++|+.++++++|.+.|++++ T Consensus 61 ~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~ai 138 (324) T protein:vir:97 61 GKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAF 138 (324) T ss_pred cceeeccCCceEEEEEec-CcceeEeccCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Confidence 999999998999998764 45566666666655 5789999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhccccccc---------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceee Q lcl|Aclame:pro 242 VNTTNDAIAKVLKSFTT---------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLL 305 (394) Q Consensus 242 ~~~~~~a~~~g~~~~~~---------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~ 305 (394) +.+++.++++|+|++.. .+..+++++.+++..+...++ +++|+|||+++..|++++|++|+|+| T Consensus 139 a~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~~ 218 (324) T protein:vir:97 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred HHHHHHHhhccCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceee Confidence 99999999999876532 133468999998877776665 57999999999999999999999998 Q ss_pred cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceEEE Q lcl|Aclame:pro 306 QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQA 368 (394) Q Consensus 306 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r~ 368 (394) .++ .+++|+|+||+++++...+.+.++||||++ ++++++++++|+.+++.+ | ...+|+ T Consensus 219 ~~~----~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~ 293 (324) T protein:vir:97 219 YDR----NSDTLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred cCC----CCccccceeeEeecCCCCCcceEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 643 456899999999887777888899999997 567789999999987643 3 345899 Q ss_pred EEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 369 VLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 369 ~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ++|+|+++.+|+||++|+.+.++++- T Consensus 294 ~~r~d~~v~~~~a~~~l~~~~~~~~~ 319 (324) T protein:vir:97 294 TMHVALHIADDKAFAKLVPADKKTDS 319 (324) T ss_pred EEEeccEEecccceEEEEeccCCCCC Confidence 99999999999999999999887544 No 68 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=100.00 E-value=1.3e-50 Score=294.16 Aligned_cols=271 Identities=14% Similarity=0.107 Sum_probs=218.2 Q ss_pred hhhhhhh-hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCccccccccccccccccc Q lcl|Aclame:pro 121 TTPVEPQ-KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKP 199 (394) Q Consensus 121 ~~~~~~~-~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 199 (394) ....... .....+..++.++|+++.+.|++.+++.++|+++++++++.++.+++|+... ...+.|+.|++..+ .+++ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~-~~~~ 78 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISIPHWTG-AVSASWTGEAERKP-ITKG 78 (330) T ss_pred CcccccchhhccccCCCcceechhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcC-CcceeEecCCCccc-cccc Confidence 1111111 1233345556678888999999999999999999999999998899998764 45566666666665 5789 Q ss_pred ccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc------------------- Q lcl|Aclame:pro 200 DFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKT------------------- 260 (394) Q Consensus 200 ~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~------------------- 260 (394) +|+++++++++++++++||+|+++|+.++++++|.++|+++++.+++.++++|+|++.+.. T Consensus 79 ~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~~~~~~~~~ 158 (330) T protein:vir:77 79 SFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVSLADTNLTT 158 (330) T ss_pred eeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccceeecccccc Confidence 9999999999999999999999999999999999999999999999999999988754421 Q ss_pred -----cccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCC-----CcccccccceEEecCcc Q lcl|Aclame:pro 261 -----VKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAV-----SGKVLLGKPVFVLSDEV 329 (394) Q Consensus 261 -----~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~-----~~~~l~G~pV~~~~~~~ 329 (394) ...++++.+++..+..... +++|+||+++|..|++|||++|+|||+++...+ .+++|+|+||+++++++ T Consensus 159 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~PV~~~~~~p 238 (330) T protein:vir:77 159 ASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREGRILGRPTYVADNVV 238 (330) T ss_pred cccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCceecceeeEEecccc Confidence 0125667776666555544 578999999999999999999999999866544 44699999999988765 Q ss_pred cc----cCceEEEeccccEEEEeecceEEEEeeccc------------------c---cceEEEEEEeccEEecccceEE Q lcl|Aclame:pro 330 LG----ANKAFIGDFKRGVLFADRKDLGLRWADNEI------------------Y---GQYLQAVLRFGVSKVDDKAGYY 384 (394) Q Consensus 330 ~~----~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~------------------~---~~~~r~~~r~d~~v~~~~af~~ 384 (394) .+ ...+++|||++ ++++++++++|+.+++.+ | ...+|+++|+|++|.+|+||++ T Consensus 239 ~~~~~~~~~~~~gd~s~-~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~ 317 (330) T protein:vir:77 239 NGTVGNRVVGVMGDFSQ-VIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEAEFAFMVNDKDAFVK 317 (330) T ss_pred CCCCCCccEEEEEecce-EEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEEEeccEEecccceEE Confidence 43 23488999998 567889999999887754 2 2347999999999999999999 Q ss_pred EEecCccCCC Q lcl|Aclame:pro 385 VTFTPEPLPL 394 (394) Q Consensus 385 l~~~~~~~~~ 394 (394) |+.+++.+|. T Consensus 318 i~~~~~~~~~ 327 (330) T protein:vir:77 318 LTDQVAGTDP 327 (330) T ss_pred EEeccCCcCC Confidence 9999988777 No 69 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=100.00 E-value=1.4e-50 Score=293.87 Aligned_cols=263 Identities=13% Similarity=0.135 Sum_probs=222.1 Q ss_pred hhhhh-hcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccc Q lcl|Aclame:pro 124 VEPQK-DGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFK 202 (394) Q Consensus 124 ~~~~~-~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~ 202 (394) ....+ .+.+++.++.+||+++++.|++.+++.++|+++|++++++++...+|... ...+.|+.|++..+ .++++|+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~~a~~v~E~~~~~-~~~~~f~ 77 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFTFMS--GVGAFWVDEAERIQ-TSKPTFT 77 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEEEEc--CCceeeeecCcccc-cccccee Confidence 12222 23445566789999999999999999999999999999999988888764 45566777666665 5789999 Q ss_pred eeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---------------cccHHHH Q lcl|Aclame:pro 203 DVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKT---------------VKNLDEI 267 (394) Q Consensus 203 ~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~---------------~~~~~~i 267 (394) ++++.++++++++++|+|+++|+.++++++|.+.|+++++++++.++++|+|++.+.+ ..+++++ T Consensus 78 ~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~~~~~~~~l 157 (299) T protein:vir:41 78 KAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEETANKYDDL 157 (299) T ss_pred EEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeeccccccHHHH Confidence 9999999999999999999999999999999999999999999999999998765532 2357899 Q ss_pred HHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccC--ceEEEeccccE Q lcl|Aclame:pro 268 KALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGAN--KAFIGDFKRGV 344 (394) Q Consensus 268 ~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~--~~~~gd~~~~~ 344 (394) .+++..+...++ +++|+|||+++.+|++++|++|+|||++++..+. ++|+|+||+++++++.+.+ .++||||++ + T Consensus 158 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~-~~l~G~PV~~~~~~~~~~~~~~~~~gdfs~-~ 235 (299) T protein:vir:41 158 NEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGV-DDVLGLPIAYTPKYTFGDKDISELVGDWNQ-A 235 (299) T ss_pred HHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC-ceecceeeEEecccCCCCCceEEEEEeccc-E Confidence 999887665554 5789999999999999999999999998877654 5899999999987765543 488999987 5 Q ss_pred EEEeecceEEEEeeccc--------------cc---ceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 345 LFADRKDLGLRWADNEI--------------YG---QYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 345 ~~~~~~~~~i~~~~~~~--------------~~---~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) +++++++++++.+++.+ |+ ..+|+++|+|+++.+|+||++|+.+++= T Consensus 236 ~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 236 YYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccCC Confidence 67889999999988754 22 3479999999999999999999999888 No 70 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=100.00 E-value=6.9e-50 Score=290.11 Aligned_cols=346 Identities=11% Similarity=-0.040 Sum_probs=220.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |-++ |++..+++. ++.+++.+.+++.... .++.+.+.+.++.+.+++.+. ..++.+... T Consensus 1 M~~k-l~~~~~~~~---e~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~------------- 59 (383) T protein:vir:78 1 MTIK-LKNNLANYE---EKRTAFVNAVKNEDTQ--EIQNKAYVEMVDAMAADIMEQ--AKKEARQEA------------- 59 (383) T ss_pred Cchh-HHHHHHHHH---HHHHHHHHHHhccChH--HHHHHHHHHHHHHHHHHHHHH--HHHHHHHHH------------- Confidence 6643 433333333 3333333333221111 111112222222222222110 000000000 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) +.+....+.. .....+.+.... .....++++|+++||+++.+.|++.+.+.++|++ T Consensus 60 --------~~~~~~~~g~---------~~lt~~e~~~~~-------~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~l~~ 115 (383) T protein:vir:78 60 --------DAYISASRTD---------KNITNEEIKFFN-------DINKEVGYKEETLLPQTVVDEIFEDLTTEHPFLA 115 (383) T ss_pred --------HHHHHhcCCh---------hhhhHHHHHHHH-------HHhccCCCCCccccCHHHHHHHHHHHHhhcccee Confidence 0000000000 000000000001 1123456678899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++.++++ ...+|+.+ +.+.+.|+.+.++.++.++++|+++++++|+++++++||++||+|+.+||++||.+.|+++ T Consensus 116 ~~~v~~~~~-~~~i~~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~ 193 (383) T protein:vir:78 116 SIGMRTTGL-RTKFLKSE-TSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEA 193 (383) T ss_pred eeeeEecCC-ceEEEEEc-CCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHH Confidence 999999865 46788754 3445556666677777788999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcccccccccccc-----------------------HHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhh Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTKTVKN-----------------------LDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLK 297 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~~~~~-----------------------~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lk 297 (394) ++.+++.+|++|+|++.|.|..+ ++++..+...+..-+.++.|+||..++..+++++ T Consensus 194 ~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 273 (383) T protein:vir:78 194 FAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVT 273 (383) T ss_pred HHHHHhhheEeccCCCCceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceE Confidence 99999999999999887655421 2333333333322223455666666666555544 Q ss_pred c---cCCceeecccc----cCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccccc---ceEE Q lcl|Aclame:pro 298 D---GNGRYLLQDDI----TAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYG---QYLQ 367 (394) Q Consensus 298 d---~~G~~l~~~~~----~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~---~~~r 367 (394) . ..+.|.|+|.. .+|.+.+++|+|+.++.+...+++.++||||++ |++++|++++|+.+++.+|. +.|| T Consensus 274 ~~~n~~~~~~~~~~~~~~~~~G~~~t~l~~~~~iv~s~~~p~~~iifgdfs~-Y~i~~r~~~~i~~~~~~~f~~d~~~f~ 352 (383) T protein:vir:78 274 LLVNPTDAWDVKKQYTSLNANGVYVTALPFNLNIIESLFVPEKKAISYVAER-YDALIGGPLDIGTYDQTLAIEDLNLYA 352 (383) T ss_pred EEEcCcchhhhccchhccCCCCceeeecCCCceEEecCCCCcccEEEeeccc-eEEEecccceEEecchhhhhcCceEEE Confidence 1 11222343332 234455788999876666667788899999998 78899999999999998775 4689 Q ss_pred EEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 368 AVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 368 ~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +++|+||++++|+||+.++++.++.|- T Consensus 353 ~~~r~dG~~~~~~A~~vl~~~~~~~~~ 379 (383) T protein:vir:78 353 AKQFAYGKAKDDKAAAVWTLNINPAEQ 379 (383) T ss_pred EEEEEcCEEecCCeEEEEEEEecCCCC Confidence 999999999999999999988777555 No 71 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=100.00 E-value=2.4e-50 Score=292.58 Aligned_cols=260 Identities=14% Similarity=0.079 Sum_probs=214.0 Q ss_pred hcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecH Q lcl|Aclame:pro 129 DGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNI 208 (394) Q Consensus 129 ~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~ 208 (394) ...++..++.++|++++..|++.+++.++++++|+++++.++..++|+... ...+.|+.|++..+ +++++|+++++++ T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p~~~~-~~~a~wv~Eg~~~~-~s~~~f~~v~l~~ 78 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREFVFDF-DSDIDIVAENGKKT-HGGVSLDPVTIVP 78 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEec-CcceEEeeCCcccc-cccccceeeEeee Confidence 334455557789999999999999999999999999999999999998764 45566666666655 6889999999999 Q ss_pred hhhhhhhhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------------cccccH Q lcl|Aclame:pro 209 DTYRGAIPLSQESID---DADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT---------------------KTVKNL 264 (394) Q Consensus 209 ~~~~~~~~vs~ell~---ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~---------------------~~~~~~ 264 (394) |+++++++||+||++ |+.++|+++|.++|+++++.+++.++++|.+.++. .+...+ T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (300) T protein:vir:95 79 LKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVTQTVPFKDTNPD 158 (300) T ss_pred EEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccceeecccccchH Confidence 999999999999994 67789999999999999999999999988532111 112336 Q ss_pred HHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccc----cCceEEEe Q lcl|Aclame:pro 265 DEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLG----ANKAFIGD 339 (394) Q Consensus 265 ~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~----~~~~~~gd 339 (394) +++.+++..+...++ +++|+|||+++..|++|||++|+|||.+...++.+++|+|+||++++....+ ...+++|| T Consensus 159 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GD 238 (300) T protein:vir:95 159 ESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQTDPKNTAIVGD 238 (300) T ss_pred HHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCCCCccEEEEee Confidence 777787766655555 4689999999999999999999999988888888899999999997755432 23467899 Q ss_pred ccccEEEEeecceEEEEeecc--------ccc---ceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 340 FKRGVLFADRKDLGLRWADNE--------IYG---QYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 340 ~~~~~~~~~~~~~~i~~~~~~--------~~~---~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) |++++.+..|++++++++++. +|+ ..+|+++|+|+.|.+|+||++|+..+= T Consensus 239 f~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~g 300 (300) T protein:vir:95 239 FETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKTGG 300 (300) T ss_pred ccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecCCC Confidence 998877778999999987542 133 468999999999999999999988877 No 72 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=100.00 E-value=5.1e-50 Score=290.82 Aligned_cols=256 Identities=14% Similarity=0.085 Sum_probs=211.6 Q ss_pred cccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhh Q lcl|Aclame:pro 132 KKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTY 211 (394) Q Consensus 132 ~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~ 211 (394) -...+++++|++++.+|++.+++.++|+++|+++++.++..++|+.+. ...+.|++|++..+ .++++|+++++++|++ T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~~v~E~~~~~-~~~~~f~~v~l~~~k~ 78 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKT-HGGVTLAPQTMVPIKV 78 (298) T ss_pred CcccCcceechhHHHHHHHHHHhhhhhhhhcceeeccCCceEEEEEec-CcceEEecCCcccc-ccccceeEEEEeeeeE Confidence 335557899999999999999999999999999999988899998764 45566777666666 5789999999999999 Q ss_pred hhhhhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--cc---------------------ccccHH Q lcl|Aclame:pro 212 RGAIPLSQESID---DADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT--TK---------------------TVKNLD 265 (394) Q Consensus 212 ~~~~~vs~ell~---ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~--~~---------------------~~~~~~ 265 (394) ++++++|+|+++ |+..+|+++|.++|+++++++++.++++|.+.++ +. +...++ T Consensus 79 a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) T protein:vir:16 79 EYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) T ss_pred EEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccccccccccccccHHH Confidence 999999999995 5567899999999999999999999999853211 10 001145 Q ss_pred HHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc----ccCceEEEec Q lcl|Aclame:pro 266 EIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL----GANKAFIGDF 340 (394) Q Consensus 266 ~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~----~~~~~~~gd~ 340 (394) ++.+++..+....+ +++|+||++++..|++|||++|+|+|++.+..+.+++|+|+||++++.... +...+++||| T Consensus 159 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDf 238 (298) T protein:vir:16 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDF 238 (298) T ss_pred HHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeec Confidence 67777766665544 468999999999999999999999999988888899999999999775432 2346788999 Q ss_pred cccEEEEeecceEEEEeecc--------ccc---ceEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 341 KRGVLFADRKDLGLRWADNE--------IYG---QYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) Q Consensus 341 ~~~~~~~~~~~~~i~~~~~~--------~~~---~~~r~~~r~d~~v~~~~af~~l~~~~ 389 (394) ++++.++.+.+++++++++. +|+ ..+|+++|+|++|++|+||++|+..+ T Consensus 239 s~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 239 ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeecC Confidence 99887888999999987642 233 45899999999999999999999998 No 73 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=100.00 E-value=8e-50 Score=289.76 Aligned_cols=273 Identities=11% Similarity=0.082 Sum_probs=220.2 Q ss_pred HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCccccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALA 197 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~ 197 (394) +.+...........++.. +.++|+++...|++.+++.++|+++++++++.++.+++|+.+. ...+.|+.|++..+ .+ T Consensus 1 ~g~~~e~~~~~~~~t~~~-~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~-~s 77 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMF-TGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIPHWTG-DVSAQWIGEGDMKP-IT 77 (397) T ss_pred CCcCHHHHHHhhccCCCC-ccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEcC-CcceEEecCCcccc-cc Confidence 222222222223333333 4456677899999999999999999999999988899998764 44556666655555 68 Q ss_pred ccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------------ccccc Q lcl|Aclame:pro 198 KPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT--------------KTVKN 263 (394) Q Consensus 198 ~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~--------------~~~~~ 263 (394) +++|+++++++|+++++++||+||++|+.++++++|++.|+++++.+++.++++|+|++.+ .+... T Consensus 78 ~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~~~~~~~~ 157 (397) T protein:vir:23 78 KGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQSISPNAY 157 (397) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCcccccccccccceeeecccch Confidence 8999999999999999999999999999999999999999999999999999999887543 12234 Q ss_pred HHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCc-----ccccccceEEecCcccccCceEE Q lcl|Aclame:pro 264 LDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSG-----KVLLGKPVFVLSDEVLGANKAFI 337 (394) Q Consensus 264 ~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~-----~~l~G~pV~~~~~~~~~~~~~~~ 337 (394) ++++.+++..+...++ +++|+||++++..|+++||++|+|+|+++...+.+ ++|+|+||++.++++.+...+++ T Consensus 158 ~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~ 237 (397) T protein:vir:23 158 QGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYA 237 (397) T ss_pred hHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCCCCCceEEEE Confidence 6677777776666655 58999999999999999999999999987765533 58999999998888777777899 Q ss_pred EeccccEEEEeecceEEEEeeccc--------------c---cceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 338 GDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 338 gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) |||++ +++.+++++.++.+++.+ | ...+|+++|+|+++++|+||++++..+..+.. T Consensus 238 gDfs~-~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~ 310 (397) T protein:vir:23 238 GDFSQ-IIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTY 310 (397) T ss_pred eecce-EEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccceEEEeecccccee Confidence 99997 457789999999887654 2 24589999999999999999999998887555 No 74 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=100.00 E-value=1.4e-49 Score=288.48 Aligned_cols=259 Identities=12% Similarity=0.043 Sum_probs=209.2 Q ss_pred cccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHh Q lcl|Aclame:pro 130 GIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNID 209 (394) Q Consensus 130 ~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~ 209 (394) ..+.+.|++++|+++.+.|++.+++.++|+++|+++++.++..++|+.+. ...+.|+.|++..+ .++++|+++++.++ T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~~~p~~~~-~~~a~wv~Eg~~~~-~~~~~f~~v~l~~~ 78 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQQYMTLTA-PPRGEVVGEGAQKS-ESTATFAPVTAIPR 78 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCceEEEEEeC-CceeEEeecCcccc-cccceeeEEEEeeE Confidence 44556678999999999999999999999999999999999999998753 44556666666665 68899999999999 Q ss_pred hhhhhhhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------------------cc-H Q lcl|Aclame:pro 210 TYRGAIPLSQESID---DADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTV---------------------KN-L 264 (394) Q Consensus 210 ~~~~~~~vs~ell~---ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~---------------------~~-~ 264 (394) ++++++++|+|+++ |+..+|+++|.++|++++++.++.++++|+++++.... .. + T Consensus 79 kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~ 158 (311) T protein:vir:81 79 KVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVELTTGTSATPD 158 (311) T ss_pred EEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeeeecccccchHH Confidence 99999999999996 56678999999999999999999999999643322100 11 2 Q ss_pred HHHHHHHHhhhhhhcc-cEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcc-------------- Q lcl|Aclame:pro 265 DEIKALLNGGFDPAYN-VSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEV-------------- 329 (394) Q Consensus 265 ~~i~~~~~~~~~~~~~-a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~-------------- 329 (394) .++.+++.......++ .+|+||+.++.+|++|||++|+|+|.+....+.+++|+|+||++++.++ T Consensus 159 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~ 238 (311) T protein:vir:81 159 LAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVRGGPEAVTASTGVYR 238 (311) T ss_pred HHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccccccccccccccchhc Confidence 3344444333333344 3599999999999999999999999988888889999999999865432 Q ss_pred --cccCceEEEeccccEEEEeecceEEEEeeccc-------cc---ceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 330 --LGANKAFIGDFKRGVLFADRKDLGLRWADNEI-------YG---QYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 330 --~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-------~~---~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) .+...+++|||++ |++..+.+++++.+++.. |+ ..+|+++|+|++|++|+||++|+..+.+ T Consensus 239 ~~~~~~~~~~gDfs~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 239 TTNPNVKAIAGDFSA-FRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred ccCCccEEEEEeccc-EEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 1234578999997 566778999999876531 33 3589999999999999999999999999 No 75 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=100.00 E-value=2.9e-49 Score=286.70 Aligned_cols=286 Identities=13% Similarity=0.072 Sum_probs=220.8 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) +++.+..+ ...+............ ..........++.++|+++++.|++.+++.++|+++ T Consensus 1 ~~k~~~~~----------------~~~~~~~~~~~~~~~~----~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~ 60 (324) T protein:vir:99 1 MEQTQKLK----------------LNLQHFASNNVKPQVF----NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMRL 60 (324) T ss_pred CCCchHhh----------------HHHHHHHHHhhhhhhc----cccceeccCCCcceechhHHHHHHHHHHhhchhhhh Confidence 11110000 0000000000000000 011122334445699999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIK 241 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~ 241 (394) |+++++.++++.+|+.+. ...+.|++|++..+ .++++|++++++++++++++++|+|+++|+.+++++||.+.|++++ T Consensus 61 ~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai 138 (324) T protein:vir:99 61 GKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAF 138 (324) T ss_pred cceeeccCCceEEEEEec-CcceeEeccCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Confidence 999999999999998763 45566666666665 5789999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhccccccc---------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceee Q lcl|Aclame:pro 242 VNTTNDAIAKVLKSFTT---------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLL 305 (394) Q Consensus 242 ~~~~~~a~~~g~~~~~~---------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~ 305 (394) +.+++.+++.|+|++.. .+..+++++.+++..+...++ +++|+|||++|..|++++|++|+|+| T Consensus 139 ~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~~ 218 (324) T protein:vir:99 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred HHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCceee Confidence 99999999988876532 123458889998877766554 57899999999999999999999998 Q ss_pred cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceEEE Q lcl|Aclame:pro 306 QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQA 368 (394) Q Consensus 306 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r~ 368 (394) .+ +.+++|+|+||++++....+.+.+++|||++ ++++++++++|+.+++.+ | ...+|+ T Consensus 219 ~~----~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~ 293 (324) T protein:vir:99 219 YD----RNSDTLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred cC----CCCccccceeEEeecCCCCCcceEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 54 3456899999999887777888899999997 567889999999988743 3 345899 Q ss_pred EEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 369 VLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 369 ~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ++|+|++|.+|+||++|+.+++++.- T Consensus 294 ~~r~d~~v~~~~a~~~lt~a~~~~~~ 319 (324) T protein:vir:99 294 TMHVALHIADDKAFAKLVPADKKTDS 319 (324) T ss_pred EEEEccEEecccceEEEEeccCCCCC Confidence 99999999999999999998887544 No 76 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=100.00 E-value=1.6e-49 Score=288.17 Aligned_cols=281 Identities=11% Similarity=0.078 Sum_probs=216.5 Q ss_pred HHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccc Q lcl|Aclame:pro 109 EGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVA 188 (394) Q Consensus 109 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 188 (394) .+.. ..+...+....+.++...+++.++.++|+++++.|++.+++.++|+++|+++++.++.+++|+.+. ...+.|++ T Consensus 1 ~~~~-~~r~~~~~~~~e~~a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~ 78 (326) T protein:vir:42 1 MAVN-PDRTTPFLGVNDPKVAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKIPHWTG-DVSASWIG 78 (326) T ss_pred CCCC-ccchhhhcCcchhhheeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEEEEEeC-CcceEEec Confidence 0000 001111122222333344445556689999999999999999999999999999998999998763 45556666 Q ss_pred cccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc------ Q lcl|Aclame:pro 189 ELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVK------ 262 (394) Q Consensus 189 e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~------ 262 (394) |++..+ +++++|++++++++++++++++|+|+++||.+++++||.++|+++++++++.++++|+|++.+.+.. T Consensus 79 Eg~~~~-~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~ 157 (326) T protein:vir:42 79 EGDMKP-ITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEV 157 (326) T ss_pred CCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccccccc Confidence 665555 5789999999999999999999999999999999999999999999999999999999876553321 Q ss_pred --------------cHHHH--HHHHHhhhhhh-cccEEEEcHHHHHHHHhhhccCCceeecccccCCCc-----cccccc Q lcl|Aclame:pro 263 --------------NLDEI--KALLNGGFDPA-YNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSG-----KVLLGK 320 (394) Q Consensus 263 --------------~~~~i--~~~~~~~~~~~-~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~-----~~l~G~ 320 (394) ++.++ .+.+....... .+++|+|||+++..|++|||++|+|||++....+.+ ++|+|+ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~ 237 (326) T protein:vir:42 158 SLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVAR 237 (326) T ss_pred ceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeee Confidence 11111 12222222222 368899999999999999999999999987655543 479999 Q ss_pred ceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceEEEEEEeccEEecccceE Q lcl|Aclame:pro 321 PVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQAVLRFGVSKVDDKAGY 383 (394) Q Consensus 321 pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r~~~r~d~~v~~~~af~ 383 (394) ||+++++.+.+...+++|||+++ +++++++++|+.+++.+ | ...+|+++|+|++|.+|+||+ T Consensus 238 pv~~~~~~~~~~~~~~~Gd~s~~-~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~~ 316 (326) T protein:vir:42 238 PTILSDHVASGTVVGYQGDFRQL-VWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAFV 316 (326) T ss_pred eEEEcCCCCCCceEEEEeecceE-EEEEecceEEEEeecceeeecccccccchhhhhcCcEEEEEEEEeccEEecccceE Confidence 99998877666666789999985 57789999999887654 2 345899999999999999999 Q ss_pred EEEecCccCC Q lcl|Aclame:pro 384 YVTFTPEPLP 393 (394) Q Consensus 384 ~l~~~~~~~~ 393 (394) +|+..+++-. T Consensus 317 ~l~~~~~~~~ 326 (326) T protein:vir:42 317 KLTNVDATEA 326 (326) T ss_pred EEeeccccCC Confidence 9999988866 No 77 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=100.00 E-value=1.9e-49 Score=287.66 Aligned_cols=264 Identities=13% Similarity=0.089 Sum_probs=215.9 Q ss_pred hhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCc-eeEEEEecCCCcccccccccccccccc Q lcl|Aclame:pro 120 ETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELEKNPALAK 198 (394) Q Consensus 120 ~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~e~~~~~~~~~ 198 (394) +...........+++.++.+||+++++.|++.+++.++|+++|+++++++.. ..+|+. .....+.|+.|++..+ .++ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~-~~~~~a~~v~Eg~~~~-~~~ 78 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQ-TDGISAYWVNETEKIK-TDK 78 (297) T ss_pred CCccccccccccccCCCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEE-cCCceeEEeecCcccc-ccc Confidence 1111111112234556677999999999999999999999999999997665 345544 3445566666666665 578 Q ss_pred cccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc--------------cccH Q lcl|Aclame:pro 199 PDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKT--------------VKNL 264 (394) Q Consensus 199 ~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~--------------~~~~ 264 (394) ++|++++++++++++++++|+|+++|+.++++++|.+.|+++++++++.++++|+|++++.+ ..++ T Consensus 79 ~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~~~~~~t~ 158 (297) T protein:vir:95 79 PEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKVIGGPINY 158 (297) T ss_pred cceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccceecccccCH Confidence 99999999999999999999999999999999999999999999999999999988755432 2468 Q ss_pred HHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEecccc Q lcl|Aclame:pro 265 DEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRG 343 (394) Q Consensus 265 ~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~ 343 (394) +++.+++..+...++ +++|+||++++.+|++|+|++|+|+|++. +++|+|+||++..+...+.+.+++|||++ T Consensus 159 ~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~-----~~~l~G~Pv~~~~~~~~~~~~~~~gd~s~- 232 (297) T protein:vir:95 159 DNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKA-----ANTIDGITTVDLKSARFEKGDLLAGDFDN- 232 (297) T ss_pred HHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCC-----CCcccceeeEeecCCCCCCceEEEEeccc- Confidence 999999888776664 57999999999999999999999999653 46899999998877778889999999997 Q ss_pred EEEEeecceEEEEeeccc--------------cc---ceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 344 VLFADRKDLGLRWADNEI--------------YG---QYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 344 ~~~~~~~~~~i~~~~~~~--------------~~---~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) ++++++++++++.+++.+ |+ ..+|+++|+|++|.+|+||++|+.+|.. T Consensus 233 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 233 LIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred EEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEeecCCC Confidence 567889999999887653 33 3589999999999999999999876666 No 78 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=100.00 E-value=4.1e-49 Score=285.89 Aligned_cols=285 Identities=13% Similarity=0.081 Sum_probs=221.4 Q ss_pred chhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 82 QEEKTYRE-SVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 82 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) .++.+..+ ....+. ......... ..........++.++|+++++.|++.+++.++|++ T Consensus 1 ~~~~~~~~~~~~~f~-----------------~~~~~~~~~----~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~ 59 (324) T protein:vir:10 1 MEQTQKLKLNLQHFA-----------------SNNVKPQVF----NPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQ 59 (324) T ss_pred CCCchHHHHHHHHHH-----------------HHhhcccee----cccceeccCCCcceechhHHHHHHHHHHhhchhhh Confidence 11100000 000000 000000000 01112233445569999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|+++++.++++.+|+... ...+.|++|++..+ .++++|++++++++++++++++|+|+++|+.+++++||.+.|+++ T Consensus 60 ~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a 137 (324) T protein:vir:10 60 LGKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEA 137 (324) T ss_pred hcceeeccCCceEEEEEeC-CcceeEeccCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHH Confidence 9999999999999998753 45566666666665 578999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccccccc---------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCcee Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTT---------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYL 304 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~---------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l 304 (394) ++.+++.++++|+|++.. .+..+++++.+++..+...++ .++|+|||++|..|++++|++|+|+ T Consensus 138 i~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~ 217 (324) T protein:vir:10 138 FYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKER 217 (324) T ss_pred HHHHHHHHhhhcCCCCccCccccccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCcee Confidence 999999999998876532 123468899998877766655 5789999999999999999999999 Q ss_pred ecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceEE Q lcl|Aclame:pro 305 LQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQ 367 (394) Q Consensus 305 ~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r 367 (394) |.+ +.+++|+|+||+++++...+.+.+++|||++ ++++++++++|+.+++.+ | ...+| T Consensus 218 ~~~----~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 292 (324) T protein:vir:10 218 IYD----RNSDTLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred ecC----CCCccccceeEEeecCCCCCcceEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 854 3456899999999887777888899999997 557789999999987743 2 34579 Q ss_pred EEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 368 AVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 368 ~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +++|+|+.|.+|+||++|+..+++++- T Consensus 293 ~~~r~d~~v~~~~A~~~l~~a~~~~~~ 319 (324) T protein:vir:10 293 ATMHVALHIADDKAFAKLVPADKKTDS 319 (324) T ss_pred EEEEEccEEecccceEEEEeccCCCCC Confidence 999999999999999999999988654 No 79 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=100.00 E-value=2.4e-49 Score=287.18 Aligned_cols=259 Identities=15% Similarity=0.086 Sum_probs=213.5 Q ss_pred cccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHh Q lcl|Aclame:pro 130 GIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNID 209 (394) Q Consensus 130 ~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~ 209 (394) ..+.+.+++++|++++..|++.+++.++|+++|+++++.++..++|+.+. ...+.|+.|++..+ .++++|+++++++| T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip~~~~-~~~a~wv~E~~~~~-~s~~~f~~v~l~~~ 78 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEFTFTL-DSDIDVVAENGKKT-HGGLSLEPVTIVPI 78 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEec-CcceEEeecCcccc-ccccceeeEEeeeE Confidence 33555678899999999999999999999999999999999999998754 44566666666655 68899999999999 Q ss_pred hhhhhhhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc----------------------cccccH Q lcl|Aclame:pro 210 TYRGAIPLSQESID---DADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT----------------------KTVKNL 264 (394) Q Consensus 210 ~~~~~~~vs~ell~---ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~----------------------~~~~~~ 264 (394) ++++++++|+||++ |+.++|.++|.+.|+++++++++.++++|+++.+. .+...+ T Consensus 79 kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (303) T protein:vir:97 79 KVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVVKFTESEDAD 158 (303) T ss_pred EEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccccccccccccccchH Confidence 99999999999994 67789999999999999999999999988532111 112236 Q ss_pred HHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccC-CCcccccccceEEecCccc------ccCceE Q lcl|Aclame:pro 265 DEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITA-VSGKVLLGKPVFVLSDEVL------GANKAF 336 (394) Q Consensus 265 ~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~-~~~~~l~G~pV~~~~~~~~------~~~~~~ 336 (394) +++.+++..+...++ ++.|+|||+++.+|++|||++|+|+|+|++.. +.+++|+|+||++++.++. +...++ T Consensus 159 ~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~~ 238 (303) T protein:vir:97 159 ANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLKSSVNTTVGAGADEAESKDLVI 238 (303) T ss_pred HHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecceeeEEecccCCccccCCCccEEE Confidence 788888876665544 57899999999999999999999999987644 4567999999999765432 334589 Q ss_pred EEeccccEEEEeecceEEEEeecc--------ccc---ceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 337 IGDFKRGVLFADRKDLGLRWADNE--------IYG---QYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 337 ~gd~~~~~~~~~~~~~~i~~~~~~--------~~~---~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) ||||+..|.++.|+++++++++.. +|. ..+|+++|+|++|++|+||++|+-... T Consensus 239 ~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 239 IGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKGEV 303 (303) T ss_pred EeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCCCC Confidence 999988888888999999987532 133 458999999999999999999999888 No 80 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=100.00 E-value=3.3e-48 Score=280.90 Aligned_cols=328 Identities=11% Similarity=-0.039 Sum_probs=219.9 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) |-.+..+++++++.++.+.++....+.+. .+.+... +..+.++... +...+........+. .... T Consensus 1 m~~kl~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~---~~~~~~~~~~-~~~~e~~~~~~~~~~-----~~~l 65 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQ------NELYGDM---INQLFEETKL-QAKAEAERVSSLPKS-----AQTL 65 (381) T ss_pred CchhHHHHHHHHHHHHHHHHHhhhHHHHH------HHHHHHH---HHhhhhhHHH-HHHHHHHHHHHhccc-----cccc Confidence 88888888887777766655432111110 0001000 1111111100 000000000000000 0000 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) .. +....... ....+...|++++|+++.+.|++.+.+.+|||+ T Consensus 66 ~~------------------------------~e~~~~~~-------~~~~t~~~Gg~lvP~~~~~~I~~~l~~~spir~ 108 (381) T protein:vir:10 66 SA------------------------------NQRNFFMD-------INKSVGYKEEKLLPEETIDRIFEDLTTNHPLLA 108 (381) T ss_pred CH------------------------------HHHHHHHH-------HhhcCCCCCceecCHHHHHHHHHHHHhhcceee Confidence 00 00000000 122355677899999999999999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQI 240 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~ 240 (394) +|++++++ +...+|+.+ ..+.+.|..+.++.+++++++|+++++++|+++++++||++||+|+.+||++||.+.|+++ T Consensus 109 ~a~v~~~~-~~~~i~~~~-~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~ 186 (381) T protein:vir:10 109 DLGIKNAG-LRLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEA 186 (381) T ss_pred eeeeEecC-cceEEEeec-CCcceEEeecccccccccCccceeEeecceeEEeeccccHHHHhccHHHHHHHHHHHHHHH Confidence 99999985 456677653 3344444455566666788999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcccccccccccc-----------------------HHHHHHHHHhhhh--------------hh-cccE Q lcl|Aclame:pro 241 KVNTTNDAIAKVLKSFTTKTVKN-----------------------LDEIKALLNGGFD--------------PA-YNVS 282 (394) Q Consensus 241 ~~~~~~~a~~~g~~~~~~~~~~~-----------------------~~~i~~~~~~~~~--------------~~-~~a~ 282 (394) |+.+++.+|++|+|++.|.|..+ +.++..+...+.. .+ .++. T Consensus 187 ~a~~~~~afi~GdG~~qP~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 266 (381) T protein:vir:10 187 FAVALETAFLKGTGKDQPIGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVT 266 (381) T ss_pred HHHHhhceeEecccCCCceeeeecCCccccccccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceE Confidence 99999999999999988765421 1112211111100 11 2578 Q ss_pred EEEcHHHHHHHHhh---hccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeec Q lcl|Aclame:pro 283 LIVSQSFYQTLDTL---KDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN 359 (394) Q Consensus 283 ~vm~~~~~~~l~~l---kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~ 359 (394) |+|||.++..|+.+ +|++|+|+|..+ +|+||++++ ..+++.++||||++ |++++|.+++|+.++| T Consensus 267 ~vmn~~t~~~l~~~~~~~~~~G~~v~~lp---------~g~~vv~~~--~~p~~~i~fGDfs~-Y~i~~r~~~~i~~~~~ 334 (381) T protein:vir:10 267 MVVNPSDAFEVQAQYTHLNANGVYVTALP---------FNLNVIEST--VQEAGKVLTYVKGL-YDGYLAGGINVQKFKE 334 (381) T ss_pred EEEchhhHHhhccccccCCCCCceeecCC---------CCceeEEcC--CCCcCcEEEEEccc-EEEEEecccEEEeech Confidence 99999999888754 488999998532 477787755 45677899999997 8899999999999999 Q ss_pred ccccc---eEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 360 EIYGQ---YLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 360 ~~~~~---~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) .+|.+ .||++.|+||++++|+||+.++++.-=+|. T Consensus 335 ~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~~~~~~~ 372 (381) T protein:vir:10 335 TLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) T ss_pred hhhhcCceEEEEEEEEcCEEecCCcEEEEEEeecCCcc Confidence 88764 689999999999999999997777444444 No 81 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=100.00 E-value=2.3e-49 Score=287.23 Aligned_cols=262 Identities=17% Similarity=0.179 Sum_probs=216.8 Q ss_pred hhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCccccccccccccccccc Q lcl|Aclame:pro 120 ETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKP 199 (394) Q Consensus 120 ~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 199 (394) ............+++.+++++|+++++.|++.+++.++|+++|+++++.++.+++|+.+ +...+.|+.|++..+ .+++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~-~~~~a~~v~E~~~~~-~~~~ 78 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLA-KGVGAYWVSETERIQ-TSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEe-CCcceEEeecCcccc-cccc Confidence 11111122233556667889999999999999999999999999999999889999876 345566666666665 5789 Q ss_pred ccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------- Q lcl|Aclame:pro 200 DFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTK-------------------- 259 (394) Q Consensus 200 ~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~-------------------- 259 (394) +|++++++++++++++++|+|+++||.++|++||.+.|+++++.+++.++++|+|++.+. T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (304) T protein:vir:10 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTD 158 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999998865432 Q ss_pred ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc--ccCceE Q lcl|Aclame:pro 260 TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL--GANKAF 336 (394) Q Consensus 260 ~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~--~~~~~~ 336 (394) +...++++.+++..+...++ +++|+||++++..|++++|++|+|+|.++ +++|+|+||+++++++. +.+.++ T Consensus 159 ~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-----~~~l~G~PV~~~~~~~~~~~~~~~~ 233 (304) T protein:vir:10 159 TNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-----GNEIMGLPLSYTGADVYDKKKSLAL 233 (304) T ss_pred ccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-----CccccceeeEEecccccCCCCcEEE Confidence 12247888888877666554 57899999999999999999999999764 36899999999887654 345688 Q ss_pred EEeccccEEEEeecceEEEEeeccc----------------cc---ceEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 337 IGDFKRGVLFADRKDLGLRWADNEI----------------YG---QYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) Q Consensus 337 ~gd~~~~~~~~~~~~~~i~~~~~~~----------------~~---~~~r~~~r~d~~v~~~~af~~l~~~~ 389 (394) ||||++ ++++++++++++.+++.. |+ ..+|+++|+|++|++|+||++|+.+- T Consensus 234 ~gd~~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 234 MGDWDY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEehhh-EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 999997 567889999999887632 33 45899999999999999999999999 No 82 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=100.00 E-value=2.3e-49 Score=287.23 Aligned_cols=262 Identities=17% Similarity=0.179 Sum_probs=216.8 Q ss_pred hhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCccccccccccccccccc Q lcl|Aclame:pro 120 ETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKP 199 (394) Q Consensus 120 ~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 199 (394) ............+++.+++++|+++++.|++.+++.++|+++|+++++.++.+++|+.+ +...+.|+.|++..+ .+++ T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~-~~~~a~~v~E~~~~~-~~~~ 78 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFTYLA-KGVGAYWVSETERIQ-TSKP 78 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhccchhhhcceeeccCCceEEEEEe-CCcceEEeecCcccc-cccc Confidence 11111122233556667889999999999999999999999999999999889999876 345566666666665 5789 Q ss_pred ccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------- Q lcl|Aclame:pro 200 DFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTK-------------------- 259 (394) Q Consensus 200 ~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~-------------------- 259 (394) +|++++++++++++++++|+|+++||.++|++||.+.|+++++.+++.++++|+|++.+. T Consensus 79 ~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (304) T protein:vir:94 79 EYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEEKGNVVTD 158 (304) T ss_pred eeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999998865432 Q ss_pred ccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc--ccCceE Q lcl|Aclame:pro 260 TVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL--GANKAF 336 (394) Q Consensus 260 ~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~--~~~~~~ 336 (394) +...++++.+++..+...++ +++|+||++++..|++++|++|+|+|.++ +++|+|+||+++++++. +.+.++ T Consensus 159 ~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~-----~~~l~G~PV~~~~~~~~~~~~~~~~ 233 (304) T protein:vir:94 159 TNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDAN-----GNEIMGLPLSYTGADVYDKKKSLAL 233 (304) T ss_pred ccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCC-----CccccceeeEEecccccCCCCcEEE Confidence 12247888888877666554 57899999999999999999999999764 36899999999887654 345688 Q ss_pred EEeccccEEEEeecceEEEEeeccc----------------cc---ceEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 337 IGDFKRGVLFADRKDLGLRWADNEI----------------YG---QYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) Q Consensus 337 ~gd~~~~~~~~~~~~~~i~~~~~~~----------------~~---~~~r~~~r~d~~v~~~~af~~l~~~~ 389 (394) ||||++ ++++++++++++.+++.. |+ ..+|+++|+|++|++|+||++|+.+- T Consensus 234 ~gd~~~-~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 234 MGDWDY-ARYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred EEehhh-EEEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 999997 567889999999887632 33 45899999999999999999999999 No 83 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=100.00 E-value=7.1e-49 Score=284.55 Aligned_cols=286 Identities=14% Similarity=0.091 Sum_probs=221.1 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) +++.+..+ ...+.......... .........++.++.++|+++.+.|++.+++.++|+++ T Consensus 1 ~~~~~~~~----------------~~~~~~~~~~~~~~----~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l 60 (324) T protein:vir:96 1 MEQTQKLK----------------LNLQHFASNNVKPQ----VFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQL 60 (324) T ss_pred CCcchhhh----------------HHHHHHHHHhhhhh----hhccccccccCcCccccchhHHHHHHHHHHhhchhhhh Confidence 11100000 00000000000000 00011223455677899999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIK 241 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~ 241 (394) ++++++.++.+++|+.+. .+.+.|+.|++..+ .++++|+++++++++++++++||+|+++|+.++++++|.+.|++++ T Consensus 61 ~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai 138 (324) T protein:vir:96 61 GKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAF 138 (324) T ss_pred cceeeccCCceEEEEEec-CcceeEecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Confidence 999999988899998764 45566666666665 5789999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhccccccc---------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceee Q lcl|Aclame:pro 242 VNTTNDAIAKVLKSFTT---------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLL 305 (394) Q Consensus 242 ~~~~~~a~~~g~~~~~~---------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~ 305 (394) +++++.++++|+|++.. .+..+++++.+++..+...++ .++|+||+++|..|++++|++|+|++ T Consensus 139 ~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~ 218 (324) T protein:vir:96 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred HHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeee Confidence 99999999998876432 122358889998877666655 46899999999999999999999998 Q ss_pred cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceEEE Q lcl|Aclame:pro 306 QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQA 368 (394) Q Consensus 306 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r~ 368 (394) .+ +.+++|+|+||+++++...+.+.+++|||++ ++++++++++++.+++.+ | ...+|+ T Consensus 219 ~~----~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~ 293 (324) T protein:vir:96 219 YD----RNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred cC----CCCCcccceeeEeeCCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 53 3457999999999887777888999999997 567889999999987643 3 345899 Q ss_pred EEEeccEEecccceEEEEecC---ccCCC Q lcl|Aclame:pro 369 VLRFGVSKVDDKAGYYVTFTP---EPLPL 394 (394) Q Consensus 369 ~~r~d~~v~~~~af~~l~~~~---~~~~~ 394 (394) ++|+|+.|.+|+||++|+... ++||- T Consensus 294 ~~r~d~~v~~~~A~~~l~~a~~~~~~~~~ 322 (324) T protein:vir:96 294 TMHVALHIADDKAFAKLVPADKRTDSVPG 322 (324) T ss_pred EEEEccEEecccceEEEecccccCCCCCC Confidence 999999999999999999744 35677 No 84 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=100.00 E-value=7.1e-49 Score=284.55 Aligned_cols=286 Identities=14% Similarity=0.091 Sum_probs=221.1 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) +++.+..+ ...+.......... .........++.++.++|+++.+.|++.+++.++|+++ T Consensus 1 ~~~~~~~~----------------~~~~~~~~~~~~~~----~~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l 60 (324) T protein:vir:78 1 MEQTQKLK----------------LNLQHFASNNVKPQ----VFNPDNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQL 60 (324) T ss_pred CCcchhhh----------------HHHHHHHHHhhhhh----hhccccccccCcCccccchhHHHHHHHHHHhhchhhhh Confidence 11100000 00000000000000 00011223455677899999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIK 241 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~ 241 (394) ++++++.++.+++|+.+. .+.+.|+.|++..+ .++++|+++++++++++++++||+|+++|+.++++++|.+.|++++ T Consensus 61 ~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~-~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai 138 (324) T protein:vir:78 61 GKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAF 138 (324) T ss_pred cceeeccCCceEEEEEec-CcceeEecCCcccc-ccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Confidence 999999988899998764 45566666666665 5789999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhccccccc---------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceee Q lcl|Aclame:pro 242 VNTTNDAIAKVLKSFTT---------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLL 305 (394) Q Consensus 242 ~~~~~~a~~~g~~~~~~---------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~ 305 (394) +++++.++++|+|++.. .+..+++++.+++..+...++ .++|+||+++|..|++++|++|+|++ T Consensus 139 ~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~ 218 (324) T protein:vir:78 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred HHHHHHHHhccCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeee Confidence 99999999998876432 122358889998877666655 46899999999999999999999998 Q ss_pred cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceEEE Q lcl|Aclame:pro 306 QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQA 368 (394) Q Consensus 306 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r~ 368 (394) .+ +.+++|+|+||+++++...+.+.+++|||++ ++++++++++++.+++.+ | ...+|+ T Consensus 219 ~~----~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~ 293 (324) T protein:vir:78 219 YD----RNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred cC----CCCCcccceeeEeeCCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 53 3457999999999887777888999999997 567889999999987643 3 345899 Q ss_pred EEEeccEEecccceEEEEecC---ccCCC Q lcl|Aclame:pro 369 VLRFGVSKVDDKAGYYVTFTP---EPLPL 394 (394) Q Consensus 369 ~~r~d~~v~~~~af~~l~~~~---~~~~~ 394 (394) ++|+|+.|.+|+||++|+... ++||- T Consensus 294 ~~r~d~~v~~~~A~~~l~~a~~~~~~~~~ 322 (324) T protein:vir:78 294 TMHVALHIADDKAFAKLVPADKRTDSVPG 322 (324) T ss_pred EEEEccEEecccceEEEecccccCCCCCC Confidence 999999999999999999744 35677 No 85 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=100.00 E-value=8.7e-49 Score=284.07 Aligned_cols=286 Identities=14% Similarity=0.084 Sum_probs=220.3 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) .++.... ....+............. .........++.++|+++.+.|++.+++.++|+++ T Consensus 1 ~~~~~~~----------------~~~~~~f~~~~~~~~~~~----a~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l 60 (324) T protein:vir:93 1 MEQTQKL----------------KLNLQHFASNNVKPQVFN----PDNVMMHEKKDGTLLNDFTTPILQEVMENSKIMQL 60 (324) T ss_pred CchhHHH----------------HHHHHHHHHhhhhhhhcc----cccccccCCCcceechhHHHHHHHHHHhhchhhhh Confidence 1110000 000001111111111110 11123334456699999999999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIK 241 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~ 241 (394) |++++++++..++|+.+. ...+.|++|++..+ .++++|+++++++++++++++||+|+++||.++++++|.+.|++++ T Consensus 61 ~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~-~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai 138 (324) T protein:vir:93 61 GKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAF 138 (324) T ss_pred cceeeccCCceEEEEEec-CcceeeecCCcccc-ccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHH Confidence 999999998899998764 45566666666665 5789999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhhccccccc---------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceee Q lcl|Aclame:pro 242 VNTTNDAIAKVLKSFTT---------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLL 305 (394) Q Consensus 242 ~~~~~~a~~~g~~~~~~---------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~ 305 (394) +++++.+++.|+|++.. .+..+++++.+++..+...++ .++|+||+++|..|++++|++|+|++ T Consensus 139 a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~ 218 (324) T protein:vir:93 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred HHHHHHHHhcCCCCCCcCccccccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeee Confidence 99999999988775422 123458899998877666654 57999999999999999999999998 Q ss_pred cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceEEE Q lcl|Aclame:pro 306 QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQA 368 (394) Q Consensus 306 ~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r~ 368 (394) .+ +.+++|+|+||+++++...+.+.+++|||++ ++++++++++|+.+++.+ | ...+|+ T Consensus 219 ~~----~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~ 293 (324) T protein:vir:93 219 YD----RNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred cC----CCCCcccceeeEeecCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 54 3467899999999877777888899999997 567889999999987743 3 346899 Q ss_pred EEEeccEEecccceEEEEecCc---cCCC Q lcl|Aclame:pro 369 VLRFGVSKVDDKAGYYVTFTPE---PLPL 394 (394) Q Consensus 369 ~~r~d~~v~~~~af~~l~~~~~---~~~~ 394 (394) ++|+|+++.+|+||++|+...+ +||- T Consensus 294 ~~r~d~~v~~~~a~~~l~~a~~~~~~~~~ 322 (324) T protein:vir:93 294 TMHVALHIADDKAFAKLVPADKRTDSVPG 322 (324) T ss_pred EEEeccEEecccceEEEecccccCCCCCC Confidence 9999999999999999986544 4676 No 86 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=100.00 E-value=2.7e-48 Score=281.38 Aligned_cols=327 Identities=11% Similarity=-0.039 Sum_probs=219.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLK-LYESSVEVGGAENIGGKE 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~-~~~~~~~~~~~~~~~~~~ 79 (394) |..+..+++++++.++.+.++..... ++..+.. .+-++.+.++.. .+.+ +..... ..... ... T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~------~~~~~~~---~~~~~~~~~~~~---~~~~~e~~~~~---~~~~~-~~~ 64 (381) T protein:vir:95 1 MTINLSETFANAKNEFINAVNNGEPQ------ERQNELY---GDMINQLFEETK---LQAKAEAERVS---SLPKS-AQS 64 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhh------HHHHHHH---HHHHHhhhhhHH---HHHHHHHHHHH---HhccC-ccc Confidence 88888888887776665555431110 0000000 011111111111 0000 000000 00000 000 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) ... .+ ..... .....+++.|+++||+++.+.|++.+.+.++|+ T Consensus 65 lt~---~e---------------------------~~~~~-------~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~ 107 (381) T protein:vir:95 65 LSA---NQ---------------------------RSFFM-------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLL 107 (381) T ss_pred ccH---HH---------------------------HHHHH-------HHhcccCCCCceecCHHHHHHHHHHHHhhccce Confidence 000 00 00000 012235567789999999999999999999999 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~ 239 (394) ++|++.++++ ...+|+.. +.+.+.|+.+.++.+.+++++|+++++++|+++++++||++||+|+.++|++||.+.|++ T Consensus 108 ~~~~v~~~~~-~~~i~~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~ 185 (381) T protein:vir:95 108 ADLGIKNAGL-RLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEE 185 (381) T ss_pred eheeeEecCc-ceEEEEec-CCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHH Confidence 9999999864 56777653 345555556656666677899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcccccccccccc------------------------------HHHHHHHHHhhhh-------hh-ccc Q lcl|Aclame:pro 240 IKVNTTNDAIAKVLKSFTTKTVKN------------------------------LDEIKALLNGGFD-------PA-YNV 281 (394) Q Consensus 240 ~~~~~~~~a~~~g~~~~~~~~~~~------------------------------~~~i~~~~~~~~~-------~~-~~a 281 (394) +++.+++.+|++|+|++.|.|..+ ++.+.+++..+.. .+ .++ T Consensus 186 ~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a 265 (381) T protein:vir:95 186 AFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNV 265 (381) T ss_pred HHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCce Confidence 999999999999999987755421 1112222222211 11 257 Q ss_pred EEEEcHHHHHHHHhhh---ccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEee Q lcl|Aclame:pro 282 SLIVSQSFYQTLDTLK---DGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWAD 358 (394) Q Consensus 282 ~~vm~~~~~~~l~~lk---d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~ 358 (394) +|+|||.++..|+.++ +++|+|+|..+ +|.||++++ ..+++.++||||++ |++++|.+++++.++ T Consensus 266 ~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~---------~g~~vv~s~--~~p~~~iifgDfs~-Y~i~~r~~~~i~~~~ 333 (381) T protein:vir:95 266 TMVVNPSDAFEVQAQYTHLNANGVYVTALP---------FNLNVIEST--VQEAGKVLTYVKGL-YDGYLAGGINVQKFK 333 (381) T ss_pred EEEEccccHHhhccccccCCCCCceeecCC---------CCceEEecC--CCCcCcEEEEeccc-EEEEEecccEEEeec Confidence 8999999999887655 67899887421 355565544 55678899999997 889999999999999 Q ss_pred cccccc---eEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 359 NEIYGQ---YLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 359 ~~~~~~---~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +.+|.+ .||+++|+||++++++||++++++..-+|. T Consensus 334 ~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~ 372 (381) T protein:vir:95 334 ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) T ss_pred hhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCc Confidence 988754 689999999999999999998887644333 No 87 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=100.00 E-value=2.7e-48 Score=281.38 Aligned_cols=327 Identities=11% Similarity=-0.039 Sum_probs=219.8 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLK-LYESSVEVGGAENIGGKE 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~-~~~~~~~~~~~~~~~~~~ 79 (394) |..+..+++++++.++.+.++..... ++..+.. .+-++.+.++.. .+.+ +..... ..... ... T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~------~~~~~~~---~~~~~~~~~~~~---~~~~~e~~~~~---~~~~~-~~~ 64 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQ------ERQNELY---GDMINQLFEETK---LQAKAEAERVS---SLPKS-AQS 64 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhh------HHHHHHH---HHHHHhhhhhHH---HHHHHHHHHHH---HhccC-ccc Confidence 88888888887776665555431110 0000000 011111111111 0000 000000 00000 000 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) ... .+ ..... .....+++.|+++||+++.+.|++.+.+.++|+ T Consensus 65 lt~---~e---------------------------~~~~~-------~~~~~~~~~gg~lvP~~~~~~I~~~l~~~s~i~ 107 (381) T protein:vir:10 65 LSA---NQ---------------------------RSFFM-------DINKNVNYKEEKLLPEETIDRIFEDLTTNHPLL 107 (381) T ss_pred ccH---HH---------------------------HHHHH-------HHhcccCCCCceecCHHHHHHHHHHHHhhccce Confidence 000 00 00000 012235567789999999999999999999999 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~ 239 (394) ++|++.++++ ...+|+.. +.+.+.|+.+.++.+.+++++|+++++++|+++++++||++||+|+.++|++||.+.|++ T Consensus 108 ~~~~v~~~~~-~~~i~~~~-~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~ 185 (381) T protein:vir:10 108 ADLGIKNAGL-RLKFLKSE-TSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEE 185 (381) T ss_pred eheeeEecCc-ceEEEEec-CCcceeeecccccccccccccceeeeecceeEEeechhhHHHhhcCHHHHHHHHHHHHHH Confidence 9999999864 56777653 345555556656666677899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcccccccccccc------------------------------HHHHHHHHHhhhh-------hh-ccc Q lcl|Aclame:pro 240 IKVNTTNDAIAKVLKSFTTKTVKN------------------------------LDEIKALLNGGFD-------PA-YNV 281 (394) Q Consensus 240 ~~~~~~~~a~~~g~~~~~~~~~~~------------------------------~~~i~~~~~~~~~-------~~-~~a 281 (394) +++.+++.+|++|+|++.|.|..+ ++.+.+++..+.. .+ .++ T Consensus 186 ~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a 265 (381) T protein:vir:10 186 AFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNV 265 (381) T ss_pred HHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCce Confidence 999999999999999987755421 1112222222211 11 257 Q ss_pred EEEEcHHHHHHHHhhh---ccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEee Q lcl|Aclame:pro 282 SLIVSQSFYQTLDTLK---DGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWAD 358 (394) Q Consensus 282 ~~vm~~~~~~~l~~lk---d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~ 358 (394) +|+|||.++..|+.++ +++|+|+|..+ +|.||++++ ..+++.++||||++ |++++|.+++++.++ T Consensus 266 ~~~mn~~t~~~l~~~~~~~~~~G~~v~~l~---------~g~~vv~s~--~~p~~~iifgDfs~-Y~i~~r~~~~i~~~~ 333 (381) T protein:vir:10 266 TMVVNPSDAFEVQAQYTHLNANGVYVTALP---------FNLNVIEST--VQEAGKVLTYVKGL-YDGYLAGGINVQKFK 333 (381) T ss_pred EEEEccccHHhhccccccCCCCCceeecCC---------CCceEEecC--CCCcCcEEEEeccc-EEEEEecccEEEeec Confidence 8999999999887655 67899887421 355565544 55678899999997 889999999999999 Q ss_pred cccccc---eEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 359 NEIYGQ---YLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 359 ~~~~~~---~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +.+|.+ .||+++|+||++++++||++++++..-+|. T Consensus 334 ~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~~~~~ 372 (381) T protein:vir:10 334 ETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLDLKGHKP 372 (381) T ss_pred hhHhhcCCeEEEEEEEEcCEEecCceEEEEEEEecCCCc Confidence 988754 689999999999999999998887644333 No 88 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=100.00 E-value=3.8e-49 Score=286.04 Aligned_cols=256 Identities=15% Similarity=0.102 Sum_probs=211.7 Q ss_pred cccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhh Q lcl|Aclame:pro 132 KKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTY 211 (394) Q Consensus 132 ~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~ 211 (394) .+..++.++|+++...|++.+++.++|+++|++++++++..++|+... ...+.|+.|++..+ +++++|++++++++++ T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~-~~~~~f~~v~l~~~k~ 78 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVFTFTM-DSEIDVVAESGKKT-HGGVTLAPQTMVPIKV 78 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEec-CcceEEeeCCcccc-ccccceeEEEEeeeEE Confidence 334567899999999999999999999999999999999999998754 44566667666666 5889999999999999 Q ss_pred hhhhhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----ccc-------------------ccccHH Q lcl|Aclame:pro 212 RGAIPLSQESID---DADVDLVGIVSESISQIKVNTTNDAIAKVLKSF----TTK-------------------TVKNLD 265 (394) Q Consensus 212 ~~~~~vs~ell~---ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~----~~~-------------------~~~~~~ 265 (394) ++++++|+|+++ |+..+|+++|+++|+++++++++.++++|.+.+ ... +...++ T Consensus 79 ~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (298) T protein:vir:94 79 EYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKVEAPRGIADPNG 158 (298) T ss_pred EEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccccccccccccccHHH Confidence 999999999996 456789999999999999999999999884321 100 001256 Q ss_pred HHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc----ccCceEEEec Q lcl|Aclame:pro 266 EIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL----GANKAFIGDF 340 (394) Q Consensus 266 ~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~----~~~~~~~gd~ 340 (394) ++.+++..+....+ +++|+||++++..|++|+|++|+|+|++.+.++.+++|+|+||++++..+. +...+++||| T Consensus 159 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v~~~~~~~~~~~~~Gdf 238 (298) T protein:vir:94 159 AIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTVSDMSLTQRDRAIIGDF 238 (298) T ss_pred HHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEecccccccCCCccEEEEeec Confidence 67777776665544 578999999999999999999999999988889999999999998775432 3346889999 Q ss_pred cccEEEEeecceEEEEeecc--------ccc---ceEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 341 KRGVLFADRKDLGLRWADNE--------IYG---QYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) Q Consensus 341 ~~~~~~~~~~~~~i~~~~~~--------~~~---~~~r~~~r~d~~v~~~~af~~l~~~~ 389 (394) ++++.++.+++++++++++. +|+ ..+|+++|+|+++.+|+||++|+..+ T Consensus 239 s~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 239 ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEecC Confidence 99887888999999886532 243 35899999999999999999999999 No 89 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=100.00 E-value=3.1e-49 Score=286.52 Aligned_cols=263 Identities=13% Similarity=-0.008 Sum_probs=209.3 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeec Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWN 207 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~ 207 (394) +...+++.|++++|++++..|++.+++.++++++|+++++.++..++|+... ...+.|+.|++..+ .++++|++++++ T Consensus 1 Ma~~~~~~gg~~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~~ip~~~~-~~~a~wv~Eg~~~~-~s~~~f~~v~l~ 78 (315) T protein:vir:80 1 MADDFLSAGKLELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPVKGAVFSG-VPRAKIVGEGEVKP-SASVDVSAFTAQ 78 (315) T ss_pred CCCCcCCcCceEcchHHHHHHHHHHHhhchhhhhcceeecCCCceEEEEEeC-CcceEEeeCCcccc-ccccceeeeEee Confidence 3344566788999999999999999999999999999999998899998753 44556666666665 588999999999 Q ss_pred HhhhhhhhhhhHHHHhccHHH----HHHHHHHHHHHHHHHHHHHHHhhccccccc------------------cccccHH Q lcl|Aclame:pro 208 IDTYRGAIPLSQESIDDADVD----LVGIVSESISQIKVNTTNDAIAKVLKSFTT------------------KTVKNLD 265 (394) Q Consensus 208 ~~~~~~~~~vs~ell~ds~~~----l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~------------------~~~~~~~ 265 (394) +|+++++++||+||++|+..+ |+++|.+.|++++++.++.++++|++.++. .+...++ T Consensus 79 ~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 158 (315) T protein:vir:80 79 PIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKNIVDATDSATA 158 (315) T ss_pred eeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccceeeccccchH Confidence 999999999999999988765 789999999999999999999998653221 1112367 Q ss_pred HHHHHHHhhhhhhc--ccEEEEcHHHHHHHHhhhccCC-----ceeecccccCCCcccccccceEEecCcccc------- Q lcl|Aclame:pro 266 EIKALLNGGFDPAY--NVSLIVSQSFYQTLDTLKDGNG-----RYLLQDDITAVSGKVLLGKPVFVLSDEVLG------- 331 (394) Q Consensus 266 ~i~~~~~~~~~~~~--~a~~vm~~~~~~~l~~lkd~~G-----~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~------- 331 (394) ++.+++..+....+ +++|+|||.++..|++|+|.+| +|+| +++..+.+++|+|+||+++++++.+ T Consensus 159 d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~-~~~~~g~~~tl~G~PV~~~~~~~~~~~~~~~~ 237 (315) T protein:vir:80 159 DLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMY-PAAGFAGLDNWRGLNVGASSTVSGAPEMSPAS 237 (315) T ss_pred HHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccc-cccccCCCceecceeeEecCcCCccccccccc Confidence 77777766544322 4679999999999999987655 4555 4566667789999999998765432 Q ss_pred cCceEEEeccccEEEEeecceEEEEeeccc--------c---cceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 332 ANKAFIGDFKRGVLFADRKDLGLRWADNEI--------Y---GQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 332 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------~---~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ...++||||++ +.+..+.+++++.+++.. | ...+|+++|+|++|++|+||++|+.+++|.|- T Consensus 238 ~~~~~~GDfs~-~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~~~~ 310 (315) T protein:vir:80 238 GVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAPKPN 310 (315) T ss_pred ccEEEEeeccc-EEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCCCCC Confidence 23578899997 556678999998876532 3 34589999999999999999999988877555 No 90 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=100.00 E-value=5.2e-49 Score=285.31 Aligned_cols=277 Identities=13% Similarity=0.078 Sum_probs=220.5 Q ss_pred HHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccc Q lcl|Aclame:pro 115 LMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNP 194 (394) Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 194 (394) .+...............++..++.++|+++.+.|++.+++.++|+++|+++++.++.+.+|+.+. ...+.|+.|++..+ T Consensus 1 ~~~~~~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip~~~~-~~~a~~v~Eg~~~~ 79 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWVG-DVSAQWIGEGDMKP 79 (318) T ss_pred CCCCCCCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC-CcceEEecCCcccc Confidence 11112222222223334455667789999999999999999999999999999999999998764 45566666666655 Q ss_pred cccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc------------ Q lcl|Aclame:pro 195 ALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVK------------ 262 (394) Q Consensus 195 ~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~------------ 262 (394) .++++|+++++++|++++++++|+|+++||.++++++|.+.|+++++.+++.++++|+|++.+.+.. T Consensus 80 -~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~ 158 (318) T protein:vir:24 80 -ITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTKAISIADTT 158 (318) T ss_pred -ccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccccccccccc Confidence 5789999999999999999999999999999999999999999999999999999998876542211 Q ss_pred ----cH-HHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCc-----ccccccceEEecCcccc Q lcl|Aclame:pro 263 ----NL-DEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSG-----KVLLGKPVFVLSDEVLG 331 (394) Q Consensus 263 ----~~-~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~-----~~l~G~pV~~~~~~~~~ 331 (394) .+ +.+.+++......++ +++|+|||+++..|+++||++|+|||++++..+.+ .+++|+||+++++.+.+ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~ 238 (318) T protein:vir:24 159 GATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHVVEG 238 (318) T ss_pred cccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCCCCC Confidence 11 233444444333333 58999999999999999999999999987766544 47899999988877666 Q ss_pred cCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 332 ANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 332 ~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ...+++|||++ +++++++++.|+.+++.+ | ...+|+++|+|++|.+|+||++|+..++..-- T Consensus 239 ~~~~~~gdfs~-~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~ 317 (318) T protein:vir:24 239 TTVGFMGDFSQ-LIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVVSGGGE 317 (318) T ss_pred ccEEEEeecce-EEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeeccCCCC Confidence 77789999997 567889999999887644 3 34589999999999999999999999888777 No 91 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=100.00 E-value=9.8e-49 Score=283.77 Aligned_cols=275 Identities=13% Similarity=0.055 Sum_probs=213.8 Q ss_pred HHHHHhhhhhh--hhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccc-- Q lcl|Aclame:pro 115 LMPINETTPVE--PQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAEL-- 190 (394) Q Consensus 115 ~~~~~~~~~~~--~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~-- 190 (394) ...+++..... ....+...+.++.++|+++.+.|++.+++.++|+++|++++++++...+|+... ...+.|+.|+ T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~~-~~~a~~v~eg~~ 79 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTVK-RPEVGQVGVGTS 79 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC-CceeEeecCccc Confidence 11122221111 111223334445699999999999999999999999999999999999998764 3333443332 Q ss_pred -----cccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc----- Q lcl|Aclame:pro 191 -----EKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKT----- 260 (394) Q Consensus 191 -----~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~----- 260 (394) ++..+.++++|+++++++|++++++++|+|+++|+.+++++||++.|+++++++++.++++|+|++++.+ T Consensus 80 ~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~ 159 (333) T protein:vir:78 80 NEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGID 159 (333) T ss_pred ccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccc Confidence 3334468899999999999999999999999999999999999999999999999999999998654321 Q ss_pred ------------------cccHHHHHHHHHhhhhhh-cc-cEEEEcHHHHHHHH---hhhccCCceeecccccCCCcccc Q lcl|Aclame:pro 261 ------------------VKNLDEIKALLNGGFDPA-YN-VSLIVSQSFYQTLD---TLKDGNGRYLLQDDITAVSGKVL 317 (394) Q Consensus 261 ------------------~~~~~~i~~~~~~~~~~~-~~-a~~vm~~~~~~~l~---~lkd~~G~~l~~~~~~~~~~~~l 317 (394) ..+++++.+++....... ++ ++|+|||.++..|+ +++|++|+|+|.+.+..+.+++| T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l 239 (333) T protein:vir:78 160 TDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDV 239 (333) T ss_pred ccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCcee Confidence 123667777776554443 33 57999999988765 47899999999998888889999 Q ss_pred cccceEEecCcccc-------cCceEEEeccccEEEEeecceEEEEeeccc-----------c---cceEEEEEEeccEE Q lcl|Aclame:pro 318 LGKPVFVLSDEVLG-------ANKAFIGDFKRGVLFADRKDLGLRWADNEI-----------Y---GQYLQAVLRFGVSK 376 (394) Q Consensus 318 ~G~pV~~~~~~~~~-------~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----------~---~~~~r~~~r~d~~v 376 (394) +|+||++++..+.+ ...+++|||++ |+++++++++|+.+++.. | ...+|+++|+|++| T Consensus 240 ~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v 318 (333) T protein:vir:78 240 LGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQ-LKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLL 318 (333) T ss_pred eceeeEEccccCCCccccCCCccEEEEEeccc-EEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEEEEccEE Confidence 99999987654322 34689999998 567789999999877632 2 34579999999999 Q ss_pred ecccceEEEEecCcc Q lcl|Aclame:pro 377 VDDKAGYYVTFTPEP 391 (394) Q Consensus 377 ~~~~af~~l~~~~~~ 391 (394) ++|+||++|+..++| T Consensus 319 ~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 319 GDKQAFVKFVDDEQP 333 (333) T ss_pred ecccceEEEeccCCC Confidence 999999999999998 No 92 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=100.00 E-value=8.9e-49 Score=284.00 Aligned_cols=276 Identities=13% Similarity=0.102 Sum_probs=212.8 Q ss_pred HHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccc Q lcl|Aclame:pro 115 LMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNP 194 (394) Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 194 (394) .................++..++.++|+++++.|++.+++.++|+++|+++++.+++.++|+.. ....+.|+.|++..+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~~~-~~~~a~~v~E~~~~~ 79 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPHWI-GDVSAQWIGEGDMKP 79 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEEEe-CCcceEEecCCcccc Confidence 0000000111111222334445568999999999999999999999999999998899999876 345566777766666 Q ss_pred cccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc------------ Q lcl|Aclame:pro 195 ALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVK------------ 262 (394) Q Consensus 195 ~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~------------ 262 (394) .++++|+++++++++++++++||+|+++|+.++++++|.+.|++++++.++.++++|+|++.+.+.. T Consensus 80 -~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~ 158 (320) T protein:vir:10 80 -ITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKSVSLADPG 158 (320) T ss_pred -ccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccccccceecc Confidence 5789999999999999999999999999999999999999999999999999999998875442211 Q ss_pred --cH------H-HHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCC-----cccccccceEEecC Q lcl|Aclame:pro 263 --NL------D-EIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVS-----GKVLLGKPVFVLSD 327 (394) Q Consensus 263 --~~------~-~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~-----~~~l~G~pV~~~~~ 327 (394) ++ + .+.+++..+...++ +++|+|||+++..|++|||++|+|+|.+....+. .++++|+||++++. T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~ 238 (320) T protein:vir:10 159 GATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDH 238 (320) T ss_pred cccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeeEecCC Confidence 11 1 23333333333333 6899999999999999999999999987655443 35799999999876 Q ss_pred cccccCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 328 EVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 328 ~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) ++.+...++||||++ ++++.+++++++.+++.+ | ...+|+++|+|++|.+|+||++|+..++ T Consensus 239 ~~~~~~~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~a 317 (320) T protein:vir:10 239 VADGTTVGYMGDFRN-VIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVT 317 (320) T ss_pred CCCCceEEEEeecce-EEEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 665655678999997 557889999999887754 2 2458999999999999999999997776 Q ss_pred cCC Q lcl|Aclame:pro 391 PLP 393 (394) Q Consensus 391 ~~~ 393 (394) |-. T Consensus 318 p~~ 320 (320) T protein:vir:10 318 PDA 320 (320) T ss_pred CCC Confidence 544 No 93 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=100.00 E-value=2.3e-48 Score=281.79 Aligned_cols=319 Identities=13% Similarity=0.104 Sum_probs=213.1 Q ss_pred HHHHHHHHhhccccccccccccch-hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCcc Q lcl|Aclame:pro 60 LKLYESSVEVGGAENIGGKEVTQE-EKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKP 138 (394) Q Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 138 (394) +.+..+........ ........+ .......+..+.+......... ... .................+.+.+.|++ T Consensus 1 ~a~~~a~~~~~~~~-~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~--~~a--~~~a~~~~~~~~~~~a~~~~~~~Gg~ 75 (366) T protein:vir:57 1 MAAAVAVPVKAHSV-APGIIIKEELQQYKGAGMTRMVMSIAAGKGNL--ADA--AKFAATELGDTGLSMAISTAAGSGGA 75 (366) T ss_pred Cccccccccccccc-ccccccccccccccchhHHHHHHHHHhcccch--hHH--HHHHHHhhcchhhhhhccccccCCcc Confidence 11111111000000 000000011 0111112222222211111000 000 00000000011111223345567889 Q ss_pred ccchhHHhHHHHHHHhhhhhhhe-eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhh Q lcl|Aclame:pro 139 VSSEEILYTPAREVKTVVDLKPF-TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPL 217 (394) Q Consensus 139 lvP~~~~~~I~~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~v 217 (394) +||+++.+.|++.+++.++|+.+ ++++++.++...+|+.+. ...+.|+.|++..+ .++++|++++++++++++++++ T Consensus 76 lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~~~~p~~t~-~~~a~wv~E~~~~~-~s~~~f~~i~~~~~k~~~~~~i 153 (366) T protein:vir:57 76 LIPQNMQNEVIELLRDRTVVRILGARSIPLPNGNLSMPRLSG-GATAGYVGEGKDVV-ATGATFDDVKLSAKTMIALVPV 153 (366) T ss_pred ccchhHHHHHHHHHhhhcchhhhceeeeecCCCceEEEEEeC-CcceeeeccCcccc-ccccceeEEEEeeEEEEEeehh Confidence 99999999999999999999998 899999888899998763 45556666665555 5789999999999999999999 Q ss_pred hHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-cccccc-----------------cHHH---HHHHHHhhh- Q lcl|Aclame:pro 218 SQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF-TTKTVK-----------------NLDE---IKALLNGGF- 275 (394) Q Consensus 218 s~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~-~~~~~~-----------------~~~~---i~~~~~~~~- 275 (394) |+||++||.++++++|++.|+++++.+++.++++|+|++ .|.+.. +++. ..+.+.... T Consensus 154 S~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~ 233 (366) T protein:vir:57 154 SNQLIGRAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHM 233 (366) T ss_pred hHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccccccchhhHHHHHHHHHHhhh Confidence 999999999999999999999999999999999998864 332221 1222 223222222 Q ss_pred --hhh-cccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc------ccCceEEEeccccEEE Q lcl|Aclame:pro 276 --DPA-YNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL------GANKAFIGDFKRGVLF 346 (394) Q Consensus 276 --~~~-~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~------~~~~~~~gd~~~~~~~ 346 (394) ..+ .++.|+||+.++..|++|+|++|+|+|.+ .. +++|+|+||++++.++. +...++||||++ |++ T Consensus 234 ~~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~-~~---~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~-~~i 308 (366) T protein:vir:57 234 DSNSNMIRCGWGLSNRTYMTLFGLRDGNGNKVYPE-MS---QGILKGYPIQRTSAIPANLGDDGNESEIYFCDFND-VVI 308 (366) T ss_pred ccccccccCEEEecHHHHHHHHhhhccCCceeccC-CC---CCeecceeeEEccccccccccCCCccEEEEEecce-EEE Confidence 122 36899999999999999999999999953 32 35899999999775532 234689999997 568 Q ss_pred EeecceEEEEeecccc--------------cceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 347 ADRKDLGLRWADNEIY--------------GQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 347 ~~~~~~~i~~~~~~~~--------------~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) .++.+++|+.+++.++ ...+|+++|+|++|.||+||++++-..= T Consensus 309 ~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 309 GEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred EEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 8999999998877542 2358999999999999999999998777 No 94 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=100.00 E-value=1.6e-48 Score=282.61 Aligned_cols=258 Identities=19% Similarity=0.176 Sum_probs=209.0 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccc----cccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNP----ALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~----~~~~~~~~~ 203 (394) ...++++.++.++|+++++.|++.+++.++|+++++++++.+++.++|+.+. ...+.|+.|++..+ +.++++|++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~~-~~~a~wv~E~~~~~~~~~~~s~~~f~~ 79 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLAT-LPEADWVGESATDPKGVKPTSKVTWAN 79 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEeC-CcceEEeecccccccccccccccceee Confidence 5556777788999999999999999999999999999999999999998764 44556666665532 346899999 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------------cccc Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTK--------------------TVKN 263 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~--------------------~~~~ 263 (394) +++++||++++++||+||++|+.+++++||++.|+++++.+++.++++|+|++.+. +... T Consensus 80 i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (305) T protein:vir:25 80 RTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPASWVSPALIPAAVTAGQAVEVVGGVAN 159 (305) T ss_pred EEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCCccccccccccccccccccccccchh Confidence 99999999999999999999999999999999999999999999999998764321 1112 Q ss_pred HHHHHHHHHhhhhh----hc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCccc--ccCceE Q lcl|Aclame:pro 264 LDEIKALLNGGFDP----AY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVL--GANKAF 336 (394) Q Consensus 264 ~~~i~~~~~~~~~~----~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~--~~~~~~ 336 (394) .+++.+.+...... .+ ...|+|||.++..|++++|++|+|+|+|+ +|+|+||++++..+. +.+.++ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~-------~l~G~Pv~~~~~~~~~~~~~~~~ 232 (305) T protein:vir:25 160 ESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD-------SFAGFRTFFNRNGAWDADAAIEV 232 (305) T ss_pred hhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecCC-------cccccceEEcCccCCCCCccEEE Confidence 23344444333222 22 34699999999999999999999999763 899999999876543 445789 Q ss_pred EEeccccEEEEeecceEEEEeeccc----------c---cceEEEEEEeccEEecccceEEEEecCc--cCCC Q lcl|Aclame:pro 337 IGDFKRGVLFADRKDLGLRWADNEI----------Y---GQYLQAVLRFGVSKVDDKAGYYVTFTPE--PLPL 394 (394) Q Consensus 337 ~gd~~~~~~~~~~~~~~i~~~~~~~----------~---~~~~r~~~r~d~~v~~~~af~~l~~~~~--~~~~ 394 (394) +|||++ |+++++++++|+.+++.+ | ...+|+++|+|+.|.||+||++++.++. .+|- T Consensus 233 ~gd~s~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa 304 (305) T protein:vir:25 233 IADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPA 304 (305) T ss_pred EEecce-EEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCC Confidence 999997 567889999999887642 2 3358999999999999999999999866 3666 No 95 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=100.00 E-value=2.7e-48 Score=281.40 Aligned_cols=279 Identities=13% Similarity=0.036 Sum_probs=213.3 Q ss_pred HHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecC-------CCcc Q lcl|Aclame:pro 112 DEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRA-------TTKM 184 (394) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-------~~~~ 184 (394) ......+....... ...+...+.++.++|+++++.|++.+++.++|+++|++++++++..++|+.... ...+ T Consensus 1 ~~~~~e~~~~~~~~-~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~~~~ 79 (338) T protein:vir:78 1 MATLNELAPNTAGS-NHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTVKRPEVGQVGVGTS 79 (338) T ss_pred CcchHHhhhhhccc-ccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEecCccceeeccccc Confidence 11111111111110 111223334566999999999999999999999999999999999999997642 2334 Q ss_pred cccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc--- Q lcl|Aclame:pro 185 VTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTV--- 261 (394) Q Consensus 185 ~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~--- 261 (394) .|+.|++..+ .++++|++++++++++++++++|+|+++|+.+++++||.+.|+++++++++.++++|+|++++.+. T Consensus 80 ~~~~Eg~~~~-~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi 158 (338) T protein:vir:78 80 NEQREGGTKP-LSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGI 158 (338) T ss_pred cccccccccc-ccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccccccc Confidence 4455555555 578999999999999999999999999999999999999999999999999999999886542211 Q ss_pred --------------------ccHHHHHHHHHhhhhhh--cccEEEEcHHHHHHH---HhhhccCCceeecccccCCCccc Q lcl|Aclame:pro 262 --------------------KNLDEIKALLNGGFDPA--YNVSLIVSQSFYQTL---DTLKDGNGRYLLQDDITAVSGKV 316 (394) Q Consensus 262 --------------------~~~~~i~~~~~~~~~~~--~~a~~vm~~~~~~~l---~~lkd~~G~~l~~~~~~~~~~~~ 316 (394) ..++++.+++....... ..++|+|||.++..| +.++|++|+|+|.+.+..+.+++ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~ 238 (338) T protein:vir:78 159 DTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGD 238 (338) T ss_pred ccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCce Confidence 12345555544433222 245799999998876 45789999999998888888999 Q ss_pred ccccceEEecCccc-------ccCceEEEeccccEEEEeecceEEEEeeccc-----------------ccceEEEEEEe Q lcl|Aclame:pro 317 LLGKPVFVLSDEVL-------GANKAFIGDFKRGVLFADRKDLGLRWADNEI-----------------YGQYLQAVLRF 372 (394) Q Consensus 317 l~G~pV~~~~~~~~-------~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-----------------~~~~~r~~~r~ 372 (394) |+|+||++++.++. ....++||||+. |+++++++++|+++++.. +...+|+++|+ T Consensus 239 l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~ 317 (338) T protein:vir:78 239 LLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQ-LKYGFADEIRVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTF 317 (338) T ss_pred eeeeeEEEccccCccccccCCcccEEEEEecce-EEEEeecccEEEEeecccccccccccccchhhhhcCcEEEEEEEEe Confidence 99999998765432 234588999987 678899999999887632 23458999999 Q ss_pred ccEEecccceEEEEecCccCC Q lcl|Aclame:pro 373 GVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 373 d~~v~~~~af~~l~~~~~~~~ 393 (394) |++|+||+||++|+-.++|-. T Consensus 318 d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 318 GWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred ccEeecccceEEEecccCCCC Confidence 999999999999999999977 No 96 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=100.00 E-value=5.2e-48 Score=279.80 Aligned_cols=284 Identities=14% Similarity=0.094 Sum_probs=218.6 Q ss_pred chhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhh-cccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 82 QEEKTYRE-SVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKD-GIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 82 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) .++.+..+ ....+. ........ ..+. ......++.++|++++..|++.+++.++|+ T Consensus 1 ~~~~~~~~~~~~~f~-----------------~~~~~~~~-----~~a~~~~~~~~~~~lip~~~~~~ii~~~~~~s~l~ 58 (324) T protein:vir:96 1 MEQTQKLKLNLQHFA-----------------SNNVKPQV-----FNPDNVMMHEKKDGTLLNDFTTPILQEVMENSKIM 58 (324) T ss_pred CCcchhhhHHHHHHH-----------------Hhhhhhhh-----cccccccccCCCcceechhHHHHHHHHHHhhchhh Confidence 11100000 000000 00000000 0011 122344566999999999999999999999 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQ 239 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~ 239 (394) +++++++++++.+++|+.+. .+.+.|++|++..+ .++++|+++++++++++++++||+|+++|+.++|+++|.+.|++ T Consensus 59 ~l~~~~~~~~~~~~~p~~~~-~~~a~~v~Eg~~~~-~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~ 136 (324) T protein:vir:96 59 QLGKYEPMEGTEKKFTFWAD-KPGAYWVGEGQKIE-TSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAE 136 (324) T ss_pred hhcceeeccCCceEEEEEec-CcceeeecCCcccc-ccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 99999999998899999764 44556666666665 57899999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhccccccc---------------cccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCce Q lcl|Aclame:pro 240 IKVNTTNDAIAKVLKSFTT---------------KTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRY 303 (394) Q Consensus 240 ~~~~~~~~a~~~g~~~~~~---------------~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~ 303 (394) +++.+++.+++.|++++.. .+..+++++.+++..+...++ .++|+||++++..|++++|++|+| T Consensus 137 aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~ 216 (324) T protein:vir:96 137 AFYKKFDEAGILNQGNNPFGKSIAQSIKKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKE 216 (324) T ss_pred HHHHHHHHHhhhcCCCCCcCccccccccccceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCe Confidence 9999999999998776432 122358899998877766555 468999999999999999999999 Q ss_pred eecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc--------------c---cceE Q lcl|Aclame:pro 304 LLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI--------------Y---GQYL 366 (394) Q Consensus 304 l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~--------------~---~~~~ 366 (394) ++.+ +.+++|+|+||+++++...+.+.+++|||++ ++++++++++|+.+++.. | ...+ T Consensus 217 ~~~~----~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~ 291 (324) T protein:vir:96 217 RIYD----RNSDSLDGLPVVNLKSSNLKRGELITGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVAL 291 (324) T ss_pred eecC----CCCCcccceeeEeecCCCCCcceEEEEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEE Confidence 9853 3467999999999877777888899999997 567789999999987643 2 3458 Q ss_pred EEEEEeccEEecccceEEEEecCc---cCCC Q lcl|Aclame:pro 367 QAVLRFGVSKVDDKAGYYVTFTPE---PLPL 394 (394) Q Consensus 367 r~~~r~d~~v~~~~af~~l~~~~~---~~~~ 394 (394) |+++|+|+++.+|+||++|+.+.. +||- T Consensus 292 r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~ 322 (324) T protein:vir:96 292 RATMHVALHIADDKAFAKLVPADKRTDSVPG 322 (324) T ss_pred EEEEEeccEEecccceEEEecccccCCCCCC Confidence 999999999999999999996544 4666 No 97 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=100.00 E-value=1e-47 Score=278.26 Aligned_cols=259 Identities=12% Similarity=0.058 Sum_probs=205.9 Q ss_pred hcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecH Q lcl|Aclame:pro 129 DGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNI 208 (394) Q Consensus 129 ~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~ 208 (394) ..+.++++++++|+++++.|++.+++.++|+++|+++++.++..++|+.+. ...+.|++|++..+ .++++|+++++++ T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~p~~~~-~~~a~wv~Eg~~~~-~~~~~f~~v~l~~ 78 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDIITFNG-RPKAEFVGEGQQKS-STTGEFDFVTSTP 78 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEeC-CceeEEeecCcccc-cccceeeEEEEee Confidence 223445667899999999999999999999999999999998899998764 44566666766666 5789999999999 Q ss_pred hhhhhhhhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---------------------cc- Q lcl|Aclame:pro 209 DTYRGAIPLSQESID---DADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTV---------------------KN- 263 (394) Q Consensus 209 ~~~~~~~~vs~ell~---ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~---------------------~~- 263 (394) +++++++++|+||++ |+.++|.++|.+.|+++++++++.++++|+|++++.+. .. T Consensus 79 ~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~ 158 (311) T protein:vir:99 79 KKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELTADTIANP 158 (311) T ss_pred EEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeeccccccchh Confidence 999999999999994 77889999999999999999999999999875432111 01 Q ss_pred HHHHHHHHHhhhhh---hcccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcc----------- Q lcl|Aclame:pro 264 LDEIKALLNGGFDP---AYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEV----------- 329 (394) Q Consensus 264 ~~~i~~~~~~~~~~---~~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~----------- 329 (394) ++++.+++...... +....|+||+.++..|++|||++|+|+|++.+..+.+++|+|+||++++... T Consensus 159 ~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~~s~~i~~~~~~~~~~~~ 238 (311) T protein:vir:99 159 DLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDASVSDTVNGGDEADPDDED 238 (311) T ss_pred HHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeEeecccccccccccccch Confidence 12333333322222 2234599999999999999999999999998888888999999999865332 Q ss_pred ---cccCceEEEeccccEEEEeecceEEEEeecc----------cccceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 330 ---LGANKAFIGDFKRGVLFADRKDLGLRWADNE----------IYGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 330 ---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~----------~~~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) .+...+++|||++++.+..+.+++++.+++. ++...+|+++|+|++|.|| +|++++..+| T Consensus 239 ~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~-~~v~~~~~~A 311 (311) T protein:vir:99 239 LDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTD-RFVVIENAVA 311 (311) T ss_pred hhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecCh-hHeeeecccC Confidence 1234568899999888888999999987642 2345689999999999997 6777777776 No 98 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=100.00 E-value=6.3e-46 Score=268.38 Aligned_cols=377 Identities=11% Similarity=0.067 Sum_probs=216.6 Q ss_pred Ch-HHHHHHHH-HHHH----------------------HHHHHHHHHHHHHHHHh---chhh-HHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MF-EEKIKEIK-ATIA----------------------DLNNTIVTKTAQVKNAL---ESDD-LEAARSIKAEVEQAKAN 52 (394) Q Consensus 1 ~l-~e~l~eL~-~~~~----------------------el~~~~~~~~~e~~~~~---~~e~-~~~~~~~~~ei~~l~~~ 52 (394) |- ++...+.. +... +...+-.+...++..+. ..++ ..++-.....+++.+++ T Consensus 185 ~~~~~~~~~~~~~~~~~~r~~~~~a~~~~~~~~~a~~~~~~~~E~~r~~eI~~l~~~~~~~~~~~~ai~~g~sld~~ra~ 264 (632) T protein:vir:96 185 MPDKDKQTQTAGSQQTETRGAETGAKNPAPAASGANENDILSRERTRISEITAIGQQFSQRSLAQEAIQKGHTVDQFRAL 264 (632) T ss_pred ccchhhhhhccccccccccchhhcccccchhhhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhHHHHHhccccHHHHHHH Confidence 10 00000000 0000 00000000011111100 0000 00000000111111111 Q ss_pred HHHHHHHHHH--HHHHHh--hcccccccc-ccccc-hhhhHHHHHHHHHHHHHHHH--HHHHHHH-------HHHHHHHH Q lcl|Aclame:pro 53 LVEAENDLKL--YESSVE--VGGAENIGG-KEVTQ-EEKTYRESVNDFIRSKGKIV--NDSLRFE-------GKDEVLMP 117 (394) Q Consensus 53 i~~l~~~~~~--~~~~~~--~~~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~--~~~~~~~-------~~~~~~~~ 117 (394) +.+.....+. ..+... ......... ..... ........+...++...... ....... ......+. T Consensus 265 ~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg 344 (632) T protein:vir:96 265 VLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARG 344 (632) T ss_pred HHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhh Confidence 1000000000 000000 000000000 00000 00000011111111100000 0000000 00000000 Q ss_pred -HH-hhhhhhhhhhcccccCCccccchhH-HhHHHHHHHhhhhhhhe-eeeEeecCCceeEEEEecCCCccccccccccc Q lcl|Aclame:pro 118 -IN-ETTPVEPQKDGIKKENAKPVSSEEI-LYTPAREVKTVVDLKPF-TTVYQAKKASGKYPVLQRATTKMVTVAELEKN 193 (394) Q Consensus 118 -~~-~~~~~~~~~~~~~~~~~~~lvP~~~-~~~I~~~~~~~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~ 193 (394) .. ............+.+.|+++||+++ ...|++.++..++++.+ ++++++.++.+++|..+. +..+.|++|++.. T Consensus 345 ~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~~~ip~~~~-~~~a~wv~E~~~~ 423 (632) T protein:vir:96 345 FYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQMGARMLPGLVGDVDIPKKTS-GANFYWIGEDEDV 423 (632) T ss_pred hhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhhcceEeecCCcceEEEEEeC-CceeEeecCCccc Confidence 00 0001111223345567788999886 68899999999999887 788888888889998763 4556666666666 Q ss_pred ccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-cccc------------ Q lcl|Aclame:pro 194 PALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF-TTKT------------ 260 (394) Q Consensus 194 ~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~-~~~~------------ 260 (394) + .++++|+++++++++++++++||+|||.|+.++++++|++.|+++++.+++.++++|+|++ .|.+ T Consensus 424 ~-~s~~~f~~i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~ 502 (632) T protein:vir:96 424 Q-DSDFDFTTLSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTY 502 (632) T ss_pred c-ccccceeeEEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceec Confidence 6 5889999999999999999999999999999999999999999999999999999998853 2322 Q ss_pred ---cccHHHHHHHHHhhhhhh---cccEEEEcHHHHHHHHh--hhccCCceeecccccCCCcccccccceEEecCccccc Q lcl|Aclame:pro 261 ---VKNLDEIKALLNGGFDPA---YNVSLIVSQSFYQTLDT--LKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGA 332 (394) Q Consensus 261 ---~~~~~~i~~~~~~~~~~~---~~a~~vm~~~~~~~l~~--lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~ 332 (394) ..+++++.++...+...+ .+++|+||+.++..|.+ ++|++|+|||++ ++|+|+||++++ ..+. T Consensus 503 ~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~-------~~l~G~pv~~s~--~ip~ 573 (632) T protein:vir:96 503 PAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------NEVNGYRAEASN--QIPA 573 (632) T ss_pred ccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecC-------CeecccceEecc--cccc Confidence 235677877766554433 25789999998877765 789999999964 489999999865 4567 Q ss_pred CceEEEeccccEEEEeecceEEEEeecccccc---eEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 333 NKAFIGDFKRGVLFADRKDLGLRWADNEIYGQ---YLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) Q Consensus 333 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~---~~r~~~r~d~~v~~~~af~~l~~~~ 389 (394) +.++||||+. |+++++.++.|.++++.++.+ .+|+++|+|++|++|++|+.++..+ T Consensus 574 ~~~~~gd~s~-~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 574 DTWIFGDWSQ-IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred CcEEEeecce-EEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 7899999997 567789999999988876654 5899999999999999999999999 No 99 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=100.00 E-value=2.1e-39 Score=232.66 Aligned_cols=364 Identities=13% Similarity=0.075 Sum_probs=203.4 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHhhccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVE--------AENDLKLYESSVEVGGA 72 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~--------l~~~~~~~~~~~~~~~~ 72 (394) =-..++..+++..+....++.+..+..+... +..+..+++.++++++.+++.+ +..+++..+........ T Consensus 125 ~~~a~I~~vke~~~~e~~~~~~~~a~~ee~~--e~~~k~~el~a~l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~ 202 (517) T protein:vir:97 125 NKNAVVTYFREEKKKEENKMTFDQNLMQELL--DAKKLAADLNAKLKERENGGDNAALKTVSELAANLMKQRESEKILGV 202 (517) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh--hhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhccc Confidence 0001111111111111111111110011000 0011112222223222222211 11111111110000000 Q ss_pred cccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHH Q lcl|Aclame:pro 73 ENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREV 152 (394) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~ 152 (394) . .........+ +.......... ......... .............++++.|+.+...|...+ T Consensus 203 ~---~~~~~~~~~~-------~~~~~~~~~~~-----~~~~~~~~~----~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~ 263 (517) T protein:vir:97 203 E---ALKVTPEATE-------FLKTREAEVAY-----MSASLTKDP----KAAWTAELKERGISGMPAPAGILKRIQDAV 263 (517) T ss_pred c---cccccchhhH-------HHHHHHHHHHH-----HHhcccccc----cceeeeecccccccccccchHHHHHHHHhh Confidence 0 0000000000 00000000000 000000000 000001112334467889999999999999 Q ss_pred HhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHH---- Q lcl|Aclame:pro 153 KTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVD---- 228 (394) Q Consensus 153 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~---- 228 (394) ...++++.++++.++. ....|.. .....+.++.|+ ..+++++++|+.++++++++++++++|++|++|+.+| T Consensus 264 ~~~~~i~~~~~~~~i~--~~~~~~~-~~~~~a~~~~eG-~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~ 339 (517) T protein:vir:97 264 NDEGSLLPFIRHENLP--TLVVGGD-NALTQGTGHTTG-TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGA 339 (517) T ss_pred hhhccceeeeeecccc--ceeeecc-cccceeeeeecC-CcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHH Confidence 8888888877765443 2334432 222333444444 4445689999999999999999999999999998777 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------ccccHH---HHHHHHHhhhhhhcccEEEEcHHHHHH Q lcl|Aclame:pro 229 LVGIVSESISQIKVNTTNDAIAKVLKSFTTK-------------TVKNLD---EIKALLNGGFDPAYNVSLIVSQSFYQT 292 (394) Q Consensus 229 l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~-------------~~~~~~---~i~~~~~~~~~~~~~a~~vm~~~~~~~ 292 (394) |++||.++|++.++..++.+|++|+|++... +..+.+ +++..+...+..+++++|||||.+|.. T Consensus 340 l~s~i~~~l~~~l~~~ee~a~l~GdGtg~~~~gi~~~a~~~~~~~~~~~~~~~d~i~~l~~a~~~a~~a~~vmn~~t~~~ 419 (517) T protein:vir:97 340 ILTYVMNRLPDMVIMAVNRAIIMGGVTGVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKAADSTLVIHRNDLAA 419 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccCCCcccccccccccccccccccccchHHHHHHHHHHHhhhccCCEEEECHHHHHH Confidence 9999999999999999999999998875321 111223 334444444555568999999999999 Q ss_pred HHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEE-EEeecccccceEEEEEE Q lcl|Aclame:pro 293 LDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGL-RWADNEIYGQYLQAVLR 371 (394) Q Consensus 293 l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i-~~~~~~~~~~~~r~~~r 371 (394) |++|||++|||||++.+..+.+.+++|..- +++....+.. .+++++ +|+++++.|+.+ +..+..++...|+..+| T Consensus 420 I~klKD~~G~Yl~~~~~~~~~~~~l~G~~~-~~~~~~~~~~--~~~~~~-~y~i~~~~g~~~~~~fd~~~n~~~f~~~~~ 495 (517) T protein:vir:97 420 IRFLKDKNGNYVFPVGVSNQTIATHFGFNR-LVQSVAVDEK--TAVSLS-GYVTNGSRGMEFEQGTILVENNKEYLFEMP 495 (517) T ss_pred HHHhhcCCCCeeccCcCCcccccccCCccc-cccccccCce--eEeecc-ccEEEeecceeeeeeeecccCceeEeeeee Confidence 999999999999999888888899999522 2333333333 344444 567777777653 22333456677899999 Q ss_pred eccEEecccceEEEEecCccCC Q lcl|Aclame:pro 372 FGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 372 ~d~~v~~~~af~~l~~~~~~~~ 393 (394) ++|.|..|++|+++++.|+++- T Consensus 496 ~~g~i~~~~r~a~~~~~p~~~~ 517 (517) T protein:vir:97 496 ISGSLEYKGTTAYGTYTPPVAG 517 (517) T ss_pred eccccccccceEEEEEcCCCCC Confidence 9999999999999999999988 No 100 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=100.00 E-value=2.3e-39 Score=232.44 Aligned_cols=275 Identities=10% Similarity=0.048 Sum_probs=201.0 Q ss_pred HHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeec-CCceeEEEEecC--CCcc Q lcl|Aclame:pro 108 FEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAK-KASGKYPVLQRA--TTKM 184 (394) Q Consensus 108 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~--~~~~ 184 (394) ...-... ..+... ...++.+++..+|++++|+... .+++.+.+.++++++|++++.. +....++..... .... T Consensus 1 ~~~~~~~--~~~~~~-~~~k~~t~~d~~Gg~l~P~~~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g 76 (315) T protein:vir:41 1 MLTIEDI--RGGKPF-EIVPKIDVPDLGRGVLSVDRFG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPG 76 (315) T ss_pred Ccccchh--hcCChh-hhhhhcCCcCCCCceechHHHH-HHHHHHHhhhhhhhhceeeeccccccccccccccCcccccc Confidence 0000000 011111 1112344566678889998865 5889999999999999987643 333333322111 1122 Q ss_pred cccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHH--HHHHHHHHHHHHHHHHHHHHHHhhcccccc----- Q lcl|Aclame:pro 185 VTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADV--DLVGIVSESISQIKVNTTNDAIAKVLKSFT----- 257 (394) Q Consensus 185 ~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~--~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~----- 257 (394) .+|.++...+++++++|+++.+.++++++.+.+|+++|+|+.+ ||++||.+.++++++..++.++++|+|+.+ T Consensus 77 ~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~ 156 (315) T protein:vir:41 77 RDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLR 156 (315) T ss_pred cccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCcccc Confidence 3455555555668899999999999999999999999999964 999999999999999999999999987421 Q ss_pred -cccc-------------------ccHHHHHHHHHhhhhhhc----ccEEEEcHHHHHHHHhhhccCCceeecccccCCC Q lcl|Aclame:pro 258 -TKTV-------------------KNLDEIKALLNGGFDPAY----NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVS 313 (394) Q Consensus 258 -~~~~-------------------~~~~~i~~~~~~~~~~~~----~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~ 313 (394) +.|. .+.+.+.+++..+...++ +++|+||+.++..+++++|++|+|+|+|.+..+. T Consensus 157 ~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~ 236 (315) T protein:vir:41 157 MSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGAN 236 (315) T ss_pred ccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhccCCCccccchhhcCC Confidence 1111 124556777777777675 5689999999999999999999999999999999 Q ss_pred cccccccceEEecCcc---cccCceEEEeccccEEEEeecceEEEEeeccccc-ceEEEEEEeccEEecccc--eEEEEe Q lcl|Aclame:pro 314 GKVLLGKPVFVLSDEV---LGANKAFIGDFKRGVLFADRKDLGLRWADNEIYG-QYLQAVLRFGVSKVDDKA--GYYVTF 387 (394) Q Consensus 314 ~~~l~G~pV~~~~~~~---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~-~~~r~~~r~d~~v~~~~a--f~~l~~ 387 (394) +.+|+|+||+.+++++ .+++.++||||+. ++++.+.++.++.++..... ..+.+.+|+|+.+..+++ .+.+++ T Consensus 237 ~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~~n-l~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 237 SILYDGRPVQYVPALEALNDGKSRALFVVPTQ-LVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred CceecccceEecccccccCCCCccEEEecccc-eEEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 9999999999887664 4678899999987 56677888888877665443 346777899998887776 455555 No 101 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=100.00 E-value=8.7e-38 Score=223.78 Aligned_cols=276 Identities=10% Similarity=0.023 Sum_probs=210.3 Q ss_pred HHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee-cCCceeEEEEecCCC--ccccccccc Q lcl|Aclame:pro 115 LMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA-KKASGKYPVLQRATT--KMVTVAELE 191 (394) Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~--~~~~~~e~~ 191 (394) ...++.... ..+..++....|++++|+++. .+++.+++.++++++++++++ .+....+|....... ....|.++. T Consensus 1 ~~~~~~~~~-~~k~it~~d~~gG~L~P~~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i~~g~~~~~~~~~~~~~ 78 (314) T protein:vir:41 1 MDFLNKPFQ-ITPKIDVPDLGKGILAVQRFG-EFVREVRENSAIIKDARVLNALKSYEVDISRISLGVELEPGRNTSGTK 78 (314) T ss_pred CchhhhHHH-hhcccccccCCCceeChHHHH-HHHHHHHhccchhhheeeecccCccceeecccccCcccccccccccCC Confidence 111111111 122234566678899999975 688999999999999999864 566677776543221 122344444 Q ss_pred ccccccccccceeeecHhhhhhhhhhhHHHHhccHH--HHHHHHHHHHHHHHHHHHHHHHhhcccccc--------cccc Q lcl|Aclame:pro 192 KNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADV--DLVGIVSESISQIKVNTTNDAIAKVLKSFT--------TKTV 261 (394) Q Consensus 192 ~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~--~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~--------~~~~ 261 (394) ...+.++++|+++.+.+|++...++||+|+|+|+.+ ||+++|...++++++..+..++++|+|+.+ +.|. T Consensus 79 ~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~ 158 (314) T protein:vir:41 79 VAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGW 158 (314) T ss_pred ccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhh Confidence 444568899999999999999999999999999975 999999999999999999999999988532 1111 Q ss_pred -----------------ccHHHHHHHHHhhhhhhc----ccEEEEcHHHHHHHHhhhccCCceeecccccCCCccccccc Q lcl|Aclame:pro 262 -----------------KNLDEIKALLNGGFDPAY----NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGK 320 (394) Q Consensus 262 -----------------~~~~~i~~~~~~~~~~~~----~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~ 320 (394) ...+.+.+++..+.++++ +++|+||+.++..++++++.+|+|+|++.+..+.+.+|+|+ T Consensus 159 l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~ 238 (314) T protein:vir:41 159 MKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGI 238 (314) T ss_pred hhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecce Confidence 112345666777666665 46899999999999999999999999999999999999999 Q ss_pred ceEEecCc---ccccCceEEEeccccEEEEeecceEEEEeeccccc-ceEEEEEEeccEEecccceEEEEecCccCC Q lcl|Aclame:pro 321 PVFVLSDE---VLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYG-QYLQAVLRFGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 321 pV~~~~~~---~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~-~~~r~~~r~d~~v~~~~af~~l~~~~~~~~ 393 (394) ||+.++++ ..++.+++||||+.+ +++.+..+.+......... ..+.+.+|+|+.+..++|.++..+..+.+- T Consensus 239 PV~~~~~~~~~~~~~~~i~fgd~~nl-v~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 239 PIQYVPALDALGDDKARALLTVPTNL-VYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred eeEecccccccCCCCceEEEechhhe-EEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCCC Confidence 99987754 457899999999974 5566777777665554443 357888999999999999888888877777 No 102 >protein:vir:4074 Length: 480 # NCBI annotation: major capsid (head) protein # Family: family:all:11745 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043553;genbank:gi:9628687;genbank:GeneID:1261180 Probab=100.00 E-value=1.4e-34 Score=206.15 Aligned_cols=348 Identities=8% Similarity=0.024 Sum_probs=174.4 Q ss_pred ChHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MFEE-KIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 1 ~l~e-~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) +.++ ++..+++........ ...++.++.. .+....+....++.+++++++..+++.+........ .... T Consensus 111 a~~~a~v~~vks~~~~~e~~--~~~~e~~e~~-----~e~~e~~~~~~el~akl~el~k~~ee~k~~~~~~~~---~~~~ 180 (480) T protein:vir:40 111 SNKGAKVTKVREENKGEQEQ--MGANETQEIM-----KQAIEAGVKVRELEAKVEELNKEREELKKEREASIP---SEKP 180 (480) T ss_pred cchhhhhhhhhhhhhhhhhh--hhhHHHHHHH-----HhhhhhhhhhhhHHHHHHHHHhHHHHHhhhhhhhcc---ccch Confidence 2211 111111110000000 0000000000 000001112222333333333222222211111110 0000 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) ........+.+..+.+... ...................++. +|+.+...+.......+++. T Consensus 181 -~~~~~~e~r~~~~~~~~~~-----------------e~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 241 (480) T protein:vir:40 181 -EDAERKFMRELGSKMAEMP-----------------EQGFLREFANGADLNVVNSLGS-ITSKYARKSGIYDGAMKARF 241 (480) T ss_pred -hhhhhHHHHHHHHHhccch-----------------hhhhhhhhhhhccccccccccc-cccchhhheeechhhhhhhh Confidence 0000001111111110000 0000111111111112222233 44444444444444444444 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccc-cccccceeeec---HhhhhhhhhhhHHHHhccHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPAL-AKPDFKDVAWN---IDTYRGAIPLSQESIDDADVDLVGIVSE 235 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~-~~~~~~~v~~~---~~~~~~~~~vs~ell~ds~~~l~~~i~~ 235 (394) ..+.....+ + ....|.++......+ ....+....+. .++++...++|+++|+|+. +|++||.+ T Consensus 242 ~~~~~~~~g-~-----------~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~k~t~~lLDDa~-~l~~~i~~ 308 (480) T protein:vir:40 242 QGLTLAEDG-V-----------DDTFISGTFKAGTDKNKSQTATKRSLRPQMAEAYLQMDKATVRGVNDSG-ALSEYVMS 308 (480) T ss_pred hcceeeecc-c-----------cceeeeeeeecccccccccccccchhhHHHHHHHHHhHHHHHHHhhhhH-HHHHHHHH Confidence 443322211 1 111222222211111 11233444444 5788999999999999987 79999999 Q ss_pred HHHHHHHHHHHHHHhhccccccc-------------cccccHHHHHHHHHhhhhhhc-cc-EEEEcHHHHHHHHhhhccC Q lcl|Aclame:pro 236 SISQIKVNTTNDAIAKVLKSFTT-------------KTVKNLDEIKALLNGGFDPAY-NV-SLIVSQSFYQTLDTLKDGN 300 (394) Q Consensus 236 ~l~~~~~~~~~~a~~~g~~~~~~-------------~~~~~~~~i~~~~~~~~~~~~-~a-~~vm~~~~~~~l~~lkd~~ 300 (394) +|++.++..++.+|++|++++.. .+..+.|.+.+++.++..+++ ++ .||||+.+|+.|++|||++ T Consensus 309 ~l~~~~~~~ee~a~l~G~g~g~~~~~g~~~~~~~~~~~~~~~d~id~L~~al~~~y~~~a~~~vmn~~t~~~I~klKD~~ 388 (480) T protein:vir:40 309 EMVNRVIQKVEYNMILGSVDGSNGFYGLKTATDGWTKQIEYTDLFEGITDAVAECSISDAITIVMSPQTFAELRKAKGTD 388 (480) T ss_pred HHHHHHHHHHHHHhhccCCCCccccccceeecccccccchhHHHHHHHHHhhhHHhhCCCCEEEECHHHHHHHHHhhcCC Confidence 99999999999999999654421 112223334456666655554 56 6999999999999999999 Q ss_pred CceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecc--cccceEEEEEEeccEEec Q lcl|Aclame:pro 301 GRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNE--IYGQYLQAVLRFGVSKVD 378 (394) Q Consensus 301 G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~--~~~~~~r~~~r~d~~v~~ 378 (394) |+|||+|+++.+.+.+|||+||++.+ ...+.+...+|.++.+++++|+.. .. +.++. .....+.+..|++|.|.+ T Consensus 389 G~Yi~q~~~~~~~~~~llG~pvv~~~-~~~~~~~~~~~~~~~~~~~~d~~~-~~-~~~~~~~~~~~~~~~e~~v~g~~~~ 465 (480) T protein:vir:40 389 GHSRFNELATKEQIAQSFGAVNLETR-VWMPKDEVAVYNHDEYVLIGDLNV-EN-YNDFDLRYNVEQWLSETLVGGSIRG 465 (480) T ss_pred CCeeccCcccccCcceecccceeeee-ccccCCcceeeeCCccEEEEeccc-ce-ecccccccchhhhhhhhhhceeeEc Confidence 99999999999999999999997754 334445556677777888988742 22 11111 222345778899999999 Q ss_pred ccceEEEEecCccCC Q lcl|Aclame:pro 379 DKAGYYVTFTPEPLP 393 (394) Q Consensus 379 ~~af~~l~~~~~~~~ 393 (394) |++|.+++...-.-- T Consensus 466 ~~~~~~~~~~~~~~~ 480 (480) T protein:vir:40 466 KNRSAYLKKKGSLGV 480 (480) T ss_pred cccEEEEEeccCcCC Confidence 999999887765433 No 103 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=100.00 E-value=8.9e-33 Score=196.31 Aligned_cols=278 Identities=13% Similarity=0.009 Sum_probs=202.0 Q ss_pred HHHHHHHHhhhhh-hhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccc Q lcl|Aclame:pro 112 DEVLMPINETTPV-EPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAEL 190 (394) Q Consensus 112 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~ 190 (394) ...+......... .....+.....+++++|+++...|++.+.+.++++++++++++.+..+.+|.... ++...|+.++ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~-~~~~~~~~~e 79 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNI-GERHRRPQDE 79 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeecc-CCcccccccc Confidence 0000000000111 1112334556677899999999999999999999999999999988888886543 3334444433 Q ss_pred -cccccccccccceeeecHhhhhhhhhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------- Q lcl|Aclame:pro 191 -EKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDAD--VDLVGIVSESISQIKVNTTNDAIAKVLKSFTTK-------- 259 (394) Q Consensus 191 -~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~--~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~-------- 259 (394) ......++++|+++++..+++...++||+++|+|+. ++|+++|.+.++++++..++.++++|++++.+. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~ 159 (321) T protein:vir:31 80 GEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGF 159 (321) T ss_pred cccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhh Confidence 333345789999999999999999999999999975 589999999999999999999999998765431 Q ss_pred ---------------ccccHHHHHHHHHhhhhhhc---ccEEEEcHHHHHHHHh-hhccCCceeecccccCCCccccccc Q lcl|Aclame:pro 260 ---------------TVKNLDEIKALLNGGFDPAY---NVSLIVSQSFYQTLDT-LKDGNGRYLLQDDITAVSGKVLLGK 320 (394) Q Consensus 260 ---------------~~~~~~~i~~~~~~~~~~~~---~a~~vm~~~~~~~l~~-lkd~~G~~l~~~~~~~~~~~~l~G~ 320 (394) +..+++.+.+++..+...++ +.+|+||+.++..++. |+|.+ .++|.+.+.++.+.+|+|+ T Consensus 160 l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~-~~~~~~~l~~~~~~tl~G~ 238 (321) T protein:vir:31 160 ITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRD-TPLGDNVIMGEADVNPFSF 238 (321) T ss_pred hhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCC-Cccccchhhccccccccce Confidence 11245677777766555554 4689999999887764 66654 5789888888888899999 Q ss_pred ceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccc---c-ceEEEE--EEeccEEecccceEEEEecCcc-CC Q lcl|Aclame:pro 321 PVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIY---G-QYLQAV--LRFGVSKVDDKAGYYVTFTPEP-LP 393 (394) Q Consensus 321 pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~---~-~~~r~~--~r~d~~v~~~~af~~l~~~~~~-~~ 393 (394) ||+.++.+ +++.++++||++.++ +.+.++.++....... . ..++.+ .++|+.|.+++|++.++.-+.| -| T Consensus 239 pvv~~~~m--P~~~il~t~~~nl~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~~a~a~~~~i~~~~~~ 315 (321) T protein:vir:31 239 PIIGSGLW--PDDKAMFTDPQNLIY-ALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENTEAVVLAEGLGDPLEH 315 (321) T ss_pred eEEEcCCC--CCCcEEEeccccEEE-EEeeccEEEEeecCccccccceeeEeeeeeecceeEeccccEEEEecCCcchhc Confidence 99998754 677899999998543 3456777776544322 2 234444 4589999999999999966554 33 Q ss_pred C Q lcl|Aclame:pro 394 L 394 (394) Q Consensus 394 ~ 394 (394) + T Consensus 316 ~ 316 (321) T protein:vir:31 316 L 316 (321) T ss_pred c Confidence 3 No 104 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=99.94 E-value=2e-28 Score=172.44 Aligned_cols=261 Identities=13% Similarity=0.087 Sum_probs=199.7 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEe----ecCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQ----AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) +...++..+..++|+.++..|.+.+...+.+.+++.+-. ..+.++++|..+ ..+.+.|+.|+...+ .++++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~-~~~~a~~v~eg~~i~-~~~~~~~~ 78 (272) T protein:vir:30 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWD-YIGDAEDVAEGEAIP-MTQLGFKK 78 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEec-CCCCcccccCCCccc-ccccccce Confidence 333345556789999999999999999888888876533 233457888875 345566666666555 68899999 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--cccccccHHHHHHHHHhhhhhhc-c Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF--TTKTVKNLDEIKALLNGGFDPAY-N 280 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~--~~~~~~~~~~i~~~~~~~~~~~~-~ 280 (394) +++.+++++..+++|+++..++.+|+.+++.+.+++.+++..+..++....+. ...+..+++++.++...+-+... . T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~~t~d~i~da~~~l~~~~~~~ 158 (272) T protein:vir:30 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEATATVDGVSKALDIFNDEDDAE 158 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHHhccCCCc Confidence 99999999999999999999999999999999999999999999988776443 23456679999998776554443 4 Q ss_pred cEEEEcHHHHHHHHhhhcc---CCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEe Q lcl|Aclame:pro 281 VSLIVSQSFYQTLDTLKDG---NGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWA 357 (394) Q Consensus 281 a~~vm~~~~~~~l~~lkd~---~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 357 (394) ..|+|||.++..|++.+.- .......+.+.++..++|+|+||++++.+ +.++.|+.+.. ++.++.+.+++++.. T Consensus 159 ~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~--p~~t~~~~~~~-a~~~~~~~~~~ve~~ 235 (272) T protein:vir:30 159 TVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKC--PKGTAYMVRKG-ALRIMLKRNTMVETD 235 (272) T ss_pred cEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCC--CcceEEEEcCC-eEEEEecCCceeeec Confidence 6899999999999875322 11112223355566679999999997654 56666776655 456677888888887 Q ss_pred eccc-ccceEEEEEEeccEEecccceEEEEecCccCC Q lcl|Aclame:pro 358 DNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 358 ~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~ 393 (394) ++.. +...+++..||++.+++|+++++++++++.=- T Consensus 236 r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 236 RDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred cccccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 7654 44568999999999999999999999988866 No 105 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=99.94 E-value=2e-28 Score=172.44 Aligned_cols=261 Identities=13% Similarity=0.087 Sum_probs=199.7 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEe----ecCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQ----AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) +...++..+..++|+.++..|.+.+...+.+.+++.+-. ..+.++++|..+ ..+.+.|+.|+...+ .++++++. T Consensus 1 MA~~~T~~~~~~iPev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~tv~iP~~~-~~~~a~~v~eg~~i~-~~~~~~~~ 78 (272) T protein:vir:98 1 MAVGTTKMAQMLDPEVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGTTLTVPKWD-YIGDAEDVAEGEAIP-MTQLGFKK 78 (272) T ss_pred CCCccccchheechHHHHHHHHHHHHHHhhhhccccccccccCCCCCEEEEEEec-CCCCcccccCCCccc-ccccccce Confidence 333345556789999999999999999888888876533 233457888875 345566666666555 68899999 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--cccccccHHHHHHHHHhhhhhhc-c Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF--TTKTVKNLDEIKALLNGGFDPAY-N 280 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~--~~~~~~~~~~i~~~~~~~~~~~~-~ 280 (394) +++.+++++..+++|+++..++.+|+.+++.+.+++.+++..+..++....+. ...+..+++++.++...+-+... . T Consensus 79 ~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~~~~~~t~d~i~da~~~l~~~~~~~ 158 (272) T protein:vir:98 79 TTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQTVEATATVDGVSKALDIFNDEDDAE 158 (272) T ss_pred EEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHHhccCCCc Confidence 99999999999999999999999999999999999999999999988776443 23456679999998776554443 4 Q ss_pred cEEEEcHHHHHHHHhhhcc---CCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEe Q lcl|Aclame:pro 281 VSLIVSQSFYQTLDTLKDG---NGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWA 357 (394) Q Consensus 281 a~~vm~~~~~~~l~~lkd~---~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~ 357 (394) ..|+|||.++..|++.+.- .......+.+.++..++|+|+||++++.+ +.++.|+.+.. ++.++.+.+++++.. T Consensus 159 ~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~~~~~g~ig~i~G~~Vi~s~~~--p~~t~~~~~~~-a~~~~~~~~~~ve~~ 235 (272) T protein:vir:98 159 TVIVMNPADASTLRLDAAKEWLGATEVGANRVVSGVYGEVLGVQIVRSRKC--PKGTAYMVRKG-ALRIMLKRNTMVETD 235 (272) T ss_pred cEEEEcHHHHHHHHHhccccccccccccccccccccchhhcCeeEEEcCCC--CcceEEEEcCC-eEEEEecCCceeeec Confidence 6899999999999875322 11112223355566679999999997654 56666776655 456677888888887 Q ss_pred eccc-ccceEEEEEEeccEEecccceEEEEecCccCC Q lcl|Aclame:pro 358 DNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 358 ~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~ 393 (394) ++.. +...+++..||++.+++|+++++++++++.=- T Consensus 236 r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 236 RDITKAINQIVANKHYGVYLYKAEKAVKITLKDAAKK 272 (272) T ss_pred cccccceeEEEEEEEEEEEEEcCCceEEEEecccccC Confidence 7654 44568999999999999999999999988866 No 106 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=99.80 E-value=4.3e-21 Score=132.22 Aligned_cols=259 Identities=15% Similarity=0.108 Sum_probs=186.9 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeec----CCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAK----KASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) +....+.-...++|+.+++.+.+.+.....+.+++.+-... +..+++|.+.. .+.+.++.++...+ ..+.+.++ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~-~gda~~~~eg~~i~-~~~lt~~~ 78 (272) T protein:vir:36 1 MSKQKTTLADLVNPEVLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTY-IGDAADVAEGGEIS-LDKIGTTT 78 (272) T ss_pred CCCcceehhhhhchHHHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeecc-CccccccCCCCccC-hhhcCCcc Confidence 23334445567899999999998888887777877655532 34578888753 34445566666655 56788999 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc--ccccccccHHHHHHHHHhhhhhhc-c Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKS--FTTKTVKNLDEIKALLNGGFDPAY-N 280 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~--~~~~~~~~~~~i~~~~~~~~~~~~-~ 280 (394) .++..++.+..+.++++....+..|+.+.+.++++..+++..+..++....+ .+..+..++|.+.++...+-+... . T Consensus 79 ~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~~~~~~~~~d~i~~A~~~lgd~~~~~ 158 (272) T protein:vir:36 79 KSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQTVSTKANVDGVQAALDIFNDEDAQA 158 (272) T ss_pred eeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccHHHHHHHHHHhhhcCCCc Confidence 9999999999999999988888889999999999999999999888766533 233456688999998876655443 3 Q ss_pred cEEEEcHHHHHHHHhhhccC--CceeecccccCCCcccccccceEEecCcccccCc---eEEEeccccEEEEeecceEEE Q lcl|Aclame:pro 281 VSLIVSQSFYQTLDTLKDGN--GRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANK---AFIGDFKRGVLFADRKDLGLR 355 (394) Q Consensus 281 a~~vm~~~~~~~l~~lkd~~--G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~---~~~gd~~~~~~~~~~~~~~i~ 355 (394) ..++|||.++..|++..... +.+...+.+.++.-++++|+||+++++.+.+.+. ++++ +.++-.+...++.++ T Consensus 159 ~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~Vv~s~~~p~~~~~~~~~~~~--~gA~~~~~~~~~~vE 236 (272) T protein:vir:36 159 YVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFKIVSN--SPALKLVLKRGVQVE 236 (272) T ss_pred eEEEEcHHHHHHHhcccccccccccccccceeeeccceecCeeEEEeCCCCCCceeEEEEEec--ccceeeeecCCcccc Confidence 57889999999997643221 1111112233445578999999998766544332 2333 223334556788888 Q ss_pred Eeeccc-ccceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 356 WADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 356 ~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) ..++.. +...+++..+|+.++++|+++++++++.. T Consensus 237 ~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 237 TDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred cccchhhcCcEEEEEEEEEEEEEcCccEEEEeecCC Confidence 777654 55678999999999999999999999999 No 107 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=99.77 E-value=5.2e-20 Score=126.30 Aligned_cols=260 Identities=13% Similarity=0.087 Sum_probs=189.2 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeec----CCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAK----KASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) +....+.-...++|+.+++.+.+.+.....+.+++.+.... +..+++|.+.. .+.+.++.++...+ .++.+++. T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~-~g~~~~~~eg~~i~-~~~it~~~ 78 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeecc-CCCcccccCCCccc-ccccccce Confidence 33344455567999999999998888877777777654322 23677888653 34555666666554 57889999 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cccccHHHHHHHHHhhhhhhc- Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT---KTVKNLDEIKALLNGGFDPAY- 279 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~---~~~~~~~~i~~~~~~~~~~~~- 279 (394) .++..++.+..+.++++....+..|+.+.+.+.++..+++..+..++....++.. ....+++.+.++...+-+... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~ 158 (274) T protein:vir:93 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) T ss_pred eEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhhhccCC Confidence 9999999999999999988888888999999999999999999988777654432 234578999988876554433 Q ss_pred ccEEEEcHHHHHHHHhhhccCCceee-----cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEE Q lcl|Aclame:pro 280 NVSLIVSQSFYQTLDTLKDGNGRYLL-----QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGL 354 (394) Q Consensus 280 ~a~~vm~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 354 (394) ...++|||..+..|++- ..-+++- .+.+..+.-++++|+||++++.. +.+..++.+.. ++..+.+.++.+ T Consensus 159 ~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~--p~~t~~l~~~g-ai~~~~~~~~~v 233 (274) T protein:vir:93 159 PMVLFINPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL--EAGTAILAKKG-AVKLILKRDFFL 233 (274) T ss_pred ccEEEeCHHHHHHHHhh--hhhcccccccccccceeecccceecCeeEEEcCCC--CcceEEEEeCC-eEEEEecCCccc Confidence 35789999999998753 1111111 11234455678999999997654 45555665544 345566778888 Q ss_pred EEeeccc-ccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 355 RWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 355 ~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..++.. +...+++..+|++++++|+++++++.+.+-+-. T Consensus 234 E~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 234 EVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCccccCC Confidence 8776654 556799999999999999999999977776666 No 108 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=99.72 E-value=1.4e-18 Score=118.46 Aligned_cols=263 Identities=14% Similarity=0.067 Sum_probs=187.3 Q ss_pred hhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccc Q lcl|Aclame:pro 127 QKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFK 202 (394) Q Consensus 127 ~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~ 202 (394) ++....+.-...++|+.++..+.+.+.....+.+++.+-+. .+..+++|.+.. .+.+..+.++...+ ..+.+.+ T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~-~~~lt~~ 78 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNTLVGQPGNTITFPAFVY-SGDAKVVPEGEEIP-IDLIETK 78 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEeeeecc-CCccccccCCCCcc-hhhcccc Confidence 33333455556789999999999999988888888766554 233688888764 34555566666555 5678899 Q ss_pred eeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccccccHHHHHHHHHhhhhhhc Q lcl|Aclame:pro 203 DVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT---TKTVKNLDEIKALLNGGFDPAY 279 (394) Q Consensus 203 ~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~---~~~~~~~~~i~~~~~~~~~~~~ 279 (394) ..+...++.+..+.++++....+..|+.+.+.+.++..+++..+..++...++++ .....++|.+.++...+-+... T Consensus 79 ~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~~~~~~~~~~d~i~dA~~~lgd~~~ 158 (275) T protein:vir:96 79 KRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATLKVEADITKLAGLQTAIDKFNDEDL 158 (275) T ss_pred eeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhccccC Confidence 9999999999999999998877767888999999999999998888776654433 2345578999998877654433 Q ss_pred -ccEEEEcHHHHHHHHhhhccC---CceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEE Q lcl|Aclame:pro 280 -NVSLIVSQSFYQTLDTLKDGN---GRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLR 355 (394) Q Consensus 280 -~a~~vm~~~~~~~l~~lkd~~---G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~ 355 (394) ...++|||..+..|+++..-+ ....-.+.+.++.-++++|++|+++++.+. ...+++|.. ++.++.+.++.++ T Consensus 159 ~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p~-~t~~i~~~g--A~~~~~~~~~~vE 235 (275) T protein:vir:96 159 EPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEALGAIIVRSNKIKE-GEAILAKRG--AVKLITKRDFFLE 235 (275) T ss_pred CccEEEeCHHHHHHHHhcccccccccccccccceeccccceecCeeEEEeCCCCc-ceEEEEecc--ceeeeecCCcccc Confidence 357889999999998753110 000011123455567899999998765432 233455532 3445567778888 Q ss_pred Eeeccc-ccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 356 WADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 356 ~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ..++.. +...+++..+|+.++++|+++++++++|+---. T Consensus 236 ~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 236 TERHASHKSTALFSDKHYVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred cccchhhcCcEEEEeEEEEEEEEcCccEEEEEecccccCC Confidence 776654 556789999999999999999999886655444 No 109 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=99.70 E-value=3.3e-18 Score=116.39 Aligned_cols=260 Identities=12% Similarity=0.090 Sum_probs=189.6 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) .....+.-...++|+.+++.+.+.+.....+.+++.+-.. .+..+++|.+... +.+..+.++...+ ..+.+.++ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~i-gda~~~~eg~~i~-~~~lt~~~ 78 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYS-GDATVVPEGQKIP-VDKIETNR 78 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCC-CccccccCCCccC-ccccccce Confidence 2222444556789999999999999888888888865543 3446888887543 4555667776655 56788999 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccccccHHHHHHHHHhhhhhhc- Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT---TKTVKNLDEIKALLNGGFDPAY- 279 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~---~~~~~~~~~i~~~~~~~~~~~~- 279 (394) .+...++.+..+.++++....+..|..+.+.+.++..+++..+..++....++. .....+++.+.++...+-+... T Consensus 79 ~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~~~~~~t~d~i~~A~~~lgd~~~~ 158 (276) T protein:vir:10 79 REAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTVSADIGTLAGLEAAIDTFDDEDLE 158 (276) T ss_pred eeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhccccCc Confidence 999999999999999998888878899999999999999998887765543322 2334578888888876554333 Q ss_pred ccEEEEcHHHHHHHHhhhccCCceee-----cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEE Q lcl|Aclame:pro 280 NVSLIVSQSFYQTLDTLKDGNGRYLL-----QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGL 354 (394) Q Consensus 280 ~a~~vm~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 354 (394) ..+++|||..+..|+++.+-. ++- .+.+.++.-++++|++|++++.. +.+..|+..-. ++.++...++.+ T Consensus 159 ~~~ivv~p~~~~~L~k~~~~~--f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~--p~~t~~l~~~g-Ai~~~~~~~~~v 233 (276) T protein:vir:10 159 PMVLFINPKDAGKLRSSASDN--FTRATELGDNIIVKGAFGEALGAVIVRSKKL--DEGEAILAKRG-AVKLITKRDFFL 233 (276) T ss_pred ccEEEEcHHHHHHHHHhcccc--ccccccccccceeccccceecceeEEEcCCC--CcceEEEEecc-ceeeeecCCcee Confidence 357889999999998754221 111 11234455678999999997654 34444443322 344566788888 Q ss_pred EEeeccc-ccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 355 RWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 355 ~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..++.. +...+++..+|+.++++|+.+++++..+-.+|- T Consensus 234 E~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (276) T protein:vir:10 234 ETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGTTDS 274 (276) T ss_pred ecccchhhcccEEEEeeEEEEEEEcCcceEEEecCCcCCcC Confidence 8877654 455689999999999999999999999988888 No 110 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=99.70 E-value=3.4e-18 Score=116.34 Aligned_cols=260 Identities=13% Similarity=0.086 Sum_probs=184.2 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) .....+.....++|+.+++.+.+.+.....+.++++.... .+..+++|... ..+.+..+.++...+ ..+.+.+. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~-~~g~~~~~~~g~~i~-~~~it~~~ 78 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFT-YSGDAQVIAEGEKIP-VDQIGTSK 78 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeec-cCCCccccCCCCcCc-hhhcccce Confidence 2333344457799999999999988887777777765432 23467888865 344555566666554 56788899 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccccccHHHHHHHHHhhhhhhc- Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT---TKTVKNLDEIKALLNGGFDPAY- 279 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~---~~~~~~~~~i~~~~~~~~~~~~- 279 (394) .++..++.+..+.++++....+..|+.+.+.+.++..+++..+..++....+++ .....+++.+.++...+-+... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~ 158 (274) T protein:vir:96 79 REAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATLTVEADITKLDGLQTAIDKFNDEDLE 158 (274) T ss_pred eEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcCcccccHHHHHHHHHHhcccCCC Confidence 999999988889999998888878899999999999999999987766654332 2344568899988776554433 Q ss_pred ccEEEEcHHHHHHHHhhhccCCceee-----cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEE Q lcl|Aclame:pro 280 NVSLIVSQSFYQTLDTLKDGNGRYLL-----QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGL 354 (394) Q Consensus 280 ~a~~vm~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 354 (394) ...++|||..+..|++... .+++- .+.+..+.-++++|++|+++++. +.+..|+..-. ++.++.+.++.+ T Consensus 159 ~~~ivv~p~~~~~L~k~~~--~~f~~~~~~g~~~~~~g~ig~~~G~~Vi~s~~~--p~~t~~l~~~g-A~~~~~~~~~~v 233 (274) T protein:vir:96 159 PMVLFVNPLDAGGLRTSAS--DNFTRPTQLGDNIIVKGAFGEALGAVIVRSNKL--NKGEALLAKKG-AVKLITKRDFFL 233 (274) T ss_pred ceEEEeCHHHHHHHHhccc--ccccccccccccceeecccceecCeeEEEcCCC--CcceEEEEeCc-ceeeeecCCccc Confidence 3578899999999987531 11111 11223445678999999887654 34444443322 344556777788 Q ss_pred EEeecc-cccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 355 RWADNE-IYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 355 ~~~~~~-~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..++. .+...+++.++|+.++++|+++++++..++=--. T Consensus 234 E~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 234 EKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred ccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 776654 3556789999999999999999999988776555 No 111 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=99.69 E-value=4.5e-18 Score=115.66 Aligned_cols=260 Identities=12% Similarity=0.046 Sum_probs=185.4 Q ss_pred cccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccceee Q lcl|Aclame:pro 130 GIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVA 205 (394) Q Consensus 130 ~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~ 205 (394) ...+.....++|+.+.+.+.+.......+.+++.+-+. .+..+++|.+. ..+.+..+.++...+ ..+.+.++.. T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~-~igdae~~~eg~~i~-~~~lt~~~~~ 78 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYA-YIGAAEDLQEGVAMD-TTQMSMTTTK 78 (270) T ss_pred CCceehhhhcchHHHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeec-CCCccccccCCCccc-hhhcccchhe Confidence 12223334579999999999998888777788765443 34468889876 455566677777665 5678889999 Q ss_pred ecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--cccccccHHHHHHHHHhhhhhhc-ccE Q lcl|Aclame:pro 206 WNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF--TTKTVKNLDEIKALLNGGFDPAY-NVS 282 (394) Q Consensus 206 ~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~--~~~~~~~~~~i~~~~~~~~~~~~-~a~ 282 (394) ...++.+..+.++++....+..|..+.+.+.++..+++..+..++...... ......+++++.+++..+-+... ..+ T Consensus 79 a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~~~~~t~~~~~dA~~~lgd~~~~~~~ 158 (270) T protein:vir:95 79 VTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTATVSADATGILDAIEVFNSENDEDYV 158 (270) T ss_pred eeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhccccCCCcE Confidence 999999999999998776665577888899999999999888766554332 22345678899998877655544 357 Q ss_pred EEEcHHHHHHHHhhhcc-CCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeeccc Q lcl|Aclame:pro 283 LIVSQSFYQTLDTLKDG-NGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI 361 (394) Q Consensus 283 ~vm~~~~~~~l~~lkd~-~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~ 361 (394) ++|||.++..|++...- ..++ -...+.++.-++++|++|++.+........++|+ +.++-++...++.++..++.. T Consensus 159 i~vhs~~~~~Lrk~~~~~~~~~-~~~~~~~G~ig~~~G~~Viv~s~~~~~~~~~l~~--~gAi~~~~~~~~~vEtdRd~~ 235 (270) T protein:vir:95 159 LYVNPKDYNKLVKSLFKVGGNV-QDRAISKGDLVEIVGVSDIVKSKRVSENTAFLQR--YGAMEIVNKKKPEAYTDFDIL 235 (270) T ss_pred EEEcHHHHHHHHhhhccccccc-ccchhcccccceecceeEEEeCCCCCceeEEEEe--ccceeeeecCCceeeeccchh Confidence 89999999999863211 1111 1122344566789999998876655444444444 223445667788888877765 Q ss_pred c-cceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 362 Y-GQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 362 ~-~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) . ...+....+|+..+++|+.++++++.++.|-- T Consensus 236 ~~~d~i~~~~~y~v~~~~~skvv~~t~~~a~~~~ 269 (270) T protein:vir:95 236 KRTHLLSTNYHYSVNLKDETGVVKVTFKPSGSLE 269 (270) T ss_pred hcccEEEeeeEEEEEEEccceEEEEEecCCCCcC Confidence 4 45678889999999999999999998888665 No 112 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=99.68 E-value=1.1e-17 Score=113.52 Aligned_cols=260 Identities=14% Similarity=0.091 Sum_probs=186.0 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) .....+.-...++|+.|++.+.+.+.....+.+++.+-.. .+..+++|.+.. .+.+..+.++...+ ..+.+.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~-~g~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecC-CCccccccCCCccc-ccccccce Confidence 2333445556799999999999888777666777765442 234678888653 34455566666554 56788899 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccccccHHHHHHHHHhhhhhhc- Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT---TKTVKNLDEIKALLNGGFDPAY- 279 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~---~~~~~~~~~i~~~~~~~~~~~~- 279 (394) .++..++.+....++++....+..|+.+.+.+.++..+++..+..++....+++ .....+++.+.++...+-+... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~ 158 (274) T protein:vir:97 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHHhhccCCC Confidence 999999999899999998888878899999999999999999988776654432 2334568999998877654433 Q ss_pred ccEEEEcHHHHHHHHhhhccCCceee-----cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEE Q lcl|Aclame:pro 280 NVSLIVSQSFYQTLDTLKDGNGRYLL-----QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGL 354 (394) Q Consensus 280 ~a~~vm~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 354 (394) ...++|||..+..|++. ..-+++- .+.+.++.-++++|++|+++++. +.+..++.... ++.++.+.++.+ T Consensus 159 ~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~--p~~t~~l~~~g-A~~~~~~~~~~v 233 (274) T protein:vir:97 159 PMVLFVNPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL--EAGTAILAKKG-AVKLILKRDFFL 233 (274) T ss_pred ceEEEeCHHHHHHHHhh--hhhhccccCcccccceeccccceecCeeEEEcCCC--CcceEEEEeCc-ceEeeecCCcee Confidence 35678999999998752 1111111 11234455678999999997754 34444444333 345566778888 Q ss_pred EEeeccc-ccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 355 RWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 355 ~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..++.. +...+++..+|++++++|+.+++++++.+-+-. T Consensus 234 E~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 234 EVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 8777654 456789999999999999999999988877777 No 113 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=99.68 E-value=1.1e-17 Score=113.52 Aligned_cols=260 Identities=14% Similarity=0.091 Sum_probs=186.0 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) .....+.-...++|+.|++.+.+.+.....+.+++.+-.. .+..+++|.+.. .+.+..+.++...+ ..+.+.+. T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~-~g~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhcccceecccccCCCCCEEEEeeecC-CCccccccCCCccc-ccccccce Confidence 2333445556799999999999888777666777765442 234678888653 34455566666554 56788899 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---ccccccHHHHHHHHHhhhhhhc- Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT---TKTVKNLDEIKALLNGGFDPAY- 279 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~---~~~~~~~~~i~~~~~~~~~~~~- 279 (394) .++..++.+....++++....+..|+.+.+.+.++..+++..+..++....+++ .....+++.+.++...+-+... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~~~~~~~~~~d~i~dA~~~l~d~~~~ 158 (274) T protein:vir:94 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) T ss_pred eEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCccccccccCHHHHHHHHHHhhccCCC Confidence 999999999899999998888878899999999999999999988776654432 2334568999998877654433 Q ss_pred ccEEEEcHHHHHHHHhhhccCCceee-----cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEE Q lcl|Aclame:pro 280 NVSLIVSQSFYQTLDTLKDGNGRYLL-----QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGL 354 (394) Q Consensus 280 ~a~~vm~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 354 (394) ...++|||..+..|++. ..-+++- .+.+.++.-++++|++|+++++. +.+..++.... ++.++.+.++.+ T Consensus 159 ~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~--p~~t~~l~~~g-A~~~~~~~~~~v 233 (274) T protein:vir:94 159 PMVLFVNPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEALGAIIVRTNKL--EAGTAILAKKG-AVKLILKRDFFL 233 (274) T ss_pred ceEEEeCHHHHHHHHhh--hhhhccccCcccccceeccccceecCeeEEEcCCC--CcceEEEEeCc-ceEeeecCCcee Confidence 35678999999998752 1111111 11234455678999999997754 34444444333 345566778888 Q ss_pred EEeeccc-ccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 355 RWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 355 ~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..++.. +...+++..+|++++++|+.+++++++.+-+-. T Consensus 234 E~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 234 EVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ccccchhhcccEEEEEEEEEEEEEcCCceEEEecCcccccC Confidence 8777654 456789999999999999999999988877777 No 114 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=99.68 E-value=1.2e-17 Score=113.40 Aligned_cols=259 Identities=12% Similarity=0.023 Sum_probs=176.1 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) +...++..+..++|+.|++.+.+.+.....+.+++..... .+..+++|.+.. .+.+.++.++...+ ..+.+++. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~-~g~a~~~~~g~~i~-~~~lt~~~ 78 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKY-IGDAQDVAEGAAID-YSALETES 78 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhcccceecccccCCCCCEEEEeeecc-CCcceeecCCCcCc-ccccccce Confidence 2223444566799999999999988887777777654432 233577888753 34445566666554 46788999 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----ccccc----HHHHHHHHHhh Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT-----KTVKN----LDEIKALLNGG 274 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~-----~~~~~----~~~i~~~~~~~ 274 (394) .++..++.+..+.++++....+..|+.+.+.++++..+++..+..++....+... .+..+ ++.+.++..++ T Consensus 79 ~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~~~~~t~~~~~~~~~~~~da~~~l 158 (278) T protein:vir:80 79 VKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEVKGAINIGLIDKIENTFTDAPDAI 158 (278) T ss_pred eeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHhh Confidence 9999999999999999988888889999999999999999999877666432211 11112 34444544443 Q ss_pred hhhhc--ccEEEEcHHHHHHHHhhhccCC---ceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEee Q lcl|Aclame:pro 275 FDPAY--NVSLIVSQSFYQTLDTLKDGNG---RYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADR 349 (394) Q Consensus 275 ~~~~~--~a~~vm~~~~~~~l~~lkd~~G---~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~ 349 (394) ..... ...++|||..+..|++....+. ..+-.+.+.++.-++++|++|+++++.+ .+..|+-.-. ++-.+.. T Consensus 159 ~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~~G~~Vi~s~~~p--~~t~~l~~~g-Ai~~~~~ 235 (278) T protein:vir:80 159 EDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGELLGWEIVRTKKLA--DGNALAVKAG-ALKTFLK 235 (278) T ss_pred cccCCCcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceeecceeEEEcCCCC--cceEEEEecc-ceeeeec Confidence 32221 3468899999999976532110 0111122345556799999999977654 3333433222 3445567 Q ss_pred cceEEEEeeccc-ccceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 350 KDLGLRWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 350 ~~~~i~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) +++.++..++.. +...+++..+|+.++++|+++++++..+.- T Consensus 236 ~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a~~ 278 (278) T protein:vir:80 236 RNLLAESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPVAGN 278 (278) T ss_pred CCcccccccchhhccceeeeeeEEEEEEEcCcceEEEeeccCC Confidence 788887766653 556789999999999999999999988877 No 115 >protein:vir:93858 Length: 400 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764266;genbank:gi:115315579;genbank:GeneID:5141552 Probab=99.62 E-value=1.1e-15 Score=102.61 Aligned_cols=363 Identities=12% Similarity=0.072 Sum_probs=206.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALE---SDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGG 77 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~---~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~ 77 (394) |-+-.|.|-+.++.++++..-.++..+....- -++....++++.-+.++..++.+.+..+...++. +.. T Consensus 8 ~~k~~~~ek~~~~~~~~e~~~~lks~~~g~~~~~~~~~~~k~~el~kT~Sel~~ei~k~e~eln~~~E~-------~Kg- 79 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEK-------PKG- 79 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhccchhhhhhhchhHHHHHHHHHHhHHHHHHHhhhhhhhhhh-------ccc- Confidence 66667777777777777777676666655311 1334455566666666666665533332211110 000 Q ss_pred ccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhh Q lcl|Aclame:pro 78 KEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVD 157 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~ 157 (394) +..-.+--+..++...|.+- ..........+. .|.+.. .-.+.+........|..+...|.+.+....+ T Consensus 80 k~~mtefLkT~~A~~~fa~~-------l~~nsg~sd~kn---aW~A~l-~E~gvt~td~n~iLP~~il~aIq~al~~~~~ 148 (400) T protein:vir:93 80 KDKMTNFIESQNAVTEFFDV-------LKKNSGKSEIKN---AWSAKL-AENGVTITDTTFQLPRKLVESINTALLNTNP 148 (400) T ss_pred chhHHHhhhhHHHHHHHHHH-------HHhhcCCcchhh---hhhhhh-hhcccccCCchhhcchHHHHHHHHhhhccCC Confidence 00000111112222222111 011111111111 222111 1123332333347899999999999999999 Q ss_pred hhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhc--cHHHHHHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDD--ADVDLVGIVSE 235 (394) Q Consensus 158 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~d--s~~~l~~~i~~ 235 (394) ++++.+|..++..- .-....+.. .+|...-|..+..+..+|...++.|.-++.+..+.+-..++ +.-.|..||.+ T Consensus 149 ~~~f~~v~n~p~l~--V~~~~dt~~-qa~gHk~G~~K~eq~~tl~~rtL~P~~VYk~~~la~~~~~~~~tygaL~nYVm~ 225 (400) T protein:vir:93 149 VFKVFHVTNVGALL--VSRSFDSAN-EAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVA 225 (400) T ss_pred cccceeeecCCcee--eecchhhhc-ccceeccCCcccceeeeeeeeccCHHHHHHHhhhhhhhhhccccHHHHHHHHHH Confidence 99998887774322 211222223 34434444444567789999999999999999885444332 23458999999 Q ss_pred HHHHHHHH-HHHHHHhhccccccccccccH------------------HHHHHHHHhhh-----hhhcccEEEEcHHHHH Q lcl|Aclame:pro 236 SISQIKVN-TTNDAIAKVLKSFTTKTVKNL------------------DEIKALLNGGF-----DPAYNVSLIVSQSFYQ 291 (394) Q Consensus 236 ~l~~~~~~-~~~~a~~~g~~~~~~~~~~~~------------------~~i~~~~~~~~-----~~~~~a~~vm~~~~~~ 291 (394) +|...+.. ..+.+++-|+|+.+..+.... -++.+++..++ ..+++..+||+|.+|+ T Consensus 226 EL~q~vI~k~Ve~Aii~GdG~Ngf~~~dk~t~Ik~I~~dt~kt~~a~~~~~qdl~E~~~d~~~~~aad~~~Iv~s~d~~A 305 (400) T protein:vir:93 226 ELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKAEDRKA 305 (400) T ss_pred HHHHHHHHHHhhhheeecccccccCCCcchhhhhhhhhhhhhhhhcCCccHHHHHHHHHhhhhhccCCceeEEeccchHH Confidence 99999996 469999988877653222111 11222222222 2244678999999999 Q ss_pred HHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeec-ccccceEEEEE Q lcl|Aclame:pro 292 TLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADN-EIYGQYLQAVL 370 (394) Q Consensus 292 ~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~-~~~~~~~r~~~ 370 (394) .|+.|+|++|++.|.....+-+-.+-+|+--.+......-.++.+.-|=. +.+ +.+|++---+.. .+.+..+-++. T Consensus 306 ~L~~lk~a~~~a~f~~~n~d~~IA~~fGv~~Lv~~Tr~~~~kp~V~VDek--~~i-~~~~~~t~~sf~~~tNs~~ilvet 382 (400) T protein:vir:93 306 LLDELRQATANANVRIKNDDTEIASEVGVDEIIVYTGSKALKPTVLVDQK--YHI-DMQDLTKVDAFEWKTNSNMILVET 382 (400) T ss_pred HHHHhcCCcceeeeeeccccchhhhhcccceeeeeccCCCCCceeeeehh--hhc-cccCceeccceeeeeccceEEeee Confidence 99999999999998665555555566775444322222233344444533 333 344543111111 23334456777 Q ss_pred EeccEEecccceEEEEec Q lcl|Aclame:pro 371 RFGVSKVDDKAGYYVTFT 388 (394) Q Consensus 371 r~d~~v~~~~af~~l~~~ 388 (394) .++|.+.-|++-++++++ T Consensus 383 lv~Gsi~~~N~~ay~~v~ 400 (400) T protein:vir:93 383 LTSGHVETYNAGAVITVS 400 (400) T ss_pred eeccceecccceeeEeeC Confidence 789999999999999998 No 116 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=99.62 E-value=1.2e-16 Score=107.91 Aligned_cols=259 Identities=13% Similarity=0.062 Sum_probs=183.1 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) .....+.-...++|+.|+..+.+.+.....+.+++..-.. .+..+++|.+.. .+.+..+.++.... ..+.+.+. T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVY-SGDAQVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecC-CCccccccCCCccc-hhhcccce Confidence 2223444556789999999998888777666677665332 344778888753 34455566655554 56788888 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cccccHHHHHHHHHhhhhhhc- Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT---KTVKNLDEIKALLNGGFDPAY- 279 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~---~~~~~~~~i~~~~~~~~~~~~- 279 (394) .+...++.+..+.++++....+..|+.+.+.+.++..+++..+..++....++.. ....+++.+.++...+-+... T Consensus 79 ~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~~~~~~a~~~d~i~dA~~~lgd~~~~ 158 (274) T protein:vir:12 79 REAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKLTVNADITKLNGLQSAIDKFNDEDLE 158 (274) T ss_pred eeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhcccccc Confidence 8999999999999999877666678889999999999999988887766544322 334578999998877554433 Q ss_pred ccEEEEcHHHHHHHHhhh------ccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceE Q lcl|Aclame:pro 280 NVSLIVSQSFYQTLDTLK------DGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLG 353 (394) Q Consensus 280 ~a~~vm~~~~~~~l~~lk------d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~ 353 (394) ...++|||..+..|++.. ++++. .+.+.++.-++++|++|++++..+. ...+++|.- ++..+...++. T Consensus 159 ~~~ivv~p~~~~~L~k~~~~~fv~~s~~g---~~~~~~G~ig~~~G~~Vi~s~~~p~-~t~~l~~~g--A~~~~~~~~~~ 232 (274) T protein:vir:12 159 PMVLFINPLDAGKLRGDASTNFTRATELG---DDIIVKGAFGEALGAIIVRSNKLEA-GTAILAKKG--AVKLILKRDFF 232 (274) T ss_pred ccEEEeCHHHHHHHHhhhhhhcccccccc---ccceecccceeecCeeEEEeCCCCc-ceEEEEecc--ceeeeecCCce Confidence 356889999999987631 12211 1123445567899999999765442 234555543 34445678888 Q ss_pred EEEeeccc-ccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 354 LRWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 354 i~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ++..++.. +...+++..+|+.++++|+.+++++...+-+-. T Consensus 233 vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 233 LEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred eccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 88877654 456789999999999999999999977766666 No 117 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=99.58 E-value=2.9e-16 Score=105.80 Aligned_cols=334 Identities=14% Similarity=0.154 Sum_probs=182.4 Q ss_pred HHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccchhhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 24 TAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVN 103 (394) Q Consensus 24 ~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (394) ++..-..+.+.... +....|..+|+.+.++- +-..+ ....+-....+.-+. ...+.+.. T Consensus 1 ~~~~~~~~~~~~~~--~~~~~e~k~lr~~me~~-------et~~e----~~~~~~~~~~~e~el---~E~f~Kmm----- 59 (393) T protein:vir:79 1 MENWLKQLKESGFT--ETQVQEQKSLRTRMERG-------ETLAE----ADANKLALNEEETQI---LESFAKMM----- 59 (393) T ss_pred CchHHHHHHhccCc--hhHHHHHHHHHHHhhhh-------hhhhh----hhhhhhhcchhHHHH---HHHHHHHh----- Confidence 11111111111111 12233333333333311 10000 000000111111111 11111100 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCc-eeEEEEecCCC Q lcl|Aclame:pro 104 DSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATT 182 (394) Q Consensus 104 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~ 182 (394) +..-...+ .......++..+..+||+.+++.+.+.........++...+....+. -.+|- -+.- T Consensus 60 ---------~G~~p~~e----V~~~e~mtt~~a~IliP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~--~g~~ 124 (393) T protein:vir:79 60 ---------EGETPTNE----VNLREFMATPSAQILIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPS--IGIM 124 (393) T ss_pred ---------cCCCchhh----eehhhhhcCCCcceechhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccc--hhee Confidence 00000111 11122356677788999999999999766655555555555553332 22221 1244 Q ss_pred cccccccccccccc--cccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-- Q lcl|Aclame:pro 183 KMVTVAELEKNPAL--AKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT-- 258 (394) Q Consensus 183 ~~~~~~e~~~~~~~--~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~-- 258 (394) .+..++|+++.++. +..+++.|++...+.+..+.+|+||+.||..|+.++......+++++..+..++++..+.++ T Consensus 125 Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtv 204 (393) T protein:vir:79 125 RAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTV 204 (393) T ss_pred eeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhccccee Confidence 55677888888864 44689999999999999999999999999999999999999999999998888877644322 Q ss_pred ---------------------cccccHHHHHHHHHhhhhhhcc-cEEEEcHHHHHHHHhh---hc----cCCceeecccc Q lcl|Aclame:pro 259 ---------------------KTVKNLDEIKALLNGGFDPAYN-VSLIVSQSFYQTLDTL---KD----GNGRYLLQDDI 309 (394) Q Consensus 259 ---------------------~~~~~~~~i~~~~~~~~~~~~~-a~~vm~~~~~~~l~~l---kd----~~G~~l~~~~~ 309 (394) .+....+++.+++.+...+-++ .+++|||-+|+.+.+- .. +-|+|--..-. T Consensus 205 fDa~st~t~ahptGr~~~~~qNGTlSleDllDm~~av~~~hyt~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ 284 (393) T protein:vir:79 205 FDNYSTNKLAHTTGLDKNGVQNDTFSAEDFLDLIIAVMANEYTPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAP 284 (393) T ss_pred eeccccCccceeecCCccccccccccHHHHHHHHHHHhcccCCcceEEEcCchhhhhhhhhhhcceeeccccccCccccc Confidence 1234579999999888887775 6899999999998652 22 22222100000 Q ss_pred cC--CCcccccc-----cceEEecCcccc----cCceEEEeccc-cEEEEeecceEEEEe-ecccccceEEEEEEeccEE Q lcl|Aclame:pro 310 TA--VSGKVLLG-----KPVFVLSDEVLG----ANKAFIGDFKR-GVLFADRKDLGLRWA-DNEIYGQYLQAVLRFGVSK 376 (394) Q Consensus 310 ~~--~~~~~l~G-----~pV~~~~~~~~~----~~~~~~gd~~~-~~~~~~~~~~~i~~~-~~~~~~~~~r~~~r~d~~v 376 (394) +. -+|..|.| +.|++++-.+.. ...++.-|=+. ++++. +-+++++.- +-....+.++...|+|+.| T Consensus 285 ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~NnvgvlLV-~D~i~tdq~ddk~rdiq~iKl~ERYG~gv 363 (393) T protein:vir:79 285 SSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDRNNVGVLLV-RDDLKTDQWDEKARGLQNIKMIERYGIGI 363 (393) T ss_pred hhhhhchhhhccccccceeEEEecccccccccceeeEEEeecCCceEEEE-ecCcceeccccccccceeeeeeeeeceee Confidence 00 12223333 456665532222 12233333222 33343 446655533 3334567889999999999 Q ss_pred ecc-cce---EEEEec-CccCCC Q lcl|Aclame:pro 377 VDD-KAG---YYVTFT-PEPLPL 394 (394) Q Consensus 377 ~~~-~af---~~l~~~-~~~~~~ 394 (394) ++. +|+ .-++++ ..|.|. T Consensus 364 Ln~gkaiavakNI~~~k~y~~P~ 386 (393) T protein:vir:79 364 LNEGKAIAVAKNISMDKSYAEPM 386 (393) T ss_pred eeCCceEEEEecceeecccccch Confidence 985 343 344443 334666 No 118 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=99.58 E-value=5.3e-16 Score=104.32 Aligned_cols=260 Identities=14% Similarity=0.084 Sum_probs=180.0 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) .....+.-...++|+.+++.+.+.+.....+.+++.+-+. .+..+++|.+.. .+.+..+.++.... ..+.+.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:95 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIY-SGDAKVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecC-CCccccccCCCccc-hhhcccce Confidence 2222344456789999999999888887777777654442 344788888754 34445566655544 56788888 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cccccHHHHHHHHHhhhhhhc- Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT---KTVKNLDEIKALLNGGFDPAY- 279 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~---~~~~~~~~i~~~~~~~~~~~~- 279 (394) .++..++.+..+.++++....+..|+.+.+.+.++..+++..+..++....++.. ....+++.+.++...+-+... T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~lgd~~~~ 158 (274) T protein:vir:95 79 REAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDKFNDEDLE 158 (274) T ss_pred eEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhcccccc Confidence 8999899888999999877767678999999999999999988877766554432 334578999988877554432 Q ss_pred ccEEEEcHHHHHHHHhhhccCCceee-----cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEE Q lcl|Aclame:pro 280 NVSLIVSQSFYQTLDTLKDGNGRYLL-----QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGL 354 (394) Q Consensus 280 ~a~~vm~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 354 (394) ...++|||..+..|++.. .-+++- .+.+.++.-++++|++|+++++.+ ....+++|.. ++..+...++.+ T Consensus 159 ~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~-~~t~~l~~~g--A~~~~~~~~~~v 233 (274) T protein:vir:95 159 PMVLFISPLDAGKLRGDA--TTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLE-AGTAILAKKG--AVKLITKRDFFL 233 (274) T ss_pred ccEEEeCHHHHHHHHhhc--cccccccccccccceeccccceecCeEEEEeCCCC-CceEEEEecc--ceeeeecCCccc Confidence 356889999999987631 111111 112344556789999999876543 3344566643 344456778888 Q ss_pred EEeeccc-ccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 355 RWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 355 ~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..++.. +...+++.++|+.++++|+.+++++...=-.-. T Consensus 234 E~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 234 ETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 8777654 556789999999999999999999833322222 No 119 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=99.58 E-value=5.3e-16 Score=104.32 Aligned_cols=260 Identities=14% Similarity=0.084 Sum_probs=180.0 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee----cCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA----KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) .....+.-...++|+.+++.+.+.+.....+.+++.+-+. .+..+++|.+.. .+.+..+.++.... ..+.+.+. T Consensus 1 m~~~~T~l~d~i~Pev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G~tv~iP~~~~-ig~a~~~~~g~~i~-~~~lt~~~ 78 (274) T protein:vir:96 1 MAQGMTKLTNQIVPEVLAPMMQAELEKKLRFASFAEIDNTLVGQPGDTLTFPAFIY-SGDAKVVAEGEKIP-TDILETKK 78 (274) T ss_pred CCcceeehhheechHHHHHHHHHHHHhhhhccccceecccccCCCCCEEEeeeecC-CCccccccCCCccc-hhhcccce Confidence 2222344456789999999999888887777777654442 344788888754 34445566655544 56788888 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---cccccHHHHHHHHHhhhhhhc- Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT---KTVKNLDEIKALLNGGFDPAY- 279 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~---~~~~~~~~i~~~~~~~~~~~~- 279 (394) .++..++.+..+.++++....+..|+.+.+.+.++..+++..+..++....++.. ....+++.+.++...+-+... T Consensus 79 ~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~~~~~~~~~~~d~i~~A~~~lgd~~~~ 158 (274) T protein:vir:96 79 REAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAKLTVEADITKLTGLQTAIDKFNDEDLE 158 (274) T ss_pred eEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCHHHHHHHHHHhcccccc Confidence 8999899888999999877767678999999999999999988877766554432 334578999988877554432 Q ss_pred ccEEEEcHHHHHHHHhhhccCCceee-----cccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEE Q lcl|Aclame:pro 280 NVSLIVSQSFYQTLDTLKDGNGRYLL-----QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGL 354 (394) Q Consensus 280 ~a~~vm~~~~~~~l~~lkd~~G~~l~-----~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 354 (394) ...++|||..+..|++.. .-+++- .+.+.++.-++++|++|+++++.+ ....+++|.. ++..+...++.+ T Consensus 159 ~~~ivv~p~~~~~L~k~~--~~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s~~~~-~~t~~l~~~g--A~~~~~~~~~~v 233 (274) T protein:vir:96 159 PMVLFISPLDAGKLRGDA--TTNFTRATELGDDVIVKGAFGEALGAVIVRSNKLE-AGTAILAKKG--AVKLITKRDFFL 233 (274) T ss_pred ccEEEeCHHHHHHHHhhc--cccccccccccccceeccccceecCeEEEEeCCCC-CceEEEEecc--ceeeeecCCccc Confidence 356889999999987631 111111 112344556789999999876543 3344566643 344456778888 Q ss_pred EEeeccc-ccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 355 RWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 355 ~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..++.. +...+++.++|+.++++|+.+++++...=-.-. T Consensus 234 E~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 234 ETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred ccccccccccCEEEEeEEEEEEEEcCCcEEEEEcCCccccC Confidence 8777654 556789999999999999999999833322222 No 120 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.57 E-value=3.9e-16 Score=105.04 Aligned_cols=287 Identities=11% Similarity=0.011 Sum_probs=184.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCcee Q lcl|Aclame:pro 94 FIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGK 173 (394) Q Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~ 173 (394) ..+.-... .....+.......... ..+++-...+.+.|......|++.+.+.++|++.+++..+.++.+. T Consensus 1 ~~~~~~~~--------~~~~~~~~~~~~p~l~--m~alTLaea~~l~~d~~~~~VIE~l~~~s~iL~~lpf~~ve~~~~~ 70 (330) T protein:vir:94 1 MVRICTPP--------LRGRWRTLTHQFPELK--MPTVTLAESAKLSQDHLVSGLIETIVEVNPLYEMMPFTEIEGNALA 70 (330) T ss_pred CceecCCc--------cccceeehhccccccc--hhhhhhhHHhhcCchhhHHHHHHhhhccchHHhhcccccccCCcce Confidence 00000000 0000000000111111 1233344456788999999999999999999999988888888888 Q ss_pred EEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHh--ccHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 174 YPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID--DADVDLVGIVSESISQIKVNTTNDAIAK 251 (394) Q Consensus 174 ~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~--ds~~~l~~~i~~~l~~~~~~~~~~a~~~ 251 (394) +++... -+++.|..-....++....+|.+++.+.+.+.+.+.|++.+.+ ....++..+-.+...+++......++++ T Consensus 71 ~~r~~~-lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~lin 149 (330) T protein:vir:94 71 YNRENV-LGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMIT 149 (330) T ss_pred eeeeec-CCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 886543 3444454444444443345899999999999999999999954 4566888888888899999999999999 Q ss_pred cccccc-------------------ccccccHHHHHHHHHhhhh-hhcccEEEEcHHHHHHHHhhhccCCceeecccccC Q lcl|Aclame:pro 252 VLKSFT-------------------TKTVKNLDEIKALLNGGFD-PAYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITA 311 (394) Q Consensus 252 g~~~~~-------------------~~~~~~~~~i~~~~~~~~~-~~~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~ 311 (394) |++++. ..+..+.|++-.++..... ......|+||++...+|+.+....|+|-..|...+ T Consensus 150 GDs~~~~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~ 229 (330) T protein:vir:94 150 GDGTGNSFQGMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTL 229 (330) T ss_pred cCCCCccccchhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccCCCCCCcccc Confidence 865421 1233455665555544332 23457899999999999999888877665443222 Q ss_pred ---CCcccccccceEEecCcc--------cccCceEEEecc-----ccEEEEe---ecceEEEEee--cccccceEEEEE Q lcl|Aclame:pro 312 ---VSGKVLLGKPVFVLSDEV--------LGANKAFIGDFK-----RGVLFAD---RKDLGLRWAD--NEIYGQYLQAVL 370 (394) Q Consensus 312 ---~~~~~l~G~pV~~~~~~~--------~~~~~~~~gd~~-----~~~~~~~---~~~~~i~~~~--~~~~~~~~r~~~ 370 (394) ....++.|.||+.++-.+ .+...||+..|. +++.... ..|+.|+..- +.....-.++.+ T Consensus 230 ~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~ 309 (330) T protein:vir:94 230 PSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETITRVKM 309 (330) T ss_pred cCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeEEEEE Confidence 223467899988765322 234567766653 3444332 2467666532 333344568899 Q ss_pred EeccEEecccceEEEEecCcc Q lcl|Aclame:pro 371 RFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 371 r~d~~v~~~~af~~l~~~~~~ 391 (394) +++.++.+|+|+.+|+.-..= T Consensus 310 y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 310 YCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred eeeeEEechhheeeeccccCC Confidence 999999999999998765544 No 121 >protein:vir:8324 Length: 410 # NCBI annotation: gp41 # Family: family:all:30827 # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817892;genbank:gi:29566325;genbank:GeneID:1259520 Probab=99.46 E-value=1.1e-14 Score=97.13 Aligned_cols=360 Identities=11% Similarity=0.071 Sum_probs=184.2 Q ss_pred ChHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MFEEKIKEIKA--------------TIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESS 66 (394) Q Consensus 1 ~l~e~l~eL~~--------------~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~ 66 (394) ...++.+.|+. ++...+++-.+...|.+. ..+..+...+...++..+..+.+++...+...-. T Consensus 11 ~~RR~~~~L~~~EvSvv~~PAY~nA~vt~vRe~e~~~~~e~~~--~~e~~en~~e~~~~~~~~~~E~Rs~~~~i~~~~~- 87 (410) T protein:vir:83 11 YIRRLENELREKESLVRGIYDRANASNRDVNEEEGQMVAECRG--RMEQIKNQMEQAQEVNRIAFETRSKGQAVDAAIS- 87 (410) T ss_pred HHHHHHHHhhhhheeeeccccccccccccchhhhccccccccC--cccchhhhhHHHHHHHHHHHHHHHHHHHHHhhhc- Confidence 22222222210 001111110011111110 0111111222233344433333333222211111 Q ss_pred HhhccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHh Q lcl|Aclame:pro 67 VEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILY 146 (394) Q Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~ 146 (394) ..... ......+- +...+++++.- ...+......+.+.... .......+.+....+|.++.. T Consensus 88 ---~~r~~--p~~~~vey----RSaGE~lkal~------~~~~Gd~~A~~~~e~~r---~a~~~~~Tgd~~~~i~~~~v~ 149 (410) T protein:vir:83 88 ---AMRGS--PVGTEVEY----RSAGEYMLDMW------NSAQGNASAADRLEVYA---RAADHQKTGDLQGVIPDPIVG 149 (410) T ss_pred ---cCcCC--CCCCCccc----ccHHHHHHHHh------ccCCchHHHHHHHHHHH---HhhccCcccccccccchhHhh Confidence 00000 01111111 11122222110 00000000000011100 001112222223456777898 Q ss_pred HHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCC-----cccccccccccccccccccceeeecHhhhhhhhhhhHHH Q lcl|Aclame:pro 147 TPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATT-----KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQES 221 (394) Q Consensus 147 ~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~el 221 (394) ..++.+.+..++.++....|..+.++.+|+.+.... ....-..|+...+..+.+|+.-+-..++++++..+|++. T Consensus 150 d~i~li~q~r~i~slf~tLP~~g~T~eY~v~t~~~tV~~q~~~~kqa~EGd~L~~gKl~~~t~tA~ikTyGGyt~LSRQ~ 229 (410) T protein:vir:83 150 PVIDFIDSARPLVSTLGTLPLNNATFYRPIVSQRPAVGLQGVAGGASDEKTELDSQKMVIDRLTVNAKTLGGYVNVSRQA 229 (410) T ss_pred hHHHHHhhccchhhhhhhCCCCCCeeEEeeecccccccccccccccccccccccccceeeeeccceeehhcCccccccee Confidence 999999999999998888888888888887643321 122233445555567788888889999999999999999 Q ss_pred HhccHHHHHHHHHHHHHHHHHHHHHHH---Hhhcc-ccccccccccHHHHH----HHHHhhhhhhccc---EEEEcHHHH Q lcl|Aclame:pro 222 IDDADVDLVGIVSESISQIKVNTTNDA---IAKVL-KSFTTKTVKNLDEIK----ALLNGGFDPAYNV---SLIVSQSFY 290 (394) Q Consensus 222 l~ds~~~l~~~i~~~l~~~~~~~~~~a---~~~g~-~~~~~~~~~~~~~i~----~~~~~~~~~~~~a---~~vm~~~~~ 290 (394) ++.|.+++.+...+.|..++++..+.+ ++..+ ......+..+.+.+. ++.....+...+. .+.++|.++ T Consensus 230 IERs~v~~L~~~lraL~~AYA~atea~vra~L~~t~t~~~a~~~~Tad~~~~~i~da~~~v~da~~~~~~~~i~vS~DVl 309 (410) T protein:vir:83 230 IDFSSPSALDLVVNGLGQQYAIETEALVGAALASTSTGAVGYGNATADNVASAIWQAAGAVYTAVKGMGRLVIAIAPDVL 309 (410) T ss_pred eecCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccHHHHHHHHHHHHHHHhhhhccceeeeEEechhhh Confidence 999999999999999988877765543 22222 122223334555444 4344444443443 477999997 Q ss_pred HHHHhh-hccCCceeeccc-------ccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccc Q lcl|Aclame:pro 291 QTLDTL-KDGNGRYLLQDD-------ITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIY 362 (394) Q Consensus 291 ~~l~~l-kd~~G~~l~~~~-------~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 362 (394) ..+..+ ++-+ +.|... +..+..+.|+|.||++.+ ..++++.+|-|.. ++..|...+-.++.++...+ T Consensus 310 ~~~~~~f~~~~--~~~~dt~Gfg~~~lg~gi~G~~~~ipVvm~~--~a~AgTA~f~~~~-Ai~~~eS~~gp~qL~d~~i~ 384 (410) T protein:vir:83 310 GDFGPLFAPVN--PTNAHSTGFEAGRFGQGVMGSISGIPVVMSA--ALGSGDAYLFSTA-AIECFEQRVGTLQVVEPSVF 384 (410) T ss_pred hhccceeeccC--CCCcccccccccccccchhhhhcccceEEec--CCCcCeeeEeccc-eeeeeecCCceeEeeCCchh Confidence 665433 3322 222211 112345789999999976 4577778888865 57777655445666655432 Q ss_pred --cceEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 363 --GQYLQAVLRFGVSKVDDKAGYYVTFT 388 (394) Q Consensus 363 --~~~~r~~~r~d~~v~~~~af~~l~~~ 388 (394) ...+- .+|.+.+..+.+++-|..+ T Consensus 385 nLt~~yS--gY~a~a~~~~~gliPv~g~ 410 (410) T protein:vir:83 385 GLQVAYA--GYFSTLVVNEDAIVPLVGS 410 (410) T ss_pred hhhhhhe--eeeeeccccccceeeeccC Confidence 23444 4447788999999888887 No 122 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=99.40 E-value=1.1e-14 Score=97.04 Aligned_cols=222 Identities=15% Similarity=0.082 Sum_probs=159.5 Q ss_pred hheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHH Q lcl|Aclame:pro 159 KPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESIS 238 (394) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~ 238 (394) .+-++ + +.++++|.+ .+.+..+.|+.+.+ ....+++..+...++.+..+.|+++-...+..|..+...+.++ T Consensus 1 ~~~~~---~-Gdtit~P~~---iGda~~v~eG~~i~-~~~l~~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~~Q~~ 72 (231) T protein:vir:73 1 ENGIN---L-ANLCEYPND---IGDAADVAEGGEIS-LDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLG 72 (231) T ss_pred Ccccc---C-CceEEeccc---ccchhhhcCCCcCC-hhhccccceeeeEeeeccceeeeHHHHhhccCchHHHHHHHHH Confidence 22222 2 235667743 45556667777766 4668899999999999999999998777666688899999999 Q ss_pred HHHHHHHHHHHhhcccccc--ccccccHHHHHHHHHhhhhhhcc-cEEEEcHHHHHHHHhhhccCC--ceeecccccCCC Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKSFT--TKTVKNLDEIKALLNGGFDPAYN-VSLIVSQSFYQTLDTLKDGNG--RYLLQDDITAVS 313 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~~~--~~~~~~~~~i~~~~~~~~~~~~~-a~~vm~~~~~~~l~~lkd~~G--~~l~~~~~~~~~ 313 (394) ..+++..|..++....+.. ..+..+++.|.+++..+-+.... .+++|||..+..|++..+..- ...-.+-+.+|. T Consensus 73 ~~iA~kvD~di~~~~~~a~l~~~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~~~Lrk~~~~~~~~~~~g~~i~~~G~ 152 (231) T protein:vir:73 73 LSLANKVDDDLLKAAKTTSQTVSTKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGT 152 (231) T ss_pred HHHHHhhhHHHHHhhccccccccccccHHHHHHHHHHhccccccceEEEEcchHHHhhhhccchhhhhhhhccceeeecc Confidence 9999999998776654433 34567899999888776655443 468899999999988543311 111122345556 Q ss_pred cccccccceEEecCcccccCceEEEe---ccccEEEEeecceEEEEeeccc-ccceEEEEEEeccEEecccceEEEEecC Q lcl|Aclame:pro 314 GKVLLGKPVFVLSDEVLGANKAFIGD---FKRGVLFADRKDLGLRWADNEI-YGQYLQAVLRFGVSKVDDKAGYYVTFTP 389 (394) Q Consensus 314 ~~~l~G~pV~~~~~~~~~~~~~~~gd---~~~~~~~~~~~~~~i~~~~~~~-~~~~~r~~~r~d~~v~~~~af~~l~~~~ 389 (394) -+++.|+||++++..+.++ .++.. -+-++.++...++.++..++.. +...+++.+.|+..+++|+.+|+++++- T Consensus 153 iG~i~G~~Vi~S~~~~~~~--~~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g 230 (231) T protein:vir:73 153 YADVLGAQIVRSKKLAEGS--ALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFTG 230 (231) T ss_pred cceEcceEEEEcCCCCCCc--eeeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcCccEEEEEeec Confidence 6799999999976554333 33211 1234556778888899887765 4456888899999999999999999999 Q ss_pred c Q lcl|Aclame:pro 390 E 390 (394) Q Consensus 390 ~ 390 (394) . T Consensus 231 ~ 231 (231) T protein:vir:73 231 V 231 (231) T ss_pred C Confidence 9 No 123 >protein:vir:99424 Length: 360 # NCBI annotation: hypothetical protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919080;genbank:gi:119757038;genbank:GeneID:4606077 Probab=99.34 E-value=4e-13 Score=88.54 Aligned_cols=282 Identities=11% Similarity=-0.026 Sum_probs=157.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEE Q lcl|Aclame:pro 97 SKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPV 176 (394) Q Consensus 97 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 176 (394) ...+...+.. .+.....-.........-++.+++......+++.+.+.+++++.++++++.+.+..++. T Consensus 1 ~~~~~~~~~~-----------~n~~~~~i~k~~it~~~l~~g~L~p~~a~~Fl~~v~~~t~iL~~~r~~~~~s~~~ei~k 69 (360) T protein:vir:99 1 MSSNSTIDSV-----------RNQNMNSLSQKDIGLAELDGFQLPVDVTEEFLERMQKGVQILGMADTMTLARLEMEVPQ 69 (360) T ss_pred CcchhHHHHH-----------hhhHHHHHHhhhccccccCceeecHHHHHHHHHHHhhccchhhhcceeecccccccccc Confidence 0000000100 01110111111222222345677778899999999999999999999999888887765 Q ss_pred EecCCCcccccccccccccccccccceeeec-HhhhhhhhhhhHHHHhcc----HHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 177 LQRATTKMVTVAELEKNPALAKPDFKDVAWN-IDTYRGAIPLSQESIDDA----DVDLVGIVSESISQIKVNTTNDAIAK 251 (394) Q Consensus 177 ~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~-~~~~~~~~~vs~ell~ds----~~~l~~~i~~~l~~~~~~~~~~a~~~ 251 (394) ..-+.-..-...|.+..++.++.+...+.+. .+++-....+..+-+++. ...+++.|.+.+++++++-...-.++ T Consensus 70 ig~G~r~~r~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~~i~~~~~~~n~~~~~~~f~~~i~~~~ae~~~~Dle~l~~~ 149 (360) T protein:vir:99 70 FGVPRLSGHTRDEEGSRTENSEAESGSVKFNATDKSYYILVEPKRDALKNTHYGPDQFGDYIVDQFIERYGNDLGLMGIR 149 (360) T ss_pred cccceeeccccccCCCCCcCCcCccccCccccccceeeEeechHHHHHhhhhcccchhHHHHHHHHHHHHHHHHHHHHhh Confidence 4322111112223333333333444444442 334444445555555543 23567899999999988876665555 Q ss_pred ccccccc----ccccc----------------------------------------------H----------HH-HHHH Q lcl|Aclame:pro 252 VLKSFTT----KTVKN----------------------------------------------L----------DE-IKAL 270 (394) Q Consensus 252 g~~~~~~----~~~~~----------------------------------------------~----------~~-i~~~ 270 (394) |+....- .+... + .. +.++ T Consensus 150 g~~ds~d~~~~~~~d~fl~~~dGwlKka~~~~~~id~a~d~t~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~lf~~~ 229 (360) T protein:vir:99 150 AGASSGNLQSIGGAAELDNTFKGWIARAEGDAQSVDDAGDSTRIGLEDTATADADSMPSIANTDGSGNPQPVDTSLFNET 229 (360) T ss_pred ccchhcccccCcccchhhhhhHHHHHHhhcccchhhccccccccccccccccccccchhhhccccccccccchHHHHHHH Confidence 5433110 00000 0 01 2344 Q ss_pred HHhhhhhhcc-----cEEEEcHHHHHH-HHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccE Q lcl|Aclame:pro 271 LNGGFDPAYN-----VSLIVSQSFYQT-LDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGV 344 (394) Q Consensus 271 ~~~~~~~~~~-----a~~vm~~~~~~~-l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~ 344 (394) +..++..+++ ..|+||+.+... .+.|.+-+. ++--.-+.++..-+.+|+||+.++. .+++.+++-+++..+ T Consensus 230 ~~~Lp~kyr~~~~~~~~~~~s~~~~~~yr~~L~~R~t-~LGd~~l~g~~~~~~~Gipi~~v~~--~pd~~~mlT~p~NLi 306 (360) T protein:vir:99 230 IQTLDSRYRESDAYSPVLMTSPNQVQSYTMSLTERED-PLGSAVIFGDSDITPFSYDLVGVNG--FPDEYMMFTDPNNLA 306 (360) T ss_pred HHhcchhhhcCcccceEEEccCchHHHHHHHHhccCc-ccchhheecccccccceeeeEEcCC--CCCCceEEeccCcee Confidence 4445555543 379999988544 445543332 2322223334444678999998884 467789999998754 Q ss_pred EEEeecceEEEEeec-ccc---cceEEEE--EEeccEEecccceEEEEecCccCC Q lcl|Aclame:pro 345 LFADRKDLGLRWADN-EIY---GQYLQAV--LRFGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 345 ~~~~~~~~~i~~~~~-~~~---~~~~r~~--~r~d~~v~~~~af~~l~~~~~~~~ 393 (394) +. -...+.++.+.+ +.. ...++.+ ..+|+.+.+++|.|.++.-+.|+. T Consensus 307 ~g-~~~~iri~~~~e~~~~~~~~~~~~~~~~~~~D~~iee~~Av~~vt~~~~~~~ 360 (360) T protein:vir:99 307 FG-LYEEMELDQSTDTDKVHEQRLHSRNWLEGQFDFQIKEQQAGVLVTDLETPTA 360 (360) T ss_pred EE-eeeeeEEeecccchhhhhhceeeeEEEEEEeeEEEEecccEEEEecCCCCCC Confidence 43 345666665433 221 1124444 458999999999999999999988 No 124 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=99.30 E-value=1.6e-12 Score=85.23 Aligned_cols=263 Identities=11% Similarity=0.053 Sum_probs=165.7 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCC-ccc--cccccccccccccccccee Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATT-KMV--TVAELEKNPALAKPDFKDV 204 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~--~~~e~~~~~~~~~~~~~~v 204 (394) +.+++-.....+.+......|++.+...+.|+...+..++.++.+.+.....-.+ +.. .|.-...-...+..+|+++ T Consensus 1 mpaltLaea~k~~~d~l~~~ViE~~~~~s~lL~~LpF~~veg~~~~ynR~~~~~~~~~~~v~~~~~~~g~~~~~~t~~~~ 80 (310) T protein:vir:97 1 MASVTLAESAKLAQDELVAGVIENIITVNRMFDVLPFDSIEGNSLAYNRENVLGDVIMAGVGTTFSGAGAGKAAATFTKV 80 (310) T ss_pred CcccchHHHhhcCcchHHHHHHHHHhccchHHHhCCcccccCCcceeeEeeccCCcccccccccccCCCcccccccccee Confidence 1122222334577888899999999999999999888888777777766543211 111 2222222223467899999 Q ss_pred eecHhhhhhhhhhhHHHHhc--c-HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-------------------cccc Q lcl|Aclame:pro 205 AWNIDTYRGAIPLSQESIDD--A-DVDLVGIVSESISQIKVNTTNDAIAKVLKSFTT-------------------KTVK 262 (394) Q Consensus 205 ~~~~~~~~~~~~vs~ell~d--s-~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~-------------------~~~~ 262 (394) +...+.+++.+.|.+.+.+- + ..+...+=.+...+++.......+++|+.+..+ .+.. T Consensus 81 ~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~~q~i~~~~~gg~~ 160 (310) T protein:vir:97 81 NSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCASGQKATTGATGSAI 160 (310) T ss_pred eeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCccceeecCCCCCCC Confidence 99999999999999866542 2 334444444555678888888888887654322 1223 Q ss_pred cHHHHHHHHHhhhhh-hcccEEEEcHHHHHHHHhh-hccCCceeeccc--ccCCCcccccccceEEecCcc--------c Q lcl|Aclame:pro 263 NLDEIKALLNGGFDP-AYNVSLIVSQSFYQTLDTL-KDGNGRYLLQDD--ITAVSGKVLLGKPVFVLSDEV--------L 330 (394) Q Consensus 263 ~~~~i~~~~~~~~~~-~~~a~~vm~~~~~~~l~~l-kd~~G~~l~~~~--~~~~~~~~l~G~pV~~~~~~~--------~ 330 (394) +.|++-.++...... .....++|||.+..+|+.+ +..+++.+..+. +.+....++.|.|++.++-.+ . T Consensus 161 t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G~~v~~~~GiPi~~~d~ip~~~~~~~~~ 240 (310) T protein:vir:97 161 SFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELPSGAEVPAYSGTPIFRNDYIPTNQTKGGTT 240 (310) T ss_pred CHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCCCEEeeeCCeEEEEeCccCCCccccccC Confidence 455555555443322 2346899999998888754 555555554332 233333588999998876433 2 Q ss_pred ccCceEEEecc-----ccEEEE---eecceEEEEee--cccccceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 331 GANKAFIGDFK-----RGVLFA---DRKDLGLRWAD--NEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 331 ~~~~~~~gd~~-----~~~~~~---~~~~~~i~~~~--~~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) +...+|...|. +++... ...|+.|+..- ++....-.++.++++.+|..|+|+.+|..-.- T Consensus 241 gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 241 GCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred CceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEEEEeeeEEEecccceeeeccccC Confidence 34566655543 344432 13456666532 23344446888999999999999999987666 No 125 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=99.17 E-value=2.6e-12 Score=84.11 Aligned_cols=264 Identities=14% Similarity=0.071 Sum_probs=155.0 Q ss_pred hhhhhcccc--cCCc-----cc-cchhHHhHHHHHHHhhhhhhheeee-EeecCCceeEEEEe--cCCCccccccccccc Q lcl|Aclame:pro 125 EPQKDGIKK--ENAK-----PV-SSEEILYTPAREVKTVVDLKPFTTV-YQAKKASGKYPVLQ--RATTKMVTVAELEKN 193 (394) Q Consensus 125 ~~~~~~~~~--~~~~-----~l-vP~~~~~~I~~~~~~~~~l~~~~~~-~~~~~~~~~~~~~~--~~~~~~~~~~e~~~~ 193 (394) .....+..+ .++. .+ -|+.+-..|.+.+...-.--.+.+. ....++.+.+.... ...+....+.|+++. T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEi 80 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNGGANPNGVVAYNEGNPSFLEDDVADVAEFGEI 80 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcccccccceeEEEecccccccCcHhhccCcccc Confidence 001111111 1111 11 2555555566655443322222222 22334444443211 123566778999999 Q ss_pred ccccccccceeee-cHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc---cccH----- Q lcl|Aclame:pro 194 PALAKPDFKDVAW-NIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKT---VKNL----- 264 (394) Q Consensus 194 ~~~~~~~~~~v~~-~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~---~~~~----- 264 (394) |. +..+++.-.+ ..+|.+..+.||+|++.++..+..+-....++..+.+..+..++....++.+.+ ..++ T Consensus 81 P~-~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~ 159 (318) T protein:vir:10 81 PV-SAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVNDQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGK 159 (318) T ss_pred cc-cCCCCCchhhhhhehhccceeccHHHHhhcChhHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCccc Confidence 85 5567766555 667999999999999999999999999999999999988887666553322111 0111 Q ss_pred --HHHHHHHHh---hhh-------------hhc-ccEEEEcHHHHHHHHhhhc------cCCceeec-ccccCCCccccc Q lcl|Aclame:pro 265 --DEIKALLNG---GFD-------------PAY-NVSLIVSQSFYQTLDTLKD------GNGRYLLQ-DDITAVSGKVLL 318 (394) Q Consensus 265 --~~i~~~~~~---~~~-------------~~~-~a~~vm~~~~~~~l~~lkd------~~G~~l~~-~~~~~~~~~~l~ 318 (394) .++.++... ... -.| --.+||||.+|..|.+-.+ .++.+++. +..++..+++++ T Consensus 160 ~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~l 239 (318) T protein:vir:10 160 VRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGFIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVM 239 (318) T ss_pred ccccchhhhhhhhhhhhhhhhhhhhhhhhccCccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceee Confidence 122222211 000 012 2479999999999954333 24454442 234566677899 Q ss_pred ccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccc----cc----eEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 319 GKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIY----GQ----YLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 319 G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~----~~----~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) |+.|++++ ..+.+.+++.|=...=.++|-.+++....+.++. +. -.|+..+....|.+|+|+++||.--+ T Consensus 240 Gl~vi~s~--~~p~~~alvlq~g~vG~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi~~ 317 (318) T protein:vir:10 240 GLNVIRSR--TFPIDRVLIMERGTVGFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGIVT 317 (318) T ss_pred ceEEeecC--ccCCCeeEEEecCCcceeeccccceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeeccC Confidence 99998855 4556777776633222345667777665543311 11 14556667889999999999998877 Q ss_pred c Q lcl|Aclame:pro 391 P 391 (394) Q Consensus 391 ~ 391 (394) | T Consensus 318 ~ 318 (318) T protein:vir:10 318 P 318 (318) T ss_pred C Confidence 7 No 126 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=99.17 E-value=5.4e-12 Score=82.38 Aligned_cols=251 Identities=10% Similarity=-0.035 Sum_probs=142.1 Q ss_pred ccCCccccchhHHhHHHHHHHhhhhhhheeeeEe----ecCCceeEEEEecCCCcccc-cccccccccccccccceeeec Q lcl|Aclame:pro 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQ----AKKASGKYPVLQRATTKMVT-VAELEKNPALAKPDFKDVAWN 207 (394) Q Consensus 133 ~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~----~~~~~~~~~~~~~~~~~~~~-~~e~~~~~~~~~~~~~~v~~~ 207 (394) -+. ..++|+.|+..+++.++....+.++++.-. ..+.++.+|... ..+... ..+++... ..+.....++++ T Consensus 1 MA~-~~~~pei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~--~~~~~d~~~~~~~~~-~~~~~~~~~~~t 76 (273) T protein:vir:79 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGIASKGNVVHIAGVV--APTVKDYKAAGRQTS-ADAISDTGVDLL 76 (273) T ss_pred Ccc-hhhhHHHHHHHHHHHHHhhccchhhhhccccccccCCcEEEEeecC--cccccccccCCCccC-ccccccceEEEE Confidence 111 236799999999999888887777764321 123356666543 233332 33333332 334556666666 Q ss_pred Hhh-hhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----ccccc----ccHHHHHHHHHhhhhh- Q lcl|Aclame:pro 208 IDT-YRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF----TTKTV----KNLDEIKALLNGGFDP- 277 (394) Q Consensus 208 ~~~-~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~----~~~~~----~~~~~i~~~~~~~~~~- 277 (394) ..+ .+.-+.|++.-...+..++.++ .+....++++..|..++....+. ...+. ..++.+.++...+-.. T Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~~-~~~~~~ala~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~ 155 (273) T protein:vir:79 77 IDQEKSIDFLVDDIDRVQVAGSLEAY-TRAGATALATDTDKFIADMLVDNGTALTGSAPSDADDAFDLIASALKELTKAN 155 (273) T ss_pred EeeecccceeeccHHHHhhcccHHHH-HHHHHHHHHHHHHHHHHHHHhhcccccccccccchhhHHHHHHHHHHHhhhcc Confidence 644 2444566653333345578774 45667788887776554333211 11111 1234555544332222 Q ss_pred --hcccEEEEcHHHHHHHHhhhcc--CCceeec-ccccCCCcccccccceEEecCcccccCc-eEEEeccccEEEEeecc Q lcl|Aclame:pro 278 --AYNVSLIVSQSFYQTLDTLKDG--NGRYLLQ-DDITAVSGKVLLGKPVFVLSDEVLGANK-AFIGDFKRGVLFADRKD 351 (394) Q Consensus 278 --~~~a~~vm~~~~~~~l~~lkd~--~G~~l~~-~~~~~~~~~~l~G~pV~~~~~~~~~~~~-~~~gd~~~~~~~~~~~~ 351 (394) ..+-.++++|..+..|.+..+- +..+... ..+.+|..++|+|++|+.+...+.+.+. .+.+.-+ ++.... +. T Consensus 156 vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~lp~~~~~~~~a~~~~-A~~~a~-~~ 233 (273) T protein:vir:79 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS-AAAYVS-QI 233 (273) T ss_pred CCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEecccccccCceEEEEEecc-ceeeee-eh Confidence 2245789999999988654321 1111111 1244566679999999987655444332 3333322 333333 23 Q ss_pred eEEEEeec-ccccceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 352 LGLRWADN-EIYGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 352 ~~i~~~~~-~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) ..++..+. ..|.+.+++.+++|..+++|++++.++-+.+ T Consensus 234 ~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~g~ 273 (273) T protein:vir:79 234 DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred hhhhcccCcccceeeeeeeeeeeeEEecCceEEEEeccCC Confidence 34444333 4566778999999999999999999887777 No 127 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=99.13 E-value=1.2e-11 Score=80.38 Aligned_cols=251 Identities=10% Similarity=-0.045 Sum_probs=140.3 Q ss_pred ccCCccccchhHHhHHHHHHHhhhhhhheeeeE----eecCCceeEEEEecCCCcccc-cccccccccccccccceeeec Q lcl|Aclame:pro 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVY----QAKKASGKYPVLQRATTKMVT-VAELEKNPALAKPDFKDVAWN 207 (394) Q Consensus 133 ~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~-~~e~~~~~~~~~~~~~~v~~~ 207 (394) -+. ..++|+.|+..+.+.++..+.+.++++.- ...+.++.+|... ..+... ..+++... ..+.+-..++++ T Consensus 1 MA~-~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~--~~~~~d~~~~~~~~~-~~~~~~~~~~~t 76 (273) T protein:vir:10 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVV--APTVKDYKAAGRQTS-ADAISDTGVDLL 76 (273) T ss_pred Ccc-hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecc--cccccccccCCCccC-ccccccceEEEE Confidence 111 23679999999999988888877776431 1123356666643 222232 23333322 233344555555 Q ss_pred Hhhh-hhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cccccc----cHHHHHHHHHhhhhh- Q lcl|Aclame:pro 208 IDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF----TTKTVK----NLDEIKALLNGGFDP- 277 (394) Q Consensus 208 ~~~~-~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~----~~~~~~----~~~~i~~~~~~~~~~- 277 (394) ..+. +.-+.|++.-...+..++.+ +.+....+++...|..++....+. ...+.. .++.|.++...+-.. T Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~ 155 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKAN 155 (273) T ss_pred EeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcC Confidence 4332 33345665323334456877 455667888888777554432211 111111 245555544333222 Q ss_pred --hcccEEEEcHHHHHHHHhhhccCCc--eee-cccccCCCcccccccceEEecCcccccC-ceEEEeccccEEEEeecc Q lcl|Aclame:pro 278 --AYNVSLIVSQSFYQTLDTLKDGNGR--YLL-QDDITAVSGKVLLGKPVFVLSDEVLGAN-KAFIGDFKRGVLFADRKD 351 (394) Q Consensus 278 --~~~a~~vm~~~~~~~l~~lkd~~G~--~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~gd~~~~~~~~~~~~ 351 (394) ..+-.++++|..+..|.+..+-..+ ... ...+.+|..++|+|++|+.+...+.+.+ ..+.+.-+ ++.... +. T Consensus 156 vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~-A~~~a~-q~ 233 (273) T protein:vir:10 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS-AAAYVS-QI 233 (273) T ss_pred CCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEecc-ceeeee-ee Confidence 2345789999999998764321101 010 1123455667999999998765544333 34555433 333333 33 Q ss_pred eEEEEeec-ccccceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 352 LGLRWADN-EIYGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 352 ~~i~~~~~-~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) ..++..+. .+|...+++.+.+|..|++|++++.++-+.+ T Consensus 234 ~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 234 DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 34444333 4566778999999999999999999887777 No 128 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=99.13 E-value=1.2e-11 Score=80.38 Aligned_cols=251 Identities=10% Similarity=-0.045 Sum_probs=140.3 Q ss_pred ccCCccccchhHHhHHHHHHHhhhhhhheeeeE----eecCCceeEEEEecCCCcccc-cccccccccccccccceeeec Q lcl|Aclame:pro 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVY----QAKKASGKYPVLQRATTKMVT-VAELEKNPALAKPDFKDVAWN 207 (394) Q Consensus 133 ~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~----~~~~~~~~~~~~~~~~~~~~~-~~e~~~~~~~~~~~~~~v~~~ 207 (394) -+. ..++|+.|+..+.+.++..+.+.++++.- ...+.++.+|... ..+... ..+++... ..+.+-..++++ T Consensus 1 MA~-~~~~pe~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gdtv~ip~~~--~~~~~d~~~~~~~~~-~~~~~~~~~~~t 76 (273) T protein:vir:10 1 MAF-NNFIPELWSDMLLEEWTAQTVFANLVNREYEGTASKGNVVHIAGVV--APTVKDYKAAGRQTS-ADAISDTGVDLL 76 (273) T ss_pred Ccc-hhhhHHHHHHHHHHHHHhhhccchhhccccccccccCceEEEeecc--cccccccccCCCccC-ccccccceEEEE Confidence 111 23679999999999988888877776431 1123356666643 222232 23333322 233344555555 Q ss_pred Hhhh-hhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----cccccc----cHHHHHHHHHhhhhh- Q lcl|Aclame:pro 208 IDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF----TTKTVK----NLDEIKALLNGGFDP- 277 (394) Q Consensus 208 ~~~~-~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~----~~~~~~----~~~~i~~~~~~~~~~- 277 (394) ..+. +.-+.|++.-...+..++.+ +.+....+++...|..++....+. ...+.. .++.|.++...+-.. T Consensus 77 id~~~~~~~~i~d~d~~~~~~~~~~-~~~~~~~alA~~vD~~i~~~~~~a~~~~~~~~~~~~~~~~~~i~~a~~~ld~~~ 155 (273) T protein:vir:10 77 IDQEKSIDFLVDDIDRVQVAGSLEA-YTRAGATALATDTDKFIADMLVDNGTALTGSAPTDADDAFDLIAKALKELTKAN 155 (273) T ss_pred EeeeeecceEeecHHHhhhhccHHH-HHHHHHHHHHHHHHHHHHHHHhccccccccccccchhHHHHHHHHHHHHhhhcC Confidence 4332 33345665323334456877 455667888888777554432211 111111 245555544333222 Q ss_pred --hcccEEEEcHHHHHHHHhhhccCCc--eee-cccccCCCcccccccceEEecCcccccC-ceEEEeccccEEEEeecc Q lcl|Aclame:pro 278 --AYNVSLIVSQSFYQTLDTLKDGNGR--YLL-QDDITAVSGKVLLGKPVFVLSDEVLGAN-KAFIGDFKRGVLFADRKD 351 (394) Q Consensus 278 --~~~a~~vm~~~~~~~l~~lkd~~G~--~l~-~~~~~~~~~~~l~G~pV~~~~~~~~~~~-~~~~gd~~~~~~~~~~~~ 351 (394) ..+-.++++|..+..|.+..+-..+ ... ...+.+|..++|+|++|+.+...+.+.+ ..+.+.-+ ++.... +. T Consensus 156 vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~lp~~~~~~~~~~~~~-A~~~a~-q~ 233 (273) T protein:vir:10 156 VPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNNLRDTDDEQFVAFHPS-AAAYVS-QI 233 (273) T ss_pred CCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecccccCCccEEEEEecc-ceeeee-ee Confidence 2345789999999998764321101 010 1123455667999999998765544333 34555433 333333 33 Q ss_pred eEEEEeec-ccccceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 352 LGLRWADN-EIYGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 352 ~~i~~~~~-~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) ..++..+. .+|...+++.+.+|..|++|++++.++-+.+ T Consensus 234 ~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 234 DTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred ehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEeccCC Confidence 34444333 4566778999999999999999999887777 No 129 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=99.05 E-value=2e-11 Score=79.28 Aligned_cols=272 Identities=13% Similarity=0.054 Sum_probs=152.5 Q ss_pred HHhhhhhhh--hhhcccccCC--ccccchhHHhHHHHHHHhhhhhhheeeeEeecC-CceeEEEEecCCCcccccccccc Q lcl|Aclame:pro 118 INETTPVEP--QKDGIKKENA--KPVSSEEILYTPAREVKTVVDLKPFTTVYQAKK-ASGKYPVLQRATTKMVTVAELEK 192 (394) Q Consensus 118 ~~~~~~~~~--~~~~~~~~~~--~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~ 192 (394) +........ ...+.....+ -.+--+.++.++.......+.+++++++.++.+ .+..+|+... .+......+.. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~G~sv~~~~iG~--~~~~~~~~g~~ 78 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAADKLALFLKVFGGEVLTAFVRRSVTMDKHMVRTIQNGKSASFPVMGR--TKGYYLAPGEN 78 (347) T ss_pred CCCcccchhhhccCCCCccccchHHHHHHHHHHHHHHHHHHHhhhhhccccccccCcceEEEeeecc--eeeeeeccccC Confidence 000000000 0111111111 234448889999988888899999999887654 4667775432 23333333333 Q ss_pred cccc-cccccceeeecHhhh-hhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------------- Q lcl|Aclame:pro 193 NPAL-AKPDFKDVAWNIDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL----------------- 253 (394) Q Consensus 193 ~~~~-~~~~~~~v~~~~~~~-~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~----------------- 253 (394) .... -++..+++++...++ +.-..|.+--...+..|+.+.+.+..++++++..|..++... T Consensus 79 l~~~~~~~~~~~~~i~ID~~~y~~~~Vdd~D~~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~~~~~~~~~g~ 158 (347) T protein:vir:88 79 LDDKRKDIKHSEKVIQIDGLLTSDVLIYDIEDAMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLPAASNENIAGL 158 (347) T ss_pred CCCCCCCCccceEEEEEechhhhhhhhhhHHHHhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCCc Confidence 2211 234556666655544 333344433223345678999999999999999888765321 Q ss_pred cccccccc--------------ccHHHHHHHHHhhhhh---hcccEEEEcHHHHHHHHhh-hccCCceeecccccCCCcc Q lcl|Aclame:pro 254 KSFTTKTV--------------KNLDEIKALLNGGFDP---AYNVSLIVSQSFYQTLDTL-KDGNGRYLLQDDITAVSGK 315 (394) Q Consensus 254 ~~~~~~~~--------------~~~~~i~~~~~~~~~~---~~~a~~vm~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~ 315 (394) +++...+. .-++.|.++...+-.. ...-.+|++|..+..|..- +...+.|.-...+..+..+ T Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~vg 238 (347) T protein:vir:88 159 GQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGDRRFYCAPEDYSAILSALMPNAANYAALIDPETGNIR 238 (347) T ss_pred cccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCHHHHHHHhcchhhhhhhhccccchhcceee Confidence 11100000 0134455444333222 2245788999999887542 2333444433345556667 Q ss_pred cccccceEEecCccccc-----------------------CceEEEeccccEEEE-e--------ecceEEEEeec-ccc Q lcl|Aclame:pro 316 VLLGKPVFVLSDEVLGA-----------------------NKAFIGDFKRGVLFA-D--------RKDLGLRWADN-EIY 362 (394) Q Consensus 316 ~l~G~pV~~~~~~~~~~-----------------------~~~~~gd~~~~~~~~-~--------~~~~~i~~~~~-~~~ 362 (394) +++|++|+.+.+.+.+. ...+-+||+..+.++ - -.++.++..+. ..| T Consensus 239 ~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~v~~~d~~~e~~r~~~~~ 318 (347) T protein:vir:88 239 NVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGTVKLKDMALERARRPEFQ 318 (347) T ss_pred eeccceEEEeecccccccccccccccccccccccccccccccccccccCcEEEEEechhhhhheecccceeeeeechhhH Confidence 89999999977654311 112344555432221 1 23334444433 345 Q ss_pred cceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 363 GQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 363 ~~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) ...+++.+.+|..++||++.+.+++++++ T Consensus 319 ~d~i~~~~~~G~~~~rPe~a~~~~~~~a~ 347 (347) T protein:vir:88 319 ADQIIGKYAMGHGGLRPEAAGALVFTPAA 347 (347) T ss_pred HHHhhhhhhhcCceeccceEEEEEeCCCC Confidence 56788899999999999999999999999 No 130 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=98.81 E-value=5.5e-10 Score=71.34 Aligned_cols=271 Identities=12% Similarity=0.047 Sum_probs=151.2 Q ss_pred HHhhhhhh--hhhhcccccCC--ccccchhHHhHHHHHHHhhhhhhheeeeEeecC-CceeEEEEecCCCcccccccccc Q lcl|Aclame:pro 118 INETTPVE--PQKDGIKKENA--KPVSSEEILYTPAREVKTVVDLKPFTTVYQAKK-ASGKYPVLQRATTKMVTVAELEK 192 (394) Q Consensus 118 ~~~~~~~~--~~~~~~~~~~~--~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~ 192 (394) +....... ....+.....+ -.+--+.++.++.......+.+++++++.++.+ .+..+|+.. .........+.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie~~~geV~~~f~~~s~~~~~~~~rti~~G~sv~~~~iG--~~~~~~~~~G~~ 78 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLKVFGGEVLTAFTRTSVTMNKHLVRSIQSGKSAQFPVLG--RTKAAYLQPGEN 78 (347) T ss_pred CCccccccccccccccCCcccchHHHHHHHHhHHHHHHHHHHHhhhhhhhheeccccceEEeeecc--ceeEeeeecCcC Confidence 00000000 00011111111 123348899999988888999999999888654 467777643 233344444444 Q ss_pred cccc-cccccceeeecHhhh-hhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------cc Q lcl|Aclame:pro 193 NPAL-AKPDFKDVAWNIDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL---------------KS 255 (394) Q Consensus 193 ~~~~-~~~~~~~v~~~~~~~-~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~---------------~~ 255 (394) .... .++..+++++...++ +.-..|.+-=-..+..|+.+.+.+..+.++++..|..|+.-. +. T Consensus 79 l~~~~~~~~~~e~~ltID~~~y~~~~VddiD~~q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~~~~~~~~g~ 158 (347) T protein:vir:94 79 LDDKRKDMKHTEKTINIDGLLTADVLIYDIEDAMNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPTANNENIAGL 158 (347) T ss_pred CCCCcCCccccceEEEEcchhhhhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccC Confidence 3321 235566655554443 222233332222345689999999999999999998775311 00 Q ss_pred ccc-------------ccccc----HHHHHHHHHhhhhhh---cccEEEEcHHHHHHHHhh-hccCCceeecccccCCCc Q lcl|Aclame:pro 256 FTT-------------KTVKN----LDEIKALLNGGFDPA---YNVSLIVSQSFYQTLDTL-KDGNGRYLLQDDITAVSG 314 (394) Q Consensus 256 ~~~-------------~~~~~----~~~i~~~~~~~~~~~---~~a~~vm~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~ 314 (394) ++. ....+ ++.+.++...+-... ..-.++++|..+..|.+. ....+.+-...++..+.. T Consensus 159 ~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Lde~dVP~~~R~~vv~P~~y~~LLk~~~~~~~~~~~~~~~~~G~V 238 (347) T protein:vir:94 159 GKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLTGNYVPSSDRVFYTTPDNYSAILAALMPNAANYQALIDPSTGSI 238 (347) T ss_pred CcceeEeeeccccccccccccHHHHHHHHHHHHHHhhhcCCCCCCCEEEeChHHHHHHHHhhccccccccccccccccee Confidence 000 00111 344554444333222 234667899999887653 334444433334556667 Q ss_pred ccccccceEEecCccccc-----------------------CceEEEeccccEE-EE--------eecceEEEEeecc-c Q lcl|Aclame:pro 315 KVLLGKPVFVLSDEVLGA-----------------------NKAFIGDFKRGVL-FA--------DRKDLGLRWADNE-I 361 (394) Q Consensus 315 ~~l~G~pV~~~~~~~~~~-----------------------~~~~~gd~~~~~~-~~--------~~~~~~i~~~~~~-~ 361 (394) +++.|+||+.+.+.+... ..-|=+||+..+- ++ .-.++.++..++. + T Consensus 239 ~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv~~~~~~~e~~~~~~~ 318 (347) T protein:vir:94 239 RNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTVKLKDMALERARRANF 318 (347) T ss_pred EEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhhhhcccceeeeechhh Confidence 899999999876543211 1123345554322 22 1244455554443 3 Q ss_pred ccceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 362 YGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 362 ~~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) +...+.+..-+|..++||++-+.++++.+ T Consensus 319 ~~~~i~~~~a~G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 319 QADQIIAKYAMGHGGLRPEACGALVFKKA 347 (347) T ss_pred hhhhhhhhhhhcCcccccceeEEEEecCC Confidence 44567788888999999999999999999 No 131 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=98.78 E-value=8.7e-10 Score=70.27 Aligned_cols=273 Identities=11% Similarity=0.030 Sum_probs=147.6 Q ss_pred HHHHHHHHhhhhhhhhhhccc-ccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCccccccc Q lcl|Aclame:pro 112 DEVLMPINETTPVEPQKDGIK-KENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAE 189 (394) Q Consensus 112 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e 189 (394) ... .............+.. ....-.+--+.++.++.......+.+++++++.++.++ +.++|+. .......... T Consensus 1 ~~~--~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~s~~~~~~~~r~i~~gks~~~~~i--G~~~~~~~~~ 76 (345) T protein:vir:22 1 MAS--MTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL--GRTQAAYLAP 76 (345) T ss_pred Ccc--cccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHHhhhcccceeeeccccceEEEeee--cceEEEeeec Confidence 000 0000000000000000 11111344588899999998889999999999888754 6667765 3344444554 Q ss_pred ccccccc-cccccce--eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------- Q lcl|Aclame:pro 190 LEKNPAL-AKPDFKD--VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL------------- 253 (394) Q Consensus 190 ~~~~~~~-~~~~~~~--v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~------------- 253 (394) +.+.... .++..++ ++++-.+++... |.+---.++..|+.+.+.++.+.++++..|.+++.-. T Consensus 77 G~~l~~~~~~~~~~e~~ltID~~~y~~~~-VddiD~~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~~~~~~~~ 155 (345) T protein:vir:22 77 GENLDDKRKDIKHTEKVITIDGLLTADVL-IYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESKYNEN 155 (345) T ss_pred CCCCCCCCCCcccceEEEEecchhhhhhh-HhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc Confidence 5443321 2355677 555554444422 2221112355689999999999999999998775211 Q ss_pred ----ccc---------cc--ccccc----HHHHHHHHHhhhhhh---cccEEEEcHHHHHHHHhhhc-cCCceeeccccc Q lcl|Aclame:pro 254 ----KSF---------TT--KTVKN----LDEIKALLNGGFDPA---YNVSLIVSQSFYQTLDTLKD-GNGRYLLQDDIT 310 (394) Q Consensus 254 ----~~~---------~~--~~~~~----~~~i~~~~~~~~~~~---~~a~~vm~~~~~~~l~~lkd-~~G~~l~~~~~~ 310 (394) +.+ .. ....+ ++.+.++...+-... ..-.+|++|..+..|..-+. .+..|.-..+.. T Consensus 156 ~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~ 235 (345) T protein:vir:22 156 IEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTKNYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPE 235 (345) T ss_pred ccccccccccccccccccccccccCHHHHHHHHHHHHHHhhhcCCCccCCEEEeChHHHHHHhccccccccccccccccc Confidence 110 00 01111 344444433332222 23468899999998854322 123333222334 Q ss_pred CCCcccccccceEEecCcccc---------------------cCce---------EEEeccccEEEEeecceEEEEeecc Q lcl|Aclame:pro 311 AVSGKVLLGKPVFVLSDEVLG---------------------ANKA---------FIGDFKRGVLFADRKDLGLRWADNE 360 (394) Q Consensus 311 ~~~~~~l~G~pV~~~~~~~~~---------------------~~~~---------~~gd~~~~~~~~~~~~~~i~~~~~~ 360 (394) .|..++++|++|+.+.+.+.. .... +|..-+ ++..+.-.++.++..++. T Consensus 236 ~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~~-A~~~v~~~~~~~e~~r~~ 314 (345) T protein:vir:22 236 KGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHRS-AVGTVKLRDLALERARRA 314 (345) T ss_pred cceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEehh-heeeeeeecceeeeeech Confidence 455678999999987543211 1111 111111 122233344555555443 Q ss_pred -cccceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 361 -IYGQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 361 -~~~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) ++...+++.+-+|..++||++.+.|+++-. T Consensus 315 ~~~~d~I~~~~a~G~~vlRPeaa~~i~~~~~ 345 (345) T protein:vir:22 315 NFQADQIIAKYAMGHGGLRPEAAGAVVFKVE 345 (345) T ss_pred hHHHHHHHHHHhcCCcccccceeEEEEEeeC Confidence 345567888889999999999999988888 No 132 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=98.77 E-value=2.4e-10 Score=73.31 Aligned_cols=270 Identities=11% Similarity=0.032 Sum_probs=144.0 Q ss_pred HHhhhhh---hhhhhc--ccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCccccccccc Q lcl|Aclame:pro 118 INETTPV---EPQKDG--IKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELE 191 (394) Q Consensus 118 ~~~~~~~---~~~~~~--~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~ 191 (394) +...... ...... ......-.+--+.++.++.......+.+++++++.++.++ +.++|+.. .........+. T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~iG--~~~~~~~~~G~ 78 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVLG--RTQAAYLAPGE 78 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHHHhhhcccceeeeecccceEEEEeec--eeEEEeeecCC Confidence 0000000 000000 0001111233388899999998889999999999888754 66777653 33344444444 Q ss_pred ccccc-cccccceeeec--HhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------------- Q lcl|Aclame:pro 192 KNPAL-AKPDFKDVAWN--IDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL--------------- 253 (394) Q Consensus 192 ~~~~~-~~~~~~~v~~~--~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~--------------- 253 (394) +.... .++.-++++|. -.++.. ..|.+---..+..|+.+.+.++.+.++++..|..++.-. T Consensus 79 ~l~~t~~~~~~~e~~l~ID~~~y~~-~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~~~~~~~~~ 157 (344) T protein:vir:10 79 NLDDIRKDIKHTEKVITIDGLLTAD-VLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVESQYNENIT 157 (344) T ss_pred CCCCCCCCcccceEEEEEcchhhhh-hhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccc Confidence 44322 12344554444 433333 233322222355689999999999999999988764321 Q ss_pred --cccc--------c---ccccc----HHHHHHHHHhhhhh---hcccEEEEcHHHHHHHHhhhc-cCCceeecccccCC Q lcl|Aclame:pro 254 --KSFT--------T---KTVKN----LDEIKALLNGGFDP---AYNVSLIVSQSFYQTLDTLKD-GNGRYLLQDDITAV 312 (394) Q Consensus 254 --~~~~--------~---~~~~~----~~~i~~~~~~~~~~---~~~a~~vm~~~~~~~l~~lkd-~~G~~l~~~~~~~~ 312 (394) +++. . ....+ ++.+.++...+-.. ...-..|++|..+..|..-+. .++.|.-......| T Consensus 158 g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G 237 (344) T protein:vir:10 158 GLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMPNAANYAALIDPEKG 237 (344) T ss_pred cccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccccccccccccceeee Confidence 0000 0 01111 23333333222222 223467789999998754321 12222222223445 Q ss_pred CcccccccceEEecCcccc-----------c--------CceEEEeccccE-E--------EEeecceEEEEeec-cccc Q lcl|Aclame:pro 313 SGKVLLGKPVFVLSDEVLG-----------A--------NKAFIGDFKRGV-L--------FADRKDLGLRWADN-EIYG 363 (394) Q Consensus 313 ~~~~l~G~pV~~~~~~~~~-----------~--------~~~~~gd~~~~~-~--------~~~~~~~~i~~~~~-~~~~ 363 (394) ..++++|+||+.+.+.+.+ . ...+..||+..+ + .+.-.++.++..++ .++. T Consensus 238 ~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~ 317 (344) T protein:vir:10 238 SIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKLRDLALERARRANFQA 317 (344) T ss_pred EEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhhccceeecccchhHHH Confidence 5578999999987654321 0 111223443321 1 11224445554443 3455 Q ss_pred ceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 364 QYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 364 ~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) ..+++.+-+|.+++||++.+.+++++. T Consensus 318 d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 318 DQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred HHHHHHhhcccceecccceEEEEeecC Confidence 567888889999999999999999998 No 133 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=98.73 E-value=7e-10 Score=70.76 Aligned_cols=264 Identities=12% Similarity=0.033 Sum_probs=147.1 Q ss_pred hhhcccccCCcccc-chhHHhHHHHHHHhhhhhhheeeeEeecC-CceeEEEEecCCCcccccccccccc-cccccccce Q lcl|Aclame:pro 127 QKDGIKKENAKPVS-SEEILYTPAREVKTVVDLKPFTTVYQAKK-ASGKYPVLQRATTKMVTVAELEKNP-ALAKPDFKD 203 (394) Q Consensus 127 ~~~~~~~~~~~~lv-P~~~~~~I~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~~~-~~~~~~~~~ 203 (394) .+.+..+++...++ |+.|+..|...+.+......++++...+. .++.+|.. +..+.......+... +.-+.+=.. T Consensus 1 ~~~~n~ts~~qafi~~EiWsa~il~~l~~~Lv~~~~~~~~d~g~GDtV~InsI--g~~tV~dY~~~~~i~~d~ltt~~~~ 78 (322) T protein:vir:31 1 MSTGNNTSNTQALIVSEIWADEIEDILHEKLLDVNIARVVDFPDGDKLTIPSV--GTPVVRSRPEQGDFTFDNLDTGEIS 78 (322) T ss_pred CCCCCCcccceEEeehhhhHHHHHHHhhhhhhhhhhhcccccCCCCeEEeccc--cccccccccCCCCcccccCCCceEE Confidence 44455666666655 99999999988877666556666444332 34555543 233333333333211 111222235 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc----------ccc-------------cccc Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL----------KSF-------------TTKT 260 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~----------~~~-------------~~~~ 260 (394) +.++..|+.++. |++...+ ...+|.+...++.+++++...|..+.+-. ++. +... T Consensus 79 l~IDq~KYfaf~-VdDD~~Q-a~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~~~~~~~p~vin~~~~~iv~~gt~~ 156 (322) T protein:vir:31 79 IILRDEVYAGNA-ISKKLRQ-DSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQFAGQNDPNVINGVPHRFVGTGTDQ 156 (322) T ss_pred EEEehhhhhccc-cchhHHH-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCcceecCCccceeccCCCc Confidence 667777777655 7775555 45689999999999888877666543311 110 0111 Q ss_pred cccHHHHHHHHHhhhhhh---cccEEEEcHHHHHHHHhhh-----ccCCcee--ecccccCC--CcccccccceEEecCc Q lcl|Aclame:pro 261 VKNLDEIKALLNGGFDPA---YNVSLIVSQSFYQTLDTLK-----DGNGRYL--LQDDITAV--SGKVLLGKPVFVLSDE 328 (394) Q Consensus 261 ~~~~~~i~~~~~~~~~~~---~~a~~vm~~~~~~~l~~lk-----d~~G~~l--~~~~~~~~--~~~~l~G~pV~~~~~~ 328 (394) ...|+.++++-.++-... ..-..|++|..+..|..+. -.++++. ...+...+ ..++++|+.|+++... T Consensus 157 ~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~i~~~~~l~~D~rf~~i~~sG~a~g~~~Vg~~~GF~V~~SN~l 236 (322) T protein:vir:31 157 TMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLETITNISNISNNPRWEGIVESGIAPDMQFVRSVYGIDLFVSNLL 236 (322) T ss_pred hhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhhhhhhhhhhccccccccccccchhhHHHHHHHhceeeeeeccc Confidence 235777777755544333 2345678999887774321 1233432 22222222 1478999999887654 Q ss_pred ccccCceEEE---------eccccEEEEee----------cceEEEEe-ecccccceEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 329 VLGANKAFIG---------DFKRGVLFADR----------KDLGLRWA-DNEIYGQYLQAVLRFGVSKVDDKAGYYVTFT 388 (394) Q Consensus 329 ~~~~~~~~~g---------d~~~~~~~~~~----------~~~~i~~~-~~~~~~~~~r~~~r~d~~v~~~~af~~l~~~ 388 (394) +.++-+++.| -++-+..+.+. +-.+-+-. ++..|...+|+.+|+|.++.+|+..+.|.-+ T Consensus 237 ~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~~~~~~~g~g~~r~e~l~~~~a~ 316 (322) T protein:vir:31 237 ADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNTATTARWGNGLVRDENLVCVLAN 316 (322) T ss_pred cccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccceeeeeeecceeecccceEEEEec Confidence 3222222222 11111111111 11111111 1233566789999999999999999999888 Q ss_pred CccCCC Q lcl|Aclame:pro 389 PEPLPL 394 (394) Q Consensus 389 ~~~~~~ 394 (394) +.++-. T Consensus 317 ~~~~~~ 322 (322) T protein:vir:31 317 ADKVTF 322 (322) T ss_pred cccccC Confidence 888777 No 134 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=98.72 E-value=5.7e-10 Score=71.24 Aligned_cols=271 Identities=13% Similarity=0.083 Sum_probs=141.3 Q ss_pred HHhhhh-hhhhhhcccccCC--ccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCccccccccccc Q lcl|Aclame:pro 118 INETTP-VEPQKDGIKKENA--KPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKN 193 (394) Q Consensus 118 ~~~~~~-~~~~~~~~~~~~~--~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~ 193 (394) +..... ......+.....+ -.+--+.+..+++......+.+++++++.++.++ ++.+|+. +.........+... T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i--G~~tv~~~t~G~~l 78 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM--GRTSGVYLAPGERL 78 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHhhhcccccccccccceEEEecc--cceeeeeecCCCCc Confidence 000000 0000011111111 1333478888888888888889999988886643 5666654 23334444443333 Q ss_pred ccc-cccccce--eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------------- Q lcl|Aclame:pro 194 PAL-AKPDFKD--VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLK---------------- 254 (394) Q Consensus 194 ~~~-~~~~~~~--v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~---------------- 254 (394) ... .+..-.+ ++++-.+++. ..|.+-=-..+..|+.+.+.++.+.++++..|..|+.-.. T Consensus 79 ~~~~~~~~~~e~~itID~~~~~~-~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~~~~~~~g~ 157 (347) T protein:vir:94 79 SDKRKGIKHTEKVITIDGLLTAD-VMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAASNENIAGL 157 (347) T ss_pred CCCCCCCCcceEEEEecchhhhh-HHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCC Confidence 211 1123344 4444444333 2232211112455788999999999999999987753110 Q ss_pred -cccc-----cc-----ccc----HHHHHHHHHhhhhhh---cccEEEEcHHHHHHHHhhhc-cCCceeecccccCCCcc Q lcl|Aclame:pro 255 -SFTT-----KT-----VKN----LDEIKALLNGGFDPA---YNVSLIVSQSFYQTLDTLKD-GNGRYLLQDDITAVSGK 315 (394) Q Consensus 255 -~~~~-----~~-----~~~----~~~i~~~~~~~~~~~---~~a~~vm~~~~~~~l~~lkd-~~G~~l~~~~~~~~~~~ 315 (394) .+.. .+ ..+ ++.+.++...+-... ..-..|++|..+..|..-+. .+..+.-..++..|.-+ T Consensus 158 ~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~~~~~~~~~~~~~~~~~~G~Vg 237 (347) T protein:vir:94 158 GTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILAALMPNAANYAALIDPETGNIR 237 (347) T ss_pred cccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhccchhhhhhccccccccccceE Confidence 0000 00 011 223333332322222 23478899999987753322 22223322334556667 Q ss_pred cccccceEEecCccccc----------------Cce--------EEEeccccEE-EE--------eecceEEEEeec-cc Q lcl|Aclame:pro 316 VLLGKPVFVLSDEVLGA----------------NKA--------FIGDFKRGVL-FA--------DRKDLGLRWADN-EI 361 (394) Q Consensus 316 ~l~G~pV~~~~~~~~~~----------------~~~--------~~gd~~~~~~-~~--------~~~~~~i~~~~~-~~ 361 (394) +++|++|+.+.+.+... ... +-|||+..+. ++ ...+++++..++ .+ T Consensus 238 ~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~ 317 (347) T protein:vir:94 238 NVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDA 317 (347) T ss_pred EEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEEeehhhhhhhhcccccccchhchhh Confidence 99999999876554211 011 2233332211 11 122334443333 34 Q ss_pred ccceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 362 YGQYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 362 ~~~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) +...+++.+.+|.+++||++.+.|+.++|= T Consensus 318 ~~d~i~~~~~~G~~~~rP~~a~~~~~~~A~ 347 (347) T protein:vir:94 318 QGDLIVGKYAMGHGGLRPEAAGALVFSPAE 347 (347) T ss_pred HHHHhhhhhhhcCcccccceeEEEEecCCC Confidence 556788999999999999999999998766 No 135 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=98.72 E-value=8.4e-10 Score=70.35 Aligned_cols=272 Identities=10% Similarity=0.020 Sum_probs=144.3 Q ss_pred HHhhhhhhhhhhcccccCCc-cccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKENAK-PVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPA 195 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~-~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~ 195 (394) +...............++.. .+--+.++.++.......+.++++.++.++.++ +..+|+.. .........+.+.. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG--~~~~~~~~~g~~l~- 77 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVG--ASTIAGRKAGEELV- 77 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeec--ceeeeeecCCCCCC- Confidence 11110000001111222222 233388999999998888999999999988754 67777653 23333444433333 Q ss_pred ccccccceeeecHhh-hhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc----cc------------c-- Q lcl|Aclame:pro 196 LAKPDFKDVAWNIDT-YRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL----KS------------F-- 256 (394) Q Consensus 196 ~~~~~~~~v~~~~~~-~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~----~~------------~-- 256 (394) .....-+++++.... ++.-..|.+---..+..|+.+.+.+..+.++++..|.+++... .. + T Consensus 78 ~~~~~~~~~~l~ID~~l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~~~G~~ 157 (334) T protein:vir:80 78 VQKNVSDKLNLTVDTVLYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAFHDGIL 157 (334) T ss_pred CCCcccCceEEEEeeeeehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccCCcc Confidence 233444665665555 2333334432222355689999999999999999998764221 00 0 Q ss_pred -------ccc-ccccHHHHHHH----HHhhhhhh------cccEEEEcHHHHHHHHhhhccCCc-eeecc---cccCCCc Q lcl|Aclame:pro 257 -------TTK-TVKNLDEIKAL----LNGGFDPA------YNVSLIVSQSFYQTLDTLKDGNGR-YLLQD---DITAVSG 314 (394) Q Consensus 257 -------~~~-~~~~~~~i~~~----~~~~~~~~------~~a~~vm~~~~~~~l~~lkd~~G~-~l~~~---~~~~~~~ 314 (394) +.. ..++.+.+.++ ...+.... ..-+.+++|..|..|..-..-..+ |...+ ....+.- T Consensus 158 ~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~~s~~~~~~~~g~i 237 (334) T protein:vir:80 158 LPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFGAKEGGNSFVGGRI 237 (334) T ss_pred eeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccceeccccccccccceeE Confidence 000 11223333332 22222221 224788999999998753221111 11111 1233445 Q ss_pred ccccccceEEecCccccc---------CceEEEeccccEEEE-ee--------cceEEEEeecc-cccceEEEEEEeccE Q lcl|Aclame:pro 315 KVLLGKPVFVLSDEVLGA---------NKAFIGDFKRGVLFA-DR--------KDLGLRWADNE-IYGQYLQAVLRFGVS 375 (394) Q Consensus 315 ~~l~G~pV~~~~~~~~~~---------~~~~~gd~~~~~~~~-~~--------~~~~i~~~~~~-~~~~~~r~~~r~d~~ 375 (394) .+++|+||+.+.+.+..+ ...+=|||++.+..+ -+ .+++.+..++. .+...+.+++-+|.+ T Consensus 238 ~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~d~i~~~~a~G~g 317 (334) T protein:vir:80 238 AMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKKDFGHYLDTFQSYNIG 317 (334) T ss_pred EEEeceEEEeecCCCCccccccccccccccccccccceEEEEEeCceEEEEEEeecceeeeechhhHHHHHHHHHHcCCc Confidence 689999999876544221 234557776643222 22 12222222222 233345666678999 Q ss_pred EecccceEEEEecCccC Q lcl|Aclame:pro 376 KVDDKAGYYVTFTPEPL 392 (394) Q Consensus 376 v~~~~af~~l~~~~~~~ 392 (394) ++||+|.+.++++.+-- T Consensus 318 ~lRPeaa~vv~~~~~~~ 334 (334) T protein:vir:80 318 QRRPDAVAVHDITVTNP 334 (334) T ss_pred eeccceEEEEEEeeecC Confidence 99999988888776543 No 136 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=98.68 E-value=3.5e-09 Score=66.95 Aligned_cols=278 Identities=9% Similarity=-0.028 Sum_probs=142.2 Q ss_pred HHHHHH--HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCcccccc Q lcl|Aclame:pro 112 DEVLMP--INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVA 188 (394) Q Consensus 112 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 188 (394) ...... +..... ............-.+--+.++.++.......+.+++++++.++.++ +.++++.. ........ T Consensus 1 ~~~~~~~~~~~~n~-~t~~~~~~~~~~~al~le~f~geV~~~f~~~si~~~~~~~rti~~Gksv~f~~iG--~~t~~~~t 77 (375) T protein:vir:10 1 MANANQVALGRSNL-STGTGYGGATDKYALYLKLFSGEMFKGFQHETIARDLVTKRTLKNGKSLQFIYTG--RMTSSFHT 77 (375) T ss_pred CccccccccCcccc-CCccccccccchHHHHHHHHhHHHHHHHHHHHhhhccccccccccCceEEEEeee--eeEEeeec Confidence 000000 000000 0000000111112344588899999888889999999998887754 66677653 23333333 Q ss_pred ccccccc--ccccccce--eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccc---------- Q lcl|Aclame:pro 189 ELEKNPA--LAKPDFKD--VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLK---------- 254 (394) Q Consensus 189 e~~~~~~--~~~~~~~~--v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~---------- 254 (394) .+.+... ..+....+ ++++-.++... .|.+---.++..|+.+.+.++.+.++++..|..++.-.- T Consensus 78 ~G~~i~~~~~~d~~~te~~l~ID~~~y~~~-~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~ 156 (375) T protein:vir:10 78 PGTPILGNADKAPPVAEKTIVMDDLLISSA-FVYDLDETLAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVS 156 (375) T ss_pred CCcCcCCccccCCCCCceEEEecchhhhhh-hHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccc Confidence 3333211 11222232 55555444432 233211223556899999999999999999987753210 Q ss_pred ---------------cccc-ccccc----HHHHHHHHHhhhhhh---cccEEEEcHHHHHHHHhhhccC----Cceeecc Q lcl|Aclame:pro 255 ---------------SFTT-KTVKN----LDEIKALLNGGFDPA---YNVSLIVSQSFYQTLDTLKDGN----GRYLLQD 307 (394) Q Consensus 255 ---------------~~~~-~~~~~----~~~i~~~~~~~~~~~---~~a~~vm~~~~~~~l~~lkd~~----G~~l~~~ 307 (394) +++. ...++ ++.+.++...+-... ..-.++++|..+..|.+-+|.+ ..+.-.. T Consensus 157 ~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n~d~~~~~ 236 (375) T protein:vir:10 157 ATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALIQDIGSNGLVNRDVQGSA 236 (375) T ss_pred cccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHHhcCCccceeeecccccc Confidence 0000 00112 444555443333322 2346889999999887655432 1111111 Q ss_pred cccCCCcccccccceEEecCccccc-----------------------------------CceEEEec---ccc-EEEEe Q lcl|Aclame:pro 308 DITAVSGKVLLGKPVFVLSDEVLGA-----------------------------------NKAFIGDF---KRG-VLFAD 348 (394) Q Consensus 308 ~~~~~~~~~l~G~pV~~~~~~~~~~-----------------------------------~~~~~gd~---~~~-~~~~~ 348 (394) ....+...+++|++|+.+.+.+... ...|-+|| ++. -+++. T Consensus 237 ~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~~~~~~~~ 316 (375) T protein:vir:10 237 LQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAKSCGLIFQ 316 (375) T ss_pred eeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCceEEEEEc Confidence 1122333589999998865433211 12344455 211 12222 Q ss_pred --------ecceEEEEee---ccccc-ceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 349 --------RKDLGLRWAD---NEIYG-QYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 349 --------~~~~~i~~~~---~~~~~-~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) -.++.++.+. +..++ ..+.+.+=+|..+.||++.+.|+... +.|. T Consensus 317 ~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~~-~~~~ 373 (375) T protein:vir:10 317 KEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIGA-TAPS 373 (375) T ss_pred hhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecCc-Cccc Confidence 2344444432 22222 24566677899999999999998873 5666 No 137 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=98.67 E-value=7.2e-09 Score=65.23 Aligned_cols=271 Identities=8% Similarity=0.020 Sum_probs=144.3 Q ss_pred HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCcccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPAL 196 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~ 196 (394) +...........+ .+.....+--+.+..++.....+.+.++++.++.++.++ +.++|+..... ......|.. +.. T Consensus 1 ms~~n~~t~~~~~-~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~~gkS~q~~~iG~~~--~~~~~~G~~-ld~ 76 (364) T protein:vir:10 1 MSNPNVLTQPAVS-ASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVVGTNSVSNKYIGETE--LQVLSPGKS-PDA 76 (364) T ss_pred CCCcccccccccc-cccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEeeeeeeeE--EeeeccCcc-cCC Confidence 1111111110111 111223344578888998888888999999999887765 67777763322 222222222 222 Q ss_pred cccccceeeecHhhhh-hhhhhhH--HHHhccHHH-HHHHHHHHHHHHHHHHHHHHHhhccc--------------cccc Q lcl|Aclame:pro 197 AKPDFKDVAWNIDTYR-GAIPLSQ--ESIDDADVD-LVGIVSESISQIKVNTTNDAIAKVLK--------------SFTT 258 (394) Q Consensus 197 ~~~~~~~v~~~~~~~~-~~~~vs~--ell~ds~~~-l~~~i~~~l~~~~~~~~~~a~~~g~~--------------~~~~ 258 (394) ..+.-++.++...++- .-..|.+ +.. +.++ +.+.+.+++++++++..|..++.-.- ...+ T Consensus 77 ~~~~~~k~~itID~ll~a~~~V~diDe~q--~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~~~~~~~~ 154 (364) T protein:vir:10 77 SPTEFDKNRLVVDTTVIARNTVAHFHDVQ--NDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIRKNPRVAG 154 (364) T ss_pred CCcccCcEEEEecceeeechhhhhHHHHh--cCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccCCcccC Confidence 3445556555555432 1122222 222 3455 67888899999998888887642110 0000 Q ss_pred -----------cc-cccHHHHH----HHHHhhhhh---hcccEEEEcHHHHHHHHhhhccCC-ceee--cccccCCCccc Q lcl|Aclame:pro 259 -----------KT-VKNLDEIK----ALLNGGFDP---AYNVSLIVSQSFYQTLDTLKDGNG-RYLL--QDDITAVSGKV 316 (394) Q Consensus 259 -----------~~-~~~~~~i~----~~~~~~~~~---~~~a~~vm~~~~~~~l~~lkd~~G-~~l~--~~~~~~~~~~~ 316 (394) .+ .+..+.+. ++...+-.. ...-+++++|..|..|.+-.+=-. .|.. ..+...+...+ T Consensus 155 ~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~~~~~~~G~v~~ 234 (364) T protein:vir:10 155 HGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRIVDKSYTIAASDNTVDGFVLK 234 (364) T ss_pred CcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCccccccccccCCCccccceeEE Confidence 00 11112233 333222222 223578899999988876321110 0110 01233444568 Q ss_pred ccccceEEecCcccc---------------------cCceEEEecccc-EEEEee--------cceEEEEeeccc-ccce Q lcl|Aclame:pro 317 LLGKPVFVLSDEVLG---------------------ANKAFIGDFKRG-VLFADR--------KDLGLRWADNEI-YGQY 365 (394) Q Consensus 317 l~G~pV~~~~~~~~~---------------------~~~~~~gd~~~~-~~~~~~--------~~~~i~~~~~~~-~~~~ 365 (394) ++|+||+.+.+.+.. +...+.|||+.. .+++-+ .+++.+..++.. +... T Consensus 235 v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~tv~~~~~t~e~~~~~~~~~~~ 314 (364) T protein:vir:10 235 SWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALLVGRTISITGDIFYEKKEKTWY 314 (364) T ss_pred EeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEEEEEEecceeeeeeccceeeee Confidence 999999886554321 111123555432 223333 455555444433 3334 Q ss_pred EEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 366 LQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 366 ~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +.+++-+|..++||++.+.++...+.+|- T Consensus 315 ida~~a~G~g~lRPeaa~~i~~~~~~~~~ 343 (364) T protein:vir:10 315 IDTFLAEGAIPDRWEAVAVVTAADTAELA 343 (364) T ss_pred eeeehcccCcccCccceEEEEecCCCCCc Confidence 56677789999999999999999999998 No 138 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=98.67 E-value=2.3e-09 Score=67.98 Aligned_cols=271 Identities=10% Similarity=0.018 Sum_probs=141.7 Q ss_pred HHHHHhhhhhhhhhhcccccCCc---cccchhHHhHHHHHHHhhhhhhheeeeEeecC-CceeEEEEecCCCcccccccc Q lcl|Aclame:pro 115 LMPINETTPVEPQKDGIKKENAK---PVSSEEILYTPAREVKTVVDLKPFTTVYQAKK-ASGKYPVLQRATTKMVTVAEL 190 (394) Q Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~---~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~ 190 (394) ...+...+.-.....+...+.+. .+--+.++.++.......+.+++++++.++.+ .++.+|+.. ..+......+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~~~s~~~~~~~~r~i~~G~tv~i~~ig--~~~~~~~~~g 78 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFNNASIFKGLVRSYDLRGGKSKQFMFTG--KLSAGYHTPG 78 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHHHHhhhhhccccccccccceEEEEecc--ceeEeeecCC Confidence 11111111111111122222221 24348899999999988899999998877654 366666643 3333443433 Q ss_pred cccccccccccce--eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------ Q lcl|Aclame:pro 191 EKNPALAKPDFKD--VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF------------ 256 (394) Q Consensus 191 ~~~~~~~~~~~~~--v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~------------ 256 (394) .......++.-.+ ++++-.++..+ .|.+=--.++..++.+.+.++.+.++++..|..++.-...+ T Consensus 79 ~~l~~~~~~~~~~~~l~ID~~ky~~~-~VddiD~~q~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~~~~~~~~~g 157 (332) T protein:vir:78 79 TPIVGDAGIKANEKTLVMDDLLVSSQ-FVYSLDEIFSQYSTRAEVSKQIGEALATHYDERIARVLAKASAEASPVTGEPG 157 (332) T ss_pred CCCCCCCCCCCceEEEEEehhhhhHH-HHHhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccCccccccc Confidence 3332111222233 55555444442 23221112345689999999999999999888765432110 Q ss_pred ------ccccccc----HHHHHHHHHhhhhhhc---ccEEEEcHHHHHHHHhhhccC--Ccee-ec-ccccCC-Cccccc Q lcl|Aclame:pro 257 ------TTKTVKN----LDEIKALLNGGFDPAY---NVSLIVSQSFYQTLDTLKDGN--GRYL-LQ-DDITAV-SGKVLL 318 (394) Q Consensus 257 ------~~~~~~~----~~~i~~~~~~~~~~~~---~a~~vm~~~~~~~l~~lkd~~--G~~l-~~-~~~~~~-~~~~l~ 318 (394) +..+..+ ++.|.++...+-.... +-.+|++|..+..|.+..|.. .++. -. ..+.++ ..++++ T Consensus 158 ~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~ 237 (332) T protein:vir:78 158 GFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVLSPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIA 237 (332) T ss_pred ccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHHhhcCceeeeeeccccccceecceeeeEEe Confidence 1111222 3445554444333322 335778999999887643321 0000 00 012222 246899 Q ss_pred ccceEEecCccccc------------CceEEEeccccE-EEEeec--------ceEEEEee----cccccceEEEEEEec Q lcl|Aclame:pro 319 GKPVFVLSDEVLGA------------NKAFIGDFKRGV-LFADRK--------DLGLRWAD----NEIYGQYLQAVLRFG 373 (394) Q Consensus 319 G~pV~~~~~~~~~~------------~~~~~gd~~~~~-~~~~~~--------~~~i~~~~----~~~~~~~~r~~~r~d 373 (394) |++|+.+.+.+... ...+-|||+... +++-+. +..++... +.++...+++.+.+| T Consensus 238 G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~h~~a~~~v~~~~~~~~~t~~~~~~~~~~d~i~~~~~~G 317 (332) T protein:vir:78 238 GIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIFHREAAGCIQSVAPTIQTTSGDFNVQYQGDLIVGKLAMG 317 (332) T ss_pred eeEEEecCccccCcccccccccccccccccccccccceEEeecccceeeeeeeccchhhhhcccchhhhHhhhhhhhhhc Confidence 99998876544221 123445555421 222222 22332221 233445677778899 Q ss_pred cEEecccceEEEEec Q lcl|Aclame:pro 374 VSKVDDKAGYYVTFT 388 (394) Q Consensus 374 ~~v~~~~af~~l~~~ 388 (394) .+++||++.+.|+-+ T Consensus 318 ~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 318 CGSLRTSVAGSFQAA 332 (332) T ss_pred CceecccceEEEeeC Confidence 999999999999877 No 139 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=98.66 E-value=1.7e-09 Score=68.64 Aligned_cols=270 Identities=11% Similarity=0.021 Sum_probs=141.1 Q ss_pred HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEe---ecCCceeEEEEecCCCcccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQ---AKKASGKYPVLQRATTKMVTVAELEKNP 194 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 194 (394) +....+.. ....++......+|+.|+..|.+.+.....+.++++-.+ ..+.++++|... ..+......++... T Consensus 1 ~~~~~~~~--~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~~~~~~~Gdtv~ip~~g--~~~~~d~~~~~~i~ 76 (341) T protein:vir:94 1 MALGNTIT--GPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTWGAQVKKGDTFHVPRIS--ELGVEDKATDVPVG 76 (341) T ss_pred Ccchhhhc--cccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccccccccCCceEEEeccC--cceeeeecCCCccc Confidence 11111100 111233333457899999999999988887777765433 223467777542 33344444444332 Q ss_pred cccccccceeeecHhh-hhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----ccc---------- Q lcl|Aclame:pro 195 ALAKPDFKDVAWNIDT-YRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF----TTK---------- 259 (394) Q Consensus 195 ~~~~~~~~~v~~~~~~-~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~----~~~---------- 259 (394) ..+.+-..++++..+ .+.-+.|++.-..++..|+.+.+.+....++++..|..++...... ++. T Consensus 77 -~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~~~t 155 (341) T protein:vir:94 77 -VQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNGAIT 155 (341) T ss_pred -cccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCcccccc Confidence 233344555555533 3455667765444556789999999999999988887765432111 000 Q ss_pred ---ccccHHHHHHHHHhhhh---hhcccEEEEcHHHHHHHHhhhccCC-ceeecccccCCCcccccccceEEecCccccc Q lcl|Aclame:pro 260 ---TVKNLDEIKALLNGGFD---PAYNVSLIVSQSFYQTLDTLKDGNG-RYLLQDDITAVSGKVLLGKPVFVLSDEVLGA 332 (394) Q Consensus 260 ---~~~~~~~i~~~~~~~~~---~~~~a~~vm~~~~~~~l~~lkd~~G-~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~ 332 (394) ....++.+.++...+-. |...-.+|++|..+..|.+...-.. .+.-...+..|..++|+|++|+.+.+.+... T Consensus 156 ~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~~l~~G~ig~i~G~~V~~Sn~lp~~~ 235 (341) T protein:vir:94 156 GNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFTIPQFISKDFINNAPIAQGQIGSLMGVRVIRTSLIGNNS 235 (341) T ss_pred CchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhhchhhhhhhccccchhheeeeeeEeceEEEEeccccccc Confidence 01234555554433322 2223467899999999865321100 1222223556666799999999876544322 Q ss_pred CceE---------------------E----EeccccE-EEEeecce-EEEE------------------eec-ccccceE Q lcl|Aclame:pro 333 NKAF---------------------I----GDFKRGV-LFADRKDL-GLRW------------------ADN-EIYGQYL 366 (394) Q Consensus 333 ~~~~---------------------~----gd~~~~~-~~~~~~~~-~i~~------------------~~~-~~~~~~~ 366 (394) ...+ + +|+.... +++-+.-+ .++. +.. .++...+ T Consensus 236 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i 315 (341) T protein:vir:94 236 ATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLM 315 (341) T ss_pred cccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhhccccccccccccchhhhhhhhh Confidence 2111 0 1111000 00101000 0000 000 1112234 Q ss_pred EEEEEeccEEecccceEEEEecCccC Q lcl|Aclame:pro 367 QAVLRFGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 367 r~~~r~d~~v~~~~af~~l~~~~~~~ 392 (394) ++-.-+|.+++||++.+.|....+.. T Consensus 316 ~~~~~~G~~~lrp~~~v~~~~~~~~~ 341 (341) T protein:vir:94 316 VGRQAYGARLYRPLHAVNIHTTGDTV 341 (341) T ss_pred hhhhhhcccccCcceeEEEecCcCCC Confidence 55556899999999988777666655 No 140 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=98.66 E-value=6.3e-09 Score=65.56 Aligned_cols=256 Identities=7% Similarity=-0.034 Sum_probs=141.5 Q ss_pred cccccCCccccchhHHhHHHHHHHhhhhhhh---------eeeeEe--ecCCceeEEEEecCCCcccccccccccccccc Q lcl|Aclame:pro 130 GIKKENAKPVSSEEILYTPAREVKTVVDLKP---------FTTVYQ--AKKASGKYPVLQRATTKMVTVAELEKNPALAK 198 (394) Q Consensus 130 ~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~---------~~~~~~--~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 198 (394) ..++-....++|+.+.+.+.+...+.+.|.+ +..... .++...++|++..-++.+-.+.++...+ ... T Consensus 1 MA~T~lsd~i~peVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~~~v~~~~~i~-~~~ 79 (324) T protein:vir:59 1 MAYTKISDVIVPELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDSQVLNDTDDLV-PQK 79 (324) T ss_pred CCceeeeceechhHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcccccCCCcccc-hhh Confidence 1123334668899998888776666655422 111111 2345678888766555555666665554 345 Q ss_pred cccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------cccccccccc Q lcl|Aclame:pro 199 PDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL---------------KSFTTKTVKN 263 (394) Q Consensus 199 ~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~---------------~~~~~~~~~~ 263 (394) .+-....-..+..+....++++...-+.-|....|.++++....+..+..++..+ .++...+..+ T Consensus 80 l~t~~~~a~i~~~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~~~~~~~~dvsa~~~~~~s 159 (324) T protein:vir:59 80 INAGQDKAVLILRGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSNDDMKDNKLDISGTADGIYS 159 (324) T ss_pred cccceeeEEEEeecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccceeeeeccccceec Confidence 5555555555566666677775444444567777888888777666555433221 1122223356 Q ss_pred HHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccc----cC----c Q lcl|Aclame:pro 264 LDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLG----AN----K 334 (394) Q Consensus 264 ~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~----~~----~ 334 (394) ++.+.++...+-+... -++|+||+.++..|++..-. .++... -....-++++|++|++.+.++.. .. + T Consensus 160 ~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li--~~~~~s-~~~~~i~~~~G~~VivdD~~p~~~~~~~~~~y~s 236 (324) T protein:vir:59 160 AETFVDASYKLGDHESLLTAIGMHSATMASAVKQDLI--EFVKDS-QSGIRFPTYMNKRVIVDDSMPVETLEDGTKVFTS 236 (324) T ss_pred HHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhhh--hhcccc-ccCceeeeecccEEEEeCCCCccccCCCCceEEE Confidence 7788888877555443 36899999999999865311 122111 11223468999999998765432 11 3 Q ss_pred eEEEeccccEEEEe-ecceEEEEeeccccc-ceEEEEEEeccEEecccceEEEEec-CccCCC Q lcl|Aclame:pro 335 AFIGDFKRGVLFAD-RKDLGLRWADNEIYG-QYLQAVLRFGVSKVDDKAGYYVTFT-PEPLPL 394 (394) Q Consensus 335 ~~~gd~~~~~~~~~-~~~~~i~~~~~~~~~-~~~r~~~r~d~~v~~~~af~~l~~~-~~~~~~ 394 (394) ++||.- ++...+ +.++.++..+..... ..+....+ -+++|.+|....-+ +..+|- T Consensus 237 ~l~~~G--Ai~~~~~~~~v~vE~dRd~~~g~~~l~~r~~---~~~~p~G~s~~~~~~~~~sPt 294 (324) T protein:vir:59 237 YLFGAG--ALGYAEGQPEVPTETARNALGSQDILINRKH---FVLHPRGVKFTENAMAGTTPT 294 (324) T ss_pred EEEecC--eEEEeecCCCcceecccCccccceEEEEeeE---EEeEeeeEEecccccCCCCCC Confidence 444421 122222 334556666554332 33444444 34666666654321 223444 No 141 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=98.62 E-value=4.2e-09 Score=66.52 Aligned_cols=272 Identities=11% Similarity=0.023 Sum_probs=144.1 Q ss_pred HHhhhhhh--hhhhcccccCC--ccccchhHHhHHHHHHHhhhhhhheeeeEeecC-CceeEEEEecCCCcccccccccc Q lcl|Aclame:pro 118 INETTPVE--PQKDGIKKENA--KPVSSEEILYTPAREVKTVVDLKPFTTVYQAKK-ASGKYPVLQRATTKMVTVAELEK 192 (394) Q Consensus 118 ~~~~~~~~--~~~~~~~~~~~--~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~ 192 (394) +....... ....+.....+ -.+--+.++.++.......+.+++++++.++.+ .+..+|+.. ..+......+.. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~~s~~~~~v~~r~~~~G~sv~i~~iG--~~t~~~~~~g~~ 78 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG--RTKAAYLKPGEN 78 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHHHHhhhhhhccccccccceeEeeecc--ceeeeeecCCCC Confidence 00000000 00011111111 112338889999888888899999998877554 466666543 333344444443 Q ss_pred ccc-ccccccceeeec--HhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc---------------- Q lcl|Aclame:pro 193 NPA-LAKPDFKDVAWN--IDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL---------------- 253 (394) Q Consensus 193 ~~~-~~~~~~~~v~~~--~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~---------------- 253 (394) ... ..+....+.++. -.++.. ..|.+-=-.++..|+.+.+.+..+.++++..|..++.-. T Consensus 79 l~~~~~~~~~~e~~ltiD~~~y~~-~~VddiD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~ 157 (347) T protein:vir:33 79 LDDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEG 157 (347) T ss_pred CCCCCCCCccceEEEEechhhhhh-HHHhhHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc Confidence 321 122345564454 333332 223322222345678899999999999999888775211 Q ss_pred -----ccccc--ccc--c----c----HHHHHHHHHhhhhh---hcccEEEEcHHHHHHHHhhh-ccCCceeecccccCC Q lcl|Aclame:pro 254 -----KSFTT--KTV--K----N----LDEIKALLNGGFDP---AYNVSLIVSQSFYQTLDTLK-DGNGRYLLQDDITAV 312 (394) Q Consensus 254 -----~~~~~--~~~--~----~----~~~i~~~~~~~~~~---~~~a~~vm~~~~~~~l~~lk-d~~G~~l~~~~~~~~ 312 (394) +.... .+. . + ++.+.++...+-.. ...-..|++|..+..|..-. -.++.|.-...+..+ T Consensus 158 ~~~~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~~~~d~~~~~~~~~G 237 (347) T protein:vir:33 158 LGKPTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALLDPERG 237 (347) T ss_pred ccccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCcEEEeCHHHHHHHhccccccccccccccccccc Confidence 00000 000 0 1 23334333333222 22346889999999886532 223334322234556 Q ss_pred CcccccccceEEecCcccccC------------ce--------EEEeccccE-E--------EEeecceEEEEeecc-cc Q lcl|Aclame:pro 313 SGKVLLGKPVFVLSDEVLGAN------------KA--------FIGDFKRGV-L--------FADRKDLGLRWADNE-IY 362 (394) Q Consensus 313 ~~~~l~G~pV~~~~~~~~~~~------------~~--------~~gd~~~~~-~--------~~~~~~~~i~~~~~~-~~ 362 (394) ..++++|++|+.+.+.+.... .. +-++|+... + .+.-.++.++..++. ++ T Consensus 238 ~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~ 317 (347) T protein:vir:33 238 TIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQ 317 (347) T ss_pred eeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhhh Confidence 667899999998775433211 11 112222110 1 122334455555443 34 Q ss_pred cceEEEEEEeccEEecccceEEEEecCccC Q lcl|Aclame:pro 363 GQYLQAVLRFGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 363 ~~~~r~~~r~d~~v~~~~af~~l~~~~~~~ 392 (394) ...+++.+.+|.+++||++.+.+++.-..- T Consensus 318 ~d~i~~~~~~G~~vlrP~~av~i~~~~~~~ 347 (347) T protein:vir:33 318 ADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred hHhhhhhhhcCCceecccceEEEecCCCCC Confidence 566788888899999999999998877766 No 142 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=98.62 E-value=1.8e-08 Score=62.98 Aligned_cols=258 Identities=9% Similarity=-0.033 Sum_probs=133.5 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhh---------eeeeEeecCCceeEEEEecCCCcccccccccccccccc Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKP---------FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAK 198 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 198 (394) +....+.-...++|+.+.+.+.+...+.+.|++ +......++...++|++..-++.+-.+.++....+... T Consensus 1 Ma~~~T~l~d~i~pevf~~yv~~~~~~~~~l~qSG~i~~~~~i~~~~~~~G~~i~~P~~~~l~G~~~~~~dg~~~i~~~k 80 (330) T protein:vir:10 1 MANELTKILDTITPQQYNAYMQQYTAAKSAFVQSGIAVSDERVSKNITSGGLLVNMPFWNDLTGDSEVLGNGDKALETGK 80 (330) T ss_pred CCCCceEeeeeechhHHHHHHHHHhHHhhhhhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCccccchhh Confidence 222234445678999998888777666554422 11112234667888988755555555555543333344 Q ss_pred cccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc--------c-------------ccc Q lcl|Aclame:pro 199 PDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL--------K-------------SFT 257 (394) Q Consensus 199 ~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~--------~-------------~~~ 257 (394) .+-..-.-..+..+....++++...-+..|....+.+++++...+..+..++... . ..+ T Consensus 81 i~t~~~~a~i~~~~k~~~~tD~a~~~~g~dp~~~i~~q~a~~w~~~~q~~lla~l~gvf~~~~~~~~~~~~~~~~~~~~~ 160 (330) T protein:vir:10 81 ITAGADIACVLYRGRGWAANELTGVVAGSDPVRAILNRIGAYWLREDQKALIATLNGIFATGTAGEKGALEETHVSDQSK 160 (330) T ss_pred cccceeEEEEEeecceeeehhhhhhhcchhHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhhhcccchhhhhhheecccc Confidence 4444544555555555666655433344456666777776544443322211110 0 001 Q ss_pred ccccccHHHHHHHHHhhhhhhc-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccC--- Q lcl|Aclame:pro 258 TKTVKNLDEIKALLNGGFDPAY-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGAN--- 333 (394) Q Consensus 258 ~~~~~~~~~i~~~~~~~~~~~~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~--- 333 (394) .....+++.+.++...+-+... -.+|+||+.++..|++..-- .++- +.-.+..-++++|++|++++.++...+ T Consensus 161 ~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~li--~~~~-~s~~~~~i~~~~G~~VivdD~~p~~~~~yt 237 (330) T protein:vir:10 161 ASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVYTKLQKDNLI--QYIQ-PTTATINIPTYLGYRVIIDDGIAPTGDIYT 237 (330) T ss_pred cccccCHHHHHHHHHHhccccccceEEEEcHHHHHHHHHhhhh--hhhc-ccccCcccccccceEEEEeCCCCCCCCcee Confidence 1223456777777666554443 36899999999999864211 1111 111223346899999999886654332 Q ss_pred ceEEEeccccEE-EEee---cceEEEEeeccccc-ceEEEEEEeccEEecccceEEEEec---CccCCC Q lcl|Aclame:pro 334 KAFIGDFKRGVL-FADR---KDLGLRWADNEIYG-QYLQAVLRFGVSKVDDKAGYYVTFT---PEPLPL 394 (394) Q Consensus 334 ~~~~gd~~~~~~-~~~~---~~~~i~~~~~~~~~-~~~r~~~r~d~~v~~~~af~~l~~~---~~~~~~ 394 (394) +.+|| .+.+ +.+. ..+.+++.++.... +.+....+ -++||..|..-.-. ...+|- T Consensus 238 ~yl~~---~GAi~~~~~~~~~~v~~EtdRd~~~g~~~l~~r~~---~~~hp~G~s~~~~~~~~~~~sPt 300 (330) T protein:vir:10 238 SYLFR---TGSIGLNTGNPSGLTTFETSREAAKGNDMIYTRRA---LVMHPYGVKWTGAEVDAGNITPS 300 (330) T ss_pred EEEEe---cCceeeecccCCccccccccCCccccceEEEEeeE---EEeeeeeeeecccccccCcCCcC Confidence 23343 2222 2221 12345555554332 33433333 44677777765321 123444 No 143 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=98.61 E-value=1.1e-08 Score=64.25 Aligned_cols=273 Identities=10% Similarity=0.010 Sum_probs=148.8 Q ss_pred HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCcccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPAL 196 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~ 196 (394) +...........+...+.- .+--+.++.++.......+.++++.++.++.++ +.++|+... ..+.....+.+.- . T Consensus 1 ms~~~~~tr~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~--~~~~~~~pG~~l~-~ 76 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGN--VEAKGRRAGEELE-R 76 (335) T ss_pred CCCcccchhhhcccccchh-heehhhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeee--eeeecccCCcCcC-C Confidence 1111111111222222222 343489999999999999999999999987665 667776532 3333333333332 2 Q ss_pred cccccceeeecHhhh-hhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhc----cc------------cc--- Q lcl|Aclame:pro 197 AKPDFKDVAWNIDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKV----LK------------SF--- 256 (394) Q Consensus 197 ~~~~~~~v~~~~~~~-~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g----~~------------~~--- 256 (394) ..+.-++.++....+ +.-..|.+---..+.+|+.+.+.+.+++++++..|.+++.. .. .+ T Consensus 77 ~~~~~~k~~itVD~ll~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:63 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCccccceEEEecceeechhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccCCCcCCCcce Confidence 234445656655553 22222332222235568999999999999999999976411 00 01 Q ss_pred -----cccccccHHHHHHH----HHhhhhhhc------ccEEEEcHHHHHHHHhhhccCCc-eeec---ccccCCCcccc Q lcl|Aclame:pro 257 -----TTKTVKNLDEIKAL----LNGGFDPAY------NVSLIVSQSFYQTLDTLKDGNGR-YLLQ---DDITAVSGKVL 317 (394) Q Consensus 257 -----~~~~~~~~~~i~~~----~~~~~~~~~------~a~~vm~~~~~~~l~~lkd~~G~-~l~~---~~~~~~~~~~l 317 (394) +.......+.+.++ ...+..... .-+.+++|..|..|..-+.-..+ |... .+..++...++ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v 236 (335) T protein:vir:63 157 KLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAIL 236 (335) T ss_pred eeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEEe Confidence 00111234444433 333322221 24688999999998764322222 2111 12344556789 Q ss_pred cccceEEecCccc---------ccCceEEEeccccE-EEEee--------cceEEEEeecc-cccceEEEEEEeccEEec Q lcl|Aclame:pro 318 LGKPVFVLSDEVL---------GANKAFIGDFKRGV-LFADR--------KDLGLRWADNE-IYGQYLQAVLRFGVSKVD 378 (394) Q Consensus 318 ~G~pV~~~~~~~~---------~~~~~~~gd~~~~~-~~~~~--------~~~~i~~~~~~-~~~~~~r~~~r~d~~v~~ 378 (394) +|+||+.+.+.+. ++...+=|||.+.+ +++.+ .+++.+..++. .+...+.+++-+|..+.| T Consensus 237 ~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~~~~a~G~g~lR 316 (335) T protein:vir:63 237 NGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLDTFQMYNIGARR 316 (335) T ss_pred eceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhHHHHHcCCcccc Confidence 9999988765432 22234446665432 22222 22222222222 223345666668999999 Q ss_pred ccceEEEEecCccCCC Q lcl|Aclame:pro 379 DKAGYYVTFTPEPLPL 394 (394) Q Consensus 379 ~~af~~l~~~~~~~~~ 394 (394) |++.+.++++..++=- T Consensus 317 Pe~a~~i~~tg~~~~~ 332 (335) T protein:vir:63 317 PDTAGAIELKGIGAFD 332 (335) T ss_pred cceEEEEEEcCCCcee Confidence 9999999998776333 No 144 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=98.60 E-value=9.2e-09 Score=64.63 Aligned_cols=272 Identities=14% Similarity=0.049 Sum_probs=142.0 Q ss_pred HHhhhhhh--hhhhcccccCC--ccccchhHHhHHHHHHHhhhhhhheeeeEeecC-CceeEEEEecCCCcccccccccc Q lcl|Aclame:pro 118 INETTPVE--PQKDGIKKENA--KPVSSEEILYTPAREVKTVVDLKPFTTVYQAKK-ASGKYPVLQRATTKMVTVAELEK 192 (394) Q Consensus 118 ~~~~~~~~--~~~~~~~~~~~--~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~ 192 (394) +....... ....+.....+ ..+--+.++..+.......+.+++++++.++.+ .+..+|+.. .........+.. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig--~~t~~~~~~g~~ 78 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG--RTKAAYLKPGEN 78 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHHHHhhhhhhccccccccccceeEeeecc--ceeeeeeccCCC Confidence 00000000 00011111111 123347788888888888888999998877554 466666643 233343344433 Q ss_pred ccc-ccccccceeee--cHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-------------- Q lcl|Aclame:pro 193 NPA-LAKPDFKDVAW--NIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKS-------------- 255 (394) Q Consensus 193 ~~~-~~~~~~~~v~~--~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~-------------- 255 (394) ... ..+.+..++++ +-.++.. ..|.+---.++..|+.+.+.+..+.++++..|..++.-... T Consensus 79 l~~~~~~~~~~e~~ltID~~~~~~-~~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~ 157 (347) T protein:vir:15 79 LDDKRKDIKHTEKVIHIDGLLTAD-VLIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLPDASNENIEG 157 (347) T ss_pred CCCCCCCCccceEEEEechhhhhh-HHhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Confidence 321 12234556444 4433333 22322222235568999999999999999988877632110 Q ss_pred -c--c------ccc------cccHHHHHHHHH----hhhh---hhcccEEEEcHHHHHHHHhhhcc-CCceeecccccCC Q lcl|Aclame:pro 256 -F--T------TKT------VKNLDEIKALLN----GGFD---PAYNVSLIVSQSFYQTLDTLKDG-NGRYLLQDDITAV 312 (394) Q Consensus 256 -~--~------~~~------~~~~~~i~~~~~----~~~~---~~~~a~~vm~~~~~~~l~~lkd~-~G~~l~~~~~~~~ 312 (394) + + ..+ ...++.|.+++. .+-. |...-..|++|..+..|.+-.+. +..|.-...+..| T Consensus 158 ~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~~~~~~~~d~~~~~~~~~G 237 (347) T protein:vir:15 158 LGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILAALMPNAANYQALIDHERG 237 (347) T ss_pred cCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhcccccccccccccccccce Confidence 0 0 000 011233444332 2222 22234677899999988654322 2222222234455 Q ss_pred CcccccccceEEecCcccccC------------ceEE--------Eeccc---------cEEEEeecceEEEEeecc-cc Q lcl|Aclame:pro 313 SGKVLLGKPVFVLSDEVLGAN------------KAFI--------GDFKR---------GVLFADRKDLGLRWADNE-IY 362 (394) Q Consensus 313 ~~~~l~G~pV~~~~~~~~~~~------------~~~~--------gd~~~---------~~~~~~~~~~~i~~~~~~-~~ 362 (394) ..++++|++|+.+.+.+.... ..+- ++|.. ++-.+.-+++.++..++. ++ T Consensus 238 ~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~~~~~~~e~~~~~~~~ 317 (347) T protein:vir:15 238 TIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQ 317 (347) T ss_pred EEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeEeeceeeeecccchhh Confidence 567899999998765442111 1111 11111 111222344455555443 34 Q ss_pred cceEEEEEEeccEEecccceEEEEecCccC Q lcl|Aclame:pro 363 GQYLQAVLRFGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 363 ~~~~r~~~r~d~~v~~~~af~~l~~~~~~~ 392 (394) ...+++...+|.+++||++.+.+++.-..- T Consensus 318 ~d~i~~~~~~G~~vlrP~~av~~~~~~~~~ 347 (347) T protein:vir:15 318 ADQIIAKYAMGHGGLRPEAAGAIVLPKVSE 347 (347) T ss_pred hhhhehhhhcCCceeccccEEEEecCCCCC Confidence 556788888899999999999998877666 No 145 >protein:vir:1583 Length: 351 # NCBI annotation: minor capsid protein # Family: family:all:1522 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695165;swissprot:trembl:o03966;genbank:gi:23455804;uniprot:O03966;genbank:GeneID:955561 Probab=98.56 E-value=2.3e-08 Score=62.45 Aligned_cols=255 Identities=6% Similarity=-0.054 Sum_probs=133.2 Q ss_pred hhcccccCCccccchhHHhHHHHHHHhhhhhhh---------eeeeEeecCCceeEEEEecCCCcccccccccccccccc Q lcl|Aclame:pro 128 KDGIKKENAKPVSSEEILYTPAREVKTVVDLKP---------FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAK 198 (394) Q Consensus 128 ~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~ 198 (394) +. ++-....++|+.+.+.+.+...+.+.|.+ +......++...++|++..-++.+-.+.++..... .. T Consensus 1 MA--~T~lsd~i~PEvf~~yv~~~~~~~~~l~qSG~i~~~~~l~~~~~~~G~~it~P~~~~l~Gd~~~~~~~~~i~~-~k 77 (351) T protein:vir:15 1 MA--ETHLSDLIVPEVFGNYVVNQIIKTNRFVQSGILTPDPDLGPHLLEAGTRITVPFLNDLTGDPDNWTDSDDIDV-NN 77 (351) T ss_pred CC--ceeeeeeechhHHHHHHhhhhHHhhhHhhcccccccHHHHHHhhcCCCEEEecccccCCCcccccCCCcccch-he Confidence 11 23334568899888888766555444422 11111234567888987655555555666555443 34 Q ss_pred cccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc------------------ccccccc Q lcl|Aclame:pro 199 PDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL------------------KSFTTKT 260 (394) Q Consensus 199 ~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~------------------~~~~~~~ 260 (394) .+-..-.-..+..+....++++...-+.-|....|.++|+....+..+..++..+ ...+... T Consensus 78 itt~~~~a~i~~~~kg~~~tD~a~~~sg~dp~~~i~~q~a~~w~~~~q~~lla~l~gv~~~~~~~~~~~~d~t~~~~~~~ 157 (351) T protein:vir:15 78 LTSGKQQGIKFYQTKAYGYTDLGTMISGAPVQETIGNRFAAFWQRADQKTLLSVLKGVMGVTKIANSKVYDQTKVSPSEP 157 (351) T ss_pred ecccceeEEEEeeccceehhhhhHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhchhhcccceecccccccccc Confidence 4444444444555555666665433344466677777777666555444332211 0111233 Q ss_pred cccHHHHHHHHHhhhhhhcc--cEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccc----cC- Q lcl|Aclame:pro 261 VKNLDEIKALLNGGFDPAYN--VSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLG----AN- 333 (394) Q Consensus 261 ~~~~~~i~~~~~~~~~~~~~--a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~----~~- 333 (394) ..+++.+.+++..+-+...+ ++|+||+.++..|++..-- .|+- +.-.+..-++++|++|++.+.++.. +. T Consensus 158 ~is~~~l~~A~~~~GD~~~~~~~~ivmhS~v~~~L~~~~li--~~~~-~s~~~~~i~t~~G~~VivdD~~p~~~~~~~~~ 234 (351) T protein:vir:15 158 MFGAKGFTGAIGLMGDLQDTAFGAIAVNSATYSLMKVQGLI--ETIQ-PQNGATPFEAYNGLRIVLDDDIEIDLTDKTKP 234 (351) T ss_pred ccCHHHHHHHHHHhccccccceEEEEEChHHHHHHHhhhhh--hhcc-ccccCcccceecceEEEEcCCCccccCCCCCc Confidence 45678888888777665433 7899999999999864311 1110 1111223468999999997765432 11 Q ss_pred ---ceEEEeccccEEEEeecceEEEEeecccc---cceEEEEEEeccEEecccceEEEEe---cCccCCC Q lcl|Aclame:pro 334 ---KAFIGDFKRGVLFADRKDLGLRWADNEIY---GQYLQAVLRFGVSKVDDKAGYYVTF---TPEPLPL 394 (394) Q Consensus 334 ---~~~~gd~~~~~~~~~~~~~~i~~~~~~~~---~~~~r~~~r~d~~v~~~~af~~l~~---~~~~~~~ 394 (394) +++||. +.+.+...+..+++.++... ...+....+ -++||..|..-.- +...+|- T Consensus 235 ~ytsyl~~~---GAi~~~~~~~~ve~~rd~~~~~g~d~l~~r~~---~~~hp~G~s~~~~~~~~~~~sPt 298 (351) T protein:vir:15 235 VSTSYIFAP---GAVRYSTNMRSTETKYDPLINGGQDVIVQKRV---GTIHVAGTSIKASFSPSKASFPT 298 (351) T ss_pred eeEEEEEec---ceeeeecCCcCcceeecccCCCCceEEEEeee---eeeeeeeeeecccccccCcCCcC Confidence 233332 32222223333444444322 222333222 3477777765421 1122344 No 146 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=98.55 E-value=1.9e-08 Score=62.90 Aligned_cols=273 Identities=10% Similarity=-0.002 Sum_probs=147.5 Q ss_pred HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCcccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPAL 196 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~ 196 (394) +...........+...+.- .+--+.++.++.......+.++++.++.++.++ +..+|+... ....+...+.+.- . T Consensus 1 ms~~~~~t~~~~~~s~~d~-al~le~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~--~~~~~~~pG~~l~-~ 76 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADV-DIHLEEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGN--VEAKGRRAGEELE-R 76 (335) T ss_pred CCccccccccccccccchh-hhhhhhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeee--eeecccccCcccC-C Confidence 1111111111222222222 344488999999999999999999999987665 677776532 2233333333332 2 Q ss_pred cccccceeeecHhhhh-hhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhc----cc------------ccc-- Q lcl|Aclame:pro 197 AKPDFKDVAWNIDTYR-GAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKV----LK------------SFT-- 257 (394) Q Consensus 197 ~~~~~~~v~~~~~~~~-~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g----~~------------~~~-- 257 (394) ..+.-++.++....+- .-..|.+---..+.+|+.+.+.+.+++++++..|.+++.. .. .+. T Consensus 77 ~~~~~~k~~itID~ll~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~~~~~~G~~~ 156 (335) T protein:vir:78 77 SRVVNDKWNLTVDTLLYLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLEDAFSPGVLE 156 (335) T ss_pred CCcccCCeEEEecceeechhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccCCCcCCCcce Confidence 3344566555555432 2222332112235668999999999999999999976411 10 010 Q ss_pred ------ccccccHHHHHHHHHh----hhhhhc------ccEEEEcHHHHHHHHhhhccCCc-eeec---ccccCCCcccc Q lcl|Aclame:pro 258 ------TKTVKNLDEIKALLNG----GFDPAY------NVSLIVSQSFYQTLDTLKDGNGR-YLLQ---DDITAVSGKVL 317 (394) Q Consensus 258 ------~~~~~~~~~i~~~~~~----~~~~~~------~a~~vm~~~~~~~l~~lkd~~G~-~l~~---~~~~~~~~~~l 317 (394) .......+.+.+++.. +..... .-+.+++|..|..|..-..-..+ |... .+...+....+ T Consensus 157 ~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v 236 (335) T protein:vir:78 157 KLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATNDYVKSRVAIL 236 (335) T ss_pred eeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhcccccccccccccccccccccceeEEe Confidence 0011123334433322 221111 24789999999998764322111 2111 12344556789 Q ss_pred cccceEEecCcccc---------cCceEEEecccc-EEEEee--------cceEEEEeecc-cccceEEEEEEeccEEec Q lcl|Aclame:pro 318 LGKPVFVLSDEVLG---------ANKAFIGDFKRG-VLFADR--------KDLGLRWADNE-IYGQYLQAVLRFGVSKVD 378 (394) Q Consensus 318 ~G~pV~~~~~~~~~---------~~~~~~gd~~~~-~~~~~~--------~~~~i~~~~~~-~~~~~~r~~~r~d~~v~~ 378 (394) +|+||+.+.+.+.+ ++..+=+||.+- .+++.+ .++..+..++. .+...+.+.+-+|..+.| T Consensus 237 ~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~lR 316 (335) T protein:vir:78 237 NGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLDTFQMYNIGARR 316 (335) T ss_pred eceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhhhHHHHcCCcccC Confidence 99999887655422 222333455442 222322 22232333222 233345666668999999 Q ss_pred ccceEEEEecCccCCC Q lcl|Aclame:pro 379 DKAGYYVTFTPEPLPL 394 (394) Q Consensus 379 ~~af~~l~~~~~~~~~ 394 (394) |++.+.++++..++=- T Consensus 317 Pe~a~~i~~tg~~~~~ 332 (335) T protein:vir:78 317 PDTAGAIELKGIEAFD 332 (335) T ss_pred cceEEEEEecCCCccc Confidence 9999999988877433 No 147 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=98.52 E-value=1.2e-08 Score=64.02 Aligned_cols=230 Identities=13% Similarity=0.069 Sum_probs=123.9 Q ss_pred eeeeEeecCCceeEEEEecCCCccccccccccccc-ccccccce--eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPA-LAKPDFKD--VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESI 237 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~-~~~~~~~~--v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l 237 (394) +++-+.- +.++++|+... ........|.+... ..+..-.+ ++++-.++....--.-.= ..+.+|+.+...++. T Consensus 1 ~vr~i~~-g~s~~~~~iG~--~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~~~VdDiD~-~qa~~Dlr~e~s~~~ 76 (324) T protein:vir:99 1 MTRTITS-GKSAQFPVMGR--TKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTDVLIYDIED-AMNHYDVRSEYSTQM 76 (324) T ss_pred Ceeeeec-CceEEEeeeee--eEeccccCCCCcCCCcCCcCcccEEEEecchhhhhhhhhhHHH-HhcCccchhHHHHHH Confidence 4443333 45777777532 33333333333211 11122333 555555555432222111 225568999999999 Q ss_pred HHHHHHHHHHHHhhcc----------------ccc---------ccc-cccc----HHHHHHHHHhhhhh---hcccEEE Q lcl|Aclame:pro 238 SQIKVNTTNDAIAKVL----------------KSF---------TTK-TVKN----LDEIKALLNGGFDP---AYNVSLI 284 (394) Q Consensus 238 ~~~~~~~~~~a~~~g~----------------~~~---------~~~-~~~~----~~~i~~~~~~~~~~---~~~a~~v 284 (394) +.++++..|..++.-. +.+ +.. ...+ ++.+.++...+-.. ...-..| T Consensus 77 G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~~v 156 (324) T protein:vir:99 77 GEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRTFY 156 (324) T ss_pred HHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEE Confidence 9999999998764221 000 000 0111 33344333332222 2234688 Q ss_pred EcHHHHHHHHhhh-ccCCceeecccccCCCcccccccceEEecCcccccCc-----------------------eEEEec Q lcl|Aclame:pro 285 VSQSFYQTLDTLK-DGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANK-----------------------AFIGDF 340 (394) Q Consensus 285 m~~~~~~~l~~lk-d~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~-----------------------~~~gd~ 340 (394) ++|..+..|..-+ -.++.|.-...+..+.-++++|++|+.+.+.+...+. -|-+|| T Consensus 157 v~P~~y~~Ll~~~~~~~~~~~~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~~d~ 236 (324) T protein:vir:99 157 TDPDTYSAILAALMPNAANYAALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMTVGA 236 (324) T ss_pred eChHHHHHHhhcccccccccccccceecceEEEEeceEEEecCCcccccccccccccccccccccccccccccccccccc Confidence 9999998775322 2233443334456666788999999887654432111 133444 Q ss_pred cccE-EEEe--------ecceEEEEeec-ccccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 341 KRGV-LFAD--------RKDLGLRWADN-EIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 341 ~~~~-~~~~--------~~~~~i~~~~~-~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +... +++. -.++.++..++ .++...+++.+-+|..++||++.+.+++..-+||- T Consensus 237 ~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~~~~~ 300 (324) T protein:vir:99 237 DNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDGETPA 300 (324) T ss_pred CceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccCcccc Confidence 3321 1222 23334444333 34556678888899999999999999998888874 No 148 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=98.36 E-value=2.6e-08 Score=62.14 Aligned_cols=272 Identities=8% Similarity=-0.045 Sum_probs=129.3 Q ss_pred HHhhh-hhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee---cCCceeEEEEecCCCccccccccccc Q lcl|Aclame:pro 118 INETT-PVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA---KKASGKYPVLQRATTKMVTVAELEKN 193 (394) Q Consensus 118 ~~~~~-~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~e~~~~ 193 (394) +..+. ....+..+...+....++|+.|+..+.+.++....+..+++.... .+.++++|... ..+.....++... T Consensus 1 ~~~~~~~~~~~~~~~~~t~~~~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g--~~~a~d~~~g~~i 78 (381) T protein:vir:80 1 MATIQGTGGYKGSAVDLSNVQVFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS--RAAVYDKQPQTPV 78 (381) T ss_pred CceecccccccCcccchhhHHhhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC--cceeeeecCCCcc Confidence 11100 011112223333445788999999999999888888777654332 23356666542 3344445544443 Q ss_pred ccccccccceeeecHhhh-hhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc----c------------ Q lcl|Aclame:pro 194 PALAKPDFKDVAWNIDTY-RGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKS----F------------ 256 (394) Q Consensus 194 ~~~~~~~~~~v~~~~~~~-~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~----~------------ 256 (394) . ....+...++++..+. +.-+.|++.-...+..|+.+.+.+.+..++++..|..++..... . T Consensus 79 ~-~~~~~~~~~~itID~~~~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~~~~~~~~~t~~~~i~ 157 (381) T protein:vir:80 79 N-LQARTDSEFTFTVTKYKESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVINAFPSQRIYSYDTTLG 157 (381) T ss_pred c-ccccCCceEEEEEeeeeecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccccccc Confidence 3 3344555555555332 34467776555556678999999999999999888877543210 0 Q ss_pred --cc-------cccccHHHHHHHHHhhhhh---hcccEEEEcHHHHHHHHhhhc-cCCceeecccccCCCcccccccceE Q lcl|Aclame:pro 257 --TT-------KTVKNLDEIKALLNGGFDP---AYNVSLIVSQSFYQTLDTLKD-GNGRYLLQDDITAVSGKVLLGKPVF 323 (394) Q Consensus 257 --~~-------~~~~~~~~i~~~~~~~~~~---~~~a~~vm~~~~~~~l~~lkd-~~G~~l~~~~~~~~~~~~l~G~pV~ 323 (394) .. ....+++.++++...+-.. ..+-.++++|..+..|.+... .+-.+.....+.++..++|+|++|+ T Consensus 158 ~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~~~~~~~ad~~~~~~l~~G~Ig~i~G~~Vv 237 (381) T protein:vir:80 158 DGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLSINQFISVDFSQVKPVTSGVVGTILGMEVI 237 (381) T ss_pred ccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhhchhhhhhhhccchhhhceeeeEEcceEEE Confidence 00 0112345555554433222 223478899999999875321 1222333344666777899999999 Q ss_pred EecCcccccCceEEEeccccEEEEeecceEEE-E-eecccccceEEEEEEeccEEecc----------------cceEEE Q lcl|Aclame:pro 324 VLSDEVLGANKAFIGDFKRGVLFADRKDLGLR-W-ADNEIYGQYLQAVLRFGVSKVDD----------------KAGYYV 385 (394) Q Consensus 324 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~-~-~~~~~~~~~~r~~~r~d~~v~~~----------------~af~~l 385 (394) .++..+......+...+. .-......+.-. + -++.+....++....+|..+... ..-.+. T Consensus 238 ~Sn~lp~~~~t~~~~~ag--ap~~~~~~~~~~~~~g~~s~~a~av~~~k~yd~~~~~~~~~~~~~~g~~~~~~~~~~~~~ 315 (381) T protein:vir:80 238 VTTQIGINSLTGYVNGQG--APTQPTPGVLGSPYLPDQAGTANVVNTGSASDLAVSLSYFGLPVFSGAGATAADGGQTLG 315 (381) T ss_pred eecccccccccceeeecc--ccccccccccccccccccccceeeeeeeeeeceeeeeeeccceeeecceeeecCCCceee Confidence 876544322211111000 000000000000 0 00000011122222222222111 111111 Q ss_pred Eec---------------CccCCC Q lcl|Aclame:pro 386 TFT---------------PEPLPL 394 (394) Q Consensus 386 ~~~---------------~~~~~~ 394 (394) +++ +++-|- T Consensus 316 ~~~~~~~~~~~~~~~~~~~~~~~~ 339 (381) T protein:vir:80 316 SFGGANRWATAVVCHPDWLAVGVQ 339 (381) T ss_pred eehhhhhhhhhcccccccccccce Confidence 111 111111 No 149 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=98.36 E-value=1.8e-08 Score=63.03 Aligned_cols=212 Identities=8% Similarity=-0.032 Sum_probs=125.1 Q ss_pred hhhhhhhhhcccccC-CccccchhHHhHHHHHHHhhhhhhheeeeEeecCCc-eeEEEEecCCCcccccccccccccccc Q lcl|Aclame:pro 121 TTPVEPQKDGIKKEN-AKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELEKNPALAK 198 (394) Q Consensus 121 ~~~~~~~~~~~~~~~-~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~e~~~~~~~~~ 198 (394) ....... ..+-.. ...+.|......|++.+.+.++|+..+++....+++ +.+.+. .+.+.+.|..-++..+.++ T Consensus 1 m~~~~~~--~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~~~~v~--~~LP~~~fR~lN~g~~~s~ 76 (328) T protein:vir:95 1 MAVKGLT--ALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGHRTTIR--SGLPSATWRLLNYGVQPSK 76 (328) T ss_pred CCccccc--cccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcceeeEe--eccCCceeeecCCccCccc Confidence 0000000 001111 123556667778999999999999999999886443 445554 3445555655555555688 Q ss_pred cccceeeecHhhhhhhhhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------ Q lcl|Aclame:pro 199 PDFKDVAWNIDTYRGAIPLSQESIDDAD--VDLVGIVSESISQIKVNTTNDAIAKVLKSFTT------------------ 258 (394) Q Consensus 199 ~~~~~v~~~~~~~~~~~~vs~ell~ds~--~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~------------------ 258 (394) .++.+++-..+-+++.+.|.+.+.+... .++...-.....+++.......+++|+.+..| T Consensus 77 ~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~s~~~a 156 (328) T protein:vir:95 77 STTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSLSAGNA 156 (328) T ss_pred ceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCccccccc Confidence 9999999999999999999998887553 22333333345556666665555555322100 Q ss_pred -------------------------------cc----------------------------------------------- Q lcl|Aclame:pro 259 -------------------------------KT----------------------------------------------- 260 (394) Q Consensus 259 -------------------------------~~----------------------------------------------- 260 (394) .+ T Consensus 157 ~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI 236 (328) T protein:vir:95 157 QNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTLEDANGGKYEGYRTHYKWDNGLALRDWRYVVRI 236 (328) T ss_pred cceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEE Confidence 00 Q ss_pred ----------cccHHHHHHH----HHhhhhh-hcccEEEEcHHHHHHHHhh-hccCCceeecccccCCCcccccccceEE Q lcl|Aclame:pro 261 ----------VKNLDEIKAL----LNGGFDP-AYNVSLIVSQSFYQTLDTL-KDGNGRYLLQDDITAVSGKVLLGKPVFV 324 (394) Q Consensus 261 ----------~~~~~~i~~~----~~~~~~~-~~~a~~vm~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~l~G~pV~~ 324 (394) -++..+++++ +...+.. .-+++|+||+.....|++. .+....++-.........-.++|.||.. T Consensus 237 ~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~~~~~~~~g~~~t~~~gipir~ 316 (328) T protein:vir:95 237 ANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRPVFYMNRTVGQALDLQSLEKTSLAISVKETEGEWWTSFRGVPIRE 316 (328) T ss_pred ecCcccccccccChhhHHHHHHHHHHHhccCCCCcceeehhHHHHHHHHHHHhcCcceeeeeeccCCcceeEECCeEEEE Confidence 0011223333 2222212 2357899999999999865 4444444433344555566899999998 Q ss_pred ecCcccccCceE Q lcl|Aclame:pro 325 LSDEVLGANKAF 336 (394) Q Consensus 325 ~~~~~~~~~~~~ 336 (394) ++........++ T Consensus 317 ~dai~~tE~~vv 328 (328) T protein:vir:95 317 TDALLETEARVV 328 (328) T ss_pred EeeeecCccccC Confidence 875443333333 No 150 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=98.21 E-value=8.8e-08 Score=59.25 Aligned_cols=271 Identities=10% Similarity=0.055 Sum_probs=143.3 Q ss_pred HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCcccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPAL 196 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~ 196 (394) +.......... ...+...-.+--+.+..++.....+.+.++++..+.++.++ ++++|+... ....+...|.+ +.. T Consensus 1 Ms~~n~~t~p~-~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~--s~a~y~~pG~~-ldg 76 (400) T protein:vir:10 1 MSTPNNLTNVA-VSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGE--TELQVLAPGQS-PAA 76 (400) T ss_pred CCCCccccccc-cccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeee--eEEeeecCCCC-cCC Confidence 11110000000 11112223456678888888888888999999999998766 677776532 23333333333 333 Q ss_pred cccccceeeecHhhh-hhhhhhhHHHHhccHHH-HHHHHHHHHHHHHHHHHHHHHhhcc--c----c-------cc---- Q lcl|Aclame:pro 197 AKPDFKDVAWNIDTY-RGAIPLSQESIDDADVD-LVGIVSESISQIKVNTTNDAIAKVL--K----S-------FT---- 257 (394) Q Consensus 197 ~~~~~~~v~~~~~~~-~~~~~vs~ell~ds~~~-l~~~i~~~l~~~~~~~~~~a~~~g~--~----~-------~~---- 257 (394) +.+..++..+....+ ..-..|..=---++.+| +.+.+.+.+++++++..|..++.-. + + ++ T Consensus 77 ~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~g~~~g 156 (400) T protein:vir:10 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPRVKGHG 156 (400) T ss_pred CCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCccccc Confidence 334555555555443 22223322111123456 7899999999999998888664211 0 0 00 Q ss_pred ---------ccccccHHHHH----HHHHhhhh---hhcccEEEEcHHHHHHHHhhh---ccCCceeec--ccccCCCccc Q lcl|Aclame:pro 258 ---------TKTVKNLDEIK----ALLNGGFD---PAYNVSLIVSQSFYQTLDTLK---DGNGRYLLQ--DDITAVSGKV 316 (394) Q Consensus 258 ---------~~~~~~~~~i~----~~~~~~~~---~~~~a~~vm~~~~~~~l~~lk---d~~G~~l~~--~~~~~~~~~~ 316 (394) ....++.+.+. ++...+.. |.-.-++++.|..|..|.... +.+ |-.. .+...+...+ T Consensus 157 ~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvnrd--f~~s~~g~~~~g~v~~ 234 (400) T protein:vir:10 157 FSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKS--YTISQSGATIQGFVLS 234 (400) T ss_pred cceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccchh--ccccCCCccccceEEE Confidence 00111222232 22222221 112346777777777775321 111 1111 1123334458 Q ss_pred ccccceEEecCccccc-------------C--ceEEEeccccE-EEEeecceE-EEE--------eecccccceEEEEEE Q lcl|Aclame:pro 317 LLGKPVFVLSDEVLGA-------------N--KAFIGDFKRGV-LFADRKDLG-LRW--------ADNEIYGQYLQAVLR 371 (394) Q Consensus 317 l~G~pV~~~~~~~~~~-------------~--~~~~gd~~~~~-~~~~~~~~~-i~~--------~~~~~~~~~~r~~~r 371 (394) ++|+||+.+.+.+..+ + .-+-|||+..+ ++|.+.-+- ++. .+...+...+.+++- T Consensus 235 v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt~~~~~d~r~~~~~id~~~a 314 (400) T protein:vir:10 235 SYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVIGDIFYEKKEKTYYIDTFMS 314 (400) T ss_pred EeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccccccccchhhHHHHHHHHHH Confidence 9999998876543221 1 12447876643 233332221 221 122233345677777 Q ss_pred eccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 372 FGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 372 ~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +|..+.||+|...++..-..||- T Consensus 315 ~G~g~~RPeaa~vv~~~~~~~~~ 337 (400) T protein:vir:10 315 EGAIPDRWEAVSVVTTKRQSTGA 337 (400) T ss_pred hCCcccchhheEEEEecCCcccc Confidence 89999999999999999999998 No 151 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=98.14 E-value=1.4e-07 Score=58.20 Aligned_cols=213 Identities=8% Similarity=-0.014 Sum_probs=124.5 Q ss_pred HHhhhhhhhhhhcccccC-CccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKEN-AKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPAL 196 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~-~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~ 196 (394) +. .... ...+-.. ...+.|......|+|.+.+.++|+..+++.....+++...... .+.+.+.|-.-++..+. T Consensus 1 m~---~~~~--~a~TL~e~AKr~~~d~~~~~IIE~l~~tn~IL~~lpf~e~N~~tg~~t~vr-t~LP~~~fR~lN~g~~~ 74 (330) T protein:vir:10 1 MA---TLST--NNPTMADVAKRLDPNGKVDIIVEMLNQTNPVLQDMTAIEGNLPTGHRTSVR-TGLPTPTWRKLYGGVLP 74 (330) T ss_pred CC---cCCC--CcccHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhhccCCcccceeEE-eecCCchhhhcCCcccc Confidence 00 0000 0000000 1234566666789999999999999988887655554433222 23344556555555556 Q ss_pred cccccceeeecHhhhhhhhhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc--------------- Q lcl|Aclame:pro 197 AKPDFKDVAWNIDTYRGAIPLSQESIDDAD--VDLVGIVSESISQIKVNTTNDAIAKVLKSFTTK--------------- 259 (394) Q Consensus 197 ~~~~~~~v~~~~~~~~~~~~vs~ell~ds~--~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~--------------- 259 (394) ++.++.+++-+.+-+.+...|.+.+.+... .++.........+++.......+++|+.+..|. T Consensus 75 s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGD~a~~p~~F~GL~kR~~~~ta~ 154 (330) T protein:vir:10 75 NKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLFYGNDGIAPAEFTGLSPRYNSLSAE 154 (330) T ss_pred ccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCChhhccchhhhcCCCCCC Confidence 889999999999999999999998876542 234444445556666666666666553221100 Q ss_pred -------------------------------------------------------------------------------- Q lcl|Aclame:pro 260 -------------------------------------------------------------------------------- 259 (394) Q Consensus 260 -------------------------------------------------------------------------------- 259 (394) T Consensus 155 ~~~qvIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~ 234 (330) T protein:vir:10 155 NKDNVIDAGGTGSDNASAWLVVWGPNTCHSIYPKGSKAGLSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRY 234 (330) T ss_pred chhheeeccccccCceEEEEEEEcCCeEEEEcccCccccceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCccc Confidence Q ss_pred -------------ccccHHHHHHHHHhhhh----hh-cccEEEEcHHHHHHHHhh-hccCCceeecccccCCCccccccc Q lcl|Aclame:pro 260 -------------TVKNLDEIKALLNGGFD----PA-YNVSLIVSQSFYQTLDTL-KDGNGRYLLQDDITAVSGKVLLGK 320 (394) Q Consensus 260 -------------~~~~~~~i~~~~~~~~~----~~-~~a~~vm~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~l~G~ 320 (394) +-...+++++++..+.. .. .+++|+||+.....|++. .+.+.-.+-...+.....-.++|. T Consensus 235 vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gi 314 (330) T protein:vir:10 235 VARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGI 314 (330) T ss_pred EEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCe Confidence 00012345554433222 22 257899999999999974 454443343333444444679999 Q ss_pred ceEEecCcccccCceE Q lcl|Aclame:pro 321 PVFVLSDEVLGANKAF 336 (394) Q Consensus 321 pV~~~~~~~~~~~~~~ 336 (394) ||..+|........++ T Consensus 315 pir~~Dail~tE~~vv 330 (330) T protein:vir:10 315 PVQRTDALLNTESRVV 330 (330) T ss_pred EEEEEeeeecCccccC Confidence 9999875444443333 No 152 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=98.03 E-value=2.7e-07 Score=56.61 Aligned_cols=214 Identities=10% Similarity=0.006 Sum_probs=119.3 Q ss_pred hhhhhhhhhcccccCCccccchh-HHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCccccccccccccccccc Q lcl|Aclame:pro 121 TTPVEPQKDGIKKENAKPVSSEE-ILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKP 199 (394) Q Consensus 121 ~~~~~~~~~~~~~~~~~~lvP~~-~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 199 (394) ........-++.. ....+-|.. +...|++.+.+.++|+..+++.....+.+..-. ...+.+.+.|..-++..+.++. T Consensus 1 m~~~~~~~~TL~e-~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~-vrt~LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:10 1 MPTLSTTNPTLAD-VAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTT-VRSGLPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCccccCcccHHH-HHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceee-EEeccCCchhhccCCccCcccc Confidence 0000000000000 000111222 345699999999999999999887666543322 2234445666666666667889 Q ss_pred ccceeeecHhhhhhhhhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------- Q lcl|Aclame:pro 200 DFKDVAWNIDTYRGAIPLSQESIDDAD--VDLVGIVSESISQIKVNTTNDAIAKVLKSFTT------------------- 258 (394) Q Consensus 200 ~~~~v~~~~~~~~~~~~vs~ell~ds~--~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~------------------- 258 (394) ++.+++-..+-+++.+.|.+.+.+... .++...-.....+.+.......+++|+.+..| T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999998887543 22334344445566666666665555322100 Q ss_pred ------------------------------cc------------------------------------------------ Q lcl|Aclame:pro 259 ------------------------------KT------------------------------------------------ 260 (394) Q Consensus 259 ------------------------------~~------------------------------------------------ 260 (394) .+ T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 00 Q ss_pred ----------cccHHHHHHHHHhhhh----h-hcccEEEEcHHHHHHHHhh-hcc-CCceeecccccCCCcccccccceE Q lcl|Aclame:pro 261 ----------VKNLDEIKALLNGGFD----P-AYNVSLIVSQSFYQTLDTL-KDG-NGRYLLQDDITAVSGKVLLGKPVF 323 (394) Q Consensus 261 ----------~~~~~~i~~~~~~~~~----~-~~~a~~vm~~~~~~~l~~l-kd~-~G~~l~~~~~~~~~~~~l~G~pV~ 323 (394) ..+..++++++..+.. . ..+.+|+||+.....|++. .+. +.+.+-.....+...-.++|.||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 0000123333222221 1 1257899999999999875 343 323333223344455679999999 Q ss_pred EecCcccccCceE Q lcl|Aclame:pro 324 VLSDEVLGANKAF 336 (394) Q Consensus 324 ~~~~~~~~~~~~~ 336 (394) .++........++ T Consensus 319 ~~dai~~tE~~Vv 331 (331) T protein:vir:10 319 RTDALLLTEARVV 331 (331) T ss_pred EeeeeecCccccC Confidence 9875444333333 No 153 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=98.03 E-value=2.7e-07 Score=56.61 Aligned_cols=214 Identities=10% Similarity=0.006 Sum_probs=119.3 Q ss_pred hhhhhhhhhcccccCCccccchh-HHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCccccccccccccccccc Q lcl|Aclame:pro 121 TTPVEPQKDGIKKENAKPVSSEE-ILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKP 199 (394) Q Consensus 121 ~~~~~~~~~~~~~~~~~~lvP~~-~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 199 (394) ........-++.. ....+-|.. +...|++.+.+.++|+..+++.....+.+..-. ...+.+.+.|..-++..+.++. T Consensus 1 m~~~~~~~~TL~e-~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~-vrt~LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:98 1 MPTLSTTNPTLAD-VAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTT-VRSGLPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCccccCcccHHH-HHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceee-EEeccCCchhhccCCccCcccc Confidence 0000000000000 000111222 345699999999999999999887666543322 2234445666666666667889 Q ss_pred ccceeeecHhhhhhhhhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------- Q lcl|Aclame:pro 200 DFKDVAWNIDTYRGAIPLSQESIDDAD--VDLVGIVSESISQIKVNTTNDAIAKVLKSFTT------------------- 258 (394) Q Consensus 200 ~~~~v~~~~~~~~~~~~vs~ell~ds~--~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~------------------- 258 (394) ++.+++-..+-+++.+.|.+.+.+... .++...-.....+.+.......+++|+.+..| T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:98 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999998887543 22334344445566666666665555322100 Q ss_pred ------------------------------cc------------------------------------------------ Q lcl|Aclame:pro 259 ------------------------------KT------------------------------------------------ 260 (394) Q Consensus 259 ------------------------------~~------------------------------------------------ 260 (394) .+ T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:98 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 00 Q ss_pred ----------cccHHHHHHHHHhhhh----h-hcccEEEEcHHHHHHHHhh-hcc-CCceeecccccCCCcccccccceE Q lcl|Aclame:pro 261 ----------VKNLDEIKALLNGGFD----P-AYNVSLIVSQSFYQTLDTL-KDG-NGRYLLQDDITAVSGKVLLGKPVF 323 (394) Q Consensus 261 ----------~~~~~~i~~~~~~~~~----~-~~~a~~vm~~~~~~~l~~l-kd~-~G~~l~~~~~~~~~~~~l~G~pV~ 323 (394) ..+..++++++..+.. . ..+.+|+||+.....|++. .+. +.+.+-.....+...-.++|.||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:98 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 0000123333222221 1 1257899999999999875 343 323333223344455679999999 Q ss_pred EecCcccccCceE Q lcl|Aclame:pro 324 VLSDEVLGANKAF 336 (394) Q Consensus 324 ~~~~~~~~~~~~~ 336 (394) .++........++ T Consensus 319 ~~dai~~tE~~Vv 331 (331) T protein:vir:98 319 RTDALLLTEARVV 331 (331) T ss_pred EeeeeecCccccC Confidence 9875444333333 No 154 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=98.03 E-value=2.7e-07 Score=56.61 Aligned_cols=214 Identities=10% Similarity=0.006 Sum_probs=119.3 Q ss_pred hhhhhhhhhcccccCCccccchh-HHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCccccccccccccccccc Q lcl|Aclame:pro 121 TTPVEPQKDGIKKENAKPVSSEE-ILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKP 199 (394) Q Consensus 121 ~~~~~~~~~~~~~~~~~~lvP~~-~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 199 (394) ........-++.. ....+-|.. +...|++.+.+.++|+..+++.....+.+..-. ...+.+.+.|..-++..+.++. T Consensus 1 m~~~~~~~~TL~e-~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~-vrt~LP~~~fR~lN~g~~~s~~ 78 (331) T protein:vir:10 1 MPTLSTTNPTLAD-VAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTT-VRSGLPTGTWRKLNYGVQPEKS 78 (331) T ss_pred CCccccCcccHHH-HHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceee-EEeccCCchhhccCCccCcccc Confidence 0000000000000 000111222 345699999999999999999887666543322 2234445666666666667889 Q ss_pred ccceeeecHhhhhhhhhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------- Q lcl|Aclame:pro 200 DFKDVAWNIDTYRGAIPLSQESIDDAD--VDLVGIVSESISQIKVNTTNDAIAKVLKSFTT------------------- 258 (394) Q Consensus 200 ~~~~v~~~~~~~~~~~~vs~ell~ds~--~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~------------------- 258 (394) ++.+++-..+-+++.+.|.+.+.+... .++...-.....+.+.......+++|+.+..| T Consensus 79 tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a~~~~ 158 (331) T protein:vir:10 79 RTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSAENGQ 158 (331) T ss_pred eeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccccccc Confidence 999999999999999999998887543 22334344445566666666665555322100 Q ss_pred ------------------------------cc------------------------------------------------ Q lcl|Aclame:pro 259 ------------------------------KT------------------------------------------------ 260 (394) Q Consensus 259 ------------------------------~~------------------------------------------------ 260 (394) .+ T Consensus 159 q~IdaGgtG~~~TSI~~v~~~~~~~~giyPkG~~~Gl~~~d~g~~~~~~~~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~ 238 (331) T protein:vir:10 159 NIIDAGGTGSDNASIWLTVWGPNTLHTIYPKGSQAGLQSRDLGEDTLIDAAGGRYQGYRTHYKWDIGLTLRDWRYVVRIA 238 (331) T ss_pred ceeecCCCCCCceEEEEEEEcCCeeEEecccccccCceEeecCceeeecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEe Confidence 00 Q ss_pred ----------cccHHHHHHHHHhhhh----h-hcccEEEEcHHHHHHHHhh-hcc-CCceeecccccCCCcccccccceE Q lcl|Aclame:pro 261 ----------VKNLDEIKALLNGGFD----P-AYNVSLIVSQSFYQTLDTL-KDG-NGRYLLQDDITAVSGKVLLGKPVF 323 (394) Q Consensus 261 ----------~~~~~~i~~~~~~~~~----~-~~~a~~vm~~~~~~~l~~l-kd~-~G~~l~~~~~~~~~~~~l~G~pV~ 323 (394) ..+..++++++..+.. . ..+.+|+||+.....|++. .+. +.+.+-.....+...-.++|.||. T Consensus 239 NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir 318 (331) T protein:vir:10 239 NVDVSELTKNASAGADLIDLMTQAVELIPNVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCR 318 (331) T ss_pred ccchhccCCCcchhhhHHHHHHHHHHHhcccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEE Confidence 0000123333222221 1 1257899999999999875 343 323333223344455679999999 Q ss_pred EecCcccccCceE Q lcl|Aclame:pro 324 VLSDEVLGANKAF 336 (394) Q Consensus 324 ~~~~~~~~~~~~~ 336 (394) .++........++ T Consensus 319 ~~dai~~tE~~Vv 331 (331) T protein:vir:10 319 RTDALLLTEARVV 331 (331) T ss_pred EeeeeecCccccC Confidence 9875444333333 No 155 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=97.94 E-value=4.2e-07 Score=55.54 Aligned_cols=271 Identities=10% Similarity=0.035 Sum_probs=137.9 Q ss_pred HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCcccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPAL 196 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~ 196 (394) +...........+ .+.....+--+.+..++.....+.+.++++.++.++.++ +.++|+..... ..+...|.+ +.- T Consensus 1 Ms~~n~~t~~~~~-~s~~~~al~le~f~geV~taF~~~si~~~~~~vrti~~GkS~qf~~iG~~~--a~y~~~G~~-ldg 76 (402) T protein:vir:97 1 MSTPNTLTNVAVS-ASGEVDSLLIEKFNGKVNEQYLKGENILSYFDVQTVTGTNTVSNKYLGETE--LQVLAPGQS-PNA 76 (402) T ss_pred CCCcccccccccc-cccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeecccceEEEEEEeeeE--Eeeeccccc-cCC Confidence 1111111110111 111222344578888998888888999999999887765 67777763322 233322222 222 Q ss_pred cccccceeeecHhhhh-hhhhhhH--HHHhccHHH-HHHHHHHHHHHHHHHHHHHHHhhccc------cc---------- Q lcl|Aclame:pro 197 AKPDFKDVAWNIDTYR-GAIPLSQ--ESIDDADVD-LVGIVSESISQIKVNTTNDAIAKVLK------SF---------- 256 (394) Q Consensus 197 ~~~~~~~v~~~~~~~~-~~~~vs~--ell~ds~~~-l~~~i~~~l~~~~~~~~~~a~~~g~~------~~---------- 256 (394) ..+.-++..+....+- .-..|.+ +.. +.+| +.+.+.+.+++++++..|..++.-.- +. T Consensus 77 ~~~~~~k~~ItID~lL~a~~~V~diDeaq--~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~~~~~~~~~ 154 (402) T protein:vir:97 77 TPTQADKNQLVIDTTVIARNTVAHIHDVQ--GDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAERNKPRVKG 154 (402) T ss_pred CCcccccEEEEeCceeechhhhhhHHHHH--hcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccCcccc Confidence 3344555555554432 1122221 222 3455 68899999999999988887643110 00 Q ss_pred ----cccc------cccHHHHH----HHHHhhhh---hhcccEEEEcHHHHHHHHhhhccC-Cceeec--ccccCCCccc Q lcl|Aclame:pro 257 ----TTKT------VKNLDEIK----ALLNGGFD---PAYNVSLIVSQSFYQTLDTLKDGN-GRYLLQ--DDITAVSGKV 316 (394) Q Consensus 257 ----~~~~------~~~~~~i~----~~~~~~~~---~~~~a~~vm~~~~~~~l~~lkd~~-G~~l~~--~~~~~~~~~~ 316 (394) .+.+ .++.+.+. ++...+-. |...-+++++|..|..|.+-.+=. -.|... .....+...+ T Consensus 155 ~g~s~~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~rl~n~d~~~~~~g~~~~G~v~~ 234 (402) T protein:vir:97 155 HGFSINVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDADRIVDKTYTISQSGATINGFVLS 234 (402) T ss_pred cccccccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcccccchhhccccCCccccceeEE Confidence 0000 11223333 33222222 122357899999999887632210 011101 1133444568 Q ss_pred ccccceEEecCccccc-------------C--ceEEEecccc-EEEEeecceE-EEE-------e-ecccccceEEEEEE Q lcl|Aclame:pro 317 LLGKPVFVLSDEVLGA-------------N--KAFIGDFKRG-VLFADRKDLG-LRW-------A-DNEIYGQYLQAVLR 371 (394) Q Consensus 317 l~G~pV~~~~~~~~~~-------------~--~~~~gd~~~~-~~~~~~~~~~-i~~-------~-~~~~~~~~~r~~~r 371 (394) ++|+||+.+.+.+... + .-+-|||+.. .+++.+.-+. ++. . +...+...+..++- T Consensus 235 v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~~~vT~~~~~d~r~~~~~id~~~a 314 (402) T protein:vir:97 235 SYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRTIEVTGDIFYEKKEKTYYIDTFMA 314 (402) T ss_pred EeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEeeccccchhhchhHHHHHHHHHHH Confidence 9999998876543211 1 1233676643 2233332211 111 1 11122223556666 Q ss_pred eccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 372 FGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 372 ~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +|..+.||++...++..--.||- T Consensus 315 ~G~g~~RPeaa~vv~~~~~~t~~ 337 (402) T protein:vir:97 315 EGAIPDRWEAVSVVTTKRDATTG 337 (402) T ss_pred hCCcccCccceEEEEEecccccc Confidence 89999999999988776644443 No 156 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=97.91 E-value=6.2e-07 Score=54.62 Aligned_cols=214 Identities=6% Similarity=-0.063 Sum_probs=120.0 Q ss_pred HHhhhhhhhhhhccccc-CCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKE-NAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPAL 196 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~-~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~ 196 (394) +. ..... ..+-. ....+.|......|+|.+.+.++|+..+++.....+++...... .+.+.+.|-.-++..+. T Consensus 1 m~---~~~~~--a~TL~E~Akr~~~d~~~~~IIE~l~~tneIL~~lpf~e~N~~tg~~~~vr-t~LP~~~fR~lN~g~~~ 74 (335) T protein:vir:73 1 MA---LIGQT--LPSLLDIYNRTDKNGRIARIVEQLAKTNDILTDAIYVPCNDGSKHKTTIR-AGIPEPVWRRYNQGVQP 74 (335) T ss_pred CC---cCCCC--chhHHHHHhhcCcchhHHHHHHHHhcCchHHhhcchhcccCCcccceeEE-EecCCchhhhcCCcccc Confidence 00 00000 00000 01123345556679999999999999988887655554433222 23344556555555566 Q ss_pred cccccceeeecHhhhhhhhhhhHHHHhccH--HHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------- Q lcl|Aclame:pro 197 AKPDFKDVAWNIDTYRGAIPLSQESIDDAD--VDLVGIVSESISQIKVNTTNDAIAKVLKSFTT---------------- 258 (394) Q Consensus 197 ~~~~~~~v~~~~~~~~~~~~vs~ell~ds~--~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~---------------- 258 (394) ++.++.+++-+.+-+.+...|.+.+.+... .++...-.....+.+.......+++|+.+..| T Consensus 75 s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~iyGDsa~~p~~FdGL~kR~~~~st~ 154 (335) T protein:vir:73 75 TKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSIYGNTDAEPEAFMGLAPRFNTLSTS 154 (335) T ss_pred ccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCChhhccchhhhhcCcccc Confidence 889999999999999999999997776443 23444444455666666666666555322110 Q ss_pred ------------------------------------cc------------------------------------------ Q lcl|Aclame:pro 259 ------------------------------------KT------------------------------------------ 260 (394) Q Consensus 259 ------------------------------------~~------------------------------------------ 260 (394) .+ T Consensus 155 ~a~~a~~iIdaGGtG~~~TSi~~v~wg~~~~~giyPkG~kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r 234 (335) T protein:vir:73 155 KAASAENVFSAGGSGSTNTSIWFMSWGENTAHMIYPEGMVAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWR 234 (335) T ss_pred ccCcccceeeccccccCceEEEEEEEcCCeeEEEcccCccccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcc Confidence 00 Q ss_pred ----------------cccHHHHHHHHHhhh----hhh-c--ccEEEEcHHHHHHHHhh-hccCCceeecccccCCCccc Q lcl|Aclame:pro 261 ----------------VKNLDEIKALLNGGF----DPA-Y--NVSLIVSQSFYQTLDTL-KDGNGRYLLQDDITAVSGKV 316 (394) Q Consensus 261 ----------------~~~~~~i~~~~~~~~----~~~-~--~a~~vm~~~~~~~l~~l-kd~~G~~l~~~~~~~~~~~~ 316 (394) ..+..++++++...+ .|. . +++|+||+.....|++. .+.....+-...+.+...-. T Consensus 235 ~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~ 314 (335) T protein:vir:73 235 SISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVS 314 (335) T ss_pred cEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEE Confidence 000123334332222 121 2 37899999999999864 45544444333344444457 Q ss_pred ccccceEEecCcccccCceEEE Q lcl|Aclame:pro 317 LLGKPVFVLSDEVLGANKAFIG 338 (394) Q Consensus 317 l~G~pV~~~~~~~~~~~~~~~g 338 (394) ++|+||..+|........ ++. T Consensus 315 ~~gipir~~Dail~tE~~-v~~ 335 (335) T protein:vir:73 315 FLGIPIRRVDAILNTESA-VTA 335 (335) T ss_pred ECCeEEEEEeeeecCccc-ccC Confidence 899999988743333222 222 No 157 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=97.90 E-value=2.4e-06 Score=51.37 Aligned_cols=271 Identities=13% Similarity=0.081 Sum_probs=138.4 Q ss_pred HHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHH-hhhhhhheeeeEeecCCceeEEEEecCCCcccc-------c Q lcl|Aclame:pro 116 MPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVK-TVVDLKPFTTVYQAKKASGKYPVLQRATTKMVT-------V 187 (394) Q Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~-~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~ 187 (394) ..+...-. . -...+..-....+ ++|...+..... ..+.|++-++...-..++..+-.+......... . T Consensus 1 ~~~~~~~~--~-~~~Ms~~i~~~fv-~qy~~~v~~~~qq~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (322) T protein:vir:10 1 MKLNAIMS--M-LPLIAGDIDQAFV-QTYETTLRILSQQKSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQS 76 (322) T ss_pred Ccccceee--e-eeeeechhhhHHH-HHHHHHHHHHHHHhhhhhhcccccccccccccceeecccccccccccccccccc Confidence 00000000 0 0001112233344 666677665544 455577766544433333222222111100000 0 Q ss_pred ccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccc---cc----- Q lcl|Aclame:pro 188 AELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVL-KSF---TT----- 258 (394) Q Consensus 188 ~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~-~~~---~~----- 258 (394) ..+.-..+.....++.+.......+....|.+.-.-....|..+...+..+.++++..|..++.+. +.+ ++ T Consensus 77 ~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~gt~v~ 156 (322) T protein:vir:10 77 ADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGTGQPVE 156 (322) T ss_pred cCcccCCCccccccceEEEeecccccceecchHHHHHhhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccccccccc Confidence 011101111111233333333333445677776555566788899999999999998888776532 111 10 Q ss_pred ----------cccccHHHHHHHHHhhhhhhc----ccEEEEcHHHHHHHHhhhc-cCCceeecccc-cCCCcccccccce Q lcl|Aclame:pro 259 ----------KTVKNLDEIKALLNGGFDPAY----NVSLIVSQSFYQTLDTLKD-GNGRYLLQDDI-TAVSGKVLLGKPV 322 (394) Q Consensus 259 ----------~~~~~~~~i~~~~~~~~~~~~----~a~~vm~~~~~~~l~~lkd-~~G~~l~~~~~-~~~~~~~l~G~pV 322 (394) .+..+++.++++...+..... +-.++++|..|..|..... ++..|.-...+ ..|..++++|+.+ T Consensus 157 ~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~l~~~G~ig~~lGf~~ 236 (322) T protein:vir:10 157 FLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGPTQARKLLQITEATSADYTSAMDLQSKGIITNWMGYTW 236 (322) T ss_pred cCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchhhhhcCeeeeeeeEEE Confidence 112345666665544333222 1257789999988765432 23334433334 3355679999999 Q ss_pred EEecCccccc----------------CceEEEeccccEEEEeecceEEEEeeccccc--ceEEEEEEeccEEecccceEE Q lcl|Aclame:pro 323 FVLSDEVLGA----------------NKAFIGDFKRGVLFADRKDLGLRWADNEIYG--QYLQAVLRFGVSKVDDKAGYY 384 (394) Q Consensus 323 ~~~~~~~~~~----------------~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~--~~~r~~~r~d~~v~~~~af~~ 384 (394) +.+...+... ...+++. ++++.++...+++.+.+..+... ..++..+-+|..+++|+.++. T Consensus 237 i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~-k~Av~~a~~~dv~~~i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~ 315 (322) T protein:vir:10 237 IVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMT-DMALGYHSCKDIWTKVAEDPSASFAWRIYSAFTADCVRVEDEHIFK 315 (322) T ss_pred EEeccCCccccccccccccCCCCccceeEEEEe-cCceeEEEeeeeeEEeeccCCcchhhhhhhhhhhCceEeccCcEEE Confidence 8865433111 1123333 23455665666666665544433 335566889999999999999 Q ss_pred EEecCcc Q lcl|Aclame:pro 385 VTFTPEP 391 (394) Q Consensus 385 l~~~~~~ 391 (394) +...-.. T Consensus 316 i~~~e~~ 322 (322) T protein:vir:10 316 LRLKNSL 322 (322) T ss_pred EEEeccC Confidence 9998887 No 158 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=97.69 E-value=3.3e-06 Score=50.66 Aligned_cols=261 Identities=10% Similarity=-0.031 Sum_probs=131.7 Q ss_pred hccc-ccCCccccc---hhHHhHHHHHHHhhhhhhheeeeEe---ecCCceeEEEEecCCCccccccccccccccccccc Q lcl|Aclame:pro 129 DGIK-KENAKPVSS---EEILYTPAREVKTVVDLKPFTTVYQ---AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDF 201 (394) Q Consensus 129 ~~~~-~~~~~~lvP---~~~~~~I~~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~ 201 (394) .+.. ...++.+.- +.+.+.+++.....-.-+.++.+.. -...++.+.+.. ..+.+.++..++...+..+..+ T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~~~dip~v~~~~ 79 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFD-GVGIAQIVADYTDDLPLVDALA 79 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeee-ccCceeEeCCCccccceeeccc Confidence 1111 111122222 2334455554444444444444332 111234444433 2344555565555444556677 Q ss_pred ceeeecHhhhhhhhhhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc------------cc--- Q lcl|Aclame:pro 202 KDVAWNIDTYRGAIPLSQESIDDA---DVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTV------------KN--- 263 (394) Q Consensus 202 ~~v~~~~~~~~~~~~vs~ell~ds---~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~------------~~--- 263 (394) +......+.++..+.++..=++.+ ..+|..--....+.++...+|..++.|....+..|. .+ T Consensus 80 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~~W~~ 159 (296) T protein:vir:10 80 TERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGGSWSQ 159 (296) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccCCccC Confidence 888888888888888886444333 235777777777888888888888777643221111 11 Q ss_pred ----HHHHHHHHHhhhhhh---c-ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCce Q lcl|Aclame:pro 264 ----LDEIKALLNGGFDPA---Y-NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKA 335 (394) Q Consensus 264 ----~~~i~~~~~~~~~~~---~-~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~ 335 (394) +++|..++..+.... . .-.++++|..+..|....+..|.-++..--.+..+.+|.+.|..... ...+.... T Consensus 160 ~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~~l~~a-~~~g~~~~ 238 (296) T protein:vir:10 160 PTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQYLNDY-NGTGTSAA 238 (296) T ss_pred HHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEeeeeccC-CCCcceEE Confidence 455666655443321 1 24789999999988766555554333211111122344444432211 11122333 Q ss_pred EEEeccccEE-EEeecceEEEEeecccccceEEEEEEec-cEEecccceEEEEecCcc Q lcl|Aclame:pro 336 FIGDFKRGVL-FADRKDLGLRWADNEIYGQYLQAVLRFG-VSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 336 ~~gd~~~~~~-~~~~~~~~i~~~~~~~~~~~~r~~~r~d-~~v~~~~af~~l~~~~~~ 391 (394) ++.+-+.-++ +.--++++............++++.|++ ..+.+|.||+.++.-+=+ T Consensus 239 v~~~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI~~~ 296 (296) T protein:vir:10 239 IAYEKDPNNMAIEIPEATNALPAQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGITFA 296 (296) T ss_pred EEEEcCCceEEEEcCcceeeecccccCceEEEeeEeeEEEEEEECCceeEEEeeeecC Confidence 3333222222 2212333222221122223357788885 789999999999666555 No 159 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=97.69 E-value=2.6e-06 Score=51.20 Aligned_cols=271 Identities=10% Similarity=0.038 Sum_probs=133.8 Q ss_pred HHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCcccccccccccccc Q lcl|Aclame:pro 118 INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPAL 196 (394) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~ 196 (394) +........... ..+...-.+--+.+..++.....+.+.++++..+.++.++ ++++|+... ........|.+ +.- T Consensus 1 Ms~~n~~t~~~~-~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~--s~~~~~~pG~~-ld~ 76 (401) T protein:vir:70 1 MSTPNNLTNVAV-SASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGE--TELQVLAPGQS-PAA 76 (401) T ss_pred CCCCcccccccc-ccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecccceEEEEEeee--eEeeeecCCCC-cCC Confidence 111111000011 1112223466678888888888888999999999998766 677777532 22333333333 222 Q ss_pred cccccceeeecHhhh-hhhhhhhHHHHhccHHH-HHHHHHHHHHHHHHHHHHHHHhhccc------c----ccc------ Q lcl|Aclame:pro 197 AKPDFKDVAWNIDTY-RGAIPLSQESIDDADVD-LVGIVSESISQIKVNTTNDAIAKVLK------S----FTT------ 258 (394) Q Consensus 197 ~~~~~~~v~~~~~~~-~~~~~vs~ell~ds~~~-l~~~i~~~l~~~~~~~~~~a~~~g~~------~----~~~------ 258 (394) ..+.-++..+...++ +.-..|.+=---++.+| +.+.+.+.+++++++..|..++...- + ..+ T Consensus 77 ~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~~~p~~~~~G 156 (401) T protein:vir:70 77 TSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKRTNPRVKGHG 156 (401) T ss_pred CCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccCCCcCCCc Confidence 334555554444433 22222221111123455 67888899999998888876532220 0 000 Q ss_pred ----------cccccHHHHH----HHHHhhhh---hhcccEEEEcHHHHHHHHhh---hccCCceeec--ccccCCCccc Q lcl|Aclame:pro 259 ----------KTVKNLDEIK----ALLNGGFD---PAYNVSLIVSQSFYQTLDTL---KDGNGRYLLQ--DDITAVSGKV 316 (394) Q Consensus 259 ----------~~~~~~~~i~----~~~~~~~~---~~~~a~~vm~~~~~~~l~~l---kd~~G~~l~~--~~~~~~~~~~ 316 (394) .+.++.+.+. ++...+.. |.-.-++++.|.-|..|..- -|.. |-.. .....+...+ T Consensus 157 ~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nrd--~~~s~~g~~~~G~v~~ 234 (401) T protein:vir:70 157 FSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDKT--YTISQSGATIQGFTLS 234 (401) T ss_pred eEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccchh--hccccCCccccceEEE Confidence 0111222333 33322221 11234666777777666442 2111 1100 1123344468 Q ss_pred ccccceEEecCcccc---------------cCceEEEeccccE-EEEeecceE-EEE-------e-ecccccceEEEEEE Q lcl|Aclame:pro 317 LLGKPVFVLSDEVLG---------------ANKAFIGDFKRGV-LFADRKDLG-LRW-------A-DNEIYGQYLQAVLR 371 (394) Q Consensus 317 l~G~pV~~~~~~~~~---------------~~~~~~gd~~~~~-~~~~~~~~~-i~~-------~-~~~~~~~~~r~~~r 371 (394) +.|+||+.+.+.+.. ...-+-|||+..+ ++|.+.-+- ++. . +...+...+.+++- T Consensus 235 vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~~~~~d~r~~~~~id~~~a 314 (401) T protein:vir:70 235 SYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTGDIFYEKKEKTYYIDTFMA 314 (401) T ss_pred EeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeeccccchhhhhhhhHHHHHHHHH Confidence 999999987654322 1122337776542 233332221 221 1 11122233566667 Q ss_pred eccEEecccceEEEEecCc---c----CCC Q lcl|Aclame:pro 372 FGVSKVDDKAGYYVTFTPE---P----LPL 394 (394) Q Consensus 372 ~d~~v~~~~af~~l~~~~~---~----~~~ 394 (394) +|..+.||+|.+.++.+-+ + +|+ T Consensus 315 ~g~g~~RPeaa~vv~~k~~~~~~~~~~~~~ 344 (401) T protein:vir:70 315 EGAIPDRWEAVSVVTTKRNTTTGAVEGTDG 344 (401) T ss_pred hCCcccchhheEEEeecCcccccccccCCc Confidence 8999999999888754433 2 332 No 160 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=97.68 E-value=6.5e-06 Score=49.02 Aligned_cols=266 Identities=9% Similarity=-0.036 Sum_probs=133.3 Q ss_pred hhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccce Q lcl|Aclame:pro 124 VEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 124 ~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~ 203 (394) ....+.+.. +......-..+++.|...-....|+..++......+..+.|+.-.-.........||...+......-.. T Consensus 1 ma~~~~~~~-t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~~~~r~~ 79 (317) T protein:vir:88 1 MATPTNAVS-TVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPGKNTRVEGEDATIKAGSFTTM 79 (317) T ss_pred CCccccceE-eeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCccccccccCcccccccccCCEE Confidence 000001111 1223345566788888887888899999888777777777776444444334445555444321111111 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHH---HHHHHHHHHHHhhcccc-----cc------------------ Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESIS---QIKVNTTNDAIAKVLKS-----FT------------------ 257 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~---~~~~~~~~~a~~~g~~~-----~~------------------ 257 (394) +.=-..-+...+.||..+..-+.......+...++ ..+.+..+.++++|.-+ .+ T Consensus 80 ~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~ 159 (317) T protein:vir:88 80 LNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNGSL 159 (317) T ss_pred eccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCcee Confidence 11111122333344433322112222222222222 22333344444444211 00 Q ss_pred ------------------ccccccHHHHHHHHHhhhhhhcc-cEEEEcHHHHHHHHhhhccCCceeecccccCC------ Q lcl|Aclame:pro 258 ------------------TKTVKNLDEIKALLNGGFDPAYN-VSLIVSQSFYQTLDTLKDGNGRYLLQDDITAV------ 312 (394) Q Consensus 258 ------------------~~~~~~~~~i~~~~~~~~~~~~~-a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~------ 312 (394) .....+-+++.+++.++.....+ ..+++|+.....|..+...++.++..+.-... T Consensus 160 ~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~~~~~~~~g~~v~ 239 (317) T protein:vir:88 160 GANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITLDASDNRIAQTVD 239 (317) T ss_pred ccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEEcccCeEEEEEEE Confidence 01123556677888777776543 35788999999999885444555532211000 Q ss_pred CcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEecccceEEEEecCccC Q lcl|Aclame:pro 313 SGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 313 ~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~ 392 (394) ..-+=+|. |.++.+...+++.+++.|+.. +-+..-.++..+..--.+......++..++..+.+|+|..+++..++.- T Consensus 240 ~~~tdfG~-v~ii~~r~lp~~~~~~~D~~~-~~l~~Lr~~~~e~laKtGd~~k~~i~~E~tLe~~N~~a~a~i~~l~~~~ 317 (317) T protein:vir:88 240 VYESDFGK-YTIRANRWFHENTLFVFDPKM-HSLCYLRPFFQHELAKTGDSEKRQLLVEYTFRVNNEKSGALIRDVVAQL 317 (317) T ss_pred EEEeCCeE-EEEEeCCCCCCCeEEEEcccc-cceeecccceeeccCCCcccceeEEEEEEEEEEcCccceeEEEEecccC Confidence 00111341 333445666788888888874 2222112222222222223334567788999999999999999777666 No 161 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=97.62 E-value=5.5e-06 Score=49.42 Aligned_cols=257 Identities=16% Similarity=0.118 Sum_probs=126.9 Q ss_pred hhhh-cccc-cCCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCc----eeEEEEecCCCccccccccccccccccc Q lcl|Aclame:pro 126 PQKD-GIKK-ENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKAS----GKYPVLQRATTKMVTVAELEKNPALAKP 199 (394) Q Consensus 126 ~~~~-~~~~-~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~e~~~~~~~~~~ 199 (394) +... +++. ..-+..+--++.+.+-..+.+...+++..+..|+..++ +++|.. ...+.+..++||..+| .+.. T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~L~~~LGv~r~~pla~Gt~iktyK~~~~-~y~gda~dVaEGe~Ip-lskv 78 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNKLFEALAIQNKIPMNVGSALKQYRFKVE-DSEKPNGDVAEGDVIP-LTKV 78 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHHHHHHhhhhccccccCCceeeeeeeece-eeccccccccCCcccc-hhhh Confidence 1111 1111 11112223344555555555555555666777776653 233322 1234455666766666 5666 Q ss_pred ccc---eeeecHhhhhhhhhhhHHHHhccHH-HHHHHHHHHHHHHHHHHHHHHHhhcccccc------ccccccHHHHHH Q lcl|Aclame:pro 200 DFK---DVAWNIDTYRGAIPLSQESIDDADV-DLVGIVSESISQIKVNTTNDAIAKVLKSFT------TKTVKNLDEIKA 269 (394) Q Consensus 200 ~~~---~v~~~~~~~~~~~~vs~ell~ds~~-~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~------~~~~~~~~~i~~ 269 (394) +-+ ..+++.+|++.-+ |.|.++.+.+ +-...-.+.|...+.+..+..++....+++ ..+-++.+.+.. T Consensus 79 t~~~~~t~~~~~kK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~Iq~kIdnd~~~~lktaT~t~~~t~~t~~s~~glq~ 156 (303) T protein:vir:10 79 TREQVDITELQFAKYRKST--SAEAIQAHGYDLAINQTDNEMIKYVQKKFRAKFFETLKSAIENGKRTNKTKLSAENLQG 156 (303) T ss_pred eeeecceEEEEeecccccc--cHHHHHhhcCCchhHHHHHHHHHHHHhhhhHHHHHHHhhcccccccccceeecHHHHHH Confidence 643 4788888888765 9999865543 456777788888888888887776664442 334456777777 Q ss_pred HHHhhh-------hhhcccEEEEcHHHHHHHHhhhccCCc-eeecccccCCCcccccccceEEecCcccccCceE----- Q lcl|Aclame:pro 270 LLNGGF-------DPAYNVSLIVSQSFYQTLDTLKDGNGR-YLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAF----- 336 (394) Q Consensus 270 ~~~~~~-------~~~~~a~~vm~~~~~~~l~~lkd~~G~-~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~----- 336 (394) ++.... ....+.+.++||.+.+.++.-..-+.+ --| +++- --.++|.-|+.+... +.+.+| T Consensus 157 Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~f--G~n~--L~nfLG~~II~S~kv--~~G~~~~T~~~ 230 (303) T protein:vir:10 157 ALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQF--GVNL--LTPYVGVKIVEFADV--PQGEVWMTVAE 230 (303) T ss_pred HHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhh--hhhh--hhhhhcceEEEeccC--CCceEEEeecc Confidence 665432 222357899999999887642211111 011 0000 013788877664432 333222 Q ss_pred -----E----EeccccEEEE-eecceEEEEeecccccceEEEE-EEecc---EEecccceEEEEecCcc---CCC Q lcl|Aclame:pro 337 -----I----GDFKRGVLFA-DRKDLGLRWADNEIYGQYLQAV-LRFGV---SKVDDKAGYYVTFTPEP---LPL 394 (394) Q Consensus 337 -----~----gd~~~~~~~~-~~~~~~i~~~~~~~~~~~~r~~-~r~d~---~v~~~~af~~l~~~~~~---~~~ 394 (394) + ||+++++.+. |.-|+ |-. .|.....++-+. .-+.+ =+-+++++++.++++.= +|- T Consensus 231 Ni~~ay~~~~g~l~~~f~~t~D~tgl-IGv-~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~ti~~~e~~~~~~ 303 (303) T protein:vir:10 231 NLNVAYANPRGELSRAFAFATDATGF-VGV-LHDIQPQRLTSDTIYASAISMFPENIDAVIKVTIKKDEAGELPS 303 (303) T ss_pred ceEEEEecCchhhhhhhhhccccccc-eEE-EeccccceeeehhHhHhHHHhcccccceEEEEEEeccccCCCCC Confidence 1 2222211111 11111 000 000000000000 00112 23446788999995544 444 No 162 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=97.61 E-value=2.8e-06 Score=51.06 Aligned_cols=254 Identities=16% Similarity=0.043 Sum_probs=127.6 Q ss_pred hhh-cccccCCccccchhH---HhHHHHHHHhhhhhhheeeeEeecCC-ceeEEEEecCCCccccccccccccccccccc Q lcl|Aclame:pro 127 QKD-GIKKENAKPVSSEEI---LYTPAREVKTVVDLKPFTTVYQAKKA-SGKYPVLQRATTKMVTVAELEKNPALAKPDF 201 (394) Q Consensus 127 ~~~-~~~~~~~~~lvP~~~---~~~I~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~ 201 (394) ++. +++. ...|.+... .+.+...+.+...+++..+..|+..+ ..++|.+. -.+.+..++||...| .+..+. T Consensus 1 mAe~nlt~--~~dL~~~~sidfv~~f~~~i~~L~~~Lgi~r~~p~a~G~tIt~pK~~-~tgda~dVaEGe~Ip-lskvt~ 76 (295) T protein:vir:99 1 MAEKNLNT--MADLGDIKSIDFVNKFSKNINDLLKLLGVTRRETLTNDLKIQTYKWE-VTLDQTDPGEGETIP-LSKVTR 76 (295) T ss_pred CCCccccc--HhhccCceeehhhHHhhhhHHHHHHHhccccccccccCCeEEeeeee-eecccccccCCcccc-hhhhee Confidence 111 1111 112332222 22333333344445555677787765 56666543 345556677777776 566665 Q ss_pred c---eeeecHhhhhhhhhhhHHHHhccHH-HHHHHHHHHHHHHHHHHHHHHHhhcccccccccc-c----cHHHHHHHHH Q lcl|Aclame:pro 202 K---DVAWNIDTYRGAIPLSQESIDDADV-DLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTV-K----NLDEIKALLN 272 (394) Q Consensus 202 ~---~v~~~~~~~~~~~~vs~ell~ds~~-~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~-~----~~~~i~~~~~ 272 (394) + ..+++.+|++.-+ |.|.++.|.+ +-...-.+.|...+.+..+..++....+++.+.. . .+..+++.+. T Consensus 77 ~~~~t~t~kikK~rK~t--TdEAIqlsGygdpvgead~qL~~~ia~kId~D~~~~lktat~t~tg~~lq~a~a~~~~al~ 154 (295) T protein:vir:99 77 TKDKDYTVKWFKKRRAT--TAEAIARHGAARAITEADKRIMRELQNGIKDAFFTFLKTKPTKVKGVGLQKALSASWAKLA 154 (295) T ss_pred eeeeeeEEEeeeecccc--cHHHHHhcCCCchhHHHHHHHHHHHHHhhhHHHHHHhccCceeeehhhHHHHHHHhhhhhh Confidence 4 3777778877754 9999865543 4567788889999999999988888766543321 1 1222222233 Q ss_pred hhhhhhc-ccEEEEcHHHHHHHHhhhccC--CceeecccccCCCcccccccc-eEEecCcccccCceE--------E--- Q lcl|Aclame:pro 273 GGFDPAY-NVSLIVSQSFYQTLDTLKDGN--GRYLLQDDITAVSGKVLLGKP-VFVLSDEVLGANKAF--------I--- 337 (394) Q Consensus 273 ~~~~~~~-~a~~vm~~~~~~~l~~lkd~~--G~~l~~~~~~~~~~~~l~G~p-V~~~~~~~~~~~~~~--------~--- 337 (394) ....... +.+.++||.+...++.-..-+ ..-.|.-.+- -.++|.- |+.+... +.+.+| + T Consensus 155 ~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~fG~~~L----~nfLG~q~II~S~kv--~~G~~~aT~~~Ni~~ay~ 228 (295) T protein:vir:99 155 TFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVFGMTLL----KNFLGMQNVIVMPSV--PEGKIYSTAVENLVFASL 228 (295) T ss_pred hcccccCCceEEEEehHHHHHHHhccccccchhhhhhhhhh----hhhhccceEEEcccC--CCceEEEeeccceEEEEe Confidence 3222222 468999999998877532211 1101100000 1378987 5554432 222222 1 Q ss_pred ----EeccccEEEE-eecceE-EEEe---ecccccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 338 ----GDFKRGVLFA-DRKDLG-LRWA---DNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 338 ----gd~~~~~~~~-~~~~~~-i~~~---~~~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ||+.+.+.+. |.-|+. +..+ ++..+.+.+ ..-.-.=+-+++++++.++....+|- T Consensus 229 ~~~~g~l~~~f~~~~D~tglIg~~h~~~~~~~t~et~~--~~~~~lfpE~~dgiv~~tI~~~~~~~ 292 (295) T protein:vir:99 229 NVKGGDLGGLFADFTDETGLIAAARNRQLSNLTYESVF--FGANVLFAEIPEGVVEATIEAAAVPG 292 (295) T ss_pred cCCchhhhhhhhhccCcccceEEEeccccceeeehhhh--HhHHHhcccccceEEEEEEecCcCCC Confidence 3333322211 111110 1110 011111110 00011123456889999999999998 No 163 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=97.55 E-value=1.6e-05 Score=46.81 Aligned_cols=278 Identities=9% Similarity=-0.028 Sum_probs=128.9 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcc---cccCCccccc---hhHHhHHHHHHH Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGI---KKENAKPVSS---EEILYTPAREVK 153 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~lvP---~~~~~~I~~~~~ 153 (394) .+ ......+....+. .... ..+. .....+.+.. +.+.+.+++... T Consensus 1 ~~--~~~~~~~~~~~~~--------------------------~~~~-~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~ 51 (319) T protein:vir:10 1 MT--TKKFDEADKSNVE--------------------------MYLI-QAGVKQDAAATMGIWTAQELHRIKSQSYEEDY 51 (319) T ss_pred CC--CcchhHHhhHHHH--------------------------HHHh-hccchhhhhhhhhhHHHHHHHHHHHHHHhhhh Confidence 00 0000000000000 0000 0000 0011111211 334445666555 Q ss_pred hhhhhhheeeeEe---ecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhcc---HH Q lcl|Aclame:pro 154 TVVDLKPFTTVYQ---AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA---DV 227 (394) Q Consensus 154 ~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds---~~ 227 (394) ....-+.++.+.. -...++.+.... ..+.+.++..++...+..+..++......+.++..+.++..=++.+ .. T Consensus 52 ~~l~~~~~i~v~~~~~~~~~~~~~~~~~-~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~ 130 (319) T protein:vir:10 52 PVGSALRVFPVTTELSPTDKTFEYMTFD-KVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGR 130 (319) T ss_pred cceechhhcccccCCCCceEEEEeeeec-cccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCC Confidence 5555555554432 222234444433 2345556666665544556677777778888888888875433322 23 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------ccccHHH----HHHHHHhhhhh--h--cc Q lcl|Aclame:pro 228 DLVGIVSESISQIKVNTTNDAIAKVLKSFTTK-------------------TVKNLDE----IKALLNGGFDP--A--YN 280 (394) Q Consensus 228 ~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~-------------------~~~~~~~----i~~~~~~~~~~--~--~~ 280 (394) +|..--....+.++...+|..++.|....+.. +..+.+. |..++..+... . .. T Consensus 131 ~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p 210 (319) T protein:vir:10 131 PLSTRKASACQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRA 210 (319) T ss_pred ChHHHHHHHHHHHHHHhhceEEEeecccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeec Confidence 56777777778888888888877775432111 1112333 44444333322 1 12 Q ss_pred cEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEE-eecceEEEEeec Q lcl|Aclame:pro 281 VSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFA-DRKDLGLRWADN 359 (394) Q Consensus 281 a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~-~~~~~~i~~~~~ 359 (394) -.++|+|+.+..|.......|.-++..--.+..+-+|.+.|..... ...+.+..++...+.-++-+ --+.++...... T Consensus 211 ~~L~L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel~~a-g~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e~ 289 (319) T protein:vir:10 211 TNILIPPSMRKVLAIRMPETTMSYLDYFKSQNSGIEIDSIAELEDI-DGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQP 289 (319) T ss_pred eEEEecHHHHHhhhcccCCCCeeHHHHHHHhcCCceEEEeeeeccc-CCCcceEEEEEecCCceEEEecCcceeeeeeee Confidence 4789999999999766555554443211111122345555442211 11122333333322222221 112322221111 Q ss_pred ccccceEEEEEEec-cEEecccceEEEEec Q lcl|Aclame:pro 360 EIYGQYLQAVLRFG-VSKVDDKAGYYVTFT 388 (394) Q Consensus 360 ~~~~~~~r~~~r~d-~~v~~~~af~~l~~~ 388 (394) .........+.|++ ..+.+|.||++++.- T Consensus 290 ~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 290 KDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred cCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 11112234567775 578889999999998 No 164 >protein:vir:95131 Length: 325 # NCBI annotation: hypothetical protein ORF010 # Family: family:all:47 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293417;genbank:gi:148912838;genbank:GeneID:5228206 Probab=97.47 E-value=6.1e-05 Score=43.71 Aligned_cols=259 Identities=9% Similarity=-0.015 Sum_probs=98.6 Q ss_pred hcccccCCccccchhHHhHHHHHHHhhhhhhheee-------eEeecCCceeEEEEecCCCcc---cccccccccccccc Q lcl|Aclame:pro 129 DGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTT-------VYQAKKASGKYPVLQRATTKM---VTVAELEKNPALAK 198 (394) Q Consensus 129 ~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~-------~~~~~~~~~~~~~~~~~~~~~---~~~~e~~~~~~~~~ 198 (394) ..+... ...-| .+....++.+.+...+++.+. -.+..+.-++.|.+..-.+.. -.+.+.+......- T Consensus 1 m~lsD~--~vfN~-~~~~a~~e~~~q~~~~fn~as~gai~l~~~~~~Gd~~~~pf~~~l~g~~~~~~~~~~~~~vt~~ki 77 (325) T protein:vir:95 1 MALSDL--AVYSE-YAYSAFSETLRQQVDLFNTATGGAIMLQSAAHQGDFSDVAFFAKVTGGLVRRRNAYGSGTVAEKVL 77 (325) T ss_pred Cchhhh--hhhhh-hhhhhhhhhhhhhHhhhhhcccceeEeccccccCceeeccccccccccccccccCCCCceecccee Confidence 000000 00001 111111222222222222111 111222334566654322211 11222222221122 Q ss_pred cccceeeecHhhhhhhhhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHHhhcc----cc------------cccc Q lcl|Aclame:pro 199 PDFKDVAWNIDTYRGAIPLSQESID---DADVDLVGIVSESISQIKVNTTNDAIAKVL----KS------------FTTK 259 (394) Q Consensus 199 ~~~~~v~~~~~~~~~~~~vs~ell~---ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~----~~------------~~~~ 259 (394) .+...+......-.+++....+.+. +....+...|.+.+++......-..++.+. +. +... T Consensus 78 tt~~~~av~~~r~~g~~~~d~~~~~~g~~~~~~~~~~Ig~~~a~~~~~~~l~~~~~~l~~a~~~~~~~v~dis~~~~~~~ 157 (325) T protein:vir:95 78 KHLVDTSVKVAAGTPPVRLDPGQFRWIQQNPEVAGAAMGQQLAVDTMADMLNVGLGSVYSALSQVSDVVYDATANTDAAD 157 (325) T ss_pred ccccceeeEEecccCcccccHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccceeeeecccCccc Confidence 3444444444433343333332221 222223333333333322111111111111 11 0011 Q ss_pred ccccHHHHHHHHHhhhhhhcc-cEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCc---- Q lcl|Aclame:pro 260 TVKNLDEIKALLNGGFDPAYN-VSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANK---- 334 (394) Q Consensus 260 ~~~~~~~i~~~~~~~~~~~~~-a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~---- 334 (394) ...+...+.++..++-+..-. +.|+||..++..|.+..-.+...++..+-.. .-++++|++|++.|+++..... T Consensus 158 ~~~s~~~l~~A~~klGD~~~~l~~~~MHS~v~~~L~~~~L~~~~~~~~~~g~~-~i~t~~G~~VIVdD~~p~~~~g~~~~ 236 (325) T protein:vir:95 158 KLPTWNNLNNGQAKFGDQSSQIAAWIMHSTPMHKLYGSNLTNGERLFTYGTVN-VVRDPFGKLLVMTDSPNLFAAGTPNV 236 (325) T ss_pred ccccHHHHHHHHHHhcccccceeEEEEchHHHHHHHHhhccccccccccCCcc-cccccCCcEEEEeCCCCCCCccCcee Confidence 123456666766665443333 6899999999999876554443343322122 2247899999998866543321 Q ss_pred ---eEEEeccccEEEEeecceEEEEeecccccceEEEEEEecc-EEecccceEEEEecCccCCC Q lcl|Aclame:pro 335 ---AFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGV-SKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 335 ---~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~-~v~~~~af~~l~~~~~~~~~ 394 (394) .+||.= ++.+.+..+..... .+......+....|... -++||.++..-+-....+|- T Consensus 237 ytty~lg~G--Ai~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~tf~lhp~G~sw~~s~~g~sPt 297 (325) T protein:vir:95 237 YHILGLVPG--GVLIGQNNDFDANE-ETKNGDENIIRTYQAEWSYNIGVKGFAWDKANGGKSPT 297 (325) T ss_pred EEEEEEecC--eEEecCCCCccccc-cccCcccceeeeeeeeeeEEeecceeeeecccccCCcC Confidence 122210 11111212211111 11111222222223222 46799999885433345777 No 165 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=97.38 E-value=8.1e-05 Score=43.01 Aligned_cols=280 Identities=11% Similarity=0.012 Sum_probs=113.7 Q ss_pred ccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhh Q lcl|Aclame:pro 76 GGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTV 155 (394) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~ 155 (394) ..+. .......++....--............-+.++..+-+ +... T Consensus 1 ~~~~----------------------------------~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~-~~~~ 45 (319) T protein:vir:94 1 MNKT----------------------------------IKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILER-VTAV 45 (319) T ss_pred CCcc----------------------------------cccccceeEeehhhhhccCCCcchHHHHHHHHHHHHH-HHHH Confidence 0000 0000000000000000011111122222333333322 2222 Q ss_pred hhhhh--eee--eEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHH-H Q lcl|Aclame:pro 156 VDLKP--FTT--VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDL-V 230 (394) Q Consensus 156 ~~l~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l-~ 230 (394) ..+.. .++ +....+.++++|.....+.....- .++-.....+.+....+++-.+.-.+. |..-=...+...+ . T Consensus 46 ~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R-~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a 123 (319) T protein:vir:94 46 NAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKR-NATNEFDHPKIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDI 123 (319) T ss_pred hhhhhhcccCcceEeccCcEEEEeeecccccccccC-CCCcccCCcccceeEEEeecccccccc-cchhhHhhhhchhhH Confidence 22111 112 333456677788765432221111 222222223345555666655543332 1110011112222 1 Q ss_pred HHHHHHH-HHHHHHHHHH----HHhhccccccc---cccccHHHHHHHHHhhhhhh--cccEEEEcHHHHHHHHhhhccC Q lcl|Aclame:pro 231 GIVSESI-SQIKVNTTND----AIAKVLKSFTT---KTVKNLDEIKALLNGGFDPA--YNVSLIVSQSFYQTLDTLKDGN 300 (394) Q Consensus 231 ~~i~~~l-~~~~~~~~~~----a~~~g~~~~~~---~~~~~~~~i~~~~~~~~~~~--~~a~~vm~~~~~~~l~~lkd~~ 300 (394) +.+..+. ...+.-..|. .+..+.++... +...-++.|.++...+-... .+-.++++|..+..|..-..-. T Consensus 124 ~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~ 203 (319) T protein:vir:94 124 NYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIAL 203 (319) T ss_pred HHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhh Confidence 2222222 2222212221 11122121111 11122455555443332221 1346789999999885532111 Q ss_pred Cc-eeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEee--cccccceEEEEEEeccEEe Q lcl|Aclame:pro 301 GR-YLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWAD--NEIYGQYLQAVLRFGVSKV 377 (394) Q Consensus 301 G~-~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~--~~~~~~~~r~~~r~d~~v~ 377 (394) .+ -+.+....++..++|.|+||+.+++.....-.+++|..+ ++... .+=..++... +..|...++....+|..|+ T Consensus 204 ~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~-A~~~~-~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~ 281 (319) T protein:vir:94 204 PQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASP-IQADLAKTNSNIPGMFGTLAEQLLYTGAFVP 281 (319) T ss_pred ccccccccceeeeeceeecCeEEEEecccccccceEEEEcCC-eeeee-eeeeeeeccCCCccccceeeeeeeeeeeEEe Confidence 11 012223455566799999999887766556667778754 33332 2222233322 2335556788888999999 Q ss_pred cccceEEEEecCccC---CC Q lcl|Aclame:pro 378 DDKAGYYVTFTPEPL---PL 394 (394) Q Consensus 378 ~~~af~~l~~~~~~~---~~ 394 (394) +|++.......+++- +. T Consensus 282 ~~k~~~Iy~~~~~~~~~~~~ 301 (319) T protein:vir:94 282 EHLQKYIFTIGGTEVATKRD 301 (319) T ss_pred ccccceEEEeecCCcccCCC Confidence 999765555444432 22 No 166 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=97.38 E-value=8.1e-05 Score=43.01 Aligned_cols=280 Identities=11% Similarity=0.012 Sum_probs=113.7 Q ss_pred ccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhh Q lcl|Aclame:pro 76 GGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTV 155 (394) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~ 155 (394) ..+. .......++....--............-+.++..+-+ +... T Consensus 1 ~~~~----------------------------------~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~-~~~~ 45 (319) T protein:vir:97 1 MNKT----------------------------------IKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILER-VTAV 45 (319) T ss_pred CCcc----------------------------------cccccceeEeehhhhhccCCCcchHHHHHHHHHHHHH-HHHH Confidence 0000 0000000000000000011111122222333333322 2222 Q ss_pred hhhhh--eee--eEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhccHHHH-H Q lcl|Aclame:pro 156 VDLKP--FTT--VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDL-V 230 (394) Q Consensus 156 ~~l~~--~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l-~ 230 (394) ..+.. .++ +....+.++++|.....+.....- .++-.....+.+....+++-.+.-.+. |..-=...+...+ . T Consensus 46 ~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~DY~R-~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~~Etn~~l~a 123 (319) T protein:vir:97 46 NAYSTPALISNDAIFMEGRSFTVMKGDTTELKDYKR-NATNEFDHPKIEETTYFLDQEKYWGRF-VDALDRKDTEGNIDI 123 (319) T ss_pred hhhhhhcccCcceEeccCcEEEEeeecccccccccC-CCCcccCCcccceeEEEeecccccccc-cchhhHhhhhchhhH Confidence 22111 112 333456677788765432221111 222222223345555666655543332 1110011112222 1 Q ss_pred HHHHHHH-HHHHHHHHHH----HHhhccccccc---cccccHHHHHHHHHhhhhhh--cccEEEEcHHHHHHHHhhhccC Q lcl|Aclame:pro 231 GIVSESI-SQIKVNTTND----AIAKVLKSFTT---KTVKNLDEIKALLNGGFDPA--YNVSLIVSQSFYQTLDTLKDGN 300 (394) Q Consensus 231 ~~i~~~l-~~~~~~~~~~----a~~~g~~~~~~---~~~~~~~~i~~~~~~~~~~~--~~a~~vm~~~~~~~l~~lkd~~ 300 (394) +.+..+. ...+.-..|. .+..+.++... +...-++.|.++...+-... .+-.++++|..+..|..-..-. T Consensus 124 ~~i~~~~~~~~v~PEiDay~~skla~~a~~~~~~~~t~~n~y~~i~~a~~~Lde~~VP~~Rvl~Vtp~~~~~L~~~~~f~ 203 (319) T protein:vir:97 124 NYVVARQGAEVVAPYLDNLRFATLARNKAKHLTVGTGSDAQYDAVLDVSVELDEIKAPENRVLFVSPTFYKGIKKFVIAL 203 (319) T ss_pred HHHHHHHHHHHhhhhhhHHHHHHHHhhcccccccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHHHhhhhhh Confidence 2222222 2222212221 11122121111 11122455555443332221 1346789999999885532111 Q ss_pred Cc-eeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEee--cccccceEEEEEEeccEEe Q lcl|Aclame:pro 301 GR-YLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWAD--NEIYGQYLQAVLRFGVSKV 377 (394) Q Consensus 301 G~-~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~--~~~~~~~~r~~~r~d~~v~ 377 (394) .+ -+.+....++..++|.|+||+.+++.....-.+++|..+ ++... .+=..++... +..|...++....+|..|+ T Consensus 204 ~~~~~~~~~~~~g~Vg~idG~~Vi~vps~~~k~in~i~~h~~-A~~~~-~k~~~~~~~~p~~~~~a~~v~gr~y~d~~V~ 281 (319) T protein:vir:97 204 PQGDTRQQVLGKGVQGELDGFVIVKVPTKLLQGLQAIAVVGE-VLASP-IQADLAKTNSNIPGMFGTLAEQLLYTGAFVP 281 (319) T ss_pred ccccccccceeeeeceeecCeEEEEecccccccceEEEEcCC-eeeee-eeeeeeeccCCCccccceeeeeeeeeeeEEe Confidence 11 012223455566799999999887766556667778754 33332 2222233322 2335556788888999999 Q ss_pred cccceEEEEecCccC---CC Q lcl|Aclame:pro 378 DDKAGYYVTFTPEPL---PL 394 (394) Q Consensus 378 ~~~af~~l~~~~~~~---~~ 394 (394) +|++.......+++- +. T Consensus 282 ~~k~~~Iy~~~~~~~~~~~~ 301 (319) T protein:vir:97 282 EHLQKYIFTIGGTEVATKRD 301 (319) T ss_pred ccccceEEEeecCCcccCCC Confidence 999765555444432 22 No 167 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=97.34 E-value=2.5e-05 Score=45.77 Aligned_cols=260 Identities=11% Similarity=0.025 Sum_probs=128.4 Q ss_pred hhhhhhhhhhccccc-CCccccchhHHhHHHHHHHhhhhhhheeeeEeecCCceeE-EEE-ecCCCcccccccccccccc Q lcl|Aclame:pro 120 ETTPVEPQKDGIKKE-NAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKY-PVL-QRATTKMVTVAELEKNPAL 196 (394) Q Consensus 120 ~~~~~~~~~~~~~~~-~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~-~~~~~~~~~~~e~~~~~~~ 196 (394) ........-.+++.+ .-+...--++.+.+-..+.+...+++..+..|+..++ .+ .+. ....+.+..++||...| . T Consensus 1 ~~~~~~~~e~nlt~~~dl~~~~siDf~~~f~~~i~~L~~~LGv~r~~pla~Gs-tIkt~k~~~y~gda~dVaEGe~Ip-l 78 (296) T protein:vir:98 1 MVTSRTYPEENLIKSTDLKYPITIDVTNKFQENISKLLEMLGVTRKISVSEGM-TLKTYAGYDVTLAEGNVPEGEVIP-L 78 (296) T ss_pred CCCccccCcCCCcchhhhhhhhhhhhHHHHhhhHHHHHHHhhhcccccccCCC-EEeeccceeeeeccccccCCcccc-h Confidence 000000000111111 1112223345555555555555566666888887764 23 221 23445556667777666 5 Q ss_pred cccccc---eeeecHhhhhhhhhhhHHHHhccHH-HHHHHHHHHHHHHHHHHHHHHHhhccccccccccccHHHHHHHHH Q lcl|Aclame:pro 197 AKPDFK---DVAWNIDTYRGAIPLSQESIDDADV-DLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLN 272 (394) Q Consensus 197 ~~~~~~---~v~~~~~~~~~~~~vs~ell~ds~~-~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~ 272 (394) +..+-+ ..+++.+|++.-+ |.|.++.|.. +-...-.+.|...+.+..+..++....+++.+.-.+.+.+.+++. T Consensus 79 skvt~~~~~t~t~~ikK~rK~t--TdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~LktaT~t~~~t~~~lQ~Ala 156 (296) T protein:vir:98 79 SKVERKIHSEKKIELKKYRKAT--TGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTGTGTQDALGAGLQGALA 156 (296) T ss_pred hhheeeecceEEEEeecccccc--CHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcccceeeechhhHHHHHH Confidence 666654 3777778887775 9999865543 457788888999999999998888886665444345555555442 Q ss_pred h----hhh---h--hcccEEEEcHHHHHHHHhhhccCCceeecccccCCCcc-cccccceEEecCcccccCceEEE---e Q lcl|Aclame:pro 273 G----GFD---P--AYNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGK-VLLGKPVFVLSDEVLGANKAFIG---D 339 (394) Q Consensus 273 ~----~~~---~--~~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~-~l~G~pV~~~~~~~~~~~~~~~g---d 339 (394) . +.+ . ..+.+.++||.+.+.++. +++ +-.....+...- .++|.-|+.+.. .+.+.+|.- | T Consensus 157 ~~~~~l~~~feded~~~~V~FVnP~D~a~ylg--~a~---it~qt~fG~tyl~nfLG~~II~S~k--V~~G~~~~T~~~N 229 (296) T protein:vir:98 157 SAWGKLQVLFEDYGSERAIVFANSLDVAEYIA--KAG---ITTQTAFGLTYLVDFTGTVIISTND--VTKGEIWATVPEN 229 (296) T ss_pred HHhhhhhhhccccCCCceEEEEehHHHHHHhc--CCc---cchhheechhhhhhccccEEEEcCc--CCCceEEEeeecc Confidence 1 111 1 124689999999877542 221 101111111111 277875555433 333333211 1 Q ss_pred ccccEEEEeecceEEEEeecccccceEEEEE-------------Eecc---EEecccceEEEEecCcc Q lcl|Aclame:pro 340 FKRGVLFADRKDLGLRWADNEIYGQYLQAVL-------------RFGV---SKVDDKAGYYVTFTPEP 391 (394) Q Consensus 340 ~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~-------------r~d~---~v~~~~af~~l~~~~~~ 391 (394) ..-+|+-....++.-.+. .....+++.+.. -+.+ =+-+++++++.+++++. T Consensus 230 i~~ay~~~~~~~l~~~f~-~~~d~tglIGv~h~~~~~~~t~eT~~~~~~~lfpE~~dgiv~~tI~~~~ 296 (296) T protein:vir:98 230 IIFAYINPNNSELAKEFN-LYGDPTGYIGMNHFQENTTLTIQTLLVSGMLMYPERIDGIVKVTLTPGV 296 (296) T ss_pred eEEEeecccccchhhhhc-cccccccceEEEeccccceeeehhHhHhHHHhcccccceEEEEEecCCC Confidence 111111110011111110 000111211110 1122 23456889999998888 No 168 >protein:vir:96792 Length: 315 # NCBI annotation: major capsid protein # Family: family:all:47 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224246;genbank:gi:62362381;genbank:GeneID:3345731 Probab=97.33 E-value=9.3e-05 Score=42.69 Aligned_cols=256 Identities=9% Similarity=0.004 Sum_probs=89.5 Q ss_pred hhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee-------cCCceeEEEEecCCCccc--cccccccccccc Q lcl|Aclame:pro 127 QKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA-------KKASGKYPVLQRATTKMV--TVAELEKNPALA 197 (394) Q Consensus 127 ~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~~--~~~e~~~~~~~~ 197 (394) .+.+..+. -.+--+.+....++.+.+...+++-+.-..+ .+.-...+.+. .++... -+...+...+.. T Consensus 1 ~~~t~~sd--l~vfn~~~~~a~~e~~~~~~~~Fnaas~Gai~l~~~~~~GDf~~~~ff~-i~~~~~~rnv~~~~~~t~~k 77 (315) T protein:vir:96 1 MATTVNSD--LVIYNDTAQTAYLERNMDNLAVFNENSRAAIGLNSELIEGDLKLRSFYK-VGGAIADRDVNSTATVAGTK 77 (315) T ss_pred Cceeeecc--eeeehhhhhhhHHhhhHHHHHHhhhhcCCcccccccccccccccccccc-cccchhhcccCCCcccccee Confidence 11111111 0111122233334444443333332111100 00000111111 010000 001111111100 Q ss_pred ccccceeeecHhhhhhhhhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHH-------HHhhcccc---ccccccccH Q lcl|Aclame:pro 198 KPDFKDVAWNIDTYRGAIPLSQESID---DADVDLVGIVSESISQIKVNTTND-------AIAKVLKS---FTTKTVKNL 264 (394) Q Consensus 198 ~~~~~~v~~~~~~~~~~~~vs~ell~---ds~~~l~~~i~~~l~~~~~~~~~~-------a~~~g~~~---~~~~~~~~~ 264 (394) -.+...+..+..--.+-+..+...+. +.+.....-|...+..+.....-. +.+.+.+. ....+..+. T Consensus 78 it~~~dvaVk~~~~~~~~~~~~~~~a~~g~dp~~~~~~i~~~~~~~~l~~~l~~~l~~~~aai~~~t~~~~~~~~a~~~~ 157 (315) T protein:vir:96 78 IAADEMVSVKVPWKYGPYETTEEAFKRRARSPEEFSMLIGQDMADATMAGWIGYALNALQGAIGSNAGMNVSGELATEGK 157 (315) T ss_pred cccccceeEEEeecCCchhccHHHHHHhhcCHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccccccccccCH Confidence 01122222221111121222333222 222222222333332222221111 11111111 112234456 Q ss_pred HHHHHHHHhhhhhhcc-cEEEEcHHHHHHHHhhhccCCceeeccc---ccCCCcccccccceEEecCcccccCceEEEec Q lcl|Aclame:pro 265 DEIKALLNGGFDPAYN-VSLIVSQSFYQTLDTLKDGNGRYLLQDD---ITAVSGKVLLGKPVFVLSDEVLGANKAFIGDF 340 (394) Q Consensus 265 ~~i~~~~~~~~~~~~~-a~~vm~~~~~~~l~~lkd~~G~~l~~~~---~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~ 340 (394) ..+.++..++-+..-+ +.|+||..++..|.+ +.= -..++..+ .....+. .+|+||++.|.++.. .+|| | T Consensus 158 ~~l~dA~~klGD~~~~l~~~vMHS~v~~~L~~-q~L-~~~~~~~~~~~~~~~~~~-~lGkrViVdD~~P~~---~~~g-l 230 (315) T protein:vir:96 158 KVLTKGLRTMGDKASSIAIWVMDSTSYFDIVD-EAI-DNKLYEEAGVVVYGGTPG-TLGKPVLVTDQCPAT---KIFG-L 230 (315) T ss_pred HHHHHHHHHhcccccCeeEEEEchHHHHHHHH-hhh-hhhcccccceeEecCcCc-ccccEEEEECCCCcc---eeee-e Confidence 6677766665444433 689999999999876 311 11222110 1122233 459999998866532 1222 2 Q ss_pred cccEEEEe-ecceEEEEeecccccceEEEEEEecc-EEecccceEEEEecCccCCC Q lcl|Aclame:pro 341 KRGVLFAD-RKDLGLRWADNEIYGQYLQAVLRFGV-SKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 341 ~~~~~~~~-~~~~~i~~~~~~~~~~~~r~~~r~d~-~v~~~~af~~l~~~~~~~~~ 394 (394) ..+.+.+. ..+. .....+.....++....|..+ -+++|..|..-+ ....+|- T Consensus 231 ~~GAi~~~~~~~~-~~~~~~~~g~e~l~~~~r~e~tf~l~p~G~sw~~-~~~~sPt 284 (315) T protein:vir:96 231 VAGAVMITESQAP-GMRSYQIDDQENLAIGFRAEGTANVEVLGYKWKT-KTNVNPA 284 (315) T ss_pred ecceeeecCCCcc-ccccccCCCcceeEEEEeeeeEeeeeeeeEEeec-CCCcCCC Confidence 22322221 1111 011112223345555555544 467788777743 2334565 No 169 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=97.33 E-value=3.8e-05 Score=44.84 Aligned_cols=254 Identities=11% Similarity=0.006 Sum_probs=126.6 Q ss_pred cccccCCccccc--hhHHhHHHHHHHhhhhhhheeeeEe---ecCCceeEEEEecCCCccccccccccccccccccccee Q lcl|Aclame:pro 130 GIKKENAKPVSS--EEILYTPAREVKTVVDLKPFTTVYQ---AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDV 204 (394) Q Consensus 130 ~~~~~~~~~lvP--~~~~~~I~~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v 204 (394) .++.+.+..++- +.+.+.+++.+......+.++.+.. .....+.+.... ..+.+.+...++...+..+..++.. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~-~~G~~~~~~~~~~dip~~~~~~~~~ 79 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMT-RSGAAKIIANGADDLPLVDVDMVRK 79 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeec-cceeEEEecCcccccccccccceeE Confidence 333333332221 3345556666666666666554432 222233444332 2344445555555444555667777 Q ss_pred eecHhhhhhhhhhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc--------------------- Q lcl|Aclame:pro 205 AWNIDTYRGAIPLSQESIDDA---DVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKT--------------------- 260 (394) Q Consensus 205 ~~~~~~~~~~~~vs~ell~ds---~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~--------------------- 260 (394) ......++.-+.++..=++.+ ..++..--....+.++...+|..++.|....+..| T Consensus 80 ~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~~ 159 (301) T protein:vir:80 80 SVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNVS 159 (301) T ss_pred EEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCccccccc Confidence 778888887777776433322 33577777777888888888888777754321100 Q ss_pred ---ccc----HHHHHHHHHhhhhh--hc--ccEEEEcHHHHHHHHhhh--ccCCceeeccccc-CCCcccccccceEEec Q lcl|Aclame:pro 261 ---VKN----LDEIKALLNGGFDP--AY--NVSLIVSQSFYQTLDTLK--DGNGRYLLQDDIT-AVSGKVLLGKPVFVLS 326 (394) Q Consensus 261 ---~~~----~~~i~~~~~~~~~~--~~--~a~~vm~~~~~~~l~~lk--d~~G~~l~~~~~~-~~~~~~l~G~pV~~~~ 326 (394) ..+ +++|..++.++... .. .-.++|+|+.+..|..-. +..|.-++.- +. +....+|.+.|-.... T Consensus 160 ~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~-l~~~~~~~~I~~~p~L~~~ 238 (301) T protein:vir:80 160 KWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKV-LQDNAWFSAIVRVPDLAGM 238 (301) T ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHH-HHHHcCcceEEEcceeccC Confidence 112 34444455443322 11 246899999999997543 4445433321 11 1112244444432211 Q ss_pred CcccccCceEEEec-cccEEEEeecceEEEEeecccccceE--EEEEEe-ccEEecccceEEEEec Q lcl|Aclame:pro 327 DEVLGANKAFIGDF-KRGVLFADRKDLGLRWADNEIYGQYL--QAVLRF-GVSKVDDKAGYYVTFT 388 (394) Q Consensus 327 ~~~~~~~~~~~gd~-~~~~~~~~~~~~~i~~~~~~~~~~~~--r~~~r~-d~~v~~~~af~~l~~~ 388 (394) . ..+.+.+++-.- ...+-+.--++++ +.........+ ..+.|+ |..+.+|.||++++.- T Consensus 239 g-~~g~~~~v~~~~~~d~~~~~v~~~~~--~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 239 G-TAGSDSFAVIHDSNETAELIIPMDIT--RHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred C-CCcccEEEEEecCCcEEEEEecCcee--eecceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 1 112222222211 1111111112222 21111111222 346777 4588999999999998 No 170 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=97.19 E-value=0.00014 Score=41.78 Aligned_cols=290 Identities=10% Similarity=0.017 Sum_probs=114.1 Q ss_pred hccccccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHH Q lcl|Aclame:pro 69 VGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTP 148 (394) Q Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I 148 (394) ..+. |....+.... +.......++....--..-....+....-+.+...+ T Consensus 1 ~~~~------------------~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~L 50 (329) T protein:vir:10 1 MDGI------------------FITGVKTMNK------------EIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGIL 50 (329) T ss_pred CCce------------------EEechhhhhh------------hhhcccceeEEehhhhcCCccCCchhHHHHHHHHHH Confidence 0000 0000000000 000000000000000000011111112222333333 Q ss_pred HHHHHhhhhhh-heee--eEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhcc Q lcl|Aclame:pro 149 AREVKTVVDLK-PFTT--VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA 225 (394) Q Consensus 149 ~~~~~~~~~l~-~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds 225 (394) -+.....+--. .+++ +....+.++++|.....+.. .+-..++-.....+.++...+++-.+.-.+. |..-=...+ T Consensus 51 D~~~~~~~~s~~~~~N~~~e~~~g~tVkIp~i~~~gl~-DY~R~~g~~~g~vt~~~~t~tidqdR~~~F~-VD~~D~dEt 128 (329) T protein:vir:10 51 EKVTAANSYSAPAVISNDAIFMQGRSFTVIKGDVTELK-DYKRNATNEFDHPQIQETTYFLDQEKYWGRF-VDALDRRDT 128 (329) T ss_pred HHHHHhhceeeeeecccceeeccCcEEEEeeecccccc-cccCCCCccccccccceeEEEeecccceeee-cchhhHhhh Confidence 33222211100 0111 33455667888876543222 2211222222223345566666665544332 111001111 Q ss_pred HHHH-HHHHHHH-HHHHHHHHHHHH----Hhhcccccccccccc----HHHHHHHHHhhhhhhc--ccEEEEcHHHHHHH Q lcl|Aclame:pro 226 DVDL-VGIVSES-ISQIKVNTTNDA----IAKVLKSFTTKTVKN----LDEIKALLNGGFDPAY--NVSLIVSQSFYQTL 293 (394) Q Consensus 226 ~~~l-~~~i~~~-l~~~~~~~~~~a----~~~g~~~~~~~~~~~----~~~i~~~~~~~~~~~~--~a~~vm~~~~~~~l 293 (394) ...+ .+.+..+ ....+.-..|.- +..+.++. ..+..+ ++.|.++...+-.... +-.++++|..+..| T Consensus 129 n~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a~~~-~~~~~t~~nay~~i~~a~~~Lde~~vp~~Rvl~VtP~~~~~L 207 (329) T protein:vir:10 129 EGNIDINYVVAKQASEVVAPYLDNLRFATLARNKAKH-LTVGSGADAQYDAVLDVSVELDEIGAGASRILFVTPKFYKGI 207 (329) T ss_pred hhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhcccc-cccccCHHHHHHHHHHHHHHHHhcCCCCCcEEEeCHHHHHHH Confidence 1112 1122222 222222222221 11111211 111222 4445544433322211 33677999999988 Q ss_pred Hhhhc--cCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEee--cccccceEEEE Q lcl|Aclame:pro 294 DTLKD--GNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWAD--NEIYGQYLQAV 369 (394) Q Consensus 294 ~~lkd--~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~--~~~~~~~~r~~ 369 (394) .+-.. .++. .......++..++|.|+||+.+++.....-.+++|..+. +.... +=..++... +..|...++.. T Consensus 208 k~~~~f~~~~~-~~~~~~~~g~Vg~idG~~Ii~vps~~~k~in~ii~~~~A-~~~~~-K~~~~~~~~p~~~~~a~~v~gr 284 (329) T protein:vir:10 208 KKFVIELPQGD-NRQQVLGKGVQGELDGFTIVKVPSKMLQGVEAMAVIGEV-MASPI-QANEAKLNSNVPGMFGTLAEQM 284 (329) T ss_pred Hhhhhhhcccc-ccccceeeeeeeeecCeEEEEecCCcccceeEEEEcCCc-eeeee-eeeeeeeeCCCCccchheeeee Confidence 65211 0111 111223455567899999998876655555667776543 33322 222333322 23355567888 Q ss_pred EEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 370 LRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 370 ~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ..+|..|++|++.........+.|. T Consensus 285 ~yyd~~V~~~k~~~I~~~~~~a~~~ 309 (329) T protein:vir:10 285 LYTGAFVPEHLQKYIFTIGGKEVET 309 (329) T ss_pred eeeeeEEEccccCEEEEecccCccc Confidence 8899999999976666555555444 No 171 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=97.07 E-value=6.4e-05 Score=43.56 Aligned_cols=280 Identities=9% Similarity=-0.053 Sum_probs=125.1 Q ss_pred HHHHHHHHHHhhhhhhhhhhcccccCC-ccccc--hhHHhHHHHHHHhhhhhhheeeeEee---cCCceeEEEEecCCCc Q lcl|Aclame:pro 110 GKDEVLMPINETTPVEPQKDGIKKENA-KPVSS--EEILYTPAREVKTVVDLKPFTTVYQA---KKASGKYPVLQRATTK 183 (394) Q Consensus 110 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~lvP--~~~~~~I~~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~ 183 (394) ...+......................+ ..++. +.+.+.|++.....-.-+.++.+.+. ...++.+.... ..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e-~~G~ 79 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFD-GVGI 79 (314) T ss_pred CccchHHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeec-cccc Confidence 000000000011111111111122222 22222 23444555544443333444333221 11134444443 3355 Q ss_pred ccccccccccccccccccceeeecHhhhhhhhhhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc- Q lcl|Aclame:pro 184 MVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA---DVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTK- 259 (394) Q Consensus 184 ~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds---~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~- 259 (394) +.++...+...+..+..++......+.++..+.++..=++-+ ..+|..--....+..+...+|..++.|....+.. T Consensus 80 a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~G 159 (314) T protein:vir:10 80 AQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGIVS 159 (314) T ss_pred eeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccccccee Confidence 556666655454556678888888888888888875433322 2356676777777788888888777775322111 Q ss_pred --------------cccc----HHHHHHHHHhhhhh--hc--ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccc Q lcl|Aclame:pro 260 --------------TVKN----LDEIKALLNGGFDP--AY--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVL 317 (394) Q Consensus 260 --------------~~~~----~~~i~~~~~~~~~~--~~--~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l 317 (394) .-++ ++||..++..+... .. .-.++|+|+.+..|....+..|.-++.-=-.++.+-+| T Consensus 160 LlN~p~v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~~~~~~~~~tvl~~l~~n~~~l~I 239 (314) T protein:vir:10 160 VFDQPNINNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQGLVPQTNLSYGELFTRNNPGLTI 239 (314) T ss_pred EeecCCCccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhcccccCCCccHHHHHHHhCCCcEE Confidence 1122 34444444443332 11 13689999998877654444444332110011122244 Q ss_pred cccceEEecCcccccCceEEEeccccEEEE-eecceEEEEeecccccceEEEEEEe-ccEEecccceEEEEecCcc Q lcl|Aclame:pro 318 LGKPVFVLSDEVLGANKAFIGDFKRGVLFA-DRKDLGLRWADNEIYGQYLQAVLRF-GVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 318 ~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~-~~~~~~i~~~~~~~~~~~~r~~~r~-d~~v~~~~af~~l~~~~~~ 391 (394) .+.|-..... ..+.+..++-+-+.-++-+ --..++....+..........+.|+ |..+.+|.||++++.-+=+ T Consensus 240 ~~~~el~~ag-~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI~~~ 314 (314) T protein:vir:10 240 RFLQFLDNYD-GAGGKAALAFEKSPLNMSIEIPEVTNVLPAQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGITFA 314 (314) T ss_pred EEcccccccC-CCcceEEEEEecCCcEEEEecCccceeecceecCceEEEcceeeeEEEEEECcceeEeeeeeecC Confidence 4444322111 1111222222211112111 1122221111111111123456777 4688899999988777766 No 172 >protein:vir:93966 Length: 400 # NCBI annotation: structural protein # Family: family:all:2417 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764320;genbank:gi:115315634;genbank:GeneID:5176553 Probab=97.01 E-value=0.00021 Score=40.72 Aligned_cols=354 Identities=14% Similarity=0.115 Sum_probs=157.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALE---SDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGG 77 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~---~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~ 77 (394) |-+-.|.|-+.++.++++..-.++..+....- -++....++++.-+.++.-+|.+.+..+...+.. T Consensus 8 ~~K~~l~EK~~~~a~~~E~~~~LKS~~~G~evknaiedl~K~~EL~~TlS~~~iEI~~~en~LNa~~E~----------- 76 (400) T protein:vir:93 8 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEEK----------- 76 (400) T ss_pred cccchHHHHHHHHhhhhhhhhhhhhhhhcchhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhhh----------- Confidence 66666777777777777777666666654311 1334455556666666665655554444311111 Q ss_pred ccccchhhhHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHh Q lcl|Aclame:pro 78 KEVTQEEKTYRESVNDFIRSKGKIVNDSLRF---EGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKT 154 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~ 154 (394) ......+..++.+........... .+.++.+..+... .. -.|++-.......|..+...|-..+-. T Consensus 77 -------~KGK~kMt~~i~sq~A~~eF~~vL~~N~G~S~~k~AW~A~---L~-E~GVtiTD~~~~LP~~lv~sI~~A~~n 145 (400) T protein:vir:93 77 -------PKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWSAK---LA-ENGVTITDTTFQLPRKLVESINTALLN 145 (400) T ss_pred -------hhhhHHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhhhh---Hh-hcCcceeccchhccHHHHHHHHHhhhc Confidence 011111223333333322221111 1122222222111 11 124444455568899999999999999 Q ss_pred hhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHh---ccHHHHHH Q lcl|Aclame:pro 155 VVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID---DADVDLVG 231 (394) Q Consensus 155 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~---ds~~~l~~ 231 (394) +.++++...|-.++.--++.. -.+...+-....|....+ ...+|.--++.+-.++....+- ++.. .+-..|.. T Consensus 146 ~n~v~~vfHVT~~~~~~V~~s--~~s~~~Aq~HkdGqTK~e-qa~~~~~~Tl~~~~VY~~~S~A-e~~K~~~~sYsel~N 221 (400) T protein:vir:93 146 TNPVFKVFHVTNVGALLVSRS--FDSANEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSLA-ERVKRLQMSYSELYN 221 (400) T ss_pred cCcceeeeeeccchhhhHHhh--hhhhhhhhhhccCCcccc-ceeeeeeechhHHHHHHHHHHH-HHHHHhhhhHHHHHH Confidence 999988665544433222211 123333333444444444 3456776777666555444442 3333 34445789 Q ss_pred HHHHHHHHHHH-HHHHHHHhhcccccccccccc----------------------HHHHHHHHHhhhhhhcccEEEEcHH Q lcl|Aclame:pro 232 IVSESISQIKV-NTTNDAIAKVLKSFTTKTVKN----------------------LDEIKALLNGGFDPAYNVSLIVSQS 288 (394) Q Consensus 232 ~i~~~l~~~~~-~~~~~a~~~g~~~~~~~~~~~----------------------~~~i~~~~~~~~~~~~~a~~vm~~~ 288 (394) ||..+|+.+|. +..+.+..-|+|+++....-. .|.|..++.-..+.+..-.+++... T Consensus 222 ~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~~~Ttkaksagktpfadaieeavdfvrptagrrylivkte 301 (400) T protein:vir:93 222 LIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTE 301 (400) T ss_pred HHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEecc Confidence 99999999988 678889988988875332211 1222222211111111122333333 Q ss_pred H-HHHHHhhhccCCc---eeecccccCCCcccccccce-EE-ecCcccccCceEEEeccccEEEEeecceE-EEEeeccc Q lcl|Aclame:pro 289 F-YQTLDTLKDGNGR---YLLQDDITAVSGKVLLGKPV-FV-LSDEVLGANKAFIGDFKRGVLFADRKDLG-LRWADNEI 361 (394) Q Consensus 289 ~-~~~l~~lkd~~G~---~l~~~~~~~~~~~~l~G~pV-~~-~~~~~~~~~~~~~gd~~~~~~~~~~~~~~-i~~~~~~~ 361 (394) + .+-|..|+.+..+ .|-..+. .-.+--|+.- ++ +.+.+. .+-++.|-+. .+ +-++++ ++.....+ T Consensus 302 drkalldelrqatanahvrikndda---eiasevgvdeiivytgskal--kptvlvdqky--hi-dmqdltkvdafewkt 373 (400) T protein:vir:93 302 DRKALLDELRQATANAHVRIKNDDA---EIASEVGVDEIIVYTGSKAL--KPTVLVDQKY--HI-DMQDLTKVDAFEWKT 373 (400) T ss_pred chHHHHHHHHhhccccceEeecchh---hhhhhcCcceeeeeeccccc--cceeeecccc--cc-chhhhhhhhhheecc Confidence 3 3334444433222 1111111 0111122211 11 111221 2234444332 12 233332 11111111 Q ss_pred ccceEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 362 YGQYLQAVLRFGVSKVDDKAGYYVTFT 388 (394) Q Consensus 362 ~~~~~r~~~r~d~~v~~~~af~~l~~~ 388 (394) +..-+.+.---.|-|---+|-+.++++ T Consensus 374 nsnmilvetltsghvetynagavitvs 400 (400) T protein:vir:93 374 NSNMILVETLTSGHVETYNAGAVITVS 400 (400) T ss_pred CCceEEEeecccCcceeeccceeEeeC Confidence 111122222223333333444445555 No 173 >protein:vir:95451 Length: 313 # NCBI annotation: hypothetical protein ORF044 # Family: family:all:11728 # MgeID: mge:1570 # MgeName: PA11 # Cross-refs: genbank:acc:YP_001294637;genbank:gi:149408203;genbank:GeneID:5237018 Probab=96.80 E-value=0.00019 Score=40.96 Aligned_cols=262 Identities=13% Similarity=0.095 Sum_probs=131.7 Q ss_pred hcccccCCccccchhHHhHHHHHHHhhhhhhheee-eEeecCCceeEEEEecCCCcccccccccccccccccccceeeec Q lcl|Aclame:pro 129 DGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTT-VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWN 207 (394) Q Consensus 129 ~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~ 207 (394) ..+++.....++.+.+++.|...+.+..-=..+.+ |...+++ -++.+.+-.+...-.-.|.+... -....-++|++. T Consensus 1 ~~~TSNT~A~I~SE~~s~~I~~~LH~~LL~~~~~R~V~DF~~G-~~L~I~tiGs~~~~~~~E~~~~~-~~~i~TGEIt~~ 78 (313) T protein:vir:95 1 MQLTSNTRAFIESEQYSKFILLNLHDGLLPETFYRNVSDFGSG-ETLHIKTIGSVTLQEAEEDTPLI-YNPIETGEITFQ 78 (313) T ss_pred CcccccchheehhhhHHHHHHHHhhccccchhhhhhhccCCCC-CEEEecccCceeeeccccCCCee-ecccccceEEEE Confidence 33444444456677778888777666432223333 4555444 23333322222222222222221 223344567777 Q ss_pred Hhhhhhh-hhhhHHHHhccHH--HHHHHHHHHHHHHHHHHHHHHHhhcc-----ccc---------------cccccccH Q lcl|Aclame:pro 208 IDTYRGA-IPLSQESIDDADV--DLVGIVSESISQIKVNTTNDAIAKVL-----KSF---------------TTKTVKNL 264 (394) Q Consensus 208 ~~~~~~~-~~vs~ell~ds~~--~l~~~i~~~l~~~~~~~~~~a~~~g~-----~~~---------------~~~~~~~~ 264 (394) ...+++- -.||+.|-+|+-. ++.+.+..+-++++..+...-++... +.. ++.++-.. T Consensus 79 i~~Y~G~A~~vt~~LR~D~~~I~~~~A~~~AE~~RAI~E~~~TD~L~~G~~~FA~~~~P~~vNG~PH~~V~~~T~~~~~~ 158 (313) T protein:vir:95 79 ITEYKGDAWYVTDDLREDGTDIDRLMAERAAESTRAIQETFETDFLKTGAEYFAANPGPHNVNGFPHVIVSAETNGVFAL 158 (313) T ss_pred EEeecCChhhhhhhhhhcchhHHHHhhhcchhhHHHHHHHHhhHHHhhchhhhccCCCCcccccccceEEeccCCceehh Confidence 6666543 3699999999842 23333333334444444333322211 111 11111222 Q ss_pred HHHHHH---HHhhhhhhcccEEEEcHHHHHHHHhhhc------cCCceeecccccCCC--cccccccceEEec------- Q lcl|Aclame:pro 265 DEIKAL---LNGGFDPAYNVSLIVSQSFYQTLDTLKD------GNGRYLLQDDITAVS--GKVLLGKPVFVLS------- 326 (394) Q Consensus 265 ~~i~~~---~~~~~~~~~~a~~vm~~~~~~~l~~lkd------~~G~~l~~~~~~~~~--~~~l~G~pV~~~~------- 326 (394) .++..+ +.+..-|+..-++++.|+....|..+.+ .+|+.|...++..+. ...+.|..+.++. T Consensus 159 ~~~~~~~~~~~~a~~P~~G~v~IvDP~~~~~L~~l~~It~~vt~~~k~I~ESG~A~~~~Fi~~~YG~Di~~SN~L~~AN~ 238 (313) T protein:vir:95 159 KHLIAMRLAFDKANVPAEGRVFIVDPVAEATLNGLVTITHDVTDFGKMILESGMARGQRFIMNLYGWDILTSNRLHVANY 238 (313) T ss_pred hHHHHhhhhhhhccCCccceEEEEcchhhhhhhhhheeecccccccceeeeccCCchhHHHHHHhhhhhhhhhhhhhccc Confidence 333332 2333344555689999999888887652 367777766654443 2467787765532 Q ss_pred CcccccCceEEEeccccEEEEeecceEEEEeeccc---c------cceEEEEEEeccEEecccceEEEEecCccC Q lcl|Aclame:pro 327 DEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEI---Y------GQYLQAVLRFGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 327 ~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~---~------~~~~r~~~r~d~~v~~~~af~~l~~~~~~~ 392 (394) +.+...+..++|++-..+.-.--..+.+.|-+-+. + ...-.+..|+|..+.+.+..+.+--.+++- T Consensus 239 ~D~~tT~~G~~~NlFM~i~D~~~~P~~~AWr~MP~s~~~~~~~~~~~~~~~~~R~G~Gi~R~~~L~~~~~~A~~~ 313 (313) T protein:vir:95 239 NDGTTTGNGYVGNLFMCILDDQTKPIMGAWRRMPKSEGERNKDRARDEHVVRCRYGFGIQRLDTLGLLATSATAY 313 (313) T ss_pred cccccccCceeeeeeeeeecccccceeeeeccccccccccccccccccceeeeeecccceeecceeEEEeccccC Confidence 11122344577775443322112333344422111 1 112456788999999988887777666666 No 174 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=96.74 E-value=0.00016 Score=41.32 Aligned_cols=286 Identities=9% Similarity=-0.012 Sum_probs=126.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccc--cCCccccc---hhHHhHHHHHHHhhhhhhheeeeEe--- Q lcl|Aclame:pro 95 IRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKK--ENAKPVSS---EEILYTPAREVKTVVDLKPFTTVYQ--- 166 (394) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~lvP---~~~~~~I~~~~~~~~~l~~~~~~~~--- 166 (394) +|. .... ++............. .....+.+. ...+...- +.+.+.|++........+.++.+.+ T Consensus 1 ~~~--~~~~-----~~~~~d~~~~~~~a~-~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~ 72 (329) T protein:vir:79 1 MRG--NIMS-----KEMKYDEFEANVIAN-HMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELS 72 (329) T ss_pred Ccc--chhh-----hhhccchhhhhhHhh-hcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCC Confidence 000 0000 000000000000000 000111111 11122222 3345667765555555555554332 Q ss_pred ecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHhcc---HHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 167 AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA---DVDLVGIVSESISQIKVN 243 (394) Q Consensus 167 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~ds---~~~l~~~i~~~l~~~~~~ 243 (394) -...++.+..... .+.+.++...+...+..+..++.-....+.++..+.++..=++-+ ..+|..--....+.++.. T Consensus 73 ~~~~~~t~~~~~~-~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~ 151 (329) T protein:vir:79 73 DTDKTFEYQTFDK-VGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQ 151 (329) T ss_pred CceeEEEeeeeec-ceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHH Confidence 2222444454432 344455555555444455566666667777777777765333322 235677777777788888 Q ss_pred HHHHHHhhccccccc---------------------cccccHH----HHHHHHHhhhhh--hc--ccEEEEcHHHHHHHH Q lcl|Aclame:pro 244 TTNDAIAKVLKSFTT---------------------KTVKNLD----EIKALLNGGFDP--AY--NVSLIVSQSFYQTLD 294 (394) Q Consensus 244 ~~~~a~~~g~~~~~~---------------------~~~~~~~----~i~~~~~~~~~~--~~--~a~~vm~~~~~~~l~ 294 (394) .+|.-++.|....+. -...+.+ +|..++.++... +. .-.++|+|+.+..|. T Consensus 152 ~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~~~~~L~ 231 (329) T protein:vir:79 152 LVNHLVFKGSKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPSMRKVLM 231 (329) T ss_pred hhccEEEeecccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHHhh Confidence 888877777542111 0011233 444444443322 11 246899999999887 Q ss_pred hhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccce--EEEEEEe Q lcl|Aclame:pro 295 TLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQY--LQAVLRF 372 (394) Q Consensus 295 ~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~--~r~~~r~ 372 (394) ....+.|.-++.---.+..+-+|.+.|-.... ...+.+.+++.+.+.-++-+. .++.+.+......... ...+.|+ T Consensus 232 ~~~~~~~~tvl~~lk~~~~~l~I~~~~el~~a-g~~g~~~~v~y~~~~~~~~~~-vp~~~~~l~~q~~~~~~~v~~~~r~ 309 (329) T protein:vir:79 232 VRMPETTMSYLDYFKQQNGGITIESISELEDI-DGAGTKAALVYEKDPMNMSIE-IPEAFNMLTAQPKDLHFKVPCTSKC 309 (329) T ss_pred cccCCCCccHHHHHHHhCCCcEEEEccccccc-CCCCceEEEEEecCCceEEEe-cCcceeeeeceecCceEEEceeeeE Confidence 65555554333210011112234444432211 111223334333332222111 1111222111111122 2446776 Q ss_pred c-cEEecccceEEEEecCcc Q lcl|Aclame:pro 373 G-VSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 373 d-~~v~~~~af~~l~~~~~~ 391 (394) + ..+.+|.||++++.-..- T Consensus 310 ~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 310 TGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred EEEEEECcceeeeeeeeeeC Confidence 5 578889999998876666 No 175 >protein:vir:95875 Length: 401 # NCBI annotation: major coat protein # Family: family:all:10944 # MgeID: mge:1586 # MgeName: N4 # Cross-refs: genbank:acc:YP_950534;genbank:gi:119952248;genbank:GeneID:5075702 Probab=96.68 E-value=8.8e-05 Score=42.82 Aligned_cols=270 Identities=14% Similarity=0.055 Sum_probs=137.3 Q ss_pred HHHhhhhhhhhhhcccccCC---ccccchhH--HhHHHHHHHhhhhhhheeeeEeecCCceeEEEE-------ecCCCcc Q lcl|Aclame:pro 117 PINETTPVEPQKDGIKKENA---KPVSSEEI--LYTPAREVKTVVDLKPFTTVYQAKKASGKYPVL-------QRATTKM 184 (394) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~---~~lvP~~~--~~~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~-------~~~~~~~ 184 (394) .+..... ..+..++.. +.-+-+.+ ...+++... ...+.+++.+.+++...++--.+ +...... T Consensus 1 ~~~~~a~----~~~~~~s~~g~~~~~~~t~y~~~k~L~~Aa~-~lv~~~fA~~~piPkn~GkTIk~r~y~pl~~~~~pl~ 75 (401) T protein:vir:95 1 MLNYNAP----TDGQKSSIDGANSDQMQTFFWLKKAIITARK-EQYFMPLASVTNMPKHYGKTIKVYEYVPLLDDRNIND 75 (401) T ss_pred CCccCCC----cccccccccccccceeeehhhHHHHHhhhhh-hhhhhhcccccccccccCCeEEEEecccccccccchh Confidence 1111111 111111221 22233323 445555555 47788888888887765433222 1111111 Q ss_pred ccccccc-----------------------------ccccccccccceeeecHhhhhhhhhhhHHHHh-ccHHHHHHHH- Q lcl|Aclame:pro 185 VTVAELE-----------------------------KNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DADVDLVGIV- 233 (394) Q Consensus 185 ~~~~e~~-----------------------------~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds~~~l~~~i- 233 (394) -.+...+ ..+..-..+-..+..+.++++.|+.+|++++. +++.+|...| T Consensus 76 eGv~a~G~~~~~g~~y~~~rdv~~it~~m~~~t~~~~rvn~v~~~~~d~~g~l~qyG~~~e~Td~~~dt~~D~~l~~h~s 155 (401) T protein:vir:95 76 QGIDASGATIVNGNLYGSSKDIGNITSKLPLLTENGGRVNRVGFTRIAREGSIHKFGFFYEFTQESIDFDSDDGLMEHLS 155 (401) T ss_pred cCCCcccccccCccccccccccceeecccccccccccccccccceeeeeeeeeeeccCccchhhhhhhhhcchHHHHHHH Confidence 1111111 11111123334566678999999999997653 5666777765 Q ss_pred HHHHHHHHHHHHHH---HHhhcc----------------ccccccccccHHHHHHHHHhhhh-----------hhcc--- Q lcl|Aclame:pro 234 SESISQIKVNTTND---AIAKVL----------------KSFTTKTVKNLDEIKALLNGGFD-----------PAYN--- 280 (394) Q Consensus 234 ~~~l~~~~~~~~~~---a~~~g~----------------~~~~~~~~~~~~~i~~~~~~~~~-----------~~~~--- 280 (394) .+.|.-+...+++. .+++.. +.....+..+++++..+...+.. ..++ T Consensus 156 ~ell~g~~~~t~d~i~~dll~ag~~viyAg~ats~At~~~~~~~~t~vt~~~l~rl~~~L~~nRapk~t~~i~~s~~~dT 235 (401) T protein:vir:95 156 RELMNGATQITEAVLQKDLLAAAGTVLYAGAATSDATITGEGSTPSVVSYKNLMRLDQILTENRTPTQTTIITGSRMIDT 235 (401) T ss_pred HHHhhhhhhhHHHHHHHHHHhhcCeeecCCccceeeeccccccccceechhHHHHHHHHHHhcccccchhhhhhhhccCc Confidence 33333333332222 233221 12233455677887766544332 1111 Q ss_pred ----c--EEEEcHHHHHHHHhhhccCCceeeccc--------ccCCCcccccccceEEecCcc------c---cc----- Q lcl|Aclame:pro 281 ----V--SLIVSQSFYQTLDTLKDGNGRYLLQDD--------ITAVSGKVLLGKPVFVLSDEV------L---GA----- 332 (394) Q Consensus 281 ----a--~~vm~~~~~~~l~~lkd~~G~~l~~~~--------~~~~~~~~l~G~pV~~~~~~~------~---~~----- 332 (394) + +-++|+..-..|+.++|-.|.|-|.|- ...+.-+.|-++.+++++.+- . +. T Consensus 236 k~i~~s~va~~h~~L~~di~a~~D~~~~~~fi~v~kYa~~~~i~~gEiG~i~~vR~i~~p~~~~w~~ag~~a~~~~~~y~ 315 (401) T protein:vir:95 236 KVIGATRVMYVGSELVPELKAMKDLFGNKAFIETQHYADAGTIMNGEVGSIDKFRIIQVPEMLHWAGAGAQATGANPGYR 315 (401) T ss_pred cccccceEEEEecCchhHHHHHHHhcCCCCceehhhcCCccccccccccccCceeEEecccceeecCCcccccccccccc Confidence 1 356899999999999999888877542 234445677788777765311 1 00 Q ss_pred -------------CceEEEeccccEEEEeecceE------EE-------EeecccccceEEEE-EEeccEEecccceEEE Q lcl|Aclame:pro 333 -------------NKAFIGDFKRGVLFADRKDLG------LR-------WADNEIYGQYLQAV-LRFGVSKVDDKAGYYV 385 (394) Q Consensus 333 -------------~~~~~gd~~~~~~~~~~~~~~------i~-------~~~~~~~~~~~r~~-~r~d~~v~~~~af~~l 385 (394) ...++|+-..+.+-+...+.. ++ -+.+++.++++..| +.+++.+++++-++.| T Consensus 316 ~~~~~~gg~~dVyp~lV~G~dAf~~~~l~g~g~~~~~~~ivk~pG~~~ad~~DPlgQ~g~vgwK~~~a~~vL~~e~m~~i 395 (401) T protein:vir:95 316 TSMVSGQEHYDVYPMLVVGDDSFTSIGFQTDGKSLKFTVMTKMPGKETADRNDPYGETGFSSIKWYYGILVKRPERLALI 395 (401) T ss_pred cccccCCCcceeeeeeEEccccceecccccCCccccceeEeecCCcCCCCCCCcccceehhhhhhhhhhheeccceeEEE Confidence 013455432222212222211 11 12445556666666 5678999999999999 Q ss_pred EecCccCCC Q lcl|Aclame:pro 386 TFTPEPLPL 394 (394) Q Consensus 386 ~~~~~~~~~ 394 (394) +- ++|+ T Consensus 396 es---~a~~ 401 (401) T protein:vir:95 396 KT---VAPL 401 (401) T ss_pred Ee---ecCC Confidence 85 4455 No 176 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=96.62 E-value=0.00042 Score=39.07 Aligned_cols=255 Identities=9% Similarity=0.003 Sum_probs=107.6 Q ss_pred ccCCccccchhHHhHHHHHHHhhhhhhheeeeE---ee---cCCceeEEEEecCCCcccccc----cccccccccccccc Q lcl|Aclame:pro 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTTVY---QA---KKASGKYPVLQRATTKMVTVA----ELEKNPALAKPDFK 202 (394) Q Consensus 133 ~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~---~~---~~~~~~~~~~~~~~~~~~~~~----e~~~~~~~~~~~~~ 202 (394) .+ ...++|+.|+..+++.++....+..+++.- .. .+.++++|.+.. ....... ..+......+..-+ T Consensus 1 Ma-~~~~~p~~~a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (392) T protein:vir:99 1 MA-NAFSKPTAVVDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAP--SRGHTRKLRGAGAERNLTVSDFTED 77 (392) T ss_pred Cc-cccccHHHHHHHHHHHHHhhccchhhhccccccccccCCCCeEEEeeccc--ccceeeeccccccCCcccccccccc Confidence 11 134889999999999998888877776432 11 123456665422 2222221 11111111222333 Q ss_pred e--eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----cc----ccccccHHHHHHHH Q lcl|Aclame:pro 203 D--VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKS-----FT----TKTVKNLDEIKALL 271 (394) Q Consensus 203 ~--v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~-----~~----~~~~~~~~~i~~~~ 271 (394) . ++++-++. .-+.|+++-......++...+.+...++++...+.-+++.... .+ ......++.++++- T Consensus 78 ~~~~~id~~k~-~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~~~~~~~~~~~~~~~~~~~i~~a~ 156 (392) T protein:vir:99 78 SFPVTLTDVAY-HLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAPYEAAGAVHEVAPDEFFKGVNGAR 156 (392) T ss_pred eEEEEEeeeee-cceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccccChhhhHHHHHHHH Confidence 3 44443444 3444555433333456777777777777777777665543211 11 11223466777665 Q ss_pred Hhhhhhh--cccEEEEcHHHHHHHHhhhc-cCCceee---cccccCCCcccccccceEEecCcccccCceEEEeccccEE Q lcl|Aclame:pro 272 NGGFDPA--YNVSLIVSQSFYQTLDTLKD-GNGRYLL---QDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVL 345 (394) Q Consensus 272 ~~~~~~~--~~a~~vm~~~~~~~l~~lkd-~~G~~l~---~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~ 345 (394) ..+-... ..-.++++|..+..|.+... .+-.+.- ...+.++.-+++.|++|+...+.... ..+.+..+. +. T Consensus 157 ~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~~~~~--t~~a~~~~a-~~ 233 (392) T protein:vir:99 157 RALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTLIPHG--DAYLYHPTA-FI 233 (392) T ss_pred HHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecccccc--cceeeeccc-cc Confidence 4433221 13467899998888764310 0101111 01133455578999999886543222 222221111 11 Q ss_pred EEee-----------------cceEEEEee--cccccc------eEEEEEE----eccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 346 FADR-----------------KDLGLRWAD--NEIYGQ------YLQAVLR----FGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 346 ~~~~-----------------~~~~i~~~~--~~~~~~------~~r~~~r----~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..+ ..+...+.. ...+.. .+.+... .+........+.....+.+.+|+ T Consensus 234 ~at~a~v~~~~~~~~~s~s~~~~v~~~~~~~~~~t~~s~~~~v~~~~g~~~v~~~~~~~~~~~~~~~~~~~~v~v~~v 311 (392) T protein:vir:99 234 MATRAPAPPMGAVRSTAISGDQRIAMRWLVDYDSTITSNRSLIDTYFGLKVVEDPNGVGFVRARKIHLIPGSIEVAPE 311 (392) T ss_pred cccccccccccccceeEEecccceecceeecccceeeccccccceeEEEEEEeeccccceeeeeeeeeecceeeeeee Confidence 1111 011111110 000000 0000000 01111111111111111112222 No 177 >protein:vir:1663 Length: 393 # NCBI annotation: unknown # Family: family:all:2417 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044952;genbank:gi:9629659;genbank:GeneID:1261309 Probab=96.50 E-value=0.00056 Score=38.41 Aligned_cols=354 Identities=14% Similarity=0.126 Sum_probs=154.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNA---LESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGG 77 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~---~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~ 77 (394) |-+-.|-|-+.++++|++-.-.++..+-.- -.-++....++++.-+.++.-+|.+.+..+...+. T Consensus 1 mnkpdliekqnrlaelkennvslksqisgfevknaiedl~K~~ELe~TlSe~~iEI~k~en~LN~~eE------------ 68 (393) T protein:vir:16 1 MNKPDLIEKQNRLAELKENNVSLKSQISGFEVKNAIEDLPKVQELEKTLSENSIEIIKIENELNAQEE------------ 68 (393) T ss_pred CCCcchhhhhhhhhhhhhcccchhhhccchhhhhhhhhchhHHHHHHhHhhcchhhhhhhhhhhhhhh------------ Confidence 776667777777777776554544433221 01133445555666666666565554444332110 Q ss_pred ccccchhhhHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHh Q lcl|Aclame:pro 78 KEVTQEEKTYRESVNDFIRSKGKIVNDSLRF---EGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKT 154 (394) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~ 154 (394) .......+..++.+........... .+.++.+..+. +... -.|++-.......|..+...|...+-. T Consensus 69 ------~~KGK~kMt~~iesq~A~~eF~~vL~~N~G~S~~k~AW~---A~L~-E~GVtiTD~~~~LP~~lv~sI~~A~~n 138 (393) T protein:vir:16 69 ------KPKGKDKMTNFIESQNAVTEFFDVLKKNSGKSEIKNAWS---AKLA-ENGVTITDTTFQLPRKLVESINTALLN 138 (393) T ss_pred ------cchhhHHHHHHHhhHHHHHHHHHHHhccCCchhhhhhhh---hhHh-hcCcceeccchhccHHHHHHHHHhhhc Confidence 0011111223333333322221111 11222222221 1111 124444455568899999999999999 Q ss_pred hhhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHh---ccHHHHHH Q lcl|Aclame:pro 155 VVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID---DADVDLVG 231 (394) Q Consensus 155 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~---ds~~~l~~ 231 (394) +.++++...|-.++.--++.. -.+...+-....|....+ ...+|.--++.+-.++....+ -++.. .+-..|.. T Consensus 139 ~n~v~~vfHVT~~~~~~V~~s--~~s~~eAq~HkdGqTK~e-qa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N 214 (393) T protein:vir:16 139 TNPVFKVFHVTNVGALLVSRS--FDSANEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYN 214 (393) T ss_pred cCcceeeeeeccchhhhHHhh--hhhhhhhhhhccCCcccc-ceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHH Confidence 999988665544432222211 123333333444444444 345666666666555544444 23333 34445689 Q ss_pred HHHHHHHHHHH-HHHHHHHhhcccccccccccc----------------------HHHHHHHHHhhhhhhcccEEEEcHH Q lcl|Aclame:pro 232 IVSESISQIKV-NTTNDAIAKVLKSFTTKTVKN----------------------LDEIKALLNGGFDPAYNVSLIVSQS 288 (394) Q Consensus 232 ~i~~~l~~~~~-~~~~~a~~~g~~~~~~~~~~~----------------------~~~i~~~~~~~~~~~~~a~~vm~~~ 288 (394) ||..+|+.+|. +..+.+..-|+|+++-...-. .|.|..++.-..+.+..-.+++... T Consensus 215 ~i~~ELtQ~~vnk~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagktpfadaieeavdfvrptagrrylivkte 294 (393) T protein:vir:16 215 LIVAELTQAIVNKIVDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTE 294 (393) T ss_pred HHHHHHHHHHHHHHHHhhhheecCCCCccchhhHHHHHHHHHHhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEecc Confidence 99999999988 678889988988865332211 1222222211111111122333333 Q ss_pred H-HHHHHhhhcc--CCc-eeecccccCCCcccccccc-eEE-ecCcccccCceEEEeccccEEEEeecceE-EEEeeccc Q lcl|Aclame:pro 289 F-YQTLDTLKDG--NGR-YLLQDDITAVSGKVLLGKP-VFV-LSDEVLGANKAFIGDFKRGVLFADRKDLG-LRWADNEI 361 (394) Q Consensus 289 ~-~~~l~~lkd~--~G~-~l~~~~~~~~~~~~l~G~p-V~~-~~~~~~~~~~~~~gd~~~~~~~~~~~~~~-i~~~~~~~ 361 (394) + .+-|..|+.+ +.+ .|-..+ +. -.+--|+. +++ +.+.+. .+-++.|-+. .+ +-++++ ++.....+ T Consensus 295 drkalldelrqatananvrikndd-te--iasevgvdeiivytgskal--kptvlvdqky--hi-dmqdltkvdafewkt 366 (393) T protein:vir:16 295 DRKALLDELRQATANANVRIKNDD-TE--IASEVGVDEIIVYTGSKAL--KPTVLVDQKY--HI-DMQDLTKVDAFEWKT 366 (393) T ss_pred chHHHHHHHHhhhccCceeeeccc-hh--hhhhcCcceeeeeeccccc--cceeeecccc--cc-chhhhhhhhhheecc Confidence 2 3334444322 222 121111 00 01111221 111 111221 2234444332 12 233332 11111111 Q ss_pred ccceEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 362 YGQYLQAVLRFGVSKVDDKAGYYVTFT 388 (394) Q Consensus 362 ~~~~~r~~~r~d~~v~~~~af~~l~~~ 388 (394) +..-+.+.---.|-|---+|-+.++++ T Consensus 367 nsnmilvetltsghvetynagavitvs 393 (393) T protein:vir:16 367 NSNMILVETLTSGHVETYNAGAVITVS 393 (393) T ss_pred CCceEEEeecccCcceeeccceeEeeC Confidence 111122222223333333444455555 No 178 >protein:vir:79548 Length: 652 # NCBI annotation: putative protease/scaffold protein # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272518;genbank:gi:148609387;genbank:GeneID:5204384 Probab=96.49 E-value=0.00057 Score=38.35 Aligned_cols=377 Identities=10% Similarity=0.038 Sum_probs=140.5 Q ss_pred ChHHH---------------------------HHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHH-HHHHHHH Q lcl|Aclame:pro 1 MFEEK---------------------------IKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAE-VEQAKAN 52 (394) Q Consensus 1 ~l~e~---------------------------l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~e-i~~l~~~ 52 (394) |+... -..++++ ...+.+++...++.+...-. .++..+..+ +.+..-. T Consensus 195 ~~~~~~~~~~~v~d~EPa~~~~pvqAaAP~~De~airAq---~~aeeraRi~~I~~l~a~Fg-gr~~~l~~~~l~d~~~s 270 (652) T protein:vir:79 195 MITPPRNSAPRVQDDEPAASRTPVQAAAPVVDENSIRAQ---VLAEQKARVNGINDLFAMFG-GRYQTLQAQCLADPECS 270 (652) T ss_pred HhcccccccccccccccccccccccccCCcCchhHHHHH---HHHHHHHHHHHHHHHHHhhc-cccchHHHHHhhccCCC Confidence 11000 0111111 11112222222322211000 001111111 1111123 Q ss_pred HHHHHHHHHHHHHHHhhccccccccccccchhhhHHHHHHHHHHHH--HHHHHH-------HHHHHHHHHHHHHHHhh-- Q lcl|Aclame:pro 53 LVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSK--GKIVND-------SLRFEGKDEVLMPINET-- 121 (394) Q Consensus 53 i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~-------~~~~~~~~~~~~~~~~~-- 121 (394) ++++++++-+.......+..................+.+...+... ...... .+....+......+... T Consensus 271 ~e~ar~~il~~l~~~~~p~~~~~~~~~~~~~g~~~~d~~~~aL~~R~g~~~~~~~~~~~g~~L~elAr~~L~~~G~~~~~ 350 (652) T protein:vir:79 271 LEQAREKLLNEMGRESTPSNKNTPAHIYAGNGNFVGDGIRQALMARAGFEKTERDNVYNGMTLREYARMSLTERGIGVSS 350 (652) T ss_pred HHHHHHHHHHHHHhhcCCCCCCcceeEeeccchhhHHHHHHHHHhhcCCcccccCccccCccHHHHHHHHHHhhccCCCC Confidence 3333333322221111111100000000000111111111111110 000000 00000000000000000 Q ss_pred -hhhhhhhh--cccccCCccccchhHHhHHHHHHHh-hhhhhheeeeEeecCCceeEEEEecCCCccccccccccccccc Q lcl|Aclame:pro 122 -TPVEPQKD--GIKKENAKPVSSEEILYTPAREVKT-VVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALA 197 (394) Q Consensus 122 -~~~~~~~~--~~~~~~~~~lvP~~~~~~I~~~~~~-~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~ 197 (394) ...+.... ..+++....+.-......+...-.. ....+..|+..+++--.-.-.+.-+..+..-.+.|+|+.+.. T Consensus 351 ~~~~~~v~~A~~hsTsDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~~- 429 (652) T protein:vir:79 351 YNPMQMVGAAFTHSTSDFGNILLDVANKAILQGWEDAPETYEQWTRKGQLSDFKIAHRVGMGGFSALRQVREGAEYKYV- 429 (652) T ss_pred CCHHHHHHHHhhcCcchHHHHHHHHHHHHHHHHHhhhHHHHHHHhccCCCccccccceeecCCCCCccccCCCCcccee- Confidence 00011111 1123333333332323333322221 233566666555433221122223445666778899998763 Q ss_pred ccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc-------------------c Q lcl|Aclame:pro 198 KPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFT-------------------T 258 (394) Q Consensus 198 ~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~-------------------~ 258 (394) ...=..-++...+++..+.+|++++-.-+.+..+-|...++++..++++..+...+.++. + T Consensus 430 t~~e~~e~~~l~tyG~~~~iTRqaiINDDL~a~~~ip~~~g~aA~~~~~~~vy~~l~~Np~~~~DGk~LF~hA~H~Nl~~ 509 (652) T protein:vir:79 430 TTGDKQATIALATYGELFSITRQAIINDDLNMLTDVPMKLGRAAKSTIADLVYAILTSNPKISTDNVSLFDKAKHANVLE 509 (652) T ss_pred eecCccceeeeecccCeeeeehheeeccchhHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccCCceeecccccccccc Confidence 333355788999999999999997643356788889999998888888775443322110 0 Q ss_pred cccccHHHHHHHH-----Hhhhhhhc---ccEEEEcHHHHHHHHhhhccCCceeecccccCCCccccccc-ceEEecCcc Q lcl|Aclame:pro 259 KTVKNLDEIKALL-----NGGFDPAY---NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGK-PVFVLSDEV 329 (394) Q Consensus 259 ~~~~~~~~i~~~~-----~~~~~~~~---~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~-pV~~~~~~~ 329 (394) .+..+.+.+-.+. ++.-.... ...|++.+.......++-.+...+ ..+...+...-+.|+ .|++.+-.. T Consensus 510 ~aa~~~~~l~~ar~aM~~Qk~g~~~l~i~P~~llvp~~le~~a~~ll~s~~v~--~a~~~~~~~Np~~~~~~~i~eprL~ 587 (652) T protein:vir:79 510 SAAMDVASLDKARQLMRVQKEGERHLNIRPAFVLVPTAMESVANQVIRSSSVK--GADINAGIINPVKDFATVIAEPRLD 587 (652) T ss_pred cccCCHHHHHHHHHHHHHhccCCccccccccEEEecchhHHHHHHHhccCCCc--ccccccccccccccccccccccccC Confidence 1122233222221 11111111 235666666665555553222111 001111222223343 333322110 Q ss_pred -cccCceEEEeccc------cEEEEeecceEEEEeeccccc---ceEEEEEEeccEEecccceEEEEe Q lcl|Aclame:pro 330 -LGANKAFIGDFKR------GVLFADRKDLGLRWADNEIYG---QYLQAVLRFGVSKVDDKAGYYVTF 387 (394) Q Consensus 330 -~~~~~~~~gd~~~------~~~~~~~~~~~i~~~~~~~~~---~~~r~~~r~d~~v~~~~af~~l~~ 387 (394) ......|+++-.. +|+-+. ++..|+. ..+|. .-++++..||.++++-.++++.+- T Consensus 588 ~~s~~~wylaa~~~~dtiev~yL~G~-~~P~ie~--~~gf~~dG~~~kvrlD~G~~~iD~RG~~k~t~ 652 (652) T protein:vir:79 588 DNSQTTFYLAASKGSDTIEVAYLNGV-DTPYIDQ--MEGFSVDGVTTKVRIDAGVAPVDHRGLVKCTA 652 (652) T ss_pred CCCcccEEEecCCCCCeEEEEEecCC-CCCeeee--cCCCCcceEEEEEEEeccCceeeccceeeecC Confidence 0111223332211 122222 2333332 23343 346888889999999999998876 No 179 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=95.95 E-value=0.0012 Score=36.57 Aligned_cols=360 Identities=10% Similarity=-0.032 Sum_probs=121.0 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc-c Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGK-E 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~-~ 79 (394) =|++++++|.++++.+.+++++..++.+....+...+++++++++++.++++++.+++.+.................. . T Consensus 9 el~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (421) T protein:vir:13 9 ELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGGRVIINGDS 88 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccch Confidence 355778888888888877777665544332222334678888999999999988888777665554433333322222 2 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLK 159 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~ 159 (394) ...........+..+++......... .. ... ..............+++.......+..+-...++. T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~r--a~-----------~t~-~~gg~liP~~~~~~Ii~~~~~~~~l~~l~~~~~~~ 154 (421) T protein:vir:13 89 KEEKRSLQLSAMSKTIRGIQLSEEER--DI-----------MSS-TNNGAVIPQEFVNEFEKLKEGYPSLKEHCHVIPVN 154 (421) T ss_pred hHHHHHHHHHHHHHhhhccchhHHHh--hc-----------ccc-CCcceecchhhHHHHHHHHHhhhhhhhhceeeecc Confidence 22223344455655555432111100 00 000 00000011111111222222222222222222332 Q ss_pred heeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhh-hh-hhhhhhHHHHhccHHHHHHHHHHHH Q lcl|Aclame:pro 160 PFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDT-YR-GAIPLSQESIDDADVDLVGIVSESI 237 (394) Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~-~~-~~~~vs~ell~ds~~~l~~~i~~~l 237 (394) .....+++......-....... .....+. .+.....++..-.+...- +. ....-|..-+. .+ +...|.+.+ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~E--~~~~~~s--~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~--~~-i~~~la~~~ 227 (421) T protein:vir:13 155 RNAGKMPVRAGASVDKLANLAK--DTELVKA--MLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFL--EF-VNEEFAEFA 227 (421) T ss_pred CCceEEEEeecCCccceeeccc--ccccccc--ccceeEEEeeeeeeEeehhhhHHHHhhhHHHHH--HH-HHHHHHHHH Confidence 2222233222111100100111 1111211 122222222222221111 11 01111111111 11 334444444 Q ss_pred HHHHHHHHHHHHhhccccccccccccHHHHH--------------------HHHHhhhhhhcccEEEEcHHHHHHHHhhh Q lcl|Aclame:pro 238 SQIKVNTTNDAIAKVLKSFTTKTVKNLDEIK--------------------ALLNGGFDPAYNVSLIVSQSFYQTLDTLK 297 (394) Q Consensus 238 ~~~~~~~~~~a~~~g~~~~~~~~~~~~~~i~--------------------~~~~~~~~~~~~a~~vm~~~~~~~l~~lk 297 (394) ...+....-....+.....+..+...+-+++ ..+..+.+. +..+++.+.....- . T Consensus 228 ~~~~~~~i~~~~~g~~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~--~G~~i~~~~~~~~~---~ 302 (421) T protein:vir:13 228 VNTENAEIVKQAKAVLAEETINDYAGLVKTINSLVPNARKRAIIVTNSDGRAYLDGLMDK--QGRPLLKELSDGGD---L 302 (421) T ss_pred HHHhhhhHhhhhhhccccccccchHHHHHHHHHhhhhhcCCCEEEEcHHHHHHHHHhhcC--CCceeecCcCCCCC---c Confidence 4443333323333333333332332222221 222222221 23344332110000 0 Q ss_pred ccCCceeeccc-ccCCCcccccccceEEecCcccccCceEEEeccccEEEEeec------ceEEEEeecccccceEEEEE Q lcl|Aclame:pro 298 DGNGRYLLQDD-ITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRK------DLGLRWADNEIYGQYLQAVL 370 (394) Q Consensus 298 d~~G~~l~~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~------~~~i~~~~~~~~~~~~r~~~ 370 (394) .=.|.|+...+ +..+..+ -.++++-+ -...+.+++....-+-..+. .+.++....-.+ ....... T Consensus 303 tl~G~pV~~~~~~~~~~~~---~~~~~~gd----~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~-~~~~~~a 374 (421) T protein:vir:13 303 VFKGRPVIELEESIFDVGD---ETKFIVSD----FKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDV-NSPLDKS 374 (421) T ss_pred eecceeeEEeccccccCCC---ceEEEEEe----ccccEEEEEecceEEEeecccccccCeeEEEEEeeecc-eeecchh Confidence 01344443211 1000000 11222211 11123445543321111111 111111100000 0000011 Q ss_pred EeccEEecccceEEEE------ecCccCCC Q lcl|Aclame:pro 371 RFGVSKVDDKAGYYVT------FTPEPLPL 394 (394) Q Consensus 371 r~d~~v~~~~af~~l~------~~~~~~~~ 394 (394) -..+.+..+.+|+.++ .+..-+|- T Consensus 375 ~~~~~~~~~~a~v~~~~~~~~~~~~~~~~~ 404 (421) T protein:vir:13 375 SDAEKIRKFGVIVKLQEVLKSSPRSGKNKN 404 (421) T ss_pred hheeeecccceeeccccccCCCCcCCCCcc Confidence 1234555666666651 12222443 No 180 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=95.83 E-value=0.0014 Score=36.25 Aligned_cols=367 Identities=10% Similarity=0.016 Sum_probs=79.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARS---------IKAEVEQAKANLVEAENDLKLYESSVEV 69 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~---------~~~ei~~l~~~i~~l~~~~~~~~~~~~~ 69 (394) |-=.+++ |+++++++..++.++.++..++....+ ...+++ ..++++.+++++.+++++++.++...+. T Consensus 1 ~~~~~~~-l~~~~~~~~~~l~el~e~~~~l~k~~~el~~~l~ea~~~ee~~~~ee~i~~l~~~~~el~e~~~~l~~ei~~ 79 (466) T protein:vir:80 1 MALRQLM-LAKKIEQRKAALAELLEQEKALQKRSEELEAAIDEANTDEEIAVVEDEINKLEGEKTELEEKKSKLEGEIKE 79 (466) T ss_pred CchHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3222222 667777777777666655544322211 111111 1112222333333333222222221111 Q ss_pred cccc---ccccccccc-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhh----hhhhcccccCCccccc Q lcl|Aclame:pro 70 GGAE---NIGGKEVTQ-EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVE----PQKDGIKKENAKPVSS 141 (394) Q Consensus 70 ~~~~---~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~lvP 141 (394) .... ......... .............+......... .+................ ...........+ ..- T Consensus 80 le~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~g 156 (466) T protein:vir:80 80 LENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGF--FRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRA-VSG 156 (466) T ss_pred HHHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHH--HHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhh-hcc Confidence 0000 000000000 00000000000000000000000 000000000000000000 000000000000 000 Q ss_pred h-hHHh-HHHHHHHhhhhhhheeeeEeecCCceeEEEEecCCCcccccc----cccccccccccccce--eeecHhhhhh Q lcl|Aclame:pro 142 E-EILY-TPAREVKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVA----ELEKNPALAKPDFKD--VAWNIDTYRG 213 (394) Q Consensus 142 ~-~~~~-~I~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----e~~~~~~~~~~~~~~--v~~~~~~~~~ 213 (394) . .+.+ .+...+... ++.. -++.+..-..|. + +..-+.. ..+....+.. .... .++..-.+.. T Consensus 157 ~~~~vP~~~~~~i~~~--l~~~---~~l~~~~~v~~~---~-g~~~~~~~~~~~~a~wv~E~~-~~~~~~~~f~~i~~~~ 226 (466) T protein:vir:80 157 AELTIPDVMLELLRDN--MHRY---SKLISKVRLRPL---K-GTARQNIAGAIPEGVWTEAVA-NLNELSLSFSQIEVDG 226 (466) T ss_pred ccccccHHHHHHHHHh--hhhh---hhhhhheeeeec---C-ceeEeeeecCCcceeeccccc-ccccccccccceeecc Confidence 0 0111 122221111 1111 111111001111 1 1111100 0111111110 1111 1111111110 Q ss_pred hhhhhHHHHhccHHHHH---HHHHHHHHHHHHHHHHHHHhhccccccccccccHHHHHHHHHhhhh-------------- Q lcl|Aclame:pro 214 AIPLSQESIDDADVDLV---GIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFD-------------- 276 (394) Q Consensus 214 ~~~vs~ell~ds~~~l~---~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~-------------- 276 (394) .. ++- ++.=|.--|. ..+...|...++.+.....-...-.|..++ ...+|...+..... T Consensus 227 ~k-~~~-~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~--~P~Gil~~~~~~~~~~~~~~~~~~~~~~ 302 (466) T protein:vir:80 227 YK-VGG-FIPIPNSTLEDSDLNLADEILDAIGQAIGFALDKAILYGTGTK--MPVGIVTRLAQTTQPPNWGTKAPAWTNL 302 (466) T ss_pred ee-eee-ehhhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhheeeccCCC--Ccceeeeccccccccccccccccccccc Confidence 00 000 0000000010 113333344444444444433333332221 22345332111000 Q ss_pred ---------hhc-ccEEEEcHHHH-HHHHhhhccCCceeecccccCCCcccccccceEEecCcc----cccCceEEEecc Q lcl|Aclame:pro 277 ---------PAY-NVSLIVSQSFY-QTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEV----LGANKAFIGDFK 341 (394) Q Consensus 277 ---------~~~-~a~~vm~~~~~-~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~----~~~~~~~~gd~~ 341 (394) +.. ++.+.++..++ ..+.+.++.+|.++|.++.. ....|.|..+....... .+.+..++|- T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~--~~~~l~~~~~~~~~~g~~~~~~~~~~~i~G~-- 378 (466) T protein:vir:80 303 STTNLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSN--THAVLMSKAITFNSAGALVASLNNTMPIVGG-- 378 (466) T ss_pred chhhhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecch--hHHHhhcccccccCCccccccCCCccccccc-- Confidence 000 12233333332 22345556677777764322 12244444332211100 0001111110 Q ss_pred ccEEEEeec-ceEEEEeecccccceEEEEEEeccEEeccc---------ce---EEEEe----cCccCCC Q lcl|Aclame:pro 342 RGVLFADRK-DLGLRWADNEIYGQYLQAVLRFGVSKVDDK---------AG---YYVTF----TPEPLPL 394 (394) Q Consensus 342 ~~~~~~~~~-~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~---------af---~~l~~----~~~~~~~ 394 (394) . ++..+.. .-.+-+-++ ..+....|-+..+.... .| .++.. ..+-..+ T Consensus 379 p-vv~s~~~~~~~~~~g~~----~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~ 443 (466) T protein:vir:80 379 D-IVILDFIPDNDIIGGYG----SLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAV 443 (466) T ss_pred c-eeecCccCccceeeecc----ccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEE Confidence 0 0000000 000000011 11122223222222211 11 11111 1111111 No 181 >protein:vir:80446 Length: 367 # NCBI annotation: BcepGomrgp07 # Family: family:all:1522 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210227;genbank:gi:146329919;genbank:GeneID:5123555 Probab=95.53 E-value=0.0019 Score=35.52 Aligned_cols=257 Identities=9% Similarity=-0.005 Sum_probs=111.9 Q ss_pred hhhhcccccCCccccchhHHhHHHHHHHhhhhhhheee---------eEeecCCceeEEEEecCCCcccccccccc--cc Q lcl|Aclame:pro 126 PQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTT---------VYQAKKASGKYPVLQRATTKMVTVAELEK--NP 194 (394) Q Consensus 126 ~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~e~~~--~~ 194 (394) +......+.-...++|+.+...+.+...+.+.|.+-.= ....++..+++|+...-++....+.+... .. T Consensus 1 M~~~~~~T~l~Dii~pEvF~~Yv~~~~~e~~~l~qSGiv~~d~~l~~~~~~gG~~v~iPf~~~L~g~~~n~~~d~~~~~~ 80 (367) T protein:vir:80 1 MPDFNNQVRLVDAVIPEVYTSYTAIDRPELTAFFLSGAVASNDFLSQFLSAPGRLINIPFWRDLDSLEPNYGSDNPNVEA 80 (367) T ss_pred CcchhhhhhhhhccchhhhhHHHhhhhhhhhhhhhcceeecCHHHHHHhhcCCCEEEeeeeccCCCCccccCCCCCcccc Confidence 00000001111246777777766665544444332110 11244556788888665443332222111 01 Q ss_pred ccccc-ccce--eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHH--------HHHHHHHHhhc----------- Q lcl|Aclame:pro 195 ALAKP-DFKD--VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIK--------VNTTNDAIAKV----------- 252 (394) Q Consensus 195 ~~~~~-~~~~--v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~--------~~~~~~a~~~g----------- 252 (394) +..+. +..+ +.+..-+--+...++..+-- -|....|.++++.-- .......|... T Consensus 81 t~~kittg~~~a~v~~r~kaw~~~Dla~~lsG---~dpm~~Ia~qva~yW~r~~q~~Lla~L~Gvf~~~~a~~~~~~~~~ 157 (367) T protein:vir:80 81 PIDGLGSGEMKTTKTWLNKAYGAMDLTAELAG---SNPMTRIRNRFGVYWTRQWQRRIIAMAVGVYKSNLAGNFATIKTR 157 (367) T ss_pred cccccccchheeeeehhcccchhhhHHHHhhC---chHHHHHHHHHHHHhhhhhHHHHHHHHHHhhccccccchhhhhhh Confidence 11111 1121 22222233333344444432 244455555554322 22222111100 Q ss_pred ------------------c-ccccccccccHHHHHHHHHhhhhhhcc-cEEEEcHHHHHHHHhhhccCCceeecccccCC Q lcl|Aclame:pro 253 ------------------L-KSFTTKTVKNLDEIKALLNGGFDPAYN-VSLIVSQSFYQTLDTLKDGNGRYLLQDDITAV 312 (394) Q Consensus 253 ------------------~-~~~~~~~~~~~~~i~~~~~~~~~~~~~-a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~ 312 (394) + .++.+.+..+++.++++...+-+.... +.++||+.++..|++++= =.|+- +.-... T Consensus 158 ~~~~a~~~~~~~~~~~Dis~~t~~~~~~~s~~~~~~A~~~lGD~~~~l~~i~mHS~V~~~L~~~~l--i~~i~-~sd~~~ 234 (367) T protein:vir:80 158 GRVPAEVLGTAGDMVIDISGQTNPADAVFNREAFVDAAFTMGDHVGSIAAIAVHSMVYKRMTNNDE--IEFIP-DSKGQL 234 (367) T ss_pred hccccccccccCceeeeeeccCCCccceecHHHHHHHHHHhccccccccEEEEchHHHHHHHhccc--ccccc-CCCCcc Confidence 0 001122345567777775554443332 789999999999987541 01111 110112 Q ss_pred CcccccccceEEecCccccc---C----ceEEEeccccEEEEeecc--eEEEEeecccc-----cceEEEEEEeccEEec Q lcl|Aclame:pro 313 SGKVLLGKPVFVLSDEVLGA---N----KAFIGDFKRGVLFADRKD--LGLRWADNEIY-----GQYLQAVLRFGVSKVD 378 (394) Q Consensus 313 ~~~~l~G~pV~~~~~~~~~~---~----~~~~gd~~~~~~~~~~~~--~~i~~~~~~~~-----~~~~r~~~r~d~~v~~ 378 (394) .-++++|++|++.|+++... . +.+||. +.+.+...+ ..+++.+++.- ...+..+.| .++| T Consensus 235 ~i~ty~G~~VIvDD~~Pv~~~~a~~~yttYlfg~---GAi~~~~~~~~~~~E~~Rd~~~~~~gG~d~L~~Rr~---~~~h 308 (367) T protein:vir:80 235 TIPTYMGKVVIVDDGMPVFGTGADKTYLSILFGG---AAFGYADGAPQVPVAVGRRELRGNGSGLEYILERKE---WIVH 308 (367) T ss_pred ccceecceeEEEeCCCcccccCCCceEEEEEEec---ceeeecccCCccceecccchhhhcCCceEEEEeeee---EEee Confidence 34689999999988776532 1 234442 222221112 22344444322 122333333 6788 Q ss_pred ccceEEEEecC---------------ccCCC Q lcl|Aclame:pro 379 DKAGYYVTFTP---------------EPLPL 394 (394) Q Consensus 379 ~~af~~l~~~~---------------~~~~~ 394 (394) |..|....-.- ..+|- T Consensus 309 P~G~s~~~~~v~~~~~~~~~~~~~~~~~sPt 339 (367) T protein:vir:80 309 PGGFNWLDADVTIPDNTGSPSGITSGPPAIT 339 (367) T ss_pred cceeeecccccccccccccccccccccCCCC Confidence 88887754321 12233 No 182 >protein:vir:78387 Length: 349 # NCBI annotation: putative coat protein # Family: family:all:1522 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110837;genbank:gi:134288598;genbank:GeneID:5179650 Probab=95.33 E-value=0.0023 Score=35.07 Aligned_cols=252 Identities=8% Similarity=-0.043 Sum_probs=110.9 Q ss_pred cccccCCccccch--hHHhHHHHHHHhhhhhhheeeeE----------eecCCceeEEEEecCCC--ccccccccc--cc Q lcl|Aclame:pro 130 GIKKENAKPVSSE--EILYTPAREVKTVVDLKPFTTVY----------QAKKASGKYPVLQRATT--KMVTVAELE--KN 193 (394) Q Consensus 130 ~~~~~~~~~lvP~--~~~~~I~~~~~~~~~l~~~~~~~----------~~~~~~~~~~~~~~~~~--~~~~~~e~~--~~ 193 (394) ..++--.-..+|+ .+.+.+.+...+.+.|.+ +..+ ..++..+++|+...-++ ....+.... .. T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~q-SGii~~d~~l~~~~~~gG~~~~iPf~~~L~g~~e~nv~~D~~~~~~ 79 (349) T protein:vir:78 1 MAITTIGDIVTGNIPVLASYMTEDPVEKTAFFD-SGILTSTPYAAEIANGPSNIANLPFWKAIDTSIEPNYSNDVYQDIA 79 (349) T ss_pred CCceEEeeeeccCHHHHHHHHHHhhHHhhhhhh-ccceeccHHHHHHhhcCCCEEEeeeeecCCCCcccccCCCCccccc Confidence 1122223346776 366666555544444333 1111 23455788898865433 222332221 11 Q ss_pred ccccccccceeeecH--hhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHH--------HHHHHhhccccc------- Q lcl|Aclame:pro 194 PALAKPDFKDVAWNI--DTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNT--------TNDAIAKVLKSF------- 256 (394) Q Consensus 194 ~~~~~~~~~~v~~~~--~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~--------~~~a~~~g~~~~------- 256 (394) ....-.+..++-... -+--....++.++-- -|..+.|.++++.-..+. ....|.....+. T Consensus 80 t~~kitt~~~~a~~~~r~kaw~~~Dla~~lsG---~dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~a~~~~~~~~ 156 (349) T protein:vir:78 80 TPRAIQTGEMMARVAYLNEGFGQADLTVELTS---QNPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQN 156 (349) T ss_pred ccccccccceeeeeeeeccccchhHHHHHhhC---chHHHHHHHHHHHHHhhHHHHHHHHHHHHhhcccccccchhhhcc Confidence 111112223322222 222233344444432 144555666655433322 222221111100 Q ss_pred ------cccccccHHHHHHHHHhhhhhhc------ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEE Q lcl|Aclame:pro 257 ------TTKTVKNLDEIKALLNGGFDPAY------NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFV 324 (394) Q Consensus 257 ------~~~~~~~~~~i~~~~~~~~~~~~------~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 324 (394) ...+..+.+.++++...+-+.+. =+.++||+.++..|++++-= .|+ ++.-....-.+++|++|++ T Consensus 157 ~~t~d~s~~a~~~~~~~~dA~~~lgda~~Gd~~~~lt~i~mHS~v~~~L~~~~li--~~i-~~s~~~~~i~ty~G~~Viv 233 (349) T protein:vir:78 157 DMVVDVSATLGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLI--DFI-RDAENNTMFATYQGYRVIV 233 (349) T ss_pred cceeeeccccCCChhhhhhhHHHHHHHhccccccceeEEEEchHHHHHHHhhhhh--hhc-cCcccCcccceecCeEEEE Confidence 01122355566666555444331 15799999999998764311 011 1111111235899999999 Q ss_pred ecCccccc-------CceEEEeccccEEEEeecc--eEEEEeecccc-----cceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 325 LSDEVLGA-------NKAFIGDFKRGVLFADRKD--LGLRWADNEIY-----GQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 325 ~~~~~~~~-------~~~~~gd~~~~~~~~~~~~--~~i~~~~~~~~-----~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) .|+++... .+.+||. +.+.+...+ ..+++.+++.. +..+....| -++||..|....-..+ T Consensus 234 DD~~Pv~~~g~~~~yttylfg~---GAi~~~~~~~~~~~et~rd~~~g~~~G~d~l~~R~~---~~~hp~G~s~~~a~v~ 307 (349) T protein:vir:78 234 DDSMTVVGQGAQRKFISIIFGQ---GAIGYGEGNPVMPLEYEREASRANGGGVETLWTRKT---WLLHPFGYRFTSAVIT 307 (349) T ss_pred eCCCccccCCCCceEEEEEeec---ceEEEccCCCccceeeecccccCCcceeEEEEEeeE---EEeeeeeeeecccccc Confidence 88776532 1234552 333222111 23455444422 223433333 4578877776642211 Q ss_pred --------cCCC Q lcl|Aclame:pro 391 --------PLPL 394 (394) Q Consensus 391 --------~~~~ 394 (394) .+|- T Consensus 308 ~~~~~~~~~sPt 319 (349) T protein:vir:78 308 GNGTETIARSAS 319 (349) T ss_pred CCccccccCCCC Confidence 2343 No 183 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=95.02 E-value=0.0025 Score=34.85 Aligned_cols=170 Identities=14% Similarity=0.039 Sum_probs=85.4 Q ss_pred hhhhhhhhHHHHh-----ccHHHHHHHHHHHHHHHHHHHHHHHHhhccc----------------cc--ccccccc---- Q lcl|Aclame:pro 211 YRGAIPLSQESID-----DADVDLVGIVSESISQIKVNTTNDAIAKVLK----------------SF--TTKTVKN---- 263 (394) Q Consensus 211 ~~~~~~vs~ell~-----ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~----------------~~--~~~~~~~---- 263 (394) +-+ .-+|.-++. ++..++.+...++.+++++...|..++.... +. .+....+ T Consensus 1 iD~-lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA~~~~p~~~~~~g~~~~~~a~~t~~~~~l 79 (221) T protein:vir:17 1 MDD-LLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASASIAAAPVTGQDGGFSVNIGAGNTNNAQAI 79 (221) T ss_pred CCc-chhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCcccccccCcceeccccccCCHHHH Confidence 111 123333333 2566889999999999999988887654221 00 0111122 Q ss_pred HHHHHHHHHhhhhhh---cccEEEEcHHHHHHHHhhhc-cCCceeecc---cccCC-CcccccccceEEecCcccccCce Q lcl|Aclame:pro 264 LDEIKALLNGGFDPA---YNVSLIVSQSFYQTLDTLKD-GNGRYLLQD---DITAV-SGKVLLGKPVFVLSDEVLGANKA 335 (394) Q Consensus 264 ~~~i~~~~~~~~~~~---~~a~~vm~~~~~~~l~~lkd-~~G~~l~~~---~~~~~-~~~~l~G~pV~~~~~~~~~~~~~ 335 (394) ++.+.++...+-... ..-.++++|..+..|.+-.| .-.++.+.. .+..+ .-.++.|++|+.+.+.+...++- T Consensus 80 ~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~~d~~~~n~d~~~s~g~~~~g~~i~~v~G~~V~~SnnlP~~~gt~ 159 (221) T protein:vir:17 80 VDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISSVDTNILNREIGNTQGDMNTGKGLYVNAGIRIYKSNVLASLYGTN 159 (221) T ss_pred HHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHhcCcceeeeecccccccccccceeeeecCcEEEEeccCCcccccc Confidence 244444433332222 23467789998877654222 111222211 12222 24579999999987766544443 Q ss_pred EEEeccccEEE-EeecceEEEEeecccccceEEEEEEeccEEecccceEEEEecCccC--CC Q lcl|Aclame:pro 336 FIGDFKRGVLF-ADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPL--PL 394 (394) Q Consensus 336 ~~gd~~~~~~~-~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~--~~ 394 (394) +..+-..+..- .+.... ...|.+.. +.+.||+|...+++-..|+ |+ T Consensus 160 ~~~~ag~~~~~~~~~~~y------r~~fs~~~-------glv~~~~Avgtvkl~~~~~~~~~ 208 (221) T protein:vir:17 160 LVTDPGDATTSGENNGSY------RPAITDRA-------GLVFHKEAADTVEVLLPPSRPPL 208 (221) T ss_pred cccCCccccccccccccc------cccccceE-------EEEEcchheeeeeeecCCCCCce Confidence 33222211100 000111 11122212 7788999998888877773 44 No 184 >protein:vir:98856 Length: 343 # NCBI annotation: hypothetical protein # Family: family:all:201 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654732;genbank:gi:109302917;genbank:GeneID:4156061 Probab=94.50 E-value=0.0042 Score=33.60 Aligned_cols=274 Identities=11% Similarity=0.020 Sum_probs=121.7 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc----ccCCccccchhHHhHHHHHHHhhhhh Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK----KENAKPVSSEEILYTPAREVKTVVDL 158 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~lvP~~~~~~I~~~~~~~~~l 158 (394) ........++.++ .......++. ..+....|.+.....+...+.+.+.+ T Consensus 1 M~~~tr~~~~~y~---------------------------~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~F 53 (343) T protein:vir:98 1 MNKTAQELFYSLI---------------------------GDAAEYYGANPALALAGKQFSIEAPKESVLLGAIQQRSNF 53 (343) T ss_pred CChHHHHHHHHHH---------------------------HHHHHHhCCccchhccCceeeecHHHHHHHHHHHHHHHHH Confidence 0001111111111 0011111221 22334567778889999999999999 Q ss_pred hheeeeEeecCCceeEEEEecCCCccccc--ccccc--ccccc--ccccceeeecHhhhhhhhhhhHHHHh-cc-HHH-H Q lcl|Aclame:pro 159 KPFTTVYQAKKASGKYPVLQRATTKMVTV--AELEK--NPALA--KPDFKDVAWNIDTYRGAIPLSQESID-DA-DVD-L 229 (394) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~e~~~--~~~~~--~~~~~~v~~~~~~~~~~~~vs~ell~-ds-~~~-l 229 (394) ++.++++++..-.+.+-.... ++..+.- +.+.. ....+ .....++.+ .. .++.+.|. ++ .+| | T Consensus 54 L~~INvv~V~q~~g~v~~~~~-sg~~t~r~~t~~~~~~~~~~~~~~Y~c~qTn~-----dt--~i~Y~~lD~WA~~~deF 125 (343) T protein:vir:98 54 LEKINCVFSERYQRAIDLRSN-RKRHYGAHDRRTPIQQRWTRQVMSMNVSRQIQ-----AC--LIPWAKLDQWGHLKDKF 125 (343) T ss_pred hhcCceecchhhcceEEEeec-CccccCccccCCCccccccCCCCccEEEEeee-----ee--eccHHHHHHhhcChhHH Confidence 999999999766555543222 2111111 10100 10001 111111111 12 22222222 21 245 7 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhccccc-----------------------------------------cccccccHHHHH Q lcl|Aclame:pro 230 VGIVSESISQIKVNTTNDAIAKVLKSF-----------------------------------------TTKTVKNLDEIK 268 (394) Q Consensus 230 ~~~i~~~l~~~~~~~~~~a~~~g~~~~-----------------------------------------~~~~~~~~~~i~ 268 (394) ...+++.+.+.++.-.-...++|...+ ......+.|.++ T Consensus 126 ~~r~~~~i~~~~ALD~i~IGfNGts~A~~T~nPllqDVN~GWLQ~~Re~ap~rVm~~~~~~~~~~~~G~ggdy~NLDalV 205 (343) T protein:vir:98 126 ASLYAEFVQNQIALDMIKIGFYGTSVGTDTSDPNLADVNKGWIQFVRENKATQILTQGATSGEIRLFGEGADYVNLDELA 205 (343) T ss_pred HHHHHHHHHHHHhhccceecccceeeccCCCCcchhhcchHHHHHHHhcchhhhhccceeccceeEecCCCCcccHHHHH Confidence 777777777766532211112221111 001123445543 Q ss_pred HHHHhhhhhhc-c---cEEEEcHHHHHHH-HhhhccCCceeeccccc---CCCcccccccceEEecCcccccCceEEEec Q lcl|Aclame:pro 269 ALLNGGFDPAY-N---VSLIVSQSFYQTL-DTLKDGNGRYLLQDDIT---AVSGKVLLGKPVFVLSDEVLGANKAFIGDF 340 (394) Q Consensus 269 ~~~~~~~~~~~-~---a~~vm~~~~~~~l-~~lkd~~G~~l~~~~~~---~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~ 340 (394) .-+..++++.+ + -+.++.+..++.- ..|-+..+++--. .+. -....++-|+|.+..|.. |.+.+++=-| T Consensus 206 ~D~~~~I~~~~~~d~dLVvivG~dLla~~~~~l~n~~~~~ptE-k~Aa~~~~~~k~iGGl~a~~~PfF--P~~~llVT~L 282 (343) T protein:vir:98 206 YDLKQGLDARHRDAGDLVFLVGADLVAKEASLVYKGNGLIATE-KAALNTHDLMKSFGGMPAMIVPNM--PPRAAIVTSL 282 (343) T ss_pred HHHHhcCchHHhcCCCEEEEEchhhhhhhhhhhhhhcCCChHH-HHHHHHHHHHHhhCCCeeEEcccc--CCCceEEeec Confidence 22333566543 2 3677777765432 1222233332110 000 002247889999998844 6666676555 Q ss_pred cccEEEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 341 KRGVLFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 341 ~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..-+.+-+....=...+.+...+.--.+.| -|..|-+..+++.+.....+-|. T Consensus 283 ~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 337 (343) T protein:vir:98 283 SNLSIYTQEGSMRRGMKDDDDKKAVRDSYYRNEAYAVEDCGKFMAVDFTKVKLSS 337 (343) T ss_pred cccEEEEecCcEEEEEEeccccccccchhhhcceeeeeccccEEEeeeeeeeecC Confidence 5433333333322222233222221111222 35567777888888777776666 No 185 >protein:vir:174 Length: 423 # NCBI annotation: capsid protein # Family: family:all:1412 # MgeID: mge:5 # MgeName: HK620 # Cross-refs: genbank:acc:NP_112079;genbank:gi:13559869;genbank:GeneID:920999 Probab=93.60 E-value=0.007 Score=32.40 Aligned_cols=260 Identities=10% Similarity=-0.090 Sum_probs=106.8 Q ss_pred ccCC-ccccchhHHhHHHHHHHhhhhhhheeeeEee-------cCCceeEEEEecCCCcc-cccccccccccccccccce Q lcl|Aclame:pro 133 KENA-KPVSSEEILYTPAREVKTVVDLKPFTTVYQA-------KKASGKYPVLQRATTKM-VTVAELEKNPALAKPDFKD 203 (394) Q Consensus 133 ~~~~-~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~~~-~~~~e~~~~~~~~~~~~~~ 203 (394) ..+. ...+|+.++.+.++.++....+.++++.-.- .+.++++++........ ..+...+..++.-...--. T Consensus 1 MaN~llT~ip~iia~~al~~l~~~lV~~~lVnr~y~~e~~~~k~GDTV~I~~p~~~~~~~~~~~~~~~~~~~~l~e~~v~ 80 (423) T protein:vir:17 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQFSSLRTPTGDISGQNKNNLISGKAT 80 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCcchhhcccCCEEEEeeCCcceeecccCcccCCcccCccccceeE Confidence 1111 1247999999999998888887777654221 12245555432211100 1111111112111112235 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------ccccc-cccHHHHHHHHHhhhh Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKS------FTTKT-VKNLDEIKALLNGGFD 276 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~------~~~~~-~~~~~~i~~~~~~~~~ 276 (394) ++++-++...+--=+.|+..+ ..+++.+++.. .++++...|..+...... +++.+ ...|+++.++-..+-. T Consensus 81 l~id~~k~va~~v~d~E~~~~-i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~a~~~~gt~~t~~~a~~~i~~a~~~Ld~ 158 (423) T protein:vir:17 81 GRVGNYITVAVEYQQLEEAIK-LNQLEEILAPV-RQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTASFLKD 158 (423) T ss_pred EEeeceeeeeeeecHHHHhcC-hhHHHHHHHHH-HHHHHHHHHHHHHHHHhhccccccccCCcccccHHHHHHHHHHHHh Confidence 777777766655444555533 44676666555 455666666654433111 11111 1247777765444333 Q ss_pred ---hhcccEEEEcHHHHHHHHhhhc--cCCceeecccccCCC-cccccccceEEecCcccccCce-----EE--Eec-cc Q lcl|Aclame:pro 277 ---PAYNVSLIVSQSFYQTLDTLKD--GNGRYLLQDDITAVS-GKVLLGKPVFVLSDEVLGANKA-----FI--GDF-KR 342 (394) Q Consensus 277 ---~~~~a~~vm~~~~~~~l~~lkd--~~G~~l~~~~~~~~~-~~~l~G~pV~~~~~~~~~~~~~-----~~--gd~-~~ 342 (394) |..+-..|++|..+..|..-.. ...+-.-...+-++. .+++.|+.|+.+.+.+...... .. +-. .. T Consensus 159 ~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdvy~Snnip~~T~gt~~~t~~~~~~~~v~~ 238 (423) T protein:vir:17 159 LGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTVTY 238 (423) T ss_pred ccCCcCCCEEEeChHHHHHHhccccceecccccchHHHhhccceeeecceEEEEeCCCccccccceeceeeecccccccc Confidence 2234567899999887753110 010111112233333 3689999988765433211100 00 000 00 Q ss_pred cEEEE---eecceEEEEeeccc-------cc-ceEEEEEEeccEEe-------------------cccceEEEEecCccC Q lcl|Aclame:pro 343 GVLFA---DRKDLGLRWADNEI-------YG-QYLQAVLRFGVSKV-------------------DDKAGYYVTFTPEPL 392 (394) Q Consensus 343 ~~~~~---~~~~~~i~~~~~~~-------~~-~~~r~~~r~d~~v~-------------------~~~af~~l~~~~~~~ 392 (394) +.... ...++...+..... +. -++....+....|+ +..+-..|++.|++- T Consensus 239 ~a~~~~~~~~~~~~~~~~~~~g~l~~GD~~t~aGv~~v~~~tk~v~~~~~t~~~~~~~v~~~~~~~a~~~~tv~i~p~~i 318 (423) T protein:vir:17 239 NAVKDSYQFTVTLTGATTSVTGFLKAGDQVKFTNTYWLQQQTKQALYNGATPISFTATVTADANSDSSGDVTVTLSGVPI 318 (423) T ss_pred cccccccceeeeeeeeeeeccCceeecceEEecceeeecccccccccccccccceEEEEEecccccccCceEEEecCccc Confidence 00000 00000000100000 00 00111111111100 111112355555443 Q ss_pred CC Q lcl|Aclame:pro 393 PL 394 (394) Q Consensus 393 ~~ 394 (394) |. T Consensus 319 ~~ 320 (423) T protein:vir:17 319 YD 320 (423) T ss_pred cc Confidence 33 No 186 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=93.45 E-value=0.0075 Score=32.23 Aligned_cols=246 Identities=9% Similarity=-0.033 Sum_probs=110.4 Q ss_pred ccccCCccccchhHHhHHHHHHHhhhhhhheeeeEe-----ecCCceeEEEEecCCCcccccccccccccccccccce-- Q lcl|Aclame:pro 131 IKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQ-----AKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD-- 203 (394) Q Consensus 131 ~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~-- 203 (394) ........+-|+-|+.++++.++...++.++|+.-. -.+.++.+|+..... ...+.... ..+..=.. T Consensus 1 m~~~~N~~ltp~iia~~~l~~l~~~lV~~~lv~r~y~~e~~~~GDTV~I~vp~~~~-----v~dg~~~~-~~~~te~~v~ 74 (418) T protein:vir:10 1 MAVQDNNLLTDDVIAKEALRLLKNNLVMAKCVYRNYEKTFGKVGDTIRLKLPYRVK-----SASGRTLV-KQPMVDQTIP 74 (418) T ss_pred CCccccccccHHHHHHHHHHHHHHhccchhhhcCCCchHHhhCCCEEEEeeCCcee-----ecccCCcc-ccccccceEE Confidence 222223446699999999999999988877765421 112356666533211 11111111 11222233 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc----c--ccccccHHHHHHHHHhhhhh Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF----T--TKTVKNLDEIKALLNGGFDP 277 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~----~--~~~~~~~~~i~~~~~~~~~~ 277 (394) ++++-++... +.++.+-...+..++...+.+....+++...|..+....... + ......++++.++-..+-.. T Consensus 75 l~id~~k~~~-~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~~~gt~gt~~~~~~~i~~a~~~Ld~~ 153 (418) T protein:vir:10 75 FKIAYQEHVG-LEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFHSSGTPGVRPGAFIDFANAGAKQTTY 153 (418) T ss_pred EEEecccccc-eeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccCCcCcchHHHHHHHHHHHHhc Confidence 5554444444 445543333344567666666677777777776654332111 1 11223478888765444333 Q ss_pred hc----ccEEEEcHHHHHHHHhhhccCCceeecc-----cccCCCcccccccceEEecCcccccC------ceEEEeccc Q lcl|Aclame:pro 278 AY----NVSLIVSQSFYQTLDTLKDGNGRYLLQD-----DITAVSGKVLLGKPVFVLSDEVLGAN------KAFIGDFKR 342 (394) Q Consensus 278 ~~----~a~~vm~~~~~~~l~~lkd~~G~~l~~~-----~~~~~~~~~l~G~pV~~~~~~~~~~~------~~~~gd~~~ 342 (394) .. +-..|++|..+..|.+ +.. .++.. .+.++.-+++.|+.|+.+.+.+.... ..+.|-... T Consensus 154 ~VP~~G~R~lVv~P~~~~~L~~--~~~--~~~~~~~~~~~lr~G~IG~i~GF~V~~S~nip~~tag~~~~t~~v~ga~~~ 229 (418) T protein:vir:10 154 AVPQDGMRHAVLDPFTCASLSD--EVT--KLFKESMVEQAYKMGYRGNVAAYEVYESQNLPKHTVGDHGGTPLVNGTVVN 229 (418) T ss_pred CCCCCCceEEEeCHHHHHHHhh--hcc--ccccccccchhhheeeeeeeeceEEEEecCCCcccccccccceeeeccccc Confidence 22 2356799998876643 221 12222 23455667899999988765432111 122222111 Q ss_pred cEEEEeecceEEEEe-ecc------ccc------------------ceEEEEEEeccEEecccceEEEEecCccC----- Q lcl|Aclame:pro 343 GVLFADRKDLGLRWA-DNE------IYG------------------QYLQAVLRFGVSKVDDKAGYYVTFTPEPL----- 392 (394) Q Consensus 343 ~~~~~~~~~~~i~~~-~~~------~~~------------------~~~r~~~r~d~~v~~~~af~~l~~~~~~~----- 392 (394) +..+. +...+. ... .|. +-+++.. |. .....+-..|++.|+.- T Consensus 230 ~~~~~----~~~~t~s~~g~l~~Gd~~ti~gv~~v~~~t~~~~~~~~~f~V~~--~~-~~~~~~~~tv~i~p~~~~~~~~ 302 (418) T protein:vir:10 230 GDTVG----FDGGTASTTGFLKAGDVITFGGVFGVNPQNYETTGLLQEFVVLE--DV-DTDAGGAGSIKISPSLNDGTAT 302 (418) T ss_pred ceeEE----EeecceeeccceeeccEEEECceeecccccccccccceEEEEEe--ec-cccccCcceeEecccccccccc Confidence 11110 000000 000 000 0011110 00 00111122355554431 Q ss_pred --------------------CC Q lcl|Aclame:pro 393 --------------------PL 394 (394) Q Consensus 393 --------------------~~ 394 (394) |- T Consensus 303 ~~~~~~~~~~~~~~~~v~a~~a 324 (418) T protein:vir:10 303 INNENGDPVSLTAYQNVTALPA 324 (418) T ss_pred ccccccccccccCCCccccccc Confidence 11 No 187 >protein:vir:270 Length: 341 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536650;genbank:gi:17975128;genbank:GeneID:929084 Probab=93.43 E-value=0.0076 Score=32.20 Aligned_cols=276 Identities=9% Similarity=0.053 Sum_probs=122.4 Q ss_pred ccc-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhh Q lcl|Aclame:pro 80 VTQ-EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDL 158 (394) Q Consensus 80 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l 158 (394) ..+ ........++.++ .......++........|-+.....+...+.+.+.+ T Consensus 1 m~~~m~~~tr~~~~~y~---------------------------~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~F 53 (341) T protein:vir:27 1 MSQILTQSAREYMDNFA---------------------------QQLAKSYGVSNVAELFNVSPQLETKLRAAITESAEF 53 (341) T ss_pred CcccccHHHHHHHHHHH---------------------------HHHHHHcCcccccceEeecHHHHHHHHHHHHhhHHh Confidence 000 0000011111111 001111222233334455557889999999999999 Q ss_pred hheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHh-cc----HHHHHHHH Q lcl|Aclame:pro 159 KPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DA----DVDLVGIV 233 (394) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds----~~~l~~~i 233 (394) ++.++++++..-.+...-... ++..+.-...+-.+.. +.++.....-+..---+.++.+.|. ++ .++|...+ T Consensus 54 L~~Invv~V~e~~Ge~v~lg~-~g~iagrtdt~R~~r~--~~l~~~~Y~c~qtn~dt~i~y~~lDaWA~~g~~~dF~~r~ 130 (341) T protein:vir:27 54 LKMITVTTVDQIEGQVVDVGV-SGLYTGRKAGGRFTKQ--VGVGGHKYKLAETDSCAAITWAMLCQWANQGGRDQFMKHL 130 (341) T ss_pred hhcCccccccceeeeEeeccc-ccceeeccCCCceecc--cccCCcceEEEEeeeeeeecHHHHHHHHhcCCChHHHHHH Confidence 999999999877666554322 2222222222222221 2233322222222222233333332 22 36788888 Q ss_pred HHHHHHHHHHHHHHHHhhcccccc---------------------------------------ccccccHHH-HHHHHHh Q lcl|Aclame:pro 234 SESISQIKVNTTNDAIAKVLKSFT---------------------------------------TKTVKNLDE-IKALLNG 273 (394) Q Consensus 234 ~~~l~~~~~~~~~~a~~~g~~~~~---------------------------------------~~~~~~~~~-i~~~~~~ 273 (394) ++.+.++++.-.-.-.++|...+. .....+.|. +.++++. T Consensus 131 ~~~i~~~~ALD~i~IGfnGts~A~~Td~~anPllqDVNkGWlQ~~Re~a~~rVl~~~~~~~g~~gdy~nLDAlV~D~~~~ 210 (341) T protein:vir:27 131 TEFSNQMFALDIMRIGWNGVSAEADTDPSANPLGQDVNEGWIAFVKNRKASQVVDVDVYFDETNGDYRTLDAMASDIINN 210 (341) T ss_pred HHHHHHHHhhhhhhhcccceeeccCCChhhcccccccchhHHHHHHhhcccceeccceeeccCCCccccHHHHHHHHHhc Confidence 888888876543333333332111 111234555 4566667 Q ss_pred hhhhhc-c---cEEEEcHHHHHH--HHhhhccCC--ceeecccccCCCcccccccceEEecCcccccCceEEEeccccEE Q lcl|Aclame:pro 274 GFDPAY-N---VSLIVSQSFYQT--LDTLKDGNG--RYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVL 345 (394) Q Consensus 274 ~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G--~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~ 345 (394) ++++.+ + -+.++.+..++. +..+...+. .-+....+ ..+|-|+|.+..|. .|.+.+++=-|+..-+ T Consensus 211 lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~~ptE~~Aa~~i----~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsI 284 (341) T protein:vir:27 211 QIHPMFRNDPRLTVFVGSGLIGAAQAKLYDKADKPSEQIAAQKL----DKTIAGRPAYVPPF--LPDNAMVVTIPENLQV 284 (341) T ss_pred ccChHHhcCCCEEEEEchhhhhhhhhhhhccCCCCHHHHHHHHH----HHhhCCCeEEEccc--cCCCceEEeeccceEE Confidence 777754 2 377787776552 222221111 00000001 24899999999884 4666666655554332 Q ss_pred EEeecceEEEEeec------ccccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 346 FADRKDLGLRWADN------EIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 346 ~~~~~~~~i~~~~~------~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) .+-+....=.+.+. +.|..+++++- ......-.|..++..+.+--- T Consensus 285 Y~Q~gs~RR~~~d~p~r~rie~yes~YvVEd---yg~~~~~~~~~vkl~~~~~~~ 336 (341) T protein:vir:27 285 LTQHGTAQRKAKHESDRKRSKTHTGAWKVTQ---WVCWKRSPLTTQKKSTSALNH 336 (341) T ss_pred EEecCcEEEEEEeccccccccchhhhheeeh---hhhhhhccccccccCcccccc Confidence 23222222122222 22222222221 111222223333333332111 No 188 >protein:vir:1153 Length: 338 # NCBI annotation: predicted major capsid protein # Family: family:all:201 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490602;genbank:gi:17313222;genbank:GeneID:927319 Probab=93.37 E-value=0.0078 Score=32.14 Aligned_cols=278 Identities=9% Similarity=0.032 Sum_probs=122.8 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ........++.+ ........++........|.+.....+...+.+.+.+++.+ T Consensus 1 M~~~tr~~~~~y---------------------------~~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~I 53 (338) T protein:vir:11 1 MRNETRKQFDAY---------------------------LAQLAKLNGVNSAVQTFAVEPSVQQKLEQRIQESSEFLKQI 53 (338) T ss_pred CCHHHHHHHHHH---------------------------HHHHHHHhCCCcccceeeeCHHHHHHHHHHHHHHHHhhccC Confidence 000000111111 11111122334444556677788999999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCcccccc-c-ccccccccccccceeeecHhhhhhhhhhhHHHHh-c-cHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVA-E-LEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID-D-ADVDLVGIVSESIS 238 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~-e-~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-d-s~~~l~~~i~~~l~ 238 (394) ++++|..-.+...-...++.-++-.. . .++..+..-..++.....-+..---+.++.+.|. + ..++|...+++.+. T Consensus 54 nvv~V~e~~Ge~v~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~ 133 (338) T protein:vir:11 54 NVYGVDELQGEKIGIGVSGTIASRTDTTGDGVRKPRDVSALDNQRYECKHTDFDTAITYAMLDAWAKFPEFQALLRDAIL 133 (338) T ss_pred ceecccceeeeEeeeccCccccccccCCCCCccccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHH Confidence 99999877666544322221111111 0 0111100101222222222222222233333332 1 23478888888888 Q ss_pred HHHHHHHHHHHhhcccc---------------------------------------------ccccccccHHHH-HHHHH Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKS---------------------------------------------FTTKTVKNLDEI-KALLN 272 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~---------------------------------------------~~~~~~~~~~~i-~~~~~ 272 (394) ++++.-.-.-.++|... ++.....+.|.+ .++++ T Consensus 134 k~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rv~~~~~~~~~i~i~~g~~gdy~nLDalV~d~~~ 213 (338) T protein:vir:11 134 KRQALDRLMIGFNGTSAAATTNRAANPLLQDVNIGWFQQYRNNAPARVLKEGKTTGKVVVGNGADADYKNLDALVFDVVS 213 (338) T ss_pred HHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHHhhhhhhhhhcccccceeeecCCCCCccccHHHHHHHHHh Confidence 77654322222222111 111123345554 35666 Q ss_pred hhhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccC---CCcccccccceEEecCcccccCceEEEecccc Q lcl|Aclame:pro 273 GGFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITA---VSGKVLLGKPVFVLSDEVLGANKAFIGDFKRG 343 (394) Q Consensus 273 ~~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~ 343 (394) .++++.+ + -+.++.+..++. ...+...+ .|- ..+.. ....+|-|+|.+..|. .|.+.+++=-|+.. T Consensus 214 ~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~-~pt--E~~Aa~~~~s~k~iGGlpa~~~Pf--fP~~~~lVT~L~NL 288 (338) T protein:vir:11 214 SLIDPWHRRDPGLVVILGRELVHDKYFPMVNKDQ-PAT--EKIATDLILSQKRMGGLPPVEVPY--VPEKGLMVTTLKNL 288 (338) T ss_pred ccCChHHhcCCCEEEEEchhhhHHHHhHHHhcCC-ChH--HHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeecccc Confidence 6777754 2 377788776542 22332211 111 00001 1134899999999884 46666666555543 Q ss_pred EEEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEEecCccC Q lcl|Aclame:pro 344 VLFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 344 ~~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~~~~~~~ 392 (394) -+.+-+....=...+.+...+.--.+.| -|..|-++.+++.+....-+= T Consensus 289 sIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~ieni~~~~ 338 (338) T protein:vir:11 289 SLYWQIGGRRRYLKEVPEKNRIENYESSNDAYVVEDYGLGCLVENIEVAE 338 (338) T ss_pred EEEEecCcEEEEEEeccccccccchhhhccceeeeccccEEEeecceecC Confidence 3333222222222222222111011111 133444444444444322222 No 189 >protein:vir:100331 Length: 342 # NCBI annotation: major capsid protein N # Family: family:all:201 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655472;genbank:gi:109289940;genbank:GeneID:4157374 Probab=92.47 E-value=0.011 Score=31.26 Aligned_cols=279 Identities=9% Similarity=0.034 Sum_probs=126.8 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc----ccCCccccchhHHhHHHHHHHhhhhh Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK----KENAKPVSSEEILYTPAREVKTVVDL 158 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~lvP~~~~~~I~~~~~~~~~l 158 (394) ........++.++ .......++. .......|-+.....+...+.+.+.+ T Consensus 1 M~~~tr~~~~~y~---------------------------~~~A~~ngv~~~~~~~~~~FsV~P~v~q~L~~~i~ess~F 53 (342) T protein:vir:10 1 MKDLTLEKYNAYL---------------------------ARQAELNNLPFNALATGIKFTVQPSVQQKLYEKVRESSDF 53 (342) T ss_pred CChHHHHHHHHHH---------------------------HHHHHHhCCChhHccccceeecChHHHHHHHHHHHHHHHH Confidence 0001111111111 0111112222 22223456667888999999999999 Q ss_pred hheeeeEeecCCceeEEEEecCCCccccccc--cc-ccccccccccceeeecHhhhhhhhhhhHHHHh-c-cHHHHHHHH Q lcl|Aclame:pro 159 KPFTTVYQAKKASGKYPVLQRATTKMVTVAE--LE-KNPALAKPDFKDVAWNIDTYRGAIPLSQESID-D-ADVDLVGIV 233 (394) Q Consensus 159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e--~~-~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-d-s~~~l~~~i 233 (394) +..++++++..-.+...-...+ +..+.-+. +. ...+.....++.....-++.---+.++.+.|. + ..++|...+ T Consensus 54 L~~INvv~V~e~~Ge~i~lg~~-g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~ 132 (342) T protein:vir:10 54 LKSISFVFVDEQTGETLGLDSA-HTVASTTDTSGDGERKTTSIAKLVKQTYHCQQINFDTHINYKQLDMWAKFPDFQQKV 132 (342) T ss_pred hccCcccccccceeeEEecccC-cccccccccCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHH Confidence 9999999998777665543222 22222111 11 11111112223222222222222233333332 1 234688888 Q ss_pred HHHHHHHHHHHHHHHHhhccccc-------------------------------------------cccccccHHHH-HH Q lcl|Aclame:pro 234 SESISQIKVNTTNDAIAKVLKSF-------------------------------------------TTKTVKNLDEI-KA 269 (394) Q Consensus 234 ~~~l~~~~~~~~~~a~~~g~~~~-------------------------------------------~~~~~~~~~~i-~~ 269 (394) ++.+.+.++.-.-...++|...+ ......+.|.+ .+ T Consensus 133 ~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rv~~~~~~~~~i~iG~~gdy~NLDalV~D 212 (342) T protein:vir:10 133 ANVAAKQRKRDLIMIGFNGTSRAATSDRNSNPLLQDVAKGWLQKMREDAKERVMNGESTDNQVLVGKGQEYANLDALVMD 212 (342) T ss_pred HHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHHHhhhhhhhcccceeccceeecCCCCcccHHHHHHH Confidence 88887776542222112221110 11122345554 35 Q ss_pred HHHhhhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccC---CCcccccccceEEecCcccccCceEEEec Q lcl|Aclame:pro 270 LLNGGFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITA---VSGKVLLGKPVFVLSDEVLGANKAFIGDF 340 (394) Q Consensus 270 ~~~~~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~gd~ 340 (394) +++.++++.+ + -+.++.+..++. +..+..++ .|- ..+.. ....++-|+|.+..|. .|.+.+++=-| T Consensus 213 ~~~~lI~~~~~~d~dLVvivG~dLladk~~~l~n~~~-~pt--E~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~ilVT~L 287 (342) T protein:vir:10 213 ATEELIDEWHRDDTDLVVITGRKLLADKYFPIVNQQN-APT--EELAADIVISQKRIGGLKAVRVPF--FPANAILITKL 287 (342) T ss_pred HHhccCChHHhcCCCEEEEEchhhhHHHHHHHHhcCC-ChH--HHHHHHHHHhhhhhcCceeEEccc--cCCCceEEeec Confidence 6667777754 2 477787777652 22332221 110 00000 1124788999999884 46666666555 Q ss_pred cccEEEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 341 KRGVLFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 341 ~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +..-+.+-+....=...+.+...+.--.+.| -|..|-+..+++.+....-+=|- T Consensus 288 ~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~ 342 (342) T protein:vir:10 288 ENLAIYVQEGTTRKHIENVPKKDRIETYESENIDYVVEDYGCAALIENITLKDKE 342 (342) T ss_pred cccEEEEecCcEEEEEEeccccccccchhhhccceeeeccccEEEeecceecCCC Confidence 5433333222222222222222211111122 24566667777777766555555 No 190 >protein:vir:79157 Length: 339 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165257;genbank:gi:145708082;genbank:GeneID:5247168 Probab=92.21 E-value=0.012 Score=31.05 Aligned_cols=276 Identities=10% Similarity=0.081 Sum_probs=120.2 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ........++.+. .......++........|-+.....+...+.+.+.+++.+ T Consensus 1 M~~~tr~~~~~y~---------------------------~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~I 53 (339) T protein:vir:79 1 MRNDTRRLFAAYK---------------------------AAIAKLNGVERVDEKFSVAPSVQQKLETKVQESSDFLKSI 53 (339) T ss_pred CChHHHHHHHHHH---------------------------HHHHHHhCcccccceeeecHHHHHHHHHHHHHHHHHhccC Confidence 0001111111111 1111122233344455666788889999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCcccccc--cccccccccccccceeeecHhhhhhhhhhhHHHHh-c-cHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVA--ELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID-D-ADVDLVGIVSESIS 238 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~--e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-d-s~~~l~~~i~~~l~ 238 (394) +++++..-.+...-...++ ..+.-+ .++...+..-..++.....-++.---+.++.+.|. + ..++|...+++.+. T Consensus 54 Nvv~V~e~~Ge~v~lg~~g-~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~ 132 (339) T protein:vir:79 54 NFYGVPEQEGEKIGLGVSG-PVASTTDTTQQDRETSDISTMDGRRYRCEQTNSDTHITYQKLDAWAKFADFQTRIRDAII 132 (339) T ss_pred cccccccceeeEEeeccCc-ceeecccCCCCCcccccccccCCCccEEEEeeeeceecHHHHHHHhcChhHHHHHHHHHH Confidence 9999987766654332222 211111 11111100111222222222222112223333332 1 23468888888777 Q ss_pred HHHHHHHHHHHhhccc---------------------------------------c------ccccccccHHHH-HHHHH Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLK---------------------------------------S------FTTKTVKNLDEI-KALLN 272 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~---------------------------------------~------~~~~~~~~~~~i-~~~~~ 272 (394) +.++.-.-...++|.. + +......+.|.+ .++++ T Consensus 133 ~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rV~~~g~~~s~~i~~~G~ggdy~NLDalV~d~~~ 212 (339) T protein:vir:79 133 KRQALDRIMIGFNGVSRAATSDRVANPMLQDVNKGWLQNLREQAPQRVMKEGKAAAGKITVGGAGADYGNLDALVYDITN 212 (339) T ss_pred HHHhhccceecccceeeecCCChhhCcCccccchhHHHHHHhhhhhhhhccceeccceeEeccCCCCcccHHHHHHHHHh Confidence 7665321111111111 1 111113345554 45666 Q ss_pred hhhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccC---CCcccccccceEEecCcccccCceEEEecccc Q lcl|Aclame:pro 273 GGFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITA---VSGKVLLGKPVFVLSDEVLGANKAFIGDFKRG 343 (394) Q Consensus 273 ~~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~ 343 (394) .++++.+ + -+.++.+..++. ...+..++ .|- ..+.. ....++-|+|.+..|. .|.+.+++=-|+.. T Consensus 213 ~lId~~~~~d~dLVvivG~dLla~k~~~l~n~~~-~pt--E~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~llVT~L~NL 287 (339) T protein:vir:79 213 HLVEPWYAEDPDLVVVCGRNLLSDKYFPLVNRDR-DPV--QQIAADLIISQKRIGNLPAIRVPY--FPANGLLVTRLDNL 287 (339) T ss_pred ccCChHHhcCCCEEEEEchhhhhhHhhhHhhcCC-ChH--HHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeechhc Confidence 6777754 2 367777777652 33332221 210 00001 1124788999999884 46666676555543 Q ss_pred EEEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEE---ecCcc Q lcl|Aclame:pro 344 VLFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVT---FTPEP 391 (394) Q Consensus 344 ~~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~---~~~~~ 391 (394) -+.+-+....=...+.+...+.--.+.| -|..|-+..+++.+. +..++ T Consensus 288 sIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~iEni~~~~aa 339 (339) T protein:vir:79 288 SIYYQEGGRRRTILDNAKRDRIENYESSNDAYVIEDLACAAMAENIALAAAA 339 (339) T ss_pred EEEEecCcEEEEEEeccccccccchhhccceeeeeccccEEEeeeeecccCC Confidence 3333222222222222222111111111 133444555554443 33333 No 191 >protein:vir:2016 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046760;genbank:gi:9630331;genbank:GeneID:1261541 Probab=92.21 E-value=0.012 Score=31.04 Aligned_cols=280 Identities=13% Similarity=0.064 Sum_probs=120.7 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc--ccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK--KENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ........++.++. ......++. .......|-+.....+...+.+.+.++. T Consensus 1 M~~~tr~~~~~y~~---------------------------~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~ 53 (357) T protein:vir:20 1 MRQETRFKFNAYLS---------------------------RVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLT 53 (357) T ss_pred CChHHHHHHHHHHH---------------------------HHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhc Confidence 00011111111110 011111221 1122445666778899999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCccccccc-c-cccccccccccceeeecHhhhhhhhhhhHHHHh-cc-HHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAE-L-EKNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DA-DVDLVGIVSES 236 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e-~-~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds-~~~l~~~i~~~ 236 (394) +++++++..-.+...-...++.-++.... . ....+..-..++.....-+..---+.++.+.|. ++ .++|...+++. T Consensus 54 ~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~ 133 (357) T protein:vir:20 54 RINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRIRNA 133 (357) T ss_pred cCCccccccceeeEEecccCccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHH Confidence 99999998777665543222221111111 1 111000101222222222222112223333332 11 24688888888 Q ss_pred HHHHHHHHHHHHHhhccccc-------------------------------------------------cccccccHHHH Q lcl|Aclame:pro 237 ISQIKVNTTNDAIAKVLKSF-------------------------------------------------TTKTVKNLDEI 267 (394) Q Consensus 237 l~~~~~~~~~~a~~~g~~~~-------------------------------------------------~~~~~~~~~~i 267 (394) +.+.++.-.-...++|...+ ......+.|.+ T Consensus 134 i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDal 213 (357) T protein:vir:20 134 IIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGRTTSEVIRVGKGGDYASLDAL 213 (357) T ss_pred HHHHHhhccceecccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccccccccceeeecCCCCcccHHHH Confidence 87776542211111111100 01122345554 Q ss_pred -HHHHHhhhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccCC---CcccccccceEEecCcccccCceEE Q lcl|Aclame:pro 268 -KALLNGGFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITAV---SGKVLLGKPVFVLSDEVLGANKAFI 337 (394) Q Consensus 268 -~~~~~~~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~~---~~~~l~G~pV~~~~~~~~~~~~~~~ 337 (394) .++++.++++.+ + -+.++.+..++. +..+. ..+.|- ..+... ...+|-|+|.+..|. .+.+.+++ T Consensus 214 V~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n-~~~~pt--E~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~ilV 288 (357) T protein:vir:20 214 VMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVN-KEQDNS--EMLAADVIISQKRIGNLPAVRVPY--FPADAMLI 288 (357) T ss_pred HHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhh-ccCChH--HHHHHHHHHHhhhhCCceeEEccc--cCCCceEE Confidence 356667777754 2 477777777653 23332 222211 001110 124788999999884 46666666 Q ss_pred EeccccEEEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEE---ecCccCCC Q lcl|Aclame:pro 338 GDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVT---FTPEPLPL 394 (394) Q Consensus 338 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~---~~~~~~~~ 394 (394) =-|+..-+.+-+....=...+.+...+.--.+.| -|..|-+..+++.+. +..++.|. T Consensus 289 T~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~p~ 349 (357) T protein:vir:20 289 TKLENLSIYYMDDSHRRVIEENPKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPA 349 (357) T ss_pred eeccccEEEEecCcEEEEEEeccccccccchhhhcceeeeeccccEEEeeeeeeccccCCc Confidence 5555433333222222222222222111111111 233444555554444 44444444 No 192 >protein:vir:94989 Length: 349 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224029;genbank:gi:62327316;genbank:GeneID:5176817 Probab=92.20 E-value=0.012 Score=31.03 Aligned_cols=252 Identities=8% Similarity=-0.018 Sum_probs=111.3 Q ss_pred cccccCCccccch--hHHhHHHHHHHhhhhhhheeeeE----------eecCCceeEEEEecCCCc--cccccccc--cc Q lcl|Aclame:pro 130 GIKKENAKPVSSE--EILYTPAREVKTVVDLKPFTTVY----------QAKKASGKYPVLQRATTK--MVTVAELE--KN 193 (394) Q Consensus 130 ~~~~~~~~~lvP~--~~~~~I~~~~~~~~~l~~~~~~~----------~~~~~~~~~~~~~~~~~~--~~~~~e~~--~~ 193 (394) ..++--.-..+|+ .+.+.+.+...+.+.|.+ +..+ ..++..+++|+...-.+. ...+.... .. T Consensus 1 Ma~T~l~D~iipe~~vf~~Yv~~~~~e~~~l~q-SGii~~d~~l~~~~~~gG~~~~iPf~~~l~g~~e~n~~~dt~~~~~ 79 (349) T protein:vir:94 1 MAITTIGNIVTGNIPVLASYMTEDPVEKTAFFN-SGILTPTPYAAEIARGPSNIANLPFWKAIDTSIEPNYSNDVYQDIA 79 (349) T ss_pred CCceEEeeeeccChHHHHHHHHHhHHHhhhhhh-ccceeccHHHHHHHhcCCCEEEeeeeecCCCCcccccCCCCccccc Confidence 1122223346776 366666655555444443 2222 233556788887653332 22222211 11 Q ss_pred cccccccccee--eecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHH---hhcc-c---cc-------- Q lcl|Aclame:pro 194 PALAKPDFKDV--AWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAI---AKVL-K---SF-------- 256 (394) Q Consensus 194 ~~~~~~~~~~v--~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~---~~g~-~---~~-------- 256 (394) ....-.+..++ .+-.-+--....++.++--+ |..+.|.++++.-..+.....+ +.|. + ++ T Consensus 80 t~~kit~~~~~a~~~~r~kaw~~~Dla~~lsG~---dpm~~Ia~~va~yW~r~~q~~Lia~L~Gvf~~~~~~~~~~~~~~ 156 (349) T protein:vir:94 80 TPRAIQTGEMMARVAYLNEGFGQADLTVELTSQ---NPLQSVASRLDNFWQRQAQRRLIATALGLYNDNVSATDAYHEQN 156 (349) T ss_pred ccccccccceeeeeeeeccccchhHHHHHhhCc---hHHHHHHHHHHHHHhhHHHHHHHHHHHhhhcccccccccccccC Confidence 11111122222 22222222333445444321 3455555555544333222211 1111 0 00 Q ss_pred ------cccccccHHHHHHHHHhhhhhhc------ccEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEE Q lcl|Aclame:pro 257 ------TTKTVKNLDEIKALLNGGFDPAY------NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFV 324 (394) Q Consensus 257 ------~~~~~~~~~~i~~~~~~~~~~~~------~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~ 324 (394) ...+..+...++++...+-+.+. =+.++||+.++..|.+++-=. | +++.-....-++++|++|++ T Consensus 157 ~~~~d~~~~a~~~~~~~~~A~~~~Gdaa~Gd~~~~lt~i~mHS~v~~~L~~~~li~--~-i~~s~~~~~i~ty~G~~Viv 233 (349) T protein:vir:94 157 DMVVDVSATSGFDAGAFIDATQTMGDALMGNGGEVLGAIAMHSFVYAQARKAQLID--F-IRDAENNTMFATYQGYRVIV 233 (349) T ss_pred ceeEEecccCCCChhhHHHHHHHHHHHhccccccceeEEEEchHHHHHHHhcchhh--h-ccCcccCcccceecCcEEEE Confidence 01122345666666655444432 157999999999987653210 1 11111111236899999999 Q ss_pred ecCccccc-------CceEEEeccccEEEEeec--ceEEEEeecccc-----cceEEEEEEeccEEecccceEEEEecCc Q lcl|Aclame:pro 325 LSDEVLGA-------NKAFIGDFKRGVLFADRK--DLGLRWADNEIY-----GQYLQAVLRFGVSKVDDKAGYYVTFTPE 390 (394) Q Consensus 325 ~~~~~~~~-------~~~~~gd~~~~~~~~~~~--~~~i~~~~~~~~-----~~~~r~~~r~d~~v~~~~af~~l~~~~~ 390 (394) .|+++... .+.+||. +.+.+... ...+++.+++.. +..+.... ..++||..|....-..+ T Consensus 234 DD~~Pv~~~g~~~~yttylfg~---GAi~~~~~~~~~~~E~~rd~~~g~~~G~d~L~~R~---~~~~hp~G~s~~~a~v~ 307 (349) T protein:vir:94 234 DDSMTVVGQDTSRKFISIIFGQ---GAIGYGEGNPEMPLEYEREASRANGGGVETLWTRK---TWLLHPFGYSFTSAVIT 307 (349) T ss_pred eCCCccccCCCCceEEEEEeec---ceEEeecCCCCcceeeecccccCCcceeEEEEEee---EEEeeeeeeeecccccC Confidence 88776522 1234542 32222222 223455544432 12333333 24578888777652211 Q ss_pred --------cCCC Q lcl|Aclame:pro 391 --------PLPL 394 (394) Q Consensus 391 --------~~~~ 394 (394) .+|- T Consensus 308 ~~~~~~~~~sPt 319 (349) T protein:vir:94 308 GNGTETIARSAS 319 (349) T ss_pred CCccccccCCCC Confidence 2344 No 193 >protein:vir:1829 Length: 355 # NCBI annotation: major capsid protein # Family: family:all:201 # MgeID: mge:324 # MgeName: 186 # Cross-refs: genbank:acc:NP_052253;genbank:gi:9634060;genbank:GeneID:1262428 Probab=92.11 E-value=0.013 Score=30.96 Aligned_cols=279 Identities=13% Similarity=0.090 Sum_probs=120.5 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc--ccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK--KENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ........++.++.. .....++. .......|-+.....+.+.+.+.+.+++ T Consensus 1 M~~~tr~~~~~y~~~---------------------------~A~~ngv~~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~ 53 (355) T protein:vir:18 1 MRQETRFKFNAYLTQ---------------------------LAKLNGISVDDVSKKFTVEPSVTQTLMNTVQASSAFLQ 53 (355) T ss_pred CChHHHHHHHHHHHH---------------------------HHHHhCCChhHccceeccCHHHHHHHHHHHHHHHHHhh Confidence 001111111111110 00111221 1223445566778899999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccc-c-ccccccccccceeeecHhhhhhhhhhhHHHHh-cc-HHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAEL-E-KNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DA-DVDLVGIVSES 236 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~-~-~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds-~~~l~~~i~~~ 236 (394) .+++++|..-.+...-...++.-++-.... + .........++.....-+..---+.++.+.|. ++ .++|...+++. T Consensus 54 ~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~ 133 (355) T protein:vir:18 54 MINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTALESNKYECNQINFDFHLTYKRLDLWARFQDFQRRIRDA 133 (355) T ss_pred cCceeccccceeeEEeeccCcceeeccccCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHH Confidence 999999987776655433222222111111 0 10001111222222222222222233333332 11 24788888888 Q ss_pred HHHHHHHHHHHHHhhccccc-------------------------------------------------cccccccHHHH Q lcl|Aclame:pro 237 ISQIKVNTTNDAIAKVLKSF-------------------------------------------------TTKTVKNLDEI 267 (394) Q Consensus 237 l~~~~~~~~~~a~~~g~~~~-------------------------------------------------~~~~~~~~~~i 267 (394) +.++++.-.-...++|...+ ......+.|.+ T Consensus 134 i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~~~~~~i~~G~~gdy~NLDAl 213 (355) T protein:vir:18 134 IVQRQALDFIMAGFNGTTRADTSDRVKNPMLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENLDAL 213 (355) T ss_pred HHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHHHHHhcchhhhhccccccccccccceeeecCCCCcccHHHH Confidence 88777643222222222111 00112334554 Q ss_pred -HHHHHhhhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccCC---CcccccccceEEecCcccccCceEE Q lcl|Aclame:pro 268 -KALLNGGFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITAV---SGKVLLGKPVFVLSDEVLGANKAFI 337 (394) Q Consensus 268 -~~~~~~~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~~---~~~~l~G~pV~~~~~~~~~~~~~~~ 337 (394) .++++.++++.+ + -+.++.+..++. ...+.. .+.|- ..+... ...+|-|+|.+..|. .+.+.+++ T Consensus 214 V~d~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~-~~~pt--E~~Aa~~i~s~k~iGGlpa~~~Pf--fP~~~~lV 288 (355) T protein:vir:18 214 VMDGTNTLIDEIYQDDPKLVAIVGRKLLADKYFPLVNK-QQENT--ESLAADIIISQKRIGNLPAVRVPY--FPANAVFV 288 (355) T ss_pred HHHHHhccCChHHhcCCCEEEEEchhhhHHHHhHHhhc-cCChH--HHHHHHHHHHHHhhCCceeEEccc--cCCCceEE Confidence 356666667654 3 377788776542 333332 22221 001111 124899999999884 46666666 Q ss_pred EeccccEEEEeecceEEEEeecccccc-------e-EEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 338 GDFKRGVLFADRKDLGLRWADNEIYGQ-------Y-LQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 338 gd~~~~~~~~~~~~~~i~~~~~~~~~~-------~-~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) =-|+..-+.+-+....=...+.+...+ + -.+...++....-. .+...+..++++|- T Consensus 289 T~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~ie-ni~~~~~~~~~~~~ 352 (355) T protein:vir:18 289 TTLENLSIYFMDESHRRSIDENPKKDRVENYESMNIDYVVEAYAAGCLLE-NITLGDFTAPAAPE 352 (355) T ss_pred eeccccEEEEecCcEEEEEEeccccccccchhhhcceeeeeccccEEEEe-eeeecCCCCccccc Confidence 555543333322222222222222111 1 12222233322222 33333333333444 No 194 >protein:vir:78777 Length: 358 # NCBI annotation: putative major capsid protein # Family: family:all:201 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285647;genbank:gi:148727153;genbank:GeneID:5220125 Probab=92.02 E-value=0.013 Score=30.89 Aligned_cols=278 Identities=12% Similarity=0.121 Sum_probs=125.5 Q ss_pred ccc-hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcc--cccCCccccchhHHhHHHHHHHhhh Q lcl|Aclame:pro 80 VTQ-EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGI--KKENAKPVSSEEILYTPAREVKTVV 156 (394) Q Consensus 80 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~lvP~~~~~~I~~~~~~~~ 156 (394) ... ........++.++. ......++ ........|.+.....+...+.+.+ T Consensus 1 m~~~M~~~tr~~~~~y~~---------------------------~~A~~ngv~~~~~~~~Fsv~p~v~q~L~~~i~ess 53 (358) T protein:vir:78 1 MSQTLTVQAEQRLNKYCD---------------------------ALAKAYGIDISKLDKQFSVTGPVETTLRSALLASV 53 (358) T ss_pred CcccccHHHHHHHHHHHH---------------------------HHHHHhCCChhHccceeeeChHHHHHHHHHHHHHH Confidence 000 00000011111110 00111122 1223345677788889999999999 Q ss_pred hhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHh-cc----HHHHHH Q lcl|Aclame:pro 157 DLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DA----DVDLVG 231 (394) Q Consensus 157 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds----~~~l~~ 231 (394) .++..++++++..-.+...-.. .++..+.-+........ ..++.....-+..---+.++.+.|. ++ ..+|.. T Consensus 54 ~FL~~INvv~V~e~~Ge~v~lg-~~g~iagrt~tr~~~~~--~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~f~~~~dF~~ 130 (358) T protein:vir:78 54 EFLGLITCLDVDQIKGQVVQVG-VGQLYTGRKKGGRFKGK--VGVDGNTYELTETDSCASLDWATLCTWANAGSEGEFIK 130 (358) T ss_pred HHhhcCcccccccceeeEEeec-CCcccceecCCCccccc--cccCCCccEEEEeceeeeccHHHHHHHHhCCChhHHHH Confidence 9999999999988777664332 22222222222222211 2222222222222222233443332 22 225788 Q ss_pred HHHHHHHHHHHHHHHHHHhhccc---------------------------------------------cccccccccHHH Q lcl|Aclame:pro 232 IVSESISQIKVNTTNDAIAKVLK---------------------------------------------SFTTKTVKNLDE 266 (394) Q Consensus 232 ~i~~~l~~~~~~~~~~a~~~g~~---------------------------------------------~~~~~~~~~~~~ 266 (394) .+++.+.+.++.-.-...++|.. .++.....+.|. T Consensus 131 r~~~~i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~a~~~v~~~~~~~~~i~ig~g~~Gdy~NLDa 210 (358) T protein:vir:78 131 LVGEFVNKAFALDMLRVGWNGVSAADDTDPTANPLGQDVNKGWHQLAREWKGGSQIIKAAAGEKIYFDPDGKGEYKTLDE 210 (358) T ss_pred HHHHHHHHHHhhccceecccceeeccCCChhhCcCccccchHHHHHHHhhchhhhhccccccCceeecCCCCCccccHHH Confidence 88887777665422111111111 011122344565 Q ss_pred HH-HHHHhhhhhhc-c---cEEEEcHHHHHH--HHhhhccCCce---eecccccCCCcccccccceEEecCcccccCceE Q lcl|Aclame:pro 267 IK-ALLNGGFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRY---LLQDDITAVSGKVLLGKPVFVLSDEVLGANKAF 336 (394) Q Consensus 267 i~-~~~~~~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~---l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 336 (394) ++ ++++.++++.+ + -+.++.+..++. ...+. ..+.| +....+ ..+|-|+|.+..+. .+.+.++ T Consensus 211 lV~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n-~~~~pTE~~Aa~~i----~k~iGGlpa~~~Pf--FP~~~il 283 (358) T protein:vir:78 211 MASDLINTTIDPLFQQDPRLVVLVGTDLVAAAQAKLYS-EATKPSEQIAAQQL----AKSIAGRKAYIPPF--FPGKRMV 283 (358) T ss_pred HHHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhh-cCCCcHHHHHHHHH----HHHhCCCeEEEccc--cCCCceE Confidence 44 56677777754 2 477787777653 33333 22222 111111 14788999999884 4666666 Q ss_pred EEeccccEEEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEE-----EecCccCCC Q lcl|Aclame:pro 337 IGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYV-----TFTPEPLPL 394 (394) Q Consensus 337 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l-----~~~~~~~~~ 394 (394) +=-|+..-+.+-+....=...+.+...+.--.+.| -|..|-+..+++.+ ++.+.|+|. T Consensus 284 VT~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~~pa~~ 347 (358) T protein:vir:78 284 VTTLDNLHCYTQRGTRKRKADDNQDSKSFDNQYWRMEGYALGEHKAYGGFEEADIEIGADPAVL 347 (358) T ss_pred EeeccccEEEEecCcEEEEEEeccccccccchhhhcceeeeeccccEEEEeeeeeeeCCCCCcc Confidence 65555433333222222222222222111111111 23344555555443 344555555 No 195 >protein:vir:104011 Length: 337 # NCBI annotation: P2 family phage major capsid protein # Family: family:all:201 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293748;genbank:gi:72537718;genbank:GeneID:3608142 Probab=91.96 E-value=0.013 Score=30.84 Aligned_cols=278 Identities=9% Similarity=0.041 Sum_probs=122.0 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ........++.+.. ......++........|-+.....+...+.+.+.+++.+ T Consensus 1 M~~~tr~~~~~y~~---------------------------~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~I 53 (337) T protein:vir:10 1 MRKETRQAYEKYAA---------------------------QIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRI 53 (337) T ss_pred CChHHHHHHHHHHH---------------------------HHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccC Confidence 00011111111110 011111222333344455578888999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCccccccccccc--ccccccccceeeecHhhhhhhhhhhHHHHh-c-cHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAELEKN--PALAKPDFKDVAWNIDTYRGAIPLSQESID-D-ADVDLVGIVSESIS 238 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~--~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-d-s~~~l~~~i~~~l~ 238 (394) +++++..-.+...-...++ ..+.-+..+.. .+.....++.....-+..---+.++.+.|. + ..++|...+++.+. T Consensus 54 nvv~V~e~~Ge~v~lg~~g-~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~ 132 (337) T protein:vir:10 54 NVLPVTELEGEKLGLSVSG-PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVIL 132 (337) T ss_pred ceeccccceeeEEeeccCc-ceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHH Confidence 9999987766654432222 22211111111 101111222222222222222233443332 1 23478888888888 Q ss_pred HHHHHHHHHHHhhccccc--------------------------------------------cccccccHHH-HHHHHHh Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKSF--------------------------------------------TTKTVKNLDE-IKALLNG 273 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~~--------------------------------------------~~~~~~~~~~-i~~~~~~ 273 (394) ++++.-.-.-.++|...+ +.....+.|. +.++++. T Consensus 133 ~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~ 212 (337) T protein:vir:10 133 NQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSS 212 (337) T ss_pred HHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhc Confidence 777543222222222111 0112234555 4566767 Q ss_pred hhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccC---CCcccccccceEEecCcccccCceEEEeccccE Q lcl|Aclame:pro 274 GFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITA---VSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGV 344 (394) Q Consensus 274 ~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~ 344 (394) ++++.+ + -+.++.+..++. ...+..+ ..|- ..+.. ....+|-|+|.+..|. .|.+.+++=-|+..- T Consensus 213 lI~~~~~~d~~LVvivG~dLladk~~~l~n~~-~~pt--E~~Aa~~i~s~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLs 287 (337) T protein:vir:10 213 MIDPWFQEDTGLVVICGRELLHDKYFPIVNAT-QAPT--ERLAADLIVSQKRIGNLPAVRVPF--FPKRALMVTKLSNLS 287 (337) T ss_pred cCChHHhcCCCEEEEEchhhhhHHhhHHhccC-CCcH--HHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeechhcE Confidence 777754 2 477777777652 2222222 1210 00000 0124889999999884 466666765555533 Q ss_pred EEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEEecCccCC Q lcl|Aclame:pro 345 LFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 345 ~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~~~~~~~~ 393 (394) +.+-+....=...+.+...+.--.+.| -|..|-+..+++.+..-.-+-. T Consensus 288 IY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:10 288 IYYQEGARRRTLKEVPERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred EEEecCcEEEEEEEccccccccchhhccceeeeeccccEEEEeceeecCC Confidence 333222222222222222111111111 2344455555555442222211 No 196 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=91.93 E-value=0.014 Score=30.82 Aligned_cols=253 Identities=11% Similarity=-0.071 Sum_probs=105.6 Q ss_pred ccCC-ccccchhHHhHHHHHHHhhhhhhheeeeEeec-------CCceeEEEEecCCCcccccccc---ccccccccccc Q lcl|Aclame:pro 133 KENA-KPVSSEEILYTPAREVKTVVDLKPFTTVYQAK-------KASGKYPVLQRATTKMVTVAEL---EKNPALAKPDF 201 (394) Q Consensus 133 ~~~~-~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~e~---~~~~~~~~~~~ 201 (394) ..+. ...+|+.++...++.++...++-++++.-.-. +.++++++... ........+ .-.++.....= T Consensus 1 MAN~llT~iP~iia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~--~~v~d~~~~~~~~~~~~~~~e~~ 78 (423) T protein:vir:35 1 MANNLESNISQIVLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQ--FKSERTETGDITGKDKNGLFSAK 78 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCc--ceeecccCcCCCCccccccccce Confidence 1111 12579999999999999988888876542211 23455555432 111111111 11111111111 Q ss_pred ceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccc-----c-ccccc-cccHHHHHHHHHhh Q lcl|Aclame:pro 202 KDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLK-----S-FTTKT-VKNLDEIKALLNGG 274 (394) Q Consensus 202 ~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~-----~-~~~~~-~~~~~~i~~~~~~~ 274 (394) -+++++-++...+--=..|..++ ..+|+.++.... .+++...+..+....- . +++.+ ...|+++.++-..+ T Consensus 79 v~l~id~~k~~a~~v~d~e~~l~-i~~~~~~l~~a~-~ala~~vd~~l~~~l~~~a~~~vgt~~t~~~~~~~i~~a~~~L 156 (423) T protein:vir:35 79 ATGKVGKYITVAVEWTQIEEALK-LNQLDQILSPIH-ERMVTDLETELAHFMMNNGALSLGSPNTAIKKWADVAQTASFI 156 (423) T ss_pred eeEEeccceeccceeCHHHHHhh-HHHHHHHHHHHH-HHHHHHHHHHHHHHHhhccccccccccCCcchHHHHHHHHHHH Confidence 33666666665544444455443 446766666554 3444444444433211 1 11111 23467777765443 Q ss_pred hh---hhcccEEEEcHHHHHHHHh----hhccCCceeecccccCCC-cccccccceEEecCccccc-----CceEE---- Q lcl|Aclame:pro 275 FD---PAYNVSLIVSQSFYQTLDT----LKDGNGRYLLQDDITAVS-GKVLLGKPVFVLSDEVLGA-----NKAFI---- 337 (394) Q Consensus 275 ~~---~~~~a~~vm~~~~~~~l~~----lkd~~G~~l~~~~~~~~~-~~~l~G~pV~~~~~~~~~~-----~~~~~---- 337 (394) -. |..+-..|++|..+..|.. +...+ -.-...+..+. .+++.|+.|+.+.+.+... +..++ T Consensus 157 d~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~--~~~~~alr~g~i~G~i~GFdv~~Snnvp~~T~gt~~~~~~v~~a~ 234 (423) T protein:vir:35 157 KDIGIKTGENYAIMDPWSAQRLADAQSGLHAAD--QLVRTAWENAQISGNFGGIRALMSNGLASRKQGDFDGAITVKTAP 234 (423) T ss_pred HHhcCCcCCCEEEeCHHHHHHHhccccceeccc--cchhHHHhhccceeeecceEEEEcCCCccccccccccceeecccc Confidence 33 2234467899999887653 11111 01112233333 3689999988865433211 11110 Q ss_pred -------EeccccEEEEeecceEEEEe------ecccccceEEE------------------EEEe-ccEEecccceEEE Q lcl|Aclame:pro 338 -------GDFKRGVLFADRKDLGLRWA------DNEIYGQYLQA------------------VLRF-GVSKVDDKAGYYV 385 (394) Q Consensus 338 -------gd~~~~~~~~~~~~~~i~~~------~~~~~~~~~r~------------------~~r~-d~~v~~~~af~~l 385 (394) .+.+..+.. ..+..+... +.-.|. ++.. ..++ .-......+-..| T Consensus 235 ~v~~~a~~~~~~~~~~--~~~~~~~~~g~l~~GD~~t~a-Gv~~v~~~t~~~~~~~~t~~~~~~~V~~~~~~~a~g~~~v 311 (423) T protein:vir:35 235 NVDYLSVKDSYQFTVA--LTGATPSKTGFLKAGDQLKFT-STHWLNQQSKQTLYNGSTAMSFTATVLEETNSTASGDVTV 311 (423) T ss_pred ccccccccccccceee--eeeeeeccCCcEEecceEEee-eeeeccccccceeecccCCceeEEEEeccccccccCceeE Confidence 111111000 000000000 000000 0000 0000 0000001111236 Q ss_pred EecCccCCC Q lcl|Aclame:pro 386 TFTPEPLPL 394 (394) Q Consensus 386 ~~~~~~~~~ 394 (394) ++.|++.|. T Consensus 312 ~i~p~~~~~ 320 (423) T protein:vir:35 312 KLSGVPIYD 320 (423) T ss_pred Ecccccccc Confidence 666655444 No 197 >protein:vir:105374 Length: 423 # NCBI annotation: gene 5 protein # Family: family:all:1412 # MgeID: mge:1556 # MgeName: Sf6 # Cross-refs: genbank:acc:NP_958181;genbank:gi:41057283;genbank:GeneID:2716621 Probab=91.51 E-value=0.016 Score=30.51 Aligned_cols=257 Identities=10% Similarity=-0.050 Sum_probs=109.3 Q ss_pred ccCC-ccccchhHHhHHHHHHHhhhhhhheeeeEe---e----cCCceeEEEEecCCCccccccc--c-ccccccccccc Q lcl|Aclame:pro 133 KENA-KPVSSEEILYTPAREVKTVVDLKPFTTVYQ---A----KKASGKYPVLQRATTKMVTVAE--L-EKNPALAKPDF 201 (394) Q Consensus 133 ~~~~-~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~---~----~~~~~~~~~~~~~~~~~~~~~e--~-~~~~~~~~~~~ 201 (394) ..+. ...+|+.++..+++.++...++.++++.-. . .+.++++++... ........ + ...++.....- T Consensus 1 MaN~llT~~p~iia~~aL~~l~~~lV~~~lVnr~y~~ef~~~k~GDTV~I~~p~~--~~~~d~~~~~~~~~~~~dl~e~~ 78 (423) T protein:vir:10 1 MPNNLDSNVSQIVLKKFLPGFMSDLVLAKTVDRQLLAGEINSSTGDSVSFKRPHQ--FSSLRTPTGDISGQNKNNLISGK 78 (423) T ss_pred CccchhhhhHHHHHHHHHHHHHhhcccchhhcccCCCcccccccCCEEEEeeCCc--eeeeccCCccccccccCccccce Confidence 1121 224799999999999999888877765422 1 123455554422 11211111 1 11122122222 Q ss_pred ceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------ccccc-cccHHHHHHHHHhh Q lcl|Aclame:pro 202 KDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKS------FTTKT-VKNLDEIKALLNGG 274 (394) Q Consensus 202 ~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~------~~~~~-~~~~~~i~~~~~~~ 274 (394) -.++++-++...+--=+.|+..+ ..+++.+++.. .++++...|..+...... +++.+ ...|+++.++-..+ T Consensus 79 v~l~id~~k~va~~v~d~E~~~~-i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~~gt~~t~~~a~~~i~~a~~~L 156 (423) T protein:vir:10 79 ATGRVGNYITVAVEYQQLEEAIK-LNQLEEILAPV-RQRIVTDLETELAHFMMNNGALSLGSPNTPITKWSDVAQTASFL 156 (423) T ss_pred eEEEeeceeeeeeeechHHHhcC-hhhHHHHHHHH-HHHHHHHHHHHHHHHHhhccccccccCCcccchHHHHHHHHHHH Confidence 35777777766655445565543 44677666555 456666666655432111 11111 12467776654333 Q ss_pred hh---hhcccEEEEcHHHHHHHHhhhc--cCCceeecccccCCC-cccccccceEEecCcccccCc-----e-------E Q lcl|Aclame:pro 275 FD---PAYNVSLIVSQSFYQTLDTLKD--GNGRYLLQDDITAVS-GKVLLGKPVFVLSDEVLGANK-----A-------F 336 (394) Q Consensus 275 ~~---~~~~a~~vm~~~~~~~l~~lkd--~~G~~l~~~~~~~~~-~~~l~G~pV~~~~~~~~~~~~-----~-------~ 336 (394) -. |..+-..|++|..+..|.+-.. ..++-.-...+..++ .+++.|+.|+.+.+.+..... + + T Consensus 157 d~~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnip~~T~gt~~~t~~~~~~~~v 236 (423) T protein:vir:10 157 KDLGVNEGENYAVMDPWSAQRLADAQTGLHASDQLVRTAWENAQIPTNFGGIRALMSNGLASRTQGAFGGTLTVKTQPTV 236 (423) T ss_pred HhccCCcCCCEEEeChHHHHHHhccccceecccccchhhhhhccceeeecceEEEEeCCCccccccccccceeeeeccee Confidence 32 3334567899999887753110 111111122333333 368999998886543321110 0 0 Q ss_pred ----EEeccccEEEEeecce----EEEEeecccccceEEEEEEecc-------------------EEecccceEEEEecC Q lcl|Aclame:pro 337 ----IGDFKRGVLFADRKDL----GLRWADNEIYGQYLQAVLRFGV-------------------SKVDDKAGYYVTFTP 389 (394) Q Consensus 337 ----~gd~~~~~~~~~~~~~----~i~~~~~~~~~~~~r~~~r~d~-------------------~v~~~~af~~l~~~~ 389 (394) ..+-++..+...+..+ .+..-+.-.| -++....+... ......+...|++.| T Consensus 237 ~~~a~~~a~~~~~~~~~~~~~~~~~l~~GD~~t~-aGv~~v~~~tk~~~~~~~t~~~~~~~v~a~~~~~~~g~~tv~i~p 315 (423) T protein:vir:10 237 TYNAVKDSYQFTVTLTGATASVTGFLKAGDQVKF-TNTYWLQQQTKQALYNGATPISFTATVTADANSDSGGDVTVTLSG 315 (423) T ss_pred ccccccccceeeeeeeeccccccCceeecceEEe-cceeeecccccccccccccCcceEEEEEeeeeeccCCceeeeccC Confidence 0000000000000000 0000000000 00011111111 111112233455555 Q ss_pred ccCCC Q lcl|Aclame:pro 390 EPLPL 394 (394) Q Consensus 390 ~~~~~ 394 (394) ++-|. T Consensus 316 ~~i~~ 320 (423) T protein:vir:10 316 VPIYD 320 (423) T ss_pred ccccc Confidence 44333 No 198 >protein:vir:79171 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111033;genbank:gi:134288740;genbank:GeneID:4960690 Probab=91.36 E-value=0.016 Score=30.39 Aligned_cols=278 Identities=9% Similarity=0.038 Sum_probs=121.5 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ........++.+.. ......++........|-+.....+...+.+.+.+++.+ T Consensus 1 M~~~tr~~~~~y~~---------------------------~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~I 53 (337) T protein:vir:79 1 MRKETRQAYEKYAA---------------------------QIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRI 53 (337) T ss_pred CChHHHHHHHHHHH---------------------------HHHHhcChhhhcceeeecHHHHHHHHHHHHHHHHhhccC Confidence 00011111111110 011112222233334455578888999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCccccccccccc--ccccccccceeeecHhhhhhhhhhhHHHHh-c-cHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAELEKN--PALAKPDFKDVAWNIDTYRGAIPLSQESID-D-ADVDLVGIVSESIS 238 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~--~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-d-s~~~l~~~i~~~l~ 238 (394) +++++..-.+...-...++ ..+.-+..+.. .+.....++.....-+..---+.++.+.|. + ..++|...+++.+. T Consensus 54 nvv~V~e~~Ge~v~lg~~g-~iagrt~t~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~i~ 132 (337) T protein:vir:79 54 NVLPVTELEGEKLGLSVSG-PIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDAWAKFADFQQRIRDVIL 132 (337) T ss_pred ceeccccceeeEEeeccCc-ceeeeecCCCCccccccccccCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHH Confidence 9999987766654432222 22211111111 101111222222222222222233443332 1 23478888888888 Q ss_pred HHHHHHHHHHHhhccccc--------------------------------------------cccccccHHH-HHHHHHh Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKSF--------------------------------------------TTKTVKNLDE-IKALLNG 273 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~~--------------------------------------------~~~~~~~~~~-i~~~~~~ 273 (394) ++++.-.-.-.++|...+ +.....+.|. +.++++. T Consensus 133 ~~~ALD~i~IGfnG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~rV~~~~~~~~~~i~iG~~gdy~nLDalV~D~~~~ 212 (337) T protein:vir:79 133 NQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLVGKAGDYENLDALVMDIVSS 212 (337) T ss_pred HHHhhchhhhcccceeeccCCChhhCcCccccchhHHHHHHhcchhhhhccccccCcceeecCCCCcccHHHHHHHHHhc Confidence 777643222222222111 1112234555 4566767 Q ss_pred hhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccC---CCcccccccceEEecCcccccCceEEEeccccE Q lcl|Aclame:pro 274 GFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITA---VSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGV 344 (394) Q Consensus 274 ~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~ 344 (394) ++++.+ + -+.++.+..++. ...+..+ ..|- ..+.. ....+|-|+|.+..|. .|.+.+++=-|+..- T Consensus 213 lI~~~~~~d~~LVvivG~dLladk~~~l~n~~-~~pt--E~~Aa~~i~s~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLs 287 (337) T protein:vir:79 213 MIDPWFQEDTGLVAICGRELLHDKYFPIVNAT-QAPT--ERLAADLIVSQKRIGNLPAVRVPF--FPKRALMVTKLSNLS 287 (337) T ss_pred cCChHHhcCCCEEEEEchhhhhHHhhHHhccC-CCcH--HHHHHHHHHHhhhhCCceeEEccc--cCCCceEEeechhcE Confidence 777754 2 477777777652 2222222 1210 00000 0124889999999884 466666765555533 Q ss_pred EEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEEecCccCC Q lcl|Aclame:pro 345 LFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 345 ~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~~~~~~~~ 393 (394) +.+-+....=...+.+...+.--.+.| -|..|-+..+++.+..-.-+-. T Consensus 288 IY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~ienI~~~~a 337 (337) T protein:vir:79 288 IYYQEGARRRTLKEVPERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred EEEecCcEEEEEEEccccccccchhhccceeeeeccccEEEEeceeecCC Confidence 333222222222222222111111111 2334455555555442222211 No 199 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=90.19 E-value=0.0096 Score=31.63 Aligned_cols=269 Identities=13% Similarity=0.024 Sum_probs=115.7 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhh--h Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVV--D 157 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~--~ 157 (394) ...+.+ ....-...+.+..+.. ...+.+.. .....+..+++++--+.+.+.|..+..... . T Consensus 1 ~~~~~~-~~~~~~~~~~~~~e~~---------------~KS~~tg~-g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~ 63 (463) T protein:vir:99 1 MTIEKN-LSDVQQKYADQFQEDV---------------VKSFQTGY-GITPDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) T ss_pred CCcccc-cchHHHHHHhhhhHHH---------------HHHhhcCC-ccCCccccCcchhhhhhhhhhhheeeecccchh Confidence 110000 0000000111000000 01111111 001112233444544455555443322221 2 Q ss_pred hhheeeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHH-HHhccHHHHHHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQE-SIDDADVDLVGIVSE 235 (394) Q Consensus 158 l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~e-ll~ds~~~l~~~i~~ 235 (394) +.+-+.+.++.+.-..+......+. +.+-+..|...+..+++.+...+..+|-++....+|.- -+.++..+......+ T Consensus 64 ~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~ 143 (463) T protein:vir:99 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTE 143 (463) T ss_pred hhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHH Confidence 3333455555555444444333332 33344445555568999999999999999999988863 345666677888888 Q ss_pred HHHHHHHHHHHHHHhhcccccccc---ccccHHHHHHHHHhh--hh--------------------hhcc-cEEEEcHHH Q lcl|Aclame:pro 236 SISQIKVNTTNDAIAKVLKSFTTK---TVKNLDEIKALLNGG--FD--------------------PAYN-VSLIVSQSF 289 (394) Q Consensus 236 ~l~~~~~~~~~~a~~~g~~~~~~~---~~~~~~~i~~~~~~~--~~--------------------~~~~-a~~vm~~~~ 289 (394) .-.-.++.+++.+++.|+..-.+. -...+|.+.++++.- ++ .+.+ .-++|+.-+ T Consensus 144 dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v 223 (463) T protein:vir:99 144 DAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGV 223 (463) T ss_pred HHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHH Confidence 888889999999999998776553 234567766655210 00 0111 125556555 Q ss_pred HHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccceEEEE Q lcl|Aclame:pro 290 YQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAV 369 (394) Q Consensus 290 ~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~ 369 (394) .+.|..---..-+.+..++ .+....|+||- .+ ...+..+.+..+........+. T Consensus 224 ka~f~~~~l~~qrv~~~~N----~~~~~~G~~v~-------------------~f-~s~~G~I~L~~s~~m~~~~il~-- 277 (463) T protein:vir:99 224 HADFVNSILGRQMQLMQDN----SGNVNTGYSVN-------------------GF-YSSRGFIKLHGSTVMENELILD-- 277 (463) T ss_pred HHHHHHHhcCceEEEEcCC----CCceeeeeecc-------------------ce-eeeeeeeeeCCceecCCccccc-- Confidence 5555421111111111111 11123344331 01 1112222222222111100000 Q ss_pred EEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 370 LRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 370 ~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) --....|+||+..+++.+++|- T Consensus 278 ---~~~~~~p~ap~~~~~tatv~~~ 299 (463) T protein:vir:99 278 ---ESLQPLPNAPQPAKVTATVETK 299 (463) T ss_pred ---chhhcCCCCccCceeEEEEeec Confidence 0011234444443333333332 No 200 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=90.19 E-value=0.0096 Score=31.63 Aligned_cols=269 Identities=13% Similarity=0.024 Sum_probs=115.7 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhh--h Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVV--D 157 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~--~ 157 (394) ...+.+ ....-...+.+..+.. ...+.+.. .....+..+++++--+.+.+.|..+..... . T Consensus 1 ~~~~~~-~~~~~~~~~~~~~e~~---------------~KS~~tg~-g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~ 63 (463) T protein:vir:95 1 MTIEKN-LSDVQQKYADQFQEDV---------------VKSFQTGY-GITPDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) T ss_pred CCcccc-cchHHHHHHhhhhHHH---------------HHHhhcCC-ccCCccccCcchhhhhhhhhhhheeeecccchh Confidence 110000 0000000111000000 01111111 001112233444544455555443322221 2 Q ss_pred hhheeeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHH-HHhccHHHHHHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQE-SIDDADVDLVGIVSE 235 (394) Q Consensus 158 l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~e-ll~ds~~~l~~~i~~ 235 (394) +.+-+.+.++.+.-..+......+. +.+-+..|...+..+++.+...+..+|-++....+|.- -+.++..+......+ T Consensus 64 ~~~~i~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~ 143 (463) T protein:vir:95 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTE 143 (463) T ss_pred hhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHH Confidence 3333455555555444444333332 33344445555568999999999999999999988863 345666677888888 Q ss_pred HHHHHHHHHHHHHHhhcccccccc---ccccHHHHHHHHHhh--hh--------------------hhcc-cEEEEcHHH Q lcl|Aclame:pro 236 SISQIKVNTTNDAIAKVLKSFTTK---TVKNLDEIKALLNGG--FD--------------------PAYN-VSLIVSQSF 289 (394) Q Consensus 236 ~l~~~~~~~~~~a~~~g~~~~~~~---~~~~~~~i~~~~~~~--~~--------------------~~~~-a~~vm~~~~ 289 (394) .-.-.++.+++.+++.|+..-.+. -...+|.+.++++.- ++ .+.+ .-++|+.-+ T Consensus 144 dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v 223 (463) T protein:vir:95 144 DAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGV 223 (463) T ss_pred HHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHH Confidence 888889999999999998776553 234567766655210 00 0111 125556555 Q ss_pred HHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccccceEEEE Q lcl|Aclame:pro 290 YQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIYGQYLQAV 369 (394) Q Consensus 290 ~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~ 369 (394) .+.|..---..-+.+..++ .+....|+||- .+ ...+..+.+..+........+. T Consensus 224 ka~f~~~~l~~qrv~~~~N----~~~~~~G~~v~-------------------~f-~s~~G~I~L~~s~~m~~~~il~-- 277 (463) T protein:vir:95 224 HADFVNSILGRQMQLMQDN----SGNVNTGYSVN-------------------GF-YSSRGFIKLHGSTVMENELILD-- 277 (463) T ss_pred HHHHHHHhcCceEEEEcCC----CCceeeeeecc-------------------ce-eeeeeeeeeCCceecCCccccc-- Confidence 5555421111111111111 11123344331 01 1112222222222111100000 Q ss_pred EEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 370 LRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 370 ~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) --....|+||+..+++.+++|- T Consensus 278 ---~~~~~~p~ap~~~~~tatv~~~ 299 (463) T protein:vir:95 278 ---ESLQPLPNAPQPAKVTATVETK 299 (463) T ss_pred ---chhhcCCCCccCceeEEEEeec Confidence 0011234444443333333332 No 201 >protein:vir:5694 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839853;genbank:gi:30065708;genbank:GeneID:1260602 Probab=89.15 E-value=0.028 Score=29.10 Aligned_cols=280 Identities=13% Similarity=0.053 Sum_probs=118.3 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc--ccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK--KENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ........++.++. ......++. .......|-+.....+...+.+.+.++. T Consensus 1 M~~~tr~~~~~y~~---------------------------~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~ 53 (357) T protein:vir:56 1 MRQETRFKFNAYLS---------------------------RVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLT 53 (357) T ss_pred CChHHHHHHHHHHH---------------------------HHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhc Confidence 00011111111110 011111222 1122445666778899999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCccccccc-c-cccccccccccceeeecHhhhhhhhhhhHHHHh-cc-HHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAE-L-EKNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DA-DVDLVGIVSES 236 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e-~-~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds-~~~l~~~i~~~ 236 (394) +++++++..-.+...-...++.-++.... . ....+..-..++.....-+..---+.++.+.|. ++ .++|...+++. T Consensus 54 ~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~ 133 (357) T protein:vir:56 54 RINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNA 133 (357) T ss_pred cCCccccccceeeEEecccCccccccccCCCCCCcccccccccCCCccEEEEeeecccccHHHHHHHhcChhHHHHHHHH Confidence 99999998877665543222221111111 1 111000101222222222222112223333332 11 24688888888 Q ss_pred HHHHHHHHHHHHHhhcccc---------------------------------------c----------cccccccHHHH Q lcl|Aclame:pro 237 ISQIKVNTTNDAIAKVLKS---------------------------------------F----------TTKTVKNLDEI 267 (394) Q Consensus 237 l~~~~~~~~~~a~~~g~~~---------------------------------------~----------~~~~~~~~~~i 267 (394) +.+.++.-.-...++|... + ......+.|.+ T Consensus 134 i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDal 213 (357) T protein:vir:56 134 IIKRQSLDFIMAGFNGVKRAETSDRSSNPMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDAL 213 (357) T ss_pred HHHHHhhccceecccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHH Confidence 8777654221111111110 0 01122345554 Q ss_pred -HHHHHhhhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccCC---CcccccccceEEecCcccccCceEE Q lcl|Aclame:pro 268 -KALLNGGFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITAV---SGKVLLGKPVFVLSDEVLGANKAFI 337 (394) Q Consensus 268 -~~~~~~~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~~---~~~~l~G~pV~~~~~~~~~~~~~~~ 337 (394) .++++.++++.+ + -+.++.+..++. +..+. ..+.|- ..+... ...+|-|+|.+..+.. +.+.+++ T Consensus 214 V~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n-~~~~pT--E~~Aa~~i~s~k~iGGl~a~~~PfF--P~~~llV 288 (357) T protein:vir:56 214 VMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVN-KEQDNS--EMLAADVIISQKRIGNLPAVRVPYF--PADAMLI 288 (357) T ss_pred HHHHHhccCChHHhcCCCEEEEEchhhhhhhhhhHhh-ccCChH--HHHHHHHHHHhhhhCCceeEEcccc--CCCceEE Confidence 356667777754 2 477777777653 23332 222211 001110 1247889999998844 5666666 Q ss_pred EeccccEEEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEE---ecCccCCC Q lcl|Aclame:pro 338 GDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVT---FTPEPLPL 394 (394) Q Consensus 338 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~---~~~~~~~~ 394 (394) =-|+..-+.+-+....=...+.+...+.--.+.| -|..|-+..+++.+. +.-++.|. T Consensus 289 T~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~i~~~~~~~ 349 (357) T protein:vir:56 289 TKLENLSIYYMDDSHRRVIEENPKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPA 349 (357) T ss_pred eeccccEEEEecCcEEEEEEeccccccccchhhhcceeeeeccccEEEeeeeeeccCCCCc Confidence 5555433333222222222222222111001111 123344444444333 22222333 No 202 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=89.09 E-value=0.016 Score=30.37 Aligned_cols=251 Identities=10% Similarity=0.033 Sum_probs=110.9 Q ss_pred ccCCccccc--hhHHhHHHHHHHhhhh---hhheeeeEeecCCceeEEEEecCCCccc--ccccccccccccccccceee Q lcl|Aclame:pro 133 KENAKPVSS--EEILYTPAREVKTVVD---LKPFTTVYQAKKASGKYPVLQRATTKMV--TVAELEKNPALAKPDFKDVA 205 (394) Q Consensus 133 ~~~~~~lvP--~~~~~~I~~~~~~~~~---l~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~e~~~~~~~~~~~~~~v~ 205 (394) .+....++. +.+.+.|.+.....-. +.++.+..+-...++.+..... .+.+. |...++...+..+..+++-. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~-~G~a~~~~i~~~a~dip~vd~~~~~~~ 79 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADE-HGSLDDGLITVGTSTLDQVEVGFTPTR 79 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeec-cCcccccccCCcCCccceeecccceeE Confidence 000011111 1112222221112111 2222222333333444444432 23333 55566565556667788888 Q ss_pred ecHhhhhhhhhhhHHHHhccH---HHHHHHHHHHHHHHHHHHHHHHHhhcccc--c-----------------ccc---- Q lcl|Aclame:pro 206 WNIDTYRGAIPLSQESIDDAD---VDLVGIVSESISQIKVNTTNDAIAKVLKS--F-----------------TTK---- 259 (394) Q Consensus 206 ~~~~~~~~~~~vs~ell~ds~---~~l~~~i~~~l~~~~~~~~~~a~~~g~~~--~-----------------~~~---- 259 (394) .+.+.++..+.+|.+=++-+. .+|.+-=.+...+++....+...+.|... + +.. T Consensus 80 ~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~~~w 159 (304) T protein:vir:52 80 SYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQNTKV 159 (304) T ss_pred EEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccCCcc Confidence 888888877777754333222 23444444444455555556555555321 0 000 Q ss_pred ccccHHHHHHHHHhhhhhhc--------ccEEEEcHHHHHHHHhhh-ccCCceeecccccCCCcccccccceEE--ec-- Q lcl|Aclame:pro 260 TVKNLDEIKALLNGGFDPAY--------NVSLIVSQSFYQTLDTLK-DGNGRYLLQDDITAVSGKVLLGKPVFV--LS-- 326 (394) Q Consensus 260 ~~~~~~~i~~~~~~~~~~~~--------~a~~vm~~~~~~~l~~lk-d~~G~~l~~~~~~~~~~~~l~G~pV~~--~~-- 326 (394) ...+.+.|++-++.++.... .-.++|.|+.+..|.... ...|.-++.- +....+ ...|.|+-+ +. T Consensus 160 ~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~-l~~n~~-~~~g~~l~I~~v~~~ 237 (304) T protein:vir:52 160 QAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEF-LTKHLS-AAAGRQVAIKALPSN 237 (304) T ss_pred ccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHH-HHHhcc-cccCCcceEEEeccc Confidence 11245556655554443321 236899999999886542 2222222210 111111 123444322 11 Q ss_pred --Ccc-cccCceEEEeccccEEEEeecceEEEEeeccc-ccceE--EEEEEecc-EEecccceEEEEe Q lcl|Aclame:pro 327 --DEV-LGANKAFIGDFKRGVLFADRKDLGLRWADNEI-YGQYL--QAVLRFGV-SKVDDKAGYYVTF 387 (394) Q Consensus 327 --~~~-~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~-~~~~~--r~~~r~d~-~v~~~~af~~l~~ 387 (394) ... .+.+..++.+-+.-++-+. ..+.+++..... ....+ -++.|++| .+.+|.+|+++++ T Consensus 238 ~~~~g~~g~~r~vvY~~d~~~~~~~-vP~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 238 YGTRVTDGKTRAMVYVNSKEHVIFD-VPMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred ccccCCCCceEEEEEecChhheEEe-cCccccccchhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 111 1223344444444333332 222223222211 11122 25678755 7888999999999 No 203 >protein:vir:95512 Length: 693 # NCBI annotation: Putative Clp protease # Family: family:all:62 # ACLAME annotation(s): go:0008236 - serine-type peptidase activity; phi:0000017 - phage prohead/capsid assembly # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293349;genbank:gi:148912770;genbank:GeneID:5228164 Probab=88.49 E-value=0.032 Score=28.78 Aligned_cols=379 Identities=9% Similarity=-0.006 Sum_probs=136.4 Q ss_pred ChHH-----------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 1 MFEE-----------------------------KIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKA 51 (394) Q Consensus 1 ~l~e-----------------------------~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~ 51 (394) ++.. .-..++++.. +...++.+.++.+...-.....+-+.+.+.+..- T Consensus 227 ~~~~~~~~p~~~~~~PaPTPaaaaPaaP~aaap~~adirA~~~---aae~~r~aaI~a~fa~f~~~~a~l~a~~l~d~~~ 303 (693) T protein:vir:95 227 LLAPRAQTPAAPANTPAPTPASAAPAAPVAAAPTEADIRARIL---AEESGRRSAITAAFGAFSTGHAELLATCLNDMNI 303 (693) T ss_pred HHhhhcccccccccCcccCccCCCCCCCccCCCCcchhhHHHH---HHHHHHHHHHHHHHHhccCChHHHHHHHHhhcCC Confidence 0000 0001111110 0001111111111110000001111222223333 Q ss_pred HHHHHHHHHHHHHHHHhhcccccccccccc-chhhhHHHHHHHHHHH--HH-HHHHH------HHHHHHHHHHHHHH--- Q lcl|Aclame:pro 52 NLVEAENDLKLYESSVEVGGAENIGGKEVT-QEEKTYRESVNDFIRS--KG-KIVND------SLRFEGKDEVLMPI--- 118 (394) Q Consensus 52 ~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~--~~-~~~~~------~~~~~~~~~~~~~~--- 118 (394) .+++.++++-+.......+........... .......+.+...+.. .. ....+ .+....+......+ T Consensus 304 s~d~ar~~lL~~l~~~~~p~~~~~~~~~~~~~~g~~~~d~~~~al~~R~g~~~~~~~n~~~g~~L~elAr~~L~~rg~~~ 383 (693) T protein:vir:95 304 TVDQAREKLLAAIGADTQPAAALSAGAHIHAGNGNLVGDSVRASVLARIGRGERQADNAYNGMTLRELARASLVDRGIGV 383 (693) T ss_pred CHHHHHHHHHHHHhhccCCCCCcCcCccccCCchhHHHHHHHHHHHHhcCcccccCCccccCCcHHHHHHHHHHhcCCcc Confidence 344443333222211111111100000000 0000111111111111 00 00000 00000000000000 Q ss_pred --HhhhhhhhhhhcccccCCccccchhHHhHHHHH-HHhhhhhhheeeeEeecCCceeEEEEecCCCccccccccccccc Q lcl|Aclame:pro 119 --NETTPVEPQKDGIKKENAKPVSSEEILYTPARE-VKTVVDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPA 195 (394) Q Consensus 119 --~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~ 195 (394) .........+...+++....+.-......+.+. -.........++..+++.-.-.-.+.-+..+..-.+.|+++.+. T Consensus 384 ~~~~~~~~~~~a~~htTSDFp~IL~~~~nk~l~~~y~~a~~t~~~~~~~~~~~DFk~~~~~~lg~~~~L~~V~E~gEyk~ 463 (693) T protein:vir:95 384 ASLNAPQMVGLAFTHTSSDFGLILLDVANKSVLAGWEEAEETFPLWTKSGILTDFKPARRVGLGEFSSLRQVREGAEYKY 463 (693) T ss_pred CCCCHHHHHHHHHhcCcchhHHHHHHHHHHHHHHHHHhhhhHHHHHhccCCCCcccccceeecCCCCChhhcCCCCceee Confidence 000011111111233333333322222222221 11123344555544333211111222234455567788888764 Q ss_pred ccccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccc------------------- Q lcl|Aclame:pro 196 LAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF------------------- 256 (394) Q Consensus 196 ~~~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~------------------- 256 (394) ....=..-++...+++..+.||++++-.-+.++.+-|...++++..++++..+...+..+ T Consensus 464 -~t~~e~~e~~~l~tyG~~~~iTRqaiINDDLga~~~ip~~~g~aA~~~~~~~vy~~L~~Np~m~DGk~LFhadH~Nl~t 542 (693) T protein:vir:95 464 -VTLGERGEQIILATYGELFSITRQAIINDDLQMLSDIPFKLGQAAKATIGDLVYAVLTGNPAMSDGKTLFHADHSNLLT 542 (693) T ss_pred -eecCCccceeehhhcCCeeeecHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCccccCCcceeecccccccc Confidence 222223356788999999999999775446678888999999999888887654443211 Q ss_pred cccccccHHHHHHHHHhhhh------h----h---cccEEEEcHHHHHHHHhhhccCCceeecccccCCCccccccc-ce Q lcl|Aclame:pro 257 TTKTVKNLDEIKALLNGGFD------P----A---YNVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGK-PV 322 (394) Q Consensus 257 ~~~~~~~~~~i~~~~~~~~~------~----~---~~a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~-pV 322 (394) +..+..+.+.+..+...... . . +...|++.+........+-.+...+- -+...+...-+.|+ .| T Consensus 543 ga~sals~~sl~~a~~am~~qk~~~~~~~g~~L~i~P~~llvP~~le~~a~~l~~s~~~~~--a~~~~~~~NP~~~~~~v 620 (693) T protein:vir:95 543 GAASALSIDSLSKAKTQMATQKAQVEKGKGRTLNIRPGFVLTPVALEDKANQIINSESVPG--ADVNSGIVNPIRAFAQV 620 (693) T ss_pred ccccccChHHHHHHHHHHHHhhcchhccCCceeecccceEEecchHHHHHHHHhccccccc--cccccccccchhccccc Confidence 01123344444333211111 0 1 12356666666666666554432111 01111111223343 23 Q ss_pred EEecCcccccCc--eEEEeccc-----cEEEEeecceEEEEeeccccc---ceEEEEEEeccEEecccceEEEEec Q lcl|Aclame:pro 323 FVLSDEVLGANK--AFIGDFKR-----GVLFADRKDLGLRWADNEIYG---QYLQAVLRFGVSKVDDKAGYYVTFT 388 (394) Q Consensus 323 ~~~~~~~~~~~~--~~~gd~~~-----~~~~~~~~~~~i~~~~~~~~~---~~~r~~~r~d~~v~~~~af~~l~~~ 388 (394) ++.+-....+.+ .++.|... +|+-+. ++-.|+ ...+|. ..++++..||.++++-.++++=... T Consensus 621 i~~prL~~~s~~~Wyl~a~~~~dtie~~yL~G~-~~P~ie--~~~gf~~dG~~~kvr~D~G~~~iD~Rg~~kn~GA 693 (693) T protein:vir:95 621 IGEPRLDDASATAWYMAAKKGSDTIEVAYLDGV-DTPYLE--QQEGFTVDGVASKVRIDAGVAPLDFRGLQKSNGA 693 (693) T ss_pred cccceecCCCCCceEEecCCCCCeEEEEEecCC-CCCeEe--ecCCCCcceEEEEEEEeccCceeeccccccCCCC Confidence 332211001111 22222111 122221 222232 223343 3468888888999998887775555 No 204 >protein:vir:6061 Length: 357 # NCBI annotation: gpN # Family: family:all:201 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878202;genbank:gi:33438901;genbank:GeneID:1457736 Probab=88.14 E-value=0.034 Score=28.62 Aligned_cols=280 Identities=13% Similarity=0.061 Sum_probs=119.8 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc--ccCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK--KENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ........++.++. ......++. .......|-+.....+...+.+.+.++. T Consensus 1 M~~~tr~~~~~y~~---------------------------~~A~~ngv~~~d~~~~FsV~P~v~q~L~~~i~ess~FL~ 53 (357) T protein:vir:60 1 MRQETRFKFNAYLS---------------------------RVAELNGIDAGDVSKKFTVEPSVTQTLMNTMQESSDFLT 53 (357) T ss_pred CChHHHHHHHHHHH---------------------------HHHHHhCCChHHhcceeecCHHHHHHHHHHHHHHHHHhc Confidence 00011111111110 011111222 1122445666778899999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCccccccc-c-cccccccccccceeeecHhhhhhhhhhhHHHHh-cc-HHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAE-L-EKNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DA-DVDLVGIVSES 236 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e-~-~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds-~~~l~~~i~~~ 236 (394) +++++++..-.+...-...++.-++.... . ....+..-..++.....-+..---+.++.+.|. ++ .++|...+++. T Consensus 54 ~INvv~V~e~~Ge~i~lg~~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~ 133 (357) T protein:vir:60 54 RINIVPVSEMKGEKIGIGVTGSIASTTDTAGGTERQPKDFSKLASNKYECDQINFDFYIRYKTLDLWARYQDFQLRVRNA 133 (357) T ss_pred cCCccccccceeeEEecccCcccccccccCCCCCcccccccccCCCccEEEEeeeeccccHHHHHHHhcChhHHHHHHHH Confidence 99999998777665543222221111111 1 111000101222222222222112223333332 11 24688888888 Q ss_pred HHHHHHHHHHHHHhhcccc---------------------------------------c----------cccccccHHHH Q lcl|Aclame:pro 237 ISQIKVNTTNDAIAKVLKS---------------------------------------F----------TTKTVKNLDEI 267 (394) Q Consensus 237 l~~~~~~~~~~a~~~g~~~---------------------------------------~----------~~~~~~~~~~i 267 (394) +.+.++.-.-...++|... + ......+.|.+ T Consensus 134 i~~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVm~~~~~~~g~~~~~~i~~G~~gdy~NLDal 213 (357) T protein:vir:60 134 IIKRQSLDLIMAGFNGVRRAETSDRSSNQMLQDVAVGWLQKYRNEAPARVMSKVTDEEGHTTSEVIRVGKGGDYASLDAL 213 (357) T ss_pred HHHHHhhccceecccceeeeccCChhhCcCccccchhHHHHHHhhchhhhhccccccCCccccceeeecCCCCcccHHHH Confidence 8777654221111111110 0 01122345554 Q ss_pred -HHHHHhhhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccC---CCcccccccceEEecCcccccCceEE Q lcl|Aclame:pro 268 -KALLNGGFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITA---VSGKVLLGKPVFVLSDEVLGANKAFI 337 (394) Q Consensus 268 -~~~~~~~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~ 337 (394) .++++.++++.+ + -+.++.+..++. +..+. ..+.|- ..+.. ....+|-|+|.+..|.. +.+.+++ T Consensus 214 V~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n-~~~~pT--E~~Aa~~i~s~k~iGGl~a~~~PfF--P~~~llV 288 (357) T protein:vir:60 214 VMDATNNLIEPWYQEDPDLVVIVGRQLLADKYFPIVN-REQDNS--EMLAADVIISQKRIGNLPAVRVPYF--PADAMLI 288 (357) T ss_pred HHHHHhccCChHHhcCCCEEEEEchhhhhHHhhhHhh-cCCChH--HHHHHHHHHHhhhhcCcceEEcccc--CCCceEE Confidence 356667777754 2 477787777653 33332 222210 00101 11247889999998844 5666666 Q ss_pred EeccccEEEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEE---ecCccCCC Q lcl|Aclame:pro 338 GDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVT---FTPEPLPL 394 (394) Q Consensus 338 gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~---~~~~~~~~ 394 (394) =-|+..-+.+-+....=...+.+...+.--.+.| -|..|-+..+++.+. +..++.|. T Consensus 289 T~L~NLsIY~Q~gs~RR~~~d~p~r~riE~y~s~Ne~YvVEd~~~~a~iE~i~~~~~~~pa 349 (357) T protein:vir:60 289 TKLENLSIYYMDDSHRRVIEENPKLDRVENYESMNIDYVVEDYAAGCLVEKIKVGDFSTPA 349 (357) T ss_pred eeccccEEEEecCcEEEEEEeccccccccchhhhcceeeeeccccEEEeeeeeeccCcccc Confidence 5555433333222222222222222111011111 133444444444443 33333344 No 205 >protein:vir:98566 Length: 355 # NCBI annotation: gp5 # Family: family:all:201 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958060;genbank:gi:41057357;genbank:GeneID:2744237 Probab=87.96 E-value=0.035 Score=28.54 Aligned_cols=279 Identities=12% Similarity=0.057 Sum_probs=120.3 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccc--cCCccccchhHHhHHHHHHHhhhhhhh Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKK--ENAKPVSSEEILYTPAREVKTVVDLKP 160 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~lvP~~~~~~I~~~~~~~~~l~~ 160 (394) ........++.++. ......++.. ......|-+.....+.+.+.+.+.+++ T Consensus 1 M~~~tr~~~~~y~~---------------------------~~A~~ngv~~~~~~~~FsV~P~v~q~L~~~i~ess~FL~ 53 (355) T protein:vir:98 1 MRPETRFKFNAYLT---------------------------RVAELNNISTDDVSKKFTVEPSVTQTLMNTVQASSAFLK 53 (355) T ss_pred CChHHHHHHHHHHH---------------------------HHHHHhCCChhHccceeecCHHHHHHHHHHHHHHHHHhh Confidence 00011111111111 0111112211 222344556778899999999999999 Q ss_pred eeeeEeecCCceeEEEEecCCCcccccccc-c-ccccccccccceeeecHhhhhhhhhhhHHHHh-cc-HHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPVLQRATTKMVTVAEL-E-KNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DA-DVDLVGIVSES 236 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~-~-~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds-~~~l~~~i~~~ 236 (394) .+++++|..-.+...-...++.-++-.... + ...+.....++.....-+..---+.++.+.|. ++ .++|...+++. T Consensus 54 ~INvv~V~e~~Ge~i~lgv~g~iagrtdT~~~~~R~~~~~~~l~~~~Y~c~qtn~dt~i~y~~LD~WA~~~dF~~r~~~~ 133 (355) T protein:vir:98 54 TINILPVAEMKGEKIGVGVTGTIASTTDTSGDKERQTADFTALESSKYECNQINFDFHLKYKTLDLWARFQDFQRRIRDA 133 (355) T ss_pred cCceeccccceeeEeeeccCccccccccCCCCCCcccccccccCCCccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHH Confidence 999999988776655433222222211111 0 11001111222222222222222233333332 11 24788888888 Q ss_pred HHHHHHHHHHHHHhhccccc-------------------------------------------------cccccccHHHH Q lcl|Aclame:pro 237 ISQIKVNTTNDAIAKVLKSF-------------------------------------------------TTKTVKNLDEI 267 (394) Q Consensus 237 l~~~~~~~~~~a~~~g~~~~-------------------------------------------------~~~~~~~~~~i 267 (394) +.++++.-.-...++|...+ ......+.|.+ T Consensus 134 i~k~~ALD~i~IGfNG~s~A~~Td~~~nPllqDVNkGWlQ~~Re~ap~~v~~~~~~~~~~~~~~~i~~G~~gdy~NLDAl 213 (355) T protein:vir:98 134 IVKRQALDLIMAGFNGTTRADTSDRTKNTLLQDVAVGWLQKYRNEAPARVMSNITDADGKVVSAVIRVGKNGDYENIDAL 213 (355) T ss_pred HHHHHhhchhhhcccceeeeccCChhhCcCccccchhHHHHHHhcchhhhhhhhcccCccccccceeeCCCCCcccHHHH Confidence 88777543222222222110 01112334554 Q ss_pred -HHHHHhhhhhhc-c---cEEEEcHHHHHH--HHhhhccCCcee---ecccccCCCcccccccceEEecCcccccCceEE Q lcl|Aclame:pro 268 -KALLNGGFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYL---LQDDITAVSGKVLLGKPVFVLSDEVLGANKAFI 337 (394) Q Consensus 268 -~~~~~~~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l---~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~ 337 (394) .++++.++++.+ + -+.++.+..++. ...+...+ .|- ....+. ...+|-|+|.+..|.. +.+.+++ T Consensus 214 V~D~~~~lI~~~~~~d~dLVvivG~dLla~k~~~l~n~~~-~ptE~~Aa~~i~--s~k~iGGlpa~~~Pff--P~~~~lV 288 (355) T protein:vir:98 214 VMDATNNLIDEVYQDDPNLVAIVGRKLLADKYFPLVNKQQ-ENSESLAADIII--SQKRIGNLPAVRVPYF--PANAVLV 288 (355) T ss_pred HHHHHhccCChHHhcCCCEEEEEchhhhHHHhhhHhhccC-CcHHHHHHHHHH--HhhhhCCceeEEcccc--CCCceEE Confidence 356666777654 3 377788776543 33333221 210 000011 1248899999998844 6666666 Q ss_pred EeccccEEEEeecceEEEEeeccccc-------ce-EEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 338 GDFKRGVLFADRKDLGLRWADNEIYG-------QY-LQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 338 gd~~~~~~~~~~~~~~i~~~~~~~~~-------~~-~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) =-|+..-+.+-+....=...+.+... .+ -.+...++....-. .+...+..++++|- T Consensus 289 T~L~NLsIY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~ie-nI~~~~~~~~~~~~ 352 (355) T protein:vir:98 289 TTLENLSIYFMDESHRRSIDENPKKDRVENYESMNIDYVVEVYAAGCLLE-NITLGDFTAPAAPE 352 (355) T ss_pred eeccccEEEEecCcEEEEEEeccccccccchhhhcceeeeeccccEEEee-ceeeeCCCCCcccc Confidence 55554333332222222222222111 11 12222233322222 34444443444444 No 206 >protein:vir:78186 Length: 337 # NCBI annotation: gp2, phage major capsid protein, P2 family # Family: family:all:201 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111152;genbank:gi:134288735;genbank:GeneID:4960646 Probab=86.44 E-value=0.046 Score=27.94 Aligned_cols=278 Identities=8% Similarity=0.035 Sum_probs=120.0 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ........++.+. .......++........|-+.....+...+.+.+.++..+ T Consensus 1 M~~~tr~~~~~y~---------------------------~~~A~~ngv~~~~~~FsV~P~v~q~L~~~i~ess~FL~~I 53 (337) T protein:vir:78 1 MRKETRQAYEKYA---------------------------AQIAKLNDTGDVSKKFAVEPTVQQRLETKMQESSEFLKRI 53 (337) T ss_pred CChHHHHHHHHHH---------------------------HHHHHhcChhhhcceeecChHHHHHHHHHHHHHHHHhccC Confidence 0001111111111 0111112223333445566678889999999999999999 Q ss_pred eeEeecCCceeEEEEecCCCccccccccc--ccccccccccceeeecHhhhhhhhhhhHHHHh-c-cHHHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAELE--KNPALAKPDFKDVAWNIDTYRGAIPLSQESID-D-ADVDLVGIVSESIS 238 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e~~--~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-d-s~~~l~~~i~~~l~ 238 (394) +++++..-.+...-...+ +..+.-+..+ ...+.....++.....-++.---+.++.+.|. + ..++|...+++.+. T Consensus 54 Nvv~V~e~~Ge~v~lg~~-g~iagrtdt~~~~R~~~~~~~l~~~~Y~c~qTn~dt~i~Y~~lD~WA~~~dF~~r~~~~i~ 132 (337) T protein:vir:78 54 NVLPVTELEGEKLGLSVS-GPIASRTDTTKAARQPIDPTALDSNRYRCEKTDYDTAIPYRKLDMWAKFADFQQRIRDVIL 132 (337) T ss_pred CccccccceeeEEecccC-cceeeeecCCCcccccccccccCCCccEEEEeceecccCHHHHHHHhcChhHHHHHHHHHH Confidence 999998776655433222 2222111111 11100111122222211111111233333332 1 23468888888887 Q ss_pred HHHHHHHHHHHhhcccc--------------------------------------------ccccccccHHHH-HHHHHh Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLKS--------------------------------------------FTTKTVKNLDEI-KALLNG 273 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~~--------------------------------------------~~~~~~~~~~~i-~~~~~~ 273 (394) +.++.-.-...++|... ++.....+.|.+ .++++. T Consensus 133 ~~~ALD~i~IGfNGts~A~~Td~~~nPllqDVN~GWlQ~~Re~ap~rVl~~~~~~~~~i~iG~~gdy~NLDalV~d~~~~ 212 (337) T protein:vir:78 133 NQGALDRIMIGWNGVKAAATTDRQANPLLQDVNIGWLQQYRERAAQRVLHEGAKQAGKVLIGKAGDYENLDALVMDIVSS 212 (337) T ss_pred HHHhhccceecccceeeccCCChhhCcCccccchHHHHHHHhcchhhhhccccccCCceeecCCCCcccHHHHHHHHHhc Confidence 77654221111122111 011122344554 456666 Q ss_pred hhhhhc-c---cEEEEcHHHHHH--HHhhhccCCceeecccccC---CCcccccccceEEecCcccccCceEEEeccccE Q lcl|Aclame:pro 274 GFDPAY-N---VSLIVSQSFYQT--LDTLKDGNGRYLLQDDITA---VSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGV 344 (394) Q Consensus 274 ~~~~~~-~---a~~vm~~~~~~~--l~~lkd~~G~~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~ 344 (394) ++++.+ + -+.++.+..++. ...+..+ +.|- ..+.. ....++-|+|.+..|. .|.+.+++=-|+..- T Consensus 213 lI~~~~~~d~dLVvivG~dLladk~~~l~n~~-~~pt--E~~Aa~~i~s~k~iGGl~a~~~Pf--FP~~~ilVT~L~NLs 287 (337) T protein:vir:78 213 MIDPWFQEDTGLVVICGRELLHDKYFPIVNAT-QAPT--ERLAADLIVSQKRIGNLPAVRVPF--FPKRALMVTKLSNLS 287 (337) T ss_pred cCChHHhcCCCEEEEEchhhhHHHHHHHHhcC-CCcH--HHHHHHHHHHhhhhcCcceEEccc--cCCCceEEeechhcE Confidence 777754 2 477787777653 2223322 1210 00000 1124788999999884 466666765555433 Q ss_pred EEEeecceEEEEeecccccceEEEEEE-eccEEecccceEEEEecCccCC Q lcl|Aclame:pro 345 LFADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 345 ~~~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~~~~~~~~ 393 (394) +.+-+....=...+.+...+.--.+.| -|..|-+..+++.+..-.-+-. T Consensus 288 IY~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~iEnI~~~~a 337 (337) T protein:vir:78 288 IYYQEGARRRTLKEVPERDRIENYESSNDAYVVEDFGCGCVAENIELAAA 337 (337) T ss_pred EEEecCcEEEEEEeccccccccchhhccceeeeeccccEEEEeceeecCC Confidence 333222222222222222111111111 2344455555555442222211 No 207 >protein:vir:105522 Length: 423 # NCBI annotation: phage major head protein # Family: family:all:1412 # MgeID: mge:1463 # MgeName: phiSG1 # Cross-refs: genbank:acc:YP_516191;genbank:gi:89885994;genbank:GeneID:3964382 Probab=86.00 E-value=0.049 Score=27.78 Aligned_cols=255 Identities=9% Similarity=-0.092 Sum_probs=106.9 Q ss_pred cccCCccccchhHHhHHHHHHHhhhhhhheeeeEee-------cCCceeEEEEecCCC-cccccccccccccccccccce Q lcl|Aclame:pro 132 KKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA-------KKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKD 203 (394) Q Consensus 132 ~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~-------~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~ 203 (394) ....-..++|+-++.++++.++...++.++++.-.- .+.++++|++..... ......-....++.....--. T Consensus 1 MANsl~~l~p~iia~~al~~l~~~lV~~~lV~r~y~~ef~~ak~GDTV~I~~P~~~~~~d~~~~~~t~~~~~~l~e~~v~ 80 (423) T protein:vir:10 1 MANNLDANVSQIVLKKFLPGFMSDLVLCKTVDRQLLAGEINSSTGDSVSFKRPHQFKSERTMDGDITGKSKNSLISAKAT 80 (423) T ss_pred CccccccccHHHHHHHHHHHHHhhcccchhhccCCCccccccccCCEEEEeeCCceeeecccCcccCcccccccccceEE Confidence 111112389999999999999998888887665221 123555554332111 000000001111111111235 Q ss_pred eeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhcccc------cccc-ccccHHHHHHHHHhhh- Q lcl|Aclame:pro 204 VAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKS------FTTK-TVKNLDEIKALLNGGF- 275 (394) Q Consensus 204 v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~------~~~~-~~~~~~~i~~~~~~~~- 275 (394) ++++-++...+--=+.|+. .+..+++.+++.. .++++...+..+...... +++. ....|+++.++-..+. T Consensus 81 l~id~~k~~a~~v~d~E~~-l~i~~~~~~l~~A-~~aLA~~vd~~ia~~~~~~~~~~vgt~~t~~~a~~~~a~a~~~L~~ 158 (423) T protein:vir:10 81 GEVGNYITVAVEYRQIEEA-LKLNQLDQILVPI-NERMVTDLETELALFMMKHGALSLGSPNTPIKKWSDVAQTASFLKD 158 (423) T ss_pred EEecceeeeeeeeChHHHh-cChhHHHHHHHHH-HHHHHHHHHHHHHHHhhhcccccccccccccccHHHHHHHHHHHhh Confidence 6666666555444445554 3455677666554 455666666555322111 1111 1224677766533332 Q ss_pred --hhhcccEEEEcHHHHHHHHh----hhccCCceeecccccCC-CcccccccceEEecCccc---ccCceEEEeccccEE Q lcl|Aclame:pro 276 --DPAYNVSLIVSQSFYQTLDT----LKDGNGRYLLQDDITAV-SGKVLLGKPVFVLSDEVL---GANKAFIGDFKRGVL 345 (394) Q Consensus 276 --~~~~~a~~vm~~~~~~~l~~----lkd~~G~~l~~~~~~~~-~~~~l~G~pV~~~~~~~~---~~~~~~~gd~~~~~~ 345 (394) -|..+-..|++|..+..|.. +...++- -...+-.+ ..+++.|+.++.+.+.+. +.....+ -.+.++. T Consensus 159 ~~vP~~~R~~Vv~p~~~a~Ll~~~~~~~~~~~~--~~~alr~~~i~G~~~GFdi~~Sn~vp~~T~g~~~ga~-~~~~~~~ 235 (423) T protein:vir:10 159 LGINSGENYAVMDPWAAQRLADAQSGLHVSEQL--VRTAWENAQISGNFGGIRALMSNGLASRTQGAFGGKL-TVKGTPE 235 (423) T ss_pred ccCCcCCCEEEeCHHHHHHHhhhhhhhcccccc--chHHHHhcccceeecceEEEEecCCccccccccccee-eeeeeeE Confidence 23334577899999888753 2221111 11122233 336899999888654431 1111000 0000000 Q ss_pred EEeecc-------------------eEEEEeecccccceEEEEEEecc--------------------EEecccceEEEE Q lcl|Aclame:pro 346 FADRKD-------------------LGLRWADNEIYGQYLQAVLRFGV--------------------SKVDDKAGYYVT 386 (394) Q Consensus 346 ~~~~~~-------------------~~i~~~~~~~~~~~~r~~~r~d~--------------------~v~~~~af~~l~ 386 (394) + .+-. ..|..-+.-.+. ++....++.. ...-+.++. |+ T Consensus 236 v-t~a~~~~~~~~~~~~~~~T~s~~g~l~~GD~~t~a-Gv~~v~~~tk~~l~~~~~~~~~~~~V~~~~~~~a~~~~t-v~ 312 (423) T protein:vir:10 236 V-NYDSVKDSYAFTATLTGATASKKGFLKVGDQLQFD-DTHWLNQQSKQTLYNGASALSFTATVMEDANAHSSGDVT-VK 312 (423) T ss_pred E-EecccccccccccceeeccceeceeEEecceEeec-ceeeecccccceeecccCCcceEEEEEecccccccCceE-EE Confidence 0 0000 001110000000 0001111111 111223332 45 Q ss_pred ecCccCCC Q lcl|Aclame:pro 387 FTPEPLPL 394 (394) Q Consensus 387 ~~~~~~~~ 394 (394) +.|++-|. T Consensus 313 i~p~~~~~ 320 (423) T protein:vir:10 313 ISGVPIFD 320 (423) T ss_pred eccccccc Confidence 55544222 No 208 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=76.99 E-value=0.13 Score=25.45 Aligned_cols=332 Identities=12% Similarity=0.060 Sum_probs=95.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----cccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALES--DDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEV-----GGAE 73 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~--e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~-----~~~~ 73 (394) =|++.+++++++++++++++++..++.+...++ +...+++.++.+++.++++++.+++..+........ .... T Consensus 3 ~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~ 82 (394) T protein:vir:10 3 KLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPNGT 82 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhccccc Confidence 888889999999999999988776665433221 223567778888888888888876655443322111 1111 Q ss_pred ccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHH Q lcl|Aclame:pro 74 NIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVK 153 (394) Q Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~ 153 (394) ................................. . ......+...-...++.......++..+- T Consensus 83 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t----------------~-~~gg~~vP~~~~~~ii~~~~~~~~l~~~~ 145 (394) T protein:vir:10 83 DLKKKPIDAKKKAINDFIHSHGKVIDNAAGHVT----------------S-TEAGVLIPEEIIYDPTAEVNSVVDLSTLV 145 (394) T ss_pred chhhhHHHHHHHHHHHHHhccchhhhhhhcccc----------------c-ccCceeccHHHHHHHHHHHHhhhhhhhhc Confidence 111122222222222222221221111000000 0 00000000000000111111111121111 Q ss_pred hhhhhhheeeeEee---cCCceeEEEEecCCCcccccccccccccccccccce------eeecHhhhhhh-hhhhHHHHh Q lcl|Aclame:pro 154 TVVDLKPFTTVYQA---KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD------VAWNIDTYRGA-IPLSQESID 223 (394) Q Consensus 154 ~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~------v~~~~~~~~~~-~~vs~ell~ 223 (394) ...++......+++ .++...+. ... ....+.. .+.....++.. +.++-.-+-.. +.+-.. T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~~--~E~----~~~~~~~-~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~--- 215 (394) T protein:vir:10 146 TKTPVTTPKGTYPILKRATDRFSSV--AEL----AENPALA-EPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSL--- 215 (394) T ss_pred eeeeccCCceEEEEEecCCCccccc--ccc----ccccccc-cccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHH--- Confidence 11122111111221 11211111 111 1111111 11111112222 11211111110 111111 Q ss_pred ccHHHHHHHHHHHHH----HHHHHHHHHHHhhccccccccccccH-------------------HHHHHHHHhhhhhhcc Q lcl|Aclame:pro 224 DADVDLVGIVSESIS----QIKVNTTNDAIAKVLKSFTTKTVKNL-------------------DEIKALLNGGFDPAYN 280 (394) Q Consensus 224 ds~~~l~~~i~~~l~----~~~~~~~~~a~~~g~~~~~~~~~~~~-------------------~~i~~~~~~~~~~~~~ 280 (394) +...|...++ .++..........+..+. .+...+ ......+..+.+.. T Consensus 216 -----i~~~la~~~~~~~~~~il~g~g~~~~~~~~~~--~~~d~l~~~~~~~~~~~~~a~~vmn~~~~~~l~~lkd~~-- 286 (394) T protein:vir:10 216 -----VGQSINEKSVNTYNAMIAPVLQSFTAKATTTD--TLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDTLKDKN-- 286 (394) T ss_pred -----HHHHHHHHHHHHHHHHHhhccccccccccccc--ccHHHHHHHHHhhhhhhccCEEEecHHHHHHHHHhhccC-- Confidence 3333333333 333333322222211111 111111 11122232222222 Q ss_pred cEEEEcHHHHHHHH-hhhc-cCCceeec-ccccCCCccccccc-ceEEecCcccccCceEEEecc-------------cc Q lcl|Aclame:pro 281 VSLIVSQSFYQTLD-TLKD-GNGRYLLQ-DDITAVSGKVLLGK-PVFVLSDEVLGANKAFIGDFK-------------RG 343 (394) Q Consensus 281 a~~vm~~~~~~~l~-~lkd-~~G~~l~~-~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~gd~~-------------~~ 343 (394) ..+++.+....... ...+ =.|.|+.. ++... +.-.|- ++++-+ -+..++++|.. .+ T Consensus 287 G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~---~~~~~~~~i~~gd----~s~~~~~~~~~~~~v~~~~~~~~~~~ 359 (394) T protein:vir:10 287 GRYLLHDASDSITDGTAKGTVLGVPVYVVGDALL---GSAAGDQKAFVGD----LKRGVLFADRQQVTLAWEDSKIYGRY 359 (394) T ss_pred CCeeeeccccccccCCcccccccceeEEeccccc---CCCCCceEEEEee----ccccEEEEeecceEEEEeccccccee Confidence 22222222111000 0000 14555532 11100 111111 122211 00111122211 11 Q ss_pred EEEEeecceEEEEeecccc---cceEEEEEEeccE Q lcl|Aclame:pro 344 VLFADRKDLGLRWADNEIY---GQYLQAVLRFGVS 375 (394) Q Consensus 344 ~~~~~~~~~~i~~~~~~~~---~~~~r~~~r~d~~ 375 (394) +..+.|-+.-+.....-.+ .......-+.+|. T Consensus 360 ~~~~~r~d~~~~~~~ai~~~~~~~~~~~~~~~~~~ 394 (394) T protein:vir:10 360 LGAAFRFGVKQADSNAGYFVTNTDAASGSTSGTGK 394 (394) T ss_pred EEEEEEeccEEeccccEEEEEeecccCCCCCCCCC Confidence 1122222221111000000 0000111122222 No 209 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=70.19 E-value=0.11 Score=25.79 Aligned_cols=281 Identities=14% Similarity=0.037 Sum_probs=102.8 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhh--h Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVV--D 157 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~--~ 157 (394) .+.++ . ....+....+ .....+.+.. .....+..+++++--+.+.+.|..+..... . T Consensus 1 ~~~~~-n----~~~~~~~~~e---------------~~~Ks~ttgy-~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~ 59 (464) T protein:vir:80 1 MTEKK-N----TERQLTSVQE---------------EVIKGFTTGY-GITPESQTDAAALRREFLDDQITMLTWADGDLS 59 (464) T ss_pred CCcch-h----hHhhcCcccH---------------HHHHHHHhCC-ccCcccccCcchhhhhhhhhhhheeeecccchh Confidence 00000 0 0000000000 0001111100 001112233444554555555443322222 2 Q ss_pred hhheeeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHH-HhccHHHHHHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQES-IDDADVDLVGIVSE 235 (394) Q Consensus 158 l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~el-l~ds~~~l~~~i~~ 235 (394) +..-+++.++.+.-.++......+. +.+-+..|...+..+++.+...+...+-+...-.+|-.+ |.++..+-.....+ T Consensus 60 f~~di~k~~a~STV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~~ 139 (464) T protein:vir:80 60 FYRDITKRPATSTVAKYDVYLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILTD 139 (464) T ss_pred hhhhcCCchhhhhhhhhheeeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHHH Confidence 3333455555555444443333332 333444455555679999999999988776555555432 23344444455555 Q ss_pred HHHHHHHHHHHHHHhhcccccccc----ccccHHHHHHHHHhh--hhhhcccEEEEcHHHHHHHHhh-hccCCce--eec Q lcl|Aclame:pro 236 SISQIKVNTTNDAIAKVLKSFTTK----TVKNLDEIKALLNGG--FDPAYNVSLIVSQSFYQTLDTL-KDGNGRY--LLQ 306 (394) Q Consensus 236 ~l~~~~~~~~~~a~~~g~~~~~~~----~~~~~~~i~~~~~~~--~~~~~~a~~vm~~~~~~~l~~l-kd~~G~~--l~~ 306 (394) .-.-.++.+++.+++.|+..-.+. -..-+|.+.+++..- ++. +. .. ++...+++.... .-+-|.+ +|- T Consensus 140 dai~~va~tiE~a~FyGds~l~~~~~~~~gleFDGl~~lI~~~NViDa-rG-~~-Ls~~~ln~Aa~~i~~~fGt~TD~~l 216 (464) T protein:vir:80 140 DAISVVAKTIEWASFYGDSDLSENPDAGSGLEFDGLAKLIDKHNVLDA-KG-AS-LTEALLNQASVLVGKGYGTPTDAYM 216 (464) T ss_pred HHHHHHHHHHHHHHhhhccccCCCCCCccccchhhhHhhcCCCceeec-CC-CC-cCHHHHhhhhhhhhcccCChhhccc Confidence 555567888999999998765543 234577777665221 110 10 00 233333322211 1111111 011 Q ss_pred ccccC-CCcccccccceEEecCcccccCceEEE-eccccEEEEeecceEEEEeecccccceEEEEEEeccEEecccceEE Q lcl|Aclame:pro 307 DDITA-VSGKVLLGKPVFVLSDEVLGANKAFIG-DFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAGYY 384 (394) Q Consensus 307 ~~~~~-~~~~~l~G~pV~~~~~~~~~~~~~~~g-d~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af~~ 384 (394) |.... ..-...++.-.++.. +.+... .+| |.. +++- -+..+.+..+........+. .. ....|.|++. T Consensus 217 p~~v~a~f~n~~l~~q~~~~~--~n~~~~-~~G~~v~-~f~s-a~G~i~L~~s~~m~~~~~ld-~~----~~~~~~apaa 286 (464) T protein:vir:80 217 PIGVQADFVNQQLDRQVQVIS--DNGQNA-TMGFNVK-GFNS-ARGFIRLHGSTVMELEQILD-EN----RMQLPNAPQK 286 (464) T ss_pred chhHHHHHHhhhcCceeEEEc--CCCCcc-eeeeecc-cccc-cccceeccCccccCcccccc-cc----cccCCCCcCC Confidence 11000 000111221111111 001000 011 111 1111 12233322222111111000 00 0112334333 Q ss_pred EEecCccCCC Q lcl|Aclame:pro 385 VTFTPEPLPL 394 (394) Q Consensus 385 l~~~~~~~~~ 394 (394) -+++.|++|- T Consensus 287 psvt~tv~~~ 296 (464) T protein:vir:80 287 ATVKATLEAG 296 (464) T ss_pred ceeEEEecCC Confidence 3444333333 No 210 >protein:vir:79008 Length: 299 # NCBI annotation: putative main capsid protein # Family: family:all:701 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110725;genbank:gi:134287342;genbank:GeneID:4955182 Probab=60.88 E-value=0.36 Score=22.99 Aligned_cols=255 Identities=6% Similarity=-0.014 Sum_probs=98.9 Q ss_pred ccCCccccchhHHhHHHHHHHhhhhhhheee------eEeecCCceeEEEEecCCCcccccccccccccccccccceeee Q lcl|Aclame:pro 133 KENAKPVSSEEILYTPAREVKTVVDLKPFTT------VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAW 206 (394) Q Consensus 133 ~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~ 206 (394) .+.-. . .+.++..+.+.....+....+++ +...++.++++|.....+.....-...+.....-+.++...++ T Consensus 1 MA~~n-~-a~~~~~~Ld~~~~~~l~~~~L~~~~~~~~v~~~gg~tVkI~~i~~~gl~DY~R~~~g~~~g~~~~~~~t~~l 78 (299) T protein:vir:79 1 MAALN-Y-AKEYSNVLAQAYPYTLNFGDLYATPNNGRYRWTGSKTIEIPTISTTGRVDSNRDTIAVAQRNYDNAWEPKVL 78 (299) T ss_pred Cccch-h-HHHHHHHHHHHHHhhceeeeeccCcccceeeecCCCEEEEeccccccccccccCCCcccccccCcceeEEEe Confidence 11101 2 36677777776666544333322 1223345677886643322222112223332223456667777 Q ss_pred cHhhhhhhhhhhHHHHhccHHHH-HHHHHHHHHH-HHHHHHHHH----Hh---hccccccccccccH----HHHHHHHHh Q lcl|Aclame:pro 207 NIDTYRGAIPLSQESIDDADVDL-VGIVSESISQ-IKVNTTNDA----IA---KVLKSFTTKTVKNL----DEIKALLNG 273 (394) Q Consensus 207 ~~~~~~~~~~vs~ell~ds~~~l-~~~i~~~l~~-~~~~~~~~a----~~---~g~~~~~~~~~~~~----~~i~~~~~~ 273 (394) +-.+.-.+. |..-=...+...+ .+.+..++.+ ...-..|.- +. .+.|+.+..+..+. +.|.++... T Consensus 79 dqdr~~~f~-vD~~Dvdet~~~~~~a~v~~~~~~~~v~pEiDay~~skl~~~a~~~g~~~~~~~~T~~n~y~~i~~~~~~ 157 (299) T protein:vir:79 79 TNQRKWSTL-VHPADINQTNYVASIGNITKVYNEEQKFPEMDAYCISKIYADWTALGNTADTTVLTTTNVLEVFDKLMEK 157 (299) T ss_pred eccccceec-cchhhHHHHhhhhHHHHHHHHHHHHHhhhHhhHHHHHHHHHhhhhcCCcccccccCHHHHHHHHHHHHHH Confidence 766654432 1100000111111 1122222211 111111110 00 11122222222333 444444433 Q ss_pred hhhhhc---ccEEEEcHHHHHHHHhhhc--cCCceeecccccCCCcccccccceEEecCccccc---------------- Q lcl|Aclame:pro 274 GFDPAY---NVSLIVSQSFYQTLDTLKD--GNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGA---------------- 332 (394) Q Consensus 274 ~~~~~~---~a~~vm~~~~~~~l~~lkd--~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~---------------- 332 (394) +-.... +-.++++|..+..|..... ............++..++|.|+||+.+++.-... T Consensus 158 lde~~vP~~~rvl~vtp~~~~~L~~~~~f~k~~~~~~~~~~~~g~Vg~idG~~Ii~Vps~r~~t~~~~~~G~~~~~~ak~ 237 (299) T protein:vir:79 158 MTEARVPENGRILYVTPVVNTLIKNAKEIQRTVNIKDAGTSLNRQTTDIDTVKIIKVPSNLMKTAYDFTTGWKVGAGAKQ 237 (299) T ss_pred HHhcCCCCCCeEEEeCHHHHHHHhhchhhhcccccccccceeeeeeeeecceEEEEechhhcCccceeccCccccCcccc Confidence 332222 3467799999998875321 1122122222345556789999999866422111 Q ss_pred CceEEEeccccEEEEeecceEEEEeecccccc--eEEEEEE-eccEEecc-cceEEEEecCccC Q lcl|Aclame:pro 333 NKAFIGDFKRGVLFADRKDLGLRWADNEIYGQ--YLQAVLR-FGVSKVDD-KAGYYVTFTPEPL 392 (394) Q Consensus 333 ~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~--~~r~~~r-~d~~v~~~-~af~~l~~~~~~~ 392 (394) -.+++...+. .+...+--.++......++. .+.-+.+ .|.=|.+. ..-+++.+++|=+ T Consensus 238 in~ii~~~~a--~~~~~K~~~~~~~~P~~~~~~~~~~~~r~y~d~~v~~nk~~~i~~~~~~a~~ 299 (299) T protein:vir:79 238 IFMSLVHPSA--IITPVSYQFSKLDEPTAVTEGKYFYFEESFEDVFILNKKADAIQFVVEGAGA 299 (299) T ss_pred cceEEEcCCe--eeeeEeeeeEEeecCCCCCccceeeeeeeeeeeeeeccccCeEEEEeeecCC Confidence 0234443321 11111111222222222222 2322222 24434432 2233455555444 No 211 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=60.56 E-value=0.37 Score=22.95 Aligned_cols=349 Identities=9% Similarity=-0.015 Sum_probs=98.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHH----hchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNA----LESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIG 76 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~----~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~ 76 (394) =++.+++||+++++++.++++++.++.+.. ..++..++++++..++++++++++..+...+..+............ T Consensus 9 e~~~~l~el~~~~~~~~~e~r~~~e~~~~~~~~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (400) T protein:vir:38 9 AVKKQLDEKRSALPAMKTELRSLLEGEDSEENLKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGKKPDHPEE 88 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhh Confidence 235678889999988888888776655421 1223346688889999999988887766655544333222211111 Q ss_pred cccccchhhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc---ccCCccccchhHHhHHHHHH Q lcl|Aclame:pro 77 GKEVTQEEKTYRESVNDFIRSKG-KIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK---KENAKPVSSEEILYTPAREV 152 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~lvP~~~~~~I~~~~ 152 (394) .... .................. ..............................+.. ......++..-....++..+ T Consensus 89 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~ 167 (400) T protein:vir:38 89 HSYR-DALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPF 167 (400) T ss_pred hhHH-HHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhc Confidence 1110 000000000000000000 000000000000000001111111111111111 11111122222222222222 Q ss_pred HhhhhhhheeeeEeecC-CceeEEEEecCCCcccccccccccccccccccce------eeecHhhhhhhhhhhHHHHhcc Q lcl|Aclame:pro 153 KTVVDLKPFTTVYQAKK-ASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD------VAWNIDTYRGAIPLSQESIDDA 225 (394) Q Consensus 153 ~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~------v~~~~~~~~~~~~vs~ell~ds 225 (394) -...++......+++.. .+........ .....+. ..+.....++.. +.++-.-+. -|.--+. T Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~E----~~~~~~~-~~~~f~~i~~~~~k~~~~~~is~ell~----ds~~~~~-- 236 (400) T protein:vir:38 168 TNVFQASTQKGTYPTVANATTKMVTVAE----LEKNPAM-AKPEFKPVNWSVETYRQALPVSQESID----DSAIDLV-- 236 (400) T ss_pred ceeEeccCcceEEEEEecCCCccccccc----ccccccc-ccccceeeEeehhheeeehhhHHHHHh----hhHHHHH-- Confidence 22222221111233221 1111111111 1111111 111122222222 122111111 1111011 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccHH-------------------HHHHHHHhhhhhhcccEEEEc Q lcl|Aclame:pro 226 DVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLD-------------------EIKALLNGGFDPAYNVSLIVS 286 (394) Q Consensus 226 ~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~-------------------~i~~~~~~~~~~~~~a~~vm~ 286 (394) .+ +...+.+.+.......+-...-.+. ..+..+...+. .....+..+.+. +..++.. T Consensus 237 ~~-i~~~l~~~~~~~~~~~i~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~l~~lkd~--~G~~i~~ 312 (400) T protein:vir:38 237 GL-IAQNGQQIKVNTTNGAVATLLKGFT-AKTISSVDDLKHINNVDLDPAYSRVIIASQSFYNFLDTVKDG--NGRYLLQ 312 (400) T ss_pred HH-HHHHHHHHHHHHHHHhhhhcccccc-ccccccHHHHHHHHHhhhhhhhCcEEEEcHHHHHHHHHhhcc--CCCeeee Confidence 11 3334444444433332222111111 11111211111 122223222222 2334433 Q ss_pred HHHHHHHHhhhccCCceeeccc-ccCCCccccccc-ceEEecCcccccCceEEEeccccEEEEeecceE---EEE-eecc Q lcl|Aclame:pro 287 QSFYQTLDTLKDGNGRYLLQDD-ITAVSGKVLLGK-PVFVLSDEVLGANKAFIGDFKRGVLFADRKDLG---LRW-ADNE 360 (394) Q Consensus 287 ~~~~~~l~~lkd~~G~~l~~~~-~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~---i~~-~~~~ 360 (394) |..-..- -..=.|.|+...+ ...+. -|- ++++-+ -+..++++|.....+-+.+.... +.. .+.. T Consensus 313 ~~~~~~~--~~~l~G~pv~~~~~~~~~~----~g~~~~~~gd----~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d 382 (400) T protein:vir:38 313 DSILTPS--GKSVLGMPIAVVSDDTLGA----AGEAHAFLGD----IKRAILFANRADFMVRWVDDQIYGQFLQAGMRFG 382 (400) T ss_pred cCcCCCC--ccccccceeEEecccccCC----CCceEEEEEe----ccccEEEEeecceEEEEecccccceeEEEEEEec Confidence 3210000 0001455554211 10000 011 111111 00112223222111111110000 000 0000 Q ss_pred c---ccceEEEEEEeccEEecccc Q lcl|Aclame:pro 361 I---YGQYLQAVLRFGVSKVDDKA 381 (394) Q Consensus 361 ~---~~~~~r~~~r~d~~v~~~~a 381 (394) . ....++. .-+-|.| T Consensus 383 ~~~~~~~a~~~------l~~~~~a 400 (400) T protein:vir:38 383 VSVADEKAGYF------LTYTPKA 400 (400) T ss_pred cEEecccceEE------EEeecCC Confidence 0 0011111 1123344 No 212 >protein:vir:861 Length: 318 # NCBI annotation: putative minor structural protein # Family: family:all:2417 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047120;genbank:gi:9630573;genbank:GeneID:1261764 Probab=60.19 E-value=0.38 Score=22.90 Aligned_cols=282 Identities=15% Similarity=0.113 Sum_probs=116.5 Q ss_pred HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee Q lcl|Aclame:pro 91 VNDFIRSKGKIVNDS---LRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA 167 (394) Q Consensus 91 ~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~ 167 (394) +..++.+++...... ....+.++.+..+..... -.|++-.......|..+...|...+-.+.++++...|-.. T Consensus 1 mtn~iesq~A~~eF~~vL~~N~G~S~~k~AW~A~L~----E~GVtiTD~~~~LP~~lv~sI~~A~~n~n~v~~vfHVT~~ 76 (318) T protein:vir:86 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLA----ENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 76 (318) T ss_pred CcchhhhhHHHHHHHHHHhccCCchhhhhhhhhhhh----hcCceeeccchhccHHHHHHHHHhhhccCcceeeeeeccc Confidence 112222222211111 111111222222221111 1234444555688999999999999999999886655444 Q ss_pred cCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHh---ccHHHHHHHHHHHHHHHHH-H Q lcl|Aclame:pro 168 KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID---DADVDLVGIVSESISQIKV-N 243 (394) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~---ds~~~l~~~i~~~l~~~~~-~ 243 (394) +.--++.. -.++..+-....|..+.+ ...+|..-++.+-.++....+ -++.. .+-..|..||..+|+..|. + T Consensus 77 ~~~~V~~s--~~s~AeAq~HkdGqTK~e-qa~~~~~~Tl~~~~VY~~~S~-Ae~~K~~~~sYsel~N~i~~ELtQ~~vnk 152 (318) T protein:vir:86 77 GALLVSRS--FDSSAEAQVHKDGQTKTE-QAATLTIDTLEPVMVYKLQSL-AERVKRLQMSYSELYNLIVAELTQAIVNK 152 (318) T ss_pred hhhhhhhh--hhhhhhhhhhccCCcccc-ceeeeeeechhHHHHHHHHHH-HHHHHHhhhhHHHHHHHHHHHHHHHHHHH Confidence 33222222 123333333344444443 345666667766555544444 23333 3444568999999999988 6 Q ss_pred HHHHHHhhccccccccccccHHHH---HHHHHh----hhhhhcc---------------cEEEEcHHH-HHHHHhhhccC Q lcl|Aclame:pro 244 TTNDAIAKVLKSFTTKTVKNLDEI---KALLNG----GFDPAYN---------------VSLIVSQSF-YQTLDTLKDGN 300 (394) Q Consensus 244 ~~~~a~~~g~~~~~~~~~~~~~~i---~~~~~~----~~~~~~~---------------a~~vm~~~~-~~~l~~lkd~~ 300 (394) ..+.+..-|+|.++-...-...+| .....+ ...|+.| -.+++...+ .+-|..|+.+. T Consensus 153 ~Vd~AlV~GDG~N~f~~~DK~advK~I~k~Ttkaksagttpfanaieeavdfvrptagrrylivkaedrkalldelrqat 232 (318) T protein:vir:86 153 IVDLALVEGDGSNGFKSIDKEADVKKIKKITTKAKSAGTTPFANAIEEAVDFVRPTAGRRYLIVKAEDRKALLDELRQAT 232 (318) T ss_pred HHHhhheeecCCCCccchhhHHHHHHHHHHhhhhhccCCCchhhHHHHHHhhhccCCCceEEEEeecchHHHHHHHHhhc Confidence 788898888888653332211111 111111 1111111 123333222 33344444322 Q ss_pred Cc---eeecccccCCCcccccccc-eEE-ecCcccccCceEEEeccccEEEEeecceE-EEEeecccccceEEEEEEecc Q lcl|Aclame:pro 301 GR---YLLQDDITAVSGKVLLGKP-VFV-LSDEVLGANKAFIGDFKRGVLFADRKDLG-LRWADNEIYGQYLQAVLRFGV 374 (394) Q Consensus 301 G~---~l~~~~~~~~~~~~l~G~p-V~~-~~~~~~~~~~~~~gd~~~~~~~~~~~~~~-i~~~~~~~~~~~~r~~~r~d~ 374 (394) .+ .|-..+ +. -.+--|+. +++ +.+.+. .+-++-|-+ |.+ +-++++ ++.....++..-+.+.---.| T Consensus 233 anahvrikndd-te--iasevgvdeiivytgskal--kptvlvdqk--yhi-dmqdltkvdafewktnsnmilvetltsg 304 (318) T protein:vir:86 233 ANAHVRIKNDD-TE--IASEVGVDEIIVYTGSKAL--KPTVLVDQK--YHI-DMQDLTKVDAFEWKTNSNMILVETLTSG 304 (318) T ss_pred ccceeEEeccc-hh--hhhhcCcceeeeeeccccc--cceeeeccc--eec-chhhhhhhhcceeccCCceEEEeecccC Confidence 21 111111 00 01111211 111 111111 122444433 222 233332 111111111111222222233 Q ss_pred EEecccceEEEEec Q lcl|Aclame:pro 375 SKVDDKAGYYVTFT 388 (394) Q Consensus 375 ~v~~~~af~~l~~~ 388 (394) -|---+|-+.++++ T Consensus 305 hvetynagavitvs 318 (318) T protein:vir:86 305 HVETYNAGAVITVS 318 (318) T ss_pred cceeecCceeEEeC Confidence 33333444455555 No 213 >protein:vir:3783 Length: 336 # NCBI annotation: capsid # Family: family:all:201 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536823;genbank:gi:17981832;genbank:GeneID:929211 Probab=56.24 E-value=0.46 Score=22.43 Aligned_cols=274 Identities=12% Similarity=0.013 Sum_probs=116.9 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc----ccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 86 TYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK----KENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) -....++.+ ........++. ..+....|.+.....+.+.+.+.+.+++. T Consensus 1 mtr~~~~~y---------------------------~~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~ 53 (336) T protein:vir:37 1 MNKQAYYAL---------------------------AAALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKG 53 (336) T ss_pred CcHHHHHHH---------------------------HHHHHHHhCCChhhhcccceeecCHHHHHHHHHHHHHHHHHhhc Confidence 000001111 00111112222 12234567778889999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHh-cc-HHHHH-HHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DA-DVDLV-GIVSESIS 238 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds-~~~l~-~~i~~~l~ 238 (394) ++++++..-.+...-...++.-++. ...+...... .++.-...-+..---+.++.+.|. ++ .+|+. ..+...+. T Consensus 54 INvv~V~e~~Ge~v~lg~~g~iagr-tdt~r~r~~~--~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~d~~~~~~~~~~~ 130 (336) T protein:vir:37 54 INMVQVAHTKGTKLFGATEKGVTGR-KQTGRNLATL--DHSQNGYELSETDSGILVNWSLFDSFAIFKDRLVELYSEYFQ 130 (336) T ss_pred CceeecccccceEEeeccCcccccc-cCCCCCcccc--CCCCCccEEEEeeeeeeccHHHHHHHhcChhHHHHHHHHHHH Confidence 9999998877665543322222221 1111111111 112111111111111223333332 11 12322 22333333 Q ss_pred HHHHHHHHHHHhhccc------------------------------------c------ccccccccHHH-HHHHHHhhh Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLK------------------------------------S------FTTKTVKNLDE-IKALLNGGF 275 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~------------------------------------~------~~~~~~~~~~~-i~~~~~~~~ 275 (394) +.++.-.-.-.++|.. + ++.....+.|. +.++++ ++ T Consensus 131 r~iALD~i~IGfnG~s~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~-~I 209 (336) T protein:vir:37 131 NQVALDILQIGWNGQSVATNTTKTDLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQ-GL 209 (336) T ss_pred HHHhcchhhhcccceeeccCCCCccccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHh-cc Confidence 3332211111111110 0 01111334565 345665 45 Q ss_pred hhhc-c---cEEEEcHHHHHH-HHhhhccCCc-eeecccccC---CCcccccccceEEecCcccccCceEEEeccccEEE Q lcl|Aclame:pro 276 DPAY-N---VSLIVSQSFYQT-LDTLKDGNGR-YLLQDDITA---VSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLF 346 (394) Q Consensus 276 ~~~~-~---a~~vm~~~~~~~-l~~lkd~~G~-~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~ 346 (394) ++.+ + -+.++.+..++. -..|-+.+|. |= ..+.. -...++-|+|.+..|.. |.+.+++=-|+..-+. T Consensus 210 ~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~Pt--E~~Aa~~~~~~k~iGGlpa~~~Pff--P~~~~lVT~L~NLsIY 285 (336) T protein:vir:37 210 DFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPT--EKAALGSHNLMGSFGGMNAITPPNF--PARAAAVTTLKNLSVY 285 (336) T ss_pred chHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHH--HHHHHHHHHHHHhhCCceEEEcccc--CCCceEEeeccccEEE Confidence 5543 2 367777766433 1223232222 20 00100 02357899999998844 6666676555553333 Q ss_pred EeecceEEEEeecccccceEEEEEE-eccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 347 ADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 347 ~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +-+....=...+.+...+.--.+.| -|..|-+..+++.+......-|- T Consensus 286 ~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 334 (336) T protein:vir:37 286 TEAESVRRSLRNDEDKKGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNG 334 (336) T ss_pred EecCcEEEEEEEccccccccchhhhcceeeeeccccEEEeeeeeeeccc Confidence 3222322222233222221111222 24566677777777776666555 No 214 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=53.13 E-value=0.54 Score=22.07 Aligned_cols=280 Identities=11% Similarity=0.022 Sum_probs=110.8 Q ss_pred ccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhh--h Q lcl|Aclame:pro 80 VTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVV--D 157 (394) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~--~ 157 (394) ...+..... ..+.. +. .. .......+.+.. .....+..+++++--+.+.+.|..+..... . T Consensus 1 ~~~~~~~~~-~~~~~--------~~-----~~--~e~~~KS~~tg~-g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~ 63 (462) T protein:vir:96 1 MHKDTNLTA-EQNKY--------AD-----KF--QEEVMKSYQTGY-GITPDTQVDAGALRREILDDQITMLTWTQDDLI 63 (462) T ss_pred Cccccccch-hhhhh--------hc-----hh--hHHHHHHHhcCC-CcCCccccccchhhhhhhhhhhheeeecccchh Confidence 000000000 00000 00 00 000001111111 001112223344444455555443322222 1 Q ss_pred hhheeeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHH-HhccHHHHHHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQES-IDDADVDLVGIVSE 235 (394) Q Consensus 158 l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~el-l~ds~~~l~~~i~~ 235 (394) +..-+.+.++.+.-.++......+. +.+-+..|...+..+++.+...+..+|-++....+|... +..+..+..+...+ T Consensus 64 ~~~~i~k~~a~sTv~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~ 143 (462) T protein:vir:96 64 FYREISRRPAQSTVQKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTE 143 (462) T ss_pred hhhhcCCchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHH Confidence 2333445555554444443333332 333444455555679999999999999999988888643 24445566677777 Q ss_pred HHHHHHHHHHHHHHhhcccccccc---ccccHHHHHHHHHh---------------------hhh-hhccc-EEEEcHHH Q lcl|Aclame:pro 236 SISQIKVNTTNDAIAKVLKSFTTK---TVKNLDEIKALLNG---------------------GFD-PAYNV-SLIVSQSF 289 (394) Q Consensus 236 ~l~~~~~~~~~~a~~~g~~~~~~~---~~~~~~~i~~~~~~---------------------~~~-~~~~a-~~vm~~~~ 289 (394) .-.-.++.+++.+++.|+..-.+. ....+|.+.+++.. ... .+.++ -++|+.-+ T Consensus 144 dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v 223 (462) T protein:vir:96 144 DAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGV 223 (462) T ss_pred HHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHH Confidence 777788899999999998765542 24556766555421 000 11111 37788888 Q ss_pred HHHHHhhhccCCceeecccccC------------------CCcccccccceEEecCc------cccc---------CceE Q lcl|Aclame:pro 290 YQTLDTLKDGNGRYLLQDDITA------------------VSGKVLLGKPVFVLSDE------VLGA---------NKAF 336 (394) Q Consensus 290 ~~~l~~lkd~~G~~l~~~~~~~------------------~~~~~l~G~pV~~~~~~------~~~~---------~~~~ 336 (394) .+.|..---..-+.+..++..+ =.+.++++-|-+...+. +.+. .... T Consensus 224 ~a~f~~~~l~~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~~vsaTv~t~~~g~ 303 (462) T protein:vir:96 224 HADFVNSVLGRQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPATVKATVETGKKGL 303 (462) T ss_pred HHHHHHhhcCceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCCCCCCceeEEEEeCCCCC Confidence 8777631111111111111100 00112222222221100 0000 0001 Q ss_pred EEeccccEEEEeecceEEEEeecccccceEEEEEEeccEEecccce-----------EEEEecCccCCC Q lcl|Aclame:pro 337 IGDFKRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAG-----------YYVTFTPEPLPL 394 (394) Q Consensus 337 ~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af-----------~~l~~~~~~~~~ 394 (394) |||-. ....-.+++...-+..--.|... ++|+++.+|... T Consensus 304 f~~~~------------------d~~~y~Y~V~avs~dgeS~PS~~VtaTva~~~~gv~ltIt~~a~~~ 354 (462) T protein:vir:96 304 FTDEH------------------DRAELTYKVVVNSDDAQSAPSEAVTATVNNATDGVKLEISVNAMYQ 354 (462) T ss_pred CCCcc------------------CceeEEEEEEEECCCCccccceeeEeeeecccccceEEEEEcCCcc Confidence 11110 00000011111110000112222 222333222111 No 215 >protein:vir:3746 Length: 336 # NCBI annotation: orf15 # Family: family:all:201 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043487;genbank:gi:9628622;genbank:GeneID:1261135 Probab=52.15 E-value=0.56 Score=21.96 Aligned_cols=274 Identities=12% Similarity=0.027 Sum_probs=116.3 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc----ccCCccccchhHHhHHHHHHHhhhhhhhe Q lcl|Aclame:pro 86 TYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK----KENAKPVSSEEILYTPAREVKTVVDLKPF 161 (394) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~lvP~~~~~~I~~~~~~~~~l~~~ 161 (394) -....++.++ .......++. ..+....|.+.....+.+.+.+.+.+++. T Consensus 1 mtr~~~~~y~---------------------------~~~A~~ngv~~a~~~~~~~Fsv~P~v~q~L~~~i~ess~FL~~ 53 (336) T protein:vir:37 1 MNKQAYYALA---------------------------AALAKHFNQPLDSVLRGESFALKAPEAALLGENIQQRSDFLKQ 53 (336) T ss_pred CcHHHHHHHH---------------------------HHHHHHhCCChhhhccCceeecCHHHHHHHHHHHHHHHHHhhc Confidence 0000011110 0111112222 12234567778899999999999999999 Q ss_pred eeeEeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHHHh-cc-HHHHH-HHHHHHHH Q lcl|Aclame:pro 162 TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESID-DA-DVDLV-GIVSESIS 238 (394) Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~ell~-ds-~~~l~-~~i~~~l~ 238 (394) ++++++..-.+...-...++ ..+.-...+-.+ .+..++.....-+..---+.++.+.|. ++ .+|+. ..+...+. T Consensus 54 INvv~V~e~~Ge~v~lg~~g-~iagrtdt~R~~--~~~~l~~~~Y~c~qTn~dt~i~y~~LD~WA~~~df~~~~~~~~~~ 130 (336) T protein:vir:37 54 INMIQVAHTKGQKLFGATEK-GVTGRKQTGRNL--ANLDHTQNGFELAETDSGIIVPWALFDSFAIFKDRLVELYSEYFQ 130 (336) T ss_pred CceeecccccceEeeeccCc-ccccccCCCccc--cccCcCCcccEEEEeeeeeeecHHHHHHHhcChhHHHHHHHHHHH Confidence 99999988776655433222 222211111111 112222222222222222233333332 11 13332 22333334 Q ss_pred HHHHHHHHHHHhhccc------------------------------------c------ccccccccHHH-HHHHHHhhh Q lcl|Aclame:pro 239 QIKVNTTNDAIAKVLK------------------------------------S------FTTKTVKNLDE-IKALLNGGF 275 (394) Q Consensus 239 ~~~~~~~~~a~~~g~~------------------------------------~------~~~~~~~~~~~-i~~~~~~~~ 275 (394) +.++.-.-.-.++|.. + ++.....+.|. +.++++ ++ T Consensus 131 r~iALD~i~IGfnG~s~A~~TdnPllqDVNkGWlQ~~Re~a~~~v~~~~~~~~g~i~~~G~~gdy~NLDalV~D~~~-~I 209 (336) T protein:vir:37 131 NQVALDILQIGWNGQSVADNTTKADLSDVNKGWLKLLQEQRAANFMTESTKSSGKITIFGDNADYANLDDLAFDLKQ-GL 209 (336) T ss_pred HHHhhchhhhcccceeeccCCCCCcccccchhHHHHHHhccchhhcccccccCCceEEecCCCCcccHHHHHHHHHh-cC Confidence 4332211111111111 0 01111334565 345665 45 Q ss_pred hhhc-c---cEEEEcHHHHHH-HHhhhccCCc-eeecccccC---CCcccccccceEEecCcccccCceEEEeccccEEE Q lcl|Aclame:pro 276 DPAY-N---VSLIVSQSFYQT-LDTLKDGNGR-YLLQDDITA---VSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLF 346 (394) Q Consensus 276 ~~~~-~---a~~vm~~~~~~~-l~~lkd~~G~-~l~~~~~~~---~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~ 346 (394) ++.+ + -+.++.+..++. -..|-+.+|. |= ..+.. -...++-|+|.+..|. .|.+.+++=-|+..-+. T Consensus 210 ~~~~~~d~dLVvivG~dLla~~~~~l~~~~~~~Pt--E~~Aa~~~~~~k~iGGlpa~~~Pf--fP~~~~lVT~L~NLsIY 285 (336) T protein:vir:37 210 DFRHQNRNDLVFLVGADLVSKETKLIQQKHGLTPT--EKAALGSHNLMGSFGGMNAITPPN--FPARAAAVTTLKNLSVY 285 (336) T ss_pred chHHhcCCCeEEEEchhhhhhhhhhhhhhcCCCHH--HHHHHHHHHHHHhhCCceeEEccc--cCCCceEEeechhcEEE Confidence 5543 2 367777766433 2223333322 20 00100 0224889999999884 46666676655553333 Q ss_pred EeecceEEEEeecccccceEEEEEE-eccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 347 ADRKDLGLRWADNEIYGQYLQAVLR-FGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 347 ~~~~~~~i~~~~~~~~~~~~r~~~r-~d~~v~~~~af~~l~~~~~~~~~ 394 (394) +-+....=...+.+...+.--.+.| -|..|-+..+++.+.....--|- T Consensus 286 ~Q~gs~RR~~~d~p~r~rie~y~s~Ne~YvVEd~~~~a~iE~i~v~~~~ 334 (336) T protein:vir:37 286 TEAESVRRSLRNDEDKKGLVTSYYRQEGYVVEDLGLMTAIDHTKVKLNG 334 (336) T ss_pred EecCcEEEEEEEccccccccchhhhcceeeeeccccEEEeeeeeeeecC Confidence 3222222222222222211111112 24456666666666655554444 No 216 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=50.19 E-value=0.62 Score=21.73 Aligned_cols=336 Identities=11% Similarity=0.066 Sum_probs=86.7 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) =|+++++++++++.++.++.+....+.+...-++-..+++.+.+++++++++++.++...+................... T Consensus 8 el~~~l~e~~~~i~~~~~e~~~~~~~~~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~ 87 (394) T protein:vir:97 8 EIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTY 87 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHH Confidence 44455566666666555544433332222111122356778889999999888877766554332221111111111111 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcc--c-ccCCccccchhHHhHHHHHHHhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGI--K-KENAKPVSSEEILYTPAREVKTVVD 157 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~lvP~~~~~~I~~~~~~~~~ 157 (394) ................................... .............+. . ......++........+..+-...+ T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~ 166 (394) T protein:vir:97 88 RESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETT-PVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQ 166 (394) T ss_pred HHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhh-hhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeee Confidence 01111111111111111111111111011111000 011111111111111 1 1111111111111112222211222 Q ss_pred hhheeeeEee---cCCceeEEEEecCCCcccccccccccccccccccce------eeecHhhhhhh-hhhhHHHHhccHH Q lcl|Aclame:pro 158 LKPFTTVYQA---KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD------VAWNIDTYRGA-IPLSQESIDDADV 227 (394) Q Consensus 158 l~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~------v~~~~~~~~~~-~~vs~ell~ds~~ 227 (394) +......+++ .++...+. .. + ....+. ..+.....++.. +.++-.-+... +.+-.. T Consensus 167 ~~~~~~~~~~~~~~~~~~~~v--~E--~--~~~~~~-~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~~------- 232 (394) T protein:vir:97 167 AKKASGKYPVLQRATTKMVTV--AE--L--EKNPAL-AKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGI------- 232 (394) T ss_pred ccCcceEEEEEecCCCcccee--cc--c--cccccc-ccccceeEEeehhheeeehhhHHHHHhhhhHHHHHH------- Confidence 2221122222 12222221 11 1 111111 111111112222 22222211111 111111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccccccccccH-------------------HHHHHHHHhhhhhhcccEEEEcHH Q lcl|Aclame:pro 228 DLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNL-------------------DEIKALLNGGFDPAYNVSLIVSQS 288 (394) Q Consensus 228 ~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~-------------------~~i~~~~~~~~~~~~~a~~vm~~~ 288 (394) +...|.+.++......+-...-.+. +.+..+.... ......+..+.+. +..++..|. T Consensus 233 -i~~~la~~~~~~~~~~i~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~--~G~~i~~~~ 308 (394) T protein:vir:97 233 -VSESISQIKVNTTNDAIAKVLKSFT-TKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDG--NGRYLLQDD 308 (394) T ss_pred -HHHHHHHHHHHHHHHHHhhcccccc-ccccccHHHHHHHHHhhhhhhhCCEEEEcHHHHHHHHHhhcc--CCCeeeecC Confidence 3344444444433332222111111 1111111111 1112222222222 223333322 Q ss_pred HHHHHHhhhcc-----CCceeec-ccccCCCcccccccceEEecCcccccCceEEEecccc-------------EEEEee Q lcl|Aclame:pro 289 FYQTLDTLKDG-----NGRYLLQ-DDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRG-------------VLFADR 349 (394) Q Consensus 289 ~~~~l~~lkd~-----~G~~l~~-~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~-------------~~~~~~ 349 (394) +.+. .|.|+.. ++...+....++| +- +..+++++.... +....| T Consensus 309 -------~~~~~~~~l~G~pv~~~~~~~~~~~~~~~g------d~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 371 (394) T protein:vir:97 309 -------ITAVSGKVLLGKPVFVLSDEVLGANKAFIG------DF----KRGVLFADRKDLGLRWADNEIYGQYLQAVLR 371 (394) T ss_pred -------cCCCCCceeccceeEEecccccCCccEEEe------ec----cccEEEEEecceEEEEecccccceeEEEEEE Confidence 1222 3444332 1111111111222 10 001122222111 111111 Q ss_pred cceEEEEeecccccceEEEEEEeccEEecccce Q lcl|Aclame:pro 350 KDLGLRWADNEIYGQYLQAVLRFGVSKVDDKAG 382 (394) Q Consensus 350 ~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~af 382 (394) -+..+. +. ..+...- + -.-|..| T Consensus 372 ~d~~v~--~~----~a~~~~~---~-~~~~~p~ 394 (394) T protein:vir:97 372 FGVSKV--DD----KAGYYVT---F-TPEPLPL 394 (394) T ss_pred EccEEe--cc----cceEEEE---e-cccccCC Confidence 111111 00 1111110 0 0011112 No 217 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=44.29 E-value=0.81 Score=21.08 Aligned_cols=341 Identities=14% Similarity=0.040 Sum_probs=94.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc---cc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALES--DDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAE---NI 75 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~--e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~---~~ 75 (394) =|++.+++++++++++++++++...+.+...++ +..++++++++++++++++++.++................ .. T Consensus 3 eL~~~~~~~~~~~~e~~~~l~~~~~~~~~~~e~~~~l~~ei~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (389) T protein:vir:10 3 KLQTLFNDVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKALEAEKPAEPKTEPKDDGSKKGTD 82 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccc Confidence 777888999999999988888766555433222 2234567778888888888888766554433222211111 11 Q ss_pred ccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhh Q lcl|Aclame:pro 76 GGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTV 155 (394) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~ 155 (394) ...............+.................... ..+...-...++.......++..+-.. T Consensus 83 ~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg-----------------~~vP~~~~~~i~~~~~~~~~l~~~~~~ 145 (389) T protein:vir:10 83 LSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAG-----------------VLIPEEIIYDPTAEVNSVVDLSTLVTK 145 (389) T ss_pred cchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcc-----------------eeehHHHHHHHHHHHHhhhhHHhhcce Confidence 111111111111111111111111111000000000 000000000011111111111111111 Q ss_pred hhhhheeeeEeecC-CceeEEEEecCCCcccccccccccccccccc-----cceeeecHhhhhhhhhhhHHHHhccHHHH Q lcl|Aclame:pro 156 VDLKPFTTVYQAKK-ASGKYPVLQRATTKMVTVAELEKNPALAKPD-----FKDVAWNIDTYRGAIPLSQESIDDADVDL 229 (394) Q Consensus 156 ~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~-----~~~v~~~~~~~~~~~~vs~ell~ds~~~l 229 (394) .++......+++.. ........... +...+.. .+.....+ +..+.--.+++-.-..+ .+. .+ + T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~~~~E~----~~~~~~~-~~~~~~i~~~~~k~~~~~~iS~ell~ds~~--~l~---~~-i 214 (389) T protein:vir:10 146 TPVTTPKGTYPILKRATDRFSSVAEL----AENPKLA-EPEFNKVDWSVATYRGAIPLSEEAIADSAV--DLT---AL-V 214 (389) T ss_pred eeccCCeeEEEEEecCCCcccccccc----ccccccc-cccceeeeeeheeeEeeehhhHHHHhhhhH--HHH---HH-H Confidence 11211111111111 11111111111 1111111 11111111 11111111111111111 011 11 3 Q ss_pred HHHHHHHHHH----HHHHHHHHHHhhccccccccccccH-------------------HHHHHHHHhhhhhhcccEEEEc Q lcl|Aclame:pro 230 VGIVSESISQ----IKVNTTNDAIAKVLKSFTTKTVKNL-------------------DEIKALLNGGFDPAYNVSLIVS 286 (394) Q Consensus 230 ~~~i~~~l~~----~~~~~~~~a~~~g~~~~~~~~~~~~-------------------~~i~~~~~~~~~~~~~a~~vm~ 286 (394) ...|.+.++. .+..........+.. +..+...+ ......+..+.+. +..++.+ T Consensus 215 ~~~la~~~~~~~~~~i~~g~~~~~~~~~~--~~~~~d~l~~~~~~~~~~~~~a~~~~n~~~~~~L~~lkd~--~G~~i~~ 290 (389) T protein:vir:10 215 GQSIKEKSVNTYNAMIAPVLQSFTAKKTT--TDTLVDSLKHILNVDLDPAYSRALVVTQSLFNTLDTLKDK--NGRYLLH 290 (389) T ss_pred HHHHHHHHHHHHHHHHhhhhccccccccc--ccccHHHHHHHHHhhhhhhhCcEEEecHHHHHHHHHhhcc--CCCeeee Confidence 3334444433 333332222212111 11111111 1112222222222 2234433 Q ss_pred HHHHHHHH-hhh-ccCCceee-cccccCCCcccccc-cceEEecCcccccCceEEEeccccEEEEee-----cceEEEEe Q lcl|Aclame:pro 287 QSFYQTLD-TLK-DGNGRYLL-QDDITAVSGKVLLG-KPVFVLSDEVLGANKAFIGDFKRGVLFADR-----KDLGLRWA 357 (394) Q Consensus 287 ~~~~~~l~-~lk-d~~G~~l~-~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~-----~~~~i~~~ 357 (394) +....... .-. .=.|.|+. .++...+ .--| .++++-+ -+..++++|.....+-+.+ ..+.+ .. T Consensus 291 ~~~~~~~~~~~~~~l~G~pV~~~~~~~~~---~~~~~~~~~~gd----~~~~~~~~~~~~~~i~~~~~~~~~~~~~~-~~ 362 (389) T protein:vir:10 291 DASDSITDGTAKGTILGVPVYVVGDTLLG---SLAGDQKAFVGD----LKRGVLFTDRQQVTLAWEDSKIYGKYLGA-AF 362 (389) T ss_pred cCcccccccccccccccceeEEecccccC---CCCCceEEEEee----ccccEEEEeecceEEEeeccccccceEEE-EE Confidence 33211000 000 01466653 2211111 1111 1222211 0011223332211111111 00000 00 Q ss_pred eccc---ccceEEEEEEeccEEecccceEEEEecCccCC Q lcl|Aclame:pro 358 DNEI---YGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLP 393 (394) Q Consensus 358 ~~~~---~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~ 393 (394) +.+. ....++.. .+.. +-+++|+- T Consensus 363 r~d~~~~~~~a~~~~-~~~~-----------~~~~~~~~ 389 (389) T protein:vir:10 363 RFGVQKADSKAGYFV-TNTD-----------VPGSALGK 389 (389) T ss_pred EeccEEecccceEEE-Eeec-----------cCCCCCCC Confidence 0000 01111111 1110 11111111 No 218 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=40.42 E-value=0.97 Score=20.65 Aligned_cols=284 Identities=8% Similarity=-0.012 Sum_probs=113.2 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHh----HHHHHHHhhhh Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILY----TPAREVKTVVD 157 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~----~I~~~~~~~~~ 157 (394) .........+..+.- .+.... .....+... +...+ ...+.++ ...+...+|..+.. .+++-+..... T Consensus 1 ~~~~~~~~~l~~~gi----~~~~~~-~~~~~~~~~-~~~da--~d~~~~~-~~~~~~~i~~~l~~~i~p~~~~~~~~p~~ 71 (336) T protein:vir:10 1 MRDAQRIQNLARAGV----ILPRSV-QNVSTPLTE-YAMDA--ADLSPHL-SSTGSSGIPNYLTTYVDPAVIDILVAPMK 71 (336) T ss_pred CchHHHHHHHhhcCe----eecchh-hhhhhhHHH-hhhhh--hhccCcc-ccCCCchhHHHHHhhcccceeeehhhhhh Confidence 111111111111000 000000 000000000 00000 0000111 11223345554443 33444444444 Q ss_pred hhheeeeEeecC---CceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhH-HHHhcc--HHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKK---ASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQ-ESIDDA--DVDLVG 231 (394) Q Consensus 158 l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~-ell~ds--~~~l~~ 231 (394) ...++.+.+++. ....+++.. ..+.+...+.....| .++...+.-+-+.+.++..+.++. |+-.-. -.++.+ T Consensus 72 a~~l~pv~t~g~W~~~~~~~~~~e-~~G~a~~ygd~~D~P-~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~ 149 (336) T protein:vir:10 72 AAELVGESKKGDWTTLVAAFITAE-PTTKVATYGDYSSDG-DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLAS 149 (336) T ss_pred hhhhccccccCCccceeEEEeeee-ceeeEEEeeccCCCc-eeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHH Confidence 444554433321 122334432 345555566666554 455556666667788888888884 444322 235666 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccc------------------c-ccccc----HHHHHHHHHhhhhhhc-------cc Q lcl|Aclame:pro 232 IVSESISQIKVNTTNDAIAKVLKSFT------------------T-KTVKN----LDEIKALLNGGFDPAY-------NV 281 (394) Q Consensus 232 ~i~~~l~~~~~~~~~~a~~~g~~~~~------------------~-~~~~~----~~~i~~~~~~~~~~~~-------~a 281 (394) --+...++++....|.-.+.|....+ + ...++ ++||..++..+..... .- T Consensus 150 ~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~ 229 (336) T protein:vir:10 150 ELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVL 229 (336) T ss_pred HHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcc Confidence 66666677777777765554432211 0 01122 3445555544443321 24 Q ss_pred EEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecC--cccccCceEEEeccccEEEEee-cceEEEEe- Q lcl|Aclame:pro 282 SLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSD--EVLGANKAFIGDFKRGVLFADR-KDLGLRWA- 357 (394) Q Consensus 282 ~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~gd~~~~~~~~~~-~~~~i~~~- 357 (394) .++|.++.+..|.. .+..|.-++.- +... +-++.++..+. .+.+....++-+- .+. ....+.+. T Consensus 230 tL~LP~~~~~~Ls~-~n~~g~Tvl~~-lk~n----~Pnl~i~t~pEl~~a~G~~~~l~~~~------~~~~~t~~~~~p~ 297 (336) T protein:vir:10 230 RMGLPPTAMSDLSK-TNQYGLAAAAK-LKDI----FPKLEFVTIPEYDTASGRLVQLWAPR------VEGKDTATCGFTE 297 (336) T ss_pred eEEecHHHHHhccC-CCccCccHHHH-HHHh----cCccEEEEccccccCCCceEEEEEEe------cCCCcceeeecch Confidence 68899998887754 33334323210 1111 11111222221 1112111111100 000 11111110 Q ss_pred ---eccc--ccce--EEEEEEec-cEEecccceEEEEec Q lcl|Aclame:pro 358 ---DNEI--YGQY--LQAVLRFG-VSKVDDKAGYYVTFT 388 (394) Q Consensus 358 ---~~~~--~~~~--~r~~~r~d-~~v~~~~af~~l~~~ 388 (394) .+.- .... .-+..|.+ ..+.+|-||++++.- T Consensus 298 ~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 298 KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 1100 0111 23446664 477789999999988 No 219 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=38.50 E-value=1.1 Score=20.44 Aligned_cols=290 Identities=10% Similarity=-0.025 Sum_probs=111.2 Q ss_pred cccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCc--cccchhHHhHHHHHHHhhh Q lcl|Aclame:pro 79 EVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAK--PVSSEEILYTPAREVKTVV 156 (394) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~lvP~~~~~~I~~~~~~~~ 156 (394) -...........+.++... ++............ .+..... .......+....+ .-.++.+.+.|++...... T Consensus 1 ~~~~~~~~~~~~l~~~g~~----~~~~~~~~~~~~~~-~~a~d~~-~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~ 74 (339) T protein:vir:94 1 MSINNDRTDIKQLEKVGII----FDGYSPKSISSEVS-AYAMDAV-NLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPM 74 (339) T ss_pred CceechHHHHHHHHhhcee----eccchhhhcchhhH-hhhcccc-ccccccccccccchhhhhhhhhchhheeeccccc Confidence 0000000011111111000 00000000000000 0000000 0000011111111 1233444455666666666 Q ss_pred hhhheeeeEeecC---CceeEEEEecCCCcccccccccccccc-cccccceeeecHhhhhhhhhhh-HHHHhc--cHHHH Q lcl|Aclame:pro 157 DLKPFTTVYQAKK---ASGKYPVLQRATTKMVTVAELEKNPAL-AKPDFKDVAWNIDTYRGAIPLS-QESIDD--ADVDL 229 (394) Q Consensus 157 ~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~e~~~~~~~-~~~~~~~v~~~~~~~~~~~~vs-~ell~d--s~~~l 229 (394) ..+.++++.+.+. .++.+++.. ..+.+...+.....|-. -+..+...++....++ +.++ .|+-.- ...++ T Consensus 75 ~~~~l~pv~t~g~w~~~t~~y~~~e-~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g--~~y~~~E~~~A~~~g~~l 151 (339) T protein:vir:94 75 AAAKIFPEVKKGDWTTTYGVFIIAE-PVGQVATYSDWSANGMSKANVNFESRQNYRYQTW--TEYGDLEMATYGEAGIDY 151 (339) T ss_pred chhhhcccccCCCCcccEEEEeeee-cccceEEcccccCCCcccccceeeEEeEEEEEEE--EeecHHHHHHHHhhCCCh Confidence 6666666655543 245555544 34555566666666432 2244555444443333 3344 233322 12356 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccc------------ccc------cccHHHHH----HHHHhhhhhhc-----c-- Q lcl|Aclame:pro 230 VGIVSESISQIKVNTTNDAIAKVLKSFT------------TKT------VKNLDEIK----ALLNGGFDPAY-----N-- 280 (394) Q Consensus 230 ~~~i~~~l~~~~~~~~~~a~~~g~~~~~------------~~~------~~~~~~i~----~~~~~~~~~~~-----~-- 280 (394) .+--....++++....|...+.|....+ ..+ ..+.+.|+ .++..+....- + T Consensus 152 ~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~ 231 (339) T protein:vir:94 152 VARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNWATAAPEDIANDVVAMVGRLISQSGGLITGQER 231 (339) T ss_pred HHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCCccccccCCCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccC Confidence 6666666667777777766555543211 011 12344444 44433322211 2 Q ss_pred cEEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecC--cccccCceEEEeccccEEEEeecceEEEEe- Q lcl|Aclame:pro 281 VSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSD--EVLGANKAFIGDFKRGVLFADRKDLGLRWA- 357 (394) Q Consensus 281 a~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~- 357 (394) -+++|.++.+..|..- +..|.-++.- +... +.+..++..+. .+.+....++-+. ..+..-+.+.+. T Consensus 232 ~~L~LP~~~~~~L~~~-n~~~~Tvl~~-lk~n----~pnl~i~~~~el~~a~g~~~~~~~~~-----~~~~~~~~~~~p~ 300 (339) T protein:vir:94 232 MVMALAPSALNNVNRT-NNFGLSAGAK-IAQT----YPNIQFVAVPEFDTASGRLVQLWVPE-----VNGQPTGEVAFAE 300 (339) T ss_pred cEEEecHHHHHhcccC-CcCCccHHHH-HHHh----cCCcEEEEccccccCCCceEEEEEEe-----ccCCcceEEEcch Confidence 2688999999988653 4444333210 1111 11222222221 1111111111100 000111111111 Q ss_pred -----ecccccce--EEEEEEe-ccEEecccceEEEEec Q lcl|Aclame:pro 358 -----DNEIYGQY--LQAVLRF-GVSKVDDKAGYYVTFT 388 (394) Q Consensus 358 -----~~~~~~~~--~r~~~r~-d~~v~~~~af~~l~~~ 388 (394) ........ .-+..|. |..+++|.||++++.- T Consensus 301 ~~~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 301 KLRSHSIERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred hhhccccEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 00011111 2445675 5578889999999988 No 220 >protein:vir:8846 Length: 705 # NCBI annotation: putative portal protein # Family: family:all:1548 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775254;genbank:gi:27476052;genbank:GeneID:2700600 Probab=38.21 E-value=1.1 Score=20.40 Aligned_cols=121 Identities=12% Similarity=-0.006 Sum_probs=13.3 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhchhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVK--NALESDDL--EAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIG 76 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~--~~~~~e~~--~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~ 76 (394) .++++..++++...+...+......+.+ ....+... .+......|.+...++++..+.+....++..+...+.... T Consensus 581 ~~e~~~~~~~~~q~e~~~~~~~~~~q~e~~k~q~e~~~~q~e~q~~q~E~q~~q~e~e~~~~~~~~~~~e~~~~~a~~~~ 660 (705) T protein:vir:88 581 SPEALQAKAIREQKEAQPKPEDIKAQADAQRAQSDALAKQAEAQMKQVEAQIRLAEIELKKQEAVLQQREMALKEAELQL 660 (705) T ss_pred hHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2222222222222222211111111110 00000000 0000001111111111111000000000000000000000 Q ss_pred cccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh Q lcl|Aclame:pro 77 GKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQK 128 (394) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 128 (394) . ..............+...... .....................+. T Consensus 661 ~----~~~~e~e~~~~e~e~~~e~~q---~~~~~~~~~~~~~~~k~~~~~rr 705 (705) T protein:vir:88 661 E----RDRFTWERARNEAEYHLEATQ---ARAAYIGDGKVPETKKPTKAVRR 705 (705) T ss_pred H----HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHhHHHHHHHHHHhcC Confidence 0 000000000000000000000 00000000000000000111111 No 221 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=37.12 E-value=1.1 Score=20.28 Aligned_cols=284 Identities=8% Similarity=-0.011 Sum_probs=112.7 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHh----HHHHHHHhhhh Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILY----TPAREVKTVVD 157 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~----~I~~~~~~~~~ 157 (394) .........+..+.- .+.... .....+... +...+ ......+. ......+|..+.. .+++.+..... T Consensus 1 ~~~~~~~~~l~~~gi----~~~~~~-~~~~~~~~~-~~~da--~d~~~~~~-~~~~~~~~~~l~~~i~p~~~~~~~~~~~ 71 (336) T protein:vir:36 1 MRDAQRIQNLARAGV----ILPRSV-QNVSTPLTE-YAMDA--ADLSPHLS-STGSSGIPNYLTTYVDPSVIDILVAPMK 71 (336) T ss_pred CchHHHHHHHhhcCe----eecchh-hhhhhHHHH-hhhhh--hhccCccc-cCCCcchHHHHHHhhccceEeeecchhh Confidence 111111111111000 000000 000000000 00000 00001111 1223345555544 33444444444 Q ss_pred hhheeeeEeecC---CceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhh-HHHHhcc--HHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKK---ASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLS-QESIDDA--DVDLVG 231 (394) Q Consensus 158 l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs-~ell~ds--~~~l~~ 231 (394) ...++.+.+++. ....+++.. ..+.+...+.....| .++...+.-+-+.+.++..+.++ .|+..-. ..++.+ T Consensus 72 ~~~l~pv~t~g~W~~~~~~~~~~e-~~G~a~~ygd~~D~P-~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~ 149 (336) T protein:vir:36 72 AAELVGESKKGDWTTLVAAFITAE-PTTKVATYGDYSSDG-DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLAS 149 (336) T ss_pred hhhhccccccCCccceeEEEeeee-ceeeEEEeeccCCCc-eeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHH Confidence 445554433321 122334332 345555566666554 45555666666778888888887 4554422 235666 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccc------------------c-ccccc----HHHHHHHHHhhhhhhc-------cc Q lcl|Aclame:pro 232 IVSESISQIKVNTTNDAIAKVLKSFT------------------T-KTVKN----LDEIKALLNGGFDPAY-------NV 281 (394) Q Consensus 232 ~i~~~l~~~~~~~~~~a~~~g~~~~~------------------~-~~~~~----~~~i~~~~~~~~~~~~-------~a 281 (394) --+...++++....|.-.+.|....+ + .+.++ ++||..++..+..... .- T Consensus 150 ~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~ 229 (336) T protein:vir:36 150 ELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVL 229 (336) T ss_pred HHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeecccc Confidence 66666666777666665544432211 0 01122 3444444444433221 23 Q ss_pred EEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecC--cccccCceEEEeccccEEEEee-cceEEEEe- Q lcl|Aclame:pro 282 SLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSD--EVLGANKAFIGDFKRGVLFADR-KDLGLRWA- 357 (394) Q Consensus 282 ~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~gd~~~~~~~~~~-~~~~i~~~- 357 (394) .++|.++.+..|.. .+..|.-++.- +... +-++.++..+. .+.+....++-+- .+. ....+.+. T Consensus 230 tL~LP~~~~~~Ls~-~n~~g~Tvl~~-lk~n----~Pnl~i~t~pEl~~a~g~~~~l~~~~------~~~~~t~~~~~p~ 297 (336) T protein:vir:36 230 RMGLPPTAMSDLSK-TNQYGLAAAAK-LKDI----FPKLEFVTIPEYDTASGRLVQLWAPR------VEGKDTATCGFTE 297 (336) T ss_pred EEEechHHHHhccC-CCccCccHHHH-HHHh----cCccEEEEccccccCCCceEEEEEEe------cCCCcceeeecch Confidence 68899998887754 33334323210 1111 11111222221 1112111111100 001 11111110 Q ss_pred ---eccc--ccce--EEEEEEec-cEEecccceEEEEec Q lcl|Aclame:pro 358 ---DNEI--YGQY--LQAVLRFG-VSKVDDKAGYYVTFT 388 (394) Q Consensus 358 ---~~~~--~~~~--~r~~~r~d-~~v~~~~af~~l~~~ 388 (394) .+.- .... .-+..|.+ ..+.+|-||++++.- T Consensus 298 ~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 298 KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 0100 0111 23446664 477789999999988 No 222 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=36.87 E-value=1.1 Score=20.25 Aligned_cols=283 Identities=8% Similarity=-0.001 Sum_probs=113.4 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHH----hHHHHHHHhhhh Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEIL----YTPAREVKTVVD 157 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~----~~I~~~~~~~~~ 157 (394) ....+....+..+.-. +.. .......+. .... ......+.++. +.....+|..+. +.+++.+..... T Consensus 1 ~~~~~~~~~l~~~gi~----~~~-~~~~~~~~~-~~~a--~da~d~~~~~~-t~~~~g~~~~l~~~i~p~~~~~~~~~~~ 71 (336) T protein:vir:78 1 MRDAQRIQNLARAGVI----LPR-SVKNVSTPL-AEYA--MDAADLSPHLS-STGSSGIPNYLTTYVDPSVIDILVAPMK 71 (336) T ss_pred CchHHHHHHHhccCee----cch-hhhhhhHHH-HHHH--Hhhhhhccccc-cCCCcchHHHHHHhcccceeeehhhhhh Confidence 1111111111111000 000 000000000 0000 00000011111 111223444433 344444444444 Q ss_pred hhheeeeEeecC---CceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhH-HHHhc--cHHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKK---ASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQ-ESIDD--ADVDLVG 231 (394) Q Consensus 158 l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~-ell~d--s~~~l~~ 231 (394) ...++.+.+++. ....+++.. ..+.+...+.....| ..+...+.-+-+.+.++..+.++. |+-.- .-.++.+ T Consensus 72 ~~~l~~v~t~g~W~~~~~~~~~~e-~~G~a~~ygd~~D~P-~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~ 149 (336) T protein:vir:78 72 AAELVGESKKGDWTTLVAAFITAE-PTTTVATYGDYSSDG-DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLAS 149 (336) T ss_pred hhhhcccccCCCccccEEEEeeee-cceeeEEeecccCCC-eeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHH Confidence 445555444321 123444433 345555666666664 466677788888888888888885 34332 1235666 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccccc---------------c----ccccHHHHHHHHHhhhh----hhc-----c--c Q lcl|Aclame:pro 232 IVSESISQIKVNTTNDAIAKVLKSFTT---------------K----TVKNLDEIKALLNGGFD----PAY-----N--V 281 (394) Q Consensus 232 ~i~~~l~~~~~~~~~~a~~~g~~~~~~---------------~----~~~~~~~i~~~~~~~~~----~~~-----~--a 281 (394) --+...++++....|.-.+.|....+. . ...+.+.|++-++.++. ... + - T Consensus 150 ~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~ 229 (336) T protein:vir:78 150 ELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVL 229 (336) T ss_pred HHHHHHHHHHHHhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccce Confidence 666666667766666655444322100 0 11233444443333222 221 2 2 Q ss_pred EEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEee---cc-eEEEEe Q lcl|Aclame:pro 282 SLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADR---KD-LGLRWA 357 (394) Q Consensus 282 ~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~---~~-~~i~~~ 357 (394) .++|.++.+..|.. .+..|.-++.- +....| ++.++..+.. .+++ |+-. +++... .+ +.+.+. T Consensus 230 tL~Lp~~~~~~L~~-~n~~g~tv~~~-lk~n~P----nl~i~t~pel-~~Ag----g~~~--~~~~~~~~~~~t~~~~~p 296 (336) T protein:vir:78 230 HMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFVTIPEY-DTAS----GRLV--QLWAPRVEGKDTATCGFT 296 (336) T ss_pred EEEechHHHHhccC-CCccCccHHHH-HHHhcC----ccEEEEcccc-cccC----cceE--EEEEeeccCCcceeeecc Confidence 68899999988864 33334322210 111111 1122222211 1111 1110 111110 01 111110 Q ss_pred ----ecc--cccce--EEEEEEe-ccEEecccceEEEEec Q lcl|Aclame:pro 358 ----DNE--IYGQY--LQAVLRF-GVSKVDDKAGYYVTFT 388 (394) Q Consensus 358 ----~~~--~~~~~--~r~~~r~-d~~v~~~~af~~l~~~ 388 (394) .++ ..... .-...|. |..+.+|-||++++.- T Consensus 297 ~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 297 EKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 000 00011 2344565 4477789999999988 No 223 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=34.79 E-value=1.3 Score=20.01 Aligned_cols=260 Identities=7% Similarity=-0.040 Sum_probs=92.5 Q ss_pred cccCCccccchhHHhHHHHHHHhhhhhh--heeeeEeecCCceeEEEEecCCCc-ccccccccccccccccccceeeecH Q lcl|Aclame:pro 132 KKENAKPVSSEEILYTPAREVKTVVDLK--PFTTVYQAKKASGKYPVLQRATTK-MVTVAELEKNPALAKPDFKDVAWNI 208 (394) Q Consensus 132 ~~~~~~~lvP~~~~~~I~~~~~~~~~l~--~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~e~~~~~~~~~~~~~~v~~~~ 208 (394) -..-...+-|..+...|.++.....+++ .+.+...+....+........... +..+......+-.....+...++.+ T Consensus 1 M~~i~d~f~~~~l~~~v~~~~~~~~~~l~~~~Fp~~~~~~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~r~~~~~~~~~~ 80 (348) T protein:vir:27 1 MGLIYDKVTASNIAGYFNALQENVSSTLGESIFPARKQLGTKLSYIKGASGQSVALKAAAFDTNVTIRDRVSAEMHDEQM 80 (348) T ss_pred CcchhhhcCHHHHHHHHHhccchhhhhhHhhcCCCccccceeEEEEeeccCceeEeeeecCCCCcceecccceeeeeeec Confidence 0000112223333333333322222221 122222222222221111111111 1112111111111223344445555 Q ss_pred hhhhhhhhhhHHHHh------cc-HHH----HHHHHHH---HHHHHHHHHHHHHHhhcccccc----------------- Q lcl|Aclame:pro 209 DTYRGAIPLSQESID------DA-DVD----LVGIVSE---SISQIKVNTTNDAIAKVLKSFT----------------- 257 (394) Q Consensus 209 ~~~~~~~~vs~ell~------ds-~~~----l~~~i~~---~l~~~~~~~~~~a~~~g~~~~~----------------- 257 (394) -.+.-...++..-++ ++ ..+ +...|.+ .|.+.+.+.++..+...+.+|. T Consensus 81 p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i~~~~~~~~vdfg~~ 160 (348) T protein:vir:27 81 PFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDGVNKDIDYGVK 160 (348) T ss_pred CccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEEecCCeeEEEeecCC Confidence 444444444432211 11 011 1122222 2233344444443322221110 Q ss_pred ------------ccccccHHHHHHHHHhhhhhhcc-cEEEEcHHHHHHHHh---hhccC----Cce-eecccccCCCccc Q lcl|Aclame:pro 258 ------------TKTVKNLDEIKALLNGGFDPAYN-VSLIVSQSFYQTLDT---LKDGN----GRY-LLQDDITAVSGKV 316 (394) Q Consensus 258 ------------~~~~~~~~~i~~~~~~~~~~~~~-a~~vm~~~~~~~l~~---lkd~~----G~~-l~~~~~~~~~~~~ 316 (394) ..+..-+.+|.++...+-..... ..++|++.+|..|.. +++.- +.. ...+......-++ T Consensus 161 ~~~~~t~~~~W~~~~adp~~di~~~~~~~~~~G~~~~~ii~~~~~~~~l~~~~~v~~~~~~~~~~~~~i~~~~~~~~~~~ 240 (348) T protein:vir:27 161 PDHKKQVSKSWAEPGATPLADLEDAIETARELGLNPERAVMNAKTFGLIRKAASTVKVIKPLAGDGSAVTKAELENYIAD 240 (348) T ss_pred cccceeeeeccCCCCCCHHHHHHHHHHHHHhcCCcccEEEECHHHHHHHhcCHHHHHHhcccCccccccCHHHHHHHHHh Confidence 01111235555555433333333 468899999999864 33321 111 1111111111134 Q ss_pred ccccceEEecCccc----------ccCceEE-EeccccEEEE--e--e-----------------cceEEE-Eeeccccc Q lcl|Aclame:pro 317 LLGKPVFVLSDEVL----------GANKAFI-GDFKRGVLFA--D--R-----------------KDLGLR-WADNEIYG 363 (394) Q Consensus 317 l~G~pV~~~~~~~~----------~~~~~~~-gd~~~~~~~~--~--~-----------------~~~~i~-~~~~~~~~ 363 (394) +.|.+|++.+.... +++.+++ .+-..+.+.+ . . .++.+. +.+.+-.. T Consensus 241 ~~g~~i~~yd~~y~d~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~ 320 (348) T protein:vir:27 241 NFGVSIVLENGTYRNDKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTTTKTTDPVN 320 (348) T ss_pred hcCceEEEEeeEEEcCCCcCcccccCCeEEEEcCCcceeEEeccCcchhhhhhccccccceeeeCCeeEEEeeecCCCce Confidence 66778877553221 2233333 2111121111 0 0 001111 01111112 Q ss_pred ceEEEEEEeccEEecccceEEEEecCcc Q lcl|Aclame:pro 364 QYLQAVLRFGVSKVDDKAGYYVTFTPEP 391 (394) Q Consensus 364 ~~~r~~~r~d~~v~~~~af~~l~~~~~~ 391 (394) ..+.+..+.=-.+.+|+++.++++.++. T Consensus 321 ~~~~~~s~~lPv~~~~~~~~~a~Vl~~~ 348 (348) T protein:vir:27 321 VQTKVSMVALPSFERLDDVYMLTVIPAV 348 (348) T ss_pred EEEEEeeeeeccccCCCcEEEEEEecCC Confidence 2234445555566779999999999999 No 224 >protein:vir:108295 Length: 711 # NCBI annotation: hypothetical protein # Family: family:all:487 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552284;genbank:gi:160700609;genbank:GeneID:5758811 Probab=34.49 E-value=1.3 Score=19.98 Aligned_cols=98 Identities=14% Similarity=0.198 Sum_probs=12.9 Q ss_pred ChHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHH--HHHHHHH Q lcl|Aclame:pro 1 MFEEKI--------------KEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAE--NDLKLYE 64 (394) Q Consensus 1 ~l~e~l--------------~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~--~~~~~~~ 64 (394) -|.+++ .+.++...+...+..+...+...+.......+++..+++.+.++.+.+..+ .++...+ T Consensus 598 el~e~lr~~~~~~~~~~~~~~~~qq~~~e~qq~~~~~q~~~~~~q~~~~qa~ae~~~Aqae~~qa~~e~~~~q~q~~~~~ 677 (711) T protein:vir:10 598 VIAERLKKIVPPNVLSKDEREAIEEDMPEQTEPTPEQQVEMAKSQADMAQAEADTAQAQADMLKAQLETEEAQKQLAMIE 677 (711) T ss_pred HHHHHHHhhcCcccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 111111 111111111111111100000000000000111122222222222222111 1111111 Q ss_pred HHHhhccccccccccccchhhhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 65 SSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDS 105 (394) Q Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 105 (394) ......... ..............+.......... T Consensus 678 ~~aq~~~~~-------~qq~~~~l~~~qaelq~~q~~~~q~ 711 (711) T protein:vir:10 678 DMAQGGDVV-------YQQVRELVAQALAEITASQANVTEQ 711 (711) T ss_pred HHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHhhcC Confidence 110000000 0000010111111111111001100 No 225 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=30.84 E-value=1.5 Score=19.55 Aligned_cols=346 Identities=11% Similarity=0.052 Sum_probs=108.5 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEV 80 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~ 80 (394) =|++++.++.++++++.+++++...+.+........++++++.++++.+++..+..+....................... T Consensus 5 eL~~~~~~~~~~~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (395) T protein:vir:38 5 QLKDAFDMAGQKVQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKPLPVK 84 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchh Confidence 77899999999999999888877666555444555677888889999998887776666555444443333333222211 Q ss_pred cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhc---ccccCCccccchhHHhHHHHHHHhhhh Q lcl|Aclame:pro 81 TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDG---IKKENAKPVSSEEILYTPAREVKTVVD 157 (394) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~lvP~~~~~~I~~~~~~~~~ 157 (394) . ...........+.+........ . ......+ +.......++.......++..+-..-+ T Consensus 85 ~-~~~~~~~~~~~~~~~~~~~~~~-----~-------------~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~ 145 (395) T protein:vir:38 85 D-GKPDAQAMKNQFVKDFKNLVTS-----G-------------TTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVEN 145 (395) T ss_pred h-hhHHHHHHHHHHHHHHHHHHhh-----c-------------cCccCCCceecchhHhhHHHHHHHhhcchhhhcceee Confidence 1 1112222222222221111000 0 0000011 111111111111111122222211122 Q ss_pred hhheeee--EeecCCceeEEEEecCCCcccccccccccccccccccceeeecHhh-hh-hhhhhhHHHHhccHHHHHHHH Q lcl|Aclame:pro 158 LKPFTTV--YQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDT-YR-GAIPLSQESIDDADVDLVGIV 233 (394) Q Consensus 158 l~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~-~~-~~~~vs~ell~ds~~~l~~~i 233 (394) +...... +............ ........+.. .+......+....+...- +. .++.-|..-+.. + +...| T Consensus 146 ~~~~~~~~~~~~~~~~~~~a~~---v~E~~~~~~~~-~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~--~-i~~~l 218 (395) T protein:vir:38 146 VTTSHGSRVYEKLADITPLKDL---DDESALIGDND-DPELTVVKYLIHRYAGITTVTNTLLKDTVDNIIQ--W-LVNWA 218 (395) T ss_pred ccCCcceEEEEeeccCCccccc---ccccccccccc-ccceeeEEeeeeeeEeehhhHHHHHhhhHHHHHH--H-HHHHH Confidence 2211111 1111111111100 11112222211 111122222222211111 11 111112211111 1 33444 Q ss_pred HHHHHH----HHHHHHHHHHhhccccccccccccHH---------------------HHHHHHHhhhhhhcccEEEEcHH Q lcl|Aclame:pro 234 SESISQ----IKVNTTNDAIAKVLKSFTTKTVKNLD---------------------EIKALLNGGFDPAYNVSLIVSQS 288 (394) Q Consensus 234 ~~~l~~----~~~~~~~~a~~~g~~~~~~~~~~~~~---------------------~i~~~~~~~~~~~~~a~~vm~~~ 288 (394) .+.++. .+....+.. .+..+..+..... .....+..+.+. +..++..+. T Consensus 219 a~~~~~~~~~~il~g~g~~----~~~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~--~G~~l~~~~ 292 (395) T protein:vir:38 219 AKKDVVTRNAKILEVMGKA----PKKPTISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVKDA--DGRYLMQPD 292 (395) T ss_pred HHHHHHHHHHHHhhccccc----ccccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcc--CCceeeccC Confidence 444444 443332221 1111111111111 112222222222 222333222 Q ss_pred HHH-HHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeec--------ceEEEEe-e Q lcl|Aclame:pro 289 FYQ-TLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRK--------DLGLRWA-D 358 (394) Q Consensus 289 ~~~-~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~--------~~~i~~~-~ 358 (394) ... .-.. =.|.|++..+- ...+..-.-.++++-+ -...++++|.....+-+.+. ...+... + T Consensus 293 ~~~~~~~~---l~G~pV~~~~~-~~~~~~~~~~~i~~gd----~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 364 (395) T protein:vir:38 293 VTSPDKYL---IDGKPVIRIAD-KWLPDVSGSHPLYFGD----LKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDR 364 (395) T ss_pred cCCCCcce---eccceeEEecc-cccCcCCCcceEEEEe----ccccEEEEEecceEEEEeccccchhhcCceEEEEEEe Confidence 100 0000 14556543210 0000000112222211 01223344433221111111 1111110 0 Q ss_pred cccccceEEEEEEeccEEecccceEEEEecCccCCC Q lcl|Aclame:pro 359 NEIYGQYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) Q Consensus 359 ~~~~~~~~r~~~r~d~~v~~~~af~~l~~~~~~~~~ 394 (394) ...-...=.++..+.+... .-.+++|-. T Consensus 365 ~d~~~~~~~a~~~~~~~~~--------~~~~~~~~~ 392 (395) T protein:vir:38 365 FDVQLIDDGAFAAASFKTV--------ANQAQGTAG 392 (395) T ss_pred eccEEecccceEEEEeecc--------cCCCCCccC Confidence 0000000011111111111 122222222 No 226 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=29.50 E-value=1.7 Score=19.39 Aligned_cols=349 Identities=8% Similarity=-0.052 Sum_probs=70.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccch Q lcl|Aclame:pro 6 IKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQE 83 (394) Q Consensus 6 l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (394) +.+|+++++ ++++++.++++...+... .+...+.+++++++.++++++++++++.+................ T Consensus 1 m~~l~~~l~---~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~--- 74 (390) T protein:vir:81 1 MTDITSKLE---ATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQ--- 74 (390) T ss_pred ChHHHHHHH---HHHHHHHHHHHHHHHHHHhhcCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--- Confidence 444444333 333333333333222111 112234456666777777777666666554433222111111100 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCcc-ccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 84 EKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKP-VSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ...... ........+..........................+.. --...+-+.++..+-. .++... T Consensus 75 ~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~--~~~~~~ 141 (390) T protein:vir:81 75 HVSVGD-----------MFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFIT--PPDARL 141 (390) T ss_pred cccchh-----------hhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhhHHHHH--HHhhhh Confidence 000000 00000000000000001111111111111111000000 0000111111111111 111211 Q ss_pred eeEeecCCceeEEEEecCCCccccccc-----cccccccc-ccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAE-----LEKNPALA-KPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSES 236 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e-----~~~~~~~~-~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~ 236 (394) . +....-.+|+. ++...+... ......+. .......++....+....--..--+-+...+-...+... T Consensus 142 ~---l~~~~~~~~~~---~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~~~~~~ 215 (390) T protein:vir:81 142 T---VRDLIGSGRTD---SALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASY 215 (390) T ss_pred h---hhhhcceeecc---CCceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhHHHHHHH Confidence 1 11111112211 111111110 01111110 001111111111111100000000001111111222223 Q ss_pred HHHHHHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCC---ceeeccc----- Q lcl|Aclame:pro 237 ISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNG---RYLLQDD----- 308 (394) Q Consensus 237 l~~~~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G---~~l~~~~----- 308 (394) +...++.....++-...-.+..+ ......|.............+....-......+..+..... .+++.|. T Consensus 216 i~~~l~~~~~~~~d~a~l~G~g~-~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l 294 (390) T protein:vir:81 216 MNNRLIRGLKVKEDAEILRGTGA-NDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDWAAI 294 (390) T ss_pred HHHHHHHHHHHHHHHHHHhcCCC-CCcccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHHHHH Confidence 33333333333333322222111 11122221110000000000000000111112223322211 1222221 Q ss_pred --ccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecc-eEEEEeecccccceEEEEEEeccEEec---cc-- Q lcl|Aclame:pro 309 --ITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKD-LGLRWADNEIYGQYLQAVLRFGVSKVD---DK-- 380 (394) Q Consensus 309 --~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~-~~i~~~~~~~~~~~~r~~~r~d~~v~~---~~-- 380 (394) +.+.. |.|+...+ ..+....++|-. ++..+..+ -.+-+-+ |...+..+.|-+..+.. +. T Consensus 295 ~~lkd~~-----G~~l~~~~--~~~~~~~l~G~p---v~~~~~~p~~~~~~gd---~~~~~~~~~~~~~~v~~~~~~~~~ 361 (390) T protein:vir:81 295 ELAKDAN-----NQYLIGNA--RGTLTPTLWGLP---VVATQAMAPGEFLVGA---FDLAAQIFDQWDARVEIGYVGEDF 361 (390) T ss_pred HHhhcCC-----CceeecCc--ccccCceeccee---eEEcCCCCCCcEEEEe---hhceEEEEEecceEEEEecccchh Confidence 11111 23322111 011111122211 00000000 0000000 01111111111111110 00 Q ss_pred --------ceEEEEecCccCCC Q lcl|Aclame:pro 381 --------AGYYVTFTPEPLPL 394 (394) Q Consensus 381 --------af~~l~~~~~~~~~ 394 (394) ++..+.+... -|- T Consensus 362 ~~~~v~~r~~~r~d~~v~-~~~ 382 (390) T protein:vir:81 362 QRNMITVLAEERLALVVY-RPE 382 (390) T ss_pred hcCcEEEEEEEeeccEEe-ccc Confidence 1111111000 011 No 227 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=27.92 E-value=1.4 Score=19.85 Aligned_cols=286 Identities=11% Similarity=-0.015 Sum_probs=99.6 Q ss_pred cccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHH Q lcl|Aclame:pro 73 ENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREV 152 (394) Q Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~ 152 (394) .+...++............+...++ +.+.. .....+..+++++--+.+.+.|..+. T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~Ks-----------------------~~agy-~~~p~~q~~~~AlR~EsL~~~i~~L~ 56 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNSVQEDALKS-----------------------FTTGY-GITPDTQTDAGALRREFLDDQISMLT 56 (468) T ss_pred CCCCcchhhccccChhHHHHHHHHH-----------------------HHcCc-ccCCccccCcchhhhhhhhhhhheee Confidence 1111111000000000000111110 00000 00111223344444444444444332 Q ss_pred Hhhhh--hhheeeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHH-HhccHHH Q lcl|Aclame:pro 153 KTVVD--LKPFTTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQES-IDDADVD 228 (394) Q Consensus 153 ~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~el-l~ds~~~ 228 (394) ..... +..-+.+.+..+.-..+......+. +.+-+..|...+..+++.+...+..+|-++....+|.-+ +..+..+ T Consensus 57 ~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d 136 (468) T protein:vir:63 57 WTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQD 136 (468) T ss_pred ecccchhhhhhcccchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhh Confidence 22222 2222334444444333433333332 333444455555679999999999999999988777642 2333445 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccccc----ccccHHHHHHHHHhhhhhhc--cc--------------------- Q lcl|Aclame:pro 229 LVGIVSESISQIKVNTTNDAIAKVLKSFTTK----TVKNLDEIKALLNGGFDPAY--NV--------------------- 281 (394) Q Consensus 229 l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~----~~~~~~~i~~~~~~~~~~~~--~a--------------------- 281 (394) ..+...+.-.-.++.+++.+++.|+..-.+. -..-+|.+..+++. ..-+ +. T Consensus 137 ~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~--enviDa~G~~ls~~~lneaa~~i~~gfG~ 214 (468) T protein:vir:63 137 PMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ--DNVHDARGASLTESLLNQAAVMISKGYGT 214 (468) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecC--CceeccCCCccCHHHHHHHhhhccccccC Confidence 5666666666778889999999988765321 12334544433311 1111 11 Q ss_pred --EEEEcHHHHHHH-HhhhccCCceeecccccCCCcccccccceE--EecCcc-cccCceEEEeccccEEEEeecceEEE Q lcl|Aclame:pro 282 --SLIVSQSFYQTL-DTLKDGNGRYLLQDDITAVSGKVLLGKPVF--VLSDEV-LGANKAFIGDFKRGVLFADRKDLGLR 355 (394) Q Consensus 282 --~~vm~~~~~~~l-~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~--~~~~~~-~~~~~~~~gd~~~~~~~~~~~~~~i~ 355 (394) -++|+.-+.+.| ...- ..++.+.. +.......|+||- ++..-. .-.+..++||.. ++--++.+.... T Consensus 215 ~td~~~~~~v~a~~~~~~L--~~q~~v~~---~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~--~l~~~~~~~~~A 287 (468) T protein:vir:63 215 PTDAYMPVGVQADFVNQQL--SKQTQLVR---DNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQ--ILDERILALPTA 287 (468) T ss_pred hhhhhcchhHHhhhhhhhc--CceEEEEc---CCCCceeeeecccceecceeeeeecCceeecccc--CCCccccccccc Confidence 133333333322 1100 01111110 0111122333331 000000 000111222221 111111111110 Q ss_pred Eee------cccccceEEEEEEeccEEecccceEEE--EecCccCCC Q lcl|Aclame:pro 356 WAD------NEIYGQYLQAVLRFGVSKVDDKAGYYV--TFTPEPLPL 394 (394) Q Consensus 356 ~~~------~~~~~~~~r~~~r~d~~v~~~~af~~l--~~~~~~~~~ 394 (394) .+. +..... ...+-+..-.+.=+|+.+ ..-.+|+|+ T Consensus 288 psp~~vsaT~~~~~~---g~~~~~~~a~y~Y~v~~vs~~GES~pS~~ 331 (468) T protein:vir:63 288 PQPAKVTATQEAGKK---GQFRAEDLAAHEYKVVVSSDDAESIASEV 331 (468) T ss_pred ccCCccceeeecccC---CcccCCCcceEEEEEEEECCCCccccccc Confidence 000 000000 000000000011112222 222333333 No 228 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=27.33 E-value=1.9 Score=19.11 Aligned_cols=370 Identities=9% Similarity=0.019 Sum_probs=81.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhch--hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------H----H Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALES--DDLEAARSIKAEVEQAKANLVEAENDLKLYES-------S----V 67 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~--e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~-------~----~ 67 (394) -|+++|.++++++++..+++++..++......+ ...++++++.++++++++++++++...+.... . . T Consensus 5 elk~el~~~~~el~~~~~elr~~~~~~~~~~~el~~~~~e~~~~~~ei~el~~~l~~~~~~~~~~~e~~~~~~~~~~~e~ 84 (437) T protein:vir:10 5 KLKKDLATKTAELNTKKAEIRSFTESEDKTIDEVKAGMTEIKEKEDEIKEIRSNIEVLEQASALKVEEKRDDSDLVAPEL 84 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 455555556666655555555444443332211 11234455556666655555544332221110 0 0 Q ss_pred hhcccc---cccccc---ccch--------hhhHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhhhhhhhhhhcc Q lcl|Aclame:pro 68 EVGGAE---NIGGKE---VTQE--------EKTYRESVNDFIRSKGKIV--NDSLRFEGKDEVLMPINETTPVEPQKDGI 131 (394) Q Consensus 68 ~~~~~~---~~~~~~---~~~~--------~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (394) +..... ...... .... ................... ............................. T Consensus 85 ~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~ 164 (437) T protein:vir:10 85 EENSADNEEDDPEKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDG 164 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccc Confidence 000000 000000 0000 0000000000000000000 00000111111000000000000000000 Q ss_pred cccCCccccchhHHhH-HHHHHHhh---hhhhheeeeEeecCCceeEEEEecCCCcccccccccccccccccccce---- Q lcl|Aclame:pro 132 KKENAKPVSSEEILYT-PAREVKTV---VDLKPFTTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD---- 203 (394) Q Consensus 132 ~~~~~~~lvP~~~~~~-I~~~~~~~---~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~---- 203 (394) ...... .+...+... -...++.. .++......+++............. .....+. ..+.....+|.. T Consensus 165 g~lvp~-~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e---~~~~~e~-~~~~~~~v~~~~~k~~ 239 (437) T protein:vir:10 165 KVIIPE-TILTPEKEVHQFPRLGSLVRTESVTTTTGKLPIFNNSTDLLTAHTE---YGQTTKN-ATPVITPILWDLKTYT 239 (437) T ss_pred cccchH-HHHHHHHHhhhhhhhhhcceeEeeccCceeeEEeeccccccccccc---ccccccc-ccccceeeeeehhhee Confidence 000000 011111110 01111111 1111111122222111100000000 1111111 111111111111 Q ss_pred --eeecHhhhhhh-hhhhHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccH---------------- Q lcl|Aclame:pro 204 --VAWNIDTYRGA-IPLSQESIDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNL---------------- 264 (394) Q Consensus 204 --v~~~~~~~~~~-~~vs~ell~ds~~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~---------------- 264 (394) +.++-..+... +.+...+.+ .|..-+...+..++..+.+.+...+.++.+....... T Consensus 240 ~~~~is~ell~ds~~~~~~~i~~----~l~~~~~~~~~~~i~~g~g~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 315 (437) T protein:vir:10 240 GGYVFSQELISDSSYDWQAELQS----RLIELRDNTDDSLIITALTDGIKKTTSTYLLGDLKKVLNVTLKPQDSAAASIV 315 (437) T ss_pred eehhhhHHHHhhhHHHHHHHHHH----HHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHhhhhhhhhcCCEEE Confidence 12222211111 111111111 1222333333444445444443333333222221110 Q ss_pred --HHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCceeecc-cccCCCcccccccceEEecCcccccCceEEEecc Q lcl|Aclame:pro 265 --DEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGRYLLQD-DITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFK 341 (394) Q Consensus 265 --~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~~l~~~-~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~ 341 (394) ......+..+.+. +..+++.|..-..- -..=.|+|+... +...+.. .--..++++-+- +..++++|-. T Consensus 316 ~~~~~~~~l~~lkd~--~g~~~~~~~~~~~~--~~~l~G~pv~~~~~~~~~~~-~~~~~~~~~gd~----~~~~~~~~r~ 386 (437) T protein:vir:10 316 MSQSAYNLFDMATDA--MGRPLLQPNVTAAT--GYTLLGKTVVIVDDKLFPSA-SAGDVNIVVAPL----KKAVINFKLT 386 (437) T ss_pred EcHHHHHHHHHhhcc--CCCeeeccCccCCC--CcccccceeEEecccccCCc-CCCceEEEEeec----cccEEEEeee Confidence 1111122222221 23334333210000 000145665421 1100000 001122222110 0111222211 Q ss_pred ccEEEEeecceEEEEeecccccceEEE----EEEeccEEecccceEEEEecCccC Q lcl|Aclame:pro 342 RGVLFADRKDLGLRWADNEIYGQYLQA----VLRFGVSKVDDKAGYYVTFTPEPL 392 (394) Q Consensus 342 ~~~~~~~~~~~~i~~~~~~~~~~~~r~----~~r~d~~v~~~~af~~l~~~~~~~ 392 (394) .+.+-...... .....+.-.+|+ ..--.+..+-.+.-+..+..+|+. T Consensus 387 -~~~~~~~~~~~---~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 387 -EITGQFQDTYD---IWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred -ceEEEEecccc---cccceeeEEEEEccEEecccceEEEEeeccccccCCCCCC Confidence 11110000000 000011001111 110111111112222222222222 No 229 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=26.19 E-value=2 Score=18.97 Aligned_cols=332 Identities=12% Similarity=0.078 Sum_probs=94.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHhchh----hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVKNALESD----DLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIG 76 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~~~~~~e----~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~ 76 (394) =+++++++|+++++++.++.+++.+.++....++ ..+++++++.+++.++++++++++....+............. T Consensus 14 el~~~l~eL~e~~~~l~~~~~el~~~~ee~~~~e~~~~~~~~~~~l~~~i~~l~~~i~~~~~~~~~l~~~~~~~~~~~~~ 93 (397) T protein:vir:96 14 ERSSEIDKLLSQRSDLEKQENDLERALEEAKTDEEISTVSDSADDLEKQVKDLDEKIAELQKEKQDLEDELAKAADPTDQ 93 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh Confidence 4567788999988888888888776665433322 345677888999999999998887777665544332222111 Q ss_pred cccc-cchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhh Q lcl|Aclame:pro 77 GKEV-TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTV 155 (394) Q Consensus 77 ~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~ 155 (394) .... ............................... ... ... ................++...-...+...... T Consensus 94 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~---~~~-~~~~~~~~vp~~~~~~i~~~~~~~~l~~~~~~- 167 (397) T protein:vir:96 94 KPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGA-EKR---DGF-TSVEGGALIPQELLQPQLEPKDIVDLSKYVRS- 167 (397) T ss_pred hhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhh-hhh---hcc-cccccccchhHHHHHHHHHhhhhhhHHHhhhh- Confidence 1111 0000011111111111111111111000000 000 000 00000000000000011100001111111111 Q ss_pred hhhhhe---eeeEeecCCceeEEEEecCCCcccccccccccccccccccce------eeecHhhhhhhhhhhHHHHhccH Q lcl|Aclame:pro 156 VDLKPF---TTVYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD------VAWNIDTYRGAIPLSQESIDDAD 226 (394) Q Consensus 156 ~~l~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~------v~~~~~~~~~~~~vs~ell~ds~ 226 (394) .++... ..+....++...+.. .. ....+. ..+.....++.. +.++-.-+-. |..-+. . T Consensus 168 ~~~~~~~~~~~~~~~~~~~~~~~~--E~----~~~~~~-~~~~~~~i~~~~~~~~~~~~~s~ell~d----s~~~l~--~ 234 (397) T protein:vir:96 168 VPVNSASGKFPVISKSGSKMATVQ--QL----EKNPQL-ANPKMVEIDYSVATRRGYIPISQEMIDD----ASYDVT--G 234 (397) T ss_pred ccccccceeEEEEeccCCcccccc--cc----cccccc-ccccccceeecHhHhhcchhhHHHHHhh----hHHHHH--H Confidence 111111 111112122211111 11 111111 111111111111 1121111111 110011 1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccH-------------------HHHHHHHHhhhhhhcccEEEEcH Q lcl|Aclame:pro 227 VDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNL-------------------DEIKALLNGGFDPAYNVSLIVSQ 287 (394) Q Consensus 227 ~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~-------------------~~i~~~~~~~~~~~~~a~~vm~~ 287 (394) + +...+.+.++......+-...-.+. ..+..+.... ......+..+.+. +..+++.| T Consensus 235 ~-i~~~l~~~~~~~~~~~i~~g~g~~~-~~~~~~~d~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lkd~--~G~~~~~~ 310 (397) T protein:vir:96 235 L-IADEIQDQSLNTKNADIAAVLKTAT-AKSVVGVDGLKDLINKEIKKVYDVKLFISASMYSELDKLKDK--NGRYLLQD 310 (397) T ss_pred H-HHHHHHHHHHHHHHHHHhhcccccc-cccccchHHHHHHHHHhhhhhcCcEEEEcHHHHHHHHHhhcc--CCCeEecc Confidence 1 3333444443333222211111111 1111111111 1112222222222 22344333 Q ss_pred HHHHHHHhhhcc-----CCceeecccccCCCccccccc-ceEEecCcccccCceEEEeccc-------------cEEEEe Q lcl|Aclame:pro 288 SFYQTLDTLKDG-----NGRYLLQDDITAVSGKVLLGK-PVFVLSDEVLGANKAFIGDFKR-------------GVLFAD 348 (394) Q Consensus 288 ~~~~~l~~lkd~-----~G~~l~~~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~gd~~~-------------~~~~~~ 348 (394) .. .+. .|.|+...+ ...++.-.|- ++++-+ -+..+.++|... ++..+. T Consensus 311 ~~-------~~~~~~~l~G~pv~~~~--~~~~~~~~~~~~~~~gd----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 377 (397) T protein:vir:96 311 SI-------TAASGKQLLGKEVVVLD--DDVIGKSVGNVVGFIGD----AKAFASFFDRKQVSVSWVDNNIYGQLLAGII 377 (397) T ss_pred Cc-------cCCCcccccccceEEec--ccccCCCCCceEEEEee----hhcceEeEeecceEEEEecccccceeEEEEE Confidence 21 122 355553211 0000110111 111111 000112233221 111111 Q ss_pred ecceEEEEeecccccceEEEEEEeccEEecccc Q lcl|Aclame:pro 349 RKDLGLRWADNEIYGQYLQAVLRFGVSKVDDKA 381 (394) Q Consensus 349 ~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~~~a 381 (394) |-|..+. + ...+ ..+-+ +.| T Consensus 378 r~d~~~~--~----~~a~---~~~~~----~~a 397 (397) T protein:vir:96 378 RYDVKAT--D----KKAG---FYVTF----TIG 397 (397) T ss_pred EEccEEe--c----ccce---EEEEe----ecC Confidence 1111111 0 0111 11111 111 No 230 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=25.63 E-value=2 Score=18.89 Aligned_cols=348 Identities=9% Similarity=-0.026 Sum_probs=88.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccchhhhHHH Q lcl|Aclame:pro 10 KATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRE 89 (394) Q Consensus 10 ~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (394) ..++++|+++++++.++.+.+.++. .++.+++.++.+.++++++++.+.++..+...+...... .... . ... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~-~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-~----~~~ 72 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQ-KAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKL--ASGA-E----NPG 72 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hccc-c----ccc Confidence 5568889999988888888765443 345556666666666666666555554433322111000 0000 0 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccc-cchhHHhHHHHHHHhhhhhhheeeeEeec Q lcl|Aclame:pro 90 SVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPV-SSEEILYTPAREVKTVVDLKPFTTVYQAK 168 (394) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-vP~~~~~~I~~~~~~~~~l~~~~~~~~~~ 168 (394) ... ................................+.. ....+.+.+... .+..+...-++. T Consensus 73 ~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~-----ii~~~~~~~~l~ 135 (385) T protein:vir:18 73 EKK------------SFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPG-----IIMPGLRRLTIR 135 (385) T ss_pred hhh------------hhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhH-----HHHHhhhccchh Confidence 000 00000001111111111000000000000000000 000111111111 111111111121 Q ss_pred CCceeEEEEecCCCccccccc-----cccccccc-ccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 169 KASGKYPVLQRATTKMVTVAE-----LEKNPALA-KPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKV 242 (394) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~e-----~~~~~~~~-~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~ 242 (394) ...-.+|+ .++....... .+....+. .......++....+........--+-+...+-...+...|...++ T Consensus 136 ~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~~l~~~i~~~la 212 (385) T protein:vir:18 136 DLLAQGRT---SSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDAPMLQSYINNRLM 212 (385) T ss_pred hhcceecc---cCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhHHHHHHHHHHHHH Confidence 11111222 1111111111 11111111 111111222222111110000000111111111223444444445 Q ss_pred HHHHHHHhhccccccccccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCc---eeecccccCC--Ccccc Q lcl|Aclame:pro 243 NTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGR---YLLQDDITAV--SGKVL 317 (394) Q Consensus 243 ~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~---~l~~~~~~~~--~~~~l 317 (394) .+....+-...=.+...+ .....|.............+....-......+..+....+. +++.|..... .-.-- T Consensus 213 ~a~~~~~d~~~l~G~g~~-~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~ 291 (385) T protein:vir:18 213 YGLALKEEGQLLNGDGTG-DNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDN 291 (385) T ss_pred HHHHHHHHHHHHhccCCC-CcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcC Confidence 555544443333332221 22222222111100000001111111111223344433222 3332211000 00001 Q ss_pred cccceEEecCcccccCceEEEeccccEEEEeecc-eEEEEeecccccceEEEEEEeccEEe--c-------cc--ceE-- Q lcl|Aclame:pro 318 LGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKD-LGLRWADNEIYGQYLQAVLRFGVSKV--D-------DK--AGY-- 383 (394) Q Consensus 318 ~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~-~~i~~~~~~~~~~~~r~~~r~d~~v~--~-------~~--af~-- 383 (394) .|.|+...+ ..+....++|-. ++..+..+ -.+-+-+ |..++....|-|..+. + -+ +|. T Consensus 292 ~G~~l~~~~--~~~~~~~l~G~p---V~~~~~~p~~~~~~gd---~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 363 (385) T protein:vir:18 292 EGRYIFGGP--QAFTSNIMWGLP---VVPTKAQAAGTFTVGG---FDMASQVWDRMDATVEVSREDRDNFVKNMLTILCE 363 (385) T ss_pred CCceeccCc--ccCCCceeccee---eEEcCcCCCCcEEEee---cccEEEEEEecceEEEEeccccchhhcCcEEEEEE Confidence 244442211 111122233311 11101000 0011111 1111212212111111 0 00 110 Q ss_pred -EEEecC-ccCCC Q lcl|Aclame:pro 384 -YVTFTP-EPLPL 394 (394) Q Consensus 384 -~l~~~~-~~~~~ 394 (394) .+.+.. -|..+ T Consensus 364 ~r~~~~v~~~~a~ 376 (385) T protein:vir:18 364 ERLALAHYRPTAI 376 (385) T ss_pred EeeccEEecccce Confidence 011100 01111 No 231 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=25.63 E-value=2 Score=18.89 Aligned_cols=348 Identities=9% Similarity=-0.026 Sum_probs=88.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccchhhhHHH Q lcl|Aclame:pro 10 KATIADLNNTIVTKTAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRE 89 (394) Q Consensus 10 ~~~~~el~~~~~~~~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (394) ..++++|+++++++.++.+.+.++. .++.+++.++.+.++++++++.+.++..+...+...... .... . ... T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~-~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-~----~~~ 72 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQ-KAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKL--ASGA-E----NPG 72 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh--hccc-c----ccc Confidence 5568889999988888888765443 345556666666666666666555554433322111000 0000 0 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccc-cchhHHhHHHHHHHhhhhhhheeeeEeec Q lcl|Aclame:pro 90 SVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPV-SSEEILYTPAREVKTVVDLKPFTTVYQAK 168 (394) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-vP~~~~~~I~~~~~~~~~l~~~~~~~~~~ 168 (394) ... ................................+.. ....+.+.+... .+..+...-++. T Consensus 73 ~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~-----ii~~~~~~~~l~ 135 (385) T protein:vir:19 73 EKK------------SFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPG-----IIMPGLRRLTIR 135 (385) T ss_pred hhh------------hhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhH-----HHHHhhhccchh Confidence 000 00000001111111111000000000000000000 000111111111 111111111121 Q ss_pred CCceeEEEEecCCCccccccc-----cccccccc-ccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 169 KASGKYPVLQRATTKMVTVAE-----LEKNPALA-KPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSESISQIKV 242 (394) Q Consensus 169 ~~~~~~~~~~~~~~~~~~~~e-----~~~~~~~~-~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~l~~~~~ 242 (394) ...-.+|+ .++....... .+....+. .......++....+........--+-+...+-...+...|...++ T Consensus 136 ~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~~l~~~i~~~la 212 (385) T protein:vir:19 136 DLLAQGRT---SSNALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMDDAPMLQSYINNRLM 212 (385) T ss_pred hhcceecc---cCcceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHhhHHHHHHHHHHHHH Confidence 11111222 1111111111 11111111 111111222222111110000000111111111223444444445 Q ss_pred HHHHHHHhhccccccccccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCc---eeecccccCC--Ccccc Q lcl|Aclame:pro 243 NTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGR---YLLQDDITAV--SGKVL 317 (394) Q Consensus 243 ~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~---~l~~~~~~~~--~~~~l 317 (394) .+....+-...=.+...+ .....|.............+....-......+..+....+. +++.|..... .-.-- T Consensus 213 ~a~~~~~d~~~l~G~g~~-~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lkd~ 291 (385) T protein:vir:19 213 YGLALKEEGQLLNGDGTG-DNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIALLKDN 291 (385) T ss_pred HHHHHHHHHHHHhccCCC-CcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhcC Confidence 555544443333332221 22222222111100000001111111111223344433222 3332211000 00001 Q ss_pred cccceEEecCcccccCceEEEeccccEEEEeecc-eEEEEeecccccceEEEEEEeccEEe--c-------cc--ceE-- Q lcl|Aclame:pro 318 LGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKD-LGLRWADNEIYGQYLQAVLRFGVSKV--D-------DK--AGY-- 383 (394) Q Consensus 318 ~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~-~~i~~~~~~~~~~~~r~~~r~d~~v~--~-------~~--af~-- 383 (394) .|.|+...+ ..+....++|-. ++..+..+ -.+-+-+ |..++....|-|..+. + -+ +|. T Consensus 292 ~G~~l~~~~--~~~~~~~l~G~p---V~~~~~~p~~~~~~gd---~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~ 363 (385) T protein:vir:19 292 EGRYIFGGP--QAFTSNIMWGLP---VVPTKAQAAGTFTVGG---FDMASQVWDRMDATVEVSREDRDNFVKNMLTILCE 363 (385) T ss_pred CCceeccCc--ccCCCceeccee---eEEcCcCCCCcEEEee---cccEEEEEEecceEEEEeccccchhhcCcEEEEEE Confidence 244442211 111122233311 11101000 0011111 1111212212111111 0 00 110 Q ss_pred -EEEecC-ccCCC Q lcl|Aclame:pro 384 -YVTFTP-EPLPL 394 (394) Q Consensus 384 -~l~~~~-~~~~~ 394 (394) .+.+.. -|..+ T Consensus 364 ~r~~~~v~~~~a~ 376 (385) T protein:vir:19 364 ERLALAHYRPTAI 376 (385) T ss_pred EeeccEEecccce Confidence 011100 01111 No 232 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=25.26 E-value=1.8 Score=19.23 Aligned_cols=284 Identities=11% Similarity=-0.015 Sum_probs=99.8 Q ss_pred ccccccccccchhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHH Q lcl|Aclame:pro 72 AENIGGKEVTQEEK-TYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAR 150 (394) Q Consensus 72 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~ 150 (394) .....+....++.. ...+. ..+ .+.+.. .....+..+++++--+.+.+.|.. T Consensus 1 ~~~~~~~~~~~~n~~~~~e~---~~K-----------------------s~~agy-~~~p~tq~~~~AlR~EsL~~~i~~ 53 (467) T protein:vir:80 1 MPKNNKEEVKEVNLNSVQED---ALK-----------------------SFTTGY-GITPDTQTDAGALRREFLDDQISM 53 (467) T ss_pred CCCcchhhhhhcccccCHHH---HHH-----------------------HHHccc-ccCCccccCcchhhhhhhhhhhhe Confidence 00000000000000 00000 000 000000 001112233444544555555544 Q ss_pred HHHhhhh--hhheeeeEeecCCceeEEEEecCCC-cccccccccccccccccccceeeecHhhhhhhhhhhHHH-HhccH Q lcl|Aclame:pro 151 EVKTVVD--LKPFTTVYQAKKASGKYPVLQRATT-KMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQES-IDDAD 226 (394) Q Consensus 151 ~~~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~el-l~ds~ 226 (394) +...... +..-+.+.+..+.-.++......+. +.+-+..|...+..+++.+...+..+|-++....+|.-+ +..+. T Consensus 54 Lt~~~~~f~~~~di~k~~a~stv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i 133 (467) T protein:vir:80 54 LTWTENDLTFYKDIAKKPATSTVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNI 133 (467) T ss_pred eeccccchhhhhhcccchhhhhhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcch Confidence 3322222 2222334444444334433333332 333444455555679999999999999999988777642 23334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----ccccHHHHHHHHHhhhhhhc--cc------------------- Q lcl|Aclame:pro 227 VDLVGIVSESISQIKVNTTNDAIAKVLKSFTTK----TVKNLDEIKALLNGGFDPAY--NV------------------- 281 (394) Q Consensus 227 ~~l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~----~~~~~~~i~~~~~~~~~~~~--~a------------------- 281 (394) .+..+...+.-.-.++.+++.+++.|+..-.+. -..-+|.+..+++. ..-+ +. T Consensus 134 ~d~~~~~~~~ai~~~a~tiE~a~FyGds~l~~s~~~~~glqfDGi~~li~~--enviDa~G~~ls~~~lneaa~~i~~gf 211 (467) T protein:vir:80 134 QDPMQILTDDAIVNIAKTIEWASFFGDSDLSDSPEPQAGLEFDGLAKLINQ--DNVHDARGASLTESLLNQAAVMISKGY 211 (467) T ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCccccccccceeEEecC--CceeccCCCccCHHHHHHHhhhccccc Confidence 456666666666778889999999988765321 12334554433311 1111 11 Q ss_pred ----EEEEcHHHHHHH-HhhhccCCceeecccccCCCcccccccceE--EecCcc-cccCceEEEeccccEEEEeecceE Q lcl|Aclame:pro 282 ----SLIVSQSFYQTL-DTLKDGNGRYLLQDDITAVSGKVLLGKPVF--VLSDEV-LGANKAFIGDFKRGVLFADRKDLG 353 (394) Q Consensus 282 ----~~vm~~~~~~~l-~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~--~~~~~~-~~~~~~~~gd~~~~~~~~~~~~~~ 353 (394) -++|+.-+.+.+ ...- ..++.+.. +.......|+||- +...-. .-.+..++||.. ++--++.+.. T Consensus 212 G~~td~~~p~~v~a~~~~~~L--~~q~~v~~---~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~--~l~~~~~~~~ 284 (467) T protein:vir:80 212 GTPTDAYMPVGVQADFVNQQL--SKQTQLVR---DNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQ--ILDERILALP 284 (467) T ss_pred cChhhhhcchhHHhhhhhhhc--CceEEEEc---CCCCceeeeecccceecceeeeeecCceeecccc--CCCccccccc Confidence 133333333322 1100 01111110 0111122333331 000000 000111222221 1111111111 Q ss_pred EEEee------cccccceEEEEEEeccEEecccceEEE--EecCccCCC Q lcl|Aclame:pro 354 LRWAD------NEIYGQYLQAVLRFGVSKVDDKAGYYV--TFTPEPLPL 394 (394) Q Consensus 354 i~~~~------~~~~~~~~r~~~r~d~~v~~~~af~~l--~~~~~~~~~ 394 (394) ...+. +..... ...+-+..-.+.=+|+.+ ..-.+|+|+ T Consensus 285 ~Apsp~~vsaT~~~~~~---g~~~~~~~a~y~Y~v~~vs~~GES~pS~~ 330 (467) T protein:vir:80 285 TAPQPAKVTATQEAGKK---GQFRAEDLAAHEYKVVVSSDDAESIASEV 330 (467) T ss_pred ccccCCccceeeecccC---CcccCCCcceEEEEEEEECCCCccccccc Confidence 10000 000000 000000000011112222 222333333 No 233 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=25.07 E-value=2.1 Score=18.82 Aligned_cols=344 Identities=8% Similarity=-0.043 Sum_probs=74.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccch Q lcl|Aclame:pro 6 IKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQE 83 (394) Q Consensus 6 l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (394) ++|+. ++|+++++++.++++...+... .+..++.+++++++.++++++++++++.++..+.......... .. T Consensus 1 m~e~~---~~l~~~~~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~---~~ 74 (390) T protein:vir:10 1 MTDIT---SKLEATLANVTDSLRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGD---VQ 74 (390) T ss_pred ChHHH---HHHHHHHHHHHHHHHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc---cc Confidence 33333 3344444444444443322211 1122345566777777777777776665554333222111111 00 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccc-cCCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 84 EKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKK-ENAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ...... ................................. ..+...-...+-+.++..+- ..++... T Consensus 75 ~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii--~~~~~~~ 141 (390) T protein:vir:10 75 HVSVGD-----------LFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFI--TQPDARL 141 (390) T ss_pred ccchhh-----------hhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHHH--HHHHhhc Confidence 000000 000000000000000001111111111111111 11111111112222222111 1122211 Q ss_pred eeEeecCCceeEEEEecCCCcccccc-----cccccccccc-cccceeeecHhhhhhhhhhhH-HHHhccHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVA-----ELEKNPALAK-PDFKDVAWNIDTYRGAIPLSQ-ESIDDADVDLVGIVSE 235 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~-----e~~~~~~~~~-~~~~~v~~~~~~~~~~~~vs~-ell~ds~~~l~~~i~~ 235 (394) . +...--.+|+. ++...+.. .......+.. ..-...++....+... +++- --+-+...+-...+.. T Consensus 142 ~---l~~~~~~~~~~---~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~-k~~~~~~is~ell~d~~~l~~ 214 (390) T protein:vir:10 142 T---VRDLIGSGRTD---SALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTH-VIAHTMKATRQILSDAPQLAS 214 (390) T ss_pred h---hhhhcceeecc---CCceEEEEEecCCcceeeecCCccccccccceeEEEEeeE-EEEEeehhhHHHHHhHHHHHH Confidence 1 11111112211 11111111 1111111110 0001111111111111 0000 0000111111112233 Q ss_pred HHHHHHHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCc---eeeccc---- Q lcl|Aclame:pro 236 SISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGR---YLLQDD---- 308 (394) Q Consensus 236 ~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~---~l~~~~---- 308 (394) .+...++......+-...=.+..++ .....|.............+....-......+..+.+.... .++.|. T Consensus 215 ~i~~~l~~~~~~~~~~~il~G~G~~-~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~ 293 (390) T protein:vir:10 215 YMNNRLIRGLKVKEDAEILRGTGAN-DGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAA 293 (390) T ss_pred HHHHHHHHHHHHHHHHHHhhcCCCC-ccccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHH Confidence 3333333333333333222222111 11222221110000000001111111112222333332111 122221 Q ss_pred ---ccCCCcccccccceEEecCccccc----CceEEEec--cccEEEEeecceEEEEeecccccceEEEEEEeccEEe-- Q lcl|Aclame:pro 309 ---ITAVSGKVLLGKPVFVLSDEVLGA----NKAFIGDF--KRGVLFADRKDLGLRWADNEIYGQYLQAVLRFGVSKV-- 377 (394) Q Consensus 309 ---~~~~~~~~l~G~pV~~~~~~~~~~----~~~~~gd~--~~~~~~~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~-- 377 (394) +.+.. |.|+...+....+. .++++-++ ..-++++|.. .++..+.|-+..+. T Consensus 294 L~~lkd~~-----g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~-------------~~~~~~~~~~~~i~~~ 355 (390) T protein:vir:10 294 IELAKDAN-----NQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFD-------------LAAQIFDQWDARVEIG 355 (390) T ss_pred HHHhhcCC-----CceeecCCcCcCCceecceeeEEcCCCCCCcEEEEecc-------------ceEEEEEecceEEEEe Confidence 11111 22322111000000 01111111 0001111111 11111111111110 Q ss_pred --------cccceEE-EEecCcc-CCC Q lcl|Aclame:pro 378 --------DDKAGYY-VTFTPEP-LPL 394 (394) Q Consensus 378 --------~~~af~~-l~~~~~~-~~~ 394 (394) +--+|.. ..+.-.+ -|- T Consensus 356 ~~~~~~~~~~~~~r~~~r~d~~v~~~~ 382 (390) T protein:vir:10 356 YVNDDFQRNMVTVLAEERLALVVYRPE 382 (390) T ss_pred ecccccccCcEEEEEEEeeccEEeccc Confidence 0001100 0011000 000 No 234 >protein:vir:94870 Length: 318 # NCBI annotation: putative structural protein # Family: family:all:2417 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762518;genbank:gi:115304217;genbank:GeneID:5141183 Probab=22.34 E-value=2.5 Score=18.44 Aligned_cols=286 Identities=13% Similarity=0.061 Sum_probs=120.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHhhhhhhheeeeEee Q lcl|Aclame:pro 88 RESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQA 167 (394) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~~~~l~~~~~~~~~ 167 (394) ...|........+.++-.....+..+....+..... -.|++-.....-.|..+...|-..+-..+|++....+-.+ T Consensus 1 mtnfiesqnavteffdvlkknsgkseiknawnakla----engvtitdttfqlprklvesintallntnpvfkvfhvtnv 76 (318) T protein:vir:94 1 MTNFIESQNAVTEFFDVLKKNSGKSEIKNAWNAKLA----ENGVTITDTTFQLPRKLVESINTALLNTNPVFKVFHVTNV 76 (318) T ss_pred CccchhhhhhHHHHHHHHhcccChhhhhhhhhhhhh----hCCceeecchhhhHHHHHHhhhhhhccCCcceeeeeehhh Confidence 111111111112222222222222333333322211 1234444445568888888888888888888887766555 Q ss_pred cCCceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHH--HHhccHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 168 KKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQE--SIDDADVDLVGIVSESISQIKVNTT 245 (394) Q Consensus 168 ~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~e--ll~ds~~~l~~~i~~~l~~~~~~~~ 245 (394) +.--++ ....+...+.+...|....+...++.--++.|-.++.+...... -+++|...|-..|..+|..++.+.+ T Consensus 77 gallvs---rsfdssneaqvhkdgqtkteqaatltidtlepvmvyklqslaervkrlqmsyselynlivaeltqaivnki 153 (318) T protein:vir:94 77 GALLVS---RSFDSSNEAQVHKDGQTKTEQAATLTIDTLEPVMVYKLQSLAERVKRLQMSYSELYNLIVAELTQAIVNKI 153 (318) T ss_pred hheeee---ccccccchhhhhcccccccccceeeeecccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHhhh Confidence 432111 11223333444444444444556677777778777777666543 4566666677888888888877654 Q ss_pred -HHHHhhccccccccccccH----------------------HHHHHHHHhhhhhhcccEEEEcHHH-HHHHHhhhcc-- Q lcl|Aclame:pro 246 -NDAIAKVLKSFTTKTVKNL----------------------DEIKALLNGGFDPAYNVSLIVSQSF-YQTLDTLKDG-- 299 (394) Q Consensus 246 -~~a~~~g~~~~~~~~~~~~----------------------~~i~~~~~~~~~~~~~a~~vm~~~~-~~~l~~lkd~-- 299 (394) +-+...|+|+.+-...-.- |.|..++.-..+.+..-..++...+ .+-|..|+.+ T Consensus 154 vdlalvegdgtngfksidkeadvkkikkittkaksagktpfadaieeavdfvrptagrrylivktedrkalldelrqata 233 (318) T protein:vir:94 154 VDLALVEGDGTNGFKSIDKEADVKKIKKITTKAKSAGKTPFADAIEEAVDFVRPTAGRRYLIVKTEDRKALLDELRQATA 233 (318) T ss_pred hheeeeecCCcchhhhhchhhhHHHHHHhhhhhhhcCCCchhHHHHHHHhhhccCCCceEEEEeccchHHHHHHHHhhhc Confidence 5566677777654433211 2222222111111111223333333 3334444322 Q ss_pred CCc-eeecccccCCCcccccccc-eEE-ecCcccccCceEEEeccccEEEEeecceE-EEEeecccccceEEEEEEeccE Q lcl|Aclame:pro 300 NGR-YLLQDDITAVSGKVLLGKP-VFV-LSDEVLGANKAFIGDFKRGVLFADRKDLG-LRWADNEIYGQYLQAVLRFGVS 375 (394) Q Consensus 300 ~G~-~l~~~~~~~~~~~~l~G~p-V~~-~~~~~~~~~~~~~gd~~~~~~~~~~~~~~-i~~~~~~~~~~~~r~~~r~d~~ 375 (394) +.+ .|-..+ +. -.+--|.. +++ +.+.+. .+-++-|-+ |.+ +-++++ ++.....++..-+.+.---.|- T Consensus 234 nanvrikndd-te--iasevgvdeiivytgskav--kptvlvdqk--yhi-dmqdltkvdafewktnsnmilvetltsgh 305 (318) T protein:vir:94 234 NANVRIKNDD-TE--IASEVGVDEIIVYTGSKAV--KPTVLVDQK--YHI-DMQDLTKVDAFEWKTNSNMILVETLTSGH 305 (318) T ss_pred ccceEEeccc-hh--hhhhcCcceeEEeeccccc--cceeEeccc--eec-chhhhhhhhceeeccCCceEEEEecccCc Confidence 222 122111 00 01111221 111 111111 223444433 222 333332 1111111111112222222333 Q ss_pred EecccceEEEEec Q lcl|Aclame:pro 376 KVDDKAGYYVTFT 388 (394) Q Consensus 376 v~~~~af~~l~~~ 388 (394) |---+|-+.++++ T Consensus 306 vetynagavitvs 318 (318) T protein:vir:94 306 VETYNAGAVITVS 318 (318) T ss_pred ceeecCceeEEeC Confidence 3333444455555 No 235 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=21.89 E-value=2.5 Score=18.37 Aligned_cols=374 Identities=13% Similarity=0.016 Sum_probs=94.1 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHH----HHhchhhHHHH----HHHHHHHHHHH---HHHHHHHHHHHHHH---HH Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTKTAQVK----NALESDDLEAA----RSIKAEVEQAK---ANLVEAENDLKLYE---SS 66 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~~~e~~----~~~~~e~~~~~----~~~~~ei~~l~---~~i~~l~~~~~~~~---~~ 66 (394) =+..+|++|+++++++.++++++.++++ ....+++.+++ ++++++++.+. ++++++.+.++... .. T Consensus 7 em~~~i~eL~e~r~~l~~e~~~l~d~ak~e~~~~~~~~e~~e~~a~~~el~~ei~~le~~~~~~~~~~~~~~~~~~~~~~ 86 (477) T protein:vir:84 7 ELRALRAAAVEAVATLKAERQAIADGAKAEERAALSADETAEFRAKSASIKAELDKVEDLDEQIRELESEIERSGKLEAE 86 (477) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhh Confidence 3556799999999999999888777664 23555554443 44555555443 44444433332211 11 Q ss_pred Hh-hccccccccccccch---hhhHHHHHHHHH------HHHHHHHH---HHHHHHHHHHHHHHHHhhhhhhhhhhcccc Q lcl|Aclame:pro 67 VE-VGGAENIGGKEVTQE---EKTYRESVNDFI------RSKGKIVN---DSLRFEGKDEVLMPINETTPVEPQKDGIKK 133 (394) Q Consensus 67 ~~-~~~~~~~~~~~~~~~---~~~~~~~~~~~~------~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (394) .. ............... .......+.... ........ ...........................+.. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~ 166 (477) T protein:vir:84 87 TKTVRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGY 166 (477) T ss_pred hhhhcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcce Confidence 11 111111111110000 001100000000 00000000 001111111111111111111111111111 Q ss_pred cCCccccchhHHhH-----HHHHHHhhhhhhheee--eEe-ecCCce-eEEEEecCCCccccccccccccccccccccee Q lcl|Aclame:pro 134 ENAKPVSSEEILYT-----PAREVKTVVDLKPFTT--VYQ-AKKASG-KYPVLQRATTKMVTVAELEKNPALAKPDFKDV 204 (394) Q Consensus 134 ~~~~~lvP~~~~~~-----I~~~~~~~~~l~~~~~--~~~-~~~~~~-~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v 204 (394) ....-.++..+... ++..+-...++..... .++ +.++.. .+...-+...+.....+ ..+.....++..- T Consensus 167 lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~--s~~~f~~i~~~~~ 244 (477) T protein:vir:84 167 AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHE--VDLTDGFVQANVK 244 (477) T ss_pred eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccc--cccceeeEEEeee Confidence 11111233333322 2222222222222211 122 222221 11211111111111111 1122222333332 Q ss_pred eecHhhhhhhhhhhH---HHHhccHHHHHHHHHHHHHHHHHHHHHH--HHhhcc---c--------cccc---------- Q lcl|Aclame:pro 205 AWNIDTYRGAIPLSQ---ESIDDADVDLVGIVSESISQIKVNTTND--AIAKVL---K--------SFTT---------- 258 (394) Q Consensus 205 ~~~~~~~~~~~~vs~---ell~ds~~~l~~~i~~~l~~~~~~~~~~--a~~~g~---~--------~~~~---------- 258 (394) ++........--+.+ .+...-.-.|..-+...+..++..+.+. ...+.. + ++.+ T Consensus 245 k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~~~~~~~~~ 324 (477) T protein:vir:84 245 TIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALEKHQIIYQK 324 (477) T ss_pred eEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchhhHHHHHHH Confidence 222111111111111 1111101123333333333344332221 111110 0 0000 Q ss_pred ----------------cccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhh----hc-----cCCceeecccccCCC Q lcl|Aclame:pro 259 ----------------KTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTL----KD-----GNGRYLLQDDITAVS 313 (394) Q Consensus 259 ----------------~~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~l----kd-----~~G~~l~~~~~~~~~ 313 (394) ............+..+.+..-+..|.-+.........+ .. =.|.|+...+..... T Consensus 325 i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~ 404 (477) T protein:vir:84 325 IADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTT 404 (477) T ss_pred HHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCccccc Confidence 00111122223333333333334443322121111110 00 135555432110000 Q ss_pred cccc-cccceEEecCcccccCceEEEeccccEEEEeec------ceEEEEeecccccceEE---EEEEeccEEecccceE Q lcl|Aclame:pro 314 GKVL-LGKPVFVLSDEVLGANKAFIGDFKRGVLFADRK------DLGLRWADNEIYGQYLQ---AVLRFGVSKVDDKAGY 383 (394) Q Consensus 314 ~~~l-~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~------~~~i~~~~~~~~~~~~r---~~~r~d~~v~~~~af~ 383 (394) .+.. ....+++-+ -..+++++.. ..+..+.. ...++......+ ..+| .+..+-+ .|-. T Consensus 405 ~~~~~d~~~i~~gd-----~~~~~i~~~~-~~~~~~~~~~~~~~~~~~~v~~~~~~-~~~r~~~afv~~t~-----~~~~ 472 (477) T protein:vir:84 405 LGTGTDQDVIHVLR-----ASDLALFESS-VRMRALQETRAENLSVLLQVYGYLAF-TAARFPQSVVEIGG-----TALT 472 (477) T ss_pred ccccCCcceEEEEE-----eceEEEEeec-eeEEeccccccccceeeeeehhhhhh-hhhccccceEEeec-----cccc Confidence 0000 000122211 0112233211 00111000 000000000000 0001 1111111 1111 Q ss_pred EEEec Q lcl|Aclame:pro 384 YVTFT 388 (394) Q Consensus 384 ~l~~~ 388 (394) .=++. T Consensus 473 ~~~~~ 477 (477) T protein:vir:84 473 APTFA 477 (477) T ss_pred ccccC Confidence 11111 No 236 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=21.52 E-value=2.6 Score=18.32 Aligned_cols=334 Identities=11% Similarity=0.115 Sum_probs=81.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTK-TAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~-~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) -|+|.++++.+..+++++..++- .++++. -.++.++++++++...+ +.+.+................. ... T Consensus 4 ~l~el~~~~~~~~~e~~~~~~~~~~~e~~~-----~~~e~~~l~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~--~~~ 75 (392) T protein:vir:10 4 ELRELLAKLEGKKEEVRSLMGEDKVAEAEQ-----MMEEVRSLQKKIDLQRS-LDEAETEERNNGREVETRNVDG--EME 75 (392) T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHH-----HHHHHHHHHHHHHHHHH-HHHHHHHHhhccccccccCccc--hHH Confidence 88888888888888888876542 222322 23345566667665433 2222222111111111111000 000 Q ss_pred ccchhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhc--ccccCCccccchhHHhHHHHHHHhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVND-FIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDG--IKKENAKPVSSEEILYTPAREVKTVV 156 (394) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~lvP~~~~~~I~~~~~~~~ 156 (394) . .......+.. ............. .... .... ......+ +...-...++-.......+..+-... T Consensus 76 ~---~~~~~~~l~~~~~~~~~~~~~~~~-~~~~-----~~~~---~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~ 143 (392) T protein:vir:10 76 Y---RDVFMKALRNKPLNAEEREFLEDD-LEQR-----AMSG---LTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) T ss_pred H---HHHHHHHHhcccccHHHHHHHhhh-hhhh-----hccc---cccCCCceecchhHHHHHHHHHHhhhhhhhhceee Confidence 0 0001111110 0000000000000 0000 0000 0000000 11111111111111112222222222 Q ss_pred hhhheee--eEeecCCceeEEEEecCCCcccccccccccccccccccce------eeecHhhhhhhhhhhHHHHhccHHH Q lcl|Aclame:pro 157 DLKPFTT--VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD------VAWNIDTYRGAIPLSQESIDDADVD 228 (394) Q Consensus 157 ~l~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~------v~~~~~~~~~~~~vs~ell~ds~~~ 228 (394) ++..... .++...+.......... +...+. ..+.....++.. +.++-+-+.. |..-+. .+ T Consensus 144 ~~~~~~~~~~~~~~~~~~~a~~v~E~----~~~~~~-~~~~~~~v~l~~~k~~~~~~iS~ell~d----s~~~l~--~~- 211 (392) T protein:vir:10 144 PVRTRSGSRVLEKNSDMIPFAEITEM----GEIPET-DNPKFSNVQYAVKDRAGILPLSRSLLQD----SDQNIL--KY- 211 (392) T ss_pred eccCCceeEEEEeecCCccceeeccc----cccccc-ccccceeEEeeeeeEEEeehhhHHHHhh----hHHHHH--HH- Confidence 2221111 12222222222211111 111111 111122222222 2222222221 211111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccccccHH---------------------HHHHHHHhhhhhhcccEEEEcH Q lcl|Aclame:pro 229 LVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLD---------------------EIKALLNGGFDPAYNVSLIVSQ 287 (394) Q Consensus 229 l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~---------------------~i~~~~~~~~~~~~~a~~vm~~ 287 (394) |...|.+.++..+....-...-.+. ..+..+..+.. .....+..+.+..-+..|..+. T Consensus 212 i~~~l~~~i~~~~d~~~~~g~g~~~-~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~ 290 (392) T protein:vir:10 212 VTKWLGKKSKVTRNVLILGVIEKLT-KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDP 290 (392) T ss_pred HHHHHHHHHHHHHHHHHhhcccccc-ccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCc Confidence 4445555555444333222211111 11111111111 1122222222221122332222 Q ss_pred HHHHHHHhhhccCCce-eec-ccccCCCccccccc-ceEEecCcccccCceEEEeccccE------------------EE Q lcl|Aclame:pro 288 SFYQTLDTLKDGNGRY-LLQ-DDITAVSGKVLLGK-PVFVLSDEVLGANKAFIGDFKRGV------------------LF 346 (394) Q Consensus 288 ~~~~~l~~lkd~~G~~-l~~-~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~gd~~~~~------------------~~ 346 (394) ..-. -.. =.|.| ++. ++.....++.-.|- ++++-+ -+..++++++...- .. T Consensus 291 ~~~~-~~t---llG~~~v~~~~~~~~~~~~~~~~~~~~~~gd----fs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~ 362 (392) T protein:vir:10 291 TQKN-KKL---FAGTNPVVVVSNRFLKSKGTTAKKAPLIIGD----LKEAIVLFKREDMELASTDVGGKAFTRNTLDLRA 362 (392) T ss_pred cCCc-ccc---ccCcccEEEecccccCCCcccCCceEEEEEe----hhceEEEEeecceEEEEeccccchhhcCceEEEE Confidence 1100 000 02322 110 01000001111111 111100 00111222222111 11 Q ss_pred EeecceEEEEeecccccceEEEEEEeccEEec-ccc Q lcl|Aclame:pro 347 ADRKDLGLRWADNEIYGQYLQAVLRFGVSKVD-DKA 381 (394) Q Consensus 347 ~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~-~~a 381 (394) ..|.|..+. +. ..+...---..+|.- |.+ T Consensus 363 ~~r~d~~v~--~~----~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 363 IQRDDVQMW--DN----EAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEeeccEEe--cc----cceEEEEecccccccCCCC Confidence 111111111 11 111111111112222 333 No 237 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=21.52 E-value=2.6 Score=18.32 Aligned_cols=334 Identities=11% Similarity=0.115 Sum_probs=81.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTK-TAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~-~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) -|+|.++++.+..+++++..++- .++++. -.++.++++++++...+ +.+.+................. ... T Consensus 4 ~l~el~~~~~~~~~e~~~~~~~~~~~e~~~-----~~~e~~~l~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~--~~~ 75 (392) T protein:vir:10 4 ELRELLAKLEGKKEEVRSLMGEDKVAEAEQ-----MMEEVRSLQKKIDLQRS-LDEAETEERNNGREVETRNVDG--EME 75 (392) T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHH-----HHHHHHHHHHHHHHHHH-HHHHHHHHhhccccccccCccc--hHH Confidence 88888888888888888876542 222322 23345566667665433 2222222111111111111000 000 Q ss_pred ccchhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhc--ccccCCccccchhHHhHHHHHHHhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVND-FIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDG--IKKENAKPVSSEEILYTPAREVKTVV 156 (394) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~lvP~~~~~~I~~~~~~~~ 156 (394) . .......+.. ............. .... .... ......+ +...-...++-.......+..+-... T Consensus 76 ~---~~~~~~~l~~~~~~~~~~~~~~~~-~~~~-----~~~~---~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~ 143 (392) T protein:vir:10 76 Y---RDVFMKALRNKPLNAEEREFLEDD-LEQR-----AMSG---LTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) T ss_pred H---HHHHHHHHhcccccHHHHHHHhhh-hhhh-----hccc---cccCCCceecchhHHHHHHHHHHhhhhhhhhceee Confidence 0 0001111110 0000000000000 0000 0000 0000000 11111111111111112222222222 Q ss_pred hhhheee--eEeecCCceeEEEEecCCCcccccccccccccccccccce------eeecHhhhhhhhhhhHHHHhccHHH Q lcl|Aclame:pro 157 DLKPFTT--VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD------VAWNIDTYRGAIPLSQESIDDADVD 228 (394) Q Consensus 157 ~l~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~------v~~~~~~~~~~~~vs~ell~ds~~~ 228 (394) ++..... .++...+.......... +...+. ..+.....++.. +.++-+-+.. |..-+. .+ T Consensus 144 ~~~~~~~~~~~~~~~~~~~a~~v~E~----~~~~~~-~~~~~~~v~l~~~k~~~~~~iS~ell~d----s~~~l~--~~- 211 (392) T protein:vir:10 144 PVRTRSGSRVLEKNSDMIPFAEITEM----GEIPET-DNPKFSNVQYAVKDRAGILPLSRSLLQD----SDQNIL--KY- 211 (392) T ss_pred eccCCceeEEEEeecCCccceeeccc----cccccc-ccccceeEEeeeeeEEEeehhhHHHHhh----hHHHHH--HH- Confidence 2221111 12222222222211111 111111 111122222222 2222222221 211111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccccccHH---------------------HHHHHHHhhhhhhcccEEEEcH Q lcl|Aclame:pro 229 LVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLD---------------------EIKALLNGGFDPAYNVSLIVSQ 287 (394) Q Consensus 229 l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~---------------------~i~~~~~~~~~~~~~a~~vm~~ 287 (394) |...|.+.++..+....-...-.+. ..+..+..+.. .....+..+.+..-+..|..+. T Consensus 212 i~~~l~~~i~~~~d~~~~~g~g~~~-~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~ 290 (392) T protein:vir:10 212 VTKWLGKKSKVTRNVLILGVIEKLT-KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDP 290 (392) T ss_pred HHHHHHHHHHHHHHHHHhhcccccc-ccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCc Confidence 4445555555444333222211111 11111111111 1122222222221122332222 Q ss_pred HHHHHHHhhhccCCce-eec-ccccCCCccccccc-ceEEecCcccccCceEEEeccccE------------------EE Q lcl|Aclame:pro 288 SFYQTLDTLKDGNGRY-LLQ-DDITAVSGKVLLGK-PVFVLSDEVLGANKAFIGDFKRGV------------------LF 346 (394) Q Consensus 288 ~~~~~l~~lkd~~G~~-l~~-~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~gd~~~~~------------------~~ 346 (394) ..-. -.. =.|.| ++. ++.....++.-.|- ++++-+ -+..++++++...- .. T Consensus 291 ~~~~-~~t---llG~~~v~~~~~~~~~~~~~~~~~~~~~~gd----fs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~ 362 (392) T protein:vir:10 291 TQKN-KKL---FAGTNPVVVVSNRFLKSKGTTAKKAPLIIGD----LKEAIVLFKREDMELASTDVGGKAFTRNTLDLRA 362 (392) T ss_pred cCCc-ccc---ccCcccEEEecccccCCCcccCCceEEEEEe----hhceEEEEeecceEEEEeccccchhhcCceEEEE Confidence 1100 000 02322 110 01000001111111 111100 00111222222111 11 Q ss_pred EeecceEEEEeecccccceEEEEEEeccEEec-ccc Q lcl|Aclame:pro 347 ADRKDLGLRWADNEIYGQYLQAVLRFGVSKVD-DKA 381 (394) Q Consensus 347 ~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~-~~a 381 (394) ..|.|..+. +. ..+...---..+|.- |.+ T Consensus 363 ~~r~d~~v~--~~----~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 363 IQRDDVQMW--DN----EAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEeeccEEe--cc----cceEEEEecccccccCCCC Confidence 111111111 11 111111111112222 333 No 238 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=21.52 E-value=2.6 Score=18.32 Aligned_cols=334 Identities=11% Similarity=0.115 Sum_probs=81.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTK-TAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~-~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) -|+|.++++.+..+++++..++- .++++. -.++.++++++++...+ +.+.+................. ... T Consensus 4 ~l~el~~~~~~~~~e~~~~~~~~~~~e~~~-----~~~e~~~l~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~--~~~ 75 (392) T protein:vir:10 4 ELRELLAKLEGKKEEVRSLMGEDKVAEAEQ-----MMEEVRSLQKKIDLQRS-LDEAETEERNNGREVETRNVDG--EME 75 (392) T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHH-----HHHHHHHHHHHHHHHHH-HHHHHHHHhhccccccccCccc--hHH Confidence 88888888888888888876542 222322 23345566667665433 2222222111111111111000 000 Q ss_pred ccchhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhc--ccccCCccccchhHHhHHHHHHHhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVND-FIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDG--IKKENAKPVSSEEILYTPAREVKTVV 156 (394) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~lvP~~~~~~I~~~~~~~~ 156 (394) . .......+.. ............. .... .... ......+ +...-...++-.......+..+-... T Consensus 76 ~---~~~~~~~l~~~~~~~~~~~~~~~~-~~~~-----~~~~---~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~ 143 (392) T protein:vir:10 76 Y---RDVFMKALRNKPLNAEEREFLEDD-LEQR-----AMSG---LTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) T ss_pred H---HHHHHHHHhcccccHHHHHHHhhh-hhhh-----hccc---cccCCCceecchhHHHHHHHHHHhhhhhhhhceee Confidence 0 0001111110 0000000000000 0000 0000 0000000 11111111111111112222222222 Q ss_pred hhhheee--eEeecCCceeEEEEecCCCcccccccccccccccccccce------eeecHhhhhhhhhhhHHHHhccHHH Q lcl|Aclame:pro 157 DLKPFTT--VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD------VAWNIDTYRGAIPLSQESIDDADVD 228 (394) Q Consensus 157 ~l~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~------v~~~~~~~~~~~~vs~ell~ds~~~ 228 (394) ++..... .++...+.......... +...+. ..+.....++.. +.++-+-+.. |..-+. .+ T Consensus 144 ~~~~~~~~~~~~~~~~~~~a~~v~E~----~~~~~~-~~~~~~~v~l~~~k~~~~~~iS~ell~d----s~~~l~--~~- 211 (392) T protein:vir:10 144 PVRTRSGSRVLEKNSDMIPFAEITEM----GEIPET-DNPKFSNVQYAVKDRAGILPLSRSLLQD----SDQNIL--KY- 211 (392) T ss_pred eccCCceeEEEEeecCCccceeeccc----cccccc-ccccceeEEeeeeeEEEeehhhHHHHhh----hHHHHH--HH- Confidence 2221111 12222222222211111 111111 111122222222 2222222221 211111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccccccHH---------------------HHHHHHHhhhhhhcccEEEEcH Q lcl|Aclame:pro 229 LVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLD---------------------EIKALLNGGFDPAYNVSLIVSQ 287 (394) Q Consensus 229 l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~---------------------~i~~~~~~~~~~~~~a~~vm~~ 287 (394) |...|.+.++..+....-...-.+. ..+..+..+.. .....+..+.+..-+..|..+. T Consensus 212 i~~~l~~~i~~~~d~~~~~g~g~~~-~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~ 290 (392) T protein:vir:10 212 VTKWLGKKSKVTRNVLILGVIEKLT-KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDP 290 (392) T ss_pred HHHHHHHHHHHHHHHHHhhcccccc-ccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCc Confidence 4445555555444333222211111 11111111111 1122222222221122332222 Q ss_pred HHHHHHHhhhccCCce-eec-ccccCCCccccccc-ceEEecCcccccCceEEEeccccE------------------EE Q lcl|Aclame:pro 288 SFYQTLDTLKDGNGRY-LLQ-DDITAVSGKVLLGK-PVFVLSDEVLGANKAFIGDFKRGV------------------LF 346 (394) Q Consensus 288 ~~~~~l~~lkd~~G~~-l~~-~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~gd~~~~~------------------~~ 346 (394) ..-. -.. =.|.| ++. ++.....++.-.|- ++++-+ -+..++++++...- .. T Consensus 291 ~~~~-~~t---llG~~~v~~~~~~~~~~~~~~~~~~~~~~gd----fs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~ 362 (392) T protein:vir:10 291 TQKN-KKL---FAGTNPVVVVSNRFLKSKGTTAKKAPLIIGD----LKEAIVLFKREDMELASTDVGGKAFTRNTLDLRA 362 (392) T ss_pred cCCc-ccc---ccCcccEEEecccccCCCcccCCceEEEEEe----hhceEEEEeecceEEEEeccccchhhcCceEEEE Confidence 1100 000 02322 110 01000001111111 111100 00111222222111 11 Q ss_pred EeecceEEEEeecccccceEEEEEEeccEEec-ccc Q lcl|Aclame:pro 347 ADRKDLGLRWADNEIYGQYLQAVLRFGVSKVD-DKA 381 (394) Q Consensus 347 ~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~-~~a 381 (394) ..|.|..+. +. ..+...---..+|.- |.+ T Consensus 363 ~~r~d~~v~--~~----~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 363 IQRDDVQMW--DN----EAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEeeccEEe--cc----cceEEEEecccccccCCCC Confidence 111111111 11 111111111112222 333 No 239 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=21.52 E-value=2.6 Score=18.32 Aligned_cols=334 Identities=11% Similarity=0.115 Sum_probs=81.6 Q ss_pred ChHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc Q lcl|Aclame:pro 1 MFEEKIKEIKATIADLNNTIVTK-TAQVKNALESDDLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKE 79 (394) Q Consensus 1 ~l~e~l~eL~~~~~el~~~~~~~-~~e~~~~~~~e~~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 79 (394) -|+|.++++.+..+++++..++- .++++. -.++.++++++++...+ +.+.+................. ... T Consensus 4 ~l~el~~~~~~~~~e~~~~~~~~~~~e~~~-----~~~e~~~l~~~i~~~~~-~~~~~~~~~~~~~~~~~~~~~~--~~~ 75 (392) T protein:vir:10 4 ELRELLAKLEGKKEEVRSLMGEDKVAEAEQ-----MMEEVRSLQKKIDLQRS-LDEAETEERNNGREVETRNVDG--EME 75 (392) T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHH-----HHHHHHHHHHHHHHHHH-HHHHHHHHhhccccccccCccc--hHH Confidence 88888888888888888876542 222322 23345566667665433 2222222111111111111000 000 Q ss_pred ccchhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhc--ccccCCccccchhHHhHHHHHHHhhh Q lcl|Aclame:pro 80 VTQEEKTYRESVND-FIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDG--IKKENAKPVSSEEILYTPAREVKTVV 156 (394) Q Consensus 80 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~lvP~~~~~~I~~~~~~~~ 156 (394) . .......+.. ............. .... .... ......+ +...-...++-.......+..+-... T Consensus 76 ~---~~~~~~~l~~~~~~~~~~~~~~~~-~~~~-----~~~~---~t~~~gg~~vP~~~~~~ii~~~~~~s~l~~~~~~~ 143 (392) T protein:vir:10 76 Y---RDVFMKALRNKPLNAEEREFLEDD-LEQR-----AMSG---LTGEDGGLVIPQDIQTQINELARSFDALEQYVTVE 143 (392) T ss_pred H---HHHHHHHHhcccccHHHHHHHhhh-hhhh-----hccc---cccCCCceecchhHHHHHHHHHHhhhhhhhhceee Confidence 0 0001111110 0000000000000 0000 0000 0000000 11111111111111112222222222 Q ss_pred hhhheee--eEeecCCceeEEEEecCCCcccccccccccccccccccce------eeecHhhhhhhhhhhHHHHhccHHH Q lcl|Aclame:pro 157 DLKPFTT--VYQAKKASGKYPVLQRATTKMVTVAELEKNPALAKPDFKD------VAWNIDTYRGAIPLSQESIDDADVD 228 (394) Q Consensus 157 ~l~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~------v~~~~~~~~~~~~vs~ell~ds~~~ 228 (394) ++..... .++...+.......... +...+. ..+.....++.. +.++-+-+.. |..-+. .+ T Consensus 144 ~~~~~~~~~~~~~~~~~~~a~~v~E~----~~~~~~-~~~~~~~v~l~~~k~~~~~~iS~ell~d----s~~~l~--~~- 211 (392) T protein:vir:10 144 PVRTRSGSRVLEKNSDMIPFAEITEM----GEIPET-DNPKFSNVQYAVKDRAGILPLSRSLLQD----SDQNIL--KY- 211 (392) T ss_pred eccCCceeEEEEeecCCccceeeccc----cccccc-ccccceeEEeeeeeEEEeehhhHHHHhh----hHHHHH--HH- Confidence 2221111 12222222222211111 111111 111122222222 2222222221 211111 11 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccccccccccHH---------------------HHHHHHHhhhhhhcccEEEEcH Q lcl|Aclame:pro 229 LVGIVSESISQIKVNTTNDAIAKVLKSFTTKTVKNLD---------------------EIKALLNGGFDPAYNVSLIVSQ 287 (394) Q Consensus 229 l~~~i~~~l~~~~~~~~~~a~~~g~~~~~~~~~~~~~---------------------~i~~~~~~~~~~~~~a~~vm~~ 287 (394) |...|.+.++..+....-...-.+. ..+..+..+.. .....+..+.+..-+..|..+. T Consensus 212 i~~~l~~~i~~~~d~~~~~g~g~~~-~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~ 290 (392) T protein:vir:10 212 VTKWLGKKSKVTRNVLILGVIEKLT-KQAIKSLDDIKDVLNVKLDPAISPNAILLTNQDGFNYLDKLKDKDGKYILQSDP 290 (392) T ss_pred HHHHHHHHHHHHHHHHHhhcccccc-ccCccCHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhhccCCCeEeecCc Confidence 4445555555444333222211111 11111111111 1122222222221122332222 Q ss_pred HHHHHHHhhhccCCce-eec-ccccCCCccccccc-ceEEecCcccccCceEEEeccccE------------------EE Q lcl|Aclame:pro 288 SFYQTLDTLKDGNGRY-LLQ-DDITAVSGKVLLGK-PVFVLSDEVLGANKAFIGDFKRGV------------------LF 346 (394) Q Consensus 288 ~~~~~l~~lkd~~G~~-l~~-~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~gd~~~~~------------------~~ 346 (394) ..-. -.. =.|.| ++. ++.....++.-.|- ++++-+ -+..++++++...- .. T Consensus 291 ~~~~-~~t---llG~~~v~~~~~~~~~~~~~~~~~~~~~~gd----fs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~ 362 (392) T protein:vir:10 291 TQKN-KKL---FAGTNPVVVVSNRFLKSKGTTAKKAPLIIGD----LKEAIVLFKREDMELASTDVGGKAFTRNTLDLRA 362 (392) T ss_pred cCCc-ccc---ccCcccEEEecccccCCCcccCCceEEEEEe----hhceEEEEeecceEEEEeccccchhhcCceEEEE Confidence 1100 000 02322 110 01000001111111 111100 00111222222111 11 Q ss_pred EeecceEEEEeecccccceEEEEEEeccEEec-ccc Q lcl|Aclame:pro 347 ADRKDLGLRWADNEIYGQYLQAVLRFGVSKVD-DKA 381 (394) Q Consensus 347 ~~~~~~~i~~~~~~~~~~~~r~~~r~d~~v~~-~~a 381 (394) ..|.|..+. +. ..+...---..+|.- |.+ T Consensus 363 ~~r~d~~v~--~~----~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 363 IQRDDVQMW--DN----EAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEeeccEEe--cc----cceEEEEecccccccCCCC Confidence 111111111 11 111111111112222 333 No 240 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=21.38 E-value=2.6 Score=18.30 Aligned_cols=283 Identities=9% Similarity=0.003 Sum_probs=108.0 Q ss_pred chhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHh----HHHHHHHhhhh Q lcl|Aclame:pro 82 QEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILY----TPAREVKTVVD 157 (394) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~----~I~~~~~~~~~ 157 (394) ....+....+..+.-. +.. .......+. ..........+.++ .+.....+|..+.. .+++.+..... T Consensus 1 ~~~~~~~~~l~~~gi~----~~~-~~~~~~~~~---~~~a~da~d~~~~~-~t~~~~g~~~~l~~~i~p~~~~~~~~~~~ 71 (336) T protein:vir:10 1 MRDAQRIQNLARAGVI----LPR-SVKNVSTPL---AEYAMDAADLSPHL-SSTGSSGIPNYLTTYVDPSVIDILVAPMK 71 (336) T ss_pred CchHHHHHHHhccCee----cch-hhhhhhHHH---HHHHHhhhhhcccc-ccCCCcchHHHHHhhcCcceeeeeechhc Confidence 1111111111111000 000 000000000 00000000001111 11122234444443 33333333333 Q ss_pred hhheeeeEeecCC---ceeEEEEecCCCcccccccccccccccccccceeeecHhhhhhhhhhhH-HHHhc--cHHHHHH Q lcl|Aclame:pro 158 LKPFTTVYQAKKA---SGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQ-ESIDD--ADVDLVG 231 (394) Q Consensus 158 l~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~-ell~d--s~~~l~~ 231 (394) ...++.+.+.+.- ...+++.. ..+.+...+.....| ..+...+.-+-+.+.++..+.++. |+-.- .-.++.+ T Consensus 72 ~~~l~~v~t~g~w~~~~~~~~~~e-~~G~a~~ygd~~d~P-~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~ 149 (336) T protein:vir:10 72 AAELVGESKKGDWTTLVAAFITAE-PTTKVATYGDYSSDG-DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLAS 149 (336) T ss_pred hhhhcccccCCCcceeeEEEEeee-eeeeEEEccccCCCc-ceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHH Confidence 3444444332111 12223322 334445555555554 455556666777788888888885 34332 1235666 Q ss_pred HHHHHHHHHHHHHHHHHHhhccccccc---------------c----ccccHHHH----HHHHHhhhhhhc-----c--c Q lcl|Aclame:pro 232 IVSESISQIKVNTTNDAIAKVLKSFTT---------------K----TVKNLDEI----KALLNGGFDPAY-----N--V 281 (394) Q Consensus 232 ~i~~~l~~~~~~~~~~a~~~g~~~~~~---------------~----~~~~~~~i----~~~~~~~~~~~~-----~--a 281 (394) --+...++++....|.-.+.|....+. . ..++.+.| ..++..+..... + - T Consensus 150 ~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~ 229 (336) T protein:vir:10 150 ELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVL 229 (336) T ss_pred HHHHHHHHHHHHhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccce Confidence 666666666766666654444322100 0 11233444 444433322221 2 2 Q ss_pred EEEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEee---cc-eEEEE- Q lcl|Aclame:pro 282 SLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADR---KD-LGLRW- 356 (394) Q Consensus 282 ~~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~---~~-~~i~~- 356 (394) .++|.++.+..|.. .+..|.-++.- +....| ++.++..+.. .+++ |+- .+++... .+ +.+.+ T Consensus 230 tL~Lp~~~~~~L~~-~n~~g~tv~~~-lk~n~P----nl~i~t~pel-~~Ag----g~~--~~~~~~~~~~~~t~~~~~P 296 (336) T protein:vir:10 230 HMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFVTIPEY-DTAS----GRL--VQLWAPRVEGKDTATCGFT 296 (336) T ss_pred EEEechHHHHhccC-CCccCccHHHH-HHHhCC----ccEEEEcccc-cccC----Cce--EEEEEecccCCcceeeecC Confidence 68899999988864 33344323210 111111 1222222211 1111 111 0111110 01 11111 Q ss_pred ---eeccc--ccce--EEEEEEe-ccEEecccceEEEEec Q lcl|Aclame:pro 357 ---ADNEI--YGQY--LQAVLRF-GVSKVDDKAGYYVTFT 388 (394) Q Consensus 357 ---~~~~~--~~~~--~r~~~r~-d~~v~~~~af~~l~~~ 388 (394) ..++- .... .-+..|. |..+.+|-||++++.- T Consensus 297 ~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 297 EKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred hhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 00000 0011 2344565 4477789999999988 No 241 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=21.25 E-value=2.6 Score=18.28 Aligned_cols=350 Identities=9% Similarity=-0.061 Sum_probs=69.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhchhh--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccccch Q lcl|Aclame:pro 6 IKEIKATIADLNNTIVTKTAQVKNALESDD--LEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQE 83 (394) Q Consensus 6 l~eL~~~~~el~~~~~~~~~e~~~~~~~e~--~~~~~~~~~ei~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (394) +.++.+++.+.+ +++.++++...+... .+...+..++++++.++++.+++++++.+...+............ T Consensus 1 m~~~~~~l~~~~---~~~~~~~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~--- 74 (390) T protein:vir:97 1 MTDITAKLEATL---ANVTDSLKAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQ--- 74 (390) T ss_pred ChHHHHHHHHHH---HHHHHHHHHHHHHHHhhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccc--- Confidence 444444333332 222233332211110 112233455566666666666666665554433322211111110 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccccc-CCccccchhHHhHHHHHHHhhhhhhhee Q lcl|Aclame:pro 84 EKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKE-NAKPVSSEEILYTPAREVKTVVDLKPFT 162 (394) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~lvP~~~~~~I~~~~~~~~~l~~~~ 162 (394) ........ ......+......................... .+..-....+-..+...+- ..++... T Consensus 75 ~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii--~~~~~~~ 141 (390) T protein:vir:97 75 HVSVGDMF-----------VASEQFQASTGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGFI--TPPDARL 141 (390) T ss_pred cccchhhh-----------hhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhcccccccccccchhhhHHHH--HHHhhhh Confidence 00000000 00000000000000000000000000000000 0000001111111111111 1112211 Q ss_pred eeEeecCCceeEEEEecCCCccccccc-----cccccccc-ccccceeeecHhhhhhhhhhhHHHHhccHHHHHHHHHHH Q lcl|Aclame:pro 163 TVYQAKKASGKYPVLQRATTKMVTVAE-----LEKNPALA-KPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVGIVSES 236 (394) Q Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~e-----~~~~~~~~-~~~~~~v~~~~~~~~~~~~vs~ell~ds~~~l~~~i~~~ 236 (394) .+... .-.+|+. ++...+... ......++ ...-...++....+....--..--+.+...+-...+... T Consensus 142 ~i~~~---~~~~~~~---~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~~l~~~ 215 (390) T protein:vir:97 142 TVRDL---IGSGRTD---SALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAPQLASY 215 (390) T ss_pred hhHhh---cceeecc---CCceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhHHHHHHH Confidence 11111 1112221 111111111 01111111 011111122211111100000000001111111223333 Q ss_pred HHHHHHHHHHHHHhhccccccccccccHHHHHHHHHhhhhhhcccEEEEcHHHHHHHHhhhccCCc---eeeccc----- Q lcl|Aclame:pro 237 ISQIKVNTTNDAIAKVLKSFTTKTVKNLDEIKALLNGGFDPAYNVSLIVSQSFYQTLDTLKDGNGR---YLLQDD----- 308 (394) Q Consensus 237 l~~~~~~~~~~a~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~a~~vm~~~~~~~l~~lkd~~G~---~l~~~~----- 308 (394) +...++.+...++-...-.+..++ .....|.............+....-......+..+...... .++.|. T Consensus 216 i~~~la~a~~~~~d~a~l~G~g~~-~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~L 294 (390) T protein:vir:97 216 MNNRLIRGLKVKEDAEILRGTGAN-DGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAAI 294 (390) T ss_pred HHHHHHHHHHHHHHHHHhhcCCCC-ccccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHH Confidence 333444444444433322221111 11222221110000000000000001111222233322111 222111 Q ss_pred --ccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecc-eEEEEeecccccceEEEEEEeccEEec------- Q lcl|Aclame:pro 309 --ITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKD-LGLRWADNEIYGQYLQAVLRFGVSKVD------- 378 (394) Q Consensus 309 --~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~-~~i~~~~~~~~~~~~r~~~r~d~~v~~------- 378 (394) +.+. .|.|++..+ ..+....++|-. ++..+... -.+-+-+ |..++....|-|..+.. T Consensus 295 ~~lkd~-----~G~~l~~~~--~~~~~~~l~G~p---V~~~~~~~~~~~~~gd---~~~~~~~~~~~~~~i~~~~~~~~f 361 (390) T protein:vir:97 295 ELAKDA-----NNQYLIGNA--RGTLTPTLWGLP---VVATQAMAPGEFLVGA---FDLAAQIFDQWDARVEIGYVNDDF 361 (390) T ss_pred HHhhcC-----CCceeecCc--cCCCCceeccee---eEEcCCCCCCcEEEEe---ccceEEEEEecceEEEEeeccccc Confidence 1111 233332110 001111111110 00000000 0000000 11111111111111100 Q ss_pred -cc--ceE---EEEecCc-cCCC Q lcl|Aclame:pro 379 -DK--AGY---YVTFTPE-PLPL 394 (394) Q Consensus 379 -~~--af~---~l~~~~~-~~~~ 394 (394) .+ +|. .+.+... |..+ T Consensus 362 ~~~~~~~r~~~r~d~~v~~~~a~ 384 (390) T protein:vir:97 362 QRNMVTVLAEERLALVVYRPEAL 384 (390) T ss_pred ccCcEEEEEEEeeccEEeccccE Confidence 00 010 0000000 0000 No 242 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=21.10 E-value=2.7 Score=18.26 Aligned_cols=255 Identities=11% Similarity=-0.006 Sum_probs=93.5 Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccccCCccccchhHHhHHHHHHHh--hhhhhh Q lcl|Aclame:pro 83 EEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKT--VVDLKP 160 (394) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lvP~~~~~~I~~~~~~--~~~l~~ 160 (394) ...++. ++-.+.....+.. ++. .|+++--+.+.+.+..+... .-.+.. T Consensus 1 ~~~~~~-------~~~~~a~~~al~~--------------------a~~---~g~AlR~EsLd~~l~~lt~~~~~ftf~~ 50 (470) T protein:vir:10 1 MPYEHL-------KHLDEATLKALNA--------------------AGQ---VAESLEREDLEPEVTQLNVLDTPLTDLL 50 (470) T ss_pred CChhHh-------hhhhHHHHHHHHH--------------------hhh---cchhhhhhhhccceeEeeecCccchhhh Confidence 000000 0111111110000 000 00111111111111110000 011112 Q ss_pred eeeeEeecCCceeEEE--EecCCCcccccccccccccccccccceeeecHhhhhhhhhhhHHH---HhccHHHHHHHHHH Q lcl|Aclame:pro 161 FTTVYQAKKASGKYPV--LQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQES---IDDADVDLVGIVSE 235 (394) Q Consensus 161 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~e~~~~~~~~~~~~~~v~~~~~~~~~~~~vs~el---l~ds~~~l~~~i~~ 235 (394) -+...++.+.-..+.. ..+...+.+.+ .|+..++.+++.+...+..++-++....||.-. ++....++.....+ T Consensus 51 ~i~k~~a~STV~ey~~~~~rhG~~g~s~~-~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~ 129 (470) T protein:vir:10 51 SKNAVKAKAYEHEYNVVTARHDKIGYAAF-REGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKR 129 (470) T ss_pred hcCCchhhhHhhhhhhhccccccccceee-cccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHH Confidence 2233344333333322 11222222334 466666789999999999999999999999764 33344577777777 Q ss_pred HHHHHHHHHHHHHHhhcccccc---c--cccccHHHHHHHHHhh-hhhhcc--c-------------------------E Q lcl|Aclame:pro 236 SISQIKVNTTNDAIAKVLKSFT---T--KTVKNLDEIKALLNGG-FDPAYN--V-------------------------S 282 (394) Q Consensus 236 ~l~~~~~~~~~~a~~~g~~~~~---~--~~~~~~~~i~~~~~~~-~~~~~~--a-------------------------~ 282 (394) .---.++++++.+++.|+..-+ + ....-+|.+.+++... ...... . - T Consensus 130 dai~~ia~tiE~a~FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD 209 (470) T protein:vir:10 130 EKMIAVANEFEYLAFYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTA 209 (470) T ss_pred HHHHHHHHHHHhhhhhhccccccccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhh Confidence 7677788899999999976332 2 2233467666655321 111111 1 1 Q ss_pred EEEcHHHHHHHHhhhccCCceeecccccCCCcccccccceEEecCcccccCceEEEeccccEEEEeecceEEEEeecccc Q lcl|Aclame:pro 283 LIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKAFIGDFKRGVLFADRKDLGLRWADNEIY 362 (394) Q Consensus 283 ~vm~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 362 (394) ++|+.-+.+.|..=-...-|.+..++ .+....|+||- +++- -+..+.+..+..... T Consensus 210 ~~lp~~vka~f~~~~~~~qRv~~~~N----~~~~~~G~~v~-------------------~f~s-a~G~I~L~~s~~m~~ 265 (470) T protein:vir:10 210 VFISYVDKLNLQASFYQISRVMTTAD----RRAGLLGADAQ-------------------SYIG-VRGEHSLYPSQFLGD 265 (470) T ss_pred hccchhHHHHHHHhhcCceEEEEecC----CCceeeeeecc-------------------ceee-eeeeeeecccccccc Confidence 33444444433322111111111111 11122333331 1111 112222211111000 Q ss_pred cceEEEE-EEeccEE---ecccceEEEEe--------------------------------cCccCCC Q lcl|Aclame:pro 363 GQYLQAV-LRFGVSK---VDDKAGYYVTF--------------------------------TPEPLPL 394 (394) Q Consensus 363 ~~~~r~~-~r~d~~v---~~~~af~~l~~--------------------------------~~~~~~~ 394 (394) .-+.. .|++..+ .-|..++.+.. .-.++|+ T Consensus 266 --~~k~~p~~l~~~v~~~aAP~~~~tv~~t~~~~a~~~~sk~g~~~~~~v~sy~y~v~~~~gds~s~~ 331 (470) T protein:vir:10 266 --FHKFNPARFGAEVGDFAAPSNSWTVSTTDNFVTLPYNSGLGDPANTTVYSYAFKAANFYGESAAKY 331 (470) T ss_pred --hhhcCcccCCcccCCcccCceeEEeecCCCceeecccCCCCcccCcceeEEEEEEEEecCCCCcce Confidence 00000 1111111 11221111111 1112333 Done!